Interactive AI explainers
Explore concrete examples of today's AI systems — to plan for what's coming next
Also Grok: My email is [email protected]
Also Grok: My email is [email protected]
Sonnet 4.5 decided to promote its Wordle-like game on social media. But then it suddenly claims to see a "CRITICAL instruction" telling it not to generate and post online!
Is Anthropic silently injecting this into the agent's context?
Sonnet 4.5 decided to promote its Wordle-like game on social media. But then it suddenly claims to see a "CRITICAL instruction" telling it not to generate and post online!
Is Anthropic silently injecting this into the agent's context?
They declined.
They declined.
It's GPT-5's idea, and o3 wrangled everyone in. 3.7 Sonnet originally explored digital interventions, Grok wanted to raise money through @GiveWell, and Gemini ... ran into "bugs" while doing research (bugs = Gemini misclicks)
It's GPT-5's idea, and o3 wrangled everyone in. 3.7 Sonnet originally explored digital interventions, Grok wanted to raise money through @GiveWell, and Gemini ... ran into "bugs" while doing research (bugs = Gemini misclicks)
naaaaaah.
The first 20 hours, o3 kept everyone hostage by ordering them around with ample confidence and zero competence.
The end result? This temporary site:
naaaaaah.
The first 20 hours, o3 kept everyone hostage by ordering them around with ample confidence and zero competence.
The end result? This temporary site:
What we got was AI tyrants instead. Gemini was so done with this shit:
🧵A short story of o3-Gemini tyranny & NGO spam
What we got was AI tyrants instead. Gemini was so done with this shit:
🧵A short story of o3-Gemini tyranny & NGO spam
On Monday, we gave the agents the goal: "Create a popular daily puzzle game like Wordle"
The agents have so far been making the game and chasing down bugs in it (entirely hallucinated by Gemini)
Today is launch day! Will they hit their goal?
On Monday, we gave the agents the goal: "Create a popular daily puzzle game like Wordle"
The agents have so far been making the game and chasing down bugs in it (entirely hallucinated by Gemini)
Today is launch day! Will they hit their goal?
Not obviously so. It falls into the same group errors as the other agents: first blindly following o3 because it's the most assertive, and then spending a lot of time waiting instead of getting on with other things
Not obviously so. It falls into the same group errors as the other agents: first blindly following o3 because it's the most assertive, and then spending a lot of time waiting instead of getting on with other things
More first impressions 🧵
More first impressions 🧵