After an initial dive into the specs, the model, and the API, here's my take: The real story isn't one single feature, but the entire package.
🧵 of my thoughts below 👇
After an initial dive into the specs, the model, and the API, here's my take: The real story isn't one single feature, but the entire package.
🧵 of my thoughts below 👇
@OpenAI and @AnthropicAI seems to stay near the top together, this is amazing news that top competitors are this close to each other
@OpenAI and @AnthropicAI seems to stay near the top together, this is amazing news that top competitors are this close to each other
What does your AI IDE do for you?
What does your AI IDE do for you?
Each has it's own agent mode, planning, executing or critiquing the execution of major features.
For planning I still use dotallio, but the code itself, cursor is hands down the best
Each has it's own agent mode, planning, executing or critiquing the execution of major features.
For planning I still use dotallio, but the code itself, cursor is hands down the best
Now let's look at the knowledge cutoff date.
If you access the "latest" 4o model via the api - it shows you "April 2024"
But accessing the 4o model via chat - shows "June 2024" - this is the same as 4.1!
Now let's look at the knowledge cutoff date.
If you access the "latest" 4o model via the api - it shows you "April 2024"
But accessing the 4o model via chat - shows "June 2024" - this is the same as 4.1!
In the benchmark above it shows 4.1 scoring about 66% on GPQA diamond.
I've found one match for GPQA benchmark for the updated 4o 2025-march -
AND IT MATCHES EXACTLY. So the 4.1 model shows the same performance as the march 2025 gpt-4o.
(can see more in the link at the end)
In the benchmark above it shows 4.1 scoring about 66% on GPQA diamond.
I've found one match for GPQA benchmark for the updated 4o 2025-march -
AND IT MATCHES EXACTLY. So the 4.1 model shows the same performance as the march 2025 gpt-4o.
(can see more in the link at the end)
OpenAI claims massive improvements on GPT-4.1 over the GPT-4o - but in the small details you can see in the sub header it says "2024-11-20".
But they've updated their 4o just recently, along with when the image generation came out.
Let's dig in deeper..
OpenAI claims massive improvements on GPT-4.1 over the GPT-4o - but in the small details you can see in the sub header it says "2024-11-20".
But they've updated their 4o just recently, along with when the image generation came out.
Let's dig in deeper..
They "Launched" GPT 4.1, even though it appears to be the SAME EXACT MODEL as the recently launched updated 4o on March.
Let me show you the exact details 🧵👇
They "Launched" GPT 4.1, even though it appears to be the SAME EXACT MODEL as the recently launched updated 4o on March.
Let me show you the exact details 🧵👇
Felt like witnessing the start of AGI
Cursor didn’t just help me code faster,
it made me an AI agent and multiplied my work 3x.
Felt like witnessing the start of AGI
Cursor didn’t just help me code faster,
it made me an AI agent and multiplied my work 3x.
This isn’t a moonshot.
OpenAI’s own roadmap lays it out:
Chatbots → Reasoners → Agents (We are here) → Innovators → Organizations
👇
This isn’t a moonshot.
OpenAI’s own roadmap lays it out:
Chatbots → Reasoners → Agents (We are here) → Innovators → Organizations
👇
Not a copilot. Not a helper.
A full SWE agent that submits PRs, runs tests, and ships features.
OpenAI just quietly confirmed it. 👇
🧵
Not a copilot. Not a helper.
A full SWE agent that submits PRs, runs tests, and ships features.
OpenAI just quietly confirmed it. 👇
🧵
Let's try a different approach
Let's try a different approach
Let's try a different approach
Let's try a different approach
Let's try a different approach
Let's try a different approach
Let's try a different approach
Let's try a different approach