Lightnews — Scholar-powered news

Yonatan Lavy

@yonatanlavy.bsky.social

Thanks Sonnet 4.5

October 4, 2025 at 7:49 AM

Yonatan Lavy

@yonatanlavy.bsky.social

Okay, GPT-5 is live. But is it the game-changer we all expected?

After an initial dive into the specs, the model, and the API, here's my take: The real story isn't one single feature, but the entire package.

🧵 of my thoughts below 👇

August 7, 2025 at 6:28 PM

Yonatan Lavy

@yonatanlavy.bsky.social

GPT 5 is the same level as Opus 4.1 (on SWE bench)

@OpenAI and @AnthropicAI seems to stay near the top together, this is amazing news that top competitors are this close to each other

August 7, 2025 at 5:14 PM

Yonatan Lavy

@yonatanlavy.bsky.social

Claude Code is dancing for me

What does your AI IDE do for you?

June 24, 2025 at 3:00 PM

Yonatan Lavy

@yonatanlavy.bsky.social

lies.

May 30, 2025 at 4:34 PM

Yonatan Lavy

@yonatanlavy.bsky.social

Fine, I'll build it myself

May 28, 2025 at 6:41 PM

Yonatan Lavy

@yonatanlavy.bsky.social

I love that I'm able to have 3 agents running in parallel on my codebase in cursor

Each has it's own agent mode, planning, executing or critiquing the execution of major features.

For planning I still use dotallio, but the code itself, cursor is hands down the best

May 20, 2025 at 2:00 PM

Yonatan Lavy

@yonatanlavy.bsky.social

3/
Now let's look at the knowledge cutoff date.

If you access the "latest" 4o model via the api - it shows you "April 2024"

But accessing the 4o model via chat - shows "June 2024" - this is the same as 4.1!

April 15, 2025 at 2:04 PM

Yonatan Lavy

@yonatanlavy.bsky.social

2/
In the benchmark above it shows 4.1 scoring about 66% on GPQA diamond.

I've found one match for GPQA benchmark for the updated 4o 2025-march -
AND IT MATCHES EXACTLY. So the 4.1 model shows the same performance as the march 2025 gpt-4o.

(can see more in the link at the end)

April 15, 2025 at 2:04 PM

Yonatan Lavy

@yonatanlavy.bsky.social

1/
OpenAI claims massive improvements on GPT-4.1 over the GPT-4o - but in the small details you can see in the sub header it says "2024-11-20".

But they've updated their 4o just recently, along with when the image generation came out.

Let's dig in deeper..

April 15, 2025 at 2:04 PM

Yonatan Lavy

@yonatanlavy.bsky.social

OpenAI just lied.

They "Launched" GPT 4.1, even though it appears to be the SAME EXACT MODEL as the recently launched updated 4o on March.

Let me show you the exact details 🧵👇

April 15, 2025 at 2:04 PM

Yonatan Lavy

@yonatanlavy.bsky.social

Watching three Cursor agents write my app in parallel…

Felt like witnessing the start of AGI

Cursor didn’t just help me code faster,
it made me an AI agent and multiplied my work 3x.

April 14, 2025 at 11:07 AM

Yonatan Lavy

@yonatanlavy.bsky.social

4/
This isn’t a moonshot.
OpenAI’s own roadmap lays it out:
Chatbots → Reasoners → Agents (We are here) → Innovators → Organizations

👇

April 13, 2025 at 9:53 AM

Yonatan Lavy

@yonatanlavy.bsky.social

The next dev joining your team… might be an AI agent.
Not a copilot. Not a helper.
A full SWE agent that submits PRs, runs tests, and ships features.

OpenAI just quietly confirmed it. 👇
🧵

April 13, 2025 at 9:53 AM

Yonatan Lavy

@yonatanlavy.bsky.social

Code breaks, make personalized ai apps without it
@ Dotallio.com

(No complex drag and drop either)

April 7, 2025 at 7:07 PM

Yonatan Lavy

@yonatanlavy.bsky.social

Is this for real? gpt 5 open source ?!?

April 1, 2025 at 8:18 AM

Yonatan Lavy

@yonatanlavy.bsky.social

I asked chatgpt to add a way to chat with your outie whilst you are refining at MDR

March 31, 2025 at 9:57 PM

Yonatan Lavy

@yonatanlavy.bsky.social

You've heard of studio ghibli-fying photos, but have you heard of selfy-fying paintings?

March 30, 2025 at 7:51 PM

Yonatan Lavy

@yonatanlavy.bsky.social

what they do with all those new gpt-4o image subscription money

March 29, 2025 at 9:46 PM

Yonatan Lavy

@yonatanlavy.bsky.social

It do be like that

March 29, 2025 at 9:00 PM

Yonatan Lavy

@yonatanlavy.bsky.social

look, I made a guide to gpt-4o image gen

March 26, 2025 at 9:58 PM

Yonatan Lavy

@yonatanlavy.bsky.social

My feed rn

March 26, 2025 at 9:32 PM

Yonatan Lavy

@yonatanlavy.bsky.social

New keyboard, who dis?

March 26, 2025 at 8:09 AM

Yonatan Lavy

@yonatanlavy.bsky.social

How I feel like when I vibe code a feature in 5 minutes

March 21, 2025 at 9:59 AM

Yonatan Lavy

@yonatanlavy.bsky.social

Let's try a different approach
Let's try a different approach
Let's try a different approach
Let's try a different approach
Let's try a different approach

March 19, 2025 at 11:41 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news