✨ http://rwiz.ai - Handling reviews with AI
🎹 http://pianocompanion.info - Chords dictionary app with 1M+ downloads.
🕹️ http://chordiq.info - Learn chords.
📝 desunit.com - my blog
What happens when reasoning keeps improving, but humans keep arguing using 2022 mental models?
... just saying.
What happens when reasoning keeps improving, but humans keep arguing using 2022 mental models?
... just saying.
But the old argument "Just a parrot, repeating old stuff on loop" -
is getting weaker every month.
But the old argument "Just a parrot, repeating old stuff on loop" -
is getting weaker every month.
Yet people still say: AI can’t handle unknown equations ..... AI isn’t creative ....
This is basically checkmate.
This does not mean AI can write 60% of math papers.
Yet people still say: AI can’t handle unknown equations ..... AI isn’t creative ....
This is basically checkmate.
This does not mean AI can write 60% of math papers.
The results?
‼️ Top models get ~50–60% correct answers ‼️
GPT-5.2 - 60%.
Gemini-3-Pro is right behind
These are problems that:
> an average human cannot solve at all
> many math grads would struggle with
The results?
‼️ Top models get ~50–60% correct answers ‼️
GPT-5.2 - 60%.
Gemini-3-Pro is right behind
These are problems that:
> an average human cannot solve at all
> many math grads would struggle with
> minimal training contamination
> no memorization of a static benchmark
> minimal training contamination
> no memorization of a static benchmark
It’s won by whoever ships early, floods the market, and improves in public.
Waiting for v1.0 is how you end up losing to someone who ships v0.1 at scale... a really large scale.
It’s won by whoever ships early, floods the market, and improves in public.
Waiting for v1.0 is how you end up losing to someone who ships v0.1 at scale... a really large scale.
But who cares?!
Volume creates learning loops →
Learning loops create cost drops →
Cost drops create adoption →
Adoption creates dominance
Exactly what we've seen with EVs. Same logic shows up in AI adoption.
But who cares?!
Volume creates learning loops →
Learning loops create cost drops →
Cost drops create adoption →
Adoption creates dominance
Exactly what we've seen with EVs. Same logic shows up in AI adoption.
- Proprietary data still matters
- Whoever owns the chat interface becomes the new aggregator
Yes, it’s a painful, especially if you’ve spent years building beautiful UX but the reality is simple:
- Proprietary data still matters
- Whoever owns the chat interface becomes the new aggregator
Yes, it’s a painful, especially if you’ve spent years building beautiful UX but the reality is simple:
Takeaways:
Takeaways:
When the interface disappears, all that’s left is API vs API.
When the interface disappears, all that’s left is API vs API.
- interface
- data
That’s why vertical software could charge premium prices.
but it looks like LLMs change that.
The LLM chat becomes the interface.
- interface
- data
That’s why vertical software could charge premium prices.
but it looks like LLMs change that.
The LLM chat becomes the interface.
LLMs are slow. Humans shouldn’t be idle while they think.
If you have experience and can keep context in your head, AI turns you into a force multiplier.
LLMs are slow. Humans shouldn’t be idle while they think.
If you have experience and can keep context in your head, AI turns you into a force multiplier.
I always believed context switching is bad. And it probably still is.
I always believed context switching is bad. And it probably still is.
> Manage parking spots
…and a lot more
Could this have been done this fast a couple of years ago? I doubt it.
> Not with this scope.
> Not just me.
> And definitely not while juggling other projects.
> Manage parking spots
…and a lot more
Could this have been done this fast a couple of years ago? I doubt it.
> Not with this scope.
> Not just me.
> And definitely not while juggling other projects.
> Send invoices
> Collect cold water meter readings
> Ping tenants who forgot to submit them
> Send invoices
> Collect cold water meter readings
> Ping tenants who forgot to submit them
You know how it works - happy wife, happy family.
You know how it works - happy wife, happy family.