Here's what I mostly write about in this account:
- Code / AI-related experiments, tools
- Security incidents that alarm me
- Engineering architecture, algorithms
- A few photos from time to time
Gated DeltaNet hybrids (Qwen3-Next, Kimi Linear), text diffusion, code world models, and small reasoning transformers.
🔗 magazine.sebastianraschka.com/p/beyond-sta...
Gated DeltaNet hybrids (Qwen3-Next, Kimi Linear), text diffusion, code world models, and small reasoning transformers.
🔗 magazine.sebastianraschka.com/p/beyond-sta...
www.planetary.org/space-images...
www.planetary.org/space-images...
Full article: thenewstack.io/ken-thompson...
Full article: thenewstack.io/ken-thompson...
pytorch.org/blog/introdu...
pytorch.org/blog/introdu...
More notes on my blog: simonwillison.net/2025/Oct/23/...
More notes on my blog: simonwillison.net/2025/Oct/23/...
My notes here: simonwillison.net/2025/Oct/4/d...
My notes here: simonwillison.net/2025/Oct/4/d...
#Brooklyn #bumpintheroad #Fujifilm #photography #cityscapes
#Brooklyn #bumpintheroad #Fujifilm #photography #cityscapes
blog.google/technology/r...
blog.google/technology/r...
• Double video length → 4x more energy
• A 5-second clip ≈ 1 hour of microwave use
• AI already = ~20% of datacenter power
This raises tough questions:
• Can we innovate and stay energy-efficient?
• Should AI platforms report power metrics?
• Double video length → 4x more energy
• A 5-second clip ≈ 1 hour of microwave use
• AI already = ~20% of datacenter power
This raises tough questions:
• Can we innovate and stay energy-efficient?
• Should AI platforms report power metrics?