Ian Littman
banner
ian.im
Ian Littman
@ian.im
Software dev, historically PHP but currently Golang, Mod/infra @ phpc.social, co-organizer longhornphp.com and mergephp.com, co-maintainer at Joind.in
If you want, I'll swap you a virtual ticket for an uncon talk at 19:30 UTC, and just bridge it through Discord. DM me somehow.
October 24, 2025 at 4:32 PM
So far it's rather introductory thirty minutes in.

Gonna be in and out today due to meetings among other things as well. Tomorrow should be clearer 😅
October 3, 2025 at 2:31 PM
Of course the tricky bit is big context windows take more VRAM, so you can afford a smaller model with the same RAM. But also 1MM token window makes certain tasks possible provided context rot isn't too bad.
September 7, 2025 at 9:39 PM
What types of use cases? Guessing docs gen or code introspection/error explanation? For error explanation I've definitely had a bit better luck with local models (e.g. fp8 Qwen3 30B), which gives hope for a smallish code specific model (<=32B).
September 7, 2025 at 9:08 PM
Added context: I use LLM codegen occasionally enough that my JetBrains All Products Pack subscription plus free tiers elsewhere cover all of my usage thusfar.
September 7, 2025 at 8:45 PM
Note that I'm using Sonnet 4 as an example here because it can actually do useful code things and we already know that Qwen Code 480B is, well 480B so you'd need a $10k Mac Studio to run that locally at fp8. Which...maybe that's worth the price of admission but that's steeper than $300/mo!
September 7, 2025 at 8:29 PM
Because if Sonnet 4 can be run in 128GB (more like 112GB net of other things) then it's conceivably possible to get an open-weights model that'll efficiently run locally and cloud AI provider subsidy is no longer table stakes

Could even argue this for 256/512GB as linked DGX Spark or M3 Ultra exist
September 7, 2025 at 8:28 PM
Question is whether the pay-per-token costs of either open-weights hosts or Google/Anthropic/OpenAI/xAI cover costs. Obviously the rate limits on $300/mo plans don't, but curious on the rest.

Corollary question is whether Sonnet 4 is runnable in 128GB of unified memory 1/2
September 7, 2025 at 8:28 PM