Avik Dey
banner
avikdey.bsky.social
Avik Dey
@avikdey.bsky.social
Mostly Data, ML, OSS & Society • Stop chasing Approximately Generated Illusions; focus on Specialized Small LMs • To understand it well enough, learn to explain it simply • Shadow self of https://linkedin.com/in/avik-dey, have a beard now
Just wait till LLMs come prebuilt with world models. That will be the underpinnings, it’s innate cognitive structure, that will help it understand language - this time.
December 5, 2025 at 4:48 PM
I hope so too. But, sanity seems to be scarce these days.
November 28, 2025 at 7:18 PM
I dislike how we still can’t edit posts here - read as “Could be reading …”
November 28, 2025 at 5:42 PM
Could you be reading your tweets instead, that would keep him busy full time.
November 28, 2025 at 5:37 PM
For digital only products - yes. Unfortunately, it’s going to manifest itself in physical products as well. These models are still under human supervision when used in safety critical products. Most physical products are self use - that’s where any breakdown in reliability will be widely felt.
November 28, 2025 at 5:31 PM
This shift will run downstream too. Once “works some of the time” is acceptable for models, it will seep into product pipelines. From marketing gloss to development then eventually in the supply chain. Reliability becomes optional not just for the models but in the end product itself. It’s coming.
November 28, 2025 at 4:43 PM
LLM’s internal monologue:

Real world mode: “Bro … this decision tree looks like a whole forest.”

Eval mode: “Ahh, a single pruned leaf … perfect boys - I got this.”
November 27, 2025 at 8:54 PM
Some prefer spoken words over reading either by choice or sometimes handicap - so, I prefer both exist.
November 26, 2025 at 9:53 PM
While most of it is based on solving for his vivid imagination regarding AGI, there are some interesting tidbits sprinkled around. Like - while RL optimizes for the reward function, it also distorts the latent space such that the global performance actually degrades - was probably the best one.
November 25, 2025 at 11:13 PM
Because whenever she’s reporting “news” - it’s debatable.
November 25, 2025 at 6:50 PM
The darn thing is a tool that’s easily lost in the matrix woods without handholding from an expert in the domain at hand. The only reason for these comparisons, is so these companies can claim - look our “AI” can replace those 5 PhDs you were going to hire, for only $20k a month.
November 24, 2025 at 10:05 PM