interested in all things ai and distributed systems
alexzhang13.github.io/blog/2024/ef...
alexzhang13.github.io/blog/2024/ef...
A re-record of my NeurIPS tutorial on language modeling (plus some added content on the high level state of things)
Blog + extra context: https://buff.ly/424VvLm
YouTube: https://buff.ly/40808l5
Slides: https://buff.ly/404jGa9
A re-record of my NeurIPS tutorial on language modeling (plus some added content on the high level state of things)
Blog + extra context: https://buff.ly/424VvLm
YouTube: https://buff.ly/40808l5
Slides: https://buff.ly/404jGa9
Evolution of AI-assited coding features and developer interaction patterns. I go through the history of progression of ai-assisted coding features, talk about how we interact with them and a Gears analogy control vs speed tradeoff
sankalp.bearblog.dev/evolution-of...
Evolution of AI-assited coding features and developer interaction patterns. I go through the history of progression of ai-assisted coding features, talk about how we interact with them and a Gears analogy control vs speed tradeoff
sankalp.bearblog.dev/evolution-of...
A higher level summary on the key decisions along the way of scoping a problem, choosing a base model, optimization algorithm, etc. (+some thoughts on OpenAI's RL Finetuning).
https://buff.ly/3ZpY5IR
A higher level summary on the key decisions along the way of scoping a problem, choosing a base model, optimization algorithm, etc. (+some thoughts on OpenAI's RL Finetuning).
https://buff.ly/3ZpY5IR
📚 Material focused on instruction tuning. Split into chat templates and supervised fine tuning. There's more to this subject than this, but we're keeping things smol.
⏩ If you haven't already, try out module 1!
🧵
How to understand OpenAI's o1 models as really just one wacky, wonderful, long chain of thought.
A fun one trying to communicate intuitions for what large scale RL training does to LLMs. Much more to explore here in 2025!
How to understand OpenAI's o1 models as really just one wacky, wonderful, long chain of thought.
A fun one trying to communicate intuitions for what large scale RL training does to LLMs. Much more to explore here in 2025!
zenml.io/llmops-datab...
zenml.io/llmops-datab...