Amirhossein Kazemnejad
banner
a-kazemnejad.bsky.social
Amirhossein Kazemnejad
@a-kazemnejad.bsky.social
Working on RL training of LLMs @Mila_Quebec.
Introducing nanoAhaMoment: Karpathy-style, single file RL for LLM library (<700 lines)

- super hackable
- no TRL / Verl, no abstraction💆‍♂️
- Single GPU, full param tuning, 3B LLM
- Efficient (R1-zero countdown < 10h)

comes with a from-scratch, fully spelled out YT video [1/n]
April 4, 2025 at 7:58 PM