Love building AI products.
Went rogue, built StyleShuffle at SFix.
Ex-physics PhD. Caltech.
It achieves state-of-the-art accuracy (79.6%) on the MATH benchmark with Qwen2.5-7B-Instruct, surpassing GPT-4o (76.6%) and Claude 3.5 (71.1%).
It achieves state-of-the-art accuracy (79.6%) on the MATH benchmark with Qwen2.5-7B-Instruct, surpassing GPT-4o (76.6%) and Claude 3.5 (71.1%).
gist.github.com/yoavg/9142e5...
gist.github.com/yoavg/9142e5...
Link: www.ethanrosenthal.com/2024/11/19/y...
This is a very wonky post about configuring training loops for ML models 🧵
Link: www.ethanrosenthal.com/2024/11/19/y...
This is a very wonky post about configuring training loops for ML models 🧵