Girish Ganesan
girish0110.bsky.social
Girish Ganesan
@girish0110.bsky.social
Data science, generative art, ML interp
Reposted by Girish Ganesan
We outperform Llama 70B with Llama 3B on hard math by scaling test-time compute 🔥

How? By combining step-wise reward models with tree search algorithms :)

We're open sourcing the full recipe and sharing a detailed blog post 👇
December 16, 2024 at 5:08 PM