* Paper: arxiv.org/abs/2502.04671
* Code: github.com/trishullab/p...
Proofwala allows the collection of proof-step data from multiple proof assistants (Coq and Lean) and multilingual training. (1/3)
* Paper: arxiv.org/abs/2502.04671
* Code: github.com/trishullab/p...
Proofwala allows the collection of proof-step data from multiple proof assistants (Coq and Lean) and multilingual training. (1/3)
The o1/o3 path to math reasoning is based on LLMs and large-scale test-time search. We argue for a different path that uses formal proof assistants for
✅ creating high-quality synthetic data
✅ rigorous test-time feedback. (1/2)
The o1/o3 path to math reasoning is based on LLMs and large-scale test-time search. We argue for a different path that uses formal proof assistants for
✅ creating high-quality synthetic data
✅ rigorous test-time feedback. (1/2)
If you work on frontier AI for math/reasoning, talk to George!
If you work on frontier AI for math/reasoning, talk to George!
Our method, LaSR, conditions mutation/crossover operators on (1) an LLM's general domain knowledge, and (2) LLM-generated abstractions of high-performing programs. (1/2)
Our method, LaSR, conditions mutation/crossover operators on (1) an LLM's general domain knowledge, and (2) LLM-generated abstractions of high-performing programs. (1/2)
drive.google.com/file/d/1ybQx...
drive.google.com/file/d/1ybQx...
My X account has been hacked. The hacker changed the account email, and X won't return access to me because I don't know what it was changed to. 🤡
Oh well, good riddance and sorry I didn't quit sooner.
My X account has been hacked. The hacker changed the account email, and X won't return access to me because I don't know what it was changed to. 🤡
Oh well, good riddance and sorry I didn't quit sooner.