DM me ml questions
- Improvement varies a lot on the dataset. There's huge improvement on GSM8K, but ARC improvement is at most 1.6%
arxiv.org/abs/2411.04282
- Improvement varies a lot on the dataset. There's huge improvement on GSM8K, but ARC improvement is at most 1.6%
arxiv.org/abs/2411.04282
arxiv.org/pdf/2110.14168
arxiv.org/pdf/2110.14168