Gabriel Martín Blázquez
gabrielmb.com
Gabriel Martín Blázquez
@gabrielmb.com
ML Engineer @hf.co 🤗 Building tools for you to take care of your datasets like Argilla or distilabel!
SmolLM2 paper is out! We wrote a paper detailing the steps we took to train one of the best smol LM 🤏 out there: pre-training and post-training data, training ablations and some interesting findings 💡

Go check it out and don't hesitate to write your thoughts/questions in the comments section!
February 6, 2025 at 10:56 AM
distilabel ⚗️ reached the 2k ⭐️ on GitHub!
January 27, 2025 at 3:58 PM