momergul.bsky.social
@momergul.bsky.social
CS PhD Student @Cornell
Reposted
🌍Introducing BabyBabelLM: A Multilingual Benchmark of Developmentally Plausible Training Data!

LLMs learn from vastly more data than humans ever experience. BabyLM challenges this paradigm by focusing on developmentally plausible data

We extend this effort to 45 new languages!
October 15, 2025 at 10:53 AM
🚨Modeling Abstention via Selective Help-seeking

LLMs learn to use search tools to answer questions they would otherwise hallucinate on. But can this also teach them what they know vs not?

We introduce MASH that trains LLMs for search and gets abstentions for free!
October 2, 2025 at 7:40 PM
Reposted
The talk for our work on Retrospective Learning from Interactions, which will be in ACL (once I figure out how to squeeze it shorter)

Gist: autonomous post-training from conversational signals for LLM bootstrapping ... look ma, no annotations! no hand-holding! 🙌📈🚀

www.youtube.com/watch?v=qW8S...
Retrospective Learning from Interactions
YouTube video by Yoav Artzi
www.youtube.com
July 25, 2025 at 2:15 PM
Reposted
Close your books, test time!
The evaluation pipelines are out, baselines are released & the challenge is on

There is still time to join and
We are excited to learn from you on pretraining and human-model gaps

*Don't forget to fastEval on checkpoints
github.com/babylm/evalu...
📈🤖🧠
#AI #LLMS
May 9, 2025 at 2:20 PM