David Hall
dlwh.bsky.social
David Hall
@dlwh.bsky.social
Research Engineering Lead at @StanfordCRFM. I do NLP and foundation model things with JAX. Previously Semantic Machines, Microsoft, Berkeley, Breeze
Have a specific use case? Come to our Datashop to curate data and train models.
Here’s how we curated more math data:
github.com/marin-commun...
Check out the data:
marin.community/data-browser/
May 19, 2025 at 7:51 PM
Have a new algorithm for training? Choose your compute budget and get on the speedrun leaderboard: how fast can you drive down validation loss?
marin.community/speedrun/
May 19, 2025 at 7:51 PM
Marin (marin.community) repurposes GitHub, which has been successful for open-source *software*, for AI:
1. Preregister an experiment as a GitHub issue
2. Submit a PR, which implements the experiment in code
3. PR is reviewed by experts in the community
4. Watch the execution of the experiment live!
May 19, 2025 at 7:51 PM
Marin is a new "open lab" for developing foundation models. More than open weights, and even open source, with Marin we're committing to "open development": everything is documented and traceable, and anyone can contribute.
May 19, 2025 at 7:51 PM