Ben Burtenshaw
banner
benburtenshaw.bsky.social
Ben Burtenshaw
@benburtenshaw.bsky.social
Building tools for AI datasets. 😽
Looking in AI datasets. 🙀
Sharing clean open AI datasets. 😻

at https://bsky.app/profile/hf.co
The science team at Hugging Face reproduced and open source the seek r1. https://buff.ly/4jtbp8x
GitHub - huggingface/open-r1: Fully open reproduction of DeepSeek-R1
Fully open reproduction of DeepSeek-R1. Contribute to huggingface/open-r1 development by creating an account on GitHub.
buff.ly
January 27, 2025 at 10:00 AM
quiz app https://buff.ly/4atPzxo
dataset with questions https://buff.ly/3ClY9Sm
agents course we're working on https://buff.ly/4gehzqi
Dataset Quiz - a Hugging Face Space by burtenshaw
A quiz app for rows of a dataset
buff.ly
January 24, 2025 at 11:08 AM
Here's how it works:

- make a dataset of multiple choice questions
- duplicate the space add set the dataset repo
- log in and do the quiz
- submit the questions to create a new dataset

I made this to get ready for the agents course, but I hope it's useful for you projects too!
January 24, 2025 at 11:08 AM
Here's a blog post I wrote with the details https://buff.ly/4gVpudi
Gradio spaces are the perfect agent tools\!
A Blog post by ben burtenshaw on Hugging Face
huggingface.co
January 17, 2025 at 10:00 AM
Here's a collection with tools for:

- create a plotly visualisation
- get travel duration
- transcribe youtube video
- transform image

https://buff.ly/3PAU6od
Tools 4 Agents - a burtenshaw Collection
This is a collection of spaces on the hub that are useful for building agents. https://huggingface.co/docs/smolagents/en/tutorials/tools
buff.ly
January 15, 2025 at 10:00 AM
These should setup a few cool agent application, but if not it's easy to build a tool within a gradio application. Here's a guide:

https://buff.ly/3Wm2ZG1
Tools
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
buff.ly
January 15, 2025 at 10:00 AM
Here's the chapter on agent in smol course: https://buff.ly/3Cf5NOf
smol-course/8_agents at main · huggingface/smol-course
A course on aligning smol models. Contribute to huggingface/smol-course development by creating an account on GitHub.
github.com
January 13, 2025 at 10:00 AM
❓What we need now?
Most of use aren't building systems to solve frontier math problems on a daily basis. Shucks! That means we need reward models and representative datasets that represent the kinds of problems we're trying to solve. Crucially, in the domains and languages we're actually working!
December 27, 2024 at 11:00 AM
⏩ What does it mean for us builders?
As these approaches develop, we can use small models on our use cases, and increase inference for challenging domain specific tasks. This means that for most tasks models need minimal compute, but for complex tasks we'll scale up compute.
December 27, 2024 at 11:00 AM