Lightnews — Scholar-powered news

Ben Burtenshaw

@benburtenshaw.bsky.social

4.3K followers 210 following 190 posts

Building tools for AI datasets. 😽
Looking in AI datasets. 🙀
Sharing clean open AI datasets. 😻

at https://bsky.app/profile/hf.co

Posts Replies Media Videos

Ben Burtenshaw

@benburtenshaw.bsky.social

The science team at Hugging Face reproduced and open source the seek r1. https://buff.ly/4jtbp8x

GitHub - huggingface/open-r1: Fully open reproduction of DeepSeek-R1

Fully open reproduction of DeepSeek-R1. Contribute to huggingface/open-r1 development by creating an account on GitHub.

buff.ly

January 27, 2025 at 10:00 AM

Ben Burtenshaw

@benburtenshaw.bsky.social

quiz app https://buff.ly/4atPzxo
dataset with questions https://buff.ly/3ClY9Sm
agents course we're working on https://buff.ly/4gehzqi

Dataset Quiz - a Hugging Face Space by burtenshaw

A quiz app for rows of a dataset

buff.ly

January 24, 2025 at 11:08 AM

Ben Burtenshaw

@benburtenshaw.bsky.social

Here's how it works:

- make a dataset of multiple choice questions
- duplicate the space add set the dataset repo
- log in and do the quiz
- submit the questions to create a new dataset

I made this to get ready for the agents course, but I hope it's useful for you projects too!

January 24, 2025 at 11:08 AM

Ben Burtenshaw

@benburtenshaw.bsky.social

Here's a blog post I wrote with the details https://buff.ly/4gVpudi

Gradio spaces are the perfect agent tools\!

A Blog post by ben burtenshaw on Hugging Face

huggingface.co

January 17, 2025 at 10:00 AM

Ben Burtenshaw

@benburtenshaw.bsky.social

Here's a collection with tools for:

- create a plotly visualisation
- get travel duration
- transcribe youtube video
- transform image

https://buff.ly/3PAU6od

Tools 4 Agents - a burtenshaw Collection

This is a collection of spaces on the hub that are useful for building agents. https://huggingface.co/docs/smolagents/en/tutorials/tools

buff.ly

January 15, 2025 at 10:00 AM

Ben Burtenshaw

@benburtenshaw.bsky.social

These should setup a few cool agent application, but if not it's easy to build a tool within a gradio application. Here's a guide:

https://buff.ly/3Wm2ZG1

Tools

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

buff.ly

January 15, 2025 at 10:00 AM

Ben Burtenshaw

@benburtenshaw.bsky.social

Here's the chapter on agent in smol course: https://buff.ly/3Cf5NOf

smol-course/8_agents at main · huggingface/smol-course

A course on aligning smol models. Contribute to huggingface/smol-course development by creating an account on GitHub.

github.com

January 13, 2025 at 10:00 AM

Ben Burtenshaw

@benburtenshaw.bsky.social

❓What we need now?
Most of use aren't building systems to solve frontier math problems on a daily basis. Shucks! That means we need reward models and representative datasets that represent the kinds of problems we're trying to solve. Crucially, in the domains and languages we're actually working!

December 27, 2024 at 11:00 AM

Ben Burtenshaw

@benburtenshaw.bsky.social

⏩ What does it mean for us builders?
As these approaches develop, we can use small models on our use cases, and increase inference for challenging domain specific tasks. This means that for most tasks models need minimal compute, but for complex tasks we'll scale up compute.

December 27, 2024 at 11:00 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news