José Francisco Calvo
jfcalvo.hf.co
José Francisco Calvo
@jfcalvo.hf.co
Software Engineer at 🤗 Hugging Face: crafting smarter AI tools, empowering innovation, and bridging tech with creativity!
Reposted by José Francisco Calvo
People are flexing their end of year stats, so I made this app to show @hf.co hub stats in a tidy design!

Thanks @jfcalvo.hf.co and @ameeelie.bsky.social for the feature!
December 19, 2024 at 1:28 PM
🚀 Argilla v2.6.0 is here! 🎉

Let me show you how EASY it is to export your annotated datasets from Argilla to the Hugging Face Hub. 🤩

Take a look to this quick demo 👇

💁‍♂️ More info about the release at github.com/argilla-io/a...

#AI #MachineLearning #OpenSource #DataScience #HuggingFace #Argilla
December 19, 2024 at 12:39 PM
Reposted by José Francisco Calvo
Imagine creating custom datasets and training AI models WITHOUT writing a single line of code. We did and made it a reality.

@hf.co Synthetic Data Generator

Blog: huggingface.co/blog
Space: huggingface.co/spaces/argil...
GitHub: github.com/argilla-io/s...
December 16, 2024 at 3:37 PM
Reposted by José Francisco Calvo
Desperate to contribute to the development of Scots language AI. I've just contributed 16 examples to this dataset:

data-is-better-together-fineweb-c.hf.space/share-your-p...
sco - Scots - Scots
Join and contribute to the dataset sco - Scots - Scots
data-is-better-together-fineweb-c.hf.space
December 12, 2024 at 1:44 PM
Reposted by José Francisco Calvo
I've just contributed 156 examples to the FineWeb 2 Spanish dataset:

data-is-better-together-fineweb-c.hf.space/share-your-p...

If you want to contribute, sign in with @hf.co and find your language
spa - español - Spanish
Join and contribute to the dataset spa - español - Spanish
data-is-better-together-fineweb-c.hf.space
December 12, 2024 at 1:24 PM
Reposted by José Francisco Calvo
Most liked and most downloaded open-source AI models from 2022 to 2024

Interactive viz: aiworld.eu/embed/model/...
Discussion: huggingface.co/spaces/huggi...
December 4, 2024 at 8:37 AM
The great @benburtenshaw.bsky.social is running an open course on fine-tuning smol LLMs, and it’s seriously worth checking out.

If you’re into AI or just curious about how these small language models work, this could be right up your alley. Don’t miss it—it’s super interesting!

#AI #LLMs #Learning
For anyone interested in fine-tuning or aligning LLMs, I’m running this free and open course called smol course. It’s not a big deal, it’s just smol.

🧵>>
December 4, 2024 at 10:31 AM
🙌 I just wanted to share a few thoughts about the latest Argilla release, 2.5.0, as it's a pretty big one!

Argilla now has full support for webhooks, which means you can do some pretty cool stuff, like model training on the fly as annotations are created. 🤯

#MachineLearning #NLP #DataLabeling
December 2, 2024 at 11:14 AM
Reposted by José Francisco Calvo
We’re looking for an intern to join our SmolLM team! If you’re excited about training LLMs and building high-quality datasets, we’d love to hear from you. 🤗

US: apply.workable.com/huggingface/...
EMEA: apply.workable.com/huggingface/...
ML Research Engineer Internship, SmolLMs pretraining and datasets - EMEA Remote - Hugging Face
Here at Hugging Face, we’re on a journey to advance good Machine Learning and make it more accessible. Along the way, we contribute to the development of technology for the better.We have built the fa...
apply.workable.com
November 27, 2024 at 10:20 AM
Did you know that on Argilla, we’re adding a new feature to export labeled datasets directly to the Hugging Face Hub? 🤔

We’re leveraging the Hugging Face datasets library for seamless integration, including defining span labeling

Stay tuned for the release!🧠✨

#MachineLearning #NLP #DataLabeling
November 26, 2024 at 12:52 PM