Alvaro Bartolome
alvarobartt.bsky.social
Alvaro Bartolome
@alvarobartt.bsky.social
machine learning @hf.co
Pinned
here we go again!

i work at hugging face and here you can expect posts about machine learning (llms mainly), some rust, some nvim nerdy stuff and anything related to hugging face 🤗

posting is not easy for me, but i’ll try to do better from now on, support is highly appreciated!
🧨 I built something with #Zig!

`tokeni.zig` is a std-only implementation of the Byte Pair Encoding (BPE) algorithm in Zig for tokenizing sequences of text, used by OpenAI (among many others) to tokenize the text when pretraining their large language models!

github.com/alvarobartt/...
March 13, 2025 at 3:50 PM
For anyone interested in Zig I wrote a small post titled "How to read and parse JSON with Zig 0.13" that explains how to read JSON from a file with keys with different value types and how to access those values.
February 10, 2025 at 4:15 PM
love this quote "working smarter helps, but the real superpower is resting smarter"

a highly recommended read!
Perhaps unsurprisingly, I have a bunch of Opinions™ on work-life balance and time management. Lately, I've been particularly bugged by the common belief that employer incentives don't align with employee well-being. So, I wrote a thing (with graphs!): thesquareplanet.com/blog/about-4...
About 40 hours
I often hear, especially from folks working at younger companies and the tech giants, that the 40 hour workweek is a lie. That “the company” secretly (or not so secretly) always wants you to work more...
thesquareplanet.com
February 3, 2025 at 9:05 AM
🤗 Here's a simple script that calculates the required VRAM for serving DeepSeek R1 from @huggingface Hub safetensor's metadata!

P.S. The result of the script above is: "model_id='deepseek-ai/DeepSeek-R1' requires memory=756.716GB"
January 31, 2025 at 4:04 PM
hmm refactoring in zig is not as easy as it's in rust, even though seems fairly common too, right? or is it just me? 🤔
January 31, 2025 at 8:31 AM
stuff that matters takes time
January 29, 2025 at 12:28 PM
Reposted by Alvaro Bartolome
Last moments of closed-source AI 🪦 :
Hugging Face is openly reproducing the pipeline of 🐳 DeepSeek-R1. Open data, open training. open models, open collaboration.

🫵 Let's go!
github.com/huggingface/...
GitHub - huggingface/open-r1: Fully open reproduction of DeepSeek-R1
Fully open reproduction of DeepSeek-R1. Contribute to huggingface/open-r1 development by creating an account on GitHub.
github.com
January 25, 2025 at 2:36 PM
🐐 DeepSeek is not on the @hf.co Hub to take part, they are there to take over!

Amazing stuff from the DeepSeek team, ICYMI they recently released some reasoning models (DeepSeek-R1 and DeepSeek-R1-Zero), fully open-source, their performance is on par with OpenAI-o1 and it's MIT licensed!
January 23, 2025 at 1:45 PM
you can find so much gold in github gists wow, i was not a big fan because the discoverability doesn't seem great, but been exploring gists lately and so much gold stuff in there!
January 23, 2025 at 8:10 AM
in case anyone missed it, we're running a certified course on ai agents at hugging face starting on feb 2nd; the course is on how to build you own ai agents for different cool use cases built on top of open source!

👇 you can sign up in the link below, don't miss it!

bit.ly/hf-learn-age...
Hugging Face
Hugging Face Email Forms
bit.ly
January 22, 2025 at 12:24 PM
ok, here we go again 😅
January 22, 2025 at 8:23 AM
Not quite sure yet about how's following me here, but I may consider not just x-posting but also eventually post more random thoughts + content in Spanish, is that something you'd be interested in?
November 27, 2024 at 7:43 AM
here we go again!

i work at hugging face and here you can expect posts about machine learning (llms mainly), some rust, some nvim nerdy stuff and anything related to hugging face 🤗

posting is not easy for me, but i’ll try to do better from now on, support is highly appreciated!
November 20, 2024 at 9:30 AM
💡 Did you know that you can use over 13700 public open models and adapters on the @huggingface Hub for FREE?

You just need a free account on the Hugging Face Hub (you can also subscribe to PRO to increase the requests per hour)

More details on the thread 🧵
November 19, 2024 at 4:15 PM
Reposted by Alvaro Bartolome
Bluesky pro-tip: you can set your domain as a handle:

bsky.social/about/blog/4...

Bonus: you can keep your handle if you ever want to move to another server in the future.
How to set your domain as your handle - Bluesky
Using a domain as your handle helps with account identity, verification, and portability. Here's how to set your domain as your handle.
bsky.social
November 18, 2024 at 8:31 PM
where the nvim-nerds at in here?
November 19, 2024 at 10:51 AM