Current: Storage infra things 🤗 @hf.co
Former: Devops/WordPress things @lexblog.bsky.social and devex/cloud infra things at @pantheon.io
When I step away from the computer you'll find me with food, Moraine (our dog), or out in nature.
More at: huggingface.co/datasets/jsu...
Our use of AI is still in the early stages and none of what is to come is preordained. We can paint a different future for ourselves that is better than the dreary present of Web 2.0.
Our use of AI is still in the early stages and none of what is to come is preordained. We can paint a different future for ourselves that is better than the dreary present of Web 2.0.
"Success is a lousy navigation system for the second half of life."
Potentially a lousy system for any half of your life, but more pressing as time goes on.
"Success is a lousy navigation system for the second half of life."
Potentially a lousy system for any half of your life, but more pressing as time goes on.
A little over a year ago, @hf.co acquired XetHub to unlock the next phase of growth in models and datasets. huggingface.co/blog/xethub-...
In April, there were 1,000 Hugging Face repos on Xet. Now every repo (over 6M) on the Hub is on Xet.
A little over a year ago, @hf.co acquired XetHub to unlock the next phase of growth in models and datasets. huggingface.co/blog/xethub-...
In April, there were 1,000 Hugging Face repos on Xet. Now every repo (over 6M) on the Hub is on Xet.
Good reminder that these genies still rely on solid infrastructure, good evaluations, and constant monitoring.
Good reminder that these genies still rely on solid infrastructure, good evaluations, and constant monitoring.
"Personal devices like glasses that understand our context because they can see what we see, hear what we hear, and interact with us throughout the day will become our primary computing devices."
"Personal devices like glasses that understand our context because they can see what we see, hear what we hear, and interact with us throughout the day will become our primary computing devices."
I celebrated by reviving the early 2000s web design aesthetics that I love so much. Here's our dashboard showing our progress converting the Hub from Git LFS to Xet (and demonstrating my questionable design sensibilities).
I celebrated by reviving the early 2000s web design aesthetics that I love so much. Here's our dashboard showing our progress converting the Hub from Git LFS to Xet (and demonstrating my questionable design sensibilities).
"There have been a series of experiences that have helped me realize more of my agency, but I think the most important one was becoming a father"
💯💯💯💯
"There have been a series of experiences that have helped me realize more of my agency, but I think the most important one was becoming a father"
💯💯💯💯
without any interruptions. Now we're migrating the rest of the Hub. We got this far by focusing on the community first.
Here's a deep dive on the infra making this possible and what's next: huggingface.co/blog/migrati...
without any interruptions. Now we're migrating the rest of the Hub. We got this far by focusing on the community first.
Here's a deep dive on the infra making this possible and what's next: huggingface.co/blog/migrati...
Some fun tidbits in here, like how we use our NAT gateway as a cost sentinel. Cloud infra costs are no joke.
Some fun tidbits in here, like how we use our NAT gateway as a cost sentinel. Cloud infra costs are no joke.
FineWeb-C v1 is complete! Communities worldwide have built their own educational quality datasets, proving that we don't need to wait for big tech to support languages.
Huge thanks to all who contributed!
huggingface.co/blog/davanst...
FineWeb-C v1 is complete! Communities worldwide have built their own educational quality datasets, proving that we don't need to wait for big tech to support languages.
Huge thanks to all who contributed!
huggingface.co/blog/davanst...
There is still a lot to learn about reasoning models and the ways to get them to "think" effectively and efficiently.
Head of Signal, Meredith Whittaker, on so-called "agentic AI" and the difference between how it's described in the marketing and what access and control it would actually require to work as advertised.
It makes questionable inventory and sales decisions, loses most of its money, and has an identity crisis.
Not *so* far off from how I would perform.
It makes questionable inventory and sales decisions, loses most of its money, and has an identity crisis.
Not *so* far off from how I would perform.
It will set at 21:11, 2 seconds earlier than the day before.
A month ago there were 5,500 users/orgs on Xet with 150K repos and 4PB. Today?
🤗 700,000 users/orgs
📈 350,000 repos
🚀 15PB
A month ago there were 5,500 users/orgs on Xet with 150K repos and 4PB. Today?
🤗 700,000 users/orgs
📈 350,000 repos
🚀 15PB