Ralith
ralith.com
Ralith
@ralith.com
Reposted by Ralith
December 29, 2025 at 12:55 AM
Reposted by Ralith
Package managers keep using git as a database, it never works out.

https://nesbitt.io/2025/12/24/package-managers-keep-using-git-as-a-database.html
Package managers keep using git as a database, it never works out
Using git as a database is a seductive idea. You get version history for free. Pull requests give you a review workflow. It’s distributed by design. GitHub will host it for free. Everyone already knows how to use it. Package managers keep falling for this. And it keeps not working out. ## Cargo The crates.io index started as a git repository. Every Cargo client cloned it. This worked fine when the registry was small, but the index kept growing. Users would see progress bars like “Resolving deltas: 74.01%, (64415/95919)” hanging for ages, the visible symptom of Cargo’s libgit2 library grinding through delta resolution on a repository with thousands of historic commits. The problem was worst in CI. Stateless environments would download the full index, use a tiny fraction of it, and throw it away. Every build, every time. RFC 2789 introduced a sparse HTTP protocol. Instead of cloning the whole index, Cargo now fetches files directly over HTTPS, downloading only the metadata for dependencies your project actually uses. (This is the “full index replication vs on-demand queries” tradeoff in action.) By April 2025, 99% of crates.io requests came from Cargo versions where sparse is the default. The git index still exists, still growing by thousands of commits per day, but most users never touch it. ## Homebrew GitHub explicitly asked Homebrew to stop using shallow clones. Updating them was “an extremely expensive operation” due to the tree layout and traffic of homebrew-core and homebrew-cask. Users were downloading 331MB just to unshallow homebrew-core. The .git folder approached 1GB on some machines. Every `brew update` meant waiting for git to grind through delta resolution. Homebrew 4.0.0 in February 2023 switched to JSON downloads for tap updates. The reasoning was blunt: “they are expensive to git fetch and git clone and GitHub would rather we didn’t do that… they are slow to git fetch and git clone and this provides a bad experience to end users.” Auto-updates now run every 24 hours instead of every 5 minutes, and they’re much faster because there’s no git fetch involved. ## CocoaPods CocoaPods is the package manager for iOS and macOS development. It hit the limits hard. The Specs repo grew to hundreds of thousands of podspecs across a deeply nested directory structure. Cloning took minutes. Updating took minutes. CI time vanished into git operations. GitHub imposed CPU rate limits. The culprit was shallow clones, which force GitHub’s servers to compute which objects the client already has. The team tried various band-aids: stopping auto-fetch on `pod install`, converting shallow clones to full clones, sharding the repository. The CocoaPods blog captured it well: “Git was invented at a time when ‘slow network’ and ‘no backups’ were legitimate design concerns. Running endless builds as part of continuous integration wasn’t commonplace.” CocoaPods 1.8 gave up on git entirely for most users. A CDN became the default, serving podspec files directly over HTTP. The migration saved users about a gigabyte of disk space and made `pod install` nearly instant for new setups. ## Go modules Grab’s engineering team went from 18 minutes for `go get` to 12 seconds after deploying a module proxy. That’s not a typo. Eighteen minutes down to twelve seconds. The problem was that `go get` needed to fetch each dependency’s source code just to read its go.mod file and resolve transitive dependencies. Cloning entire repositories to get a single file. Go had security concerns too. The original design wanted to remove version control tools entirely because “these fragment the ecosystem: packages developed using Bazaar or Fossil, for example, are effectively unavailable to users who cannot or choose not to install these tools.” Beyond fragmentation, the Go team worried about security bugs in version control systems becoming security bugs in `go get`. You’re not just importing code; you’re importing the attack surface of every VCS tool on the developer’s machine. GOPROXY became the default in Go 1.13. The proxy serves source archives and go.mod files independently over HTTP. Go also introduced a checksum database (sumdb) that records cryptographic hashes of module contents. This protects against force pushes silently changing tagged releases, and ensures modules remain available even if the original repository is deleted. ## Beyond package managers The same pattern shows up wherever developers try to use git as a database. Git-based wikis like Gollum (used by GitHub and GitLab) become “somewhat too slow to be usable” at scale. Browsing directory structure takes seconds per click. Loading pages takes longer. GitLab plans to move away from Gollum entirely. Git-based CMS platforms like Decap hit GitHub’s API rate limits. A Decap project on GitHub scales to about 10,000 entries if you have a lot of collection relations. A new user with an empty cache makes a request per entry to populate it, burning through the 5,000 request limit quickly. If your site has lots of content or updates frequently, use a database instead. Even GitOps tools that embrace git as a source of truth have to work around its limitations. ArgoCD’s repo server can run out of disk space cloning repositories. A single commit invalidates the cache for all applications in that repo. Large monorepos need special scaling considerations. ## The pattern The hosting problems are symptoms. The underlying issue is that git inherits filesystem limitations, and filesystems make terrible databases. **Directory limits.** Directories with too many files become slow. CocoaPods had 16,000 pod directories in a single Specs folder, requiring huge tree objects and expensive computation. Their fix was hash-based sharding: split directories by the first few characters of a hashed name, so no single directory has too many entries. Git itself does this internally with its objects folder, splitting into 256 subdirectories. You’re reinventing B-trees, badly. **Case sensitivity.** Git is case-sensitive, but macOS and Windows filesystems typically aren’t. Check out a repo containing both `File.txt` and `file.txt` on Windows, and the second overwrites the first. Azure DevOps had to add server-side enforcement to block pushes with case-conflicting paths. **Path length limits.** Windows restricts paths to 260 characters, a constraint dating back to DOS. Git supports longer paths, but Git for Windows inherits the OS limitation. This is painful with deeply nested node_modules directories, where `git status` fails with “Filename too long” errors. **Missing database features.** Databases have CHECK constraints and UNIQUE constraints; git has nothing, so every package manager builds its own validation layer. Databases have locking; git doesn’t. Databases have indexes for queries like “all packages depending on X”; with git you either traverse every file or build your own index. Databases have migrations for schema changes; git has “rewrite history and force everyone to re-clone.” The progression is predictable. Start with a flat directory of files. Hit filesystem limits. Implement sharding. Hit cross-platform issues. Build server-side enforcement. Build custom indexes. Eventually give up and use HTTP or an actual database. You’ve built a worse version of what databases already provide, spread across git hooks, CI pipelines, and bespoke tooling. None of this means git is bad. Git excels at what it was designed for: distributed collaboration on source code, with branching, merging, and offline work. The problem is using it for something else entirely. Package registries need fast point queries for metadata. Git gives you a full-document sync protocol when you need a key-value lookup. If you’re building a package manager and git-as-index seems appealing, look at Cargo, Homebrew, CocoaPods, Go. They all had to build workarounds as they grew, causing pain for users and maintainers. The pull request workflow is nice. The version history is nice. You will hit the same walls they did.
nesbitt.io
December 24, 2025 at 4:49 PM
Reposted by Ralith
it's nice out
December 23, 2025 at 3:03 AM
Reposted by Ralith
this is also an absurd scenario tho, since the frontier of mathematical exploration grows rather than shrinking as new discoveries are made and inevitably there will be matters the mechanical hypermathematicians haven't gotten around to yet, whatever their priorities are
if a future ai system got so incredibly good at math as to make human contribution to mathematics utterly useless by comparison i don't think abandoning the pursuit of mathematical knowledge would make sense for me personally tbh, i think id probably just ask the ai an incessant stream of questions
December 11, 2025 at 1:36 AM
Reposted by Ralith
if a future ai system got so incredibly good at math as to make human contribution to mathematics utterly useless by comparison i don't think abandoning the pursuit of mathematical knowledge would make sense for me personally tbh, i think id probably just ask the ai an incessant stream of questions
December 11, 2025 at 1:24 AM
Reposted by Ralith
beaterCore demo is out on @itch.io!
Buy, repair, restore, and race your dream project car. Take part in racing events and work your way through the career mode!

the-hellbox.itch.io/beatercore

Wishlist on steam:
store.steampowered.com/app/3711050/...

#gamedev #indiedev #rustlang #racinggame
beaterCore by The HellBox
Racing game focused on car repair and maintenance
the-hellbox.itch.io
December 4, 2025 at 5:25 PM
Reposted by Ralith
New post: a defense of lock poisoning in Rust.

Followup to recent discussion: decided to write about lock poisoning, looking at the arguments on each side, and informed by our experience at @oxide.computer dealing with the parallel problem of unexpected async cancellations

Please give it a read!
In defense of lock poisoning in Rust · sunshowers
It's worth retaining one of multithreaded Rust's most valuable features.
sunshowers.io
December 2, 2025 at 7:37 PM
Reposted by Ralith
With the @openuk.bsky.social Awards coming up, we’re excited that Rustls — a memory-safe TLS library — is shortlisted in two categories, and Creator Joe Birr-Pixton is also recognized individually.

The Rust Foundation is proud to support Rustls through the Rust Innovation Lab 🧡
Rustls Shortlisted for Two 2025 OpenUK Awards - The Rust Foundation
The Rust Foundation is delighted to congratulate Rustls for being shortlisted in the Open Source Software and Security categories of the OpenUK Awards 2025 — and Joe Birr-Pixton, Rustls Creator, for…
rustfoundation.org
December 2, 2025 at 7:45 PM
Reposted by Ralith
"you can't solve the quintic" skill issue
November 30, 2025 at 7:22 PM
Reposted by Ralith
And it's up! All together now...

🎶 Somebody told me
the user provider
should use an adaptor
to proxy the query
factory builder... 🎶

www.youtube.com/watch?v=p03o...
November 28, 2025 at 3:15 PM
Reposted by Ralith
"ansible replacement plan" update: I made facet-kdl /really good/ (in screenshot: miette reports)

finishing that up, then moving on to the other components.
November 26, 2025 at 7:33 PM
Reposted by Ralith
“Schemaless” and “self-describing” are the dynamic typing of data formats: you put more crap into the runtime representation, making it perform worse and in exchange you get worse reliability because errors are identified less reliably. Lose/lose
November 23, 2025 at 11:05 AM
Reposted by Ralith
New #rustlang blog post, about "move expressions", an idea that came up after my last post on ergonomic ref counting:

smallcultfollowing.com/babysteps/bl...
Move Expressions · baby steps
smallcultfollowing.com
November 21, 2025 at 4:34 PM
Reposted by Ralith
Pretty cool that Rust is big enough to take down the internet. :D
November 20, 2025 at 12:00 AM
Reposted by Ralith
sorry i just want to circle back to

> we’re wolves not sheep

one more time. i don’t think you know what either of those animals are like. for one thing, a sheep will kill another over an insult way more readily than a wolf will
November 17, 2025 at 8:53 PM
Reposted by Ralith
Avoiding this kind of shell script that doesn't work very well is exactly why async Rust is so incredibly powerful.

Nextest's signal handling is a cleanly designed part of the system that produces events consumed by the same event loops/actors as all other messages, thanks to async Rust
November 17, 2025 at 4:00 AM
Reposted by Ralith
let me introduce y'all to archiveofourown.org/works/3243470
November 16, 2025 at 6:49 PM
Reposted by Ralith
The beautiful river LDFLAGS
November 15, 2025 at 12:14 AM
Reposted by Ralith
oh god oh fuck
I thought the vscode ssh extension's automatic port forwarding feature was neat and assumed it was doing something smart like watching procfs but no it's looking for localhost URIs in stdout
November 11, 2025 at 2:40 PM
Reposted by Ralith
A meteor, the Milky Way and Mount Fuji captured by photographer Hayata Suzuki.
November 5, 2025 at 9:09 PM
Reposted by Ralith
November 2, 2025 at 2:47 PM
Reposted by Ralith
ladies and gentlemen...we got him
October 30, 2025 at 7:10 PM