Reuben Bond
banner
rbn.bsky.social
Reuben Bond
@rbn.bsky.social
Distributed Systems & databases person. Works at Microsoft on Orleans & Aspire
Reposted by Reuben Bond
Our SIGMOD paper with our friends at Tsinghua + @wesmckinney.com + @pateljm.bsky.social on creating a next generation open-source data file format is out. F3 is a future-proof file format avoids the mistakes of Parquet.
📄 Paper: db.cs.cmu.edu/papers/2025/...
📁 Code: github.com/future-file-...
October 1, 2025 at 1:49 PM
Reposted by Reuben Bond
Why, hello there! 🪟

Thanks @rbn.bsky.social :D
September 16, 2025 at 7:07 PM
Reposted by Reuben Bond
In the last week I probably explained Rateless Set Reconciliation to a dozen other scientists. What an amazing paper and result, and already one year old.
April 7, 2025 at 6:55 PM
What's a good discord server for people building things in the AI/LLM space?
January 4, 2025 at 10:08 PM
Reposted by Reuben Bond
Interesting workshop and a lovely community, would very much recommend!
December 15, 2024 at 4:04 PM
Reposted by Reuben Bond
Hi everyone! I'm a co-founder @dbos.dev, where we're building a serverless platform for highly reliable applications. I love conversations about databases, distributed systems, and anything technical. Thanks @qianli.dev for introducing me to Bluesky, and looking forward to meeting people here! 🦋
December 13, 2024 at 12:58 AM
Reposted by Reuben Bond
NULL BITMAP: in honour of DSQL's release last week I wrote about what the point of having extremely precise trustworthy clocks is buttondown.com/jaffray/arch...
What are the Magical Clocks for?
At Re:Invent last week, AWS announced DSQL, their new serverless SQL database. As a fan of distributed SQL databases I have been enjoying reading about the...
buttondown.com
December 9, 2024 at 7:07 PM
CRDTs fit a system model that has hardly any overlap with datacenter-based applications. If I'm wrong, please point to datacenter-based apps which have benefited from the application of CRDTs
December 8, 2024 at 5:26 PM
Reposted by Reuben Bond
New blog post on the fun new hardware advancements which databases can leverage for great gains, and why the cloud means it doesn't matter that they exist. 🫠

transactional.blog/b...
November 20, 2024 at 12:13 AM
Reposted by Reuben Bond
ACM Symposium on Cloud Computing (SoCC) is next week! Lots of great papers on serverless, cloud, and AI/ML systems. I appreciate the push on diversity: both keynote speakers are women, plus a women in systems meetup! (This was my first time as a program committee member🤗)
acmsocc.org/2024/index.h...
2024 ACM Symposium on Cloud Computing
acmsocc.org
November 14, 2024 at 9:02 PM
Reposted by Reuben Bond
We've now published a blog about how LinkedIn runs large stateful apps with a generic operator on Kubernetes. www.linkedin.com/blog/enginee...

Don’t forget to come to our #KubeCon session on Thursday.
November 13, 2024 at 4:12 PM
Reposted by Reuben Bond
I made a substantial update to my "messaging and eventing resources" page, adding the CNCF xRegistry project and related subprojects, pending proposals for Apache Avro, a section on Microsoft Fabric RTI, and a new talk recording. github.com/clemensv/mes...
GitHub - clemensv/messaging: Resource collection for messaging and eventing
Resource collection for messaging and eventing. Contribute to clemensv/messaging development by creating an account on GitHub.
github.com
November 13, 2024 at 10:52 AM
Reposted by Reuben Bond
Spent some time trying out #dotnet Aspire last night. Definitely impressed so far. I was surprised how easy it was to pull in non dotnet workloads too.
November 10, 2024 at 4:22 PM
Distributed Systems & #DataBS folks, please welcome @lalithsuresh.bsky.social! Lalith authored my favorite paper on group membership, RAPID. Instead of building strongly consistent membership atop weak (eg, Raft + SWIM), RAPID delivers consistency from the get-go www.usenix.org/system/files...
November 11, 2024 at 4:53 PM
CASPaxos is a Paxos variant which implements a linearizable (compare and set) register without needing logs. It's the simplest practical consensus algorithm, IMO. I wrote more about it here: reubenbond.github.io/posts/caspaxos
reublog - CASPaxos
reublog
reubenbond.github.io
November 3, 2024 at 12:11 AM
Reposted by Reuben Bond
Over communicate if you’re remote. And map your communication style and message to the audience.
If you want to be a senior engineer, make sure you’re working on communication. Engineering leaders don’t have time to look at code, they rely on comments from engineers that give them sense checks on quality, etc. mostly that comes from engineers who are senior and above.

Communicate.
November 2, 2024 at 2:58 PM
Scan RAM without trashing your CPU caches by leveraging non-temporal instructions, Memory Type Range Registers, or cache line flushing. Obscure, but perhaps useful for minimizing the impact of garbage collectors / LSM compaction. dl.acm.org/doi/pdf/10.1...
November 1, 2024 at 9:07 PM
Reposted by Reuben Bond
Hello friends! I'm so excited to join Bluesky here -- it feels like a cleaner and safer social platform.

The first thing I did was to follow this instruction to set my domain as my handle. It's super simple! bsky.social/about/blog/4...
How to set your domain as your handle - Bluesky
Using a domain as your handle helps with account identity, verification, and portability. Here's how to set your domain as your handle.
bsky.social
November 1, 2024 at 6:19 PM
Reposted by Reuben Bond
Is it end of the road for RocksDB in stream processing?

Disaggregated state is the clearly superior architecture, with @responsive.dev investing heavily in SlateDB.io while Flink 2.0 has forked RocksDB.

Here's why we've bet on SlateDB for Kafka Streams: www.responsive.dev/blog/why-sla...
If not RocksDB, then what? Why SlateDB is the right choice for Stream Processing.
Why we think SlateDB is the right foundation for a state storage service for Kafka Streams.
www.responsive.dev
October 29, 2024 at 4:51 PM
Disaggregated storage pushes us to reexamine past ideas under a new light. Phil Bernstein gives middle-tier distributed transactions another look in this paper sites.computer.org/debull/A19ju.... What other examples come to mind?
October 27, 2024 at 4:47 PM
Databases built on disaggregated storage (blobs) leverage reconfiguration rather than quorums. You don't need Paxos in your database if the storage layer guarantees linearizability and durability already.
Data Replication Design Spectrum by @alexmillerdb.bsky.social (transactional.blog/blog/2024-da...) pairs well with Categorizing How Distributed Databases Utilize Consensus Algorithms by Adam Prout (medium.com/@adamprout/c...). Both deserve a read if you are interested in distributed databases.
Categorizing How Distributed Databases Utilize Consensus Algorithms
Distributed databases generally fall into two camps when it comes to architectures for maintaining high availability (HA) [1]. Both…
medium.com
October 27, 2024 at 4:09 PM
Reposted by Reuben Bond
Talking about Database Internals, I want to let y’all know that I just released another internals episode with the one and only @gwenshap.bsky.social

If you want to know why Nile Postgres was created and how it supports multitenant SaaS, tune in NOW

youtu.be/WZ7t0vqGKoo
October 26, 2024 at 4:34 PM
Data Replication Design Spectrum by @alexmillerdb.bsky.social (transactional.blog/blog/2024-da...) pairs well with Categorizing How Distributed Databases Utilize Consensus Algorithms by Adam Prout (medium.com/@adamprout/c...). Both deserve a read if you are interested in distributed databases.
Categorizing How Distributed Databases Utilize Consensus Algorithms
Distributed databases generally fall into two camps when it comes to architectures for maintaining high availability (HA) [1]. Both…
medium.com
October 25, 2024 at 11:47 PM
Orleans can automatically migrate frequently communicating grains to the same hosts, reducing network calls. Here's an app running on a real cluster showing grains (small circles) being migrated between hosts to eliminate network calls (red lines).
October 25, 2024 at 6:14 PM
Stateful services: low latency, efficiency, scalability - pick three - Atul Adya (Databricks): hpts.ws/papers/2024/...
Atul previously built Slicer at Google. Dicer is an evolution of that for Databricks. This presentation at HPTS resonated strongly with me - if only we had a recording
hpts.ws
October 25, 2024 at 3:55 PM