Simon Späti 🏔️
banner
ssp.sh
Simon Späti 🏔️
@ssp.sh
Dad. Technical Author, Data Engineer.
Data practitioner (20y) • Writing at ssp.sh since 2015.

Focused on the craft of data engineering & storytelling.
📚 vault.ssp.sh • 📖 @dedp.online

❯ #dataengineering, #opensource, #writing, #obsidian, #neovim
Pinned
Hello bsky👋🏻. In case we haven't met: Simon from 🇨🇭 I bought my first domain sspaeti.com in 2008 and had a popular party site with HTML/CSS/PHP.

Since 2015, I'm crafting open-source data engineering essays on ssp.sh, vault.ssp.sh and book.ssp.sh. Hacking at gh.ssp.sh. I recently started freelancing.
November 12, 2025 at 9:00 PM
What writing devices are you using? Mixing up makes all the fun, no? 😉
November 11, 2025 at 8:37 PM
Create a rundown on the current streaming (and stream processing) landscape.

As @stanislavkozlovski.bsky.social explains, `streaming != stream processing`.
Event Streaming is Topping Out
There's no money left in real-time event streaming. A major consolidation wave is beginning.
bigdata.2minutestreaming.com
November 11, 2025 at 9:39 AM
Has anyone exported cost data from AWS, Google, or Cloudflare? What's the easiest (most automated) way?

There is creating a cost report and exporting to S3, but is there no way via CLI/API? 🤔
November 10, 2025 at 6:22 PM
Modern OLAP systems. Still not dead.
Modern OLAP Systems
With modern [[OLAP]] systems, you replace your [[Traditional OLAP Cubes]] one-to-one with another technology. Therefore, you keep everything the same on your current architecture but replace your cube...
www.ssp.sh
November 10, 2025 at 10:26 AM
Reposted by Simon Späti 🏔️
How are people working collaboratively with Markdown and Obsidian?

I use @hackmd.io synced with git. I made a video of how I use it, you can also see my note on it: www.ssp.sh/brain/hackmd.

Hacking my way with symlinks to use Wikilinks `[[]]` everywhere and auto-convert to links on my websites.
Efficient Markdown Collaboration (HackMD, Obsidian, Neovim, VSCode)
YouTube video by Simon Späti
youtu.be
June 12, 2025 at 8:50 AM
What a beautiful write-up. Be aware of the 'start scrolling'.
www.whitenoise.email/p/when-you-s...
November 9, 2025 at 4:08 PM
What's stopping you from reading the #dataengineering newsletter and the latest blog posts like this?

These are my RSS feeds in the terminal on the left, and the feeds (queries), their URLs, and configs on the right.
November 7, 2025 at 4:20 PM
Reposted by Simon Späti 🏔️
Consolidations in the hashtag#dataengineering market are happening fast. Tools from the hashtag#ModernDataStack get unified into unified data platforms.

The latest:
- Fivetran → dbt
- Fivetran → SQLMesh
- Soda → nannyML
- Snowflake → Crunchy Data
- Databricks → Neon
- Fivetran → Census
- dbt → SDF
October 13, 2025 at 6:43 PM
Reposted by Simon Späti 🏔️
Wow, what a presentation by @hannes.muehleisen.org about the history of data architecture with its changes in architecture from 1985 to 2025 with DuckDB and in general.

I took a lot of notes, some of which are illustrated below in my current Obsidian Vault.
November 4, 2025 at 8:58 AM
Reposted by Simon Späti 🏔️
Updated many people in data engineering since, but who is missing?

I also added other interesting curated lists:
> Whitepapers: www.ssp.sh/brain/data-e...
> Books: www.ssp.sh/brain/books-...
> Learning: www.ssp.sh/brain/learni...
> YT: www.ssp.sh/brain/data-e...
> Blogs: www.ssp.sh/brain/data-e...
I am sharing my list of the people in data engineering, who is missing?

🗒️ https://www.ssp.sh/brain/people-of-data-engineering/
November 5, 2024 at 7:57 AM
What other YouTube Channels are out there for Data Engineering that you like?

List and links: www.ssp.sh/brain/data-e...
November 3, 2025 at 1:59 PM
I just updated the programmatically generated image for my second brain notes. What do you think, better? 🤔

Check below the before and after. The note below also explains how I achieved this using Rust + SVG, with conversion to WebP.
November 2, 2025 at 4:28 PM
This is really cool. I added the example to my second brain, and it just works.

www.ssp.sh/brain/run-du...
November 2, 2025 at 1:45 PM
Books of Data Engineering. Which one is your favorite, and what are you reading?

Also, what books do you read that are not related to DE, but have influenced how you think or design data work?
Books of Data Engineering
Below are books related to data engineering. Starters and general knowledge: Data Engineering Design Patterns (DEDP) - My partially free online book to get you started :) The Data Warehouse Toolk...
www.ssp.sh
November 2, 2025 at 9:53 AM
I really enjoy Railway.com. Anyone else using them?

I just moved to self host my website analytics (GoatCounter) there, and its so simple and convenient.
Railway
Railway is an infrastructure platform where you can provision infrastructure, develop with that infrastructure locally, and then deploy to the cloud.
Railway.com
October 30, 2025 at 8:40 PM
We did a thing: three amazing data engineers and I answered the 10 most-asked questions on r/dataengineering.

These genuine insights help you understand the current state of #dataengineering. Obviously, I'm biased since I pulled together the questions & answered them myself, but it out for yourself
4 Senior Data Engineers Answer 10 Top Reddit Questions - MotherDuck Blog
A great panel answering the most voted/commented data questions on Reddit | Reading time: 27 min read
motherduck.com
October 30, 2025 at 2:34 PM
Very interesting paper.
www.oecd.org/en/publicati...
October 30, 2025 at 10:35 AM
Is anybody using git for data right now? What is your workflow? How do you integrate the full stack into Git-style working?

Do you use database cloning mechanisms, or are you using LakeFS and similar tools?

I'm currently writing about it, and I found that there are so many ways to do this.
October 30, 2025 at 9:55 AM
Found on Reddit:

> Can AI effectively do the jobs and an the things it's supposedly able to do? it doesn't matter. what matters is that the people in charge think that it can, and they're willing to find out and they don't care about the collateral damage to people's careers and livelihood.
pheezy42's comment on "AWS tech writers majorly impacted by today's layoffs"
Explore this conversation and more from the technicalwriting community
www.reddit.com
October 29, 2025 at 8:59 AM
Opinionated data stack vs. modern data stack 🤔
October 27, 2025 at 1:03 PM
It is this time of the year. Exchanging the beauty of the outside view and fresh air with the hard but effective grind.

Preparing, making myself fit until next year when the flowers start to open.
October 26, 2025 at 8:12 PM
Great insights are coming from *having lived*. You can’t share great ideas out of thin air. Either you had a hard time, you have lived different places (mini retirements), you encountered many problems, and overcame obstacles.

These all make great insights, articles to share. Ideas worth sharing.
Ideas Worth Sharing
Great insights are coming from having lived. You can’t share great ideas out of thin air. Either you had a hard time, you have lived in different places ([[mini retirements]]), you encountered many pr...
www.ssp.sh
October 23, 2025 at 10:02 PM