Julien Le Dem
julien.ledem.net
Julien Le Dem
@julien.ledem.net
Principal Engineer, Founder, Angel, Advisor, OSS.
LFAI&data: OpenLineage, Marquez, ASF: Parquet, Arrow, Iceberg, 🐖
he/him.
Me: https://julien.ledem.net/
Blog: https://sympathetic.ink
Good evening golden gate.
November 9, 2025 at 1:08 AM
Higher latency but higher throughput on improving the overall data ecosystem.
"if you want to go fast, go alone; If you want to go far, go together"
New Apache Parquet Community page is up: parquet.apache.org/community/
November 7, 2025 at 9:17 PM
Reposted by Julien Le Dem
"if you want to go fast, go alone; If you want to go far, go together"
New Apache Parquet Community page is up: parquet.apache.org/community/
November 7, 2025 at 8:06 PM
Reposted by Julien Le Dem
The future of data connectivity is columnar. Today we launched
@columnar.tech to accelerate the shift from slow, row-oriented APIs like ODBC and JDBC to >10x faster alternatives powered by @arrow.apache.org. Learn more 👇
Announcing Columnar
Back to the future of data connectivity
columnar.tech
October 29, 2025 at 10:51 PM
Experimenting with the laser cutter to make 3d objects.
November 3, 2025 at 12:21 AM
This just went in the oven
October 31, 2025 at 12:40 AM
If you missed my talk: "Data Observability and OpenLineage" at the Datadog summit in SF, here is your chance to catch up on the recording.
www.youtube.com/watch?v=uhNo...
Data Observability and OpenLineage
YouTube video by Datadog
www.youtube.com
October 28, 2025 at 10:30 PM
Another earthquake
October 26, 2025 at 2:33 PM
Great time seeing Garbage at the Warfield tonight.
October 25, 2025 at 5:52 AM
I'm speaking tomorrow at "Embracing AI in Open Source"
If you've been wondering why we see a flurry of new columnar formats, come see me present "Column Storage for the AI Era" I'll talk about what has changed, new advances in data encoding and how that's pushing Parquet to evolve.
luma.com/pxikwty3
Next-Gen Data Engineering: Embracing AI in Open Source · Luma
Next-Gen Data Engineering: Embracing AI in Open Source Join us October 23rd at the Silicon Valley AI Hub in Snowflake’s Menlo Park campus for an evening…
luma.com
October 22, 2025 at 9:47 PM
Who’s solving the Louvre robbery?
Right answers only
October 20, 2025 at 5:12 AM
First attempt at making prints with the laser cutter. I don’t know what I’m doing but happy to have made something.
October 19, 2025 at 8:28 PM
And then he said:
« Bring me to your leader! »
October 17, 2025 at 1:29 AM
Felt it
October 16, 2025 at 4:45 PM
Reposted by Julien Le Dem
"It is not 100% clear to me how a new file format (or three) will drive additional ecosystem adoption :thinking:"

However, I absolutely think this adds to the pressure for Parquet to evolve.

Speaking of, anyone interested in helping add new encodings to parquet?
lists.apache.org/thread/djnbb...
October 1, 2025 at 7:21 PM
"Why Datadog Chose Airflow 3: Multi-Tenancy, Observability, and the Future of Event-Driven Workflows"
Zach and I will be talking about how Datadog adopted Airflow 3 at the Airflow summit next week. Come say hi!
airflowsummit.org/sessions/202...
October 1, 2025 at 3:39 PM
Columnar file formats are hot!
Our SIGMOD paper with our friends at Tsinghua + @wesmckinney.com + @pateljm.bsky.social on creating a next generation open-source data file format is out. F3 is a future-proof file format avoids the mistakes of Parquet.
📄 Paper: db.cs.cmu.edu/papers/2025/...
📁 Code: github.com/future-file-...
October 1, 2025 at 3:08 PM
I'm trying to understand a bit better real life deployments of open source Clickhouse.
If you're using it, what does your deployment look like?
September 23, 2025 at 11:05 PM
Reposted by Julien Le Dem
Berkeley just experienced a small earthquake.

Check USGS for the official earthquake magnitude: earthquake.usgs.gov/earthquakes/...

Remember to drop, cover, and hold on during earthquake shaking.
Latest Earthquakes
earthquake.usgs.gov
September 22, 2025 at 10:03 AM
Woken up by the earth quake!
September 22, 2025 at 9:59 AM
Catching up in person at the Community over Code conference.
Nice to see you all!
September 12, 2025 at 5:46 PM
I’ll be at the Community over Code Conference in Minneapolis on Thursday and Friday. Come say hi if you’re around.
I’m speaking about the deconstructed database Thursday at noon.
communityovercode.org/schedule/
Sessions Schedule
Please note: All session times are in Central Daylight Time (UTC -5). You must be registered for Community Over Code NA 2025 to participate in the sessions. If you have not yet registered but would…
communityovercode.org
September 10, 2025 at 3:06 AM
A cool blog post by Qi Zhu, Jigao Luo and @andrewlamb1111.bsky.social on embedding custom indices in Parquet files while staying compatible with the standard.

datafusion.apache.org/blog/2025/07...
Embedding User-Defined Indexes in Apache Parquet Files - Apache DataFusion Blog
datafusion.apache.org
July 17, 2025 at 12:59 AM
🇫🇷🥐🥖🛫📼😴🛬🌉🥑
July 10, 2025 at 2:06 PM
New profile pic?
July 9, 2025 at 12:06 PM