#DuckDB's
🇫🇮 ​We are hosting a pub session next week during the @helsinkidataweek.bsky.social, where you can chat with DuckDB's co-creator, @hannes.muehleisen.org and have a drink with members of the DuckDB community.

🎟️ Sign up on Luma: luma.com/s5sl9qxx
October 20, 2025 at 2:29 PM
We published a new deep dive by Laurens Kuiper, who recently redesigned DuckDB's sort.

One data point: ordering the TPC-H SF100 lineitem table with the memory limit set to 30 GB is 3× faster in DuckDB v1.4 than in v1.3.

Read more at duckdb.org/2025/09/24/s...
Redesigning DuckDB's Sort, Again
After four years, we've decided to redesign DuckDB's sort implementation, again. In this post, we present and evaluate the new design.
duckdb.org
September 25, 2025 at 6:35 PM
This can be improved in a lot of ways. One of my goals is to bring DuckDB's UnaryExecutor (C++) to Zig. I need to learn more Zig first, but I'm pretty sure it's doable.
February 4, 2025 at 10:19 AM
Bonus Drop #81 (2025-04-06): Bonus²

This weekend Bonus rop discusses DuckDB's integration with dbt for effective data transformation, enhancing workflows with features like spatial functions and reverse ETL. It also highlights shell command efficiency improvement tools and introduces […]
Original post on mastodon.social
mastodon.social
April 6, 2025 at 6:25 PM
HN users discussed DuckDB's role in data engineering. They shared experiences, desired features (like Iceberg writes), & how they use it. "Airport for DuckDB" announcement sparked interest in extending its capabilities. #DuckDB 1/6
May 24, 2025 at 9:00 PM
Never gonna not 💙 #DuckDB's built-in bar chart function
June 21, 2025 at 11:34 AM
More magicks, this time for DuckDB's new DuckLake!

Use local or SharePoint-ed Excel as a DuckLake metadata catalog.

github.com/gregwdata/du...
GitHub - gregwdata/ducklakexl: Use Excel as a metadata catalog for DuckLake 🤪
Use Excel as a metadata catalog for DuckLake 🤪. Contribute to gregwdata/ducklakexl development by creating an account on GitHub.
github.com
June 4, 2025 at 1:41 PM
DuckDB’s installation page is… nice. Pick the version, env, package, and platform. Off you go.
August 16, 2022 at 2:47 AM
As a side note, '.by=' is the way to group when using duckDB's tidyverse wrapper, `duckplyr`, otherwise duckDB will drop back to using `dplyr` and lose the speed advantage.
November 17, 2024 at 4:54 PM
Tinkering with dltHub's REST API source to ingest data without having to write any bespoke code, and DuckDB's new local web UI, and I feel so spoiled with how good data tooling is getting!
March 21, 2025 at 3:43 PM
Correct. DuckDB's code path is a more generic one that also includes running the CSV sniffer to identify the CSV file's dialect, column names and types.
December 2, 2024 at 8:50 PM
*Says the daily prayer that someone with l33t CSS sk1llz PRs into DuckDB's UI code to support dark mode.*
July 8, 2025 at 11:15 AM
The things that have made this harder than I would have thought are
1. Duckdb's excellent json parser only works on SELECT statements! Had to create a whole separate thing just to safely pop "CREATE OR REPLACE TABLE" off the front of queries so duckdb can parse the rest. github.com/bmschmidt/da...
github.com
December 24, 2024 at 9:04 PM
"Which Parsing Approach?"

DuckDB's PEG parser post made me go down the rabbit hole and re-read on parsing for a bit. This is a nice overview on different approaches, by @ltratt.bsky.social.

tratt.net/laurie/blog/...
November 24, 2024 at 10:45 AM
Seeing something like this on a software’s website gives me confidence. Telling me when NOT to use a tool makes me trust the tool more.
September 20, 2023 at 10:50 PM
Today's Bonus [Knowledge] Drop takes you on my journey to add CIDR handling to DuckDB's INET extension, which stores IPv4 & IPv6 addresses. It outlines the process of developing a DuckDB function to expand CIDR notation into a list of IP addresses, detailing technical steps and functions involved.
Bonus Drop #70 (2024-12-14): Knowledge Drop
Today’s Bonus [Knowledge] Drop takes you on my journey to add CIDR handling to DuckDB’s INET extension, which stores IPv4 and IPv6 addresses. It outlines the process of developing a fun…
dailydrop.hrbrmstr.dev
December 14, 2024 at 6:45 PM
Drop #560 (2024-11-22): Quack • Post • Code:

Today's Drop features DuckDB's API integration with Hono.js for SQL queries, turning RSS feeds into automated Bluesky posts, and resources for learning Deno 2.x.

http://dailydrop.hrbrmstr.dev/2024/11/22/drop-560-2024-11-22-quack-post-code/
November 24, 2024 at 3:30 PM
Well… sorry, I guess? 😅 I could not find a smaller test case to trigger a bug in DuckDB‘s old bottom up decorrelation, so I’ve added this monstrosity as a test case. Do you mind sharing why do you need to debug it?
June 6, 2025 at 10:41 PM
All true, ofc.

I thought it was cool that with some combination of dd and show_query() you could get it to write as far as possible DuckDB's prefered SQL and call its native functions.

This was aimed more at the SQL-forgetter and rusty DBA whisperer (me) than the devoted dplyr-user (err, also me)
August 15, 2025 at 9:13 PM
It's only 2.2gb in DuckDB's default disk persistence format! It's tiny ass artisanal data! It's small batch IPA data.
March 16, 2025 at 11:00 PM
Thanks! I'm inclined towards not filling issues as I don't think there is much missing from duckdb's side. People need to catch up :)
October 16, 2023 at 7:48 AM
But what about larger teams that don’t want to run DuckDB locally? What about teams and enterprises with a larger Postgres investment. Connecting DuckDB’s engine with Postgres is where we come in.
November 1, 2024 at 4:26 PM
DuckDB’s DuckLake: A Simplified Lakehouse Solution

DuckDB's DuckLake is emerging as a compelling alternative to Delta Lake and Apache Iceberg, offering simplicity and efficiency in data lakehouse architectures. Here’s a breakdown of its key features and potential improvements: Simplicity: DuckLake…
DuckDB’s DuckLake: A Simplified Lakehouse Solution
DuckDB's DuckLake is emerging as a compelling alternative to Delta Lake and Apache Iceberg, offering simplicity and efficiency in data lakehouse architectures. Here’s a breakdown of its key features and potential improvements: Simplicity: DuckLake simplifies backend implementations compared to Delta Lake and Iceberg. Iceberg Compatibility: Iceberg table import/export is in development, with compaction support already available in the extension. Integration Needs: Future integrations with AWS Athena (Trino/Presto) and Spark via JNI would enhance usability.
undercodetesting.com
May 28, 2025 at 5:48 AM
Today I hacked together my first DuckDB extension in pure C. No C++. No Rust calling the C API. No CMake. Just me, a single .c file, and the compiler.

Funnily enough, even the C API demo in DuckDB's source code was written using C++. 😄

How's your Sunday going?
January 5, 2025 at 3:40 PM