Carlo Piovesan
banner
carlo.piovesan.io
Carlo Piovesan
@carlo.piovesan.io
1.1K followers 190 following 130 posts
Compiling @duckdb
Posts Media Videos Starter Packs
Reposted by Carlo Piovesan
We took Canada’s Spatial Access Measures dataset (big, clunky CSVs) → turned it into a single GeoParquet file.

Add DuckDB-WASM + deck.gl & you get
- instant queries
- smooth maps
- no backend

Public data, but actually usable.
developmentseed.org/blog/2025-10...

@saadiqmohiuddin.bsky.social
Reposted by Carlo Piovesan
📣 New blog post by @dtenwolde.bsky.social.

🕸️ In this post, we show how to use DuckDB and the DuckPGQ community extension to analyze financial data for fraudulent patterns with the SQL/PGQ graph syntax that's part of SQL:2023.

📖 Visit duckdb.org/2025/10/22/d... to read the post.
Reposted by Carlo Piovesan
In a new guest blog post, Mark Harrison (Madhive) presents Frozen DuckLakes. These are read-only DuckLakes that only require object storage and are frozen in time. He shows a process for creating and freezing DuckLakes, and demonstrates their use and performance.

📖 ducklake.select/2025/10/24/f...
Should be live, in duckdb v1.4.0:
```
INSTALL dns FROM community;
LOAD dns;
CREATE TABLE websites AS WITH websites(endpoint) AS (VALUES ('google.com'), ('duckdb.org'), ('community-extensions.duckdb.org')) FROM websites;
SELECT dns_lookup(endpoint) FROM websites;
```
duckdb.org/community_ex...
dns
DuckDB Community Extensions The DNS Extension enables DNS lookups and reverse DNS lookups from within DuckDB
duckdb.org
Reposted by Carlo Piovesan
Had a blast speaking about query.farm at the #DuckDB Amsterdam Meetup #3 where I had time to spend some time with my heroes @carlo.piovesan.io @samansmink.bsky.social @szarnyasg.org Tania Bogatsch and so many more awesome people! Thank You @duckdb.org for giving space to the community! #opensource
Reposted by Carlo Piovesan
DuckDB @duckdb.org · Sep 2
We just launched the “DuckDB in Science” site, a curated collection of papers, lectures and podcasts about DuckDB in research: duckdb.org/science/

🎡 If you would like to learn more about DuckDB in Science, consider joining our meetup in London this Thursday: www.meetup.com/duckdb/event...
Reposted by Carlo Piovesan
DuckDB @duckdb.org · Aug 18
🎓 On September 4, we are hosting a new kind of meetup in London which will focus on the use of DuckDB in Science and Education!

⚡️ We still have some spots for lightning talks. If you're working with DuckDB in your research and/or classroom, consider sharing your story!

🔗 duckdb.org/events/2025/...
DuckDB Meetup on Science and Education in London
DuckDB is an in-process SQL database management system focused on analytical query processing. It is designed to be easy to install and easy to use. DuckDB has no external dependencies. DuckDB has bin...
duckdb.org
Reposted by Carlo Piovesan
DuckDB @duckdb.org · Aug 8
We just published a deep dive on spatial joins in DuckDB by @maxxen.bsky.social.

In this blog post, Max explains how spatial joins evolved in DuckDB and how the current operator harnesses R-Trees.

Read the full post at duckdb.org/2025/08/08/s...
Is anyone aware whether it’s possible to set CORS policies on the R2 (@cloudflare.social) Iceberg REST catalog?
cloudflare.social
Reposted by Carlo Piovesan
Weekend project, using R2 + DuckDB wasm to build a low price frontend for big data analytics crimede-coder.com/graphs/GSUBo...
GSU Book Prices
GSU Book Prices Tables
crimede-coder.com
Quack!!
Beyond excited to share that I'm now working with DuckDB Labs!

My first priority will be the DuckDB Rust client, which will receive a lot more love going forward.

🎉 🦆 ❤️
Signing a new freelance contract is always exciting. Being able to work on open source makes it doubly so.

Will share more soon. 😊
Reposted by Carlo Piovesan
Beyond excited to share that I'm now working with DuckDB Labs!

My first priority will be the DuckDB Rust client, which will receive a lot more love going forward.

🎉 🦆 ❤️
Signing a new freelance contract is always exciting. Being able to work on open source makes it doubly so.

Will share more soon. 😊
Reposted by Carlo Piovesan
My #PyConUS talk is up! Tame data science file zoos with @duckdb.org!
Read files of many shapes from any cloud, including your messiest CSVs. Then, organize all of your input tables, output tables, views, and relationships in one DuckDB file. No final_v3.csv!
youtu.be/Yq-hxgysWLQ?...
Taming file zoos: Data science with DuckDB database files - Alex Monahan
YouTube video by PyCon US
youtu.be
Reposted by Carlo Piovesan
Yesterday DuckDB Labs announced DuckLake, a new Open data and catalog format. Here is a bit about why it is so awesome, how it solves some key problems with existing technologies, and why we're excited to host it in the cloud at MotherDuck.
S3 isn’t a database. But what if your lakehouse finally acted like one?

@jrdntgn.bsky.social shares why the current generation of lakehouse formats like Iceberg and Delta Lake might be missing the point—and how DuckLake, a new table format, changes the game.

motherduck.com/blog/ducklak...
A Duck Walks into a Lake - MotherDuck Blog
DuckDB introduces a new table format, what does it mean for the future of data lakes ? | Reading time: 9 min read
motherduck.com
Reposted by Carlo Piovesan
Ducklake is a really elegant solution to the data lake. Radically simple and completely open.
duckdb.org DuckDB @duckdb.org · May 27
Today we're launching DuckLake, an integrated data lake and catalog format powered by SQL. DuckLake unlocks next-generation data warehousing where compute is local, consistency central, and storage scales till infinity. ⁠ducklake is an open standard and we implemented it in the "ducklake" extension.
Classing "cache invalidation and naming things" are the most complex problems.

I think DuckLake is cute, but naming stuff is hard.
And the biggest winner is... DuckDB support for running queries on [remote] parquet files.
This happens to be a basic building block of DuckLake and a few other alternative lake technologies
duckdb.org DuckDB @duckdb.org · May 27
Today we're launching DuckLake, an integrated data lake and catalog format powered by SQL. DuckLake unlocks next-generation data warehousing where compute is local, consistency central, and storage scales till infinity. ⁠ducklake is an open standard and we implemented it in the "ducklake" extension.
Reposted by Carlo Piovesan
DuckDB @duckdb.org · May 27
Today we're launching DuckLake, an integrated data lake and catalog format powered by SQL. DuckLake unlocks next-generation data warehousing where compute is local, consistency central, and storage scales till infinity. ⁠ducklake is an open standard and we implemented it in the "ducklake" extension.
Reposted by Carlo Piovesan
DuckDB @duckdb.org · May 23
New post by Pedro Holanda, Ian Cook, Dewey Dunnington and Bryce Mecum:

Arrow IPC Support in DuckDB

This blog post introduces the new arrow community extension, which boosts interoperability by allowing to consume/produce data through the Arrow IPC Serialization Format.

duckdb.org/2025/05/23/a...