Iceberg kinda makes DWH on Cloudflare R2 feasible given it has S3-compatible API. Sippy seems cool
Iceberg kinda makes DWH on Cloudflare R2 feasible given it has S3-compatible API. Sippy seems cool
github.com/deepseek-ai/...
github.com/deepseek-ai/...
Imagine if a dbt DAG resulted in a single logical plan that operates across multiple engines.
and then they can optimize the plan before execution as well! 🤯🤯🤯
Imagine if a dbt DAG resulted in a single logical plan that operates across multiple engines.
and then they can optimize the plan before execution as well! 🤯🤯🤯
it's what enables:
1. all this interchangeability of query engines
2. (likely) using duckdb in a distributed environment in the first place
it's what enables:
1. all this interchangeability of query engines
2. (likely) using duckdb in a distributed environment in the first place
Very bullish on this future of right tool for right job and making it as simple as a config
news.ycombinator.com/item?id=4323...
Very bullish on this future of right tool for right job and making it as simple as a config
news.ycombinator.com/item?id=4323...
TIRED: "big vs. small" & "distributed vs single-node"
WIRED: tactical deployment of single-node query engines within distributed frameworks.
another great example is Apache Comet which plugs DataFusion into Spark to accelerate single-node operations resulting in overal Spark performance speedups
TIRED: "big vs. small" & "distributed vs single-node"
WIRED: tactical deployment of single-node query engines within distributed frameworks.
another great example is Apache Comet which plugs DataFusion into Spark to accelerate single-node operations resulting in overal Spark performance speedups
Within the year, we'll to see this new paradigm catch on. Future examples will probably be using DataFusion not DuckDB.
Within the year, we'll to see this new paradigm catch on. Future examples will probably be using DataFusion not DuckDB.
Why didn't they use {TOOL}? My guesses
❌ dbt: they wanted Python Dataframe API
❌ Airflow: not as close to metal as Ray
❌ pytorch or ray[data]: idk tbh
Why didn't they use {TOOL}? My guesses
❌ dbt: they wanted Python Dataframe API
❌ Airflow: not as close to metal as Ray
❌ pytorch or ray[data]: idk tbh
what you've made changes the game imho. now I need the same for all the Slacks & Discords I'm in.
what you've made changes the game imho. now I need the same for all the Slacks & Discords I'm in.