Jason Miles
@edudatasci.net
12 followers
39 following
28 posts
Practicing Data & AI specialist in California. Opinions and views are my own.
Posts
Media
Videos
Starter Packs
Jason Miles
@edudatasci.net
· 11d
The Microsoft Fabric Delta Change Data Feed (CDF)
In Microsoft Fabric you’re sitting on top of Delta Lake tables in OneLake. If you flip on Delta Change Data Feed (CDF) for those tables, Delta will record row‑level inserts, deletes, and updates (including pre‑/post‑images for updates) and let you read just the changes between versions. That makes incremental processing for SCDs (Type 1/2) and Data Vault satellites dramatically simpler and cheaper because you aren’t rescanning entire tables—just consuming the “diff.” Fabric’s Lakehouse fully supports this because it’s natively Delta; Mirrored databases land in OneLake as Delta too, but (as of September 2025) Microsoft hasn’t documented a supported way to
edudatasci.net
Jason Miles
@edudatasci.net
· 12d
Analytics Governance: the Missing Middle of the Information Governance Stack
Most organizations have matured data governance (quality, ownership, catalogs) and are racing to formalize AI governance (risk, bias, safety, model monitoring). Application governance (SDLC, access, change control) keeps production systems stable. But the layer where business decisions actually touch numbers—analytics—often sits in a gray zone. KPI definitions live in wikis, dashboards implement subtle variations of the “same” metric, and spreadsheets quietly fork the math.
edudatasci.net
Jason Miles
@edudatasci.net
· 13d
Why Star Schemas Make Analysts Faster (and Happier)
If you live in spreadsheets or SQL all day, the “one big table” (OBT) feels like home. Everything you need is right there: one row per thing, a column for every attribute, and no joins to worry about. It’s a great way to explore data fast—until it isn’t. This post explains, in plain language, why the star schema pays you back every day you analyze data, and how it keeps the speed you love without the headaches you’ve learned to live with.
edudatasci.net
Jason Miles
@edudatasci.net
· 15d
A New Paradigm For Data Teams: The Changing Role of the Data Visualization Engineer
When teams build warehouses the old way—source → bronze → silver → gold → semantic—visualization and semantic specialists are invited in at the end. Their job looks reactive: wire up a few visuals, name some measures, make it load fast enough. They inherit whatever the pipeline produced, then try to make meaning out of it. The failure mode is predictable: pixel‑perfect charts sitting on semantic quicksand, with definitions that shift underfoot and performance that depends on structures no one designed for the questions at hand.
edudatasci.net
Jason Miles
@edudatasci.net
· 18d
FabCon Feature: Fabric Real‑Time Intelligence
Real‑Time Intelligence (RTI) is the part of Fabric that treats events and logs as first‑class citizens: you connect live streams, shape them, persist them, query them with KQL or SQL, visualize them, and trigger actions—all without leaving the SaaS surface. Concretely, RTI centers on Eventstream (ingest/transform/route), Eventhouse (KQL databases), Real‑Time Dashboards / Map, and Activator (detect patterns and act). That tight loop—capture → analyze → visualize/act—now covers everything from IoT telemetry to operational logs and clickstream analytics.
edudatasci.net
Jason Miles
@edudatasci.net
· 19d
Information Governance: The Backbone That Unifies Data, AI, Applications, and Analytics
Information governance (IG) is the strategy, accountability, and control system for how an organization collects, classifies, uses, protects, shares, retains, and disposes of information across its entire lifecycle. It is: Scope‑wide: Covers structured data, unstructured content, model artifacts, code, dashboards, and records (including legal/records management and privacy). Lifecycle‑aware: From intake and creation → active use → archival → retention/disposition and legal holds.
edudatasci.net
Jason Miles
@edudatasci.net
· 20d
Baselines Over Buzzwords: From Warehouse to Lakehouse
If you’ve built data systems long enough, you’ve lived through at least three architectural moods: the tidy certainty of Kimball and Inmon, the anarchic freedom of “throw everything in the data lake to ingest quickly,” and today’s lakehouse, which tries to keep our speed without losing our sanity. I've always cared less about labels and more about baselines—clear, durable expectations that make change safe.
edudatasci.net
Jason Miles
@edudatasci.net
· 22d
A New Paradigm For Data Teams: The real bottleneck isn’t data, it’s definition
Most data teams still run a tidy assembly line: ingest sources into bronze, standardize into silver, curate into gold, and only then wire up a semantic model for BI. That sounds rigorous—but it puts the business contract (grain, conformed dimensions, measure logic, security scope, and SLOs) at the very end. By the time the organization finally argues about what “AUM” or a compliant “time‑weighted return” …
edudatasci.net
Jason Miles
@edudatasci.net
· 25d
FabCon Feature: Purview
On edudatasci.net, I keep data mesh grounded in four behaviors: domains own data; data as a product; a small self‑serve platform; and federated governance (policies expressed as code and applied consistently). I also use foundational vs derived data products as a practical way to think about scope and ownership, and I recommend publishing products in Purview’s Unified Catalog so ownership, access and SLOs are discoverable to the org, not just the team that built them.
edudatasci.net
Jason Miles
@edudatasci.net
· 26d
Certifications in IT
I hold a lot of certifications. That’s a personal choice, not a creed. I like challenging myself against a test to prove I've learned something and lets me prove to myself that I've actually successfully learned something. The “certs or no certs?” debate is as eternal—and as spicy—as vi vs. Emacs (or “eMacs,” if you’re trolling your coworkers). Different corners of computing answer that question differently, for good reasons.
edudatasci.net
Jason Miles
@edudatasci.net
· Apr 27
Jason Miles
@edudatasci.net
· Apr 25
Jason Miles
@edudatasci.net
· Jan 19
Jason Miles
@edudatasci.net
· Jan 12
Jason Miles
@edudatasci.net
· Jan 12
Jason Miles
@edudatasci.net
· May 9
Jason Miles
@edudatasci.net
· Mar 21
Reposted by Jason Miles
Jason Miles
@edudatasci.net
· Jan 6