Datafold vs dbt-expectations.
Datafold and dbt-expectations both anchor in quality & testing — 6 dimensions differ, 4 hold. Below: posture, coverage diff, and capability matrix.
What each is betting on.
Open-source data-diff was deprecated May 2024; vendor has since repositioned around AI-powered data engineering automation. Cloud product still ships data diff, monitors, and column-level lineage.
Apache-2.0 dbt package, not a company. The original repo (calogica/dbt-expectations) was marked no longer maintained on 2024-12-18; active development forked to metaplane/dbt-expectations and the dbt Package Hub listing now publishes under the `metaplane` namespace (latest 0.10.x, dbt Fusion-compatible). Metaplane was itself acquired by Datadog (announced 2025-04-23). The package remains free and Apache-2.0 — it was never sold or made proprietary.
Each tool's current strategic narrative, verbatim from its profile.
Spec sheet diff.
| Datafold | dbt-expectations | |
|---|---|---|
| Vendor | Datafold | Metaplane (Datadog) |
| Deployment | SaaS · Self-hosted | Self-hosted only |
| License | Proprietary | Open source |
| Pricing | From $799 | OSS · paid tiers |
| OSS self-host | No | Yes |
| HQ | San Francisco, CA | — |
| Authoring style | Code-first + GUI | YAML |
Both share Primary cluster: Quality & testing · Free tier: Yes · dbt integration: Native · OpenLineage: None · Founded: 2020 · Status: ● active · Test paradigm: Assertion-based
Each tool's center of gravity.
| Cluster | Datafold | dbt-expectations |
|---|---|---|
| Lineage & metadata | 3/3 | 0/3 |
| Quality & testing | 3/3primary | 3/3primary |
| Catalog & discovery | 0/3 | 0/3 |
Scored 0–3 per cluster on the same rubric across all tools. A 0 means the cluster isn't the tool's focus, not that the feature is absent. See the methodology.
Where they cover different ground.
The declared feature set.
3 of 6 declared features differ — listed first.
These are each tool's self-declared key_features; a blank dot means
undeclared, not impossible.
| Feature | Datafold | dbt-expectations |
|---|---|---|
| Pre-Merge Diffing Quality & testing | ||
| Warehouse-Native Monitoring Quality & testing | ||
| Column-Level Lineage Lineage & metadata | ||
| Assertion-Based Testing Quality & testing | ||
| dbt-Native Testing Quality & testing | ||
| Schema Change Detection Quality & testing |
Where they disagree.
Quality & testing
3 of 13 differ| Datafold | dbt-expectations | |
|---|---|---|
| Circuit breaker | ||
| Root-cause UI | ||
| Column profiling |
When to pick each.
Analytics engineering teams with mature dbt practices and a code review culture, who feel the pain of "we merged the change and broke a downstream dashboard a week later." Datafold's defining capability is showing what a model change will do to production output before the PR merges — a deeply different shape of tool from post-merge monitoring. Particularly strong for teams running large-scale warehouse migrations, where automated parity validation across thousands of tables is the difference between a six-month migration and an eighteen-month one.
dbt-centric analytics-engineering teams that already run dbt test in CI and want a broad library of declarative, in-warehouse assertions — value ranges, regex and pattern matching, schema shape, and distributional bounds (mean, median, stdev, quantiles) — with zero added cost or infrastructure. It is the natural first step up from dbt's four built-in tests (unique, not_null, accepted_values, relationships) for a team that wants richer checks without leaving the dbt workflow.
What each does best.
Datafold stands out for
- Pre-merge data diffing is genuinely category-defining; no competitor does this as well
- Column-level lineage derived from SQL static analysis catches dependencies that query-log parsing misses
- Strong dbt and CI integration — testing happens in the same workflow as code review
- Cross-database diffing makes warehouse migrations dramatically less risky
dbt-expectations stands out for
- Free and Apache-2.0 with no paid tier, no SaaS, and no lock-in — the only cost is your own warehouse compute
- A library of 50+ assertions far beyond dbt's four built-ins (value ranges, regex, schema shape, distributional bounds)
- Fully native to dbt — declared in YAML, run by dbt test / dbt build, inheriting dbt severity levels, CI, and run artifacts; the current fork release is dbt Fusion-compatible
- Push-down execution across Postgres, Snowflake, BigQuery, DuckDB, Spark, and Trino
Tools both also compete with.
A note on this comparison.
Every capability value above traces to Datafold or dbt-expectations's own structured spec, which links back to its source — nothing here is averaged or smoothed across the two.
Notice something inaccurate? Send a correction.