Data Stack Index / v 02.06
Verified 2026·05·30
Send a correction
§ Open source

Best data tools
for open-source teams.

Run it yourself — Apache-2.0 and friends. 7 indexed, 7 open source.

01
Why these

What fits.

Open-source data tooling trades licence cost for operational cost: you run it, tune it, and upgrade it. These are the open-licensed options teams run at production scale — the catalogs, the dbt-native test layers, and the lineage reference implementations — where the deciding factor is your platform-engineering appetite, not the price.

02
7 tools

The shortlist.

How this list sorts.

Open-source options sort first, then alphabetical — no editorial ranking, no paid placement. Every entry matches a structured field on the tool profile; see the methodology, or compare any two on the comparisons page.