lakeFS
Treeverse
Git-like version control for data lakes over your existing object storage.
Open-source data version control that turns object storage (S3, GCS, Azure Blob, MinIO) into Git-like repositories. Teams branch, commit, merge, and roll back petabyte-scale data lakes for isolated experimentation, reproducible ML pipelines, data-quality gates, and compliance lineage — without copying data. Integrates with Spark, Trino, Databricks, Delta Lake, and Iceberg.
- data-versioning
- data-lake
- mlops
- reproducibility
- +2