Skip to content

Data OpslakeFS

DVC

Open-source Git extension for versioning data, models, and ML experiments.

Category
Data Ops
Pricing
FREE
Platforms
CLI
Models
Model-agnostic
Verified
Jun 21, 2026

DVC (Data Version Control) brings software-engineering practices to machine learning: it versions datasets, models, and pipelines alongside code in any Git repository, storing large files in your own remote storage while keeping lightweight pointers in Git. Enables reproducible experiments, data/model lineage, and pipeline orchestration from the command line.

Pros & cons

  • Free and open source
  • Versions data and models with Git
  • No server to operate
  • Works with any storage backend
  • Reproducible ML pipelines
  • CLI-centric learning curve
  • Large-scale lakes better served by lakeFS
  • Roadmap now tied to lakeFS

Tags

View all Data Ops
  • View lakeFS details
    Data OpsFREEMIUMOpen core

    lakeFS

    Treeverse

    Git-like version control for data lakes over your existing object storage.

    Open-source data version control that turns object storage (S3, GCS, Azure Blob, MinIO) into Git-like repositories. Teams branch, commit, merge, and roll back petabyte-scale data lakes for isolated experimentation, reproducible ML pipelines, data-quality gates, and compliance lineage — without copying data. Integrates with Spark, Trino, Databricks, Delta Lake, and Iceberg.

    Open source (Apache 2.0)
    Operational overhead to self-host
    • data-versioning
    • data-lake
    • mlops
    • reproducibility
    • +2
  • View MLflow details
    ObservabilityFREEOSS

    MLflow

    Linux Foundation

    Open-source platform for the ML and GenAI lifecycle.

    MLflow is an open-source platform for managing the full machine-learning and GenAI lifecycle — experiment tracking, model registry, deployment, and, more recently, LLM/agent observability. Its GenAI stack adds OpenTelemetry-based tracing, systematic evaluation with built-in metrics and LLM judges, and prompt versioning. Framework- and provider-agnostic, it runs on your own infrastructure with no vendor lock-in.

    Fully open source (Apache-2.0), no lock-in
    Self-hosting adds operational overhead
    • llmops
    • tracing
    • evaluation
    • mlops
    • +1
  • View Comet details
    AssistantFREEMIUM

    Comet

    Perplexity

    Perplexity's AI browser with a sidebar assistant that acts across your tabs.

    An AI-native Chromium browser from Perplexity that puts a persistent assistant in the sidebar — it summarizes pages, compares products, and takes actions like drafting emails or managing tabs using the context of what you're browsing. Perplexity search is the default, with Deep Research and voice mode built in. Free on Windows, macOS, Android, and iOS; an optional $5/month Comet Plus tier unlocks premium publisher content.

    Free on desktop and mobile
    Closed source
    • browser
    • ai-assistant
    • agentic
    • search