Skip to content

VisionCVAT.ai

CVAT

Open-source annotation platform for vision AI datasets.

Category
Vision
Pricing
FREEMIUM
Source
Open core
Hosting
Hybrid
Platforms
WebAPI
Models
Model-agnostic
Verified
Jun 11, 2026

Data-labeling suite for images, video, and 3D: bounding boxes, polygons, segmentation, keypoints, and object tracking, with AI-assisted labeling via SAM and custom models through its API and SDK. Ships as the MIT-licensed Community edition to self-host, the hosted CVAT Online with free and paid plans, or a self-hosted Enterprise tier.

Pros & cons

  • Free MIT-licensed self-hosted edition
  • Boxes, polygons, keypoints, 3D, and video
  • SAM 2/3-assisted auto-labeling
  • Large community: 15k+ GitHub stars
  • Self-hosting needs Docker ops effort
  • Annotation only — no model training built in

Tags

Further reading

View all Vision
  • View Label Studio details
    Data OpsFREEMIUMOpen core

    Label Studio

    HumanSignal

    Open-source multi-type data labeling and AI evaluation.

    Widely-used open-source tool for labeling and annotating data across images, text, audio, video, and time-series, with a standardized export format for training and fine-tuning. ML backends can pre-label data to speed up human review, and it increasingly doubles as a human-in-the-loop AI evaluation surface. Maintained by HumanSignal, which offers a hosted Starter tier and Label Studio Enterprise.

    Worth knowing

    Maker Heartex rebranded to HumanSignal in June 2023; Label Studio has labeled 200M+ data points.

    • data-labeling
    • open-source
    • annotation
    • human-in-the-loop
    • +1
  • View Roboflow details
    VisionFREEMIUM

    Roboflow

    Roboflow

    Vision MLOps end-to-end. Annotate, train, deploy.

    Annotation tooling, auto-labelling, hosted training, and edge deployment for computer-vision projects. Strong default when you're shipping a custom vision model rather than reaching for a multimodal LLM.

    Worth knowing

    Its Roboflow Universe is one of the largest public computer-vision dataset and model hubs; $40M Series B led by GV in 2024.

    • annotation
    • training
    • deployment
    • edge
  • View Encord details
    VisionPAID

    Encord

    Encord

    Data platform to curate, label, and manage AI training data.

    An enterprise data development platform for preparing high-quality training data across images, video, documents, audio, DICOM, and 3D point clouds. It pairs AI-assisted labeling (SAM auto-segmentation, object tracking) with data curation, model evaluation, and workflow tooling, plus LLM-powered data agents for document tasks. Used heavily in medical imaging, robotics, and other physical-AI domains.

    Worth knowing

    YC W21 company founded by two ex-high-frequency traders; raised a $30M Series B led by Next47 in 2024.

    • data-annotation
    • training-data
    • computer-vision
    • medical-imaging
    • +1
  • View Supervisely details
    VisionFREEMIUM

    Supervisely

    Supervisely

    All-in-one computer vision platform to curate, label, and train models.

    A unified computer vision platform covering data curation, annotation, model training, and deployment across images, video, 3D point clouds, and medical imagery. AI-assisted labeling, experiment tracking, and a large catalog of installable apps make it customizable for most CV workflows. Free for researchers and small teams; Pro and self-hostable Enterprise editions for companies.

    Worth knowing

    Grew out of Deep Systems, a deep-learning consultancy its founders built in 2013, before launching as a product in 2017.

    • computer-vision
    • data-annotation
    • labeling
    • model-training
    • +1