Move AI

Markerless 3D motion capture from ordinary video.

Categories: Vision3D
Pricing: FREEMIUM
Source: Proprietary
Hosting: Cloud
Platforms: WebiOSAPI
Models: Self-contained (on-device)
Verified: Jun 15, 2026

Markerless motion-capture technology that turns 2D video into broadcast-quality 3D animation data using computer vision, biomechanics, and physics. The Move One app captures motion from a single iPhone, while multi-camera setups serve studio production; output exports to FBX and USD for game engines and animation pipelines. Used by studios including Ubisoft, Sony, and Disney.

Capabilities 1

What it actually does — grouped by capability family.

Video understanding (primary capability)

Pros & cons

No markers, suits, or specialist hardware
Multi-camera option for studio quality
Exports to FBX and USD
Used by major studios

Cloud processing; credit-based pricing
Single-camera accuracy trails multi-camera
Capture length capped on lower tiers

View Bucket Robotics details
VisionPAID
Bucket Robotics
Bucket Robotics
Computer-vision defect detection for manufacturing, trained from CAD.
Bucket Robotics builds computer-vision defect detection for manufacturing that trains from CAD files and synthetic data instead of hand-labeled photos. It generates simulated defects — burn marks, bumps, breaks — from the CAD that every modern part already has, producing production-ready vision models that deploy in minutes and adapt as parts and lines change. The system integrates into existing production lines without adding new hardware, and has drawn early customers in automotive and defense.
No hand-labeling — trains from CAD
Early-stage (founded 2024, small team)
- computer-vision
- manufacturing
- defect-detection
- synthetic-data
- +1
Open
View Memories.ai details
VisionFREEMIUM
Memories.ai
Memories.ai
A 'visual memory' layer for AI — search and reason over huge video libraries.
Video understanding platform built around a large visual memory model. It ingests long-form and large-scale video, then supports natural-language search, transcription, clip retrieval, and content analysis with unlimited video context. Applied to security and surveillance review, sports analytics, media production, and robotics, with a free playground and on-device processing options.
Handles very long and large video sets
Newer, smaller track record
- video-understanding
- visual-memory
- video-search
- multimodal
Open
View T-Rex Label details
VisionFREEMIUM
T-Rex Label
Visincept (IDEA Research)
Zero-shot AI image annotation that batch-labels with visual prompts.
T-Rex Label is a browser-based image annotation tool built on the T-Rex2 open-set detection model. Point it at one example and its visual-prompt, zero-shot detection finds and labels matching objects across an entire dataset—no training or fine-tuning required—handling dense, occluded, and varied-lighting scenes. It exports to COCO and YOLO formats and integrates with tools like Roboflow and Labelbox.
No per-class training needed
Browser-only, no offline mode
- image-annotation
- object-detection
- zero-shot
- dataset-labeling
- +1
Open
View Matroid details
VisionPAID
Matroid
Matroid
No-code computer vision to detect anything in images and video.
Matroid is an enterprise platform for building custom computer vision detectors without writing code. Non-programmers train detectors to find objects, defects, people, events, and actions, then deploy them against any existing camera or video feed. It is widely used for industrial visual inspection and quality control, where it can flag cracks, weld defects, and assembly errors in real time.
Non-programmers train custom detectors
Enterprise sales, no public pricing
- computer-vision
- no-code
- object-detection
- manufacturing
- +1
Open
View Mixpeek details
VisionFREEMIUM
Mixpeek
Mixpeek
Find any scene in your video and multimodal library.
Mixpeek is a multimodal retrieval API for searching across video, images, audio, and documents with natural language. It extracts and indexes structured features — faces, scenes, transcripts, OCR, and embeddings — over object storage like S3, GCS, and R2, then runs hybrid dense, sparse, and BM25 search with reranking. Cross-modal joins let a single query combine signals such as faces, spoken phrases, and on-screen text.
Searches video, image, audio, and docs
Developer/API-first, not no-code
- multimodal-search
- video-search
- retrieval
- embeddings
- +1
Open
View Optifye details
VisionPAID
Optifye
Optifye
Computer-vision monitoring of factory-floor efficiency from existing cameras.
Optifye is an AI computer-vision platform for manufacturing operations. It connects to a plant's existing IP/CCTV cameras to measure per-operator cycle times, detect bottlenecks and check standard-operating-procedure compliance in real time, then turns that into efficiency analytics and automated production reports. It targets labour-intensive lines across automotive, apparel, welding, medical and electronics manufacturing.
Detects production bottlenecks in real time
Worker-surveillance and privacy concerns
- computer-vision
- manufacturing
- operations
- monitoring
- +1
Open
View Datature details
VisionFREEMIUM
Datature
Datature
Build and deploy computer-vision models without code.
Datature is an end-to-end, no-code platform for computer vision. Its Label module provides AI-assisted, pixel-perfect annotation with multi-annotator review; Train offers drag-and-drop model building with hyperparameter tuning; and Deploy ships models to edge or cloud via API. It supports image classification, object detection, keypoint annotation, and semantic segmentation across industries from healthcare to manufacturing.
Covers label, train and deploy in one place
Pricing not transparent on the site
- computer-vision
- no-code
- annotation
- mlops
- +1
Open
View olmOCR details
VisionFREEOSS
olmOCR
Allen Institute for AI
Open-source OCR that converts PDFs and scans into clean, structured text.
olmOCR is an open-source toolkit from the Allen Institute for AI that turns PDFs and document images into clean, reading-order plain text, preserving tables, equations, and handwriting. It runs a fine-tuned 7B vision-language model with a document-anchoring prompting technique, and is built for cheap, dataset-scale conversion for LLM training and retrieval. Released with model weights, training data, and inference code; runs on your own GPUs or via third-party inference providers.
Ships weights, training data, and code
Requires a capable GPU to self-host
- ocr
- open-source
- pdf
- document-parsing
- +1
Open
View Hyperscience details
VisionPAID
Hyperscience
Hyperscience
Enterprise document processing that turns messy paperwork into structured data.
Hyperscience is an enterprise intelligent document processing (IDP) platform that reads, classifies, and extracts data from forms, invoices, and handwritten paperwork at high accuracy. It trains custom machine-learning models per document type and routes low-confidence cases to humans, targeting straight-through automation for high-volume back-office workflows. Sold to large enterprises and government agencies, with cloud, private-cloud, and air-gapped on-prem deployment options.
Routes low-confidence cases to humans
Enterprise sales, no public pricing
- document-processing
- idp
- ocr
- enterprise
- +1
Open
View VLM Run details
VisionFREEMIUM
VLM Run
Autonomi AI
Unified API gateway that extracts structured JSON from images, video, and documents.
VLM Run is a developer platform for visual AI that returns reliable structured JSON from images, video, and documents through a single API, combining hyper-specialized vision-language models with computer-vision tools for tasks like document parsing, structured OCR, object detection, and segmentation. It offers fine-tuning to specialize models for a domain, dashboards, and flexible deployment. The platform is operated by Autonomi AI.
One API for images, video, and documents
Pro tier jumps to $799/mo
- visual-ai
- document-extraction
- vision-language-model
- ocr
Open

Open Move AI

Move AI

Capabilities 1

Pros & cons

Tags

Further reading

Bucket Robotics

Memories.ai

T-Rex Label

Matroid

Mixpeek

Optifye

Datature

olmOCR

Hyperscience

VLM Run