AI Apps Directory
The signal, not the hype.
- 803Apps
- 44Categories
- 145 Open source, 545 Free to try, 186 Self-host
View pyannoteAI details AudioFREEMIUMOpen corepyannoteAI
pyannoteAI
Speaker intelligence — diarization that tells who spoke when.
pyannoteAI turns conversational audio into speaker-attributed transcripts: it identifies speakers, separates overlapping voices, and provides speaker metadata. Built on the widely used open-source pyannote.audio library, it adds a premium REST API and Python SDK with higher accuracy and near real-time speed.
Worth knowing
Its open-source pyannote.audio library sees 45M+ monthly downloads; the Paris company raised an €8.1M seed in April 2025.
- audio
- speaker-diarization
- speech
- open-source
- +1
View Hume AI details VoiceFREEMIUMHume AI
Hume AI
Emotionally intelligent voice AI — speech-to-speech (EVI) and expressive TTS (Octave).
Hume builds voice AI tuned to emotional expression. Its Empathic Voice Interface (EVI) is a speech-to-speech system that reads vocal tone, handles interruptions and back-channeling, and can front any external LLM. Octave is its expressive text-to-speech model with voice design, cloning, and modulation.
Worth knowing
Founded by former Google AI researcher Alan Cowen; raised a $50M Series B in 2024 led by EQT Ventures.
- voice
- speech-to-speech
- tts
- emotion-ai
- +1
View Genesis details RoboticsFREEOSSGenesis
Genesis AI
A generative, ultra-fast physics platform for robotics and embodied AI.
An open-source simulation platform for general-purpose robotics and embodied-AI learning, rebuilt from the ground up in pure Python. Combines a unified multi-physics engine, a photorealistic renderer, and a generative data engine that turns natural-language prompts into simulation scenes and training data.
Worth knowing
Began as a Dec 2024 academic project; its backer Genesis AI left stealth in July 2025 with a $105M seed co-led by Eclipse and Khosla.
- robotics
- simulation
- physics-engine
- embodied-ai
- +1
View LeRobot details RoboticsFREEOSSLeRobot
Hugging Face
Open-source models, datasets, and tools for real-world robotics in PyTorch.
Hugging Face's robotics stack — pretrained policies (ACT, Diffusion Policy, SmolVLA, Pi0), shared datasets, simulation environments, and designs for low-cost manipulator kits. Aims to be the 'Transformers for robotics', lowering the barrier to training and sharing robot-learning models.
Worth knowing
Built by Hugging Face as the 'Transformers for robotics'; NVIDIA partnered on it in 2025 to accelerate open-source robot learning.
- robotics
- open-source
- pytorch
- imitation-learning
- +1
View Papago details TranslationFREEPapago
Naver
Naver's free neural translator, tuned for East Asian languages.
Papago is Naver's free AI translation app, covering 14 languages with text, voice, image, and handwriting input. Its neural machine translation is especially strong on East Asian pairs — Korean to and from Japanese, English, and Chinese — making it a go-to for travel and study across the region. Available on the web and as iOS and Android apps.
Worth knowing
Built by South Korea's Naver and launched in 2017; 'Papago' is Esperanto for 'parrot,' nodding to its language focus.
- translation
- korean
- east-asian
- mobile
- +1
View SYSTRAN details TranslationPAIDSYSTRAN
ChapsVision
Enterprise neural machine translation with on-prem options.
SYSTRAN is one of the oldest machine-translation companies, now offering Pure Neural Machine Translation across 55+ languages and 150+ language pairs. It translates documents, web pages, and speech with domain-tuned models for legal, financial, defense, pharmaceutical, and industrial use, and supports 30+ file formats. Available as managed SaaS, on-premise server, or private cloud — built for organizations needing confidential, high-volume translation under strict compliance.
Worth knowing
Founded 1968; powered Google's language tools until 2007 and Yahoo Babel Fish until 2012; acquired by France's ChapsVision in 2024.
- translation
- enterprise
- neural-mt
- self-host
- +1
View Confident AI details EvalFREEMIUMConfident AI
Confident AI
The AI quality platform from the team behind DeepEval.
Confident AI is the hosted platform built on top of DeepEval, the open-source LLM evaluation framework. It adds dataset and test management, research-backed metrics, production tracing and monitoring, adversarial red teaming, and governance dashboards so teams can benchmark, observe, and safeguard LLM apps across the dev-to-prod loop. Python and TypeScript SDKs plug into CI and OpenTelemetry, with managed cloud and enterprise self-hosting.
Worth knowing
Builds the open-source DeepEval framework (~2M evals/day) and raised a $2.2M YC-backed seed in Aug 2025.
- eval
- observability
- red-teaming
- llm-as-judge
- +1
View Akiflow details ProductivityPAIDAkiflow
Akiflow
One app for tasks and calendars, powered by AI.
Akiflow is a keyboard-first daily planner that pulls tasks from email, Slack, Notion, and dozens of other tools into a single inbox, then lets you time-block them onto a unified calendar. Its built-in assistant, Aki, handles AI scheduling, prioritization, daily briefings, and natural-language and voice commands. It runs on macOS, Windows, web, iOS, and Android with real-time sync.
Worth knowing
Backed by Y Combinator; the keyboard-first command bar predates the 'Aki' AI assistant, which was layered on later.
- task-management
- calendar
- time-blocking
- planner
View Spinach AI details MeetingFREEMIUMSpinach AI
Spinach (StayIn, Inc.)
An AI meeting assistant built for agile teams.
Spinach AI joins video calls to record, transcribe, and summarize meetings, then turns them into clear notes, decisions, and action items. Designed for agile teams, it runs standups, retros, and sprint syncs and pushes follow-ups into the tools engineers already use. It integrates with Zoom, Google Meet, Microsoft Teams, and Webex plus 30+ destinations including Slack, Jira, Salesforce, HubSpot, and Notion.
Worth knowing
Backed by Y Combinator and strategic investors Zoom and Atlassian; started in 2021 as an AI standup tool for agile engineering teams.
- meeting-notes
- scrum
- standup
- action-items
View Novita AI details InferencePAIDNovita AI
Novita AI
One API for 120+ AI models, plus agent sandboxes and GPU cloud.
Novita AI is an AI and agent cloud for developers that combines serverless model APIs with on-demand compute. A single API serves 120+ text, image, audio, video, and vision models, while Agent Sandbox provides isolated runtimes for tool-using agents and the GPU cloud offers dedicated instances, serverless GPUs, and bare-metal clusters. It advertises sub-50ms time-to-first-token and startup-friendly, usage-based pricing.
Worth knowing
An official Hugging Face inference provider, serving open models to Hugging Face's 5M+ developers via a 'Deploy on Novita' experience.
- inference
- agent-sandbox
- gpu-cloud
- llm-api
View DeepInfra details InferencePAIDDeepInfra
DeepInfra
Low-cost, pay-as-you-go API access to 100+ AI models.
DeepInfra is a cloud inference platform that lets developers run open and proprietary models through a simple, OpenAI-compatible API without managing hardware. It serves text generation, embeddings, image/audio/video, and speech models with token-based, pay-as-you-go pricing, and offers DeepCluster dedicated NVIDIA GPU capacity for heavier workloads. It is SOC 2 and ISO 27001 certified with a zero data-retention policy.
Worth knowing
Raised a $107M Series B in May 2026 (investors include Nvidia and Samsung Next) and processes roughly 5 trillion tokens a week.
- inference
- open-models
- gpu-cloud
- llm-api
View Cloudflare Vectorize details Vector DBFREEMIUMCloudflare Vectorize
Cloudflare
A globally distributed vector database built into Cloudflare Workers.
Vectorize is Cloudflare's vector database for building AI-powered apps on its Workers platform. It stores and queries embeddings for semantic search, recommendation, classification, and RAG, and cross-references results against data in R2, D1, and KV. Embeddings can come from Workers AI or external providers like OpenAI, and indexes are configured via the dashboard, Wrangler CLI, or REST API.
Worth knowing
Reached general availability in 2024, when its per-index capacity jumped 25× from 200,000 to 5 million vectors.
- vector-database
- rag
- edge
- embeddings
View Paperguide details ResearchFREEMIUMPaperguide
Paperguide
AI research assistant and reference manager for academics.
Paperguide is an all-in-one academic research platform combining an AI assistant, reference manager, and chat-with-PDF in one workspace. It searches papers with real citations, runs literature-review workbooks, extracts data into tables, annotates PDFs, and drafts with an AI writer and plagiarism checker. Built for students, educators, and research teams, with a browser extension to save sources.
- academic-research
- reference-manager
- literature-review
- chat-with-pdf
View Neuphonic details VoiceFREEMIUMOpen coreNeuphonic
Neuphonic
Ultra-low-latency text-to-speech that runs on-device.
Neuphonic is a voice-AI company building text-to-speech and voice cloning that run locally with very low latency. Its cloud API targets real-time voice agents, and in October 2025 it open-sourced NeuTTS Air, a 748M-parameter speech language model that runs on CPU via llama.cpp and clones a voice from a few seconds of audio. Aimed at private, offline, and voice-agent use cases.
Worth knowing
Open-sourced NeuTTS Air in Oct 2025, an Apache-2.0 748M-param TTS model that runs on CPU and clones a voice from ~3 seconds of audio.
- text-to-speech
- voice-cloning
- on-device
- open-source
View Lamini details Fine-tuningPAIDLamini
Lamini
Enterprise platform to tune and run open LLMs in your own environment.
Lamini is an enterprise LLM platform for fine-tuning open models and serving them, designed to run on-prem, in a VPC, or on Lamini's cloud — including on AMD GPUs. It pairs tuning (LoRA/PEFT and memory tuning to reduce hallucinations) with an inference stack and agentic pipelines, accessed via a Python client, REST API, or web UI. Built for teams that need to keep models and data in-house.
Worth knowing
Co-founded by Greg Diamos, a creator of the MLPerf benchmark, and Stanford's Sharon Zhou; backed by Andrew Ng and Andrej Karpathy.
- fine-tuning
- llm
- enterprise
- on-prem
View Sapling details SupportFREEMIUMSapling
Sapling Intelligence
AI writing assistant for customer-facing teams.
Sapling is a language-model toolkit that gives customer-facing teams real-time autocomplete, grammar and tone suggestions, and reusable snippets inside helpdesks and CRMs like Zendesk, Salesforce, and Intercom. It exposes an API and SDK to embed the same suggestions into custom apps, plus a browser extension for everyday writing. The company also ships a widely cited AI-content detector.
Worth knowing
Y Combinator (W19) startup founded in 2018 by Stanford NLP researcher Ziang Xie.
- customer-support
- autocomplete
- grammar
- snippets
View Writefull details WritingFREEMIUMWritefull
Writefull (Digital Science)
AI language feedback and editing for academic writing.
Writefull is an academic writing assistant whose language models are trained on millions of published journal articles, giving discipline-aware language feedback, paraphrasing, and copyediting. It generates titles and abstracts, checks LaTeX, and runs automated language revision over a full manuscript. Works on the web and integrates with Microsoft Word and Overleaf.
Worth knowing
Part-owned by Digital Science (owner of Overleaf, Dimensions, and Figshare) since 2018, and fully acquired by it in November 2023.
- academic-writing
- editing
- paraphrasing
- latex
View Songscription details AudioFREEMIUMSongscription
Songscription
Turn any audio into sheet music, MIDI, and guitar tabs with AI.
Songscription transcribes audio recordings into readable notation — sheet music, MIDI, MusicXML, and GuitarPro tabs — across instruments including piano, guitar, bass, strings, horns, drums, and vocals. Often described as a 'Shazam for sheet music', it automates a task that traditionally took hours by ear. A free tier covers unlimited 30-second clips, with paid plans for longer recordings and more export formats.
Worth knowing
Stanford-founded; its model builds on co-founder Tim Beyer's research, backed by Reach Capital through Stanford's StartX accelerator.
- music-transcription
- sheet-music
- midi
View IntellCRE details Real EstatePAIDIntellCRE
IntellCRE
AI deal-flow engine for commercial real estate — underwrite and market in minutes.
IntellCRE automates the commercial real estate workflow: it sources listings and market data, runs underwriting and financial modeling, and turns the result into investor-ready marketing collateral. From one deal it can generate brochures, offering memorandums, broker opinions of value, pitch decks, and listing websites on brand. It targets brokers, investors, lenders, and analysts working multifamily and CRE deals.
Worth knowing
Runs on a proprietary database of 150M+ U.S. commercial properties, auto-sourcing comps and market data for each deal.
- cre
- underwriting
- deal-marketing
View Voyage details GamingFREEMIUMVoyage
Latitude
AI-native RPG platform where every world is built and played from a prompt.
Voyage lets anyone describe a world in natural language and drops them inside it as a playable, text-based RPG. Built on Latitude's World Engine, characters carry their own motivations and memories, storylines emerge from player choices, and the world keeps evolving even after you log off. Up to four players can share a world in co-op.
Worth knowing
Built by Latitude, the AI Dungeon studio; its World Engine took five years to develop and is backed by Google's AI Futures Fund.
- ai-rpg
- worldbuilding
- interactive-fiction
View WellSaid details VoiceFREEMIUMWellSaid
WellSaid Labs
Enterprise AI text-to-speech with voices licensed from real voice actors.
An enterprise-grade AI voice generator that produces realistic voiceovers from scripts. It offers 120+ voices across languages and accents — modeled on licensed recordings by real voice actors — plus a studio for script import and audio tuning, team workspaces, pronunciation libraries, Adobe integrations, and an API for products, LMS platforms, and IVRs.
Worth knowing
Spun out of the Allen Institute for AI (AI2) incubator in 2019; raised a $10M Series A led by FUSE in 2021.
- text-to-speech
- voiceover
- tts
- enterprise
- +1
View Move AI details VisionFREEMIUMMove AI
Move AI
Markerless 3D motion capture from ordinary video — even a single iPhone.
Markerless motion-capture technology that turns 2D video into broadcast-quality 3D animation data using computer vision, biomechanics, and physics. The Move One app captures motion from a single iPhone, while multi-camera setups serve studio production; output exports to FBX and USD for game engines and animation pipelines. Used by studios including Ubisoft, Sony, and Disney.
Worth knowing
Founded 2019 in London; raised a $10M (£8.2M) 2023 seed backed by Warner Music Group and Animoca Brands.
- motion-capture
- markerless
- 3d-animation
- mocap
- +1
View Happy Scribe details AudioFREEMIUMHappy Scribe
Happy Scribe
Transcription, subtitles, and AI meeting notes in 150+ languages.
A transcription and subtitling platform that turns calls, interviews, and recordings into accurate, searchable text. It offers instant AI transcription, a professional human transcriber network for 99%+ accuracy, automatic subtitles and translation across 150+ languages, and an AI notetaker that joins Google Meet, Microsoft Teams, and Zoom calls.
Worth knowing
Bootstrapped since 2017, founded by two Dublin City University students after a class assignment to transcribe research interviews.
- transcription
- subtitles
- translation
- notetaker
- +1
View TrueFoundry details InfraPAIDTrueFoundry
TrueFoundry
Enterprise AI gateway and deployment platform that runs in your own cloud.
A unified platform for deploying, scaling, and governing LLM and agentic AI systems. It pairs an AI gateway that routes and orchestrates calls across providers with infrastructure for hosting models (vLLM, TGI, Triton), fine-tuning, and full-stack observability — deployed inside your own VPC, on-prem, or air-gapped environment with enterprise RBAC and audit logging.
Worth knowing
Raised a $19M Series A led by Intel Capital in February 2025, bringing total funding to about $21M.
- ai-gateway
- model-deployment
- mlops
- enterprise
- +1