Skip to content

AI Apps Directory

The signal, not the hype.

  • 803Apps
  • 44Categories
  • 145 Open source, 545 Free to try, 186 Self-host
  • View pyannoteAI details
    AudioFREEMIUMOpen core

    pyannoteAI

    pyannoteAI

    Speaker intelligence — diarization that tells who spoke when.

    pyannoteAI turns conversational audio into speaker-attributed transcripts: it identifies speakers, separates overlapping voices, and provides speaker metadata. Built on the widely used open-source pyannote.audio library, it adds a premium REST API and Python SDK with higher accuracy and near real-time speed.

    Worth knowing

    Its open-source pyannote.audio library sees 45M+ monthly downloads; the Paris company raised an €8.1M seed in April 2025.

    • audio
    • speaker-diarization
    • speech
    • open-source
    • +1
  • View Hume AI details
    VoiceFREEMIUM

    Hume AI

    Hume AI

    Emotionally intelligent voice AI — speech-to-speech (EVI) and expressive TTS (Octave).

    Hume builds voice AI tuned to emotional expression. Its Empathic Voice Interface (EVI) is a speech-to-speech system that reads vocal tone, handles interruptions and back-channeling, and can front any external LLM. Octave is its expressive text-to-speech model with voice design, cloning, and modulation.

    Worth knowing

    Founded by former Google AI researcher Alan Cowen; raised a $50M Series B in 2024 led by EQT Ventures.

    • voice
    • speech-to-speech
    • tts
    • emotion-ai
    • +1
  • View Genesis details
    RoboticsFREEOSS

    Genesis

    Genesis AI

    A generative, ultra-fast physics platform for robotics and embodied AI.

    An open-source simulation platform for general-purpose robotics and embodied-AI learning, rebuilt from the ground up in pure Python. Combines a unified multi-physics engine, a photorealistic renderer, and a generative data engine that turns natural-language prompts into simulation scenes and training data.

    Worth knowing

    Began as a Dec 2024 academic project; its backer Genesis AI left stealth in July 2025 with a $105M seed co-led by Eclipse and Khosla.

    • robotics
    • simulation
    • physics-engine
    • embodied-ai
    • +1
  • View LeRobot details
    RoboticsFREEOSS

    LeRobot

    Hugging Face

    Open-source models, datasets, and tools for real-world robotics in PyTorch.

    Hugging Face's robotics stack — pretrained policies (ACT, Diffusion Policy, SmolVLA, Pi0), shared datasets, simulation environments, and designs for low-cost manipulator kits. Aims to be the 'Transformers for robotics', lowering the barrier to training and sharing robot-learning models.

    Worth knowing

    Built by Hugging Face as the 'Transformers for robotics'; NVIDIA partnered on it in 2025 to accelerate open-source robot learning.

    • robotics
    • open-source
    • pytorch
    • imitation-learning
    • +1
  • View Papago details
    TranslationFREE

    Papago

    Naver

    Naver's free neural translator, tuned for East Asian languages.

    Papago is Naver's free AI translation app, covering 14 languages with text, voice, image, and handwriting input. Its neural machine translation is especially strong on East Asian pairs — Korean to and from Japanese, English, and Chinese — making it a go-to for travel and study across the region. Available on the web and as iOS and Android apps.

    Worth knowing

    Built by South Korea's Naver and launched in 2017; 'Papago' is Esperanto for 'parrot,' nodding to its language focus.

    • translation
    • korean
    • east-asian
    • mobile
    • +1
  • View SYSTRAN details
    TranslationPAID

    SYSTRAN

    ChapsVision

    Enterprise neural machine translation with on-prem options.

    SYSTRAN is one of the oldest machine-translation companies, now offering Pure Neural Machine Translation across 55+ languages and 150+ language pairs. It translates documents, web pages, and speech with domain-tuned models for legal, financial, defense, pharmaceutical, and industrial use, and supports 30+ file formats. Available as managed SaaS, on-premise server, or private cloud — built for organizations needing confidential, high-volume translation under strict compliance.

    Worth knowing

    Founded 1968; powered Google's language tools until 2007 and Yahoo Babel Fish until 2012; acquired by France's ChapsVision in 2024.

    • translation
    • enterprise
    • neural-mt
    • self-host
    • +1
  • View Confident AI details
    EvalFREEMIUM

    Confident AI

    Confident AI

    The AI quality platform from the team behind DeepEval.

    Confident AI is the hosted platform built on top of DeepEval, the open-source LLM evaluation framework. It adds dataset and test management, research-backed metrics, production tracing and monitoring, adversarial red teaming, and governance dashboards so teams can benchmark, observe, and safeguard LLM apps across the dev-to-prod loop. Python and TypeScript SDKs plug into CI and OpenTelemetry, with managed cloud and enterprise self-hosting.

    Worth knowing

    Builds the open-source DeepEval framework (~2M evals/day) and raised a $2.2M YC-backed seed in Aug 2025.

    • eval
    • observability
    • red-teaming
    • llm-as-judge
    • +1
  • View Akiflow details
    ProductivityPAID

    Akiflow

    Akiflow

    One app for tasks and calendars, powered by AI.

    Akiflow is a keyboard-first daily planner that pulls tasks from email, Slack, Notion, and dozens of other tools into a single inbox, then lets you time-block them onto a unified calendar. Its built-in assistant, Aki, handles AI scheduling, prioritization, daily briefings, and natural-language and voice commands. It runs on macOS, Windows, web, iOS, and Android with real-time sync.

    Worth knowing

    Backed by Y Combinator; the keyboard-first command bar predates the 'Aki' AI assistant, which was layered on later.

    • task-management
    • calendar
    • time-blocking
    • planner
  • View Spinach AI details
    MeetingFREEMIUM

    Spinach AI

    Spinach (StayIn, Inc.)

    An AI meeting assistant built for agile teams.

    Spinach AI joins video calls to record, transcribe, and summarize meetings, then turns them into clear notes, decisions, and action items. Designed for agile teams, it runs standups, retros, and sprint syncs and pushes follow-ups into the tools engineers already use. It integrates with Zoom, Google Meet, Microsoft Teams, and Webex plus 30+ destinations including Slack, Jira, Salesforce, HubSpot, and Notion.

    Worth knowing

    Backed by Y Combinator and strategic investors Zoom and Atlassian; started in 2021 as an AI standup tool for agile engineering teams.

    • meeting-notes
    • scrum
    • standup
    • action-items
  • View Novita AI details
    InferencePAID

    Novita AI

    Novita AI

    One API for 120+ AI models, plus agent sandboxes and GPU cloud.

    Novita AI is an AI and agent cloud for developers that combines serverless model APIs with on-demand compute. A single API serves 120+ text, image, audio, video, and vision models, while Agent Sandbox provides isolated runtimes for tool-using agents and the GPU cloud offers dedicated instances, serverless GPUs, and bare-metal clusters. It advertises sub-50ms time-to-first-token and startup-friendly, usage-based pricing.

    Worth knowing

    An official Hugging Face inference provider, serving open models to Hugging Face's 5M+ developers via a 'Deploy on Novita' experience.

    • inference
    • agent-sandbox
    • gpu-cloud
    • llm-api
  • View DeepInfra details
    InferencePAID

    DeepInfra

    DeepInfra

    Low-cost, pay-as-you-go API access to 100+ AI models.

    DeepInfra is a cloud inference platform that lets developers run open and proprietary models through a simple, OpenAI-compatible API without managing hardware. It serves text generation, embeddings, image/audio/video, and speech models with token-based, pay-as-you-go pricing, and offers DeepCluster dedicated NVIDIA GPU capacity for heavier workloads. It is SOC 2 and ISO 27001 certified with a zero data-retention policy.

    Worth knowing

    Raised a $107M Series B in May 2026 (investors include Nvidia and Samsung Next) and processes roughly 5 trillion tokens a week.

    • inference
    • open-models
    • gpu-cloud
    • llm-api
  • View Cloudflare Vectorize details
    Vector DBFREEMIUM

    Cloudflare Vectorize

    Cloudflare

    A globally distributed vector database built into Cloudflare Workers.

    Vectorize is Cloudflare's vector database for building AI-powered apps on its Workers platform. It stores and queries embeddings for semantic search, recommendation, classification, and RAG, and cross-references results against data in R2, D1, and KV. Embeddings can come from Workers AI or external providers like OpenAI, and indexes are configured via the dashboard, Wrangler CLI, or REST API.

    Worth knowing

    Reached general availability in 2024, when its per-index capacity jumped 25× from 200,000 to 5 million vectors.

    • vector-database
    • rag
    • edge
    • embeddings
  • View Paperguide details
    ResearchFREEMIUM

    Paperguide

    Paperguide

    AI research assistant and reference manager for academics.

    Paperguide is an all-in-one academic research platform combining an AI assistant, reference manager, and chat-with-PDF in one workspace. It searches papers with real citations, runs literature-review workbooks, extracts data into tables, annotates PDFs, and drafts with an AI writer and plagiarism checker. Built for students, educators, and research teams, with a browser extension to save sources.

    • academic-research
    • reference-manager
    • literature-review
    • chat-with-pdf
  • View Neuphonic details
    VoiceFREEMIUMOpen core

    Neuphonic

    Neuphonic

    Ultra-low-latency text-to-speech that runs on-device.

    Neuphonic is a voice-AI company building text-to-speech and voice cloning that run locally with very low latency. Its cloud API targets real-time voice agents, and in October 2025 it open-sourced NeuTTS Air, a 748M-parameter speech language model that runs on CPU via llama.cpp and clones a voice from a few seconds of audio. Aimed at private, offline, and voice-agent use cases.

    Worth knowing

    Open-sourced NeuTTS Air in Oct 2025, an Apache-2.0 748M-param TTS model that runs on CPU and clones a voice from ~3 seconds of audio.

    • text-to-speech
    • voice-cloning
    • on-device
    • open-source
  • View Lamini details
    Fine-tuningPAID

    Lamini

    Lamini

    Enterprise platform to tune and run open LLMs in your own environment.

    Lamini is an enterprise LLM platform for fine-tuning open models and serving them, designed to run on-prem, in a VPC, or on Lamini's cloud — including on AMD GPUs. It pairs tuning (LoRA/PEFT and memory tuning to reduce hallucinations) with an inference stack and agentic pipelines, accessed via a Python client, REST API, or web UI. Built for teams that need to keep models and data in-house.

    Worth knowing

    Co-founded by Greg Diamos, a creator of the MLPerf benchmark, and Stanford's Sharon Zhou; backed by Andrew Ng and Andrej Karpathy.

    • fine-tuning
    • llm
    • enterprise
    • on-prem
  • View Sapling details
    SupportFREEMIUM

    Sapling

    Sapling Intelligence

    AI writing assistant for customer-facing teams.

    Sapling is a language-model toolkit that gives customer-facing teams real-time autocomplete, grammar and tone suggestions, and reusable snippets inside helpdesks and CRMs like Zendesk, Salesforce, and Intercom. It exposes an API and SDK to embed the same suggestions into custom apps, plus a browser extension for everyday writing. The company also ships a widely cited AI-content detector.

    Worth knowing

    Y Combinator (W19) startup founded in 2018 by Stanford NLP researcher Ziang Xie.

    • customer-support
    • autocomplete
    • grammar
    • snippets
  • View Writefull details
    WritingFREEMIUM

    Writefull

    Writefull (Digital Science)

    AI language feedback and editing for academic writing.

    Writefull is an academic writing assistant whose language models are trained on millions of published journal articles, giving discipline-aware language feedback, paraphrasing, and copyediting. It generates titles and abstracts, checks LaTeX, and runs automated language revision over a full manuscript. Works on the web and integrates with Microsoft Word and Overleaf.

    Worth knowing

    Part-owned by Digital Science (owner of Overleaf, Dimensions, and Figshare) since 2018, and fully acquired by it in November 2023.

    • academic-writing
    • editing
    • paraphrasing
    • latex
  • View Songscription details
    AudioFREEMIUM

    Songscription

    Songscription

    Turn any audio into sheet music, MIDI, and guitar tabs with AI.

    Songscription transcribes audio recordings into readable notation — sheet music, MIDI, MusicXML, and GuitarPro tabs — across instruments including piano, guitar, bass, strings, horns, drums, and vocals. Often described as a 'Shazam for sheet music', it automates a task that traditionally took hours by ear. A free tier covers unlimited 30-second clips, with paid plans for longer recordings and more export formats.

    Worth knowing

    Stanford-founded; its model builds on co-founder Tim Beyer's research, backed by Reach Capital through Stanford's StartX accelerator.

    • music-transcription
    • sheet-music
    • midi
  • View IntellCRE details
    Real EstatePAID

    IntellCRE

    IntellCRE

    AI deal-flow engine for commercial real estate — underwrite and market in minutes.

    IntellCRE automates the commercial real estate workflow: it sources listings and market data, runs underwriting and financial modeling, and turns the result into investor-ready marketing collateral. From one deal it can generate brochures, offering memorandums, broker opinions of value, pitch decks, and listing websites on brand. It targets brokers, investors, lenders, and analysts working multifamily and CRE deals.

    Worth knowing

    Runs on a proprietary database of 150M+ U.S. commercial properties, auto-sourcing comps and market data for each deal.

    • cre
    • underwriting
    • deal-marketing
  • View Voyage details
    GamingFREEMIUM

    Voyage

    Latitude

    AI-native RPG platform where every world is built and played from a prompt.

    Voyage lets anyone describe a world in natural language and drops them inside it as a playable, text-based RPG. Built on Latitude's World Engine, characters carry their own motivations and memories, storylines emerge from player choices, and the world keeps evolving even after you log off. Up to four players can share a world in co-op.

    Worth knowing

    Built by Latitude, the AI Dungeon studio; its World Engine took five years to develop and is backed by Google's AI Futures Fund.

    • ai-rpg
    • worldbuilding
    • interactive-fiction
  • View WellSaid details
    VoiceFREEMIUM

    WellSaid

    WellSaid Labs

    Enterprise AI text-to-speech with voices licensed from real voice actors.

    An enterprise-grade AI voice generator that produces realistic voiceovers from scripts. It offers 120+ voices across languages and accents — modeled on licensed recordings by real voice actors — plus a studio for script import and audio tuning, team workspaces, pronunciation libraries, Adobe integrations, and an API for products, LMS platforms, and IVRs.

    Worth knowing

    Spun out of the Allen Institute for AI (AI2) incubator in 2019; raised a $10M Series A led by FUSE in 2021.

    • text-to-speech
    • voiceover
    • tts
    • enterprise
    • +1
  • View Move AI details
    VisionFREEMIUM

    Move AI

    Move AI

    Markerless 3D motion capture from ordinary video — even a single iPhone.

    Markerless motion-capture technology that turns 2D video into broadcast-quality 3D animation data using computer vision, biomechanics, and physics. The Move One app captures motion from a single iPhone, while multi-camera setups serve studio production; output exports to FBX and USD for game engines and animation pipelines. Used by studios including Ubisoft, Sony, and Disney.

    Worth knowing

    Founded 2019 in London; raised a $10M (£8.2M) 2023 seed backed by Warner Music Group and Animoca Brands.

    • motion-capture
    • markerless
    • 3d-animation
    • mocap
    • +1
  • View Happy Scribe details
    AudioFREEMIUM

    Happy Scribe

    Happy Scribe

    Transcription, subtitles, and AI meeting notes in 150+ languages.

    A transcription and subtitling platform that turns calls, interviews, and recordings into accurate, searchable text. It offers instant AI transcription, a professional human transcriber network for 99%+ accuracy, automatic subtitles and translation across 150+ languages, and an AI notetaker that joins Google Meet, Microsoft Teams, and Zoom calls.

    Worth knowing

    Bootstrapped since 2017, founded by two Dublin City University students after a class assignment to transcribe research interviews.

    • transcription
    • subtitles
    • translation
    • notetaker
    • +1
  • View TrueFoundry details
    InfraPAID

    TrueFoundry

    TrueFoundry

    Enterprise AI gateway and deployment platform that runs in your own cloud.

    A unified platform for deploying, scaling, and governing LLM and agentic AI systems. It pairs an AI gateway that routes and orchestrates calls across providers with infrastructure for hosting models (vLLM, TGI, Triton), fine-tuning, and full-stack observability — deployed inside your own VPC, on-prem, or air-gapped environment with enterprise RBAC and audit logging.

    Worth knowing

    Raised a $19M Series A led by Intel Capital in February 2025, bringing total funding to about $21M.

    • ai-gateway
    • model-deployment
    • mlops
    • enterprise
    • +1