Skip to content

AI Apps Directory

The signal, not the hype.

  • 808Apps
  • 44Categories
  • 145 Open source, 549 Free to try, 187 Self-host
  • View VLM Run details
    VisionFREEMIUM

    VLM Run

    Autonomi AI

    Unified API gateway that extracts structured JSON from images, video, and documents.

    VLM Run is a developer platform for visual AI that returns reliable structured JSON from images, video, and documents through a single API, combining hyper-specialized vision-language models with computer-vision tools for tasks like document parsing, structured OCR, object detection, and segmentation. It offers fine-tuning to specialize models for a domain, dashboards, and flexible deployment. The platform is operated by Autonomi AI.

    • visual-ai
    • document-extraction
    • vision-language-model
    • ocr
  • View Notte details
    AutomationFREEMIUM

    Notte

    Notte Labs

    Framework and browser platform for building reliable AI web agents.

    Notte is a full-stack platform for building and running AI web agents: it turns any website into a structured action API so agents can click, fill forms, and navigate flows instead of scraping raw text. It offers managed cloud browser sessions, serverless browser functions, credential vaults, proxies, CAPTCHA handling, session replays, and observability, with compatibility for Playwright, Puppeteer, Selenium, and CDP tooling. A source-available framework on GitHub underpins the hosted platform.

    Worth knowing

    YC-backed; its framework ships under the Server Side Public License (SSPL) — source-available, not OSI-approved open source.

    • web-agents
    • browser-automation
    • serverless
    • source-available
  • View HoloTab details
    Browser Ext.FREE

    HoloTab

    H Company

    Free Chrome extension that runs an AI agent to complete tasks across any website.

    HoloTab is a browser extension from H Company that deploys a computer-use AI agent inside Chrome: describe a task in natural language and it navigates sites, clicks, fills forms, and makes decisions like a person would, without technical setup. A Routines feature lets you record a task once and re-run or schedule it later. It is aimed at everyday users rather than developers.

    Worth knowing

    Launched April 2026 by Paris-based H Company, the French lab that raised ~$220M; runs on its in-house Holo 3 computer-use model.

    • browser-agent
    • computer-use
    • web-automation
    • chrome-extension
  • View CuspAI details
    SciencePAID

    CuspAI

    CuspAI

    AI platform that designs new materials on demand, like a search engine for molecules.

    CuspAI is a materials-discovery company building an AI platform that takes a target set of properties and generates viable, synthesizable material candidates, with carbon capture and other sustainability applications as early focuses. It combines generative AI foundation models, deep learning, and molecular simulation into one loop to shortcut conventional trial-and-error discovery. It is engaged as an enterprise platform and research partner rather than a self-serve product.

    Worth knowing

    Co-founded by AI pioneer Max Welling; closed a $100M Series A in Sept 2025 co-led by NEA and Temasek, with NVIDIA among backers.

    • materials-discovery
    • generative-ai
    • molecular-simulation
    • deep-tech
  • View Heidi details
    HealthcareFREEMIUM

    Heidi

    Heidi Health

    AI medical scribe that turns the consultation into structured clinical notes.

    Heidi is an AI scribe for clinicians that listens to a consultation and drafts structured notes, referral letters, and patient handouts in real time, with customizable templates and support for 100+ languages. It pairs the Scribe with Evidence (clinical lookup), Remote (a wearable capture mode), and Comms for follow-up. A free tier covers unlimited basic consults, while paid tiers add EHR push and team features.

    Worth knowing

    Melbourne-based; raised a $65M USD Series B in Oct 2025 at a ~$465M valuation, and opened the round for clinicians to invest.

    • ai-scribe
    • clinical-documentation
    • ambient-ai
    • healthcare
  • View pyannoteAI details
    AudioFREEMIUMOpen core

    pyannoteAI

    pyannoteAI

    Speaker intelligence — diarization that tells who spoke when.

    pyannoteAI turns conversational audio into speaker-attributed transcripts: it identifies speakers, separates overlapping voices, and provides speaker metadata. Built on the widely used open-source pyannote.audio library, it adds a premium REST API and Python SDK with higher accuracy and near real-time speed.

    Worth knowing

    Its open-source pyannote.audio library sees 45M+ monthly downloads; the Paris company raised an €8.1M seed in April 2025.

    • audio
    • speaker-diarization
    • speech
    • open-source
    • +1
  • View Hume AI details
    VoiceFREEMIUM

    Hume AI

    Hume AI

    Emotionally intelligent voice AI — speech-to-speech (EVI) and expressive TTS (Octave).

    Hume builds voice AI tuned to emotional expression. Its Empathic Voice Interface (EVI) is a speech-to-speech system that reads vocal tone, handles interruptions and back-channeling, and can front any external LLM. Octave is its expressive text-to-speech model with voice design, cloning, and modulation.

    Worth knowing

    Founded by former Google AI researcher Alan Cowen; raised a $50M Series B in 2024 led by EQT Ventures.

    • voice
    • speech-to-speech
    • tts
    • emotion-ai
    • +1
  • View Genesis details
    RoboticsFREEOSS

    Genesis

    Genesis AI

    A generative, ultra-fast physics platform for robotics and embodied AI.

    An open-source simulation platform for general-purpose robotics and embodied-AI learning, rebuilt from the ground up in pure Python. Combines a unified multi-physics engine, a photorealistic renderer, and a generative data engine that turns natural-language prompts into simulation scenes and training data.

    Worth knowing

    Began as a Dec 2024 academic project; its backer Genesis AI left stealth in July 2025 with a $105M seed co-led by Eclipse and Khosla.

    • robotics
    • simulation
    • physics-engine
    • embodied-ai
    • +1
  • View LeRobot details
    RoboticsFREEOSS

    LeRobot

    Hugging Face

    Open-source models, datasets, and tools for real-world robotics in PyTorch.

    Hugging Face's robotics stack — pretrained policies (ACT, Diffusion Policy, SmolVLA, Pi0), shared datasets, simulation environments, and designs for low-cost manipulator kits. Aims to be the 'Transformers for robotics', lowering the barrier to training and sharing robot-learning models.

    Worth knowing

    Built by Hugging Face as the 'Transformers for robotics'; NVIDIA partnered on it in 2025 to accelerate open-source robot learning.

    • robotics
    • open-source
    • pytorch
    • imitation-learning
    • +1
  • View Papago details
    TranslationFREE

    Papago

    Naver

    Naver's free neural translator, tuned for East Asian languages.

    Papago is Naver's free AI translation app, covering 14 languages with text, voice, image, and handwriting input. Its neural machine translation is especially strong on East Asian pairs — Korean to and from Japanese, English, and Chinese — making it a go-to for travel and study across the region. Available on the web and as iOS and Android apps.

    Worth knowing

    Built by South Korea's Naver and launched in 2017; 'Papago' is Esperanto for 'parrot,' nodding to its language focus.

    • translation
    • korean
    • east-asian
    • mobile
    • +1
  • View SYSTRAN details
    TranslationPAID

    SYSTRAN

    ChapsVision

    Enterprise neural machine translation with on-prem options.

    SYSTRAN is one of the oldest machine-translation companies, now offering Pure Neural Machine Translation across 55+ languages and 150+ language pairs. It translates documents, web pages, and speech with domain-tuned models for legal, financial, defense, pharmaceutical, and industrial use, and supports 30+ file formats. Available as managed SaaS, on-premise server, or private cloud — built for organizations needing confidential, high-volume translation under strict compliance.

    Worth knowing

    Founded 1968; powered Google's language tools until 2007 and Yahoo Babel Fish until 2012; acquired by France's ChapsVision in 2024.

    • translation
    • enterprise
    • neural-mt
    • self-host
    • +1
  • View Confident AI details
    EvalFREEMIUM

    Confident AI

    Confident AI

    The AI quality platform from the team behind DeepEval.

    Confident AI is the hosted platform built on top of DeepEval, the open-source LLM evaluation framework. It adds dataset and test management, research-backed metrics, production tracing and monitoring, adversarial red teaming, and governance dashboards so teams can benchmark, observe, and safeguard LLM apps across the dev-to-prod loop. Python and TypeScript SDKs plug into CI and OpenTelemetry, with managed cloud and enterprise self-hosting.

    Worth knowing

    Builds the open-source DeepEval framework (~2M evals/day) and raised a $2.2M YC-backed seed in Aug 2025.

    • eval
    • observability
    • red-teaming
    • llm-as-judge
    • +1
  • View Akiflow details
    ProductivityPAID

    Akiflow

    Akiflow

    One app for tasks and calendars, powered by AI.

    Akiflow is a keyboard-first daily planner that pulls tasks from email, Slack, Notion, and dozens of other tools into a single inbox, then lets you time-block them onto a unified calendar. Its built-in assistant, Aki, handles AI scheduling, prioritization, daily briefings, and natural-language and voice commands. It runs on macOS, Windows, web, iOS, and Android with real-time sync.

    Worth knowing

    Backed by Y Combinator; the keyboard-first command bar predates the 'Aki' AI assistant, which was layered on later.

    • task-management
    • calendar
    • time-blocking
    • planner
  • View Spinach AI details
    MeetingFREEMIUM

    Spinach AI

    Spinach (StayIn, Inc.)

    An AI meeting assistant built for agile teams.

    Spinach AI joins video calls to record, transcribe, and summarize meetings, then turns them into clear notes, decisions, and action items. Designed for agile teams, it runs standups, retros, and sprint syncs and pushes follow-ups into the tools engineers already use. It integrates with Zoom, Google Meet, Microsoft Teams, and Webex plus 30+ destinations including Slack, Jira, Salesforce, HubSpot, and Notion.

    Worth knowing

    Backed by Y Combinator and strategic investors Zoom and Atlassian; started in 2021 as an AI standup tool for agile engineering teams.

    • meeting-notes
    • scrum
    • standup
    • action-items
  • View Novita AI details
    InferencePAID

    Novita AI

    Novita AI

    One API for 120+ AI models, plus agent sandboxes and GPU cloud.

    Novita AI is an AI and agent cloud for developers that combines serverless model APIs with on-demand compute. A single API serves 120+ text, image, audio, video, and vision models, while Agent Sandbox provides isolated runtimes for tool-using agents and the GPU cloud offers dedicated instances, serverless GPUs, and bare-metal clusters. It advertises sub-50ms time-to-first-token and startup-friendly, usage-based pricing.

    Worth knowing

    An official Hugging Face inference provider, serving open models to Hugging Face's 5M+ developers via a 'Deploy on Novita' experience.

    • inference
    • agent-sandbox
    • gpu-cloud
    • llm-api
  • View DeepInfra details
    InferencePAID

    DeepInfra

    DeepInfra

    Low-cost, pay-as-you-go API access to 100+ AI models.

    DeepInfra is a cloud inference platform that lets developers run open and proprietary models through a simple, OpenAI-compatible API without managing hardware. It serves text generation, embeddings, image/audio/video, and speech models with token-based, pay-as-you-go pricing, and offers DeepCluster dedicated NVIDIA GPU capacity for heavier workloads. It is SOC 2 and ISO 27001 certified with a zero data-retention policy.

    Worth knowing

    Raised a $107M Series B in May 2026 (investors include Nvidia and Samsung Next) and processes roughly 5 trillion tokens a week.

    • inference
    • open-models
    • gpu-cloud
    • llm-api
  • View Cloudflare Vectorize details
    Vector DBFREEMIUM

    Cloudflare Vectorize

    Cloudflare

    A globally distributed vector database built into Cloudflare Workers.

    Vectorize is Cloudflare's vector database for building AI-powered apps on its Workers platform. It stores and queries embeddings for semantic search, recommendation, classification, and RAG, and cross-references results against data in R2, D1, and KV. Embeddings can come from Workers AI or external providers like OpenAI, and indexes are configured via the dashboard, Wrangler CLI, or REST API.

    Worth knowing

    Reached general availability in 2024, when its per-index capacity jumped 25× from 200,000 to 5 million vectors.

    • vector-database
    • rag
    • edge
    • embeddings
  • View Paperguide details
    ResearchFREEMIUM

    Paperguide

    Paperguide

    AI research assistant and reference manager for academics.

    Paperguide is an all-in-one academic research platform combining an AI assistant, reference manager, and chat-with-PDF in one workspace. It searches papers with real citations, runs literature-review workbooks, extracts data into tables, annotates PDFs, and drafts with an AI writer and plagiarism checker. Built for students, educators, and research teams, with a browser extension to save sources.

    • academic-research
    • reference-manager
    • literature-review
    • chat-with-pdf
  • View Neuphonic details
    VoiceFREEMIUMOpen core

    Neuphonic

    Neuphonic

    Ultra-low-latency text-to-speech that runs on-device.

    Neuphonic is a voice-AI company building text-to-speech and voice cloning that run locally with very low latency. Its cloud API targets real-time voice agents, and in October 2025 it open-sourced NeuTTS Air, a 748M-parameter speech language model that runs on CPU via llama.cpp and clones a voice from a few seconds of audio. Aimed at private, offline, and voice-agent use cases.

    Worth knowing

    Open-sourced NeuTTS Air in Oct 2025, an Apache-2.0 748M-param TTS model that runs on CPU and clones a voice from ~3 seconds of audio.

    • text-to-speech
    • voice-cloning
    • on-device
    • open-source
  • View Lamini details
    Fine-tuningPAID

    Lamini

    Lamini

    Enterprise platform to tune and run open LLMs in your own environment.

    Lamini is an enterprise LLM platform for fine-tuning open models and serving them, designed to run on-prem, in a VPC, or on Lamini's cloud — including on AMD GPUs. It pairs tuning (LoRA/PEFT and memory tuning to reduce hallucinations) with an inference stack and agentic pipelines, accessed via a Python client, REST API, or web UI. Built for teams that need to keep models and data in-house.

    Worth knowing

    Co-founded by Greg Diamos, a creator of the MLPerf benchmark, and Stanford's Sharon Zhou; backed by Andrew Ng and Andrej Karpathy.

    • fine-tuning
    • llm
    • enterprise
    • on-prem
  • View Sapling details
    SupportFREEMIUM

    Sapling

    Sapling Intelligence

    AI writing assistant for customer-facing teams.

    Sapling is a language-model toolkit that gives customer-facing teams real-time autocomplete, grammar and tone suggestions, and reusable snippets inside helpdesks and CRMs like Zendesk, Salesforce, and Intercom. It exposes an API and SDK to embed the same suggestions into custom apps, plus a browser extension for everyday writing. The company also ships a widely cited AI-content detector.

    Worth knowing

    Y Combinator (W19) startup founded in 2018 by Stanford NLP researcher Ziang Xie.

    • customer-support
    • autocomplete
    • grammar
    • snippets
  • View Writefull details
    WritingFREEMIUM

    Writefull

    Writefull (Digital Science)

    AI language feedback and editing for academic writing.

    Writefull is an academic writing assistant whose language models are trained on millions of published journal articles, giving discipline-aware language feedback, paraphrasing, and copyediting. It generates titles and abstracts, checks LaTeX, and runs automated language revision over a full manuscript. Works on the web and integrates with Microsoft Word and Overleaf.

    Worth knowing

    Part-owned by Digital Science (owner of Overleaf, Dimensions, and Figshare) since 2018, and fully acquired by it in November 2023.

    • academic-writing
    • editing
    • paraphrasing
    • latex
  • View Songscription details
    AudioFREEMIUM

    Songscription

    Songscription

    Turn any audio into sheet music, MIDI, and guitar tabs with AI.

    Songscription transcribes audio recordings into readable notation — sheet music, MIDI, MusicXML, and GuitarPro tabs — across instruments including piano, guitar, bass, strings, horns, drums, and vocals. Often described as a 'Shazam for sheet music', it automates a task that traditionally took hours by ear. A free tier covers unlimited 30-second clips, with paid plans for longer recordings and more export formats.

    Worth knowing

    Stanford-founded; its model builds on co-founder Tim Beyer's research, backed by Reach Capital through Stanford's StartX accelerator.

    • music-transcription
    • sheet-music
    • midi
  • View IntellCRE details
    Real EstatePAID

    IntellCRE

    IntellCRE

    AI deal-flow engine for commercial real estate — underwrite and market in minutes.

    IntellCRE automates the commercial real estate workflow: it sources listings and market data, runs underwriting and financial modeling, and turns the result into investor-ready marketing collateral. From one deal it can generate brochures, offering memorandums, broker opinions of value, pitch decks, and listing websites on brand. It targets brokers, investors, lenders, and analysts working multifamily and CRE deals.

    Worth knowing

    Runs on a proprietary database of 150M+ U.S. commercial properties, auto-sourcing comps and market data for each deal.

    • cre
    • underwriting
    • deal-marketing