Skip to content

AutomationAskUI

AskUI

Vision-based agents that automate any UI across desktop, mobile, and web.

Categories
AutomationVision
Pricing
PAID
Hosting
Hybrid
Platforms
macOSWindowsLinuxAPI
Models
Multi-model
Verified
Jun 14, 2026

AskUI builds and deploys computer-use agents that visually detect and operate on-screen elements across operating systems — desktop, mobile, web, embedded, and automotive HMIs — without relying on selectors or accessibility trees. Its AgentOS runtime and Python SDK let teams automate UI testing and workflows and route between models like Claude, Gemini, and OpenAI. It is aimed at enterprise automation and QA.

Pros & cons

  • Vision-based, no selectors needed
  • Cross-platform incl. embedded/HMI
  • Model-agnostic (Claude/Gemini/OpenAI)
  • Python SDK for automation and QA
  • Enterprise-grade deployment
  • Enterprise focus, niche audience
  • Pricing not fully public
  • Smaller funding than rivals
  • Heavier setup than no-code RPA

Tags

View all Automation
  • View Skyvern details
    AutomationFREEMIUMOpen core

    Skyvern

    Skyvern

    Automate browser-based workflows on any website with AI.

    An AI agent that completes browser workflows — form fills, logins, data extraction, multi-step flows — by combining computer vision with LLMs rather than hand-written selectors, so a single agent generalizes across sites it has never seen. Run it via the cloud app and API or self-host the open-source engine; bring your own model (OpenAI, Anthropic, Gemini, or local Ollama).

    Worth knowing

    A Y Combinator S23 startup founded by ex-Faire and ex-Lyft engineers Suchintan Singh and Shuchang Zheng.

    • browser-automation
    • computer-vision
    • open-source
    • agents
    • +1
  • View Stagehand details
    AutomationFREEOSS

    Stagehand

    Browserbase

    Open-source SDK for building reliable AI browser agents.

    Stagehand is an open-source (MIT) SDK from Browserbase for building browser agents in TypeScript or Python. It exposes act(), extract(), and observe() primitives so you can drive a page with natural language while keeping deterministic code wherever you need it. v3 dropped its hard Playwright dependency for a modular Chrome DevTools Protocol driver.

    Worth knowing

    Built by Browserbase; its v3 dropped the hard Playwright dependency for a CDP-native driver, ~44% faster on complex DOM work.

    • browser-automation
    • web-agents
    • sdk
    • open-source
  • View Browse AI details
    AutomationFREEMIUM

    Browse AI

    Browse AI

    Scrape and monitor data from any website with no code.

    Browse AI lets you train a 'robot' by pointing at the data you want on a web page; it learns the pattern and then extracts that data on demand, on a schedule, or watches the page and alerts you when it changes. Results export to Google Sheets, Airtable, CSV/JSON, or a REST API, and connect onward through Zapier and Make. Built-in change monitoring and scheduled alerts make it suited to price tracking, job and listing alerts, and competitor watching.

    • web-scraping
    • monitoring
    • no-code
    • automation
    • +1
  • View Bardeen details
    AutomationFREEMIUM

    Bardeen

    Bardeen

    AI browser automation for scraping, enriching, and reaching leads.

    A no-code automation platform built as a Chrome extension. Bardeen scrapes data from any website you can open, qualifies and enriches it with AI, and pushes results into tools like Google Sheets, Airtable, and Notion. Its AI builder turns plain-English descriptions into reusable playbooks, and the product is now focused on sales and GTM workflows — lead sourcing, qualification, and contact enrichment.

    Worth knowing

    Raised a $15.3M Series A in 2022 led by Insight Partners, later adding GTM backing from Dropbox and HubSpot.

    • browser-automation
    • scraping
    • no-code
    • lead-enrichment
    • +1