Skip to content

AutomationAmazon

Amazon Nova Act

Build reliable AI agents that automate UI workflows in a browser.

Categories
AutomationAgent
Pricing
FREEMIUM
Hosting
Cloud
Models
Single model (proprietary)
Verified
Jun 21, 2026

An Amazon model and SDK for building agents that take actions in a web browser, breaking workflows into small, testable natural-language steps for reliability at scale. Developers script and chain browser actions in Python, and run them as production UI-automation agents. Available as an AWS service after a 2025 research preview.

Pros & cons

  • Reliability-focused, step-by-step action model
  • Python SDK to script and chain browser actions
  • Native AWS integration and scaling
  • Reports ~90% success on UI workflows
  • Tied to the AWS ecosystem
  • Usage-based pricing at production scale
  • Several founding Adept leads have since left

Tags

Further reading

View all Automation
  • View Browser Use details
    AgentFREEMIUMOpen core

    Browser Use

    Browser Use

    Open-source browser automation for AI agents.

    MIT-licensed Python framework that connects AI agents to a real browser to navigate, fill forms, extract data, and run multi-step web workflows. It parses the live DOM into a structured view so any model — Claude, GPT, Gemini, Qwen, DeepSeek — can act on it. A hosted Browser Use Cloud adds managed sessions and a fully-hosted agent.

    Top open-source WebVoyager score (~89%)
    Success drops on bot-protected sites
    • browser-automation
    • open-source
    • web-agent
    • playwright
  • View Skyvern details
    AutomationFREEMIUMOpen core

    Skyvern

    Skyvern

    Automate browser-based workflows on any website with AI.

    An AI agent that completes browser workflows — form fills, logins, data extraction, multi-step flows — by combining computer vision with LLMs rather than hand-written selectors, so a single agent generalizes across sites it has never seen. Run it via the cloud app and API or self-host the open-source engine; bring your own model (OpenAI, Anthropic, Gemini, or local Ollama).

    Vision+LLM, no hand-written selectors
    AGPL-3.0 limits commercial reuse
    • browser-automation
    • computer-vision
    • open-source
    • agents
    • +1
  • View Stagehand details
    AutomationFREEOSS

    Stagehand

    Browserbase

    Open-source SDK for building reliable AI browser agents.

    Stagehand is an open-source (MIT) SDK from Browserbase for building browser agents in TypeScript or Python. It exposes act(), extract(), and observe() primitives so you can drive a page with natural language while keeping deterministic code wherever you need it. v3 dropped its hard Playwright dependency for a modular Chrome DevTools Protocol driver.

    Open source (MIT)
    SDK, not a no-code tool
    • browser-automation
    • web-agents
    • sdk
    • open-source
  • View Browserbase details
    InfraFREEMIUM

    Browserbase

    Browserbase

    Headless browser infrastructure for AI agents.

    Managed cloud fleet of headless browsers that let AI agents browse, authenticate, and act on the web at scale. Sessions ship with stealth proxies, automated CAPTCHA solving, and observability, driven via API or the open-source Stagehand framework. Usage-based billing on top of a monthly base plan.

    Stealth proxies + CAPTCHA solving built in
    Usage-based cost on a monthly base
    • browser-automation
    • agents
    • headless-browser
    • web-scraping