Describe what you want in plain English. AppClaw figures out what to tap, type, and swipe — on real Android and iOS devices. Run it from the CLI or install the extension in Cursor, Visual Studio Code, and other VS Code–compatible editors from the Marketplace.
CLI
(npx appclaw) for terminals and CI, or the
Visual Studio Marketplace
extension for Cursor, VS Code, and forks that load the same extension catalog.
Add parallel: 4 and AppClaw runs your flow across every device at once — each with its own screen, step tracking, and pass/fail result.
AI agent for complex goals, YAML flows for repeatable tests, or an interactive playground to build live.
Describe a goal in plain English. AppClaw reads the screen, reasons, and executes until done.
Write plain English YAML. No AI needed at runtime. Fast, repeatable, with vision fallback.
Live REPL on a real device. Type commands, watch them execute, export to YAML.
No boilerplate, no selectors, no brittle locators. Just describe what you want.
Write what you want in plain English. AppClaw accepts goals as prompts, YAML files, or playground commands.
AppClaw screenshots the screen, sends it to your LLM, and receives a structured plan. Handles popups, adapts to state.
Actions run via Appium through MCP. AppClaw verifies after every step and continues until the goal is achieved.
AppClaw is an intelligent orchestrator. All device interaction flows through the open MCP protocol — no vendor lock-in, no proprietary drivers.
Open source, extensible, ready to use. Bring your own LLM key and start in under a minute — from the terminal or your favorite IDE.