Built for QA, mobile developers, and automation teams who need real device control, not brittle scripts. AppClaw is an on-device AI agent: describe a goal in plain English — it reads the screen, plans the next action, and runs it on Android and iOS.
Works with Claude, GPT-4, Gemini, Groq, and Ollama
Works with any LLM provider
Every step follows the same agentic loop until the goal is done or max steps are reached.
Reads the live screen via appium-mcp's page source. Parses native XML into a structured list of UI elements.
Sends the goal + current screen state to an LLM. Gets back a JSON action decision — what to tap, type, or swipe next.
Executes the chosen action via appium-mcp tools — click, set_value, scroll, swipe, launch, and 26 more.
Loops back to step 1 with the new screen state. Continues until the goal is achieved or max steps reached.
AppClaw is the decision-maker. appium-mcp handles the device. Clean separation via MCP protocol.
AppClaw consumes appium-mcp's 32 tools over stdio or SSE. It never touches the device directly — it only decides which tool to call next.
Tap, type, swipe, screenshot, launch, install, and more
Swap between Claude, GPT-4, Gemini, Groq, or local Ollama
Complex multi-app goals are split into sequential sub-goals
USB, emulator, simulator, or remote device farms via SSE
Built-in intelligence for the hardest parts of mobile automation.
From simple taps to multi-step compound operations. Smart typing detects non-editable wrappers and finds the real input. submit_message works across WhatsApp, Telegram, Slack, and more.
Record any goal execution and replay it without LLM costs. The replayer doesn't blindly repeat coordinates — it reads the current screen, matches elements, and adapts to layout changes.
Complex multi-app tasks are automatically broken into sequential sub-goals. "Copy the weather and send it on Slack" becomes 4 focused steps, each tracked and executed independently.
Install, add your LLM credentials, connect an Android or iOS device, and run. Four steps to your first AI-driven goal on real hardware or simulators.
Requires Node.js 18+. Install AppClaw globally from npm.
Create a .env file and set your provider and API key. Works with Anthropic, OpenAI, Google, Groq, or local Ollama. AppClaw is free to run; you bring your own key and pay your LLM provider per their pricing (or use a local model).
Plug in an Android phone or iPhone, or use an Android emulator / iOS Simulator. Verify Android with adb devices; for iOS, use your usual Xcode / Appium pairing. Then start the Appium server.
Pass any goal in plain English, run a declarative YAML flow, or use interactive mode. AppClaw connects to your device and executes autonomously.
Open source under Apache 2.0 — no license fee for AppClaw. You supply your own LLM API key (provider charges apply) or run a local model. Automate Android and iOS apps with AI today.
Apache 2.0 · BYO LLM key · Community driven