How can I give my Android AI agent full screen control?

Use self-ADB — Android 11+ supports wireless ADB debugging, and you can connect to it from Termux on the same device. No PC needed. The phone connects to itself, unlocking full UI automation: tapping, swiping, typing, screenshots, and app launching.

How do I set up self-ADB in Termux?

Install android-tools (pkg install android-tools), enable Wireless Debugging in Developer Options, pair Termux using 'adb pair' with the pairing code, then connect using 'adb connect' with the device IP:port. Run 'adb devices' to verify connection.

What can an AI agent do with self-ADB on Android?

With self-ADB, your AI can tap/swipe the screen ('adb shell input tap'), take screenshots ('adb shell screencap'), control any app ('adb shell am start'), type into apps ('adb shell input text'), and automate Chrome via intents — everything a human can do.

How do I keep self-ADB working after reboot?

Wireless debugging disconnects on reboot. Create a reconnect script that runs 'adb connect' with your device IP:port. You may need to re-pair if the pairing expired. Also disable battery optimization for Termux to prevent Android from killing it.

What is mobile-use and how does it compare to self-ADB?

mobile-use is a polished open-source alternative that controls Android/iOS via natural language from a host machine. It's easier to set up (Docker one-liner) and extracts structured data (JSON), but requires a separate computer. Self-ADB is fully self-contained on the phone with lower latency.

What hardware do I need for an autonomous Android AI agent?

Any Android 11+ phone works. The author uses a Google Pixel 7 Pro (128GB, 12GB RAM) purchased used for $275. Install Termux from F-Droid (not Play Store), Termux:API, and OpenClaw. Keep the phone plugged in for 24/7 operation.

Why doesn't Chrome extension relay or Playwright work on Android?

Chrome extension relay requires desktop Chrome — Android Chrome doesn't support extensions. Playwright/Puppeteer need a full browser binary, and running Chromium in proot-distro is slow and fragile. Self-ADB with Chrome intents is simpler and more reliable.

How I Turned an Android Phone into a Fully Autonomous AI Agent

By Prahlad Menon Published 2026-02-27 5 min read

This guide assumes you’ve already set up OpenClaw on your Android phone using Termux. If you haven’t, start with my comparison guide: AI Agents on Android: DroidClaw vs OpenClaw vs Termux Options Compared.

Where this guide picks up: You have OpenClaw running in Termux, you can chat with your AI via Telegram, and basic Termux:API commands work (camera, location, battery status). But you’ve hit a wall — how do you get the AI to actually control apps? Tap buttons? Navigate Chrome? Take screenshots of what’s on screen?

That’s what this guide solves.

The Problem

Out of the box, OpenClaw in Termux can:

✅ Chat via Telegram/WhatsApp/Signal
✅ Search the web
✅ Access phone hardware via Termux:API (camera, GPS, SMS, sensors)
✅ Run shell commands and scripts

But it cannot:

❌ Tap or swipe on the screen
❌ Control apps like Chrome, YouTube, or WhatsApp
❌ Take screenshots of what’s currently displayed
❌ Type into arbitrary apps

These require ADB access — and most guides assume you’re connecting from a PC. But what if the phone needs to control itself?

The Breakthrough: Self-ADB

Here’s what most people don’t realize: Android 11+ supports wireless ADB debugging, and you can connect to it from Termux running on the same device.

No PC. No second phone. The phone connects to itself.

This unlocks full UI automation: tapping, swiping, typing, screenshots, launching apps — everything ADB can do, available to your AI agent.

Prerequisites

Before continuing, make sure you have:

Termux installed from F-Droid (not Play Store)
OpenClaw installed and running (npm install -g openclaw && openclaw setup)
Termux:API app installed from F-Droid
Termux:API package installed (pkg install termux-api)
OpenClaw connected to Telegram (or your preferred chat platform)

If any of these aren’t set up, see the setup comparison guide first.

Step 1: Install ADB in Termux

Open Termux and run:

pkg install android-tools

This gives you the adb command inside Termux.

Step 2: Enable Wireless Debugging

On your phone:

Go to Settings → About Phone
Tap “Build Number” 7 times to enable Developer Options
Go to Settings → Developer Options
Scroll down and enable Wireless debugging

Step 3: Pair Termux to Your Own Phone

This is the magic step.

In Wireless debugging settings, tap “Pair device with pairing code”
You’ll see an IP:port (like 192.168.1.42:37123) and a 6-digit code
In Termux, run:

adb pair 192.168.1.42:37123

Enter the 6-digit code when prompted

You should see: Successfully paired

Step 4: Connect

Go back to the main Wireless debugging screen
Note the IP:port shown under your device name (this is different from the pairing port)
In Termux, run:

adb connect 192.168.1.42:41234
adb devices

You should see your device listed as device (not offline or unauthorized).

That’s it. Termux now has full ADB control over your phone.

What This Unlocks (Behind the Scenes)

Now your AI agent can execute these commands automatically when you chat with it. You don’t type these — the AI runs them behind the scenes when you make requests via Telegram.

Screen Control

When you say “Tap the center of the screen”, the AI runs:

adb shell input tap 540 960

When you say “Scroll up”, it runs:

adb shell input swipe 540 1500 540 500

When you say “Press the back button”, it runs:

adb shell input keyevent 4

When you say “Type ‘hello world’”, it runs:

adb shell input text "hello%sworld"

(Spaces become %s in ADB input)

Browser Control

When you say “Open themenonlab.com in Chrome”, the AI runs:

adb shell am start -a android.intent.action.VIEW \
    -d "https://www.themenonlab.com" \
    -n com.android.chrome/com.google.android.apps.chrome.Main

When you say “Open YouTube and search for lofi beats”, it runs:

adb shell am start -a android.intent.action.VIEW \
    -d "https://www.youtube.com/results?search_query=lofi+beats"

Screenshots

When you say “Take a screenshot and send it to me”, the AI runs:

adb shell screencap -p /data/local/tmp/screen.png
adb pull /data/local/tmp/screen.png ~/screen.png

Then it sends screen.png to you via Telegram.

App Launching

When you say “Open Settings”, it runs:

adb shell am start -a android.settings.SETTINGS

When you say “Open the camera app”, it runs:

adb shell am start -a android.media.action.IMAGE_CAPTURE

Keeping It Working

Reconnect After Reboot

Wireless debugging disconnects when the phone restarts. Add a reconnect script:

cat > ~/reconnect-adb.sh << 'EOF'
#!/data/data/com.termux/files/usr/bin/bash
# Update the IP:port to match your Wireless Debugging screen
adb connect 192.168.1.42:41234
EOF
chmod +x ~/reconnect-adb.sh

After a reboot, run ~/reconnect-adb.sh (you may need to re-pair if the pairing expired).

Auto-Start OpenClaw

If you haven’t already, install Termux:Boot from F-Droid:

mkdir -p ~/.termux/boot
cat > ~/.termux/boot/start-openclaw.sh << 'EOF'
#!/data/data/com.termux/files/usr/bin/bash
sleep 15
openclaw gateway start
EOF
chmod +x ~/.termux/boot/start-openclaw.sh

Battery Optimization

Prevent Android from killing Termux:

Settings → Apps → Termux → Battery → Unrestricted
Settings → Apps → Termux:API → Battery → Unrestricted
Settings → Battery → Battery Saver → OFF (when plugged in 24/7)
Developer Options → Stay awake while charging → ON

The Full Picture

With self-ADB working, your AI agent now has:

Capability	How
Chat anywhere	Telegram/WhatsApp via OpenClaw
Web search	Brave Search API
Take photos	`termux-camera-photo`
Get location	`termux-location`
Send SMS	`termux-sms-send`
Tap/swipe/type	`adb shell input` ← NEW
Screenshots	`adb shell screencap` ← NEW
Control any app	`adb shell am start` ← NEW
Browser automation	Chrome via intents ← NEW

The Termux:API gives you hardware access. Self-ADB gives you UI control. Together, your AI can do almost anything a human can do with the phone.

What Didn’t Work (And Why)

Chrome Extension Relay — Requires desktop Chrome with the extension. Android Chrome doesn’t support extensions.

Playwright/Puppeteer — Need a full browser binary. Running Chromium in proot-distro is slow and fragile. Self-ADB with Chrome intents is simpler and more reliable.

Accessibility Services — Require a proper Android app, not Termux. More setup complexity than self-ADB.

Alternative: mobile-use (If You Want a Ready-Made Solution)

Updated March 2026

If setting up self-ADB feels like too much work, there’s now a polished open-source alternative: mobile-use from Minitap AI.

mobile-use is an AI agent that controls Android and iOS devices using natural language. It understands your commands and interacts with the UI to perform tasks — from sending messages to navigating complex apps.

Key differences from the DIY approach:

	Self-ADB (this guide)	mobile-use
Setup	Manual pairing, scripts	Docker one-liner or Python install
iOS support	❌ No	✅ Yes (simulators)
AI integration	You build it	Built-in multi-LLM support
Data extraction	Manual screencap + vision	Native structured output (JSON)
Benchmark	DIY	#1 on AndroidWorld (100% completion)

Quick start with Docker:

# Linux/macOS (device connected via USB or emulator running)
bash ./mobile-use.sh \
  "Open Gmail, find first 3 unread emails, and list their sender and subject line" \
  --output-description "A JSON list of objects, each with 'sender' and 'subject' keys"

The agent will navigate Gmail, extract the data, and return structured JSON — no screenshots, no parsing, no manual automation.

Where mobile-use shines:

Complex UI navigation (finding comment boxes, tapping dynamic elements)
Apps with unresponsive elements that don’t work with coordinate-based taps
Data extraction that needs LLM reasoning about what’s on screen
When you’re already running an agent on a separate machine anyway

Where self-ADB is still better:

Fully self-contained on the phone (no separate computer needed)
Simple tasks that already work well (open URLs, screenshots, basic typing)
Lower latency — no network round-trip to a host machine
Integration with on-device agents like OpenClaw or Clawdbot in Termux

The key constraint: mobile-use requires a desktop/server running the Python agent, which connects to your phone via ADB. If you want the phone to control itself without external dependencies, self-ADB is still the way.

mobile-use on GitHub | Benchmark results | Research paper

Why This Matters

Most guides stop at “chat with your AI from Telegram.” That’s useful, but limited.

Self-ADB turns your phone into a fully autonomous agent. Need to check a website that requires JavaScript? Open it in Chrome, wait, screenshot. Need to send a WhatsApp message to someone not in your API contacts? Open the app, navigate, type, send.

The AI now has hands.

That old phone in your drawer? It just became a 24/7 autonomous agent that can control itself.

My setup: Google Pixel 7 Pro (128GB, 12GB RAM) — purchased used for $275

Previous guide: AI Agents on Android: Comparing Your Options

Questions? Find me on X @themedcave