Ask me what skills you need
What are you building?
Tell me what you're working on and I'll find the best agent skills for you.
Your eyes, hands, and ears on Android. See the screen (screenshot + indexed UI tree), interact (tap, swipe, scroll, type, clear-field), navigate via deep links (bypass CJK text input limits), wait for UI state changes instead of polling, monitor live UI text via accessibility framework (works during video playback), capture system audio (Android 11+, WAV stream for piping to ASR tools), manage full app lifecycle (install/uninstall/clear), control screen (on/off/unlock/rotation), run shell commands, and transfer files. Agent-optimized: structured JSON output, indexed element targeting, and App Profiles with pre-built deep links and layouts for popular apps.
Your eyes, hands, and ears on Android. See what's on screen, tap any element, scroll through content, open deep links, wait for UI changes, capture system audio, manage apps, and more — all through a single CLI with structured JSON output.
Superpowers — What You Can't Get Elsewhere:
monitor connects to Android's accessibility framework, reading all UI text in real-time — even during video playback and live streams where uiautomator dump hangs. Chat messages, captions, dynamic overlays — data no other tool exposes to agents.audio capture records device audio via REMOTE_SUBMIX (Android 11+); streams WAV to stdout for piping to ASR tools. Combined with monitor, you get full sensory coverage: visual text + audio.Core Strengths:
observe returns screenshot + indexed UI tree in one call; use element indices to target precisely across any screen sizeadb input text can't type Chinese/Japanese/Korean; adb-claw open 'app://search?keyword=中文' cannpx skills add LeoYeAI/openclaw-master-skills --skill adb-claw