Skip to content

Automation Skill

Once you’ve enabled automation, your AI bot gains the ability to control your phone’s UI using Python and uiautomator2. This page explains how to use this skill.

The automation skill allows your bot to:

  • Open and navigate apps
  • Tap buttons and menu items
  • Input text (including Chinese/CJK)
  • Read screen content
  • Perform complex multi-step workflows

Before using automation commands, ensure:

  1. Shizuku is running
  2. u2 service is running

Simply tell your bot what you want to do on your phone. For example:

  • “Open WeChat and send ‘Hello’ to John”
  • “Take a screenshot of my current screen”
  • “Open Settings and check my battery status”
  • “Post a photo to Instagram with caption ‘Beautiful sunset’”

The bot will break down complex tasks into step-by-step actions, observing the screen after each step.

The automation follows an observe → act → observe pattern:

  1. Observe: Read current screen state (which app, what’s visible)
  2. Act: Perform one action (tap, type, swipe)
  3. Observe: Check the result before next action

This step-by-step approach ensures reliability, especially for dynamic app UIs.

  • Open apps by name
  • Tap buttons, menu items, icons
  • Swipe and scroll
  • Press back/home buttons
  • Type in search boxes, chat inputs, forms
  • Supports Chinese and other languages via FastInputIME
  • Clear and replace text
  • Get current app and activity
  • Check if elements exist
  • Read text content from screen
"Open Twitter"

The bot will find and launch the Twitter app.

"Open WhatsApp, find the chat with Mom, and send 'I'll be home for dinner'"

The bot will:

  1. Open WhatsApp
  2. Search for Mom’s chat
  3. Tap to open the conversation
  4. Type the message
  5. Tap send
"Open Settings and tell me my phone's Android version"

The bot will navigate to About Phone and read the version info.

Instead of “post something on social media”, say “open Instagram, create a new post with the most recent photo, and add the caption ‘Sunday vibes’”.

Complex flows take time. The bot waits for screens to load and verifies each step.

If something goes wrong, the bot will try to recover or ask for guidance.

  • No password entry: For security, avoid asking the bot to enter passwords
  • App-specific quirks: Some apps have custom UIs that may need special handling
  • Network-dependent: App actions require working internet connection
  • Screen must be on: Automation requires an active display

Go to Automation Panel and tap “Start u2 Service”.

  • Make sure the target app is fully loaded
  • Try being more specific about what to tap
  • Check if the app needs to be updated
  • The bot will automatically try different input methods
  • For Chinese text, FastInputIME is used for reliability
  • Say “stop” or “cancel” to interrupt
  • Check if the app is showing a popup or dialog