Skip to content

feat(android): add UIAutomator hierarchy dump, parsing, and agent tool#251

Open
mlikasam-askui wants to merge 2 commits intomainfrom
feat/android-uiautomator-hierarchy-tool
Open

feat(android): add UIAutomator hierarchy dump, parsing, and agent tool#251
mlikasam-askui wants to merge 2 commits intomainfrom
feat/android-uiautomator-hierarchy-tool

Conversation

@mlikasam-askui
Copy link
Contributor

Summary

Dump the current screen with uiautomator dump, parse the XML into a flat list of views (text, ids, content-desc, bounds, tap centers), and expose it as AndroidGetUIAutomatorHierarchyTool for agents when screenshots are weak or you want structured UI data.

Notes

  • Wired via get_ui_elements() on Android AgentOs / PpAdbAgentOs and facade.
  • Includes pdm.lock updates.

Add UIElement and UIElementCollection to parse UIAutomator window-dump XML
from normalized shell output (bounds, text, resource-id, content-desc,
clickable, etc.).

Expose get_ui_elements() on Android AgentOs and implement it in the facade
and PpAdb path so callers get a flattened hierarchy string.

Register AndroidGetUIAutomatorHierarchyTool in the Android tool store for
act flows that need structure instead of screenshots.

Refresh pdm.lock for the otel dependency group and OpenTelemetry-related
package updates.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant