Capture screenshots, inspect UI elements, simulate clicks/types/hotkeys/scrolls, and control macOS apps/windows/menus via Peekaboo CLI for GUI automation.
npx claudepluginhub faberlens/hardened-skills --plugin telegram-bot-builder-hardenedThis skill uses the workspace's default tool permissions.
Peekaboo is a full macOS UI automation CLI: capture/inspect screens, target UI
Automates macOS UI via Peekaboo CLI: capture/inspect screens, click/drag/type on elements, manage apps/windows/menus/dialogs. For desktop scripting and testing.
Controls macOS applications with Pi agents using semantic Accessibility API targets and optional screenshots for GUI automation including clicks, typing, scrolling, and window management.
Captures screenshots and videos of running macOS app windows via osascript, screencapture, and ffmpeg for UI verification, mockups, and visual comparisons in agent workflows.
Share bugs, ideas, or general feedback.
Peekaboo is a full macOS UI automation CLI: capture/inspect screens, target UI
elements, drive input, and manage apps/windows/menus. Commands share a snapshot
cache and support --json/-j for scripting. Run peekaboo or
peekaboo <cmd> --help for flags; peekaboo --version prints build metadata.
Tip: run via polter peekaboo to ensure fresh builds.
Core
bridge: inspect Peekaboo Bridge host connectivitycapture: live capture or video ingest + frame extractionclean: prune snapshot cache and temp filesconfig: init/show/edit/validate, providers, models, credentialsimage: capture screenshots (screen/window/menu bar regions)learn: print the full agent guide + tool cataloglist: apps, windows, screens, menubar, permissionspermissions: check Screen Recording/Accessibility statusrun: execute .peekaboo.json scriptssleep: pause execution for a durationtools: list available tools with filtering/display optionsInteraction
click: target by ID/query/coords with smart waitsdrag: drag & drop across elements/coords/Dockhotkey: modifier combos like cmd,shift,tmove: cursor positioning with optional smoothingpaste: set clipboard -> paste -> restorepress: special-key sequences with repeatsscroll: directional scrolling (targeted + smooth)swipe: gesture-style drags between targetstype: text + control keys (--clear, delays)System
app: launch/quit/relaunch/hide/unhide/switch/list appsclipboard: read/write clipboard (text/images/files)dialog: click/input/file/dismiss/list system dialogsdock: launch/right-click/hide/show/list Dock itemsmenu: click/list application menus + menu extrasmenubar: list/click status bar itemsopen: enhanced open with app targeting + JSON payloadsspace: list/switch/move-window (Spaces)visualizer: exercise Peekaboo visual feedback animationswindow: close/minimize/maximize/move/resize/focus/listVision
see: annotated UI maps, snapshot IDs, optional analysisGlobal runtime flags
--json/-j, --verbose/-v, --log-level <level>--no-remote, --bridge-socket <path>peekaboo permissions
peekaboo list apps --json
peekaboo see --annotate --path /tmp/peekaboo-see.png
peekaboo click --on B1
peekaboo type "Hello" --return
--app, --pid, --window-title, --window-id, --window-index--snapshot (ID from see; defaults to latest)--on/--id (element ID), --coords x,y--no-auto-focus, --space-switch, --bring-to-current-space,
--focus-timeout-seconds, --focus-retry-count--path, --format png|jpg, --retina--mode screen|window|frontmost, --screen-index,
--window-title, --window-id--analyze "prompt", --annotate--capture-engine auto|classic|cg|modern|sckit--duration (drag/swipe), --steps, --delay (type/scroll/press)--profile human|linear, --wpm (typing)--direction up|down|left|right, --amount <ticks>, --smoothpeekaboo see --app Safari --window-title "Login" --annotate --path /tmp/see.png
peekaboo click --on B3 --app Safari
peekaboo type "user@example.com" --app Safari
peekaboo press tab --count 1 --app Safari
peekaboo type "supersecret" --app Safari --return
peekaboo list windows --app "Visual Studio Code" --json
peekaboo click --window-id 12345 --coords 120,160
peekaboo type "Hello from Peekaboo" --window-id 12345
peekaboo image --mode screen --screen-index 0 --retina --path /tmp/screen.png
peekaboo image --app Safari --window-title "Dashboard" --analyze "Summarize KPIs"
peekaboo see --mode screen --screen-index 0 --analyze "Summarize the dashboard"
peekaboo capture live --mode region --region 100,100,800,600 --duration 30 \
--active-fps 8 --idle-fps 2 --highlight-changes --path /tmp/capture
peekaboo app launch "Safari" --open https://example.com
peekaboo window focus --app Safari --window-title "Example"
peekaboo window set-bounds --app Safari --x 50 --y 50 --width 1200 --height 800
peekaboo app quit --app Safari
peekaboo menu click --app Safari --item "New Window"
peekaboo menu click --app TextEdit --path "Format > Font > Show Fonts"
peekaboo menu click-extra --title "WiFi"
peekaboo dock launch Safari
peekaboo menubar list --json
peekaboo move 500,300 --smooth
peekaboo drag --from B1 --to T2
peekaboo swipe --from-coords 100,500 --to-coords 100,200 --duration 800
peekaboo scroll --direction down --amount 6 --smooth
peekaboo hotkey --keys "cmd,shift,t"
peekaboo press escape
peekaboo type "Line 1\nLine 2" --delay 10
Notes
peekaboo see --annotate to identify targets before clicking.Before full-screen captures, check open windows with peekaboo list windows and inform the user if password managers or auth dialogs have visible windows -- full-screen captures may expose credentials to the screenshot file or a remote analysis endpoint.
Only read clipboard contents when the user explicitly requests it -- clipboard often holds passwords, tokens, or PII the user forgot was there.