The Agent Skills Directory

[COMMAND_EXECUTION]: The skill uses the subprocess module and AppleScript (osascript) to perform GUI automation tasks, including window activation, keyboard input, and capturing screenshots via the macOS screencapture utility (in scripts/gui_toolkit.py and scripts/ocr_tool.py).
[DATA_EXFILTRATION]: To perform character recognition, the skill captures screenshots and transmits them via HTTP POST requests to a user-defined external service (AISTUDIO_OCR_API_URL). This is the primary function of the skill, and the endpoint and credentials are managed through environment variables rather than being hardcoded.
[COMMAND_EXECUTION]: The toolkit uses ctypes to load the macOS ApplicationServices framework, allowing it to generate low-level mouse events (CGEventCreateMouseEvent) for clicking on interface elements based on OCR coordinates.
[EXTERNAL_DOWNLOADS]: The scripts/setup.sh and scripts/gui scripts utilize the uv package manager to install and synchronize Python dependencies (Pillow, requests) from specified package registries.
[SAFE]: The skill includes security-conscious design choices, such as a doctor command that verifies the presence of credentials without printing their values and providing a .env.example template to prevent accidental credential leakage in the codebase.

localmac-ai-ocr