pdf-to-structured

Pass

Audited by Gen Agent Trust Hub on Mar 5, 2026

Risk Level: SAFE
Full Analysis
  • [SAFE]: All components of the skill, including the scripts and instructions, are consistent with the stated goal of document data extraction and show no signs of malicious intent or obfuscation.- [EXTERNAL_DOWNLOADS]: The skill recommends installing well-known and reputable open-source libraries (pandas, pdfplumber, pytesseract, pypdf) and the Tesseract OCR engine from official sources.- [COMMAND_EXECUTION]: The skill utilizes the local Tesseract binary to perform OCR on scanned documents, which is a standard and documented requirement for this functionality.- [DATA_EXFILTRATION]: No network activity or unauthorized data transmission patterns were detected; the skill is designed for local file processing.- [PROMPT_INJECTION]: No direct prompt injection or safety bypass instructions were found. The skill has an indirect prompt injection surface because it processes untrusted PDF data. Ingestion points: text and table extraction in SKILL.md; Boundary markers: not present; Capability inventory: filesystem read/write and local binary execution (Tesseract); Sanitization: basic data cleaning via pandas is present, though no specific LLM instruction sanitization is implemented.
Audit Metadata
Risk Level
SAFE
Analyzed
Mar 5, 2026, 04:28 AM