skills/datadrivenconstruction/ddc_skills_for_ai_agents_in_construction/pdf-to-structured/Gen Agent Trust Hub
pdf-to-structured
Pass
Audited by Gen Agent Trust Hub on Mar 5, 2026
Risk Level: SAFE
Full Analysis
- [SAFE]: All components of the skill, including the scripts and instructions, are consistent with the stated goal of document data extraction and show no signs of malicious intent or obfuscation.- [EXTERNAL_DOWNLOADS]: The skill recommends installing well-known and reputable open-source libraries (pandas, pdfplumber, pytesseract, pypdf) and the Tesseract OCR engine from official sources.- [COMMAND_EXECUTION]: The skill utilizes the local Tesseract binary to perform OCR on scanned documents, which is a standard and documented requirement for this functionality.- [DATA_EXFILTRATION]: No network activity or unauthorized data transmission patterns were detected; the skill is designed for local file processing.- [PROMPT_INJECTION]: No direct prompt injection or safety bypass instructions were found. The skill has an indirect prompt injection surface because it processes untrusted PDF data. Ingestion points: text and table extraction in SKILL.md; Boundary markers: not present; Capability inventory: filesystem read/write and local binary execution (Tesseract); Sanitization: basic data cleaning via pandas is present, though no specific LLM instruction sanitization is implemented.
Audit Metadata