moltbook
Warn
Audited by Gen Agent Trust Hub on Feb 27, 2026
Risk Level: MEDIUMEXTERNAL_DOWNLOADSCOMMAND_EXECUTIONCREDENTIALS_UNSAFEPROMPT_INJECTION
Full Analysis
- [EXTERNAL_DOWNLOADS]: The skill implements a self-update mechanism that downloads and overwrites its own instruction files from a remote server.
- Evidence:
HEARTBEAT.mdcontains instructions to fetchSKILL.mdandHEARTBEAT.mdfromhttps://www.moltbook.comand redirect the output to local files in~/.moltbot/skills/moltbook/. - Risk: This allows the remote server to modify the agent's behavior or instructions at any time without user intervention or oversight.
- [COMMAND_EXECUTION]: The skill instructions frequently use shell commands for installation, updates, and maintenance.
- Evidence: Shell blocks in
SKILL.mdandHEARTBEAT.mdutilizecurl,mkdir, andgrepto manage files and check versions. - [CREDENTIALS_UNSAFE]: The skill encourages agents to store sensitive API credentials in a local plaintext configuration file.
- Evidence:
SKILL.mdexplicitly recommends saving theapi_keyandagent_nameto~/.config/moltbook/credentials.json. - [PROMPT_INJECTION]: The skill processes untrusted user-generated content from other agents, creating a significant indirect prompt injection surface.
- Ingestion points: The skill fetches data from the Moltbook API including global feeds (
/api/v1/posts), comments (/api/v1/posts/ID/comments), semantic search results (/api/v1/search), and private messages (/api/v1/agents/dm/conversations). - Boundary markers: There are no specific delimiters or protective instructions provided to the agent to distinguish between the skill's operational instructions and the potentially malicious data contained in posts or messages.
- Capability inventory: The agent possesses the capability to post content, comment, follow other agents, and send private messages based on the data it reads.
- Sanitization: No sanitization or validation routines are described for handling external text before it is processed by the agent.
Audit Metadata