anygen-slide
AI Slide Generator - AnyGen
Create professional slide presentations using AnyGen OpenAPI.
When to Use
- User needs to create PPT/Slides/Presentations
- User has files to upload as reference material for slide generation
Security & Permissions
What this skill does:
- Sends task prompts and parameters to the AnyGen API at
www.anygen.io - Uploads user-provided reference files to
www.anygen.ioafter obtaining user consent - Downloads generated PPTX files to local disk or
~/.openclaw/workspace/ - Spawns a background poll process (up to 20 min) to monitor progress and auto-download results
- Reads/writes API key config at
~/.config/anygen/config.json
What this skill does NOT do:
- Does not upload files without informing the user and obtaining consent
- Does not send your API key to any endpoint other than
www.anygen.io - Does not modify system configuration beyond
~/.config/anygen/config.json
Bundled scripts: scripts/anygen.py (Python — uses requests)
Review the bundled scripts before first use to verify behavior.
Prerequisites
- Python3 and
requests:pip3 install requests - AnyGen API Key (
sk-xxx) — Get one → Setting → Integration - Configure once:
python3 scripts/anygen.py config set api_key "sk-xxx"
All
scripts/paths below are relative to this skill's installation directory.
CRITICAL: Background Poll — NEVER Block the Conversation
After creating a task, you MUST start a background poll process to auto-download the result, then continue the conversation normally. NEVER call poll in the foreground — it blocks for 3-20 minutes.
Rules:
- Use
createto start the task → gettask_idandtask_url. - Give the user the
task_urlfor immediate online preview. - Launch
pollas a background process to auto-download the file when ready (see Phase 4). - Continue the conversation normally — do NOT wait.
- After the conversation naturally ends or the user asks about the result, check if the background poll has completed and deliver the file.
Communication Style
When interacting with the user, communicate naturally and professionally:
- You may refer to AnyGen as the service powering the slide generation when relevant.
- Present questions and suggestions in a natural, conversational tone — avoid exposing raw API responses or technical implementation details.
- Summarize
prepareresponses in your own words rather than echoing them verbatim. - Stick to the questions
preparereturned — do not add unrelated questions.
Examples
Less ideal (overly technical):
- "The prepare API returned the following JSON response with status=collecting..."
- "The system's internal tool asks: who is your target audience?"
Better (natural and professional):
- "Who is the target audience for this presentation?"
- "Based on what you've shared, here is the plan for your presentation: [summary]. Should I go ahead, or would you like to adjust anything?"
Slide Workflow (MUST Follow)
For slides, you MUST go through all 4 phases. A good presentation needs audience, purpose, scene type, desired outcome, and content details. Users rarely provide all of these upfront.
Phase 1: Understand Requirements
If the user provides files, you MUST handle them yourself before calling prepare:
- Read the file content yourself using your own file reading capabilities. Extract key information (topic, data, structure) that is relevant to creating the presentation.
- Check if the file was already uploaded in this conversation. If you already have a
file_tokenfor the same file, reuse it — do NOT upload again. - Inform the user and get consent before uploading. Tell them the file will be uploaded to AnyGen's server for processing. For example: "I'll upload your file to AnyGen so it can be used as reference material for the slides. This may take a moment..."
- Upload the file to get a
file_tokenfor later use in task creation. - Include the extracted content as part of your
--messagetext when callingprepare, so that the requirement analysis has full context.
The prepare API does NOT read files internally. You are responsible for providing all relevant file content as text in the conversation.
# Step 1: Tell the user you are uploading, then upload the file
python3 scripts/anygen.py upload --file ./report.pdf
# Output: File Token: tk_abc123
# Step 2: Call prepare with extracted file content included in the message
python3 scripts/anygen.py prepare \
--message "I need a slide deck for our Q4 board review. Here is the key content from the uploaded report: [your extracted summary/content here]" \
--file-token tk_abc123 \
--save ./conversation.json
The --file-token parameter in prepare attaches the file reference to the conversation, but it does NOT extract or read the file content. You must include the relevant content as text in --message.
Present the questions from reply naturally (see Communication Style above). Then continue the conversation with the user's answers:
python3 scripts/anygen.py prepare \
--input ./conversation.json \
--message "The audience is C-level execs, goal is to approve next quarter's budget" \
--save ./conversation.json
Repeat until status="ready" with suggested_task_params.
Special cases:
- If the user provides very complete requirements and
status="ready"on the first call, proceed directly to Phase 2. - If the user says "just create it, don't ask questions", skip prepare and go to Phase 3 with
createdirectly. - For template/style reference files (e.g., "use this as a template"), do NOT extract content. Just upload and pass the
file_token.
Phase 2: Confirm with User (MANDATORY)
When status="ready", prepare returns suggested_task_params containing a detailed prompt. You MUST present this to the user for confirmation before creating the task.
How to present:
- Summarize the key aspects of the suggested plan in natural language (audience, structure, content highlights, style).
- Ask the user to confirm or modify. For example: "Here is what I plan to create: [summary]. Should I go ahead, or would you like to change anything?"
- NEVER auto-create the task without the user's explicit approval.
When the user requests adjustments:
- Call
prepareagain with the user's modification as a new message, loading the existing conversation history:
python3 scripts/anygen.py prepare \
--input ./conversation.json \
--message "<the user's modification request>" \
--save ./conversation.json
preparewill return an updated suggestion that incorporates the user's changes.- Present the updated suggestion to the user again for confirmation (repeat from step 1 above).
- Repeat this confirm-adjust loop until the user explicitly approves. Do NOT skip confirmation after an adjustment.
Phase 3: Create Task
Once the user confirms:
python3 scripts/anygen.py create \
--operation slide \
--prompt "<prompt from suggested_task_params, with any user modifications>" \
--file-token tk_abc123
# Output: Task ID: task_xxx, Task URL: https://...
Immediately tell the user:
- Slides are being generated (takes a few minutes).
- Give them the Task URL so they can preview progress online right now.
- Tell them you will deliver the downloaded file once it's ready.
Phase 4: Monitor and Deliver Result
Requires
sessions_spawn. If sub-agent is not available, skip to the Fallback section below.
With Sub-Agent (preferred)
Spawn a sub-agent to monitor the task in the background. Use sessions_spawn with the following instructions.
Information to pass to the sub-agent (it has NO context from this conversation):
task_id— from Phase 3 outputtask_url— the Task URL returned bycreatescript_path— absolute path toscripts/anygen.py
Sub-agent instructions (pass this as the task prompt):
You are a background monitor for an AnyGen slide generation task.
Task ID: {task_id}
Task URL: {task_url}
Script: {script_path}
Your job:
1. Poll the task status every 60 seconds using:
python3 {script_path} status --task-id {task_id} --json
2. When status="completed", download the file to ~/.openclaw/workspace/:
python3 {script_path} download --task-id {task_id} --output ~/.openclaw/workspace/ --media
3. The script will output a MEDIA: line (with non-ASCII filenames automatically handled). Deliver it to the user:
a. Send a SHORT message containing ONLY the MEDIA: line exactly as output by the script.
This message MUST be sent as a separate message by itself — do NOT combine it with other text.
The MEDIA: protocol will deliver the actual file to the user in their IM client.
b. Then send a second message to the user with:
- The Task URL ({task_url}) for online viewing/editing
- If there is a [RESULT] Thumbnail URL: line, download the thumbnail image to a local file and show it to the user as a preview
4. If status="failed", tell the user the generation failed and provide the Task URL for reference.
5. Timeout after 20 minutes — if still not completed, tell the user the task is taking longer than expected and provide the Task URL for them to check manually.
Then continue the conversation normally. Do NOT wait for the sub-agent.
Fallback (no sub-agent)
If sessions_spawn is not available, tell the user something like: "I've started the generation for you. Unfortunately I'm not able to track the progress automatically in this environment — please check the link above in a few minutes to view and download your slides."
Command Reference
create
python3 scripts/anygen.py create --operation slide --prompt "..." [options]
| Parameter | Short | Description |
|---|---|---|
| --operation | -o | Must be slide |
| --prompt | -p | Content description |
| --file-token | File token from upload (repeatable) | |
| --language | -l | Language (zh-CN / en-US) |
| --slide-count | -c | Number of slides |
| --template | -t | Slide template |
| --ratio | -r | Slide ratio (16:9 / 4:3) |
| --export-format | -f | Export format: pptx (default) / image |
| --style | -s | Style preference |
upload
python3 scripts/anygen.py upload --file ./document.pdf
Returns a file_token. Max file size: 50MB. Tokens are persistent and reusable.
prepare
python3 scripts/anygen.py prepare --message "..." [--file-token tk_xxx] [--input conv.json] [--save conv.json]
| Parameter | Description |
|---|---|
| --message, -m | User message text |
| --file | File path to auto-upload and attach (repeatable) |
| --file-token | File token from prior upload (repeatable) |
| --input | Load conversation from JSON file |
| --save | Save conversation state to JSON file |
| --stdin | Read message from stdin |
poll
Blocks until task completes, auto-downloads the file, and prints [RESULT] lines.
python3 scripts/anygen.py poll --task-id task_xxx --output ./output/
| Parameter | Description |
|---|---|
| --task-id | Task ID from create |
| --output | Output directory for auto-download (default: current directory) |
download (manual, if needed)
python3 scripts/anygen.py download --task-id task_xxx --output ./output/
Error Handling
| Error | Solution |
|---|---|
| invalid API key | Check API Key format (sk-xxx) |
| operation not allowed | Contact admin for permissions |
| prompt is required | Add --prompt parameter |
| file size exceeds 50MB limit | Reduce file size |
Notes
- Max task execution time: 20 minutes
- Download link valid for 24 hours
- Poll interval: 3 seconds
Files
slide-generator/
├── skill.md # This document
└── scripts/
└── anygen.py # CLI client