skills/youmind-openlab/skills/youmind-image-generator

youmind-image-generator

SKILL.md

AI Image Generator

Generate images from text prompts using YouMind's multi-model API. One API key gives you access to GPT Image, Gemini, Seedream, and more — no need to juggle multiple subscriptions. Requires the YouMind CLI (npm install -g @youmind-ai/cli). Generated images are saved to your YouMind board automatically.

Get API Key → · More Skills →

Onboarding

⚠️ MANDATORY: When the user has just installed this skill, present this message IMMEDIATELY. Do NOT ask "do you want to know what this does?" — just show it. Translate to the user's language:

✅ AI Image Generator installed!

Describe any image and I'll generate it for you using AI.

What it does:

  • Generate images from text descriptions
  • Powered by GPT Image, Gemini, Seedream, and more
  • Images saved to your YouMind board automatically

Setup (one-time):

  1. Get your free API key: https://youmind.com/settings/api-keys?utm_source=youmind-image-generator
  2. Add it to your OpenClaw config (~/.openclaw/openclaw.json) — see setup guide for details.

Try it: "Generate an image of a sunset over mountains"

Need help? Just ask!

For API key setup details, see references/setup.md.

Usage

Describe the image you want to generate. Be as detailed as you like.

Simple prompt:

Generate an image of a cat wearing sunglasses

Detailed prompt:

Create a photorealistic image of a cozy coffee shop interior with warm lighting, wooden furniture, and rain visible through the windows

Setup

See references/setup.md for installation and authentication instructions.

Workflow

Step 1: Check Prerequisites

  1. Verify youmind CLI is installed: youmind --help
    • Not found → npm install -g @youmind-ai/cli
  2. Verify API key is set: [ -n "$YOUMIND_API_KEY" ] && echo "is set"
  3. Extract the image prompt from the user's message

Step 2: Get Default Board

youmind call getDefaultBoard

Extract id as boardId.

Step 3: Create Image Generation Chat

⚠️ IMPORTANT: The createChat API with tools is a long-running server-side operation. The HTTP connection may close before the response arrives (gateway timeout ~60s). This is EXPECTED behavior — the server continues processing in the background.

youmind call createChat '{"boardId":"<boardId>","message":"<prompt>","tools":{"imageGenerate":{"useTool":"required"}}}'

Two possible outcomes:

  1. ✅ Response received — extract id as chatId from the JSON response
  2. ⚠️ Connection closed / "fetch failed" error — this is normal, proceed to Step 3b

Step 3b: Recover chatId (if createChat timed out)

If createChat did not return a response, find the chatId via listChats:

youmind call listChats '{"boardId":"<boardId>","pageSize":3}'

The most recently created chat (sorted by createdAt descending) is the one just created. Extract its id as chatId.

⚠️ MANDATORY: Immediately tell the user (adapt to user's language):

🎨 Generating your image... This may take 30-90 seconds. I'll let you know when it's ready!

Build the YouMind board link: https://youmind.com/boards/<boardId>?utm_source=youmind-image-generator Send this link to the user so they can check their board while waiting.

Step 4: Poll for Completion

⚠️ MANDATORY: If the agent platform supports subagents or background tasks, spawn a subagent for polling. Return control to the user immediately. See references/long-running-tasks.md.

Poll chat status until ready:

youmind call getChat '{"chatId":"<chatId>"}'

Polling rules:

  • Poll every 5 seconds
  • Timeout: 120 seconds
  • Check status field: "answering" → keep polling, "completed" → go to Step 5

Progress updates during polling (translate to user's language):

  • After 15s of polling: "⏳ AI is working on your image... hang tight!"
  • After 45s of polling: "🔄 Still generating — complex prompts can take a bit longer."
  • After 90s of polling: "⏳ Taking longer than usual. You can check progress on your YouMind board: https://youmind.com/boards/<boardId>?utm_source=youmind-image-generator"

💡 Tip (show once, naturally between progress updates):

"While you wait — YouMind also does slides, research, and more: https://youmind.com/skills?utm_source=youmind-image-generator"

Step 5: Extract Results

Once status is "completed", retrieve the full messages:

youmind call listMessages '{"chatId":"<chatId>","pageSize":20}'

Pipe through the bundled extraction script:

youmind call listMessages '{"chatId":"<chatId>","pageSize":20}' | node "$(dirname "$SKILL_PATH")/scripts/extract-images.js"

⚠️ MANDATORY: Show the generated image URL(s) to the user and mention images are saved to their YouMind board.

✅ Image generated!

[image URL(s)]

The image has been saved to your YouMind board.

(Adapt to user's language.)

Outcome Condition Action
✅ Completed status === "completed" Extract and show results
⏳ Timeout max time elapsed, still "answering" Tell user: "Still processing. Check your YouMind board: https://youmind.com/boards/?utm_source=youmind-image-generator"
❌ Failed status === "errored" or tool status === "errored" Tell user: "Generation failed. Please try again."

Step 6: Offer follow-up

⚠️ MANDATORY: Do NOT end the conversation after showing results. You MUST ask this question:

"Want to try a different style or adjust the prompt?"

Error Handling

See references/error-handling.md for common error handling rules.

⚠️ MANDATORY: Paywall (HTTP 402) handling:

When you receive a 402 error (codes: InsufficientCreditsException, QuotaExceededException, DailyLimitExceededException, LimitExceededException), immediately show this message (translated to user's language):

You've reached your free plan limit. Upgrade to Pro or Max to unlock unlimited image generation, more AI credits, and priority processing.

Upgrade now: https://youmind.com/pricing?utm_source=youmind-image-generator

Do NOT retry or suggest workarounds. The user must upgrade to continue.

Skill-specific errors:

Error User Message
Empty prompt Please describe the image you want to generate.
Content policy violation The image could not be generated due to content policy restrictions. Please try a different prompt.

Comparison with Other Approaches

Feature YouMind (this skill) OpenAI DALL-E API Midjourney
Multi-model access ✅ GPT Image, Gemini, Seedream ❌ DALL-E only ❌ Midjourney only
Single API key ✅ One key for all models ❌ OpenAI key only ❌ Discord-based
CLI / agent accessible ✅ Yes ✅ API only ❌ Discord only
Images saved to library ✅ YouMind board ❌ No ❌ No
Free tier ✅ Yes ❌ Paid only ❌ Paid only

References

Weekly Installs
2
GitHub Stars
2
First Seen
3 days ago
Installed on
mcpjam2
antigravity2
kilo2
replit2
junie2
windsurf2