agenticflow-llm-models
AgenticFlow LLM Models
Choose the right model for your agent based on capability needs, speed requirements, and reasoning depth. Use built-in credits for all models listed here.
When NOT to use this skill
Use agenticflow-built-in-credits skill instead for pricing, credits, or billing questions. Use agenticflow-mcp skill if they need external API keys (BYOK). This skill covers model selection and capabilities, not account management or credit usage.
Orient first
af bootstrap --json
Extract models[] — this is the source of truth for available models in your workspace. Never hardcode model lists; they change between CLI releases and backend deployments. The models below are recommendations, but the live models[] array is the final authority.
Discover & health
af changelog --json # What's new in the CLI — model additions/removals
af context --json # AI agent orientation, env vars, invocation guidance
af bootstrap --strict --json # Health check — exits non-zero if degraded
af bootstrap returns an invocation block and data_fresh boolean. If data_fresh: false, the backend is degraded — don't rely on stale model data from a degraded response. af bootstrap --strict exits non-zero when the backend is unhealthy, so CI/automation can abort before choosing models against a degraded workspace.
Verification rule: Before recommending any model, check
models[]fromaf bootstrap --json. If a model is absent from that list, warn the user and fall back to a confirmed model.
Author's Top 3 Recommendations
These are the author's personal picks based on reliability, reasoning quality, and speed:
| Rank | Model | Role | Why |
|---|---|---|---|
| 1st | deepseek-v4-flash |
Primary default | Best all-rounder — strong reasoning, reliable tool use, good speed. Replaces the older GLM 4.7 Flash default. |
| 2nd | gemini-2.5-flash-lite |
Fallback / media | Fastest option with media support. Use when speed matters more than reasoning depth. |
| 3rd | qwen-3.5-flash |
Deep verification | Deepest thinker — use when reasoning depth and verification matter most. |
Note:
deepseek-v4-flashandgemini-2.5-flash-litemay not appear in every CLI release's hardcodedKNOWN_MODELSlist if they ship between releases. Always verify againstaf bootstrap --json > models[]. If absent, the backend may still serve them — proceed with a dry-run to confirm.
Upstream Canonical Models
The CLI's built-in validator recognizes these models as of v1.10.5. They are always safe to use:
agenticflow/deepseek-v3.2
agenticflow/gemma-4-31b-it
agenticflow/gemma-4-26b-a4b-it
agenticflow/gemini-2.0-flash
agenticflow/gpt-4o-mini
agenticflow/qwen-3.5-flash
Model characteristics
| Model | Speed | Reasoning | Best for |
|---|---|---|---|
agenticflow/deepseek-v3.2 |
Medium | Strong | Reliable reasoning, good tool use |
agenticflow/gemma-4-31b-it |
Fast | Light | General purpose, high-volume |
agenticflow/gemma-4-26b-a4b-it |
Fast | Light | General purpose, slightly smaller context |
agenticflow/gemini-2.0-flash |
Fast | Light | Deprecated — still served but being replaced by 2.5 Flash Lite |
agenticflow/gpt-4o-mini |
Fast | Light | Default for blueprints (v1.8.1+) — follows system prompts reliably, good for tool calling |
agenticflow/qwen-3.5-flash |
Medium-Deep | Very strong | Deep verification, complex reasoning |
Default model change (v1.8.1+)
- Before v1.8.1: Default was
agenticflow/gemini-2.0-flash - After v1.8.1: Default is
agenticflow/gpt-4o-mini - Reason for change: Gemini 2.0 Flash refuses
web_searchon "latest X" prompts citing knowledge cutoff, even with explicit system prompt rules. GPT-4o-mini follows system prompts and calls tools reliably.
Model selection guide
Need a default?
# Author's primary recommendation — deepseek-v4-flash
af agent create --body '{"name":"My Agent","model":"deepseek-v4-flash","project_id":"<id>"}' --json
# Or use the upstream blueprint default — gpt-4o-mini
af agent create --body '{"name":"My Agent","model":"agenticflow/gpt-4o-mini","project_id":"<id>"}' --json
# For maximum speed with media support
af agent create --body '{"name":"My Agent","model":"gemini-2.5-flash-lite","project_id":"<id>"}' --json
Need reasoning?
- Deep verification:
qwen-3.5-flash— deepest thinker - Reliable tool use + reasoning:
deepseek-v3.2ordeepseek-v4-flash
Need speed only?
- Fastest correct:
gpt-4o-miniorgemma-4-31b-it
Workforce model selection
All agents in a workforce inherit the model:
# Default (v1.8.1+)
af workforce init --blueprint dev-shop --model agenticflow/gpt-4o-mini --name "My Team" --json
# Or use author's primary pick
af workforce init --blueprint dev-shop --model deepseek-v4-flash --name "My Team" --json
Reasoning configuration
Expose reasoning tokens (where supported):
af schema agent --field model_user_config --json
Check the full agent schema (for all fields):
af schema agent --json
For models with hidden reasoning (e.g. some Gemini variants), configure via thinking_config to expose reasoning tokens.
Verify model availability
# Always dry-run first
af agent create --body @agent.json --dry-run --json
The CLI validates the model string at create time. Typos fail fast with an actionable hint listing known models. If you pass a vendor/model-name-shaped string not in the known list, the CLI warns but allows it to proceed — so brand-new models work before the CLI is updated.
Avoid these models
Based on upstream changelog and known issues:
| Model | Issue |
|---|---|
agenticflow/gemini-2.0-flash |
Deprecated, replaced by 2.5 Flash Lite. Still served but default changed to gpt-4o-mini. |
agenticflow/gemini-2.0-flash-lite |
Deprecated, replaced by 2.5 Flash Lite |
In general, if a model is absent from
af bootstrap --json > models[], it may have been deprecated or renamed. Check thehintfield on 400/422 errors for alternatives.
Fallback model guide
If your preferred model is unavailable:
- Run
af bootstrap --jsonand checkmodels[] - Pick the closest match from the confirmed list above
- Use
--dry-runon create to validate before deploying - For reasoning-heavy tasks: fall back to
qwen-3.5-flashordeepseek-v3.2 - For speed-first tasks: fall back to
gpt-4o-miniorgemma-4-31b-it
Cleanup
Test agents consume credits. Delete when done:
af agent delete --agent-id <id> --json
On errors
- 400 / Invalid model → Check
models[]from bootstrap; model may have been renamed or is not yet in the CLI's hardcoded list. Try--dry-runfirst. - 402 / Payment Required → Model requires credits; see
agenticflow-built-in-creditsskill - 422 / Model not available → Model temporarily unavailable; the
hintsuggests alternatives - finish_reason=length → Increase
max_tokensinmodel_user_config
When hint is non-empty, follow it before retrying.
More from antongulin/agenticflow-ai-skills
agenticflow-mcp
Attach external tool providers (Google Docs, Google Sheets, Slack, Notion, GitHub, Apify, etc.) to an AgenticFlow agent via MCP clients. Use when the user wants their agent to read or write external data, call third-party APIs, save outputs to a doc/sheet, or use any tool beyond the model's built-in knowledge. Covers `af mcp-clients list --name-contains`, `af mcp-clients inspect --id` (classify pattern before attach), and the Pipedream vs Composio write-capability distinction — critical for parametric writes. Route traffic through the `af` CLI; the standalone `agenticflow-mcp` server repo lags the CLI and is not recommended.
4agenticflow-agent
Create, run, and iterate on a single AgenticFlow AI agent — one chat endpoint, one assistant, one persona. Use when the user wants a customer-facing bot, a support assistant, a single task agent, or a prompt experiment. Choose this skill over agenticflow-workforce when there's no orchestration between roles (no handoff, no coordinator → workers). Covers `af agent create/update/run/delete`, the `--patch` partial-update pattern for iteration, `af schema agent --field <name>` for nested payload shapes (including suggested_messages, mcp_clients, response_format), the `model_user_config` / `code_execution_tool_config` settings, and safe iteration loops.
4agenticflow-workforce
Deploy and operate a multi-agent AgenticFlow workforce — a DAG of agents that hand off to each other (trigger → coordinator → worker agents → output). Use when the user asks for a team, pipeline, or multi-agent system: research-then-write, triage-then-specialist, dev shop, marketing agency, sales team, content studio, support center, Amazon seller team. Choose this skill over agenticflow-agent when the ask mentions 'team', 'workforce', 'pipeline', 'multiple agents', 'delegation', 'handoff', or names a built-in blueprint. Provides the `af workforce *` command surface, blueprint decisions, graph wiring, MCP attach recipes, and public URL publishing.
4agenticflow-built-in-credits
Use AgenticFlow's built-in features and account credits first — before adding external API keys (BYOK). Use this skill whenever the user asks about image generation without API keys, wants to use their existing credits, asks about built-in vs BYOK, or mentions agenticflow_generate_image, web_search, web_retrieval, or credit-efficient workflows. BYOK is only for extension when unsatisfied or explicitly requested.
4