novita-docs
Novita AI Platform Reference
Complete documentation for the Novita AI platform - an AI & Agent Cloud for developers.
When to Use This Skill
Load this skill when the user asks about:
- Novita AI products: Model APIs, GPU instances, serverless GPUs, agent sandbox
- Model information: "What models does Novita support?", model pricing, capabilities
- API guidance: How to use APIs, authentication, endpoints, parameters
- Pricing and billing: Cost estimates, billing queries, payment methods
- Integrations: LangChain, LlamaIndex, Cursor, and 30+ other tools
- Design system: Colors, typography, buttons, navigation, icons, logo
- Getting started: Quickstart guides, FAQs, setup instructions
- Troubleshooting: Error codes, common issues, support
Quick Reference
| Resource | URL |
|---|---|
| Website | https://novita.ai |
| Model Catalog | https://novita.ai/models (200+ models) |
| Documentation | https://novita.ai/docs |
| Pricing | https://novita.ai/pricing |
| Console | https://novita.ai/console |
| API Base URL | https://api.novita.ai/openai |
| Support | support@novita.ai |
| Discord | https://discord.gg/YyPRAzwp7P |
🔍 Quick: Query Available Models
Most common question: "What models does Novita support?"
Query Methods
1. Web Catalog (human-friendly):
- Browse 200+ models at https://novita.ai/models
- Filter by type: LLM, image, video, audio, embeddings
2. API Endpoint (automation):
curl https://api.novita.ai/openai/v1/models \
-H "Authorization: Bearer <YOUR_API_KEY>"
Returns: Model ID, pricing per million tokens, context size, description
Model Categories
- LLMs: 100+ models (Llama, Qwen, DeepSeek, Mistral, etc.)
- Image Generation: Flux, Stable Diffusion, SDXL
- Video: Wan 2.6, CogVideoX
- Audio: TTS, voice cloning
- Embeddings: Text embedding models
Quick Links
| Task | Reference |
|---|---|
| List all models via API | list-models.md |
| Get specific model info | retrieve-model.md |
| Recommended LLMs | llm/recommended.md |
| Image model APIs | api-reference/image-apis/ |
| Model API guides | model-apis/ |
Pro Tip: Always call /v1/models API first for the latest model list and current pricing.
How to Use This Documentation
1. Start Here
- New users: See getting-started/ - company overview, quickstart, FAQ
- Model queries: Check the "Quick: Query Available Models" section above
- API help: Jump to specific API reference sections below
2. Find Documentation by Category
Product Guides (usage and features):
- getting-started/ - Overview, quickstart, product pages
- llm/ - LLM API guides (16 files)
- model-apis/ - Model API guides (11 files)
- gpu-instance/ - GPU instances (14 files)
- serverless-gpus/ - Serverless GPUs (6 files)
- sandbox/ - Agent Sandbox (43 files)
- integrations/ - 30+ integration guides
API Reference (endpoints and parameters):
- api-reference/basic/ - Auth, billing (6 files)
- api-reference/llm/ - LLM endpoints (16 files)
- api-reference/image-apis/ - Image/video APIs (48 files)
- api-reference/gpu-instance/ - GPU APIs (2 files)
Support:
Design System:
- design-system/ - UI/UX specs (7 files)
3. File Naming Convention
Files are organized by category:
references/
├── getting-started/ # Product overviews and quickstart
├── llm/ # LLM feature guides
├── model-apis/ # Model API guides
├── gpu-instance/ # GPU instance guides
├── serverless-gpus/ # Serverless GPU guides
├── sandbox/ # Agent Sandbox docs (with subdirs)
├── integrations/ # Third-party tool integrations
├── api-reference/ # API endpoint documentation
│ ├── basic/ # Auth, billing APIs
│ ├── llm/ # LLM API endpoints
│ ├── image-apis/ # Image/video API endpoints
│ └── gpu-instance/ # GPU instance APIs
├── billing/ # Billing and payment
├── team/ # Team management
└── design-system/ # UI/UX design specs
📚 Documentation Index
Core Product Documentation
Getting Started (8 files)
- company-overview.md - Company overview, products, testimonials
- gpus.md - GPU Cloud product overview
- sandbox.md - Agent Sandbox product overview
- gpu-baremetal.md - Bare metal GPU servers
- introduction.md - Platform introduction
- quickstart.md - Quick start guide
- faq.md - Frequently asked questions
- error-handling.md - Error handling
LLM Guides (17 files) Core: api · batch-api · function-calling · vision · reasoning · structured-outputs · prompt-cache · rate-limits · monitoring · observability-metrics · dedicated-endpoint · playgrounds · recommended
Model APIs (11 files) overview · sdks · dedicated-endpoints · training-guidance · custom-model · sampler · vae · clip-skip · rate-limits · v2-to-v3-migration · configure-custom-s3-bucket
GPU Instance (14 files) overview · overview-guide · choose-a-gpu · pricing · quickstart-* (5 files) · jupyterlab · save-image · upgrade-instance · edit-instance · image-prewarm
Serverless GPUs (6 files) overview · pricing · quickstart-* (4 files)
Agent Sandbox (43 files organized in subdirectories) Core: overview · pricing · sdk-and-cli
Quickstart: your-first-sandbox · introduction · installation · quick-start · frameworks · advanced
CLI: overview · auth · spawn · list · shutdown
Commands: overview · background · streaming
Filesystem: overview · read-write · upload · download · watch
Lifecycle: overview · clone · list · idle-timeout
Template: overview · customize-cpu-ram · start-cmd · ready-cmd · version-management
More: console · connect · internet-access · environment-variables · metadata · metrics · mount-cloudstorage
Integrations (30 tools) langchain · llamaindex · huggingface · cursor · dify · browseruse · skyvern · gradio · anythingllm · axolotl · chatbox · claude-code · codecompanion · continue · deepsearcher · docsgpt · helicone · kohya-ss-gui · langflow · langfuse · litellm · lobechat · lollms-webui · openai-agents-sdk · owl · pageassist · portkey · verba
API Reference
Basic APIs (6 files) authentication · error-code · get-user-balance · query-*-billing (3 files)
LLM APIs (16 files) list-models · retrieve-model · create-chat-completion · create-completion · create-embeddings · create-rerank · create-batch · cancel-batch · list-batches · retrieve-batch · list-files · upload-batch-input-file · query-file · retrieve-file-content · delete-file
Image/Video APIs (54 files) introduction
Core APIs: txt2img · img2img · inpainting · upscale · image-upscaler · remove-background · image-to-prompt · eraser · remove-text · replace-background · merge-face · reimagine · video-merge-face · task-result
Flux Models: flux-1-schnell · flux-1-kontext-dev · flux-1-kontext-max · flux-1-kontext-pro · flux-2-dev · flux-2-flex · flux-2-pro
Other Models: seedream-* (3) · glm-image · hunyuan-image-3 · qwen-* (2) · z-image-turbo · z-image-turbo-lora
Training: create-style-training · create-subject-training · list-training-task · get-training-images-url
Other: glm-tts-voice-clone · webhook
GPU Instance APIs (2 files) create-instance · list-clusters
Support & Design System
Billing (4 files) budgets · auto-top-up · payment-methods · low-balance-alert
Team (1 file) team-management
Design System (7 files) overview · typography · colors · buttons · navigation · icons · logo
Common Tasks
Start with Model APIs
- Get API key from https://novita.ai/console
- Set base URL to
https://api.novita.ai/openai - Call
/v1/modelsto list available models - Use OpenAI-compatible APIs for chat completions
- See llm/api.md for details
Launch GPU Instance
- Go to https://novita.ai/gpus-console/explore
- Choose GPU or template
- Configure and launch
- Connect via SSH or web terminal
- See gpu-instance/ for details
Create Serverless Endpoint
- Prepare container image
- Go to https://novita.ai/gpus-console/serverless
- Create endpoint with scale policy
- Test and deploy
- See serverless-gpus/ for details
Start Agent Sandbox
- Install SDK or CLI
- Create sandbox with desired resources
- Run commands or upload code
- Pause/resume as needed
- See sandbox/ for details
Integrate with Framework
- Get Novita API key
- Set base URL to
https://api.novita.ai/openai - Update model names as needed
- See integrations/ for specific guides
Support & Resources
- Documentation: https://novita.ai/docs
- Email: support@novita.ai
- Discord: https://discord.gg/YyPRAzwp7P
- FAQ: https://novita.ai/docs/guides/faq
- Status Page: https://status.novita.ai/
More from jaxzhang-svg/novita-skills
twitterapi-cli
Twitter data retrieval CLI tool. Use when user requests Twitter data - user profiles, follower counts, tweet searches, user timelines, follower/following lists, or Twitter user metrics. Supports field filtering for structured output. For detailed API endpoint documentation, see the original API skill at https://docs.twitterapi.io/skill.md
18novita-social-monitor
Social media intelligence monitoring for Novita. Use when systematically browsing X/Twitter accounts from personal following list (@Jax_Zhang_4R) to gather AI industry intelligence. Performs sequential account review with per-account record keeping (including original tweet links), followed by comprehensive summary. Always use @skills/twitterapi-cli for data retrieval.
10get-started
Get started with Novita Skills. Use when user wants to know what skills are available, needs help installing team skills, wants to contribute new skills, asks about team capabilities, or needs recommendations for which skills to install. Provides an overview of all team skills, contribution guidelines, and helps users discover and install the right skills.
6