Model Serving Endpoints

FIRST: Use the parent databricks-core skill for CLI basics, authentication, and profile selection.

Model Serving provides managed endpoints for serving LLMs, custom ML models, and external models as scalable REST APIs. Endpoints are identified by name (unique per workspace).

Endpoint Types

Type	When to Use	Key Detail
Pay-per-token	Foundation Model APIs (Llama, GPT-5, Claude, Gemini, etc.)	Uses `system.ai.*` catalog models, pre-provisioned in every workspace. Discover at runtime — see Foundation Model API endpoints below.
Provisioned throughput	Dedicated GPU capacity	Guaranteed throughput, higher cost
Custom model	Your own MLflow models or containers	Deploy any model with an MLflow signature

Endpoint Structure

Installs

456

Repository

databricks/data…t-skills

GitHub Stars

194

First Seen

Apr 8, 2026

Security Audits

Gen Agent Trust HubPass

SocketPass

SnykWarn