MLflow Prompt Registry Patterns

When to Use

Use this skill when implementing agents that need:

Versioned prompts: Track prompt changes over time
A/B testing: Compare champion vs challenger prompts
Runtime updates: Change prompts without code deployment
Governance: Audit prompt changes and rollback capability

Core Principles

1. Single Source of Truth

All prompts stored in Unity Catalog table (agent_config)
Versioned in MLflow as artifacts
Never hardcoded in agent code
Runtime loading by alias (production, staging, champion, challenger)

2. Prompt Versioning

Each prompt update creates new MLflow run
Prompts registered as artifacts in experiment
Aliases point to specific versions
Easy rollback to previous versions

3. A/B Testing Support

Champion/challenger pattern
Load different prompts by alias
Track performance in evaluation
Promote winner to production

Unity Catalog Table Schema

CREATE TABLE {catalog}.{schema}.agent_config (
    config_key STRING NOT NULL,
    config_value STRING NOT NULL,
    config_type STRING NOT NULL,  -- 'prompt', 'setting', 'metadata'
    version INT NOT NULL,
    created_at TIMESTAMP NOT NULL,
    created_by STRING NOT NULL,
    description STRING,
    tags MAP<STRING, STRING>,
    CONSTRAINT pk_agent_config PRIMARY KEY (config_key, version)
)
CLUSTER BY AUTO
COMMENT 'Agent configuration storage including prompts, settings, and metadata';

Why Unity Catalog + MLflow?

Storage	Purpose	Benefits
Unity Catalog Table	Runtime prompt retrieval	Fast reads, SQL queryable, governed
MLflow Artifacts	Versioning & experiment tracking	Git-like history, rollback, lineage

Quick Loading Pattern

from pyspark.sql import SparkSession

def load_prompt(
    spark: SparkSession,
    catalog: str,
    schema: str,
    prompt_key: str,
    version: int = None
) -> str:
    """Load prompt from Unity Catalog agent_config table."""
    table_name = f"{catalog}.{schema}.agent_config"
    
    if version:
        query = f"""
            SELECT config_value
            FROM {table_name}
            WHERE config_key = '{prompt_key}'
              AND version = {version}
              AND config_type = 'prompt'
        """
    else:
        query = f"""
            SELECT config_value
            FROM {table_name}
            WHERE config_key = '{prompt_key}'
              AND config_type = 'prompt'
            ORDER BY version DESC
            LIMIT 1
        """
    
    result = spark.sql(query).collect()
    if not result:
        raise ValueError(f"Prompt not found: {prompt_key}")
    
    return result[0][0]

# Usage
spark = SparkSession.builder.getOrCreate()
orchestrator_prompt = load_prompt(spark, catalog, schema, "orchestrator")

Common Mistakes to Avoid

❌ DON'T: Hardcode Prompts in Agent

# BAD: Prompts embedded in code
class Agent:
    PROMPT = """You are a helpful assistant..."""  # ❌ No versioning!

✅ DO: Load from Registry

# GOOD: Prompts loaded from Unity Catalog
class Agent:
    def __init__(self):
        self._prompts = self._load_prompts_from_uc()

❌ DON'T: Skip MLflow Logging

# BAD: Only in Unity Catalog
register_to_table(prompts)  # ❌ No experiment tracking!

✅ DO: Dual Storage

# GOOD: Both Unity Catalog and MLflow
register_to_table(prompts)  # ✅ Runtime access
register_to_mlflow(prompts)  # ✅ Versioning & lineage

❌ DON'T: Ignore SQL Injection

# BAD: Unsanitized prompt text
spark.sql(f"INSERT INTO table VALUES ('{prompt}')")  # ❌ SQL injection!

✅ DO: Escape Single Quotes

# GOOD: Escaped quotes
sanitized = prompt.replace("'", "''")
spark.sql(f"INSERT INTO table VALUES ('{sanitized}')")  # ✅ Safe

prompt-registry-patterns

MLflow Prompt Registry Patterns

When to Use

Core Principles

1. Single Source of Truth

2. Prompt Versioning

3. A/B Testing Support

Unity Catalog Table Schema

Why Unity Catalog + MLflow?

Quick Loading Pattern

Common Mistakes to Avoid

❌ DON'T: Hardcode Prompts in Agent

✅ DO: Load from Registry

❌ DON'T: Skip MLflow Logging

✅ DO: Dual Storage

❌ DON'T: Ignore SQL Injection

✅ DO: Escape Single Quotes

Validation Checklist

References

Detailed Patterns

Setup Scripts

Official Documentation