lean4
Lean 4 Theorem Proving
Use this skill whenever you're editing Lean 4 proofs, debugging Lean builds, formalizing mathematics in Lean, or learning Lean 4 concepts. It prioritizes LSP-based inspection and mathlib search, with scripted primitives for sorry analysis, axiom checking, and error parsing.
Core Principles
Search before prove. Many mathematical facts already exist in mathlib. Search exhaustively before writing tactics.
Build incrementally. Lean's type checker is your test suite—if it compiles with no sorries and standard axioms only, the proof is sound.
Respect scope. Follow the user's preference: fill one sorry, its transitive dependencies, all sorries in a file, or everything. Ask if unclear.
Never change statements or add axioms without explicit permission. Theorem/lemma statements, type signatures, and docstrings are off-limits unless the user requests changes. Inline comments may be adjusted; docstrings may not (they're part of the API). Custom axioms require explicit approval—if a proof seems to need one, stop and discuss.
Commands
| Command | Purpose |
|---|---|
/lean4:formalize |
Turn informal math into Lean statements |
/lean4:prove |
Guided cycle-by-cycle theorem proving |
/lean4:autoprove |
Autonomous multi-cycle proving with stop rules |
/lean4:checkpoint |
Verified save point (build + axiom check + commit) |
/lean4:review |
Quality audit (--mode=batch or --mode=stuck) |
/lean4:refactor |
Strategy-level proof simplification |
/lean4:golf |
Optimize proofs for brevity |
/lean4:learn |
Interactive teaching and mathlib exploration |
/lean4:doctor |
Plugin troubleshooting and migration help |
Which Command?
| Situation | Command |
|---|---|
| Draft a Lean statement from an informal claim | /lean4:formalize |
| Filling sorries (interactive) | /lean4:prove |
| Filling sorries (unattended) | /lean4:autoprove |
| Verified save point | /lean4:checkpoint |
| Quality check (read-only) | /lean4:review |
| Simplify proof strategies (mathlib leverage, helpers) | /lean4:refactor |
| Optimizing compiled proofs | /lean4:golf |
| New to this project / exploring | /lean4:learn --mode=repo |
| Navigating mathlib for a topic | /lean4:learn --mode=mathlib |
| Something not working | /lean4:doctor |
| Formalize + prove end-to-end | /lean4:autoprove --formalize=auto --source=... --claim-select=first --formalize-out=... |
Typical Workflow
/lean4:formalize Turn informal math into Lean statements (optional entry)
↓
/lean4:prove Guided cycle-by-cycle proving (asks before each cycle)
/lean4:autoprove Autonomous multi-cycle proving (runs with stop rules)
↓
/lean4:refactor Simplify proof strategies (optional, or --dry-run to preview)
↓
/lean4:golf Optimize proofs for tactic-level brevity (optional)
↓
/lean4:checkpoint Create verified save point
Use /lean4:learn at any point to explore repo structure or navigate mathlib. Use /lean4:formalize standalone or via --formalize=auto on autoprove for end-to-end source-to-proof.
Notes:
/lean4:proveasks before each cycle;/lean4:autoproveloops autonomously with hard stop conditions- Both trigger
/lean4:reviewat configured intervals (--review-every) - When reviews run (via
--review-every), they act as gates: review → replan → continue. In prove, replan requires user approval; in autoprove, replan auto-continues - Review supports
--mode=batch(default) or--mode=stuck(triage); review is always read-only --formalize=autoon autoprove wraps formalize+prove in a single command (source → claims → skeletons → proofs)- If you hit environment issues, run
/lean4:doctorto diagnose
LSP Tools (Preferred)
Sub-second feedback and search tools (LeanSearch, Loogle, LeanFinder) via Lean LSP MCP:
lean_goal(file, line) # See exact goal
lean_hover_info(file, line, col) # Understand types
lean_local_search("keyword") # Fast local + mathlib (unlimited)
lean_leanfinder("goal or query") # Semantic, goal-aware (10/30s)
lean_leansearch("natural language") # Semantic search (3/30s)
lean_loogle("?a → ?b → _") # Type-pattern (unlimited if local mode)
lean_hammer_premise(file, line, col) # Premise suggestions for simp/aesop/grind (3/30s)
lean_state_search(file, line, col) # Goal-conditioned lemma search (3/30s)
lean_multi_attempt(file, line, snippets=[...]) # Test multiple tactics
Core Primitives
| Script | Purpose | Output |
|---|---|---|
sorry_analyzer.py |
Find sorries with context | text (default), json, markdown, summary |
check_axioms_inline.sh |
Check for non-standard axioms | text |
smart_search.sh |
Multi-source mathlib search | text |
find_golfable.py |
Detect optimization patterns | JSON |
find_usages.sh |
Find declaration usages | text |
Usage: Invoked by commands automatically. See references/ for details.
Invocation contract: Never run bare script names. Always use:
- Python:
${LEAN4_PYTHON_BIN:-python3} "$LEAN4_SCRIPTS/script.py" ... - Shell:
bash "$LEAN4_SCRIPTS/script.sh" ... - Report-only calls: add
--report-onlytosorry_analyzer.py,check_axioms_inline.sh,unused_declarations.sh— suppresses exit 1 on findings; real errors still exit 1. Do not use in gate commands like/lean4:checkpoint. - Keep stderr visible for Lean scripts (no
/dev/nullredirection), so real errors are not hidden.
If $LEAN4_SCRIPTS is unset or missing, run /lean4:doctor and stay LSP-only until resolved.
Automation
/lean4:prove and /lean4:autoprove handle most tasks:
- prove — guided, asks before each cycle. Ideal for interactive sessions.
- autoprove — autonomous, loops with hard stop rules. Ideal for unattended runs.
Both share the same cycle engine (plan → work → checkpoint → review → replan → continue/stop) and follow the LSP-first protocol: LSP tools are normative for discovery and search; script fallback only when LSP is unavailable or exhausted. Compiler-guided repair is escalation-only — not the first response to build errors. For complex proofs, they may delegate to internal workflows for deep sorry-filling (with snapshot, rollback, and scope budgets), proof repair, or axiom elimination. You don't invoke these directly.
Skill-Only Behavior
When editing .lean files without invoking a command, the skill runs one bounded pass:
- Read the goal or error via
lean_goal/lean_diagnostic_messages - Search mathlib with up to 2 LSP tools (e.g.
lean_local_search+lean_leanfinder/lean_leansearch/lean_loogle) - Try the Automation Tactics cascade
- Validate with
lean_diagnostic_messages(no project-gatelake buildin this mode) - No looping, no deep escalation, no multi-cycle behavior, no commits
- End with suggestions:
Use
/lean4:provefor guided cycle-by-cycle help. Use/lean4:autoprovefor autonomous cycles with stop safeguards.
Quality Gate
A proof is complete when:
lake buildpasses- Zero sorries in agreed scope
- Only standard axioms (
propext,Classical.choice,Quot.sound) - No statement changes without permission
Verification ladder: lean_diagnostic_messages(file) per-edit → lake env lean <path/to/File.lean> file gate (run from project root) → lake build project gate only. See cycle-engine: Build Target Policy.
Common Fixes
See compilation-errors for error-by-error guidance (type mismatch, unknown identifier, failed to synthesize, timeout, etc.).
Type Class Patterns
-- Local instance for this proof block
haveI : MeasurableSpace Ω := inferInstance
letI : Fintype α := ⟨...⟩
-- Scoped instances (affects current section)
open scoped Topology MeasureTheory
Order matters: provide outer structures before inner ones.
Automation Tactics
Try in order (stop on first success):
rfl → simp → ring → linarith → nlinarith → omega → exact? → apply? → grind → aesop
Note: exact?/apply? query mathlib (slow). grind and aesop are powerful but may timeout. See grind-tactic for interactive workflows, annotation strategy, and simproc escalation.
Troubleshooting
If LSP tools aren't responding, scripts provide fallback for all operations. If environment variables (LEAN4_SCRIPTS, LEAN4_REFS) are missing, run /lean4:doctor to diagnose.
Script environment check:
echo "$LEAN4_SCRIPTS"
ls -l "$LEAN4_SCRIPTS/sorry_analyzer.py"
# One-pass discovery for troubleshooting (human-readable default text):
${LEAN4_PYTHON_BIN:-python3} "$LEAN4_SCRIPTS/sorry_analyzer.py" . --report-only
# Structured output (optional): --format=json
# Counts only (optional): --format=summary
Cold start / fresh worktree:
- Fresh worktree or after
lake clean? Prime the cache in that worktree before the first real build. - Use the project's cache command:
lake cache geton newer Lake, orlake exe cache getwhere the project still uses the mathlib cache executable. - If Lean LSP is cold or timing out on first use, run one
lake buildto bootstrap the workspace. - After bootstrap, return to the normal verification ladder:
lean_diagnostic_messages(file)→lake env lean <path/to/File.lean>(from project root) →lake buildonly at checkpoint/final gate. - Do not symlink another worktree's
.lake/build; use Lake cache/artifact mechanisms instead.
References
Cycle Engine: cycle-engine — shared prove/autoprove logic (stuck, deep mode, falsification, safety)
LSP Tools: lean-lsp-server (quick start), lean-lsp-tools-api (full API — grep ^## for tool names)
Search: mathlib-guide (read when searching for existing lemmas), lean-phrasebook (math→Lean translations)
Errors: compilation-errors (read first for any build error), instance-pollution (typeclass conflicts — grep ## Sub- for patterns), compiler-guided-repair (escalation-only repair — not first-pass)
Tactics: tactics-reference (tactic lookup — grep ^### TacticName), grind-tactic (SMT-style automation — when simp can't close), simp-reference (simp hygiene + custom simprocs), tactic-patterns, calc-patterns
Proof Development: proof-templates, proof-refactoring (28K — grep by topic), proof-simplification (strategy-level: mathlib search, congr lemmas, helper extraction), sorry-filling
Optimization: proof-golfing (includes safety rules, bounded LSP lemma replacement, bulk rewrites, anti-patterns; escalates to axiom-eliminator), proof-golfing-patterns, performance-optimization (grep by symptom), profiling-workflows (diagnose slow builds/proofs)
Domain: domain-patterns (25K — grep ## Area), measure-theory (28K), axiom-elimination
Style: mathlib-style, verso-docs (Verso doc comment roles and fixups)
Custom Syntax: lean4-custom-syntax (read when building notations, macros, elaborators, or DSLs), metaprogramming-patterns (MetaM/TacticM API — composable blocks, elaborators), scaffold-dsl (copy-paste DSL template), json-patterns (json% syntax + ToJson)
Quality: linter-authoring (project-specific linter rules), ffi-patterns (C/ObjC bindings via Lake)
Workflows: agent-workflows, subagent-workflows, command-examples, learn-pathways (intent taxonomy, game tracks, source handling)
Internals: review-hook-schema