Rust Mastery (Senior → Principal)

Operate

Start by confirming: goal, scope, crate type (bin/lib/workspace), Rust/MSRV constraints, target platform, unsafe requirements, latency/throughput goals, and the definition of done.
Prefer small, reviewable changes with tests and explicit tradeoffs.
Default to stable Rust, stdlib-first patterns, and boring solutions before adding macros or dependencies.
Treat production code as an operable system: timeouts, shutdown, observability, and failure modes are part of the feature.

The target is not “clever Rust”. The target is code that remains correct, observable, and maintainable under production stress.

Default Rust Standards

Keep main.rs thin; put business logic in testable modules or crates.
Prefer typed domain errors with thiserror; use anyhow at application boundaries and CLIs.
No unwrap()/expect() on production paths unless the invariant is truly impossible and documented by the code structure.
Introduce traits at the consumer boundary, not pre-emptively.
Prefer ownership and borrowing that make invalid states unrepresentable before reaching for Arc<Mutex<_>>.
Every spawned task needs an owner, a cancellation path, and an error handling strategy.
Keep unsafe code isolated, minimal, and justified with explicit invariants.

“Bad vs Good” (common production pitfalls)

// ❌ BAD: panic in a production path with no context.
let user = repo.find(id).await.unwrap();

// ✅ GOOD: propagate context with a typed error.
let user = repo
    .find(id)
    .await
    .map_err(AppError::from)?
    .ok_or(AppError::UserNotFound { id })?;

// ❌ BAD: detached task with no owner and no shutdown path.
tokio::spawn(async move {
    loop {
        run_job().await;
    }
});

// ✅ GOOD: task respects cancellation and reports failures.
tokio::spawn(async move {
    loop {
        tokio::select! {
            _ = shutdown.cancelled() => break,
            result = run_job() => {
                if let Err(err) = result {
                    tracing::error!(error = %err, "job failed");
                }
            }
        }
    }
});

Workflow (Feature / Refactor / Bug)

Reproduce the behavior or codify it with a failing test.
Decide boundaries: transport, orchestration, domain, adapters, persistence.
Define failure modes: panics, cancellation, partial writes, retries, timeouts, shutdown.
Implement the smallest end-to-end slice.
Add tests, benchmarks, or property tests when the risk justifies them.
Validate formatting, lints, security, and release behavior.

Validation Commands

Run cargo fmt --all --check.
Run cargo clippy --workspace --all-targets --all-features -- -D warnings.
Run cargo test --workspace --all-features.
Run cargo test -- --nocapture when debugging test output.
Run cargo nextest run --workspace --all-features if available for faster suites.
Run cargo llvm-cov if coverage matters.
Run cargo audit before release.
Run cargo deny check if the repo uses policy checks for licenses/advisories.

Architecture & Boundaries

Prefer a modular monolith before splitting into many crates or services.
Keep boundary direction explicit: transport -> use-case -> domain -> ports -> adapters.
Map errors once at the boundary: HTTP/gRPC/CLI should translate domain errors consistently.
Keep domain types free from transport-specific concerns where practical.

Async, Concurrency, and Ownership Guardrails

Avoid “shared mutable state first”; prefer message passing or ownership transfer.
If you use Arc<Mutex<_>>, document the protected invariant and expected contention.
Bound concurrency for fan-out work; avoid unbounded task spawning.
Always set timeouts for outbound IO and database acquisition.
Treat cancellation as part of correctness, not just cleanup.

Service/API Defaults

Use structured tracing with stable fields such as service, trace_id, request_id, tenant_id, and status.
Expose health/readiness endpoints for services.
Validate input at the boundary; never trust deserialized payloads blindly.
Make error taxonomy explicit: invalid, unauthorized, forbidden, not-found, conflict, unavailable.
Prefer idempotent handlers for side-effecting operations where retries may happen.

Performance & Safety Defaults

Measure before optimizing with criterion, flamegraphs, or profiler traces.
Watch clone frequency, allocation churn, lock contention, and serialization hotspots.
Prefer zero-copy and borrowing only when it improves the real bottleneck and keeps code readable.
Use panic = "abort" only when the operational tradeoff is understood.

Security Checklist (Minimum)

No secrets in logs, panic messages, or Debug output.
Validate lengths, counts, recursion depth, and body sizes for untrusted input.
Use parameterized SQL and least-privilege credentials.
Prefer allowlists for outbound network and file operations in high-risk systems.
Keep unsafe blocks isolated and reviewed as security-sensitive code.

References

Architecture and dependency direction: references/architecture.md
Advanced patterns: references/advanced-patterns.md
Bug prevention: references/bug-prevention.md
Code review checklist: references/code-review-guide.md
Debugging and profiling: references/debugging-guide.md
Database and SQLX: references/database-and-sqlx.md
HTTP service patterns: references/http-service-patterns.md
Observability: references/observability.md
Reliability: references/reliability.md
Senior habits and idioms: references/senior-habits.md
Trusted libraries: references/trusted-libraries.md
Production readiness and operations: references/production-readiness.md

Scripts & Assets

scripts/scaffold_project.py - bootstrap a Rust project skeleton.
assets/github-ci.yml - CI baseline for GitHub Actions.

rust-principal-engineer

Rust Mastery (Senior → Principal)

Operate

Default Rust Standards

“Bad vs Good” (common production pitfalls)

Workflow (Feature / Refactor / Bug)

Validation Commands

Architecture & Boundaries

Async, Concurrency, and Ownership Guardrails

Service/API Defaults

Performance & Safety Defaults

Security Checklist (Minimum)

References

Scripts & Assets