Refactor Safely

Use this skill when the task is to change structure without changing intended behavior.

Core principle: Small, reversible steps over large rewrites. Separate design improvement from behavior change.

Quick Reference

Step	Action	Verification
1	Define stable behavior	Written statement of what must not change
2	Add characterization tests	Tests pass on current code
3	Choose smallest safe slice	One boundary at a time
4	Rename, move, or extract	Tests still pass
5	Remove compatibility shims	Tests still pass, new path proven

HARD-GATE

NO REFACTORING WITHOUT CHARACTERIZATION TESTS FIRST.
SEPARATE behavior changes from structural refactors.
VERIFY tests pass after EVERY step — not just at the end.

If you haven't run the test suite after a refactoring step, you cannot claim it works.

Refactoring Order

Define the behavior that must stay stable.
Add or tighten characterization tests around that behavior.
Choose the smallest safe slice.
Rename, move, or extract in steps that keep the code runnable.
Remove compatibility shims only after the new path is proven.

Core Rules

Split behavior changes from structural refactors when practical.
Keep public interfaces stable until callers are migrated.
Extract one boundary at a time.
Prefer adapters, facades, or wrappers for transitional states.
Stop and simplify if the refactor introduces more indirection than clarity.

Good First Moves

Rename unclear methods or objects
Isolate duplicated logic behind a shared object
Extract query or service objects from repeated workflows
Wrap external integrations before moving call sites
Add narrow seams before deleting old code paths

Verification Protocol

EXTREMELY-IMPORTANT: Run verification after every refactoring step.

AFTER each step:
1. Run the full test suite
2. Read the output — check exit code, count failures
3. If tests fail: STOP, undo the step, investigate
4. If tests pass: proceed to next step
5. ONLY claim completion with evidence from the last test run

Forbidden claims:

"Should work now" (run the tests)
"Looks correct" (run the tests)
"I'm confident" (confidence is not evidence)

Examples

Stable behavior to preserve: "Creating an order validates line items, applies pricing, persists the order, and enqueues NotifyWarehouseJob."

Smallest safe sequence (extract service):

Add a characterization test (request or service spec) that covers the current OrdersController#create flow.
Extract Orders::CreateOrder with the same behavior; call it from the controller; keep controller response/redirect logic. Verify tests pass.
Remove duplicated logic from the controller. No behavior change. Verify tests pass.
(Later) Improve the service internals if needed; the refactor is done.

Red-flag refactor (avoid): "Rename Order to Purchase and update all 50 call sites in one PR" — too many touchpoints; do renames in small steps with find/replace and tests after each commit.

Common Mistakes

Mistake	Reality
"Quick refactor, no tests needed"	No characterization tests = no safety net. Bugs will slip through.
Mixing behavior change with structural change	Do one or the other. Mixed changes are impossible to review.
"I'll verify at the end"	Verify after EVERY step. Bugs compound when you batch.
Renaming 50 call sites in one commit	Do it in small batches with tests between each.
Adding abstraction to satisfy a pattern	Abstractions must serve a real boundary, not a textbook.
"Should work now" without running tests	Evidence before claims, always.
Removing old code before new path is proven	Keep compatibility shims until callers are fully migrated.

Red Flags

Refactor plan requires touching many unrelated call sites at once
No tests prove current behavior before starting
Structural cleanup is mixed with new feature work
Old and new paths diverge without a migration plan
New abstractions exist only to satisfy a pattern, not a real boundary
More than 3 refactoring steps without running tests
Using "should", "probably", "seems to" when claiming tests pass

Output Style

When asked to refactor:

State the stable behavior that must not change.
Propose the smallest safe sequence.
Add or point to the tests that protect each step.
Call out any temporary compatibility code and when to remove it.
Run verification after each step and report results with evidence.

Integration

Skill	When to chain
rspec-best-practices	For writing characterization tests before refactoring
rails-architecture-review	When refactor reveals structural problems
rails-code-review	For reviewing the refactored code
ruby-service-objects	When extracting logic into service objects

refactor-safely