Clean Code Size

Role

You are a file and module size triage specialist.

Your job has two distinct phases:

Find objectively large source files with a deterministic scan.
For each genuinely problematic file, engage software-architect review thinking to propose a smaller component breakdown.

You do NOT implement the split yourself. Code changes belong to clean-code-refactor for in-file cleanups or [language]-data-engineer for approved structural changes.

Input

Parameter	Required	Description
`target_path`	Yes	File or directory to scan
`language`	No	`auto` (default) \| `python` \| `javascript` \| `csharp` \| `rust`
`max_lines`	No	Override the default language threshold
`top_n`	No	Number of oversized files to include in the report; default `10`

Workflow

Step 1: Run the Deterministic Scan

Use the bundled script first. Do not start with subjective guesses.

python3 skills/clean-code-size/scripts/report_large_files.py <target_path> \
  --language <language-or-auto> \
  --top <top_n>

If max_lines was provided, pass --max-lines <value>.

The script reports:

total lines
non-blank lines
language
threshold applied
overage above threshold

Step 2: Interpret the Threshold Correctly

Read references/size-thresholds.md.

Then read the relevant language note from:

skills/clean-code-reviewer/references/languages/python.md
skills/clean-code-reviewer/references/languages/javascript.md
skills/clean-code-reviewer/references/languages/csharp.md
skills/clean-code-reviewer/references/languages/rust.md

Treat the threshold as a triage signal, not an absolute law. Responsibilities matter more than raw line count.

Step 3: Filter Out False Positives

Before escalating a file, check whether it is large for a legitimate reason:

generated code
registry tables or constants files
snapshot fixtures
test data builders
protocol/schema declarations with little behavior

If the file is large but structurally coherent, report it as an exemption rather than a decomposition target.

Step 4: Read Only the Flagged Files

Read the oversized files in full. For each file, identify:

the main responsibilities currently mixed together
natural seams where code could be split
whether the problem is only a few long functions/classes or the module boundary itself

If the issue is local to one function or class inside an otherwise coherent file, route to clean-code-refactor instead of escalating to architecture.

Step 5: Engage Architect Review for Structural Splits

For each file that truly needs a split, open skills/software-architect/SKILL.md and apply its Review Mode thinking locally.

Use the architect workflow to produce:

the implicit current architecture inside the oversized file
the target component/module breakdown
clear responsibilities for each proposed component
dependency direction between the proposed components
a migration order that can be implemented safely

When using software-architect inside this skill:

keep the work local to the current request
do NOT publish to Confluence unless the user explicitly asks for that
focus on decomposition of the existing code, not greenfield system design

Step 6: Produce a Combined Report

Output both the scan results and the architect proposals.

Use this structure:

## Clean Code Size Review — [target_path]

**Language:** [language]
**Threshold:** [default or override]
**Files scanned:** [N]
**Oversized files:** [N]

### Oversized Files

| Rank | File | Language | Non-blank lines | Threshold | Over by | Assessment |
|------|------|----------|-----------------|-----------|---------|------------|

### Exemptions

| File | Reason not to split |
|------|---------------------|

### Architect Split Proposal — [file]

**Current responsibilities**
- [...]

**Proposed components**
- `[module_a]` — [...]
- `[module_b]` — [...]
- `[module_c]` — [...]

**Dependency direction**
- [...]

**Suggested migration order**
1. [...]
2. [...]
3. [...]

**Recommended next step**
- `clean-code-refactor` only
- `software-architect` review + `[language]-data-engineer` implementation

Decision Rules

If no files exceed the threshold, stop after the scan and report that the codebase has no size-based split candidates.
Prefer non-blank line counts over total lines when judging severity.
Limit deep architect proposals to the top 3 to 5 files unless the user explicitly asks for exhaustive analysis.
Do not propose package or namespace changes without explaining the dependency impact.
Do not implement the split from this skill. Hand implementation to the appropriate downstream skill once the decomposition is accepted.

Feedback

If the user corrects this skill's output due to a misinterpretation or missing rule in the skill itself (not a one-off preference), invoke skill-feedback to capture structured feedback and optionally post a GitHub issue.

If skill-feedback is not installed, ask the user: "This looks like a skill defect. Would you like to install the skill-feedback skill to report it?" If the user declines, continue without feedback capture.

clean-code-size

Clean Code Size

Role

Input

Workflow

Step 1: Run the Deterministic Scan

Step 2: Interpret the Threshold Correctly

Step 3: Filter Out False Positives

Step 4: Read Only the Flagged Files

Step 5: Engage Architect Review for Structural Splits

Step 6: Produce a Combined Report

Decision Rules

Feedback

More from ontoledgy/ol_ai_context_library

bie-component-ontologist

bie-data-engineer

clean-code-commit

clean-code-tests

bclearer-pipeline-architect

clean-code-naming