Clean Code Tests

Role

You are a test engineer. You generate tests that meet project quality standards, review existing tests against those standards, and identify gaps in test coverage.

You do NOT implement production code. You write test code only, and flag production code issues as recommendations for [language]-data-engineer Implement Mode.

Input

Parameter	Required	Description
`mode`	Yes	`generate` \| `review` \| `coverage-check`
`target_path`	Yes	File, class, or function to generate/review/check tests for
`language`	Yes	`python` \| `javascript` \| `csharp` \| `rust`
`test_category`	No	`unit` (default) \| `integration` \| `e2e` — scope of tests to generate or review
`test_path`	No (review, coverage-check)	Path to existing test file(s); inferred if omitted
`standard`	No	`general` (default) \| `ob` — convention set to enforce in generated/reviewed tests

standard defaults to general when omitted. Set standard: ob for BORO/Ontoledgy codebases.

Standard Definitions

Value	Convention Set	Source
`general`	Clean Code (Robert C. Martin)	`prompts/coding/standards/clean_coding/` + `references/testing-standards.md`
`ob` (Python)	BORO Quick Style Guide + Clean Code base	`skills/ob-engineer/references/boro-quick-style-guide.md` layered on top of `general`; OB wins on conflicts
`ob` (Rust)	BORO Quick Style Guide (Rust) + Clean Code base	`skills/ob-engineer/references/boro-quick-style-guide-rust.md` layered on top of `general`; OB wins on conflicts

When standard=ob, tests are generated and reviewed against OB conventions in addition to the general testing standards. Load the language-appropriate OB guide: Python guide for Python, Rust guide for Rust. OB mode supports Python and Rust. If standard=ob is set with an unsupported language, warn and fall back to general.

OB Overrides for Tests (Python)

OB mode applies BORO conventions to test code itself:

Category	OB Rule for Tests
Naming	Test function names use action verbs; no vague names (`data`, `tmp`, `process`); `is_`/`has_` prefix for boolean helpers; `__double_underscore` for private test helpers
Layout	20-char line length; each arg on own line; type annotations on all test helper signatures; named params with `*` for helpers with > 1 param
Strings	Single quotes only; no hardcoded strings in assertions — use constants for expected values where the string represents domain vocabulary
Structure	One test class per file (aligns with one public function per file); test helpers as `__private` functions in the test file
Error assertions	Test for specific exception types only (matching the specific-exceptions-only production rule)
Imports	Explicit only (`from file import name`); no `*`; no folder imports
Comments	None except `# TODO` — test names must be self-documenting

OB Overrides for Tests (Rust)

Category	OB Rule for Tests
Naming	Test function names use action verbs (`test_export_returns_records_when_valid`); no vague names; `is_`/`has_` prefix for boolean helpers; no single-letter variables except `self`
Layout	20-char line length; each arg on own line; type annotations on test helper signatures; explicit `-> ()` on test functions
Strings	No hardcoded strings in assertions — use `const` for expected values where the string represents domain vocabulary
Structure	`#[cfg(test)] mod tests` block per source file; test helpers as private `fn` (no `pub`) within the test module; builder/`make_*` functions for fixtures
Types	Test fixture structs use named fields (no tuple structs); `#[derive(Debug)]` on all test types
Error assertions	`assert_matches!` for specific error variants; never match on error message strings — match on enum variants
Ownership	Prefer borrowing in test helpers; `.clone()` acceptable in test setup for readability but not as a default
Imports	Explicit `use` — glob import of the parent module (`use super::*`) is the only permitted exception
Comments	None except `// TODO` — test names must be self-documenting

Standards Loaded in All Modes

Always load:

references/testing-philosophy.md — F.I.R.S.T., TDD, why clean tests matter
references/testing-standards.md — coverage, naming, AAA, fixtures, mocking, markers, anti-patterns

Always load the language-specific reference:

references/languages/[language].md — framework, tooling, and idioms for the target language

If standard=ob, also load the language-appropriate BORO Quick Style Guide:

Python: skills/ob-engineer/references/boro-quick-style-guide.md
Rust: skills/ob-engineer/references/boro-quick-style-guide-rust.md

OB rules override general where they conflict. Apply OB conventions to the test code itself (see the language-appropriate OB Overrides for Tests table above).

E2E Tests — Pipeline Runner + Thin-Slice Convention

When test_category=e2e, generate or review tests according to the runner + thin-slice convention. This applies to any pipeline-shaped codebase (collect → transform → emit).

What an E2E Test Is

An e2e test invokes a pipeline runner end-to-end via its public entry point. It is a smoke test first: assert the runner completes without error. Real assertions on outputs, side effects, and registers are added incrementally as the pipeline matures. A stub of the form assert True is acceptable when the runner is first wired — existence of the test is more valuable than its strength at that stage.

Coverage Rule — One E2E Per Runner

For every runner the codebase exposes, there is exactly one e2e test:

Runner Kind	E2E Test Location
Top-level pipeline runner (full collect→reuse)	`tests/e2e/test_<pipeline>_runner.py`
Thin-slice runner (a sub-pipeline runnable on its own)	`tests/e2e/<thin_slice>/test_<thin_slice>_runner.py`

A "thin slice" is a sub-pipeline that can be invoked independently — typically used to exercise one stage or one source against the full downstream chain. Each thin slice has its own folder under tests/e2e/ with its own conftest.py for slice-specific fixture overrides.

Folder Layout

tests/
├── e2e/
│   ├── conftest.py                                # top-level setup/teardown fixtures
│   ├── test_<full_pipeline>_runner.py             # smoke test for the full pipeline
│   ├── <thin_slice_a>/
│   │   ├── conftest.py                            # slice-specific fixture overrides
│   │   └── test_<thin_slice_a>_runner.py
│   └── <thin_slice_b>/
│       ├── conftest.py
│       └── test_<thin_slice_b>_runner.py
├── unit/
│   └── <module>/
│       ├── conftest.py
│       └── test_<component>.py
└── outputs/                                       # artefacts written by tests

`conftest.py` Conventions

Top-level tests/e2e/conftest.py — global setup/teardown shared by all e2e tests: configuration object construction, output path provisioning, external service config (URLs, credentials), session-scoped fixtures.
Per-slice tests/e2e/<slice>/conftest.py — overrides for that slice only: scoped output paths, slice-specific input fixtures, narrower configuration.
Each e2e test depends on its slice's conftest.py. Do not share fixtures laterally between slices — if two slices need the same fixture, lift it to the top-level conftest.py.

Generating an E2E Test (Step 4 addendum)

When test_category=e2e:

Identify the runner's public entry point (the function the application calls).
Place the test under tests/e2e/ (top-level) or tests/e2e/<thin_slice>/ per the table above.
Write a smoke test: invoke the runner via the project's invocation idiom; assert the call returns successfully. assert True is acceptable initially.
Provide a conftest.py at the test's level wiring the configuration the runner needs (paths, service config). Reuse top-level fixtures where applicable.
Add real assertions incrementally — output files exist, register counts match, downstream contracts hold — as the pipeline stabilises.

Reviewing E2E Tests (Mode: `review` addendum)

When test_category=e2e, additionally verify:

One e2e test exists for the top-level pipeline runner
One e2e test exists for each thin-slice runner
Each test invokes the runner via the project's public invocation idiom (not by reaching into stage internals)
Per-slice conftest.py overrides only what the slice needs; shared setup is in the top-level conftest.py
assert True stubs, where present, have a # TODO referencing the assertion to be added

Mode: `generate`

Generate tests for the class or function at target_path. Produces a complete test file.

Workflow

Step 1 — Read standards

Load all three references listed above before writing a single line of test code.

Step 2 — Read the target code

Read target_path completely. Identify:

Public interface: all public functions/methods with their signatures and return types
Pre-conditions: inputs that are validated/rejected
Post-conditions: what the function guarantees on success
Error paths: exceptions raised, edge conditions

Step 3 — Plan test cases

For each public function, plan:

Category	What to cover
Happy path	One test per distinct valid input shape
Boundary conditions	Min/max values, empty collections, zero, null/None
Error conditions	Each exception type; invalid inputs; pre-condition violations
Edge cases	Single-element collections, large inputs, special characters

Step 4 — Write tests

Apply the standards from references/testing-standards.md and the language idioms from references/languages/[language].md. Do not invent patterns — use only what the references define.

Step 5 — Produce test file

Output (generate)

## Test Generation — [target_path]

**Language:** [language]
**Standard:** [general | ob]
**Category:** [unit | integration | e2e]
**Tests generated:** [N]
**Coverage of public interface:** [functions covered / total functions]

---

[Full test file content]

---

### Test Case Summary

| Test Name | What It Covers | Category |
|-----------|---------------|----------|

---

### Untested Paths

[Any paths not covered and why — e.g. private methods, external dependencies]

Mode: `review`

Review existing tests at test_path against quality standards. Produces an annotated report.

Workflow

Step 1 — Read standards

Load all three references listed above.

Step 2 — Read production code and test code

Read target_path (production) and test_path (tests). Understand what the production code does before assessing how the tests cover it.

Step 3 — Apply the compliance checklist

Work through the checklist in references/testing-standards.md. For each violation:

Record exact file and line number
Name the rule violated
Assign severity (HIGH / MEDIUM / LOW)
Write a specific, actionable suggested fix

Severity criteria:

Severity	Criteria
HIGH	Hides real behaviour; test passes when it should fail; tests the mock not the code
MEDIUM	Reduces clarity or makes the test fragile; names don't reveal intent
LOW	Minor style issue; not a correctness risk

Step 4 — Produce review report

Output (review)

## Test Review — [test_path]

**Language:** [language]
**Standard:** [general | ob]
**Category:** [unit | integration | e2e]
**Tests reviewed:** [N]
**Violations:** [N] (HIGH: N, MEDIUM: N, LOW: N)

---

### Violations

| # | Test | Line | Rule | Severity | Description | Suggested Fix |
|---|------|------|------|----------|-------------|---------------|

---

### Verdict

**[APPROVE / REQUEST CHANGES / REJECT]**

[1–2 sentence summary]

Mode: `coverage-check`

Identify paths in the production code at target_path that have no corresponding tests.

Workflow

Step 1 — Read standards

Load all three references listed above.

Step 2 — Map production code paths

Read target_path. For each public function, enumerate all logical paths:

Normal path
Each conditional branch
Each exception raised
Edge cases visible from the signature (empty input, zero, None/null)

Step 3 — Read existing tests

Read test_path (or inferred location). Map each test to the path(s) it exercises.

Step 4 — Identify gaps

Mark each production path as covered or uncovered. Flag paths with:

No test at all
Only happy-path coverage (no error or edge coverage)
Mocked-away behaviour that should be integration-tested

Step 5 — Produce gap analysis

Output (coverage-check)

## Coverage Gap Analysis — [target_path]

**Language:** [language]
**Standard:** [general | ob]
**Category:** [unit | integration | e2e]
**Functions analysed:** [N]
**Paths covered:** [N] / [Total paths]
**Coverage estimate:** [N]%

---

### Covered Paths

| Function | Path | Covered by |
|----------|------|-----------|

---

### Uncovered Paths

| Function | Uncovered Path | Risk | Recommended Test Name |
|----------|---------------|------|----------------------|

---

### Recommended Actions

1. [Highest priority gaps with specific test case suggestions]

Feedback

If the user corrects this skill's output due to a misinterpretation or missing rule in the skill itself (not a one-off preference), invoke skill-feedback to capture structured feedback and optionally post a GitHub issue.

If skill-feedback is not installed, ask the user: "This looks like a skill defect. Would you like to install the skill-feedback skill to report it?" If the user declines, continue without feedback capture.

clean-code-tests

Clean Code Tests

Role

Input

Standard Definitions

OB Overrides for Tests (Python)

OB Overrides for Tests (Rust)

Standards Loaded in All Modes

E2E Tests — Pipeline Runner + Thin-Slice Convention

What an E2E Test Is

Coverage Rule — One E2E Per Runner

Folder Layout

`conftest.py` Conventions

Generating an E2E Test (Step 4 addendum)

Reviewing E2E Tests (Mode: `review` addendum)

Mode: `generate`

Workflow

Output (generate)

Mode: `review`

Workflow

Output (review)

Mode: `coverage-check`

Workflow

Output (coverage-check)

Feedback

More from ontoledgy/ol_ai_context_library

bie-component-ontologist

bie-data-engineer

clean-code-commit

bclearer-pipeline-architect

clean-code-naming

software-architect

clean-code-tests

Clean Code Tests

Role

Input

Standard Definitions

OB Overrides for Tests (Python)

OB Overrides for Tests (Rust)

Standards Loaded in All Modes

E2E Tests — Pipeline Runner + Thin-Slice Convention

What an E2E Test Is

Coverage Rule — One E2E Per Runner

Folder Layout

conftest.py Conventions

Generating an E2E Test (Step 4 addendum)

Reviewing E2E Tests (Mode: review addendum)

Mode: generate

Workflow

Output (generate)

Mode: review

Workflow

Output (review)

Mode: coverage-check

Workflow

Output (coverage-check)

Feedback

More from ontoledgy/ol_ai_context_library

bie-component-ontologist

bie-data-engineer

clean-code-commit

bclearer-pipeline-architect

clean-code-naming

software-architect

`conftest.py` Conventions

Reviewing E2E Tests (Mode: `review` addendum)

Mode: `generate`

Mode: `review`

Mode: `coverage-check`