drupal-major-upgrade-validation

Installation

SKILL.md

Drupal Major Upgrade Validation

Validate a Drupal major version upgrade by capturing a behavioral baseline on the current stable version, applying the upgrade, and running the same tests again on the new version. Produce a structured comparison report.

This skill is project-agnostic. It discovers how to run commands, which containers to use, and what URLs to hit by reading the project's AGENTS.md and other skills loaded in the current session.

Before You Start

Prerequisites

Two branches must exist: a stable branch (e.g., develop) running the current Drupal major version, and an upgrade branch with the new version already prepared (dependencies updated, patches ported, configuration adjusted).
Docker environment must be operational. The project must be buildable with docker compose build && docker compose up -d on both branches.
Browser automation tool. This skill uses playwright-cli for all browser interactions. Load the playwright-cli skill if not already loaded. Check availability:
```
command -v playwright-cli 2>/dev/null || npx -y @playwright/cli --version
```
Container context. Load the sparkfabrik-drupal-containers skill (skills/drupal/pkg-skills/) if not already loaded. It documents how to run commands inside containers, access services, discover URLs, and use make targets. All container and tooling commands in this skill follow the patterns described there.

First Interaction

Ask the user for:

Stable branch name (e.g., develop, main)
Upgrade branch name (e.g., feat/drupal-11, 497-D11-update)
Which pages to test -- or offer the default test plan (see below)
Admin credentials for backend testing (username and password)
Known local-only status report issues to ignore (e.g., missing private files directory, disabled trusted host patterns)

Default Test Plan

If the user does not provide a custom test plan, build one dynamically by navigating the site's main menu and sampling subpages.

Frontend: Menu-Driven Discovery

Homepage: Navigate to / and take a screenshot. Record title and console errors.
Extract main navigation links: Use playwright-cli to collect all first-level links from the site's primary navigation menu (typically <nav>, .menu--main, #block-mainnavigation, or the first <ul> inside the header). Record each link's text and href.
Visit every first-level menu item: Navigate to each link extracted in step 2. For each page:
- Wait for full load
- Record the page title
- Record console errors
- Take a screenshot
Random subpage sampling: For each first-level page that is a listing (contains links to detail pages, e.g., a blog index, product catalog, or news archive), randomly pick 1-2 items from the listing and navigate to them. Take a screenshot of each. This validates that detail pages and internal linking survive the upgrade.
Search: If a search form or /search path exists, execute a search with a simple term (e.g., the site name) and screenshot the results page.

Backend: Fixed Admin Pages

After frontend testing, log in with the credentials provided by the user and test these fixed admin paths:

Test	URL Path	Checks
Admin login	`/user/login`	Login form works, redirects to dashboard
Content list	`/admin/content`	Table renders with content items
Create content	`/node/add/<first-type>`	Form renders, all fields present (actual submission tested in editorial checks)
Site configuration	`/admin/config/system/site-information`	Settings page renders
Status report	`/admin/reports/status`	Check for errors and warnings (see below)
Modules list	`/admin/modules`	Module list renders
Media library	`/admin/content/media`	Media grid/table renders (if enabled)

Status Report Check

The /admin/reports/status page is a critical validation point. After taking a screenshot, inspect the page for error and warning rows. Classify each item:

Errors/warnings that also appear on the baseline (D10): these are pre-existing and not caused by the upgrade. Note them but do not mark as FAIL.
New errors/warnings that only appear after the upgrade: these may indicate a real regression. Record them and mark the status report check as WARN or FAIL.
Known local-only issues that are expected in local development and do not appear in production (e.g., missing private files directory, disabled trusted_host_patterns). Ask the user during the first interaction if there are known local-only status report items to ignore.

Numbering Convention

Number screenshots sequentially as they are taken: 01-homepage.png, 02-<menu-item-slug>.png, etc. The exact number of screenshots depends on how many menu items and subpages the site has. Record the full test manifest (number, name, URL, type) so Phase 4 can replay the exact same sequence.

Workflow

Phase 1: Discover Project Tooling

Before touching any branch, gather project context from the sparkfabrik-drupal-containers skill and the project's AGENTS.md:

Container commands: The sparkfabrik-drupal-containers skill defines how to run drush, composer, and other commands inside the tools container. Use those patterns throughout this workflow.
Site URL: Use the method from the skill to discover the local site URL (e.g., fs-cli pkg:get-urls from the host, or the container service hostname from inside a container).
Build command: Read the project's AGENTS.md to identify the site build command (e.g., make, bin/robo build-app, or custom scripts) and any hook system for pre/post install.
Command safety policy: Read the project's AGENTS.md for the command safety policy. Commands like composer install, drush deploy, and drush updb typically require user confirmation.

Phase 2: Baseline (Stable Branch)

Goal: Capture the current site's behavior as the reference baseline.

Switch to stable branch:

git stash  # if needed
git checkout <stable-branch>

Build the environment:

docker compose build
docker compose up -d

Install dependencies and build the site using the container commands from the sparkfabrik-drupal-containers skill. This typically involves:
- Running composer install inside the tools container
- Running the site build/install command (from AGENTS.md)
- Waiting for the site to be fully operational
Verify Drupal version: Run drush status --field=drupal-version inside the tools container. Record the exact version (e.g., 10.4.3).
Run the test plan using playwright-cli:

Create a directory for baseline screenshots:
```
mkdir -p .playwright-cli/baseline
```
Execute the Default Test Plan (or the user's custom plan):
- Start with the homepage
- Extract and visit all first-level main navigation links
- Randomly sample 1-2 subpages from each listing page
- Test search if available
- Log in to the backend and test fixed admin pages
For every page visited:
- Wait for the page to fully load
- Record the page title
- Record any console errors
- Take a full-page screenshot that captures the entire page, not just the visible viewport. Use playwright-cli run-code with Playwright's native full-page option:
```
playwright-cli run-code "async page => {
  await page.screenshot({ path: '.playwright-cli/baseline/NN-test-name.png', fullPage: true });
}"
```
  This scrolls the page internally and stitches the result into a single image.
- For form pages, verify that fields are present and interactive
Save the test manifest -- the ordered list of (number, name, URL, type) for every page tested. This manifest is replayed identically in Phase 4.
Record baseline results in memory:
- Drupal version
- Per-page: title, console error count, screenshot path, pass/fail, notes
- Total: pages tested, pages passed, total console errors

Phase 3: Upgrade

Goal: Switch to the upgrade branch and get the new version running.

Switch to upgrade branch:
```
git checkout <upgrade-branch>
```

Rebuild the environment:

docker compose build
docker compose up -d

Install dependencies: Run composer install inside the tools container.
Apply database updates and import configuration: Run drush deploy -y inside the tools container. This executes drush updatedb (database updates) followed by drush config:import (configuration sync). Monitor the output carefully:
- Record the number of update hooks executed
- Note any warnings or errors
- If drush deploy is not available, run the two commands separately: drush updatedb -y then drush config:import -y
Rebuild caches: Run drush cr inside the tools container.
Verify new Drupal version: Run drush status --field=drupal-version inside the tools container. Record the exact version (e.g., 11.1.6).

Phase 4: Validation (Upgraded Version)

Goal: Run the identical test plan against the upgraded site.

Create validation screenshot directory:
```
mkdir -p .playwright-cli/validation
```
Replay the test manifest saved in Phase 2. Visit the exact same URLs in the same order, using the same screenshot names. Do not re-discover the menu or re-sample subpages -- use the recorded manifest so the comparison is apples-to-apples. Save full-page screenshots to the validation/ directory using the same run-code technique from Phase 2:
```
playwright-cli run-code "async page => {
  await page.screenshot({ path: '.playwright-cli/validation/NN-test-name.png', fullPage: true });
}"
```
Editorial checks: Validate that content creation workflows still function after the upgrade. All test content created in this step is deleted at the end.

a. Discover a content type: Run drush entity:list --type=node_type (or visit /admin/structure/types) to find available content types. Pick the most common one (e.g., article, page, or the project's primary type).

b. Create a test node: Navigate to /node/add/<type> with playwright-cli. Fill in the required fields (at minimum the title, e.g., [UPGRADE-TEST] Editorial Check). If a body field exists, enter a short paragraph. Submit the form. Verify the node saves and the canonical page renders. Take a screenshot and record the new node's ID from the URL.

c. Upload a test media item: Navigate to /admin/content/media and use the "Add media" action (or /media/add/image if known). Upload a small test image, fill in required fields (name: [UPGRADE-TEST] Media Check), and save. Verify the media entity appears in the media library. Record the media ID.

d. Edit the test node: Navigate to /node/<nid>/edit. Change the title (e.g., append - Edited). Save and verify the updated title renders on the canonical page. Take a screenshot.

e. Custom module smoke tests: Custom modules are the most likely source of upgrade regressions. Perform lightweight checks on each custom module:

i. Discover custom modules: List modules under web/modules/custom/ (and addons/ if it exists). For each enabled custom module, gather: - Routes: check <module>.routing.yml for custom page/form routes - Entity types: check for @ContentEntityType or @ConfigEntityType annotations/attributes in src/Entity/ - Plugins: check for block, field formatter, or field widget plugins in src/Plugin/

ii. Visit custom routes: For each route that defines an accessible page (not an API endpoint or callback), navigate to it with playwright-cli. Verify the page renders without a fatal error (no white screen, no Drupal error page). Take a screenshot.

iii. Test custom entity forms: If a custom module defines content entities with add/edit forms, navigate to the add form. Verify the form renders and all fields are present. If the entity is simple enough (just a title/label), create a test entity with the [UPGRADE-TEST] prefix, verify it saves, then add it to the cleanup list. Do not attempt complex multi-field entity creation -- the goal is a smoke test, not exhaustive testing.

iv. Verify custom blocks: If a custom module provides block plugins, check that the blocks appear on the block layout page (/admin/structure/block) and are instantiable.

Record pass/fail per custom module. If a module has no routes, entities, or visible plugins, note it as "no testable surface" and skip.

f. Cleanup -- delete test content: Delete all test content created in this phase: the test node, test media item, and any custom entities created during the custom module smoke tests. Use the admin UI or drush commands:
```
drush entity:delete node <nid>
drush entity:delete media <mid>
drush entity:delete <entity_type> <id>  # for each custom entity
```
Verify each is gone by confirming 404/403 on their canonical URLs.

g. Record editorial results: Note pass/fail for each step (create, upload, edit, custom module checks, cleanup). If any step fails, record the error and take a screenshot.
Record validation results with the same structure as the baseline, plus the editorial check results.

Phase 5: Comparison Report

Goal: Produce a structured comparison report as a markdown file.

Generate a file named UPGRADE-VALIDATION-REPORT.md in the project root with the following structure:

# Drupal Upgrade Validation Report

**Date:** YYYY-MM-DD
**Project:** <project-name>
**Baseline branch:** <stable-branch>
**Upgrade branch:** <upgrade-branch>
**Baseline version:** Drupal <X.Y.Z>
**Upgraded version:** Drupal <X.Y.Z>

## Summary

| Metric                 | Baseline | Upgraded | Status         |
|------------------------|----------|----------|----------------|
| Drupal version         | X.Y.Z    | X.Y.Z    | --             |
| Pages tested           | N        | N        | --             |
| Pages passed           | N        | N        | PASS/FAIL      |
| Console errors (total) | N        | N        | PASS/WARN/FAIL |
| Editorial checks       | --       | 4/4      | PASS/FAIL      |
| Custom module checks   | --       | N/N      | PASS/FAIL      |
| Update hooks executed  | --       | N        | --             |

**Overall result:** PASS / FAIL

## Upgrade Details

- **Update hooks:** N hooks executed successfully
- **Configuration import:** clean / N conflicts
- **Notable changes:** (list any significant changes observed, e.g., theme
  changes, jQuery version bump, new admin UI)

## Page-by-Page Comparison

| #  | Page     | Baseline Title | Upgraded Title | Baseline Errors | Upgraded Errors | Visual            | Status |
|----|----------|----------------|----------------|-----------------|-----------------|-------------------|--------|
| 1  | Homepage | ...            | ...            | N               | N               | identical/changed | PASS   |
| 2  | ...      | ...            | ...            | ...             | ...             | ...               | ...    |

## Editorial Checks (Upgraded Version Only)

| Step          | Action                                       | Status    | Notes |
|---------------|----------------------------------------------|-----------|-------|
| Create node   | Created `<type>` node (nid: N)               | PASS/FAIL | ...   |
| Upload media  | Uploaded test image (mid: N)                  | PASS/FAIL | ...   |
| Edit node     | Changed title, verified update                | PASS/FAIL | ...   |
| Cleanup       | Deleted all test content                      | PASS/FAIL | ...   |

## Custom Module Smoke Tests (Upgraded Version Only)

| Module              | Routes | Entities | Blocks | Status    | Notes |
|---------------------|--------|----------|--------|-----------|-------|
| `my_custom_module`  | 2/2    | 1/1      | 0      | PASS/FAIL | ...   |
| `another_module`    | --     | --       | --     | skipped   | no testable surface |

## Console Errors

### Baseline (Drupal X)
<list any console errors per page, or "None">

### Upgraded (Drupal X)
<list any console errors per page, or "None">

## Screenshots

Baseline screenshots: `.playwright-cli/baseline/`
Validation screenshots: `.playwright-cli/validation/`

## Recommendations

<list any issues found, recommended fixes, or follow-up actions>

Posting Results to External Systems

IMPORTANT: Never post reports, comments, or any content to external systems (GitLab issues, GitHub PRs, Slack, etc.) without explicit human approval.

Before posting:

Show the full report content to the user in the conversation.
Ask for explicit confirmation to post (e.g., "Shall I post this to issue #N?").
Only post after the user approves. If the user requests changes, revise and show again before posting.

This applies to all external interactions: creating issue comments, uploading attachments referenced in reports, updating issue labels or status, etc.

Status Classification

PASS: Page works identically or better in the upgraded version
WARN: Minor differences detected (e.g., visual changes from theme updates, different console warning counts) but functionality is preserved
FAIL: Page is broken, missing content, or has new critical errors

Overall Result Rules

PASS: All pages pass, no new critical console errors
FAIL: Any page fails, or critical functionality is broken

Tips

S3FS vs local files: Some projects switch file storage backends between versions. Image URLs may change (e.g., /sites/default/files/ to /s3/files/). This is expected and not a failure -- note it in the report but mark as PASS.
Admin theme changes: Major Drupal upgrades often include admin theme updates (e.g., Gin 3.x to 4.x). Visual differences in the admin UI are expected. Focus on functional correctness (forms work, content saves) rather than pixel-perfect comparison.
jQuery version: Drupal 11 ships jQuery 4.x (up from 3.x in D10). This can cause console errors in contrib modules that use deprecated jQuery APIs. Note these but they are usually non-blocking.
Deprecated theme engines: If the project uses classy or stable as a base theme, these are removed from Drupal core in D11 and available as contrib. Check that the contrib versions are in composer.json on the upgrade branch.
Large test plans: For projects with many pages, prioritize: homepage, main content types, search, admin login, and content creation. Skip repetitive variations of the same page type.
Full-page screenshots: Always use page.screenshot({ fullPage: true }) via playwright-cli run-code instead of the plain playwright-cli screenshot command. The plain command only captures the visible viewport, missing content below the fold. Full-page mode scrolls internally and stitches the entire page into a single image, which is essential for detecting layout or rendering issues further down the page.
Resuming interrupted validations: If the process is interrupted, check which screenshots already exist in .playwright-cli/baseline/ and .playwright-cli/validation/ to determine where to resume.
Database seed vs config install: If the project uses a database seed for local builds, ensure the seed is from the stable branch before starting. The upgrade branch should use drush deploy on top of the seeded database, not a fresh config install.
Editorial check cleanup: The editorial checks create a test node and a media item on the upgraded site, then delete them after verification. The title prefix [UPGRADE-TEST] makes these easy to identify if cleanup is interrupted. If a previous run was interrupted, search for nodes/media with that prefix and delete them before re-running.
Custom module smoke tests: The custom module checks are intentionally shallow -- they verify that routes respond, forms render, and entities can be created. They do not test business logic, permissions matrices, or complex workflows. If a custom module has no routes, entity types, or block plugins, it is skipped. Modules that only provide services, event subscribers, or hooks without visible UI surface are noted as "no testable surface".

Related skills

More from sparkfabrik/sf-awesome-copilot

Installs

Repository

sparkfabrik/sf-…-copilot

GitHub Stars

First Seen

Apr 18, 2026

Security Audits

Gen Agent Trust HubPass

SocketPass

SnykFail