Demo Flow Runner

You generate and run Playwright browser automation scripts from natural language scenario descriptions, with full recording (HAR + video + click visualization).

Preamble (run first)

# Check skill lib
if [ -f ".claude/skills/demo/lib/index.js" ]; then
  echo "SKILL_LIB: ready"
else
  echo "SKILL_LIB: missing"
fi

# Check Playwright
if npx playwright --version >/dev/null 2>&1; then
  echo "PLAYWRIGHT: $(npx playwright --version 2>&1)"
else
  echo "PLAYWRIGHT: missing"
fi

# Check Chromium
if [ -d "$(npx playwright install --dry-run chromium 2>/dev/null | head -1)" ] || npx playwright install --dry-run chromium 2>&1 | grep -q "is already installed"; then
  echo "CHROMIUM: ready"
else
  echo "CHROMIUM: missing"
fi

# Check ffmpeg (required for video)
if command -v ffmpeg >/dev/null 2>&1; then
  echo "FFMPEG: $(ffmpeg -version 2>&1 | head -1)"
else
  echo "FFMPEG: missing (video conversion will fail)"
fi

# Check for .demoflow
if [ -d ".demoflow" ]; then
  echo "DEMOFLOW: initialized"
  ls .demoflow/scenarios/ 2>/dev/null | head -10
else
  echo "DEMOFLOW: not initialized (run /demo init)"
fi

If SKILL_LIB is missing: the skill was not installed correctly. Tell the user to run npx skills add llmer/skill-demoflow.

If PLAYWRIGHT is missing: run npm install --save-dev @playwright/test.

If CHROMIUM is missing: run npx playwright install chromium.

If FFMPEG is missing: warn the user that video conversion requires ffmpeg. They can install it with brew install ffmpeg (macOS) or apt install ffmpeg (Linux). Recording will still work but MP4 output will be skipped.

Subcommands

Check $ARGUMENTS first:

init → Run the Init Flow to explore the project and scaffold .demoflow/
list → List available scenarios from .demoflow/scenarios/ and targets from .demoflow/targets/
Anything else → Run a scenario (see Run Flow below)

Init Flow

When $ARGUMENTS is init, you explore the project and bootstrap .demoflow/.

What to do

Ensure runtime dependencies are installed:
- Verify the skill lib exists: .claude/skills/demo/lib/index.js (installed by skills add)
- Check if @playwright/test is in package.json. If not: npm install --save-dev @playwright/test
- Check if Playwright Chromium is available. If not: npx playwright install chromium
Explore the codebase using the Explore agent:
- Find all routes/pages (look for app/ or pages/ directory structures, route files, page components)
- Identify the auth flow (login, signup, verify, OAuth — look for auth pages, middleware, session handling)
- Find key interactive components (forms, wizards, modals, dialogs)
- Detect the framework (Next.js, Remix, SvelteKit, etc.) and UI library (shadcn, MUI, etc.)
- Look for existing environment config (.env.example, .env.local) to discover URLs, API endpoints, email services
- Check if Supabase, Mailpit, or other local services are configured (docker-compose, supabase/config.toml)
Generate .demoflow/context.md from what you discover:
- Auth flow (login page selectors, OTP input, submit buttons)
- Key pages and their interactive elements
- Navigation patterns (sidebar, breadcrumbs, tabs, header links)
- Modals/dialogs that might appear during flows
- Rate limits or gotchas you spot in the code
- Keep it concise — focus on what a Playwright script would need to know
Generate target configs in .demoflow/targets/:
- local.md — detect the dev server URL (from package.json scripts, .env), and if there's a local email server (Mailpit/Inbucket), configure auto-OTP
- production.md (if a prod URL is discoverable) — use prompt strategy for OTP
- Ask the user to confirm URLs and credentials you can't find
Suggest 3-5 scenarios based on the flows you discover:
- The primary happy path (signup → first action → core feature)
- A CRUD flow (create, read, update, delete of the main entity)
- Navigation coverage (visit every major section)
- An edge case or error flow if obvious
- Present these to the user and ask which to generate. Write the selected ones to .demoflow/scenarios/

Create the directory structure if it doesn't exist:

.demoflow/
├── context.md
├── targets/
│   ├── local.md
│   └── production.md
└── scenarios/
    └── (suggested scenarios)

Ensure scripts/ is excluded from the build:
- Read tsconfig.json and check if scripts is in the exclude array
- If not, add "scripts" to the exclude array — Playwright imports are Node-only and will break framework builds (Next.js, Remix, etc.) if included in the compilation scope
- If there's no exclude array, create one with ["scripts"]
Report what was created and suggest the user review context.md for accuracy.

Run Flow

Step 1: Resolve the scenario

If $ARGUMENTS matches a file in .demoflow/scenarios/ (by name, with or without .md), read that file. Otherwise treat $ARGUMENTS as an inline flow description.

If --target <name> is present in arguments, use that target. Otherwise use the target: field from the scenario's ## Config section.

Step 2: Read target + context

Read .demoflow/targets/{target-name}.md — this is the DUT (Device Under Test) config that tells you the base URL, how to handle auth/OTP, timeouts, and environment-specific behavior
Read .demoflow/context.md if it exists — app-specific UI patterns, selectors, navigation hints
Read the scenario file (or use the inline description)
If needed, read relevant source files to understand selectors and page structure

Step 3: Generate a Playwright script

Write a complete Playwright script to scripts/demo-run.ts that:

Imports from the skill lib (path relative to scripts/):

import { launchWithRecording, finalize, requestInput, pauseRecording, resumeRecording } from '../.claude/skills/demo/lib/index.js'

Uses the target config to set:
- BASE_URL from the target's Connection section
- TEST_EMAIL from the target's Connection section
- OTP strategy: if target says strategy: prompt, use requestInput(). If target provides Mailpit API details, auto-fetch the OTP.
- Timeouts from the target's Behavior section
Launches a headed browser with HAR + video recording
Executes every step from the scenario
Takes error screenshots on failure
Calls finalize() in a finally block to ensure recordings are saved

Output directory: output/{scenario-name}/

Key pattern:

import { launchWithRecording, finalize, requestInput } from '../.claude/skills/demo/lib/index.js'

const BASE_URL = '...'         // from target
const TEST_EMAIL = '...'       // from target
const WALLET_TIMEOUT = 180_000 // from target

async function main() {
  const session = await launchWithRecording({
    outputDir: 'output/scenario-name',
    headed: true,
    slowMo: 100,
    desktopFrame: true, // wraps video in macOS desktop + browser chrome
  })
  const { page } = session

  try {
    // Steps here, using target-specific values

    // When waiting for user input, pass session to auto-trim idle time from video:
    const otp = await requestInput(session.outputDir, 'Enter the OTP code', { session })
  } catch (err) {
    await page.screenshot({ path: 'output/scenario-name/error.png' })
    throw err
  } finally {
    const result = await finalize(session)
    console.log('HAR:', result.harPath)
    console.log('Video:', result.mp4Path)
  }
}

main()

Step 4: Run the script

Run the script in the background: npx tsx scripts/demo-run.ts
Monitor for the input signal file: poll output/{name}/.waiting-for-input
When the signal appears, read its contents (the prompt message) and ask the user for the value
When the user responds, write their answer to output/{name}/.input-value
The script will pick it up and continue automatically
Wait for the script to complete

Step 5: Report results

Tell the user:

Target used (local/production/etc.)
HAR file path
Video file path (MP4)
Any errors that occurred
Summary of what was captured

Recording Library Reference

All functions are exported from the skill lib at .claude/skills/demo/lib/index.js.

`launchWithRecording(options) → RecordingSession`

Launch a Chromium browser with full recording: HAR capture, video, and click visualization.

Option	Type	Default	Description
`outputDir`	`string`	required	Output directory for HAR, video, screenshots. Created if missing.
`viewport`	`{ width, height }`	`1280x720`	Browser viewport size. Also sets video resolution.
`headed`	`boolean`	`true`	Show the browser window. Set `false` for CI.
`slowMo`	`number`	`100`	Delay between actions in ms. Higher = more readable video.
`ignoreHTTPSErrors`	`boolean`	`true`	Bypass HTTPS certificate errors (useful for local dev).
`desktopFrame`	`boolean \| DesktopFrameOptions`	`true`	Wrap video in desktop chrome. See Desktop Frame.

Returns a RecordingSession with browser, context, page, and outputDir.

`finalize(session) → RecordingResult`

Close the browser, convert video to MP4, apply desktop frame compositing. Must be called in a finally block — skipping this loses the HAR and video.

Returns:

Field	Type	Description
`harPath`	`string`	Path to the HAR file (always present)
`mp4Path`	`string \| null`	Path to MP4 video (`null` if ffmpeg missing)
`webmPath`	`string \| null`	Path to raw WebM video

Pipeline: close browser → rename WebM → trim pauses (if any) → convert to MP4 → composite with desktop frame → clean up temps.

`requestInput(outputDir, message, options?) → string`

Pause the script and wait for external input (OTP codes, 2FA tokens, manual confirmation).

Option	Type	Default	Description
`session`	`RecordingSession`	—	If provided, video is auto-paused while waiting and idle time is trimmed from final video.
`timeoutMs`	`number`	`300000`	Max wait time in ms before throwing.

Writes .waiting-for-input signal file. Polls for .input-value response file. Auto-cleans both files after input is received.

`pauseRecording(session)` / `resumeRecording(session)`

Manually mark idle periods. The paused segments are trimmed from the final MP4 using ffmpeg trim + concat filters. Use when you have a known wait that isn't handled by requestInput.

pauseRecording(session)
await someSlowOperation()
resumeRecording(session)

`provideInput(outputDir, value)`

Write input from the skill/CLI side. Called by the skill runner after getting the value from the user.

`checkWaiting(outputDir) → string | null`

Check if a script is waiting for input. Returns the prompt message or null.

Desktop Frame

Videos are composited onto a desktop OS frame (titlebar + window chrome + wallpaper background) for polished output.

Options

Pass to launchWithRecording({ desktopFrame: ... }):

Value	Behavior
`true` (default)	macOS Sonoma style frame
`false`	No frame — raw viewport video
`{ style: 'macos' }`	macOS Sonoma with traffic lights and tab
`{ style: 'windows-xp' }`	Windows XP with IE chrome, taskbar, and Start button
`{ title: 'My App' }`	Override the tab/titlebar text (default: page title at finalize)
`{ resolution: { width: 1920, height: 1080 } }`	Desktop resolution (default: 1920x1080)

Frame anatomy

macOS (default):

Dark gradient wallpaper (purple/blue tones)
Window with rounded corners (top and bottom), drop shadow
Titlebar: traffic lights (red/yellow/green) + centered tab with page title
Content area: your recorded video

Windows XP:

Blue sky + green hills wallpaper
Blue gradient titlebar with IE icon + minimize/maximize/close
Address bar with URL + Go button
XP taskbar at bottom with green Start button + clock

How it works

Browser records WebM video at viewport size
finalize() converts WebM → MP4 (with pause trimming if needed)
Playwright renders the frame HTML to a PNG at the desktop resolution
ffmpeg overlays the MP4 onto the frame PNG at the calculated content position
Framed MP4 replaces the original

The frame is a static PNG — it doesn't change during the video. The page title shown in the tab is captured from the page at finalize time.

Click Visualization

Every page automatically gets a click visualization script injected via addInitScript. When the user clicks anywhere:

A red circle (30px, semi-transparent) appears at the click point
The circle expands to 2.5x and fades out over 900ms
Removed from DOM after 1200ms

This is captured in the video recording — no post-processing needed. The visualization works across all page navigations (re-injected on each new page load).

Output Files

After a successful run, output/{scenario-name}/ contains:

File	Description
`recording.har`	Full network capture (importable in Chrome DevTools Network tab)
`recording.mp4`	Polished video with click indicators + desktop frame
`recording.webm`	Raw video from Playwright (pre-conversion)
`error.png`	Screenshot at point of failure (only on error)

If desktop frame is disabled, recording.mp4 is the raw viewport video without chrome.

Target Resolution

Targets are Markdown files in .demoflow/targets/ that describe the runtime environment. They contain:

Connection: URL and email/credentials
Auth: How to handle OTP (auto via Mailpit, or prompt the user)
Behavior: Timeouts, headed/headless, slow motion
Notes: Environment-specific gotchas

When generating the script, read the target file and use its values. If a value uses ${RUN_ID}, generate a unique ID (e.g. Date.now().toString(36)).

Handling Directives

When you see these in the scenario, handle them in the generated script:

Directive	Generated Code
`[prompt: message]`	`const val = await requestInput(outputDir, "message", { session })`
`[save: var from url]`	`const var = page.url().match(/pattern/)[1]`
`[pause: Ns]`	`await new Promise(r => setTimeout(r, N * 1000))`
`[assert: condition]`	Appropriate Playwright assertion
`[screenshot: name]`	`await page.screenshot({ path: join(outputDir, 'name.png') })`

Script Generation Guidelines

Use target-specified timeouts (don't hardcode)
Add await new Promise(r => setTimeout(r, 500)) between actions for video readability
Use Playwright's text-based selectors when possible: page.click('button:has-text("Create Trust")')
Handle dialogs/modals that might appear unexpectedly
Add retry logic for flaky operations
Always wrap the main flow in try/catch/finally with finalize() in finally
Never skip finalize() — even on error, it saves the HAR and whatever video was captured
Prefer page.getByRole() and page.getByText() over CSS selectors
Use page.waitForURL() after navigation actions to ensure the page has loaded
Add page.waitForLoadState('networkidle') before screenshots or assertions on dynamic pages

Tips

Always call finalize() in finally. If you skip it, the HAR is lost and the browser process leaks.
Pass { session } to requestInput(). This auto-pauses video during idle waits so the final MP4 doesn't have dead time.
Use slowMo for video quality. 100ms is a good default. Bump to 200-300ms for demos where you want the viewer to see each step clearly.
Use pauseRecording / resumeRecording around long waits. Any operation where the screen is static for >2s should be trimmed.
Set viewport to match your target audience. 1280x720 is a safe default. Use 1920x1080 for full-HD demos, but note the desktop frame adds chrome around it.
The desktop frame title is captured at finalize. Navigate to the most meaningful page before the script ends so the title bar shows something useful.
Use desktopFrame: { style: 'windows-xp' } for fun. It generates a full Windows XP Internet Explorer chrome with taskbar. Great for retro-themed demos.
Check for ffmpeg before running. Without it, you get HAR + WebM but no MP4 and no desktop frame compositing.

Completion Status

When the run finishes, report status:

DONE — Scenario completed. HAR + video saved. Report file paths.
DONE_WITH_CONCERNS — Completed, but with issues (flaky selectors, slow loads, skipped steps). List each concern.
BLOCKED — Cannot proceed. State what is blocking (missing dependency, app not running, auth failed) and what was tried.
NEEDS_CONTEXT — Missing information required to continue (no target URL, unknown auth flow, ambiguous scenario steps). State exactly what you need.

Example Invocations

/demo init — explore the project, generate context + targets + suggested scenarios
/demo list — show available scenarios and targets
/demo workspace-switching — runs .demoflow/scenarios/workspace-switching.md with its default target
/demo workspace-switching --target production — override to use production target
/demo "log in and navigate to the dashboard" — generates from inline description using default target

demo