paper-to-skill

Installation

SKILL.md

Paper-to-Skill Pipeline

Transform research papers into production-grade skill packages. The pipeline extracts the actionable methodology from a paper, structures it as a skill specification, and feeds it through co-evolutionary refinement to produce a validated package.

This closes the loop between research and practice: a paper published today can become an executable skill tomorrow, without manual authoring.

Reference Files

File	Contents	Load When
`references/extraction-patterns.md`	Patterns for extracting methodology from papers	Always

Prerequisites

The to-markdown skill (for PDF/document conversion)
The research-critique skill (for paper analysis)
The test-engineer agent (for co-evolutionary skill generation)

Workflow

Phase 1: Paper Intake

Accept the paper in any supported format:

Input Format	Action
arXiv ID (e.g., 2604.01687)	Fetch via `https://arxiv.org/abs/<id>`, convert PDF
arXiv URL	Extract ID, fetch and convert
PDF file path	Convert using `to-markdown` skill
URL to paper	Fetch via `WebFetch`, convert if PDF
Pasted text	Use directly

For PDF conversion, invoke the to-markdown skill:

Convert this PDF to clean markdown, preserving section structure, tables, equations, and algorithm pseudocode. Drop references section but keep inline citations.

Phase 2: Critical Analysis

Invoke the research-critique skill on the converted paper:

Analyze this paper focusing on:

Core contribution: what is the novel methodology?

Algorithm description: extract the step-by-step procedure

Input/output specification: what goes in, what comes out?

Key parameters and their valid ranges

Claimed results and the evidence supporting them

Failure modes and limitations acknowledged by the authors

Prerequisites and dependencies (tools, data, compute)

The critique output becomes the foundation for the skill specification.

Phase 3: Skill Specification Extraction

From the critique output, build a structured skill specification:

specification:
  name: <kebab-case derived from paper's methodology name>
  domain: <paper's application domain>
  source_paper:
    title: <paper title>
    arxiv_id: <if available>
    url: <paper URL>
    authors: <first author et al.>
    date: <publication date>
  
  capabilities:
    - <capability 1 derived from the methodology>
    - <capability 2>
    - <capability 3>
  
  input_format: <what the skill accepts>
  output_format: <what the skill produces>
  
  algorithm_steps:
    - step: 1
      description: <from paper's algorithm>
      parameters: [<key params with ranges>]
    - step: 2
      description: <next step>
  
  failure_modes:
    - <from paper's limitations section>
  
  example_tasks:
    - <task 1 the methodology would solve>
    - <task 2>
    - <task 3>

Extraction rules:

Prefer the paper's own algorithm pseudocode over prose descriptions
Include parameter ranges from the paper's experiments (e.g., "learning rate: 0.001-0.01")
Map the paper's terminology to armory conventions (e.g., "module" → "skill", "pipeline" → "workflow")
If the paper describes multiple variants, extract the best-performing one

See references/extraction-patterns.md for patterns specific to common paper types.

Phase 4: Skill Generation

Hand off the specification to the test-engineer agent for co-evolutionary generation:

Evolve a skill for: [specification.domain]

Capabilities: [specification.capabilities] Algorithm: [specification.algorithm_steps] Input: [specification.input_format] Output: [specification.output_format] Failure modes: [specification.failure_modes] Example tasks: [specification.example_tasks]

Source: [specification.source_paper.title] ([specification.source_paper.url])

The test-engineer runs its full co-evolutionary loop (generate → verify → oracle → refine) using the specification as the task description.

Phase 5: Attribution and Finalization

Ensure the generated skill properly attributes the source paper:

Frontmatter: Add source: <paper_url> to the metadata

Body: Include an attribution section at the end of SKILL.md:

## Attribution

This skill implements the methodology from:
> <paper title>
> <authors>
> <venue/arxiv, date>
> <URL>

References: If the paper has supplementary materials (code, datasets), create a source materials reference file in the generated skill's references/ directory linking to them
Verify the skill name does not conflict with existing packages in manifest.yaml

Output

The complete skill package at skills/<name>/:

SKILL.md with attribution and paper-derived workflow
evals/cases.yaml with assertions generated by the co-evolutionary loop
references/ with extraction patterns and source materials
evals/evolution-log.yaml from the test-engineer's refinement process

Error Handling

Error	Resolution
Paper has no clear algorithm	Extract the methodology from the experiments section
Paper is purely theoretical	Report: no actionable methodology; suggest literature-review instead
PDF conversion fails	Try alternative: fetch HTML version or request user paste text
Paper methodology requires data/compute	Note in skill's prerequisites; skill may be a workflow template only
test-engineer budget exhausted	Return best-scoring iteration with manual review warning

Limitations

Cannot extract visual methodologies (circuit diagrams, neural architecture figures) — works on textual algorithm descriptions only
Papers with multiple interdependent contributions may produce overly complex skills — consider splitting into multiple skills
Non-English papers require translation before processing
The generated skill's quality depends on the paper's clarity of methodology description

Related skills

More from mathews-tom/armory

Installs

Repository

mathews-tom/armory

GitHub Stars

229

First Seen

Apr 8, 2026

Security Audits

Gen Agent Trust HubPass

SocketWarn

SnykWarn