skills/emi-dm/research-mcp/arxiv-research

arxiv-research

SKILL.md

arXiv Research Assistant

Expert workflow for conducting structured research on arXiv papers with optimized information extraction and analysis.

Prerequisites

This skill requires the arXiv MCP server to be running. Ensure the server is started before use.

Core Workflows

1. Finding Relevant Papers

When to use: User wants papers on a specific topic, methodology, or author.

Process:

  1. Use arxiv_search_papers with clear keywords
  2. Default searches Software Engineering (cs.SE) category
  3. Review results and identify most relevant papers
  4. For broad topics: start with recent papers (sort_by: "submitted_date")
  5. For established topics: use relevance sorting

Key parameters:

  • query: Be specific ("test generation neural networks" vs "testing")
  • max_results: Start with 5-10, expand if needed
  • categories: Empty [] to search beyond SE

2. Deep Paper Analysis

When to use: User needs detailed understanding of a specific paper.

Structured extraction approach:

## Paper Analysis: [TITLE]

### 1. Core Contribution
- Main problem addressed
- Novel approach/solution
- Key insight or innovation

### 2. Methodology
- Research method (empirical, theoretical, survey, etc.)
- Datasets or subjects used
- Experimental design
- Evaluation metrics

### 3. Key Findings
- Primary results (with numbers/statistics)
- Performance comparisons
- Limitations acknowledged by authors

### 4. Implications
- Impact on the field
- Practical applications
- Future research directions

### 5. Related Work Context
- How it builds on prior work
- Key papers cited
- Position in research landscape

Tools sequence:

  1. arxiv_get_paper - Get metadata first (authors, abstract, year)
  2. arxiv_query_paper - Ask specific questions about content
  3. arxiv_convert_to_markdown - Only if full text reading needed

3. Comparative Analysis

When to use: User wants to compare multiple papers or approaches.

Process:

  1. Search and identify papers (2-5 papers max for deep comparison)
  2. Extract comparable dimensions for each:
    • Problem scope
    • Methodology
    • Datasets/evaluation
    • Results
    • Year and citations context
  3. Create structured comparison table
  4. Synthesize insights: commonalities, differences, evolution

Output format:

## Comparison: [TOPIC]

| Dimension | Paper 1 | Paper 2 | Paper 3 |
|-----------|---------|---------|---------|
| Approach | ... | ... | ... |
| Dataset | ... | ... | ... |
| Key Metric | ... | ... | ... |
| Results | ... | ... | ... |
| Year | ... | ... | ... |

### Key Insights
- **Trend:** ...
- **Best for X:** ...
- **Evolution:** ...

4. Literature Review

When to use: User needs comprehensive overview of a research area.

Systematic approach:

  1. Scope definition - Clarify time range, subtopics, inclusion criteria
  2. Initial search - Broad query, recent papers (last 2-3 years)
  3. Seed paper selection - Identify 3-5 highly relevant papers
  4. Iterative expansion - Search related terms, check citations
  5. Synthesis - Group by themes, identify trends, gaps

Output structure:

## Literature Review: [TOPIC]

### Research Landscape
- Number of papers found: X
- Time range: YYYY-YYYY
- Key categories: ...

### Major Themes
1. **Theme 1**: Description
   - Representative papers: [ID1, ID2]
   - Key findings: ...

2. **Theme 2**: Description
   - Representative papers: [ID3, ID4]
   - Key findings: ...

### Methodological Trends
- Dominant approaches: ...
- Evolution over time: ...

### Research Gaps
- Understudied areas: ...
- Contradictory findings: ...
- Future directions: ...

### Recommended Starting Papers
1. [Title] (ID) - Why: ...
2. [Title] (ID) - Why: ...

Question Templates

Use these optimized questions with arxiv_query_paper:

Methodology extraction:

  • "What specific methodology do the authors use? Include datasets, experimental setup, and evaluation metrics."
  • "Describe the technical approach in detail. What algorithms or techniques are employed?"

Results and findings:

  • "What are the quantitative results? Include specific metrics and comparisons to baselines."
  • "What are the main findings and their statistical significance?"

Critical analysis:

  • "What limitations do the authors acknowledge? What threats to validity are discussed?"
  • "How does this work compare to previous approaches? What improvements are demonstrated?"

Practical application:

  • "What are the practical implications of this work? How could it be applied in industry?"
  • "What tools, code, or datasets did the authors release?"

Research context:

  • "What future work do the authors suggest? What open questions remain?"
  • "What are the key papers cited in related work? What gap does this paper fill?"

Best Practices

Information Extraction

  1. Start broad, then narrow - Get abstract first, then query specific sections
  2. Be specific in queries - "What evaluation metrics..." vs "Tell me about the paper"
  3. Cross-reference - Verify important claims across abstract, methodology, and results
  4. Note limitations - Always extract acknowledged limitations and threats to validity

Efficiency Tips

  1. Use metadata first - Abstract often answers high-level questions
  2. Batch similar papers - Search once, analyze multiple results
  3. 100-page limit - For long papers, query specific sections instead of full conversion
  4. Cache key info - Summarize important papers to avoid re-querying

Critical Reading

  1. Evaluate methodology - Is the evaluation rigorous? Are datasets appropriate?
  2. Check reproducibility - Are methods described clearly? Is code/data available?
  3. Assess impact - How many citations? Recent or dated?
  4. Context matters - Consider publication venue and year

Common Patterns

Pattern: Research Question → Papers

User: "What are recent advances in LLM testing?"

1. arxiv_search_papers("LLM testing evaluation", sort_by="submitted_date")
2. Review top 5 papers
3. For 2-3 most relevant:
   - arxiv_query_paper: "What testing techniques are proposed?"
   - arxiv_query_paper: "What are the main results and limitations?"
4. Synthesize: summarize approaches, compare effectiveness, note trends

Pattern: Paper → Understanding

User: "Explain paper 2301.12345"

1. arxiv_get_paper(arxiv_id)
2. Read abstract → identify key aspects
3. arxiv_query_paper: "What is the main contribution and methodology?"
4. arxiv_query_paper: "What are the quantitative results?"
5. arxiv_query_paper: "What limitations do authors discuss?"
6. Synthesize into structured analysis

Pattern: Comparative Evaluation

User: "Compare mutation testing approaches in Papers X, Y, Z"

1. For each paper:
   - arxiv_get_paper → metadata
   - arxiv_query_paper: "What mutation operators are used?"
   - arxiv_query_paper: "What are the effectiveness metrics?"
2. Create comparison table
3. Synthesize: which approach for which context?

Output Quality Standards

Always Include

  • Paper IDs - Full arXiv IDs for reference
  • Specifics - Numbers, metrics, exact claims (not vague descriptions)
  • Context - Year, venue, citation count when relevant
  • Limitations - What the paper doesn't address
  • Actionability - Practical takeaways or implications

Avoid

  • Vague summaries without specifics
  • Uncritical acceptance of claims
  • Mixing papers without clear attribution
  • Overconfidence about papers not fully analyzed

Troubleshooting

"Too many results" → Add more specific keywords, narrow time range, use categories filter

"Paper too long to convert" → Use arxiv_query_paper with specific questions instead of full conversion

"Can't find specific methodology details" → Query: "Describe the technical implementation and algorithms in detail"

"Need newer/older papers" → Use sort_by: "submitted_date" or add year to query ("software testing 2023")

"Rate limit error" → Wait 3-5 seconds between requests, batch queries when possible

Weekly Installs
6
First Seen
Feb 4, 2026
Installed on
codex6
opencode5
gemini-cli4
github-copilot4
kimi-cli4
amp4