Research Idea Brainstorming

Structured frameworks for discovering the next research idea. This skill provides ten complementary ideation lenses that help researchers move from vague curiosity to concrete, defensible research proposals. Each framework targets a different cognitive mode—use them individually or combine them for comprehensive exploration.

When to Use This Skill

Starting a new research direction and need structured exploration
Feeling stuck on a current project and want fresh angles
Evaluating whether a half-formed idea has real potential
Preparing for a brainstorming session with collaborators
Transitioning between research areas and seeking high-leverage entry points
Reviewing a field and looking for underexplored gaps

Do NOT use this skill when:

You already have a well-defined research question and need execution guidance
You need help with experimental design or methodology (use domain-specific skills)
You want a literature review (use scientific-skills:literature-review)

Core Ideation Frameworks

1. Problem-First vs. Solution-First Thinking

Research ideas originate from two distinct modes. Knowing which mode you are in prevents a common failure: building solutions that lack real problems, or chasing problems without feasible approaches.

Problem-First (pain point → method):

Start with a concrete failure, bottleneck, or unmet need
Naturally yields impactful work because the motivation is intrinsic
Risk: may converge on incremental fixes rather than paradigm shifts

Solution-First (new capability → application):

Start with a new tool, insight, or technique seeking application
Often drives breakthroughs by unlocking previously impossible approaches
Risk: "hammer looking for a nail"—solution may lack genuine demand

Workflow:

Write down your idea in one sentence
Classify it: Is this problem-first or solution-first?
If problem-first → verify the problem matters (who suffers? how much?)
If solution-first → identify at least two genuine problems it addresses
For either mode, articulate the gap: what cannot be done today that this enables?

Self-Check:

Can I name a specific person or community who needs this?
Is the problem I am solving actually unsolved (not just under-marketed)?
If solution-first, does the solution create new capability or just replicate existing ones?

2. The Abstraction Ladder

Every research problem sits at a particular level of abstraction. Deliberately moving up or down the ladder reveals ideas invisible at your current level.

Direction	Action	Outcome
Move Up (generalize)	Turn a specific result into a broader principle	Framework papers, theoretical contributions
Move Down (instantiate)	Test a general paradigm under concrete constraints	Empirical papers, surprising failure analyses
Move Sideways (analogize)	Apply same abstraction level to adjacent domain	Cross-pollination, transfer papers

Workflow:

State your current research focus in one sentence
Move UP: What is the general principle behind this? What class of problems does this belong to?
Move DOWN: What is the most specific, constrained instance of this? What happens at the extreme?
Move SIDEWAYS: Where else does this pattern appear in a different field?
For each new level, ask: Is this a publishable contribution on its own?

Example:

Current: "Improving retrieval accuracy for RAG systems"
Up: "What makes context selection effective for any augmented generation system?"
Down: "How does retrieval accuracy degrade when documents are adversarially perturbed?"
Sideways: "Database query optimization uses similar relevance ranking—what can we borrow?"

3. Tension and Contradiction Hunting

Breakthroughs often come from resolving tensions between widely accepted but seemingly conflicting goals. These contradictions are not bugs—they are the research opportunity.

Common Research Tensions:

Tension Pair	Research Opportunity
Performance ↔ Efficiency	Can we match SOTA with 10x less compute?
Privacy ↔ Utility	Can federated/encrypted methods close the accuracy gap?
Generality ↔ Specialization	When does fine-tuning beat prompting, and why?
Safety ↔ Capability	Can alignment improve rather than tax capability?
Interpretability ↔ Performance	Do mechanistic insights enable better architectures?
Scale ↔ Accessibility	Can small models replicate emergent behaviors?

Workflow:

Pick your research area
List the top 3-5 desiderata (things everyone wants)
Identify pairs that are commonly treated as trade-offs
For each pair, ask: Is this trade-off fundamental or an artifact of current methods?
If artifact → the reconciliation IS your research contribution
If fundamental → characterizing the Pareto frontier is itself valuable

Self-Check:

Have I confirmed this tension is real (not just assumed)?
Can I point to papers that optimize for each side independently?
Is my proposed reconciliation technically plausible, not just aspirational?

4. Cross-Pollination (Analogy Transfer)

Borrowing structural ideas from other disciplines is one of the most generative research heuristics. Many foundational techniques emerged this way—attention mechanisms draw from cognitive science, genetic algorithms from biology, adversarial training from game theory.

Requirements for a Valid Analogy:

Structural fidelity: The mapping must hold at the level of underlying mechanisms, not just surface similarity
Non-obvious connection: If the link is well-known, the novelty is gone
Testable predictions: The analogy should generate concrete hypotheses

High-Yield Source Fields for ML Research:

Source Field	Transferable Concepts
Neuroscience	Attention, memory consolidation, hierarchical processing
Physics	Energy-based models, phase transitions, renormalization
Economics	Mechanism design, auction theory, incentive alignment
Ecology	Population dynamics, niche competition, co-evolution
Linguistics	Compositionality, pragmatics, grammatical induction
Control Theory	Feedback loops, stability, adaptive regulation

Workflow:

Describe your problem in domain-agnostic language (strip the jargon)
Ask: What other field solves a structurally similar problem?
Study that field's solution at the mechanism level
Map the solution back to your domain, preserving structural relationships
Generate testable predictions from the analogy
Validate: Does the borrowed idea actually improve outcomes?

5. The "What Changed?" Principle

Strong ideas often come from revisiting old problems under new conditions. Advances in hardware, scale, data availability, or regulations can invalidate prior assumptions and make previously impractical approaches viable.

Categories of Change to Monitor:

Change Type	Example	Research Implication
Compute	GPUs 10x faster	Methods dismissed as too expensive become feasible
Scale	Trillion-token datasets	Statistical arguments that failed at small scale may now hold
Regulation	EU AI Act, GDPR	Creates demand for compliant alternatives
Tooling	New frameworks, APIs	Reduces implementation barrier for complex methods
Failure	High-profile system failures	Exposes gaps in existing approaches
Cultural	New user behaviors	Shifts what problems matter most

Workflow:

Pick a well-known negative result or abandoned approach (3-10 years old)
List the assumptions that led to its rejection
For each assumption, ask: Is this still true today?
If any assumption has been invalidated → re-run the idea under new conditions
Frame the contribution: "X was previously impractical because Y, but Z has changed"

6. Failure Analysis and Boundary Probing

Understanding where a method breaks is often as valuable as showing where it works. Boundary probing systematically exposes the conditions under which accepted techniques fail.

Types of Boundaries to Probe:

Distributional: What happens with out-of-distribution inputs?
Scale: Does the method degrade at 10x or 0.1x the typical scale?
Adversarial: Can the method be deliberately broken?
Compositional: Does performance hold when combining multiple capabilities?
Temporal: Does the method degrade over time (concept drift)?

Workflow:

Select a widely-used method with strong reported results
Identify the implicit assumptions in its evaluation (dataset, scale, domain)
Systematically violate each assumption
Document where and how the method breaks
Diagnose the root cause of each failure
Propose a fix or explain why the failure is fundamental

Self-Check:

Am I probing genuine boundaries, not just confirming known limitations?
Can I explain WHY the method fails, not just THAT it fails?
Does my analysis suggest a constructive path forward?

7. The Simplicity Test

Before accepting complexity, ask whether a simpler approach suffices. Fields sometimes over-index on elaborate solutions when a streamlined baseline performs competitively.

Warning Signs of Unnecessary Complexity:

The method has many hyperparameters with narrow optimal ranges
Ablations show most components contribute marginally
A simple baseline was never properly tuned or evaluated
The improvement over baselines is within noise on most benchmarks

Workflow:

Identify the current SOTA method for your problem
Strip it to its simplest possible core (what is the one key idea?)
Build that minimal version with careful engineering
Compare fairly: same compute budget, same tuning effort
If the gap is small → the contribution is the simplicity itself
If the gap is large → you now understand what the complexity buys

Contribution Framing:

"We show that [simple method] with [one modification] matches [complex SOTA]"
"We identify [specific component] as the critical driver, not [other components]"

8. Stakeholder Rotation

Viewing a system from multiple perspectives reveals distinct classes of research questions. Each stakeholder sees different friction, risk, and opportunity.

Stakeholder Perspectives:

Stakeholder	Key Questions
End User	Is this usable? What errors are unacceptable? What is the latency tolerance?
Developer	Is this debuggable? What is the maintenance burden? How does it compose?
Theorist	Why does this work? What are the formal guarantees? Where are the gaps?
Adversary	How can this be exploited? What are the attack surfaces?
Ethicist	Who is harmed? What biases are embedded? Who is excluded?
Regulator	Is this auditable? Can decisions be explained? Is there accountability?
Operator	What is the cost? How does it scale? What is the failure mode?

Workflow:

Describe your system or method in one paragraph
Assume each stakeholder perspective in turn (spend 5 minutes per role)
For each perspective, list the top 3 concerns or questions
Identify which concerns are unaddressed by existing work
The unaddressed concern with the broadest impact is your research question

9. Composition and Decomposition

Novelty often emerges from recombination or modularization. Innovation frequently lies not in new primitives, but in how components are arranged or separated.

Composition (combining existing techniques):

Identify two methods that solve complementary subproblems
Ask: What emergent capability arises from combining them?
Example: RAG + Chain-of-Thought → retrieval-augmented reasoning

Decomposition (breaking apart monolithic systems):

Identify a complex system with entangled components
Ask: Which component is the actual bottleneck?
Example: Decomposing "fine-tuning" into data selection, optimization, and regularization reveals that data selection often matters most

Workflow:

List the 5-10 key components or techniques in your area
Compose: Pick pairs and ask what happens when you combine them
Decompose: Pick a complex method and isolate each component's contribution
For compositions: Does the combination create emergent capabilities?
For decompositions: Does isolation reveal a dominant or redundant component?

10. The "Explain It to Someone" Test

A strong research idea should be defensible in two sentences to a smart non-expert. This test enforces clarity of purpose and sharpens the value proposition.

The Two-Sentence Template:

Sentence 1 (Problem): "[Domain] currently struggles with [specific problem], which matters because [concrete consequence]." Sentence 2 (Insight): "We [approach] by [key mechanism], which works because [reason]."

If You Cannot Fill This Template:

The problem may not be well-defined yet → return to Framework 1
The insight may not be clear yet → return to Framework 7 (simplify)
The significance may not be established → return to Framework 3 (find the tension)

Calibration Questions:

Would a smart colleague outside your subfield understand why this matters?
Does the explanation stand without jargon?
Can you predict what a skeptic's first objection would be?

Integrated Brainstorming Workflow

Use this end-to-end workflow to go from blank page to ranked research ideas.

Phase 1: Diverge (Generate Candidates)

Goal: Produce 10-20 candidate ideas without filtering.

Scan for tensions (Framework 3): List 5 trade-offs in your field
Check what changed (Framework 5): List 3 recent shifts (compute, data, regulation)
Probe boundaries (Framework 6): Pick 2 popular methods and find where they break
Cross-pollinate (Framework 4): Pick 1 idea from an adjacent field
Compose/decompose (Framework 9): Combine 2 existing techniques or split 1 apart
Climb the abstraction ladder (Framework 2): For each candidate, generate up/down/sideways variants

Phase 2: Converge (Filter and Rank)

Goal: Narrow to 3-5 strongest ideas.

Apply these filters to each candidate:

Filter	Question	Kill Criterion
Explain-It Test (F10)	Can I state this in two sentences?	If no → idea is not yet clear
Problem-First Check (F1)	Is the problem genuine and important?	If no one suffers from this → drop it
Simplicity Test (F7)	Is the complexity justified?	If a simpler approach works → simplify or drop
Stakeholder Check (F8)	Who benefits? Who might object?	If no clear beneficiary → drop it
Feasibility	Can I execute this with available resources?	If clearly infeasible → park it for later

Phase 3: Refine (Sharpen the Winner)

Goal: Turn the top idea into a concrete research plan.

Write the two-sentence pitch (Framework 10)
Identify the core tension being resolved (Framework 3)
Specify the abstraction level (Framework 2)
List 3 concrete experiments that would validate the idea
Anticipate the strongest objection and prepare a response
Define a 2-week pilot that would provide signal on feasibility

Completion Checklist:

Two-sentence pitch is clear and compelling
Problem is genuine (problem-first check passed)
Approach is justified (simplicity test passed)
At least one stakeholder clearly benefits
Core experiments are specified
Feasibility pilot is defined
Strongest objection has a response

Framework Selection Guide

Not sure which framework to start with? Use this decision guide:

Your Situation	Start With
"I don't know what area to work in"	Tension Hunting (F3) → What Changed (F5)
"I have a vague area but no specific idea"	Abstraction Ladder (F2) → Failure Analysis (F6)
"I have an idea but I'm not sure it's good"	Explain-It Test (F10) → Simplicity Test (F7)
"I have a good idea but need a fresh angle"	Cross-Pollination (F4) → Stakeholder Rotation (F8)
"I want to combine existing work into something new"	Composition/Decomposition (F9)
"I found a cool technique and want to apply it"	Problem-First Check (F1) → Stakeholder Rotation (F8)
"I want to challenge conventional wisdom"	Failure Analysis (F6) → Simplicity Test (F7)

Common Pitfalls in Research Ideation

Pitfall	Symptom	Fix
Novelty without impact	"No one has done X" but no one needs X	Apply Problem-First Check (F1)
Incremental by default	Idea is +2% on a benchmark	Climb the Abstraction Ladder (F2)
Complexity worship	Method has 8 components, each helping marginally	Apply Simplicity Test (F7)
Echo chamber	All ideas come from reading the same 10 papers	Use Cross-Pollination (F4)
Stale assumptions	"This was tried and didn't work" (5 years ago)	Apply What Changed (F5)
Single-perspective bias	Only considering the ML engineer's view	Use Stakeholder Rotation (F8)
Premature convergence	Committed to first idea without exploring alternatives	Run full Diverge phase

Usage Instructions for Agents

When a researcher asks for help brainstorming research ideas:

Identify their starting point: Are they exploring a new area, stuck on a current project, or evaluating an existing idea?
Select appropriate frameworks: Use the Framework Selection Guide to pick 2-3 relevant lenses
Walk through frameworks interactively: Apply each framework step-by-step, asking the researcher for domain-specific inputs
Generate candidates: Aim for 10-20 raw ideas across frameworks
Filter and rank: Apply the Converge phase filters to narrow to top 3-5
Refine the winner: Help articulate the two-sentence pitch and define concrete next steps

Key Principles:

Push for specificity—vague ideas ("improve efficiency") are not actionable
Challenge assumptions—ask "why?" at least three times
Maintain a written list of all candidates, even rejected ones (they may recombine later)
The researcher makes the final call on which ideas to pursue; the agent facilitates structured thinking

brainstorming-research-ideas

Research Idea Brainstorming

When to Use This Skill

Core Ideation Frameworks

1. Problem-First vs. Solution-First Thinking

2. The Abstraction Ladder

3. Tension and Contradiction Hunting

4. Cross-Pollination (Analogy Transfer)

5. The "What Changed?" Principle

6. Failure Analysis and Boundary Probing

7. The Simplicity Test

8. Stakeholder Rotation

9. Composition and Decomposition

10. The "Explain It to Someone" Test

Integrated Brainstorming Workflow

Phase 1: Diverge (Generate Candidates)

Phase 2: Converge (Filter and Rank)

Phase 3: Refine (Sharpen the Winner)

Framework Selection Guide

Common Pitfalls in Research Ideation

Usage Instructions for Agents