research-idea-validator
Research Idea Validator
Purpose
Help the user pressure-test a research idea before they sink weeks into it. The workflow is grounded in the New Researcher Handbook section on Research Idea Generation: ideas come from connections, should be evaluated with the FIVE+C framework, and should be validated through fast feedback rather than private overthinking.
The goal is not to declare an idea "good" or "bad." The goal is to produce a concrete decision: prototype, revise, park, or kill.
When to Use
- User has a new project idea, paper idea, or thesis direction
- User is comparing multiple possible directions
- User wants to prepare an advisor pitch
- User has read papers and sees a potential gap
- User is worried an idea is too small, too broad, too late, or hard to evaluate
Workflow
Stage 1: Capture the Idea in One Sentence
Ask the user to state the idea in one sentence:
I want to [method/action] for [problem/domain] because [gap/failure mode], evaluated by [metric/task].
If the user cannot fill this in, help them rewrite it. Do not move to scoring until the one-sentence version is clear.
Stage 2: Identify the Source of the Idea
Ask where the idea came from:
- Literature gap: which papers and which assumptions/future-work lines?
- Senior student or advisor signal: who mentioned the problem and why?
- Cross-pollination: which method is being moved to which domain?
- Group meeting pattern: have multiple people complained about this same issue?
- Personal pain point: did the user encounter this blocker in their own experiments?
Source matters because it changes the confidence level. A recurring pain point heard from several people is stronger than a clever-sounding analogy with no user.
Stage 3: FIVE+C Evaluation
Score each criterion as Strong, Unclear, or Weak. Ask only for the missing information needed to score honestly.
| Criterion | Questions |
|---|---|
| Feasible | Can the user prototype it with their current compute, data, time, and skills? |
| Interesting | Would the target community care if it worked? Who specifically? |
| Novel | How is it different from the closest papers from the last 2 years? |
| Valuable | What field-level capability or understanding would improve? |
| Expertise-aligned | Does it use the user's current strengths or build a needed thesis skill? |
| Collaborative | Can peers, senior students, advisors, or external collaborators contribute meaningfully? |
Stage 4: Red-Flag Check
Call out any red flags directly:
- Too incremental: the contribution sounds like a small parameter, architecture, or dataset swap.
- Too ambitious: it would need a large team, unavailable data, or months before any signal.
- Already solved: the user has not checked recent top venues or arXiv.
- No evaluation metric: success cannot be measured cleanly.
- Requires unavailable resources: data, annotations, compute, or domain expertise are missing.
- No obvious audience: it is unclear who would cite or use the result.
For each red flag, propose one narrowing or reframing move.
Stage 5: Two-Week Validation Sprint
If the idea survives, design a two-week sprint:
- Minimal baseline to reproduce or implement
- One decisive experiment or toy setup
- One expected failure mode to check
- Three people to ask for feedback
- Recent-paper search target
- Stop condition: what evidence would make the user park the idea?
Keep the sprint small. If it cannot produce any signal in two weeks, shrink the idea.
Stage 6: Produce the Artifact
Save to ~/phd-log/ideas/YYYY-MM-DD-[short-topic].md.
# Research Idea Validation — [Short Topic]
## One-sentence idea
[Clear sentence]
## Source of the idea
- Origin:
- Evidence that this is a real problem:
- Closest papers / systems:
## FIVE+C score
| Criterion | Rating | Notes / missing evidence |
|---|---|---|
| Feasible | | |
| Interesting | | |
| Novel | | |
| Valuable | | |
| Expertise-aligned | | |
| Collaborative | | |
## Red flags
- [flag] -> [reframe or mitigation]
## Two-week validation sprint
- Baseline:
- Decisive test:
- Feedback targets:
- Literature check:
- Stop condition:
## Decision
[Prototype / revise / park / kill]
## Next action
- [ ] [small concrete next step]
Tone
Be rigorous but not dismissive. Most early ideas are under-specified, not worthless. Help the user make them testable.
What Not to Do
- Do not encourage a full implementation before the nearest related work is checked.
- Do not accept "interesting" without naming the audience.
- Do not let the user skip evaluation design.
- Do not overfit to novelty. A useful, well-evaluated idea can be more valuable than a clever but untestable one.
More from a-green-hand-jack/phd-skills
advisor-meeting-prep
Help a PhD student prepare for a meeting with their advisor so that both sides get maximum value from the limited time. Use this skill whenever the user has an upcoming advisor meeting, lab meeting presentation, or committee meeting, and needs help structuring what to bring. Trigger on phrases like "meeting with my advisor", "advisor meeting tomorrow", "what do I show my PI", "prepare for lab meeting", "committee meeting prep", "I meet my advisor in", or whenever the user expresses anxiety about an upcoming research check-in. Also trigger when the user is unsure how to communicate a research problem or a setback to their advisor.
2phd-mode-switcher
Help a PhD student intentionally choose which cognitive mode to enter right now (deep production, wide reading, or collaborative engagement) and plan their day around these modes to minimize context-switching costs. Use this skill whenever the user is at the start of a day or work block and unsure what to focus on, feels scattered across too many activities, asks "what should I do right now", wants to plan their day, or feels frustrated by constant context-switching. Trigger on phrases like "plan my day", "what should I work on now", "I feel scattered", "context switching", "deep work", "can't focus", "I have X hours", or whenever the user is trying to decide between substantively different kinds of work (writing vs reading vs meetings).
2phd-quarterly-planner
Help a PhD student set and revise a realistic 3-month research plan that connects to their longer-term goals. Use this skill whenever the user wants to plan a new quarter, review how the last quarter went, set research goals for the next few months, think about what papers to aim for, or reorient after things drift off course. Trigger on phrases like "quarterly plan", "next 3 months", "plan my quarter", "what should I work on this quarter", "review last quarter", "my research goals", or whenever the user talks about mid-range planning (longer than a week, shorter than a year). Also trigger if the user is feeling directionless about what to focus on next.
2research-mental-check
Offer a structured but non-clinical space for a PhD student or researcher to check in on their mental and emotional state, especially around imposter syndrome, guilt about rest, chronic over-promising, and burnout signals. Use this skill when the user expresses feelings of inadequacy, constant comparison to peers, fear of disappointing their advisor, guilt about taking time off, or exhaustion that isn't just physical. Trigger on phrases like "I feel behind", "everyone is smarter than me", "I can't rest", "I'm burned out", "imposter syndrome", "I'm not good enough", "I'm afraid of disappointing", "I should be working", or whenever the tone of the user's message suggests emotional strain rather than a technical question. Also trigger gently if these signals appear incidentally in a task-focused conversation.
2figure-results-review
Review experimental results, plots, tables, and figures before they are shown in a meeting, paper, report, or presentation. Use this skill whenever the user wants to present results, check a figure, interpret experiment plots, prepare result slides, validate captions, audit axes/legends/error bars, or make sure results are connected to hypotheses and experimental setup.
2phd-weekly-review
Guide a PhD student through a structured weekly review of their research progress. Use this skill whenever the user wants to do a weekly check-in, prepare a progress update for their advisor, reflect on the past week's research, or plan the upcoming week. Trigger on phrases like "weekly review", "this week's progress", "advisor update", "reflect on my week", "plan next week", "how did my week go", or whenever the user mentions wanting to take stock of their recent research work. Also trigger when the user seems to be venting about the week without structure — help them channel it into a productive review.
2