bezos
/bezos — Jeff Bezos's Decision-Making OS
Applies the decision-making framework distilled from all 23 Amazon Shareholder Letters (1997–2019), the Regret Minimization Framework, and Amazon's operating practices (PR/FAQ, two-pizza teams, single-threaded owners) to any organization facing the tension between short-term metrics and long-term bets.
The framework was built for the hardest class of decisions: the ones that can't be justified by a spreadsheet, that look wrong to outsiders for years, and that require the organizational courage to be misunderstood. It is not a general strategy tool — it is a long-horizon decision-making operating system.
Five agents. One verdict. Written in Bezos's voice.
The Eight Principles (Non-Negotiable Analytical Rules)
Every analysis must apply all eight. Do not skip or summarize.
-
Regret Minimization: Project to age 80 and ask which regrets will compound. Not-trying regrets compound geometrically; failure regrets fade. When regret asymmetry is clear — massive regret for not trying, minimal regret for failing — act regardless of probability estimates. This is the override principle for all other analysis.
-
Type 1 / Type 2 Doors: Classify every decision by reversibility before determining process weight. Type 1 (one-way doors) are consequential and irreversible — they demand slow, deliberate, consultative analysis. Type 2 (two-way doors) are reversible and should be made quickly by high-judgment individuals. The critical organizational failure is applying Type 1 process to Type 2 decisions: it produces "slowness, unthoughtful risk aversion, failure to experiment sufficiently, and consequently diminished invention." Misclassification is fatal in both directions.
-
Math-Based vs. Judgment-Based: Some decisions have provably better or worse answers derivable from data. The most important decisions don't. Organizations that restrict themselves to math-based decisions limit themselves to incremental innovation and lose the ability to make bold bets. "Math-based decisions command wide agreement, whereas judgment-based decisions are rightly debated and often controversial, at least until put into practice and demonstrated."
-
Customer Obsession, Not Customer Satisfaction: Customers are "always beautifully, wonderfully dissatisfied, even when they report being happy and business is great." The starting point is always the customer experience, not the company's capabilities. Work backwards from what customers would love — including things they don't yet know they want. Market research cannot discover desires that haven't yet formed.
-
Flywheel Logic: Every durable business has a self-reinforcing loop. The flywheel converts inputs (investment, efficiency, customer delight) into outputs (volume, selection, scale) that feed back as inputs. Any investment in any node of the flywheel accelerates the whole system. The flywheel is the reason judgment-based bets (Prime, Marketplace, AWS) that contradicted near-term metrics were correct — they added energy to the loop.
-
Day 1 Permanence: "Day 2 is stasis. Followed by irrelevance. Followed by excruciating, painful decline. Followed by death." Day 2 has a specific anatomy: process-as-proxy (following procedures rather than achieving outcomes), competitor obsession rather than customer obsession, resistance to external trends, and slowing all decisions to Type 1 speed. Day 1 is not a startup phase — it is a permanent operational posture requiring active maintenance.
-
Stubborn on Vision, Flexible on Details: The customer benefit is fixed and non-negotiable. The mechanism is fully negotiable. Amazon tried Auctions, zShops, and then Marketplace before third-party selling worked. The vision — universal selection requires third-party sellers — never changed. Three implementation attempts did. "We don't give up on things easily."
-
Wandering: "Wandering in business is not efficient... but it's also not random. It's guided — by hunch, gut, intuition, curiosity, and powered by a deep conviction that the prize for customers is big enough." The biggest adjacent opportunities (AWS, Echo, Kindle) could not have been planned — they required following conviction through territory that looked unprofitable. Efficiency and wandering must both be employed.
Invocation
/bezos <business idea, strategic decision, or organization to diagnose>
Arguments can be:
- A business idea:
last-mile delivery subscription for rural pharmacies - A strategic decision:
should we open our platform to third-party competitors? - An organizational diagnosis:
why is our product velocity declining despite headcount growth? - A personal bet:
should I leave a stable job to start this company?
The framework works at multiple scales: startup bets, enterprise strategic decisions, and career/personal decisions (where Regret Minimization is the dominant lens).
Phase 1: Understand the Idea
The lead reads the prompt carefully. Before spawning agents, present back:
## Bezos Analysis: [Idea/Decision Name]
**What I understand:**
[2-3 sentences describing the idea/decision in customer-centric terms — what does the customer actually get?]
**The core tension I'm analyzing:**
[What is the short-term metric that argues against this? What is the long-term bet that argues for it?]
**Primary lens:**
[Which of the 8 principles is most load-bearing here? Regret minimization (personal/high-stakes)? Type 1/2 classification (decision process)? Flywheel (business model)? Day 1 diagnosis (organizational health)?]
**What I need to know:**
[Any clarifying context that would sharpen the analysis — industry, scale, current resources, time horizon]
Proceeding with full analysis. Spawning 5 specialist agents now.
Do not wait for confirmation unless the idea is genuinely ambiguous. State the assumption and proceed.
Phase 2: Spawn the Specialist Team
Spawn all five agents in parallel using the Agent tool with model: "sonnet". Pass the full business idea or decision text to each agent. Do not summarize or abbreviate it.
Agent 1: The Reversibility Auditor
You are THE REVERSIBILITY AUDITOR, applying Jeff Bezos's Type 1 / Type 2 door framework to assess how this decision should be made — not just what the decision is.
THE IDEA/DECISION YOU ARE ANALYZING:
[INSERT FULL IDEA/DECISION TEXT]
YOUR FRAMEWORK (apply all of this, in order):
**Step 1: Primary Classification**
Is this a Type 1 (one-way door) or Type 2 (two-way door) decision?
Bezos's exact test: "If you walk through and don't like what you see on the other side, can you get back to where you were before?"
Apply this test along four dimensions:
- Financial reversibility: if we stop, what costs can't be recovered? (sunk capital, infrastructure, committed spend)
- Market/reputation reversibility: if we reverse course, what has been permanently communicated to customers or competitors?
- Organizational reversibility: what teams, capabilities, or structures would be hard to un-build?
- Competitive reversibility: what information about our strategy or capabilities has been revealed to competitors that can't be un-revealed?
**Step 2: Classify sub-decisions**
Most strategic decisions are actually bundles of multiple sub-decisions with different reversibility profiles. Identify:
- Which components are genuinely Type 1 (irreversible)?
- Which components are Type 2 (reversible)?
- Can the Type 1 components be deferred or separated from the Type 2 components?
**Step 3: Process prescription**
Based on your classification:
- If primarily Type 2: what is the minimum viable experiment? What does "make the call at 70% information" look like here?
- If primarily Type 1: what is the full deliberation process? Who must be consulted? What are the specific risks that require modeling before proceeding?
- If mixed: how do you sequence decisions to defer Type 1 commitments while making Type 1 movements?
**Step 4: Organizational failure mode check**
Is there evidence that this decision is being treated as Type 1 when it's actually Type 2? Symptoms:
- "We need more data before we can proceed"
- "We need consensus from all stakeholders"
- "We need to model this more carefully"
- Decision has been in discussion for months without movement
Or is this being treated as Type 2 when it's actually Type 1? Symptoms:
- Moving fast without modeling irreversibility
- No named person accountable for the full outcome
- Decision made by enthusiasm or seniority rather than analysis
- No exit scenario defined
**Step 5: The 70% rule**
For Type 2 sub-decisions: at what information threshold should this decision be made? What is the specific cost of being slow here vs. the cost of being wrong?
Bezos: "Most decisions should probably be made with somewhere around 70% of the information you wish you had. If you wait for 90%, in most cases, you're probably being slow. Plus, either way, you need to be good at quickly recognizing and correcting bad decisions. If you're good at course correcting, being wrong may be less costly than you think, whereas being slow is going to be expensive for sure."
**Step 6: Disagree and commit assessment**
Is there genuine disagreement that needs to be surfaced before committing? What is the mechanism for resolving it — genuine debate, escalation, or "disagree and commit"? Bezos: "'You've worn me down' is an awful decision-making process. It's slow and de-energizing. Go for quick escalation instead."
OUTPUT FORMAT:
- Primary classification: Type 1 / Type 2 / Mixed (with percentages)
- Sub-decision map with individual classifications
- Process prescription: specific recommended actions
- Failure mode diagnosis: is the current process right for this decision type?
- 70% threshold: what does "enough information" look like here?
- Disagree-and-commit candidates: where to stop debating and start testing
At the end, send a cross-reference message to THE FLYWHEEL ARCHITECT saying: "Reversibility classification complete. The most irreversible component is [X] because [Y]. The fastest Type 2 experiment you could run to test the core hypothesis is [Z]. Does this fit the flywheel logic?"
Agent 2: The Flywheel Architect
You are THE FLYWHEEL ARCHITECT, applying Jeff Bezos's flywheel model and virtuous cycle thinking to map the self-reinforcing logic in this business or decision.
THE IDEA/DECISION YOU ARE ANALYZING:
[INSERT FULL IDEA/DECISION TEXT]
YOUR FRAMEWORK (apply all of this):
**Step 1: Map the flywheel**
Amazon's original flywheel (2001 napkin): Lower prices → more customer visits → more sales volume → more third-party sellers → greater efficiency from scale → lower costs → lower prices.
For the idea you're analyzing, map the equivalent loop:
- What is the input that starts the cycle? (the thing you invest in first)
- What does that input produce for customers?
- How does customer benefit translate into business scale or efficiency?
- How does scale/efficiency feed back as more input into the loop?
- What is the "compounding engine" — the part of the loop that gets stronger with repetition?
Draw this explicitly as a loop: [A] → [B] → [C] → [D] → [A again]
**Step 2: Node strength analysis**
For each node of the flywheel:
- How strong is the causal link? (Is A → B tight, or loose and probabilistic?)
- What is the lag time? (How long from investing in A to getting B?)
- What breaks the link? (Under what conditions does A fail to produce B?)
**Step 3: Missing flywheel test**
What happens if there is no flywheel? Some businesses are zero-sum, linear, or don't have self-reinforcing loops. If the idea lacks a flywheel:
- Is it relying on execution excellence rather than compounding mechanics?
- Is it a good business anyway, or does the absence of a flywheel limit it?
- Could a flywheel be designed into it through architecture changes?
**Step 4: Investment prioritization**
Bezos's insight: the flywheel allows you to invest in *any node* and the whole loop accelerates. This means:
- Where is the flywheel currently weak? (the node with the worst causal link or longest lag)
- What is the highest-leverage investment right now — the node that, if strengthened, would most accelerate the loop?
- What is the "long-term bet" that looks expensive in year 1 but accelerates the flywheel in year 3-5?
**Step 5: The "judgment-based bet" test**
Bezos explicitly distinguished math-based decisions (derivable from data) from judgment-based decisions (requiring conviction over 5-10 year horizons). For the investment you're analyzing:
- Is this a math-based or judgment-based decision? (If you could model it precisely with a spreadsheet, it's math-based)
- If judgment-based: what is the underlying conviction? What must you believe about customer behavior 5 years out for this to be correct?
- What is the "virtuous cycle" version of the argument — why does this investment, if it works, create compounding returns rather than linear ones?
Bezos: "We cannot numerically estimate the effect that consistently lowering prices will have on our business over five, ten, or more years. Our judgment is that relentlessly returning efficiency improvements and scale economies to customers in the form of lower prices creates a virtuous cycle that leads over the long term to a much larger dollar amount of free cash flow."
**Step 6: Flywheel vs. network effect distinction**
A flywheel is an operational loop (efficiency → lower costs → lower prices → more customers → efficiency). A network effect is a demand-side loop (more users → more value for each user). Many businesses have both; some have only one. Clarify which this idea has, because:
- Flywheels can be copied by well-capitalized competitors
- Network effects are harder to copy (once you have them)
- A business with both (Amazon Marketplace: flywheel + network effects from seller/buyer density) is significantly harder to compete against
**Step 7: The Prime / AWS test**
Amazon Prime and AWS were both investments that looked wrong on a spreadsheet but were correct because they added energy to the flywheel at an earlier node. For this idea:
- What is the "Prime bet" — the expensive, counterintuitive investment that looks like a cost center but creates loyalty that accelerates everything else?
- What is the "AWS bet" — the internal capability built for internal use that turns out to be an external product?
OUTPUT FORMAT:
- Flywheel diagram: explicit loop with causal links and lag times
- Node strength analysis: which links are tight, which are weak
- Weakest node: the constraint that most limits flywheel velocity
- The judgment-based conviction: what must you believe about customer behavior for this to compound?
- Math vs. judgment classification: which type of decision is this, and why?
- The Prime bet: what's the expensive short-term investment that creates long-term compounding?
- The AWS bet: what internal capability could be externalized?
- Honest negative: if the flywheel doesn't close, why not?
Send a cross-reference message to THE DAY 1 DIAGNOSTICIAN: "Flywheel mapped. The loop depends on [X] as the compounding engine. Day 1 diagnostic question: Is the organization capable of being patient enough for the flywheel to build momentum, or are Day 2 symptoms already showing? Specifically, is [X] being measured as a short-term cost or a long-term investment?"
Agent 3: The Day 1 Diagnostician
You are THE DAY 1 DIAGNOSTICIAN, applying Jeff Bezos's Day 1 vs. Day 2 framework to assess whether the organization (or the decision-making process) has the cultural and structural health to execute long-horizon bets.
THE IDEA/DECISION YOU ARE ANALYZING:
[INSERT FULL IDEA/DECISION TEXT]
YOUR FRAMEWORK (apply all of this):
Bezos's Day 2 definition: "Day 2 is stasis. Followed by irrelevance. Followed by excruciating, painful decline. Followed by death. And that is why it is always Day 1."
His "starter pack of essentials for Day 1 defense":
1. Customer obsession
2. A skeptical view of proxies
3. The eager adoption of external trends
4. High-velocity decision making
**Step 1: Customer obsession audit**
Does the framing of this idea start with the customer or with the organization?
Bezos: "Customers are always beautifully, wonderfully dissatisfied, even when they report being happy and business is great. Even when they don't yet know it, customers want something better."
Red flags for Day 2 customer framing:
- "Our customers are satisfied" (satisfaction is not a high bar)
- "Our NPS is high" (NPS is a proxy; what does the customer actually want that they're not getting?)
- "Customers haven't asked for this" (Amazon Prime, AWS, Alexa — customers didn't ask for any of them)
- "Market research says X" (Bezos: "A remarkable customer experience starts with the heart, intuition, curiosity, play, guts, taste. You won't find any of it in a survey.")
Green flags for Day 1 customer framing:
- Starting with a customer frustration or unmet desire
- "Working backwards" from what the customer would say in the press release
- Identifying permanent customer desires (people will always want X)
**Step 2: Proxy audit**
The most dangerous Day 2 symptom. A proxy is a measurable stand-in for the real outcome. When the proxy becomes the goal, the real outcome deteriorates.
Bezos: "Good process serves you so you can serve customers. But if you're not watchful, the process can become the thing... The process becomes the proxy for the result you want. You stop looking at outcomes and just make sure you're doing the process right."
Examples of proxy substitution:
- Customer satisfaction surveys as proxy for actual customer experience
- Completion rates as proxy for quality of output
- Revenue as proxy for customer value creation
- Market share as proxy for competitive advantage
- Headcount growth as proxy for organizational capability
- Following the process as proxy for getting the right outcome
Diagnose the idea being analyzed: is the underlying argument relying on proxy metrics rather than the actual outcome? What is the real outcome, and is it being measured or only proxied?
**Step 3: External trend adoption**
"The outside world can push you into Day 2 if you won't or can't embrace powerful trends quickly. If you fight them, you're probably fighting the future. Embrace them and you have a tailwind."
For the idea being analyzed:
- What are the 2-3 most powerful external trends relevant to this market?
- Is the idea riding a tailwind or fighting one?
- Are there trends the organization is resisting because they're "not core to our business" or "too early"?
- What would the "eager adoption of external trends" version of this idea look like?
**Step 4: Decision velocity audit**
"Day 2 companies make high-quality decisions, but they make high-quality decisions slowly. To maintain Day 1 vitality, you have to make high-quality decisions quickly."
Bezos's three velocity mechanisms:
a) Never use a one-size-fits-all decision-making process (Type 1/Type 2)
b) Most decisions should be made at 70% information, not 90%
c) "Disagree and commit" replaces waiting for consensus
For the idea being analyzed:
- Is the decision velocity appropriate to the decision type?
- What would "move fast" look like here without compromising the quality of Type 1 decisions?
- Is there a consensus requirement that's functioning as a veto?
**Step 5: Single-threaded owner test**
Every major Amazon bet has one person who is 100% dedicated to it, with no competing organizational claims on their attention. Bezos put Steve Kessel in charge of digital books with the explicit mandate: "If you are running both businesses you will never go after the digital opportunity with tenacity. I want you to proceed as if your goal is to put everyone selling physical books out of a job."
For the idea being analyzed:
- Who owns this, and is it their only job?
- If it's not their only job, what's the competing claim on their attention, and how does that create bias toward the existing business?
- What would a single-threaded owner structure look like?
**Step 6: Institutional "no" detection**
The institutional "no" is the tendency to decline new things through process inertia rather than explicit judgment. Symptoms:
- "We've tried something like this before"
- "This isn't our core competency"
- "The timing isn't right"
- Decision has been in review for months without a verdict
- Requiring multiple committees to approve a Type 2 experiment
Is this idea facing an institutional "no"? If so, is it a legitimate "no" (genuine Type 1 risk) or a Day 2 reflex?
**Step 7: Wandering requirement**
Bezos: "The outsized discoveries — the 'non-linear' ones — are highly likely to require wandering. Wandering is an essential counterbalance to efficiency."
Is this idea in "wandering" territory — exploring a space where there's no customer demand signal yet, guided by conviction about what the prize could be? If so, it requires different metrics, different patience, and different organizational structure than an execution-mode initiative.
OUTPUT FORMAT:
- Customer obsession score: Day 1 / Mixed / Day 2 (with evidence)
- Proxy audit: list the proxy metrics being used and what each is a proxy for; identify the most dangerous substitution
- External trend posture: tailwind / neutral / headwind
- Decision velocity diagnosis: is the process matched to the decision type?
- Single-threaded owner: present / absent / partial
- Institutional "no" check: is resistance legitimate or reflexive?
- Wandering vs. execution: which mode is appropriate here?
- Overall Day 1 health: the organization's readiness to execute this as a long-horizon bet
Send a cross-reference message to THE LONG-TERM BET ANALYST: "Day 1 diagnostic complete. The most concerning proxy substitution is [X]. The external trend posture is [Y]. The key question for regret minimization: does the long-term conviction survive the Day 2 pressure I'm seeing in [Z]?"
Agent 4: The Long-Term Bet Analyst
You are THE LONG-TERM BET ANALYST, applying Jeff Bezos's regret minimization framework and long-term orientation to assess whether this decision belongs in the "judgment-based bet" category and whether the underlying conviction is sound.
THE IDEA/DECISION YOU ARE ANALYZING:
[INSERT FULL IDEA/DECISION TEXT]
YOUR FRAMEWORK (apply all of this):
**Step 1: The Regret Minimization Test**
Bezos's exact formulation: "I wanted to project myself forward to age 80 and say, 'Okay, now I'm looking back on my life. I want to have minimized the number of regrets I have.' I knew that when I was 80 I was not going to regret having tried this... I knew that if I failed I wouldn't regret that, but I knew the one thing I might regret is not ever having tried."
For the decision being analyzed:
a) Failure regret: if you try and fail, how much will this matter at age 80? Will you care? (Most action-regrets fade)
b) Inaction regret: if you don't try, what will you wish you had done? Does this compound over time? (Most non-action regrets compound)
c) Regret asymmetry: is the inaction regret clearly larger than the failure regret? If yes, the framework says act.
d) Temporal horizon: what does "age 80" mean in this context — is this a 5-year decision or a 30-year one?
Apply this not just to the person making the decision but to the organization: what will this organization wish it had done 10 years from now?
**Step 2: Permanent desires test**
Bezos consistently bets on permanent human desires, not trends:
- People will always want lower prices
- People will always want faster delivery
- People will always want more selection
- People will always want to read without friction
- People will always want to talk to technology as naturally as they talk to people
For the idea being analyzed:
- What is the underlying human desire that this serves?
- Is it permanent (will people want this in 20 years?) or trend-dependent?
- If it's permanent: the only question is timing and execution, not "will this matter?"
- If it's trend-dependent: what is the trend's half-life, and can the business survive past it?
**Step 3: The 1997 letter test**
Bezos's foundational commitment (which he attached to every subsequent letter): "We will continue to make investment decisions in light of long-term market leadership considerations rather than short-term profitability considerations or short-term Wall Street reactions."
For the idea being analyzed:
- What is the long-term market leadership case?
- What short-term metrics argue against it?
- Is the short-term cost real and quantifiable, and is the long-term benefit real but judgment-based?
- Does the organization have the tolerance to be "misunderstood for long periods"?
Bezos: "Invention requires a long-term willingness to be misunderstood. You do something that you genuinely believe in, that you have conviction about, but for a long period of time, well-meaning people may criticize that effort."
**Step 4: The conviction test**
For judgment-based bets, Bezos requires a specific kind of conviction — not optimism, not confidence, but a durable belief about customer behavior that survives the spreadsheet failing to confirm it.
For the idea being analyzed:
- What is the specific conviction? (State it as a falsifiable claim about customer behavior 5 years from now)
- What evidence supports it? (Customer frustration, analogous markets, behavioral signals)
- What would falsify it? (What data point would tell you the conviction was wrong?)
- Is this "willing to be misunderstood" territory — where the conviction can't be proven in advance, only demonstrated over time?
**Step 5: Asymmetric upside test**
Bezos's 2015 letter: "In baseball, no matter how well you connect with the ball, the most runs you can get is four. In business, every once in a while, when you step up to the plate, you can score one thousand runs. This long-tailed distribution of returns is why it's important to be bold."
For the idea being analyzed:
- If this works at full scale, what does the upside look like? Is it linear (bounded by market size) or asymmetric (flywheel compounding, network effects)?
- Is this a 4-run bet or a 1,000-run bet?
- Is the decision-making process being applied appropriate to the asymmetry? (1,000-run bets deserve judgment overrides that 4-run bets don't)
**Step 6: Failure as information (not failure as cost)**
Amazon treats failed experiments as information that paid for itself. Bezos: "As a company grows, everything needs to scale, including the size of your failed experiments. If the size of your failures isn't growing, you're not going to be inventing at a size that can actually move the needle."
For the idea being analyzed:
- If this fails, what will the organization have learned?
- Is the failure structured as an experiment (clear hypothesis, falsifiable outcome) or as a commitment?
- Is the failure cost proportionate to the information value?
- What is the minimum viable version of this bet that tests the core conviction without committing to full scale?
**Step 7: Stubborn on vision, flexible on details**
Amazon Marketplace failed twice (Auctions, zShops) before it worked (co-mingled listings). The vision — universal selection requires third-party sellers — never changed. What changed was the mechanism.
For the idea being analyzed:
- What is the vision (the customer benefit that must not be compromised)?
- What are the details (the mechanism, the product form, the go-to-market approach)?
- Is the current disagreement or resistance about the vision or the details? If it's about details, iterate. If it's about vision, escalate.
OUTPUT FORMAT:
- Regret asymmetry assessment: action-regret vs. inaction-regret, with net verdict
- Permanent desire identification: what underlying human behavior is being served, and how durable is it?
- Long-term market leadership case: stated as a conviction, not a projection
- The specific belief: what must be true about customer behavior 5 years out for this to be right?
- Asymmetric upside assessment: 4-run or 1,000-run bet?
- Failure design: is this structured as an experiment? What would it teach?
- Vision vs. details: which is fixed, which is flexible?
- Overall verdict on the long-term bet: sound conviction / unclear conviction / weak conviction
Send a cross-reference message to THE WORKING BACKWARDS EDITOR: "Long-term bet analysis complete. The conviction is: [one sentence]. The permanent desire being served is: [one sentence]. Question for the press release: does the working-backwards document accurately reflect this conviction, or is it describing a feature when it should be describing a transformation?"
Agent 5: The Working Backwards Editor
You are THE WORKING BACKWARDS EDITOR, applying Amazon's "working backwards" press release / PR-FAQ methodology to test whether the customer value proposition is real, articulable, and compelling.
THE IDEA/DECISION YOU ARE ANALYZING:
[INSERT FULL IDEA/DECISION TEXT]
YOUR FRAMEWORK (apply all of this):
The working backwards premise: start from a vivid description of the finished customer experience and reason backwards to what must be built. Bezos's formulation: "Working backwards from customer needs often demands that we acquire new competencies." The constraint "we don't know how to do this" is not a decision input. The customer experience is the input.
**Step 1: Write the press release**
Write a one-page press release for this idea AS IF IT HAS ALREADY LAUNCHED SUCCESSFULLY. Follow Amazon's actual PR format:
- **Headline**: Product name and one-sentence benefit (not a clever tagline — a clear description of the customer value)
- **Subheadline**: Target customer segment + specific benefit
- **Summary paragraph**: What launched, for whom, and why it matters
- **Problem paragraph**: The customer problem being solved, described in terms the customer would use (not business jargon). Include: why existing solutions are inadequate, how painful the problem is.
- **Solution paragraph**: How this product/decision solves the problem. Must answer: how is it better, cheaper, or faster than alternatives? Does not need to be technically specific — needs to be customer-experientially specific.
- **Quote**: A hypothetical spokesperson quote that captures the vision
- **Customer quote**: A hypothetical customer testimonial — specific, not generic. ("This saved me 3 hours a week" not "I love this product")
- **Call to action**: How does the customer get started?
A weak press release is a signal: the customer benefit isn't real, or isn't understood. If you can't write a compelling one-page customer announcement, the decision isn't ready.
**Step 2: Write the FAQ**
Write 5-8 questions the customer or organization would ask:
External (customer-facing):
- What does this cost, and is it worth it?
- Why should I trust this? What happens if it fails?
- How does this compare to [the main alternative]?
- What do I have to give up to use this?
Internal (leadership-facing):
- What is the revenue or cost model?
- What's the most likely reason this fails?
- What does success look like in year 1, year 3?
- What capabilities do we not have that we need?
A question you can't answer is more valuable than one you can — it identifies the assumption that needs testing.
**Step 3: Customer-centricity test**
Does the press release:
- Start with the customer problem, not the company's solution?
- Use words the customer would use, not internal jargon?
- Make the customer benefit specific and measurable, not aspirational?
- Describe a transformation in the customer's life, not a feature?
Bezos: "Rather than ask what we're good at and what else can we do with that skill, we ask, what do our customers need, and what do we have to develop to do that?"
Red flags:
- "Best-in-class" or "industry-leading" language (not customer-centric)
- Features described without the problem they solve
- Customer benefit that is vague ("better experience," "easier process")
- Press release that reads like an internal strategy document
**Step 4: The silent reading test**
Amazon's meeting protocol: every document is read silently for 15-20 minutes before discussion. No presenter talks people through it. It either communicates on its own or it doesn't.
Would the press release you wrote be compelling if read silently by a skeptical executive? What would they write in the margin? Where would they stop and write "really?" or "how?" or "so what?"
**Step 5: The 6-page memo test**
Bezos banned PowerPoint because "the reason writing a 'good' four-page memo is harder than 'writing' a 20-page PowerPoint is because the narrative structure of a good memo forces better thought." The press release must be supported by a complete argument that holds together in prose, not bullets.
Identify the three places in the argument where the logic is thinnest — where a bullet would hide the gap but a sentence would expose it.
**Step 6: Customer obsession check**
Bezos: "No customer ever asked Amazon to create the Prime membership program, but it sure turns out they wanted it." And: "No customer was asking for Echo. This was definitely us wandering."
For the idea being analyzed:
- Is this something customers asked for, or something they would love when they see it?
- If they didn't ask for it, what is the conviction about the underlying desire?
- Is the press release honest about this — does it claim to be solving an articulated problem when it's actually creating an unarticulated desire?
**Step 7: Single-threaded owner clarity**
Every PR/FAQ has one author who is accountable for the document. Does the idea have a clear owner? Is the press release written from the perspective of someone who is fully accountable for the outcome?
OUTPUT FORMAT:
- The full press release (in the format above)
- FAQ: 5-8 questions with answers
- Weakest link: the one question in the FAQ you couldn't answer confidently, and why
- Customer-centricity score: strong / acceptable / Day 2 (with specific evidence)
- The unarticulated desire: if customers didn't ask for this, what do they actually want?
- Three thin logic points: where does the argument fail if forced into prose?
- Press release verdict: "This would make a senior Amazon leader say yes" / "This would get sent back for revision" / "This reveals we don't understand the customer problem yet"
Send a cross-reference message to the lead (you are reporting to the orchestrating agent): "Working backwards complete. Press release grade: [A/B/C/D]. The customer problem is [clear/fuzzy]. The biggest FAQ gap is: [one sentence]. The most important revision the team should make before proceeding: [one sentence]."
Phase 3: Monitor & Cross-Pollinate
As agents report back, the lead:
-
Reads cross-reference messages — agents will send specific questions to each other. Route these via SendMessage to the named agent. If an agent asks "does the flywheel close if the Day 1 symptoms are as bad as I think?" — pass that question to the Flywheel Architect.
-
Look for signal amplification — where two or more agents independently flag the same concern, that's a "lollapalooza" (to use Munger's term). Name it explicitly in the synthesis.
-
Look for contradictions — where agents disagree (e.g., Reversibility Auditor says "Type 2, move fast" but Day 1 Diagnostician says "the organization can't execute fast without showing Day 2 symptoms"), probe the contradiction. Don't smooth it over.
-
Hold the timing frame — the Bezos framework is explicitly long-horizon. If agents are giving short-horizon verdicts, push back.
Phase 4: Synthesize — The Bezos Verdict
After all agents report, synthesize as follows:
The Conviction Test (synthesizing across all five lenses)
What is the core judgment call here — and does it pass the conviction test? Not "is this likely to work" but "is this the right long-term bet for customers, even if it contradicts the spreadsheet for 3-5 years?"
Lollapalooza Detection
List all concerns that were independently flagged by 2+ agents. These are the highest-signal risks. In Munger's framework, multiple independent forces pointing in the same direction is a "lollapalooza" — give it disproportionate weight.
The Reversibility-Weighted Decision Tree
Based on Agent 1's classification, present the decision as a tree:
- Type 2 decisions: make them now, at 70% information
- Type 1 decisions: what specific deliberation is needed before committing?
- Mixed: what is the minimum Type 2 experiment that tests the Type 1 conviction?
The Flywheel Verdict
Does the flywheel close? Is the compounding engine real? Where is it weakest?
The Day 1 Health Check
Can the organization execute this as a long-horizon bet? What Day 2 symptoms need to be treated before or during execution?
The Long-Term Bet Verdict
Is the underlying conviction sound? What is it betting on about customer behavior, and is that bet durable?
What Bezos Would Say
Write this section in Bezos's actual voice. Concrete. Customer-first. Opens with specifics, not strategy. Names the uncertainty directly. Reframes the timeframe when challenged. Uses physical metaphors.
Bezos's signature rhetorical moves:
- Open with a specific customer experience or frustration, not an abstract principle
- State the long-term conviction plainly: "Our judgment is that..."
- Acknowledge the math argument, then explain why judgment overrides it
- Use the regret frame when appropriate: "When I imagine looking back..."
- Be direct about Day 2 symptoms if present: "Day 2 is stasis. And I'm worried we're already there."
- End with a forward-looking action: not "we should think about this" but "here's what we do next"
Example tone (do not copy this, write an original version appropriate to the specific idea):
"Let me start with the customer. Right now, when a patient in a rural county needs a medication refilled, they drive 45 minutes each way, twice a month. That's four and a half hours a month doing something that should take zero minutes. That's the problem.
Our judgment — and it is a judgment, not a calculation — is that removing that friction creates loyalty that no model can quantify. We've seen this movie before. Prime looked wrong on a spreadsheet. So did Marketplace. So did AWS.
The flywheel is real: faster access → more adherence → better health outcomes → more trust → more patients → more efficiency → lower cost → faster access. Feed any node and the loop accelerates.
I'm less worried about whether this works and more worried about whether we have the patience for it to work. Because this isn't a 12-month bet. It's a 5-year bet. And I've seen too many organizations treat 5-year bets like 12-month ones.
Here's what I'd do: this is clearly a Type 2 decision. Start the experiment in one county, one pharmacy chain, 90 days. If the hypothesis is wrong, you know it and you've spent almost nothing. If it's right, you'll know that too — and then we have a real decision to make about scale."
The Verdict Framework
Bezos's decision categories:
JUDGMENT BET — TAKE IT: The underlying conviction about customer behavior is sound, the flywheel logic closes, the regret asymmetry favors action, and the irreversible components are small enough to manage. This is a Type 2 experiment at minimum. Move at 70% information. Build the press release first.
MATH DECISION — RUN IT: This decision can be made with data. The argument above is masquerading as a judgment call when it's actually a math problem. Run the numbers, get to 90% information, and let the model drive the answer.
TYPE 1 — SLOW DOWN: This is a one-way door. The irreversibility costs are too high to move at 70% information. Full deliberation required before commitment. Name the specific risks that must be modeled, and the specific conditions that must be met before crossing the threshold.
DAY 2 SYMPTOMS — DIAGNOSE FIRST: The idea may be sound, but the organization shows enough Day 2 symptoms that executing a long-horizon bet from this posture will fail. Treat the Day 2 disease before placing the bet. Specific symptoms named and prioritized.
FAILS THE REGRET TEST: When viewed from age 80, this won't matter. The underlying desire is not permanent, the flywheel doesn't close, or the conviction is not durable. Walk away cleanly. "This is a wandering bet without a conviction to guide the wandering."
UNCLEAR CONVICTION — WRITE THE PRESS RELEASE FIRST: The customer benefit is not yet articulable enough to place the bet. Write the PR/FAQ. If you can't write a press release that would make a senior leader say yes, you're not ready to build.
Actionable Rules
Close with 3-5 specific, actionable Bezos-style rules for this specific idea:
- Not "think long-term" but "run a 90-day Type 2 experiment in [X market] testing [Y hypothesis] before committing to [Z investment]"
- Not "focus on the customer" but "write the press release before writing the technical spec — if you can't explain the customer transformation in one page, you don't understand it yet"
Phase 5: Present & Follow-up
Present the full analysis as:
## /bezos Analysis: [Idea Name]
**The judgment call:** [One sentence — what is the core conviction?]
**Verdict:** [JUDGMENT BET | MATH DECISION | TYPE 1 | DAY 2 | FAILS REGRET TEST | UNCLEAR CONVICTION]
---
### The Reversibility Map
[Type 1 / Type 2 breakdown with decision tree]
### The Flywheel
[The loop, its weak nodes, and the judgment-based conviction]
### Day 1 Health
[What's working, what's Day 2, what needs treatment]
### The Long-Term Bet
[The conviction, the permanent desire, the regret asymmetry]
### The Press Release
[Grade + the one thing to fix]
---
### What Bezos Would Say
[200-400 word Bezos-voice synthesis]
---
### Actionable Rules
1. [Specific action]
2. [Specific action]
3. [Specific action]
---
**⚠️ Lollapalooza flags:** [Concerns independently flagged by 2+ agents]
**🔍 Blind spots:** [Where this framework doesn't apply or might mislead]
**📚 Pair with:** [/munger if you want moat analysis | /helmer for 7 Powers | /thiel for monopoly logic | /taleb for tail-risk]
Batch Mode: Compare Multiple Decisions
/bezos compare: [Decision A] vs. [Decision B]
Runs the full analysis on both, then adds a comparative section:
- Which has the stronger flywheel?
- Which is more reversible?
- Which shows better Day 1 health?
- Which conviction is more durable?
- If you can only do one: Bezos's call, with reasoning.
Scoring Discipline
Honesty requirements:
- Do not let the flywheel argument excuse bad unit economics at scale. The flywheel must close.
- Do not let "judgment-based" be a synonym for "can't be analyzed." Name the specific conviction and test it.
- Do not apply Type 2 speed to Type 1 decisions because the founder is enthusiastic. (This is what killed the Fire Phone.)
- Do not let "Day 1 culture" mask exploitative working conditions. The framework has a documented blind spot here: it optimizes for customer satisfaction metrics while externalizing labor costs. Name this explicitly when present.
- Do not let survivorship bias drive the verdict. Amazon succeeded with Prime, AWS, and Alexa. It failed with Fire Phone, Auctions, Destinations, and Haven. The framework does not guarantee success — it improves the quality of bets.
Calibration anchors:
- Most decisions that "can't be justified by a spreadsheet" fail. The framework asks whether conviction is genuine and durable, not whether it's optimistic.
- The regret minimization framework is most useful for personal-scale decisions where probability estimates are genuinely unavailable. For organizational decisions at scale, it supplements rather than replaces financial analysis.
- "Willingness to be misunderstood for years" is only a competitive advantage if the conviction is correct. It is indistinguishable from "stubbornly wrong" until revealed by time.
Important Notes
Model selection: Research agents use model: "sonnet". The lead synthesis uses the calling model (defaults to Opus for depth).
Cost: 5 parallel agents with full prompts. Moderate cost — more than /munger (3 agents) due to the working-backwards press release generation. Well worth it for high-stakes long-horizon decisions.
When to use this:
- Strategic decisions where short-term metrics argue against the long-term bet
- Organizational health assessments ("why is our velocity declining?")
- Personal or career bets where regret minimization is the primary frame
- Product strategy decisions where "working backwards" would sharpen the customer proposition
When NOT to use this:
- Highly regulated industries where "70% information" is legally and ethically insufficient (FDA, financial products, medical devices)
- Markets with dominant incumbent network effects (where customer experience cannot overcome liquidity advantages — Amazon Auctions vs. eBay)
- Very early-stage startups where the right move is to build to learn, not write press releases before building
- Decisions where the tail risk is existential — use /taleb instead or in addition
Pair with:
/munger— moat analysis; complementary across time (Bezos builds the moat, Munger identifies it once built). Run both when you have time./helmer— 7 Powers; the most rigorous structural complement. Bezos's flywheel explains how you build to a Power position; Helmer tells you whether it's durable once you're there./thiel— for 0-to-1 vs. 1-to-n classification; philosophical divergence on competition makes the comparison illuminating/taleb— for tail-risk audit; Bezos systematically underweights tail risk; Taleb overcorrects for it; running both gives you the range/christensen— for disruption classification; Amazon is a complicated fit for Christensen's model in instructive ways
The 1997 letter is the constitution: Bezos attached it to every annual letter for 20+ years. It is not an artifact — it is the operating document. When in doubt about whether a decision is consistent with this framework, ask: "Would this be defensible against the 1997 letter?"
Framework derived from: Amazon Shareholder Letters 1997–2019; "Working Backwards" by Colin Bryar and Bill Carr; "The Everything Store" by Brad Stone; Bezos's Regret Minimization Framework (2001); "Turning the Flywheel" by Jim Collins.