mlb-opponent-profiler
MLB Opponent Profiler
Table of Contents
Example
Scenario: Monday morning of Week 5. Refresh profile for Springfield Isotopes (manager Nikolay) after observing another high-activity week (7 moves, $14 spent on adds, 4-1 matchup win).
Inputs:
team_name: "Springfield Isotopes"
yahoo_session: <authenticated chrome context>
prior_profile: # from context/opponents/springfield-isotopes.md (Week 4)
archetype: balanced
archetype_confidence: 0.50
posterior: {balanced: 0.42, stars_and_scrubs: 0.18, punt_sv: 0.02, punt_sb: 0.04,
punt_wins_qs: 0.03, hitter_heavy: 0.12, pitcher_heavy: 0.08,
inactive: 0.00, frustrated_active: 0.11, unknown: 0.00}
Step 1 -- Yahoo scrape (4 pages): teams index, team page for team_id=8, draftresults, transactions filtered by tid=8. URLs in methodology.md.
Step 2 -- MLB feature extraction yields:
sp_roster_share: 0.35; closer_count: 2; sb_speed_count: 4; power_bat_count: 6
moves_per_week: 3.3; faab_spent_pct: 0.32; faab_avg_bid: 2.5
lineup_set_daily: true; trade_offers_sent: 1; trade_offers_received: 0
record_last_2_weeks: "8-2-0"; days_since_last_login: 0
Step 3 -- Invoke opponent-archetype-classifier:
archetype_taxonomy: # see resources/template.md -- 10 archetypes
balanced: {...}
stars_and_scrubs: {...}
...
observed_features: <from step 2>
archetype_prior: <prior_profile.posterior> # sequential: last week's posterior is this week's prior
observation_weight: 0.65 # Week 5 -- see methodology.md observation_weight_calibration
correlated_feature_pairs:
- [moves_per_week, faab_spent_pct] # active managers do both
- [sp_roster_share, closer_count] # they trade off
Step 4 -- Classifier returns:
posterior:
balanced: 0.58 # up from 0.42 -- evidence accumulating
stars_and_scrubs: 0.12
hitter_heavy: 0.10
frustrated_active: 0.08 # record is winning, so frustrated_active down
pitcher_heavy: 0.07
punt_sb: 0.03
punt_sv: 0.01
punt_wins_qs: 0.01
inactive: 0.00
unknown: 0.00
map_archetype: balanced
classification_confidence: 37.7 # 0.58 * 0.65 * 100 -- still just under 40
best_response_hints:
- "Match cat-for-cat; decided on execution"
- "Include in N-estimate for every common-value FAAB target"
- "Active manager -- probe for fair consolidation trades"
Step 5 -- Read existing context/opponents/springfield-isotopes.md, preserve manual notes, update machine-generated sections:
The "Summary", "Apparent weaknesses", "Best response" and "Open questions" sections contain hand-authored user notes -- keep verbatim. Only these get refreshed:
- YAML frontmatter:
last_updated,confidence,source_urls - Section 2 (Archetype): archetype, archetype_confidence, archetype_evidence
- Section 3 (Category strength): cat_strength, presumed_punts, likely_pushes
- Section 4 (Behavioral): activity_level, last_active, waiver_aggression, faab_remaining, faab_avg_bid_pct, trade_propensity
Step 6 -- Emit weekly signal at signals/wk05-opponent-profiles.md:
---
type: opponent_profile_summary
date: 2026-04-20
week: 5
emitted_by: mlb-opponent-profiler
confidence: 0.65
source_urls: [<11 team pages + transactions + teams>]
---
## Archetype shifts since Week 4
- Springfield Isotopes: balanced 0.42 -> 0.58 (confidence 0.50 -> 0.66). Nikolay continues active pattern, record confirms execution.
- Jennys Team: inactive 0.78 -> 0.84 (confidence 0.80 -> 0.85). Still zero moves. Profile hardening.
- ...9 more
## New reactivity triggers fired
- Team 7 (Kenyi) -- outbid us on Wade Miley at $14; shift `faab_avg_bid_pct` up
Workflow
Copy this checklist and track progress:
Opponent Profiler Progress (per team, x 11 for all_opponents):
- [ ] Step 1: Resolve team identity (team_name -> team_id, manager_alias)
- [ ] Step 2: Scrape 4 Yahoo pages for this team
- [ ] Step 3: Extract MLB-specific features (9 features; see template.md)
- [ ] Step 4: Load prior_profile.posterior as archetype_prior (sequential update)
- [ ] Step 5: Invoke opponent-archetype-classifier with 10-archetype MLB taxonomy
- [ ] Step 6: Compute cat_strength (0-100 per cat) from roster composition
- [ ] Step 7: Read existing context/opponents/<slug>.md; identify manual-notes sections
- [ ] Step 8: Write updated file atomically (preserve manual notes)
- [ ] Step 9: (at end of batch) Emit signals/wkNN-opponent-profiles.md summary
Step 1: Resolve team identity
Input is either a single team_name (case-insensitive, fuzzy match against canonical Yahoo names) or all_opponents: true (iterate over team_ids 1-12, skipping user's own team_id). Map to team_id and team-slug (lowercased, hyphenated).
- If
team_namedoes not match any Yahoo team, return error -- do not guess -
team-slugmust match the filename on disk atcontext/opponents/<slug>.md(or create new file if absent)
Step 2: Scrape 4 Yahoo pages
Exact URL flow documented in resources/methodology.md. Pages needed:
-
https://baseball.fantasysports.yahoo.com/b1/23756/teams-- manager, last-login, W-L -
https://baseball.fantasysports.yahoo.com/b1/23756/<team_id>-- roster + FAAB remaining -
https://baseball.fantasysports.yahoo.com/b1/23756/<team_id>/draftresults-- draft pick distribution -
https://baseball.fantasysports.yahoo.com/b1/23756/transactions?tid=<team_id>-- adds/drops/bids/trades - Each scrape records the exact URL in
source_urls(for citation) - If any page returns 4xx/5xx or auth failure, degrade gracefully (see Guardrails #3)
Step 3: Extract MLB features
Computed from scraped pages. See resources/methodology.md for exact formulas.
-
sp_roster_share,closer_count,sb_speed_count,power_bat_count(roster composition) -
moves_per_week,faab_spent_pct,faab_avg_bid(waiver activity) -
lineup_set_daily(bool -- any benched-but-MLB-starting players in last 7 days?) -
trade_offers_sent,trade_offers_received -
record_last_2_weeks,days_since_last_login
Step 4: Sequential-update prior
Read prior_profile.posterior (if supplied) and pass as archetype_prior to the classifier. This is the key sequential-Bayes move: last week's posterior becomes this week's prior.
- If no
prior_profile, use taxonomy priors (Week 1 only) - If
prior_profile.posteriorsums to != 1.0, normalize and flag inassumptions_flagged - Observation weight rises with week number: Wk1-2 = 0.25, Wk3-4 = 0.45, Wk5-7 = 0.65, Wk8-11 = 0.80, Wk12+ = 0.90
Step 5: Invoke opponent-archetype-classifier
Pass the 10-archetype MLB taxonomy from resources/template.md along with the observed features, prior, and observation_weight. Do not re-implement Bayesian math in this skill. The classifier returns posterior, map_archetype, classification_confidence, best_response_hints, feature_contribution_breakdown, assumptions_flagged.
- Pass
correlated_feature_pairs(see Guardrails #1) - If classifier returns
map_archetype: "inconclusive", write archetype asunknownwith confidence as returned - Do not override or reinterpret classifier output -- store it as-is
Step 6: Compute cat_strength (0-100 per cat)
Classifier returns the archetype; the 10 per-cat strength scores come from a separate roster-based estimator (this is MLB-specific and stays in this skill). Method in resources/methodology.md.
- Sum projected season totals for each cat across roster
- Normalize to 0-100 where 50 is league average
- Derive
presumed_punts: cats where score < 35 - Derive
likely_pushes: cats where score > 65
Step 7: Read existing file, identify manual-notes sections
Critical: this skill never overwrites manually-authored notes. See resources/methodology.md.
- Open
context/opponents/<slug>.md; if not present, emit new file from template - Preserve Sections 1 (Summary), 5 (Apparent weaknesses / surpluses), 6 (Best response), 8 (Reactivity triggers), 9 (Open questions) verbatim
- Refresh only: frontmatter (last_updated, confidence, source_urls), Section 2 (Archetype), Section 3 (Category strength), Section 4 (Behavioral), Section 7 (Matchup history -- if we played them)
Step 8: Write file atomically
- Write to temp file
<slug>.md.tmp, fsync, rename - Validate output against opponent-profile-schema.md frontmatter + section order before rename
Step 9: Emit signal file
After all 11 updates complete (for all_opponents: true mode), emit signals/wkNN-opponent-profiles.md. Format in resources/template.md.
- Include every archetype posterior that shifted by > 0.05 since last week
- Include every confidence change > 0.10
- List any reactivity-trigger events fired this week
Common Patterns
Pattern 1: inactive (dormant manager -- e.g. Jenny's Team)
- Signals:
moves_per_week < 0.5,faab_spent_pct < 0.05,days_since_last_login > 3orlineup_set_daily = false - Best-response: Don't include in N-bidder FAAB estimates. Send consolidation trade offers. Expect roster atrophy -- exploit lineup-neglect advantage week-over-week.
- Confidence grows fast: inactivity is unambiguous; expect
archetype_confidenceabove 0.80 by Week 4.
Pattern 2: punt_wins_qs (Marmol strategy -- all-hitter + RP-only staff)
- Signals:
sp_roster_share < 0.25,closer_count >= 3,moves_per_week > 3(cycling for hitter production) - Best-response: Concede K and QS (two free cat losses); lock 6 of remaining 8 cats. Do not stream SPs against them.
Pattern 3: punt_sv (no closers, elite SP + hitting)
- Signals:
closer_count = 0,sp_roster_share > 0.40, high-end SP drafted in round 1-3 - Best-response: Concede SV; push all 5 hitting + K + QS + ERA + WHIP. Do not chase closers as streamers.
Pattern 4: frustrated_active (active but losing -- motivated trader)
- Signals:
moves_per_week > 4ANDrecord_last_2_weeksW% < 0.35 - Best-response: Prime trade partner. They'll respond to offers. Target their cooling stars (sell-high for them, buy-low for us).
Pattern 5: balanced (the default)
- Signals: No strong punts; roster composition within 1 std of league average across all features
- Best-response: Match cat-for-cat; matchup decided on weekly execution. Use variance-seeking when we're the underdog (
matchup_win_probability < 0.40).
Guardrails
-
Classifier delegation purity. This skill MUST NOT implement Bayesian math. If you find yourself computing posteriors, likelihoods, or normalizations, stop -- those belong in
opponent-archetype-classifier. This skill contributes the taxonomy (MLB-specific), feature extraction (Yahoo-specific), and output formatting. Any math beyond "sum projected season totals for cat_strength" is a smell. -
Manual-notes preservation is non-negotiable. Users hand-author Sections 1, 5, 6, 8, 9 with strategic notes this skill cannot reproduce (e.g. "Jennifer's email bounced, suggest probe via DM"). Blind-overwriting those sections destroys user work. Always read-modify-write. Validate diff before commit: only frontmatter + Sections 2, 3, 4, 7 should change.
-
Graceful scrape failure. If a Yahoo page returns 4xx/5xx or the session is unauthenticated: do NOT guess. Mark the corresponding features as
null, record the failed URL in ascrape_errors:block in frontmatter, and dropconfidenceby 0.2. The profile should still emit, just with lowered confidence. A partial profile is better than a missing one. -
Sequential-update discipline. Last week's posterior becomes this week's prior -- always. Do NOT restart from taxonomy priors each week (that throws away 1-4 weeks of evidence). The one exception: if
prior_profile.confidence < 0.20, restart (the prior profile is essentially garbage). -
Correlated-feature handling.
moves_per_weekandfaab_spent_pctare strongly correlated (active managers do both). Passcorrelated_feature_pairsto the classifier so it down-weights. Same forsp_roster_shareandcloser_count(they trade off by roster-slot constraint). -
Do not invent cat_strength from the archetype.
cat_strengthis a separate estimator based on actual roster composition, NOT derived fromarchetype. Abalancedteam with a thin OF has below-average SB; don't paper over that by inheriting "balanced = 50s across the board". Compute from roster. -
Archetype
unknownis valid. When classification_confidence < 40, writearchetype: unknownwitharchetype_confidencereflecting the classifier output. Do not force a MAP on thin evidence. Downstream agents handleunknownby falling back to broad heuristics. -
Citation requirement. Every emitted profile lists exact Yahoo URLs in
source_urls. The weekly signal file lists all URLs across all 11 teams (dedupe the teams-page URL). -
One-shot vs all-opponents mode. Single-team refreshes (on trade-offer arrival, FAAB competition observation) emit only the profile file -- no weekly summary signal. Weekly summary signal is emitted only after all 11 refreshes complete.
-
Team slugs stable. Slugs are lowercased + hyphenated team name, stable across the season (e.g.,
jennys-team,springfield-isotopes). If a manager renames their team mid-season, keep the original slug and store the newteam_namein frontmatter -- do not rename the file (breaks downstream agent references).
Quick Reference
Pipeline one-liner:
scrape_yahoo(4_pages) -> extract_mlb_features -> invoke(opponent-archetype-classifier,
taxonomy=mlb_10_archetype, prior=last_week_posterior, observation_weight=f(week))
-> compute_cat_strength(roster) -> read_existing_md -> merge_preserve_notes
-> atomic_write -> [if all_opponents] emit_weekly_signal
MLB 10-archetype taxonomy (baked in):
| Archetype | Core signal | Prior (Wk 1) |
|---|---|---|
balanced |
no punts, pushes all cats | 0.25 |
stars_and_scrubs |
4-5 elite + bench scrubs | 0.10 |
punt_sv |
0 closers, loaded elsewhere | 0.10 |
punt_sb |
power-only offense | 0.10 |
punt_wins_qs |
SP thin, RP-heavy (Marmol) | 0.05 |
hitter_heavy |
sp_roster_share < 0.30 | 0.10 |
pitcher_heavy |
sp_roster_share > 0.45 | 0.10 |
inactive |
zero moves, stale lineup | 0.10 |
frustrated_active |
high moves + losing record | 0.05 |
unknown |
inconclusive threshold | 0.05 |
Priors sum to 1.00. Informed (not uniform) per guardrail on 12-team base rates.
Observation weight schedule:
| Week | Weight | Rationale |
|---|---|---|
| 1-2 | 0.25 | Draft-only evidence; noisy |
| 3-4 | 0.45 | First waivers visible |
| 5-7 | 0.65 | Patterns stabilizing |
| 8-11 | 0.80 | Behavior well-characterized |
| 12+ | 0.90 | Any remaining doubt is true ambiguity |
Key resources:
- resources/template.md: 10-archetype MLB taxonomy YAML (passed to classifier), per-opponent
.mdoutput template (matchesopponent-profile-schema.md), weekly summary signal template. - resources/methodology.md: Yahoo scrape URL flow, feature-extraction formulas, cat_strength estimator, read-modify-write protocol, sequential-update protocol, graceful-degradation handling.
- resources/evaluators/rubric_mlb_opponent_profiler.json: 10 quality criteria (Taxonomy Completeness, Feature Extraction Correctness, Yahoo Scrape Coverage, Classifier Delegation Purity, Output Schema Conformance, Preserve Manual Notes, Sequential-Update Correctness, Signal File Emission, Graceful Scrape Failure, Citations).
Inputs required:
team_name(string, single team) ORall_opponents: trueyahoo_session(authenticated Chrome context)prior_profile(optional; if missing, bootstrap from taxonomy priors)
Outputs produced:
- Updated
context/opponents/<team-slug>.md(one per refreshed team) -- manual notes preserved signals/wkNN-opponent-profiles.md(only inall_opponentsmode) -- summary of archetype shifts + confidence changes + reactivity events
Downstream consumers (5 agents):
mlb-lineup-optimizer-- reads §3 cat_strength + §6 best_response forleverage_vs_opponentmlb-category-state-analyzer-- reads §2 archetype + §3 to anticipate opponent push/puntmlb-faab-sizer-- reads §4 (waiver_aggression, faab_remaining) across all 11 for N-bidder estimatemlb-trade-analyzer-- reads §4 (trade_propensity, trade_cooperation_score) + §5 weaknessesmlb-fantasy-coach-- reads §6 for the week's opponent for morning brief