youtube-transcript-analysis-api-skill
YouTube Transcript Analysis API Skill
📖 Brief
This skill provides an end-to-end YouTube video transcript extraction and deep content analysis service. By extracting video transcripts and then systematically analyzing them, users can understand competitors' core value propositions, target audience profiles, pain point strategies, and content gaps — all without manually watching hours of video.
This skill works in two phases:
- Phase 1 — Transcript Extraction: Uses BrowserAct API to extract raw transcript data (supports single video and batch modes).
- Phase 2 — Deep Analysis: The Agent performs structured 8-dimension analysis on the extracted transcripts.
✨ Features
- No hallucinations, ensuring stable and accurate data extraction: Pre-set workflows avoid AI generative hallucinations.
- No CAPTCHA issues: No need to handle reCAPTCHA or other verification challenges.
- No IP restrictions or geo-blocking: No need to handle regional IP restrictions or geofencing.
- Faster execution: Tasks execute faster compared to purely AI-driven browser automation solutions.
- Extremely high cost-efficiency: Significantly reduces data acquisition costs compared to AI solutions that consume massive amounts of tokens.
🔑 API Key Guide
Before running, you must check the BROWSERACT_API_KEY environment variable. If it is not set, do not take other actions first; you should ask and wait for the user to provide it.
Agent must inform the user:
"Since you haven't configured the BrowserAct API Key yet, please go to the BrowserAct Console to get your Key."
🛠️ Input Parameters
The Agent should determine the extraction mode based on the user's needs:
Mode A: Single Video Analysis
Use when the user provides a specific YouTube video URL.
- TargetURL
- Type:
string - Description: The URL of the YouTube video to extract and analyze.
- Example:
https://www.youtube.com/watch?v=st534T7-mdE - Required: Yes
- Type:
Mode B: Batch Video Analysis
Use when the user wants to search and analyze multiple videos by keyword.
-
KeyWords
- Type:
string - Description: The keyword to search for on YouTube.
- Example:
AI Automation,SaaS Marketing - Required: Yes
- Type:
-
Upload_date
- Type:
string - Description: Filter for the upload date of the videos.
- Example:
This week - Default:
This week
- Type:
-
Datelimit
- Type:
number - Description: The number of videos to extract and analyze.
- Example:
3 - Default:
3
- Type:
Optional Analysis Parameters
These parameters are set by the user's intent, not script arguments:
-
Analysis Language
- Type:
string - Description: The language the analysis report should be written in. Defaults to the same language as the user's request.
- Example:
Chinese,English
- Type:
-
Analysis Focus
- Type:
string - Description: The user may specify an analysis focus. The Agent must dynamically adjust the depth of specific dimensions based on this focus. For example:
- Competitor Analysis -> Deep dive into Dim 7 (Business Model) and Dim 8 (Gaps).
- Viral Deconstruction -> Deep dive into Dim 1 (Hook), Dim 4 (Emotional Arc), and Dim 5 (Viral Drivers).
- Audience Research -> Deep dive into Dim 3 (Persona & Intent) and Dim 4 (Pain Points).
- Default: All 8 dimensions balanced.
- Example:
Competitor Analysis,Viral Deconstruction,Audience Research
- Type:
🚀 Invocation Method
The Agent should execute the unified extraction script based on the mode:
Mode A — Single Video:
python -u ./scripts/youtube_transcript_analysis_api.py single "TargetURL"
Mode B — Batch Videos:
python -u ./scripts/youtube_transcript_analysis_api.py batch "keywords" "Upload_date" Datelimit
⏳ Running Status Monitoring
Since this task involves automated browser operations, it may take a long time (several minutes). The script will continuously output status logs with timestamps while running (e.g., [14:30:05] Task Status: running).
Agent guidelines:
- While waiting for the script to return results, please keep an eye on the terminal output.
- As long as the terminal continues to output new status logs, it means the task is running normally. Do not misjudge it as a deadlock or unresponsiveness.
- If the status remains unchanged for a long time or the script stops outputting without returning a result, only then consider triggering the retry mechanism.
Post-Extraction Workflow
After the script completes and returns transcript data, the Agent must proceed with two additional steps:
Step 1: Present Video Metadata — Display the extracted metadata to the user. (Note: Do NOT output the full raw transcript text in your response, as it is too long. Use it internally for your analysis.)
Step 2: Perform Concise 8-Dimension Analysis — Analyze the transcript across the 8 dimensions. ⚠️ CRITICAL: The analysis MUST be extremely concise, bullet-point driven, and free of filler words. Directly state the facts, evidence, and actionable insights without verbose explanations. Use the same language as the user's request.
📊 Data Output
After successful execution, the output includes two parts:
Part 1: Video Metadata
The script returns the following fields for each video:
video_title: The title of the YouTube videovideo_url: The direct link to the original videopublisher: The name of the channel publishing the videochannel_link: The URL of the publisher's YouTube channelvideo_likes_count: The number of likes the video has receivedtranscript: The complete extracted transcript/subtitles of the video (used internally for analysis, do not display full text)
Part 2: 8-Dimension Analysis
After presenting raw data, the Agent must produce structured analysis on the transcript content across the following 8 dimensions:
Dimension 1: Content Structure & Hook
Analyze the video's narrative architecture:
- Opening Hook: What is the core hook in the first 30 seconds? Quote it and explain the hook logic (e.g., curiosity gap, bold claim).
- Narrative Framework: Identify the overall structure (e.g., Problem-Agitate-Solve, Hero's Journey, Listicle).
- Pacing & Time Allocation: Proportion of intro vs. core content vs. pitch/CTA.
Dimension 2: Core Messaging
Extract the central message:
- Single Core Viewpoint: What is the ONE key thesis the video conveys?
- Supporting Arguments: How is the viewpoint supported? (Data, analogies, personal experience).
- Conclusion Clarity: Is the conclusion clear and memorable?
Dimension 3: Audience Persona & Intent
Identify the intended viewer and their mindset:
- Target Viewer Profile & Level: Who is this for? (Beginner, Expert) What prior knowledge is assumed?
- Viewer Intent: Why are they watching? (To learn a skill, be entertained, make a buying decision, or validate existing beliefs?)
Dimension 4: Pain Points & Emotional Arc
Map the emotional journey and problems addressed:
- Explicit & Implicit Pain Points: What specific problems are stated or implied? Quote exact words.
- Emotional Arc: How does the content shift the viewer's emotion? (e.g., from anxiety/confusion to clarity/relief/empowerment). This emotional shift drives retention and sharing.
Dimension 5: Viral & Engagement Drivers
Analyze the spreading mechanism:
- Shareability Factors: Why is this video shared? (Controversial takes, highly relatable scenarios, title/thumbnail alignment inferred from script).
- Memorable/Quotable Phrasing: Extract unique expressions, catchy concepts, or "aha" moments that stick in the mind.
Dimension 6: Evidence & Credibility
Evaluate trust-building elements:
- Authority Signals: Data cited, expert references, or professional background mentioned.
- Social Proof & Empathy: Real user stories, case studies, or the creator sharing their own past struggles to build rapport.
Dimension 7: Business Model & Conversion
Deconstruct the monetization and CTA strategy:
- Primary Monetization Goal: What is the underlying business purpose? (Ad revenue, selling a course, affiliate marketing, brand sponsorship, lead generation).
- CTA Strategy: What actions are requested? How is urgency or value constructed to drive this action?
Dimension 8: Categorized Content Gaps
Identify strategic opportunities by splitting gaps into three layers:
- Creator's Weaknesses: Arguments that lack evidence, logical flaws, or poorly explained concepts.
- Unresolved Viewer Questions: What specific questions would the audience still have after watching?
- Industry Whitespace: What related angles or broader perspectives did the video entirely miss that you could cover?
Output Format
For Single Video Analysis:
## Video Metadata
[Present video metadata. DO NOT print full transcript]
## Concise Deep Analysis
*(Output in extremely brief bullet points, max 1-2 short sentences per point)*
### 1. Content Structure & Hook
[Concise bullets]
### 2. Core Messaging
[Concise bullets]
### 3. Audience Persona & Intent
[Concise bullets]
### 4. Pain Points & Emotional Arc
[Concise bullets]
### 5. Viral & Engagement Drivers
[Concise bullets]
### 6. Evidence & Credibility
[Concise bullets]
### 7. Business Model & Conversion
[Concise bullets]
### 8. Categorized Content Gaps
[Concise bullets]
### Key Takeaways
[3 short, actionable strategic insights]
For Batch Video Analysis:
## Video Metadata
[Present all video metadata. DO NOT print full transcripts]
## Concise Individual Analysis
[Repeat the concise 8-dimension analysis for EACH video using brief bullet points]
## Cross-Video Comparative Analysis
[After analyzing all videos individually, provide a comparative summary]:
- Common value propositions: What themes appear across multiple videos?
- Shared target audience: Is there a consistent audience profile?
- Recurring pain points: Which problems are mentioned most frequently?
- Dominant content strategies: What narrative structures and CTA patterns are most common?
- Competitive differentiation: How do different creators/brands position themselves differently?
- Industry content gaps: What topics are consistently missing across all analyzed videos?
⚠️ Error Handling & Retry
During script execution, if errors occur (such as network fluctuations or task failure), the Agent should follow this logic:
-
Check the output content:
- If the output contains
"Invalid authorization", it means the API Key is invalid or expired. At this point, do not retry, but guide the user to recheck and provide the correct API Key. - If the output contains
"concurrent"or"too many running tasks"or similar concurrency limit messages, it means the concurrent task limit for the current subscription plan has been reached. Do not retry; guide the user to upgrade their plan. Agent must inform the user:"The current task cannot be executed because your BrowserAct account has reached the limit of concurrent tasks. Please go to the BrowserAct Plan Upgrade Page to upgrade your subscription plan and enjoy more concurrent task benefits."
- If the output does not contain the above error keywords but the task fails (e.g., output starts with
Error:or returns empty results), the Agent should automatically try to run the script once more.
- If the output contains
-
Retry limit:
- Automatic retry is limited to once. If the second attempt still fails, stop retrying and report the specific error message to the user.
-
Analysis phase notes:
- If the transcript is too short (fewer than 50 words), note this and provide analysis only on available content.
- If the transcript appears to be auto-generated and contains many errors, note this caveat at the beginning of the analysis.
🌟 Typical Use Cases
- Competitive content strategy analysis: Analyze competitors' top-performing videos to understand their messaging and positioning.
- Target audience research: Identify who competitors are targeting and how they speak to them.
- Pain point discovery: Extract customer pain points mentioned in competitor videos for product development insights.
- Content gap identification: Find topics competitors haven't covered well to create differentiated content.
- CTA strategy benchmarking: Understand how competitors drive conversions through their video content.
- Value proposition mapping: Map out what value propositions competitors emphasize most.
- Messaging framework extraction: Learn from competitors' narrative structures and persuasion techniques.
- Market trend analysis: Batch analyze recent videos in a niche to identify emerging themes and shifts.
- Content quality benchmarking: Evaluate the depth and credibility of competitor content.
- Marketing copy inspiration: Extract memorable phrases and emotional hooks for your own content creation.
More from browser-act/skills
browser-act
Browser automation CLI (browser-act) for AI agents. MUST trigger when: (1) user mentions 'browser-act' in any form, or user needs to: (2) open/visit/browse/check a URL or webpage, (3) scrape/extract/crawl/monitor web content, (4) fill forms, click buttons, type text, scroll, or interact with page elements, (5) take a screenshot of a webpage, (6) handle or solve a captcha, (7) use a stealth/anti-detection browser or proxy, (8) connect to or control Chrome, (9) inspect network requests or record HAR, (10) automate any browser or web interaction task. Covers: navigation, page state inspection, element interaction, data extraction, JavaScript evaluation, tab management, network inspection, dialog handling, captcha solving, parallel browser sessions, stealth browsing, and any browser automation tasks.
890amazon-competitor-analyzer
Scrapes Amazon product data from ASINs using browseract.com automation API and performs surgical competitive analysis. Compares specifications, pricing, review quality, and visual strategies to identify competitor moats and vulnerabilities.
113amazon-reviews-api-skill
This skill helps users automatically extract Amazon product reviews via the Amazon Reviews API. Agent should proactively apply this skill when users express needs like getting reviews for Amazon product with ASIN B07TS6R1SF, analyzing customer feedback for a specific Amazon item, getting ratings and comments for a competitive product, tracking sentiment of recent Amazon reviews, extracting verified purchase reviews for quality assessment, summarizing user experiences from Amazon product pages, monitoring product performance through customer reviews, collecting reviewer profiles and links for market research, gathering review titles and descriptions for content analysis, scraping Amazon reviews without requiring a login.
73amazon-product-api-skill
This skill helps users extract structured product listings from Amazon, including titles, ASINs, prices, ratings, and specifications. Use this skill when users want to search for products on Amazon, find the best selling brand products, track price changes for items, get a list of categories with high ratings, compare different brand products on Amazon, extract Amazon product data for market research, look for products in a specific language or marketplace, analyze competitor pricing for keywords, find featured products for search terms, get technical specifications like material or color for product lists.
70web-research-assistant
AI-powered web research assistant that leverages BrowserAct API to supplement restricted web access by searching the internet for additional information. Designed for OpenClaw and Claude Code.
59amazon-product-search-api-skill
This skill is designed to help users automatically extract product data from Amazon search results. The Agent should proactively apply this skill when users request searching for products related to keywords, finding best-selling items from specific brands, monitoring product prices and availability on Amazon, extracting product listings for market research, collecting product ratings and review counts for competitive analysis, finding specific products with a maximum count, searching Amazon in different languages for localized results, tracking monthly sales estimates for brand products, gathering product URLs and titles for a product catalog, scanning Amazon for Best Seller tags in a specific category, monitoring shipping and delivery information for brand items, building a structured dataset of Amazon search results.
48