youtube-transcript-extractor-api-skill
YouTube Transcript Extractor API Skill
📖 Introduction
This skill provides a one-stop video transcript extraction service using BrowserAct's YouTube Transcript Extractor API template. It can directly extract full video transcripts and metadata from any YouTube video. By simply providing the TargetURL, you can get clean, ready-to-use transcript and metadata.
✨ Features
- No hallucinations, ensuring stable and accurate data extraction: Pre-set workflows avoid generative AI hallucinations.
- No CAPTCHA issues: No need to handle reCAPTCHA or other verification challenges.
- No IP access restrictions or geofencing: No need to deal with regional IP limits.
- Faster execution: Compared to pure AI-driven browser automation solutions, task execution is much faster.
- High cost-effectiveness: Significantly reduces data acquisition costs compared to AI solutions that consume large amounts of tokens.
🔑 API Key Setup
Before running, you must check the BROWSERACT_API_KEY environment variable. If it is not set, do not take any other actions; you must request and wait for the user to provide it.
The Agent must inform the user at this point:
"Since you haven't configured the BrowserAct API Key yet, please go to the BrowserAct Console to get your Key first."
🛠️ Input Parameters
The Agent should configure the following parameter based on the user's needs when calling the script:
- TargetURL (Target URL)
- Type:
string - Description: The URL of the YouTube video you want to extract the transcript and metadata from.
- Example:
https://www.youtube.com/watch?v=st534T7-mdE
- Type:
🚀 Usage (Recommended)
The Agent should execute the following independent script to achieve "one command, get results":
# Example Call
python -u ./scripts/youtube_transcript_extractor_api.py "TargetURL"
⏳ Running Status Monitoring
Since this task involves automated browser operations, it may take a long time (several minutes). While running, the script will continuously output status logs with timestamps (e.g., [14:30:05] Task Status: running).
Agent Instructions:
- While waiting for the script to return results, please keep an eye on the terminal output.
- As long as the terminal continues to output new status logs, it means the task is running normally. Do not misjudge it as a deadlock or unresponsiveness.
- Only if the status remains unchanged for a long time or the script stops outputting without returning a result, should you consider triggering the retry mechanism.
📊 Data Output Description
After successful execution, the script will parse and print the results directly from the API response. The results include:
video_title: The title of the YouTube videovideo_url: The direct link to the original videopublisher: The name of the channel publishing the videochannel_link: The URL of the publisher's YouTube channelvideo_likes_count: The number of likes the video has receivedtranscript: The complete extracted transcript/subtitles of the video
⚠️ Error Handling & Retry
During script execution, if an error occurs (such as network fluctuation or task failure), the Agent should follow this logic:
-
Check output content:
- If the output contains
"Invalid authorization", it means the API Key is invalid or expired. In this case, do not retry, and guide the user to check and provide the correct API Key. - If the output does not contain
"Invalid authorization"but the task execution fails (for example, the output starts withError:or returns an empty result), the Agent should automatically try to execute the script one more time.
- If the output contains
-
Retry limits:
- Automatic retry is limited to only once. If the second attempt still fails, stop retrying and report the specific error message to the user.
More from browser-act/skills
browser-act
Browser automation CLI (browser-act) for AI agents. MUST trigger when: (1) user mentions 'browser-act' in any form, or user needs to: (2) open/visit/browse/check a URL or webpage, (3) scrape/extract/crawl/monitor web content, (4) fill forms, click buttons, type text, scroll, or interact with page elements, (5) take a screenshot of a webpage, (6) handle or solve a captcha, (7) use a stealth/anti-detection browser or proxy, (8) connect to or control Chrome, (9) inspect network requests or record HAR, (10) automate any browser or web interaction task. Covers: navigation, page state inspection, element interaction, data extraction, JavaScript evaluation, tab management, network inspection, dialog handling, captcha solving, parallel browser sessions, stealth browsing, and any browser automation tasks.
887amazon-competitor-analyzer
Scrapes Amazon product data from ASINs using browseract.com automation API and performs surgical competitive analysis. Compares specifications, pricing, review quality, and visual strategies to identify competitor moats and vulnerabilities.
113amazon-reviews-api-skill
This skill helps users automatically extract Amazon product reviews via the Amazon Reviews API. Agent should proactively apply this skill when users express needs like getting reviews for Amazon product with ASIN B07TS6R1SF, analyzing customer feedback for a specific Amazon item, getting ratings and comments for a competitive product, tracking sentiment of recent Amazon reviews, extracting verified purchase reviews for quality assessment, summarizing user experiences from Amazon product pages, monitoring product performance through customer reviews, collecting reviewer profiles and links for market research, gathering review titles and descriptions for content analysis, scraping Amazon reviews without requiring a login.
73amazon-product-api-skill
This skill helps users extract structured product listings from Amazon, including titles, ASINs, prices, ratings, and specifications. Use this skill when users want to search for products on Amazon, find the best selling brand products, track price changes for items, get a list of categories with high ratings, compare different brand products on Amazon, extract Amazon product data for market research, look for products in a specific language or marketplace, analyze competitor pricing for keywords, find featured products for search terms, get technical specifications like material or color for product lists.
70web-research-assistant
AI-powered web research assistant that leverages BrowserAct API to supplement restricted web access by searching the internet for additional information. Designed for OpenClaw and Claude Code.
59amazon-product-search-api-skill
This skill is designed to help users automatically extract product data from Amazon search results. The Agent should proactively apply this skill when users request searching for products related to keywords, finding best-selling items from specific brands, monitoring product prices and availability on Amazon, extracting product listings for market research, collecting product ratings and review counts for competitive analysis, finding specific products with a maximum count, searching Amazon in different languages for localized results, tracking monthly sales estimates for brand products, gathering product URLs and titles for a product catalog, scanning Amazon for Best Seller tags in a specific category, monitoring shipping and delivery information for brand items, building a structured dataset of Amazon search results.
48