youtube-batch-transcript-extractor-api-skill
YouTube Batch Transcript Extractor API Skill
📖 Introduction
This skill uses the BrowserAct YouTube Batch Transcript Extractor API template to provide users with an automated service for extracting YouTube video transcripts and metadata in batch. Simply by providing search keywords and filters, you can batch extract full video transcripts, likes, and channel metadata without writing crawler scripts.
✨ Features
- No hallucinations, ensuring stable and accurate data extraction: Pre-set workflows avoid generative AI hallucinations.
- No CAPTCHA issues: No need to handle reCAPTCHA or other verification challenges.
- No IP access restrictions or geofencing: No need to deal with regional IP restrictions.
- Faster execution: Tasks execute faster compared to pure AI-driven browser automation solutions.
- High cost-effectiveness: Significantly reduces data acquisition costs compared to AI solutions that consume a large number of tokens.
🔑 API Key Guide Process
Before running, you must check the BROWSERACT_API_KEY environment variable. If it is not set, do not take any other actions; you should request and wait for the user to provide it collaboratively.
The Agent must inform the user at this time:
"Since you have not yet configured the BrowserAct API Key, please go to the BrowserAct Console to get your Key first."
🛠️ Input Parameters
When calling the script, the Agent should flexibly configure the following parameters based on user needs:
-
KeyWords
- Type:
string - Description: The keyword to search for on YouTube.
- Example:
OpenClaw,AI Automation
- Type:
-
Upload_date
- Type:
string - Description: Filter for the upload date of the videos.
- Optional values:
Today,This week,This month,This year. - Default value:
This week
- Type:
-
Datelimit
- Type:
number - Description: The number of videos to extract. Adjust as needed.
- Default value:
5 - Recommendation: Set a smaller value (1-5) for quick tests and a larger value for bulk extraction.
- Type:
🚀 Invocation Method (Recommended)
The Agent should execute the following independent script to achieve "one-line command to get results":
# Example invocation
python -u ./scripts/youtube_batch_transcript_extractor_api.py "keywords" "Upload_date" Datelimit
⏳ Execution Status Monitoring
Since this task involves automated browser operations, it may take a long time (several minutes). The script will continuously output timestamped status logs (e.g., [14:30:05] Task Status: running) while running.
Agent Notes:
- While waiting for the script to return results, please keep an eye on the terminal output.
- As long as the terminal is still outputting new status logs, it means the task is running normally. Please do not mistakenly judge it as a deadlock or unresponsiveness.
- Only consider triggering the retry mechanism if the status remains unchanged for a long time or the script stops outputting without returning a result.
📊 Data Output Description
After successful execution, the script will parse and print the results directly from the API response. The results include:
Video title: The title of the YouTube video.Video URL: The direct link to the original video.Publisher: The name of the channel publishing the video.Channel link: The URL of the publisher's YouTube channel.Video likes count: The number of likes the video has received.Subtitles: The complete extracted transcript/subtitles of the videos.
⚠️ Error Handling & Retry
If an error is encountered during the execution of the script (such as network fluctuation or task failure), the Agent should follow the logic below:
-
Check the output content:
- If the output contains
"Invalid authorization", it indicates that the API Key is invalid or expired. Do not retry at this time. You should guide the user to recheck and provide the correct API Key. - If the output does not contain
"Invalid authorization"but the task execution fails (for example, the output starts withError:or returns an empty result), the Agent should automatically try to execute the script once more.
- If the output contains
-
Retry limits:
- Automatic retry is limited to once. If the second attempt still fails, stop retrying and report the specific error message to the user.
More from browser-act/skills
browser-act
Browser automation CLI (browser-act) for AI agents. MUST trigger when: (1) user mentions 'browser-act' in any form, or user needs to: (2) open/visit/browse/check a URL or webpage, (3) scrape/extract/crawl/monitor web content, (4) fill forms, click buttons, type text, scroll, or interact with page elements, (5) take a screenshot of a webpage, (6) handle or solve a captcha, (7) use a stealth/anti-detection browser or proxy, (8) connect to or control Chrome, (9) inspect network requests or record HAR, (10) automate any browser or web interaction task. Covers: navigation, page state inspection, element interaction, data extraction, JavaScript evaluation, tab management, network inspection, dialog handling, captcha solving, parallel browser sessions, stealth browsing, and any browser automation tasks.
887amazon-competitor-analyzer
Scrapes Amazon product data from ASINs using browseract.com automation API and performs surgical competitive analysis. Compares specifications, pricing, review quality, and visual strategies to identify competitor moats and vulnerabilities.
113amazon-reviews-api-skill
This skill helps users automatically extract Amazon product reviews via the Amazon Reviews API. Agent should proactively apply this skill when users express needs like getting reviews for Amazon product with ASIN B07TS6R1SF, analyzing customer feedback for a specific Amazon item, getting ratings and comments for a competitive product, tracking sentiment of recent Amazon reviews, extracting verified purchase reviews for quality assessment, summarizing user experiences from Amazon product pages, monitoring product performance through customer reviews, collecting reviewer profiles and links for market research, gathering review titles and descriptions for content analysis, scraping Amazon reviews without requiring a login.
73amazon-product-api-skill
This skill helps users extract structured product listings from Amazon, including titles, ASINs, prices, ratings, and specifications. Use this skill when users want to search for products on Amazon, find the best selling brand products, track price changes for items, get a list of categories with high ratings, compare different brand products on Amazon, extract Amazon product data for market research, look for products in a specific language or marketplace, analyze competitor pricing for keywords, find featured products for search terms, get technical specifications like material or color for product lists.
70web-research-assistant
AI-powered web research assistant that leverages BrowserAct API to supplement restricted web access by searching the internet for additional information. Designed for OpenClaw and Claude Code.
59amazon-product-search-api-skill
This skill is designed to help users automatically extract product data from Amazon search results. The Agent should proactively apply this skill when users request searching for products related to keywords, finding best-selling items from specific brands, monitoring product prices and availability on Amazon, extracting product listings for market research, collecting product ratings and review counts for competitive analysis, finding specific products with a maximum count, searching Amazon in different languages for localized results, tracking monthly sales estimates for brand products, gathering product URLs and titles for a product catalog, scanning Amazon for Best Seller tags in a specific category, monitoring shipping and delivery information for brand items, building a structured dataset of Amazon search results.
48