skills/nikiandr/goose-skills/landing-page-intel

landing-page-intel

SKILL.md

Landing Page Intel

Extract GTM-relevant intelligence from any company's landing page by scraping its HTML source.

Quick Start

Only dependency is pip install requests. No API key needed.

# Basic scan of a single URL
python3 skills/landing-page-intel/scripts/scrape_landing_page.py \
  --url "https://example.com"

# Scan multiple pages of the same site
python3 skills/landing-page-intel/scripts/scrape_landing_page.py \
  --url "https://example.com" --pages "/,/pricing,/about"

# Output as summary table instead of JSON
python3 skills/landing-page-intel/scripts/scrape_landing_page.py \
  --url "https://example.com" --output summary

# Save full report to file
python3 skills/landing-page-intel/scripts/scrape_landing_page.py \
  --url "https://example.com" --output json > report.json

What It Extracts

Category Details
Tech Stack Analytics (GA4, Mixpanel, Amplitude, PostHog, Heap), marketing automation (HubSpot, Marketo, Pardot), chat widgets (Intercom, Drift, Crisp, Zendesk), A/B testing (Optimizely, VWO, LaunchDarkly), session recording (Hotjar, FullStory, LogRocket), CDPs (Segment, Clearbit, 6sense)
Ad Pixels Meta Pixel, Google Ads, LinkedIn Insight Tag, TikTok pixel, Twitter pixel
Customer Logos Image URLs from "trusted by" / logo carousel sections, grouped by directory
SEO Metadata Title, meta description, Open Graph tags, Twitter Cards, canonical URL, structured data (JSON-LD), hreflang tags
CTAs & Sales Motion All CTA button text and links — reveals PLG vs sales-led motion
Social Proof Testimonials, customer counts, case study links, badge images
Integrations Links to integration/partner pages, embedded third-party widgets
Hidden Elements Content in display:none, hidden, or HTML comments that may reveal upcoming features
Infrastructure CMS platform (Webflow, WordPress, Next.js, etc.), detected from HTML signatures

CLI Reference

Flag Default Description
--url required Target website URL
--pages / Comma-separated paths to scan (e.g., /,/pricing,/about)
--output json Output format: json or summary
--timeout 15 Request timeout in seconds

GTM Use Cases

  • Competitive intel: See what tools competitors use, how they position, who their customers are
  • Prospect research: Before a sales call, scan a prospect's site to understand their stack and maturity
  • Market mapping: Scan multiple competitors to compare positioning, customer segments, and GTM motions
  • Customer discovery: Extract competitor customer logos as potential prospects for your own product

Cost

Free. No API keys required. Uses only HTTP requests to fetch public HTML.

Weekly Installs
19
First Seen
3 days ago
Installed on
opencode18
antigravity1