figshare-data-download
Figshare Data Download
Use this skill to fetch Figshare dataset files behind anti-bot checks.
Workflow
- Normalize inputs:
item_or_file_url: DOI URL, item page URL, orndownloaderURLoutput_path: final local pathexpected_name: expected browser filename (optional)
- Resolve the file download link in browser:
- Open the DOI/item page with browser tooling.
- Find and click the dataset "Download" / "Download file" element.
- If already given an
ndownloaderURL, open it directly in browser.
-
Complete any browser challenge/verification and trigger the real file download.
-
Copy the downloaded file from
~/Downloadstooutput_path. -
Verify output:
- Ensure file size is non-zero.
- Ensure extension/type matches expectation (
.xlsx,.csv, etc.).
Decision Rules
- Do not use
web_fetchfor this skill. - Do not run curl probing as part of this skill workflow.
- Always use browser path first and keep it end-to-end.
Output Contract
- Always produce one final file at
output_path. - Keep the downloaded source file in
~/Downloadsunchanged. - Report:
- final path
- file size
- that the transfer used browser-only workflow
Script
scripts/figshare_data_download.py(browser-only helper)- supports semi-automatic flow:
--open-browser+ wait/poll in~/Downloads+ copy/verify - does not perform curl/web_fetch probing
- supports semi-automatic flow:
More from fadeloo/skills
email-imap-fetch
Listen for one or more IMAP inboxes with the IDLE command, fetch unread email metadata plus text previews, and forward each message to OpenClaw webhooks. Use when tasks need near-real-time mailbox monitoring, multi-account inbox ingestion via environment variables, and automatic trigger delivery into OpenClaw automation.
8ai-tech-fulltext-fetch
Fetch and persist article full text for RSS entries already stored in SQLite by ai-tech-rss-fetch. Use when backfilling or incrementally syncing body text from entries.url or entries.canonical_url into a companion table for downstream indexing, retrieval, or summarization.
8ai-tech-summary
Retrieve time-windowed RSS evidence from SQLite and let the agent produce final summaries using RAG over selected records and fields. Use when generating daily, weekly, monthly, or custom-range AI tech digests directly in agent responses instead of fixed template reports.
7email-smtp-send
Send emails through SMTP with optional local attachments and optional IMAP APPEND sync to Sent mailbox. Use when tasks need reliable outbound email delivery, attachment sending, SMTP connectivity checks, or cross-client sent-mail visibility (for example appending to "Sent Items" after SMTP send).
7ai-tech-rss-fetch
Subscribe to AI and tech RSS feeds and persist normalized metadata into SQLite using mature Python tooling (feedparser + sqlite3). Use when adding feed URLs/OPML sources, running incremental sync with deduplication, and storing entry metadata without full-text extraction or summarization.
7sustainability-rss-fetch
Ingest all sustainability journal RSS entries into a dedicated RSS SQLite database first, keyed by DOI, then mark relevance and prune non-relevant rows to DOI-only. Use when building a DOI-first ingestion pipeline with mandatory full ingestion before topic filtering.
7