video-transcription
Video Transcription
Follow shared release-shell rules in:
postplus-sharedrelease-shell rules
Use this skill when the input is video and the main problem is:
- spoken transcript extraction
- subtitle timing
- edit-prep timestamps
- multilingual caption generation
This skill is not a substitute for semantic video analysis.
Hosted Endpoint
First-version hosted transcription endpoint:
- hosted transcription capability
transcription-whisper-with-video
Use enableTimestamps=true by default when the result will drive subtitles or edit decisions.
Poll Behavior
Hosted video transcription is asynchronous. The script polls the prediction result URL
until status is completed or failed. Default poll window: 150 attempts × 2 s = 5 minutes.
Real speech video typically completes within 30–60 s. Silent or very short videos may complete in under 10 s. If a job exceeds 5 minutes, the hosted provider is likely overloaded — retry rather than increasing the timeout further.
Output Contract
Persist:
request.jsonresponse.jsonmanifest.json- provider output artifacts under
outputs/
Scripts
scripts/transcribe_video.mjs
Polling support is shared with:
skills/40-creative/audio-transcription/scripts/poll_transcription.mjs
Read These References
references/tool-contracts.mdreferences/normalized-transcript-schema.md
Release-Shell Execution Contract
- keep transcription requests, provider responses, normalized transcripts, and
downloaded subtitle artifacts under
<work-folder>/.postplus/video-transcription/ - keep only final user-facing transcript exports outside
.postplus/ - start with a bounded first pass, usually one source file before larger batches
- if hosted transcription capability is unavailable, unauthorized, or returns a stable network error, stop immediately instead of switching to ad hoc shell glue
More from postplusai/postplus-skills
audio-transcription
Transcribe local or remote audio into durable text and timestamp artifacts using hosted Whisper models. Use this when the job is speech-to-text from audio files and you need request/response persistence, optional timestamps, and subtitle-ready outputs.
82google-trends-research
Research Google Trends search-intent signals for topic discovery, keyword momentum, regional interest, and rising queries without treating search trends as the same thing as platform content heat or marketplace demand.
77seedance-submitter
Use when preparing, submitting, polling, or debugging Seedance 2.0 video generation jobs from product images, storyboard images, UGC scripts, voiceover copy, or promptPlan request JSON. Use for splitting scripts into render segments, uploading references, creating request JSON, submitting jobs through the hosted capability, polling predictions, and handing off local render paths.
75social-media-publisher
Prepare and, after explicit approval, publish social posts through the PostPlus platform-owned Postiz workspace.
75facebook-research
Research Facebook pages, public follower or following surfaces, and public posts using hosted collection capability. Use this when the user wants Facebook account research, follower-surface sampling, or public post metrics.
75x-tools
Local execution tools for X/Twitter hosted collection workflows, including actor runs, dataset normalization, tweet ranking, account ranking, audience graph construction, and language clustering.
74