local-whisper

Installation
SKILL.md

🎙️ Local Whisper Transcription

100% private voice transcription using OpenAI Whisper. Audio never leaves your machine.

⚠️ Important: OpenClaw Configuration Required

This skill must be configured in OpenClaw's tools.media.audio to handle incoming voice messages. Without this config, voice messages may cause token overflow errors or be sent to cloud APIs.

Add to ~/.openclaw/openclaw.json:

{
  tools: {
    media: {
      audio: {
        enabled: true,
        models: [
          {
            type: "cli",
            command: "node",
            args: ["<skill-path>/transcribe.js", "{{MediaPath}}"]
          }
        ]
      }
    }
  }
}

Replace <skill-path> with the actual path to this skill (e.g., /home/user/skills/local-whisper).

Quick Start

# Install dependencies
pip install openai-whisper

# Transcribe audio
node transcribe.js voice.ogg

CLI Options

--model <tiny|base|small|medium|large>  Model size (default: small)
--language <lang>                        Language code (default: auto)
--output-dir <dir>                       Output directory
--smart-model                           Auto-select model by file size
--check                                 Verify dependencies

Model Sizes

Model Size Speed RAM
tiny 39 MB ⚡⚡⚡⚡ ~1GB
base 74 MB ⚡⚡⚡ ~1GB
small 244 MB ⚡⚡ ~2GB
medium 769 MB ~5GB
large 1550 MB 🐢 ~10GB

Documentation

Weekly Installs
1
First Seen
Mar 27, 2026