Skill
Transcribe Telegram voice messages (.ogg) to text using local OpenAI Whisper. No API keys, fully offline. Use when the user sends a voice recording via Telegram.
Playbook (mirrored from disk)
TG Voice Whisper
Transcribes Telegram voice messages to text using local Whisper (small model — good Hebrew accuracy; tiny for speed, large-v3-turbo for max accuracy — all cached). Offline, private, no API keys.
Requirements
ffmpeg— audio conversionwhisper— transcription (openai-whisper pip package)
Installation
sudo apt-get install -y ffmpeg
pip3 install openai-whisper --break-system-packagesUsage
When a voice message arrives, it lands in ~/.openclaw/media/inbound/ as .ogg:
whisper /path/to/file.ogg --model small --language auto --output_format txt --output_dir /tmp/whisper
cat /tmp/whisper/*.txtThen reply with the transcribed text.
Notes
- First run: ~15s model download (~72MB)
- After cache: <1s on 1vCPU
- Auto-detect language works well for Hebrew + English
- For better accuracy: use
--model baseor--model small