Whisper Claude Code Interactive

Voice interface for Claude Code. Talk to Claude using push-to-talk, type text, or combine both.

Views0
PublishedJan 14, 2026

Loading actions...

5 minBeginnerpromptSingle file

Skill content

Main instructions and any bundled files for this skill.

markdown

Whisper Claude Code Interactive

Voice interface for Claude Code. Talk to Claude using push-to-talk, type text, or combine both.

Requirements

  • Python 3.10+
  • Linux (requires sudo for global keyboard capture)
  • Microphone
  • Claude Code CLI installed and configured

Installation

./install.sh

During installation, you'll be asked to select a language for speech recognition:

#LanguageCode
1Englishen
2Españoles
3中文 (Chinese)zh
4हिन्दी (Hindi)hi
5Françaisfr
6العربية (Arabic)ar
7Portuguêspt
8Русский (Russian)ru
9日本語 (Japanese)ja
10Auto-detectNone

This creates a virtual environment and installs dependencies (Whisper, keyboard, colorama, etc.)

Usage

./run.sh

Controls

ActionDescription
Ctrl+Shift (hold)Record audio, release to send
Type + EnterSend text only (no audio)
Type + Ctrl+ShiftCombine typed text with voice
Ctrl+CExit

Voice confirmations

When Claude asks for confirmation, respond with:

  • "sí" / "yes" - Confirm once
  • "siempre" / "always" - Confirm and remember for this session
  • "no" - Cancel

Permission escalation

When a command requires additional permissions, Claude will ask. Say "sí" to allow once, or "siempre" to allow that command type permanently.

Configuration

Edit config.py to customize:

# Push-to-talk keys
PTT_KEYS = {'ctrl', 'shift'}

# Whisper model (tiny, base, small, medium, large-v3)
WHISPER_MODEL = "medium"

# Language for speech recognition
WHISPER_LANGUAGE = "es"  # "es" for Spanish, "en" for English

# Device (auto, cuda, cpu)
WHISPER_DEVICE = "auto"

Changing language after installation

To change the speech recognition language, edit config.py:

WHISPER_LANGUAGE = "en"   # English
WHISPER_LANGUAGE = "es"   # Español
WHISPER_LANGUAGE = "zh"   # Chinese
WHISPER_LANGUAGE = "hi"   # Hindi
WHISPER_LANGUAGE = "fr"   # Français
WHISPER_LANGUAGE = "ar"   # Arabic
WHISPER_LANGUAGE = "pt"   # Português
WHISPER_LANGUAGE = "ru"   # Russian
WHISPER_LANGUAGE = "ja"   # Japanese
WHISPER_LANGUAGE = None   # Auto-detect

Allowed tools

Default allowed tools (no confirmation needed):

  • Read, Glob, Grep
  • git status, log, diff, add, commit
  • ls, cat, head, tail, find, pwd, whoami, etc.

Additional tools can be permanently allowed by saying "siempre" when prompted.

Files

  • main.py - Main application
  • config.py - Configuration
  • audio_capture.py - Push-to-talk recording
  • transcriber.py - Whisper transcription
  • claude_interface.py - Claude Code integration
  • run.sh - Launch script
  • install.sh - Installation script
  • uninstall.sh - Uninstall script

Uninstall

./uninstall.sh

Notes

  • Runs with sudo for keyboard access, but Claude commands execute as original user
  • Session persists in ~/.whisper-claude-workspace/
  • Workspace directory stores session data and user-approved tools
Share: