Voice Control
Source: src/lib/gesture/voice-engine.ts, src/lib/gesture/whisper-voice-engine.ts, src/lib/gesture/tts-engine.ts
Overview
CatGo's voice control system provides speech-to-text (via Whisper) and text-to-speech for hands-free interaction with the application.
Architecture
VoiceEngine
Base voice engine interface for speech recognition.
WhisperVoiceEngine
Local Whisper model for privacy-preserving speech-to-text. Model downloaded on first use.
TTSEngine
Text-to-speech engine for voice feedback on executed commands.
Voice Commands
Structure Manipulation
- Rotation, zoom, pan commands
- Show/hide bonds, labels, axes
- Reset view
Atom Art
- Place atoms by element name
- Build molecular fragments
Analysis
- Trigger computations by voice
Configuration
- Microphone selection
- Voice activation sensitivity
- Language setting
- TTS voice selection