First Run Guide¶
This guide walks you through your first recording with Voice Transcriber.
Prerequisites¶
Before starting, ensure you've completed the Installation Guide:
- β Bun runtime installed
- β System dependencies (arecord, xsel) installed
- β
Voice Transcriber installed (
make setup
) - β Configuration file created with OpenAI API key
Step 1: Start the Application¶
Step 2: Configuration Wizard¶
If this is your first run, you'll see the Configuration Wizard:
π€ Voice Transcriber - First Run Setup
No configuration found. Let's set up your OpenAI API key.
π Get your API key at: https://platform.openai.com/api-keys
Enter your OpenAI API key: _
Enter your API key when prompted. The wizard will:
- Validate your API key
- Create configuration file at
~/.config/voice-transcriber/config.json
- Set default language to English
- Enable text formatting by default
Setup Complete
Once configured, you'll see: "β Configuration saved successfully!"
Step 3: Locate the System Tray Icon¶
Look for the green circle icon in your system tray:
-
IDLE (Green)
Ready to record. Click to start recording.
-
RECORDING (Red)
Currently recording audio. Click to stop.
-
PROCESSING (Purple)
Transcribing audio. Wait for completion.
System Tray Location
- GNOME: Top-right corner (may need "AppIndicator Support" extension)
- KDE Plasma: Bottom-right panel
- XFCE/MATE: Top or bottom panel (near clock)
Step 4: Your First Recording¶
Record Audio¶
- Click the system tray icon (or select "Start Recording" from menu)
- Icon changes to red circle π΄
- Speak into your microphone
- Click again to stop recording
Example Recording
"Hello, this is my first test recording with Voice Transcriber."
Processing¶
- Icon changes to purple circle π£
- Audio is transcribed by OpenAI Whisper
- Text is formatted by GPT (if enabled)
- Result is copied to your clipboard automatically
Paste Result¶
- Open any text editor (e.g., gedit, VS Code, browser)
- Paste (Ctrl+V) the transcribed text
Expected Result
"Hello, this is my first test recording with Voice Transcriber."
Step 5: Test Different Languages (Optional)¶
Configure French¶
Edit your configuration:
Change language to French:
{
"language": "fr",
"formatterEnabled": true,
"transcription": {
"backend": "openai",
"openai": {
"apiKey": "sk-..."
}
}
}
Restart the application and try a French recording:
French Recording
"Bonjour, ceci est un test de transcription en franΓ§ais."
Expected paste result:
Step 6: Explore Menu Options¶
Right-click the system tray icon to see available options:
π€ Voice Transcriber
βββ ποΈ Start Recording
βββ βΉοΈ Stop Recording
βββ β Exit
Menu Actions¶
- Start Recording: Begin audio capture (disabled while recording)
- Stop Recording: End recording and transcribe (enabled only while recording)
- Exit: Exit the application gracefully
Menu State
The menu items change state automatically: - When idle: "Start Recording" is enabled, "Stop Recording" is disabled - When recording: "Start Recording" is disabled, "Stop Recording" is enabled
Common First-Run Issues¶
Icon Not Visible¶
System Tray Not Showing
GNOME Users: Install "AppIndicator Support" extension
Audio Recording Fails¶
arecord: device not found
Solution: Check audio devices
API Key Invalid¶
OpenAI API Error: Invalid API key
Solutions:
- Verify your API key at OpenAI Platform
- Check for extra spaces in config file
- Ensure key starts with
sk-
- Verify API key has Whisper API access
Transcription in Wrong Language¶
French recorded but English transcribed
Solution: Set language explicitly in config
Restart the application after config changes.
Debug Mode¶
Enable debug mode for detailed logging:
Debug output example:
2025-10-11T10:30:15.123Z [DEBUG] WAV file size: 2.45 MB
2025-10-11T10:30:15.234Z [DEBUG] MP3 compression: 74.7% reduction
2025-10-11T10:30:16.789Z [INFO] OpenAI transcription completed in 1.55s
2025-10-11T10:30:16.789Z [DEBUG] ββ Transcription: 142 characters
Next Steps¶
Now that you've completed your first recording, explore more features:
- Basic Usage - Learn all recording options
- Language Support - Multilingual transcription
- Configuration - Advanced settings and custom prompts
- Self-Hosted Setup - Run 100% offline
Quick Reference Card¶
Essential Commands¶
# Run application
make run # From project directory
voice-transcriber # If installed globally
# Enable debug mode
make run ARGS="--debug" # From project directory
voice-transcriber --debug # If installed globally
# Edit configuration
nano ~/.config/voice-transcriber/config.json
# Check system dependencies
make check-system-deps
# Test audio devices
arecord -l
# View all available commands
make help
Recording Workflow¶
1. Click tray icon β π’ to π΄
2. Speak into microphone
3. Click again β π΄ to π£
4. Wait for processing
5. Paste result (Ctrl+V)
Need Help?