First Run Guide¶

This guide walks you through your first recording with Voice Transcriber.

Prerequisites¶

Before starting, ensure you've completed the Installation Guide:

✅ Bun runtime installed
✅ System dependencies (arecord, xsel) installed
✅ Voice Transcriber installed (make setup)
✅ Configuration file created with OpenAI API key

Quick Setup Reminder

make setup  # One command to install everything

Step 1: Start the Application¶

Global InstallationLocal InstallationDebug Mode (Recommended for First Run)

voice-transcriber

cd voice-transcriber
make run

# Global
voice-transcriber --debug

# Local
make run ARGS="--debug"

Step 2: Configuration Wizard¶

If this is your first run, you'll see the Configuration Wizard:

🎤 Voice Transcriber - First Run Setup

No configuration found. Let's set up your OpenAI API key.

📝 Get your API key at: https://platform.openai.com/api-keys

Enter your OpenAI API key: _

Enter your API key when prompted. The wizard will:

Validate your API key
Create configuration file at ~/.config/voice-transcriber/config.json
Set default language to English
Enable text formatting by default

Setup Complete

Once configured, you'll see: "✅ Configuration saved successfully!"

Step 3: Locate the System Tray Icon¶

Look for the green circle icon in your system tray:

IDLE (Green)

Ready to record. Click to start recording.
RECORDING (Red)

Currently recording audio. Click to stop.
PROCESSING (Purple)

Transcribing audio. Wait for completion.

System Tray Location

GNOME: Top-right corner (may need "AppIndicator Support" extension)
KDE Plasma: Bottom-right panel
XFCE/MATE: Top or bottom panel (near clock)

Step 4: Your First Recording¶

Record Audio¶

Click the system tray icon (or select "Start Recording" from menu)
Icon changes to red circle 🔴
Speak into your microphone
Click again to stop recording

Example Recording

"Hello, this is my first test recording with Voice Transcriber."

Processing¶

Icon changes to purple circle 🟣
Audio is transcribed by OpenAI Whisper
Text is formatted by GPT (if enabled)
Result is copied to your clipboard automatically

Paste Result¶

Open any text editor (e.g., gedit, VS Code, browser)
Paste (Ctrl+V) the transcribed text

Expected Result

"Hello, this is my first test recording with Voice Transcriber."

Step 5: Test Different Languages (Optional)¶

Configure French¶

Edit your configuration:

nano ~/.config/voice-transcriber/config.json

Change language to French:

{
  "language": "fr",
  "formatterEnabled": true,
  "transcription": {
    "backend": "openai",
    "openai": {
      "apiKey": "sk-..."
    }
  }
}

Restart the application and try a French recording:

French Recording

"Bonjour, ceci est un test de transcription en français."

Expected paste result:

Bonjour, ceci est un test de transcription en français.

Right-click the system tray icon to see available options:

🎤 Voice Transcriber
├── 🎙️ Start Recording
├── ⏹️ Stop Recording
└── ❌ Exit

Start Recording: Begin audio capture (disabled while recording)
Stop Recording: End recording and transcribe (enabled only while recording)
Exit: Exit the application gracefully

Menu State

The menu items change state automatically: - When idle: "Start Recording" is enabled, "Stop Recording" is disabled - When recording: "Start Recording" is disabled, "Stop Recording" is enabled

Common First-Run Issues¶

Icon Not Visible¶

System Tray Not Showing

GNOME Users: Install "AppIndicator Support" extension

# Install extension
sudo apt-get install gnome-shell-extension-appindicator

# Restart GNOME Shell
# Press Alt+F2, type 'r', press Enter

Audio Recording Fails¶

arecord: device not found

Solution: Check audio devices

# List audio devices
arecord -l

# Test recording manually
arecord -d 5 test.wav
play test.wav

API Key Invalid¶

OpenAI API Error: Invalid API key

Solutions:

Verify your API key at OpenAI Platform
Check for extra spaces in config file
Ensure key starts with sk-
Verify API key has Whisper API access

Transcription in Wrong Language¶

French recorded but English transcribed

Solution: Set language explicitly in config

{
  "language": "fr"
}

Restart the application after config changes.

Debug Mode¶

Enable debug mode for detailed logging:

# Global installation
voice-transcriber --debug

# Local installation
make run ARGS="--debug"

Debug output example:

2025-10-11T10:30:15.123Z [DEBUG] WAV file size: 2.45 MB
2025-10-11T10:30:15.234Z [DEBUG] MP3 compression: 74.7% reduction
2025-10-11T10:30:16.789Z [INFO] OpenAI transcription completed in 1.55s
2025-10-11T10:30:16.789Z [DEBUG]   └─ Transcription: 142 characters

Next Steps¶

Now that you've completed your first recording, explore more features:

Basic Usage - Learn all recording options
Language Support - Multilingual transcription
Configuration - Advanced settings and custom prompts
Self-Hosted Setup - Run 100% offline

Quick Reference Card¶

Essential Commands¶

# Run application
make run                       # From project directory
voice-transcriber              # If installed globally

# Enable debug mode
make run ARGS="--debug"        # From project directory
voice-transcriber --debug      # If installed globally

# Edit configuration
nano ~/.config/voice-transcriber/config.json

# Check system dependencies
make check-system-deps

# Test audio devices
arecord -l

# View all available commands
make help

Recording Workflow¶

1. Click tray icon → 🟢 to 🔴
2. Speak into microphone
3. Click again → 🔴 to 🟣
4. Wait for processing
5. Paste result (Ctrl+V)

Need Help?