Skip to content

First Run Guide

This guide walks you through your first recording with Voice Transcriber.

Prerequisites

Before starting, ensure you've completed the Installation Guide:

  • βœ… Bun runtime installed
  • βœ… System dependencies (arecord, xsel) installed
  • βœ… Voice Transcriber installed (make setup)
  • βœ… Configuration file created with OpenAI API key

Quick Setup Reminder

make setup  # One command to install everything

Step 1: Start the Application

voice-transcriber
cd voice-transcriber
make run
# Global
voice-transcriber --debug

# Local
make run ARGS="--debug"

Step 2: Configuration Wizard

If this is your first run, you'll see the Configuration Wizard:

🎀 Voice Transcriber - First Run Setup

No configuration found. Let's set up your OpenAI API key.

πŸ“ Get your API key at: https://platform.openai.com/api-keys

Enter your OpenAI API key: _

Enter your API key when prompted. The wizard will:

  1. Validate your API key
  2. Create configuration file at ~/.config/voice-transcriber/config.json
  3. Set default language to English
  4. Enable text formatting by default

Setup Complete

Once configured, you'll see: "βœ… Configuration saved successfully!"

Step 3: Locate the System Tray Icon

Look for the green circle icon in your system tray:

  • IDLE (Green)


    Ready to record. Click to start recording.

  • RECORDING (Red)


    Currently recording audio. Click to stop.

  • PROCESSING (Purple)


    Transcribing audio. Wait for completion.

System Tray Location

  • GNOME: Top-right corner (may need "AppIndicator Support" extension)
  • KDE Plasma: Bottom-right panel
  • XFCE/MATE: Top or bottom panel (near clock)

Step 4: Your First Recording

Record Audio

  1. Click the system tray icon (or select "Start Recording" from menu)
  2. Icon changes to red circle πŸ”΄
  3. Speak into your microphone
  4. Click again to stop recording

Example Recording

"Hello, this is my first test recording with Voice Transcriber."

Processing

  1. Icon changes to purple circle 🟣
  2. Audio is transcribed by OpenAI Whisper
  3. Text is formatted by GPT (if enabled)
  4. Result is copied to your clipboard automatically

Paste Result

  1. Open any text editor (e.g., gedit, VS Code, browser)
  2. Paste (Ctrl+V) the transcribed text

Expected Result

"Hello, this is my first test recording with Voice Transcriber."

Step 5: Test Different Languages (Optional)

Configure French

Edit your configuration:

nano ~/.config/voice-transcriber/config.json

Change language to French:

{
  "language": "fr",
  "formatterEnabled": true,
  "transcription": {
    "backend": "openai",
    "openai": {
      "apiKey": "sk-..."
    }
  }
}

Restart the application and try a French recording:

French Recording

"Bonjour, ceci est un test de transcription en franΓ§ais."

Expected paste result:

Bonjour, ceci est un test de transcription en franΓ§ais.

Step 6: Explore Menu Options

Right-click the system tray icon to see available options:

🎀 Voice Transcriber
β”œβ”€β”€ πŸŽ™οΈ Start Recording
β”œβ”€β”€ ⏹️ Stop Recording
└── ❌ Exit
  • Start Recording: Begin audio capture (disabled while recording)
  • Stop Recording: End recording and transcribe (enabled only while recording)
  • Exit: Exit the application gracefully

Menu State

The menu items change state automatically: - When idle: "Start Recording" is enabled, "Stop Recording" is disabled - When recording: "Start Recording" is disabled, "Stop Recording" is enabled

Common First-Run Issues

Icon Not Visible

System Tray Not Showing

GNOME Users: Install "AppIndicator Support" extension

# Install extension
sudo apt-get install gnome-shell-extension-appindicator

# Restart GNOME Shell
# Press Alt+F2, type 'r', press Enter

Audio Recording Fails

arecord: device not found

Solution: Check audio devices

# List audio devices
arecord -l

# Test recording manually
arecord -d 5 test.wav
play test.wav

API Key Invalid

OpenAI API Error: Invalid API key

Solutions:

  1. Verify your API key at OpenAI Platform
  2. Check for extra spaces in config file
  3. Ensure key starts with sk-
  4. Verify API key has Whisper API access

Transcription in Wrong Language

French recorded but English transcribed

Solution: Set language explicitly in config

{
  "language": "fr"
}

Restart the application after config changes.

Debug Mode

Enable debug mode for detailed logging:

# Global installation
voice-transcriber --debug

# Local installation
make run ARGS="--debug"

Debug output example:

2025-10-11T10:30:15.123Z [DEBUG] WAV file size: 2.45 MB
2025-10-11T10:30:15.234Z [DEBUG] MP3 compression: 74.7% reduction
2025-10-11T10:30:16.789Z [INFO] OpenAI transcription completed in 1.55s
2025-10-11T10:30:16.789Z [DEBUG]   └─ Transcription: 142 characters

Next Steps

Now that you've completed your first recording, explore more features:

Quick Reference Card

Essential Commands

# Run application
make run                       # From project directory
voice-transcriber              # If installed globally

# Enable debug mode
make run ARGS="--debug"        # From project directory
voice-transcriber --debug      # If installed globally

# Edit configuration
nano ~/.config/voice-transcriber/config.json

# Check system dependencies
make check-system-deps

# Test audio devices
arecord -l

# View all available commands
make help

Recording Workflow

1. Click tray icon β†’ 🟒 to πŸ”΄
2. Speak into microphone
3. Click again β†’ πŸ”΄ to 🟣
4. Wait for processing
5. Paste result (Ctrl+V)