Language Support¶
Voice Transcriber supports multiple languages with strong language enforcement to prevent unwanted language switching during transcription.
Supported Languages¶
Language | Code | Quality | Notes |
---|---|---|---|
English | en | ⭐⭐⭐⭐⭐ | Default, excellent accuracy |
French | fr | ⭐⭐⭐⭐⭐ | Excellent, strong enforcement |
Spanish | es | ⭐⭐⭐⭐⭐ | Excellent support |
German | de | ⭐⭐⭐⭐⭐ | Excellent support |
Italian | it | ⭐⭐⭐⭐⭐ | Excellent support |
Configuring Language¶
Edit your configuration file:
Set the language
field:
{
"language": "fr",
"formatterEnabled": true,
"transcription": {
"backend": "openai",
"openai": {
"apiKey": "sk-..."
}
}
}
Restart the application for changes to take effect.
Language Enforcement¶
Voice Transcriber uses strong language-specific prompts to prevent Whisper from switching languages mid-transcription.
How It Works¶
When you set a language (e.g., French):
-
Transcription prompt explicitly instructs Whisper:
-
Formatting prompt maintains the language:
This dual-layer enforcement ensures your transcriptions stay in the configured language.
Examples by Language¶
English¶
Recording: "Hello, this is a test of the transcription system"
Result: "Hello, this is a test of the transcription system."
French¶
Recording: "Bonjour, ceci est un test du système de transcription"
Result: "Bonjour, ceci est un test du système de transcription."
Spanish¶
Recording: "Hola, esta es una prueba del sistema de transcripción"
Result: "Hola, esta es una prueba del sistema de transcripción."
Mixed-Language Content¶
For recordings with technical English terms in French sentences (common in tech contexts):
Configuration:
{
"language": "fr",
"transcriptionPrompt": "Transcribe this French audio. Keep technical English terms but preserve French grammar.",
"transcription": {
"backend": "openai",
"openai": {
"apiKey": "sk-..."
}
}
}
Recording: "J'utilise React avec TypeScript pour développer l'application"
Result: "J'utilise React avec TypeScript pour développer l'application."
Custom Prompts
See Configuration Guide for advanced mixed-language scenarios.
Troubleshooting Language Issues¶
French Transcription Switches to English¶
Problem: Long French recordings switch to English mid-way
Solution: Ensure language is set explicitly:
Restart the application after config changes.
Technical Terms Not Recognized¶
Problem: Technical English terms in French speech are poorly transcribed
Solution: Use custom transcription prompt:
{
"language": "fr",
"transcriptionPrompt": "Transcribe this French technical audio. Keep common English technical terms as-is.",
"transcription": {
"backend": "openai",
"openai": {
"apiKey": "sk-..."
}
}
}
Accent Not Recognized Well¶
Problem: Strong accent reduces accuracy
Solutions: - Speak slightly slower and more clearly - Reduce background noise for better recognition - Try self-hosted Speaches with different models (see Speaches Integration)
Language-Specific Tips¶
French¶
- Accents (é, è, à, ç) are generally recognized well
- Homophones (ou/où, a/à) may need manual correction
- Long compound sentences benefit from formatting
Spanish¶
- Accents and ñ are well recognized
- Regional variations (Spain vs Latin America) work well
- Inverted question marks may need manual addition
German¶
- Umlauts (ä, ö, ü, ß) are recognized accurately
- Compound words are usually transcribed correctly
- Formal/informal you (Sie/du) preserved correctly
Italian¶
- Accents (à, è, ì, ò, ù) are well recognized
- Double consonants are usually correct
- Regional variations work well
Performance by Language¶
All supported languages have similar processing times:
Language | Avg Processing Time | Accuracy |
---|---|---|
English | 1.5-2.5s per 30s | 95-98% |
French | 1.5-2.5s per 30s | 93-97% |
Spanish | 1.5-2.5s per 30s | 93-97% |
German | 1.5-2.5s per 30s | 92-96% |
Italian | 1.5-2.5s per 30s | 93-97% |
Adding More Languages¶
While only 5 languages have strong enforcement prompts, Whisper supports many more:
Other supported codes: ja
, zh
, pt
, ru
, ko
, ar
, nl
, pl
, tr
, etc.
To use unsupported language:
{
"language": "pt",
"transcriptionPrompt": "Transcribe this Portuguese audio completely in Portuguese. Do not switch languages.",
"transcription": {
"backend": "openai",
"openai": {
"apiKey": "sk-..."
}
}
}
Community Contributions
If you use Voice Transcriber in other languages and develop effective prompts, please contribute them via GitHub Pull Request!
Next Steps¶
- Basic Usage - Learn recording workflow
- Configuration Guide - Custom prompts and settings
- Transcription Backends - OpenAI vs Speaches
Need Help?