lyrcs.ai supports 48 languages across two tiers. Indian languages have additional quality layers not present for global languages.
Indian languages — enhanced accuracy
All 30 Indian languages benefit from:
- Language-specific prompt layers — additional context and instructions tuned for each language’s linguistic characteristics, script conventions, and common lyric patterns
- Correction dictionaries — post-processing filters for common transcription errors specific to each language
- Transliteration standards — romanisation follows consistent, widely-understood conventions (e.g. ITRANS-style for Hindi/Sanskrit, ISO 15919 influenced for Dravidian languages)
Indian languages include all 22 of India’s officially scheduled languages plus additional widely-spoken regional languages.
Global languages
The 18 global languages use the base transcription pipeline without language-specific correction layers. Quality is high for well-resourced languages (English, Spanish, French, Mandarin Chinese) and good for others.
Full language table
Indian Languages — 30
| Language | Family | Script |
|---|
| Hindi | Indo-Aryan | Devanagari |
| Bengali | Indo-Aryan | Bengali |
| Marathi | Indo-Aryan | Devanagari |
| Telugu | Dravidian | Telugu |
| Tamil | Dravidian | Tamil |
| Gujarati | Indo-Aryan | Gujarati |
| Urdu | Indo-Aryan | Nastaliq (Perso-Arabic) |
| Kannada | Dravidian | Kannada |
| Odia | Indo-Aryan | Odia |
| Malayalam | Dravidian | Malayalam |
| Punjabi | Indo-Aryan | Gurmukhi |
| Assamese | Indo-Aryan | Assamese (Bengali variant) |
| Maithili | Indo-Aryan | Devanagari / Tirhuta |
| Santali | Austroasiatic | Ol Chiki |
| Kashmiri | Indo-Aryan | Perso-Arabic / Devanagari |
| Nepali | Indo-Aryan | Devanagari |
| Sindhi | Indo-Aryan | Perso-Arabic / Devanagari |
| Konkani | Indo-Aryan | Devanagari / Latin |
| Dogri | Indo-Aryan | Devanagari |
| Manipuri | Sino-Tibetan | Meitei Mayek / Bengali |
| Bodo | Sino-Tibetan | Devanagari |
| Sanskrit | Indo-Aryan | Devanagari |
| Haryanvi | Indo-Aryan | Devanagari |
| Bhojpuri | Indo-Aryan | Devanagari |
| Magahi | Indo-Aryan | Devanagari |
| Chhattisgarhi | Indo-Aryan | Devanagari |
| Rajasthani | Indo-Aryan | Devanagari |
| Tulu | Dravidian | Kannada / Tigalari |
| Gondi | Dravidian | Devanagari / Gondi |
| Saurashtra | Indo-Aryan | Saurashtra / Latin |
Global Languages — 18
| Language | Family |
|---|
| English | Germanic |
| Spanish | Romance |
| Mandarin Chinese | Sino-Tibetan |
| French | Romance |
| Arabic | Semitic |
| Portuguese | Romance |
| Russian | Slavic |
| Japanese | Japonic |
| German | Germanic |
| Korean | Koreanic |
| Italian | Romance |
| Turkish | Turkic |
| Dutch | Germanic |
| Indonesian | Austronesian |
| Thai | Tai-Kadai |
| Yoruba | Niger-Congo |
| Swahili | Niger-Congo |
| Zulu | Niger-Congo |
The script field is not returned by GET /api/v1/languages — only name and family. The script column above is for reference only. The transcript output uses whatever script the audio contains, as recognised by the model.
Using language values
Always use the exact name strings from GET /api/v1/languages. Values are case-sensitive.
# Correct
"language": "Mandarin Chinese"
# Wrong — will return 400
"language": "mandarin chinese"
"language": "Chinese"
"language": "zh"
Known limitations
- WAV/FLAC duration reporting: Audio submitted as WAV or FLAC may show
duration_seconds: 0 in job responses. This is a metadata extraction issue and does not affect transcript or alignment quality.
- Script variation: For languages with multiple scripts (e.g. Kashmiri in Perso-Arabic vs Devanagari), the output script is determined by what the model detects in the audio and context. If you need a specific script, contact support@lyrcs.ai.