> ## Documentation Index
> Fetch the complete documentation index at: https://docs.lyrcs.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# Languages

> Full language reference and quality notes

lyrcs.ai supports 48 languages across two tiers. Indian languages have additional quality layers not present for global languages.

## Indian languages — enhanced accuracy

All 30 Indian languages benefit from:

* **Language-specific prompt layers** — additional context and instructions tuned for each language's linguistic characteristics, script conventions, and common lyric patterns
* **Correction dictionaries** — post-processing filters for common transcription errors specific to each language
* **Transliteration standards** — romanisation follows consistent, widely-understood conventions (e.g. ITRANS-style for Hindi/Sanskrit, ISO 15919 influenced for Dravidian languages)

Indian languages include all 22 of India's officially scheduled languages plus additional widely-spoken regional languages.

## Global languages

The 18 global languages use the base transcription pipeline without language-specific correction layers. Quality is high for well-resourced languages (English, Spanish, French, Mandarin Chinese) and good for others.

## Full language table

### Indian Languages — 30

| Language      | Family        | Script                     |
| ------------- | ------------- | -------------------------- |
| Hindi         | Indo-Aryan    | Devanagari                 |
| Bengali       | Indo-Aryan    | Bengali                    |
| Marathi       | Indo-Aryan    | Devanagari                 |
| Telugu        | Dravidian     | Telugu                     |
| Tamil         | Dravidian     | Tamil                      |
| Gujarati      | Indo-Aryan    | Gujarati                   |
| Urdu          | Indo-Aryan    | Nastaliq (Perso-Arabic)    |
| Kannada       | Dravidian     | Kannada                    |
| Odia          | Indo-Aryan    | Odia                       |
| Malayalam     | Dravidian     | Malayalam                  |
| Punjabi       | Indo-Aryan    | Gurmukhi                   |
| Assamese      | Indo-Aryan    | Assamese (Bengali variant) |
| Maithili      | Indo-Aryan    | Devanagari / Tirhuta       |
| Santali       | Austroasiatic | Ol Chiki                   |
| Kashmiri      | Indo-Aryan    | Perso-Arabic / Devanagari  |
| Nepali        | Indo-Aryan    | Devanagari                 |
| Sindhi        | Indo-Aryan    | Perso-Arabic / Devanagari  |
| Konkani       | Indo-Aryan    | Devanagari / Latin         |
| Dogri         | Indo-Aryan    | Devanagari                 |
| Manipuri      | Sino-Tibetan  | Meitei Mayek / Bengali     |
| Bodo          | Sino-Tibetan  | Devanagari                 |
| Sanskrit      | Indo-Aryan    | Devanagari                 |
| Haryanvi      | Indo-Aryan    | Devanagari                 |
| Bhojpuri      | Indo-Aryan    | Devanagari                 |
| Magahi        | Indo-Aryan    | Devanagari                 |
| Chhattisgarhi | Indo-Aryan    | Devanagari                 |
| Rajasthani    | Indo-Aryan    | Devanagari                 |
| Tulu          | Dravidian     | Kannada / Tigalari         |
| Gondi         | Dravidian     | Devanagari / Gondi         |
| Saurashtra    | Indo-Aryan    | Saurashtra / Latin         |

### Global Languages — 18

| Language         | Family       |
| ---------------- | ------------ |
| English          | Germanic     |
| Spanish          | Romance      |
| Mandarin Chinese | Sino-Tibetan |
| French           | Romance      |
| Arabic           | Semitic      |
| Portuguese       | Romance      |
| Russian          | Slavic       |
| Japanese         | Japonic      |
| German           | Germanic     |
| Korean           | Koreanic     |
| Italian          | Romance      |
| Turkish          | Turkic       |
| Dutch            | Germanic     |
| Indonesian       | Austronesian |
| Thai             | Tai-Kadai    |
| Yoruba           | Niger-Congo  |
| Swahili          | Niger-Congo  |
| Zulu             | Niger-Congo  |

<Note>
  The `script` field is **not** returned by `GET /api/v1/languages` — only `name` and `family`. The script column above is for reference only. The transcript output uses whatever script the audio contains, as recognised by the model.
</Note>

## Using language values

Always use the exact `name` strings from `GET /api/v1/languages`. Values are case-sensitive.

```bash theme={null}
# Correct
"language": "Mandarin Chinese"

# Wrong — will return 400
"language": "mandarin chinese"
"language": "Chinese"
"language": "zh"
```

## Known limitations

* **WAV/FLAC duration reporting:** Audio submitted as WAV or FLAC may show `duration_seconds: 0` in job responses. This is a metadata extraction issue and does not affect transcript or alignment quality.
* **Script variation:** For languages with multiple scripts (e.g. Kashmiri in Perso-Arabic vs Devanagari), the output script is determined by what the model detects in the audio and context. If you need a specific script, contact [support@lyrcs.ai](mailto:support@lyrcs.ai).
