diff --git a/sdk/cpp/README.md b/sdk/cpp/README.md index 8d96083d..eaf90cd4 100644 --- a/sdk/cpp/README.md +++ b/sdk/cpp/README.md @@ -278,6 +278,42 @@ audio.TranscribeAudioStreaming(R"(C:\path\to\audio.wav)", [](const AudioCreateTr }); ``` +#### Multilingual Language Codes + +When using a multilingual ASR model (e.g. `nvidia-nemotron-3.5-asr-streaming-multilingual-0.6b`), set the language to a BCP-47 language code to hint the expected language. + +| Code | Language | Code | Language | +|------|----------|------|----------| +| `en-US` | English (US) | `en-GB` | English (UK) | +| `zh-CN` | Chinese (Simplified) | `ja-JP` | Japanese | +| `ko-KR` | Korean | `de-DE` | German | +| `fr-FR` | French | `fr-CA` | French (Canada) | +| `es-ES` | Spanish (Spain) | `es-US` | Spanish (US) | +| `it-IT` | Italian | `pt-BR` | Portuguese (Brazil) | +| `pt-PT` | Portuguese (Portugal) | `ru-RU` | Russian | +| `ar-AR` | Arabic | `hi-IN` | Hindi | +| `nl-NL` | Dutch | `pl-PL` | Polish | +| `sv-SE` | Swedish | `da-DK` | Danish | +| `fi-FI` | Finnish | `cs-CZ` | Czech | +| `ro-RO` | Romanian | `hu-HU` | Hungarian | +| `sk-SK` | Slovak | `bg-BG` | Bulgarian | +| `el-GR` | Greek | `et-EE` | Estonian | +| `hr-HR` | Croatian | `lt-LT` | Lithuanian | +| `lv-LV` | Latvian | `sl-SL` | Slovenian | +| `uk-UA` | Ukrainian | `he-IL` | Hebrew | +| `nb-NO` | Norwegian (Bokmål) | `nn-NO` | Norwegian (Nynorsk) | +| `th-TH` | Thai | `tr-TR` | Turkish | +| `vi-VN` | Vietnamese | `auto` | Auto-detect | + +Use `"auto"` to let the model detect the spoken language automatically. + +Set the language on a live audio transcription session: + +```cpp +session->Settings().language = "zh-CN"; // Chinese (Simplified) +// session->Settings().language = "auto"; // Auto-detect language +``` + ### Tool Calling See `sample/main.cpp` (Example 5) for a full tool-calling walkthrough. diff --git a/sdk/cs/README.md b/sdk/cs/README.md index 9493eea0..1ee872c5 100644 --- a/sdk/cs/README.md +++ b/sdk/cs/README.md @@ -300,6 +300,41 @@ audioClient.Settings.Language = "en"; audioClient.Settings.Temperature = 0.0f; ``` +#### Multilingual Language Codes + +When using a multilingual ASR model (e.g. `nvidia-nemotron-3.5-asr-streaming-multilingual-0.6b`), set `Language` to a BCP-47 language code to hint the expected language. This applies to both `OpenAIAudioClient.Settings.Language` and `LiveAudioTranscriptionSession.Settings.Language`. + +| Code | Language | Code | Language | +|------|----------|------|----------| +| `en-US` | English (US) | `en-GB` | English (UK) | +| `zh-CN` | Chinese (Simplified) | `ja-JP` | Japanese | +| `ko-KR` | Korean | `de-DE` | German | +| `fr-FR` | French | `fr-CA` | French (Canada) | +| `es-ES` | Spanish (Spain) | `es-US` | Spanish (US) | +| `it-IT` | Italian | `pt-BR` | Portuguese (Brazil) | +| `pt-PT` | Portuguese (Portugal) | `ru-RU` | Russian | +| `ar-AR` | Arabic | `hi-IN` | Hindi | +| `nl-NL` | Dutch | `pl-PL` | Polish | +| `sv-SE` | Swedish | `da-DK` | Danish | +| `fi-FI` | Finnish | `cs-CZ` | Czech | +| `ro-RO` | Romanian | `hu-HU` | Hungarian | +| `sk-SK` | Slovak | `bg-BG` | Bulgarian | +| `el-GR` | Greek | `et-EE` | Estonian | +| `hr-HR` | Croatian | `lt-LT` | Lithuanian | +| `lv-LV` | Latvian | `sl-SL` | Slovenian | +| `uk-UA` | Ukrainian | `he-IL` | Hebrew | +| `nb-NO` | Norwegian (Bokmål) | `nn-NO` | Norwegian (Nynorsk) | +| `th-TH` | Thai | `tr-TR` | Turkish | +| `vi-VN` | Vietnamese | `auto` | Auto-detect | + +Use `"auto"` to let the model detect the spoken language automatically. + +```csharp +// Multilingual example +audioClient.Settings.Language = "zh-CN"; // Chinese +session.Settings.Language = "auto"; // Auto-detect +``` + ### Live Audio Transcription (Real-Time Streaming) For real-time microphone-to-text transcription, use `CreateLiveTranscriptionSession()`. Audio is pushed as raw PCM chunks and transcription results stream back as an `IAsyncEnumerable`. diff --git a/sdk/js/README.md b/sdk/js/README.md index 2eebae01..5aa84e3c 100644 --- a/sdk/js/README.md +++ b/sdk/js/README.md @@ -265,6 +265,39 @@ for await (const chunk of audioClient.transcribeStreaming('/path/to/audio.wav')) } ``` +#### Multilingual Language Codes + +When using a multilingual ASR model (e.g. `nvidia-nemotron-3.5-asr-streaming-multilingual-0.6b`), set `language` to a BCP-47 language code to hint the expected language. + +| Code | Language | Code | Language | +|------|----------|------|----------| +| `en-US` | English (US) | `en-GB` | English (UK) | +| `zh-CN` | Chinese (Simplified) | `ja-JP` | Japanese | +| `ko-KR` | Korean | `de-DE` | German | +| `fr-FR` | French | `fr-CA` | French (Canada) | +| `es-ES` | Spanish (Spain) | `es-US` | Spanish (US) | +| `it-IT` | Italian | `pt-BR` | Portuguese (Brazil) | +| `pt-PT` | Portuguese (Portugal) | `ru-RU` | Russian | +| `ar-AR` | Arabic | `hi-IN` | Hindi | +| `nl-NL` | Dutch | `pl-PL` | Polish | +| `sv-SE` | Swedish | `da-DK` | Danish | +| `fi-FI` | Finnish | `cs-CZ` | Czech | +| `ro-RO` | Romanian | `hu-HU` | Hungarian | +| `sk-SK` | Slovak | `bg-BG` | Bulgarian | +| `el-GR` | Greek | `et-EE` | Estonian | +| `hr-HR` | Croatian | `lt-LT` | Lithuanian | +| `lv-LV` | Latvian | `sl-SL` | Slovenian | +| `uk-UA` | Ukrainian | `he-IL` | Hebrew | +| `nb-NO` | Norwegian (Bokmål) | `nn-NO` | Norwegian (Nynorsk) | +| `th-TH` | Thai | `tr-TR` | Turkish | +| `vi-VN` | Vietnamese | `auto` | Auto-detect | + +```typescript +// Multilingual example +audioClient.settings.language = 'zh-CN'; // Chinese +audioClient.settings.language = 'auto'; // Auto-detect +``` + ### Embedded Web Service Start a local HTTP server that exposes an OpenAI-compatible API: diff --git a/sdk/rust/README.md b/sdk/rust/README.md index dbd9906a..ded4d5f4 100644 --- a/sdk/rust/README.md +++ b/sdk/rust/README.md @@ -441,6 +441,39 @@ while let Some(chunk) = stream.next().await { } ``` +#### Multilingual Language Codes + +When using a multilingual ASR model (e.g. `nvidia-nemotron-3.5-asr-streaming-multilingual-0.6b`), set `language` to a BCP-47 language code to hint the expected language. + +| Code | Language | Code | Language | +|------|----------|------|----------| +| `en-US` | English (US) | `en-GB` | English (UK) | +| `zh-CN` | Chinese (Simplified) | `ja-JP` | Japanese | +| `ko-KR` | Korean | `de-DE` | German | +| `fr-FR` | French | `fr-CA` | French (Canada) | +| `es-ES` | Spanish (Spain) | `es-US` | Spanish (US) | +| `it-IT` | Italian | `pt-BR` | Portuguese (Brazil) | +| `pt-PT` | Portuguese (Portugal) | `ru-RU` | Russian | +| `ar-AR` | Arabic | `hi-IN` | Hindi | +| `nl-NL` | Dutch | `pl-PL` | Polish | +| `sv-SE` | Swedish | `da-DK` | Danish | +| `fi-FI` | Finnish | `cs-CZ` | Czech | +| `ro-RO` | Romanian | `hu-HU` | Hungarian | +| `sk-SK` | Slovak | `bg-BG` | Bulgarian | +| `el-GR` | Greek | `et-EE` | Estonian | +| `hr-HR` | Croatian | `lt-LT` | Lithuanian | +| `lv-LV` | Latvian | `sl-SL` | Slovenian | +| `uk-UA` | Ukrainian | `he-IL` | Hebrew | +| `nb-NO` | Norwegian (Bokmål) | `nn-NO` | Norwegian (Nynorsk) | +| `th-TH` | Thai | `tr-TR` | Turkish | +| `vi-VN` | Vietnamese | `auto` | Auto-detect | + +```rust +// Multilingual example +let audio_client = model.create_audio_client() + .language("zh-CN"); // Chinese +``` + ### Embedded Web Service Start a local HTTP server that exposes an OpenAI-compatible REST API: