Voice
Active

Nova-3 General

Supports streaming and batch transcription, automatic language detection, speaker diarization, and advanced audio intelligence for multilingual workflows.
Nova-3 GeneralTechflow Logo - Techflow X Webflow Template

Nova-3 General

Nova-3 General — multilingual speech-to-text model by Deepgram with automatic language detection and high-accuracy transcription across 30+ languages.

Deepgram Nova-3 General extends the Nova-3 architecture with multilingual capabilities, supporting 30+ languages with automatic language detection. It is designed for global voice applications, multilingual contact centers, international content pipelines, and cross-language analytics — all without requiring pre-specification of the input language.

Technical Specifications

Performance Benchmarks

  • Supports 30+ languages with competitive per-language word error rates.
  • Automatic language detection runs in parallel with transcription.
  • Sub-second latency in streaming mode.
  • Handles code-switching in select language pairs.
  • Consistent accuracy across diverse accents and recording conditions.

Architecture Breakdown

Nova-3 General shares the same end-to-end deep learning foundation as Nova-3, extended with a multilingual language head trained on diverse language corpora. Language identification is embedded in the transcription pipeline, removing the need for a separate detection step.

Pricing

  • $0.01001 / min

Core Features & Capabilities

  • Multilingual Streaming: Real-time transcription across 30+ languages via WebSocket.
  • Auto Language Detection: Dynamically identifies spoken language — no pre-configuration needed.
  • Speaker Diarization: Labels individual speakers across multilingual audio sessions.
  • Smart Formatting: Locale-aware number, date, and punctuation formatting.
  • Intent & Topic Detection: Custom and model-detected intents and topics across languages.
  • Entity Detection: Extracts key entities from multilingual audio content.
  • Custom Vocabulary (keyterm): Add domain-specific terms per language to improve accuracy.
  • Utterance Segmentation: Segments multilingual streams into labeled speech units.

Comparison with Other Models

  • VS Deepgram Nova-3: Nova-3 General adds multilingual support at the same price; Nova-3 offers maximum accuracy for English-only use cases.
  • VS Deepgram Nova-3 Medical: Nova-3 General is the general-purpose multilingual model; Nova-3 Medical is specialized for healthcare audio at a lower price per minute.

Deepgram Nova-3 General extends the Nova-3 architecture with multilingual capabilities, supporting 30+ languages with automatic language detection. It is designed for global voice applications, multilingual contact centers, international content pipelines, and cross-language analytics — all without requiring pre-specification of the input language.

Technical Specifications

Performance Benchmarks

  • Supports 30+ languages with competitive per-language word error rates.
  • Automatic language detection runs in parallel with transcription.
  • Sub-second latency in streaming mode.
  • Handles code-switching in select language pairs.
  • Consistent accuracy across diverse accents and recording conditions.

Architecture Breakdown

Nova-3 General shares the same end-to-end deep learning foundation as Nova-3, extended with a multilingual language head trained on diverse language corpora. Language identification is embedded in the transcription pipeline, removing the need for a separate detection step.

Pricing

  • $0.01001 / min

Core Features & Capabilities

  • Multilingual Streaming: Real-time transcription across 30+ languages via WebSocket.
  • Auto Language Detection: Dynamically identifies spoken language — no pre-configuration needed.
  • Speaker Diarization: Labels individual speakers across multilingual audio sessions.
  • Smart Formatting: Locale-aware number, date, and punctuation formatting.
  • Intent & Topic Detection: Custom and model-detected intents and topics across languages.
  • Entity Detection: Extracts key entities from multilingual audio content.
  • Custom Vocabulary (keyterm): Add domain-specific terms per language to improve accuracy.
  • Utterance Segmentation: Segments multilingual streams into labeled speech units.

Comparison with Other Models

  • VS Deepgram Nova-3: Nova-3 General adds multilingual support at the same price; Nova-3 offers maximum accuracy for English-only use cases.
  • VS Deepgram Nova-3 Medical: Nova-3 General is the general-purpose multilingual model; Nova-3 Medical is specialized for healthcare audio at a lower price per minute.

Try it now

500+ AI Models

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

The Best Growth Choice
for Enterprise

Get API Key
Testimonials

Our Clients' Voices