Name: Nova-3 Medical API
Brand: Deepgram

Nova-3 Medical

Nova-3 Medical — specialized speech-to-text model by Deepgram fine-tuned for clinical terminology, healthcare audio, and medical transcription workflows.

Deepgram Nova-3 Medical is a domain-specialized variant of Nova-3, fine-tuned on clinical and healthcare audio datasets. It delivers significantly higher accuracy for medical terminology — prescription names, diagnoses, procedures, anatomical terms — making it the optimal choice for clinical documentation, telehealth platforms, EHR integrations, and medical dictation tools.

Technical Specifications

Performance Benchmarks

Significantly lower word error rate on medical terminology vs. general-purpose models.
Accurately recognizes drug names, ICD-10 diagnoses, and procedural terms.
Real-time streaming with sub-300ms latency for live clinical workflows.
Handles noisy clinical environments: background equipment sounds, phone audio.
Designed to reduce manual review burden in medical documentation.

Architecture Breakdown

Nova-3 Medical is built on the same end-to-end deep learning engine as Nova-3, with an additional fine-tuning stage on curated medical audio corpora. The model's vocabulary and language model are biased toward clinical terminology, enabling reliable transcription of complex medical speech without custom vocabulary configuration.

Pricing

$0.00559 / min

Core Features & Capabilities

Medical Terminology Recognition: Accurate transcription of drug names, diagnoses, symptoms, and procedures.
Streaming Transcription: Real-time clinical dictation via WebSocket.
Speaker Diarization: Labels clinician and patient voices separately.
Smart Formatting: Handles medical number formats, dosages, and dates.
Entity Detection: Extracts medical entities from audio for downstream processing.
Custom Vocabulary (keyterm): Add institution-specific or rare drug names to further boost accuracy.
Dictation Mode: Optimized for voice-driven clinical note-taking workflows.
Filler Word Detection: Removes hesitation markers common in dictation speech.

Comparison with Other Models

VS Deepgram Nova-3: Nova-3 Medical is fine-tuned for clinical audio and priced lower per minute; Nova-3 is the general-purpose English model for non-medical use cases.
VS Deepgram Nova-3 General: Nova-3 Medical is domain-specialized for healthcare; Nova-3 General covers 30+ languages for general multilingual workflows.
VS AssemblyAI Slam-1: Nova-3 Medical focuses on clinical ASR accuracy; Slam-1 provides semantic understanding and prompt-based customization for enterprise transcription workflows.

‍

Example H2

Try it now