Try Live STT Models
Compare all STTTest Deepgram Nova-2 Live
Upload your own audio file (not huge for now) and get an instant transcript from Deepgram Nova-2. No login required.
Try Deepgram Nova-2 on your audio
Drop a file below. We have pre-selected Deepgram Nova-2 for you.
Input Source
Click or drag audio file here
Supports MP3, M4A, WAV, OGG
Max file size: 100MB
Configuration
Advanced Options
Enables raw parameter access for Deepgram Nova-2. Disables universal options.
Provides a hint for the minimum and maximum number of expected speakers to improve diarization accuracy.
Boosts the recognition probability of specific words or phrases, such as proper nouns or domain-specific terms. Provide one phrase per line.
Note: Files and transcripts are not stored on our servers and are used only to complete your request. More features are coming.
Technical Specifications
Configurable Parameters
These universal options are mapped to provider-specific features.
languagestringLanguage
Primary language of the audio
Capabilities
- Diarization
- Profanity_filter
- Punctuation
- Smart_formatting
- Word_boost
Native Configuration
These are the provider's native API parameters — shown exactly as exposed by the vendor.
diarizeRecognize speaker changes. Each word will be assigned a speaker number starting at 0.
punctuateAdd punctuation and capitalization to the transcript.
paragraphsSplits audio into paragraphs to improve transcript readability.
utterancesSegments speech into meaningful semantic units with speaker attribution.
smart_formatApply formatting to improve readability (dates, times, currency, phone numbers).
filler_wordsTranscribe interruptions in your audio like 'uh' and 'um'.
numeralsConvert numbers from written format to numerical format (e.g., 'twenty' to '20').
measurementsConvert spoken measurements to their corresponding abbreviations (e.g., 'meters' to 'm').
dictationFormat transcript with dictated speech (e.g., 'comma', 'period', 'new paragraph').
profanity_filterConvert profanity to the nearest non-profane word or remove it completely.
redactRedact sensitive information from transcripts. Select common entity groups.
replaceSearch for terms/phrases and replace them. Format: 'find:replace' (e.g., 'Acme:Company').
searchSearch for specific terms or phrases in the audio. Results include timestamps and confidence scores.
keywordsBoost accuracy for specific words or phrases. Provide comma-separated list.
detect_entitiesIdentify and extract key entities like names, dates, locations, organizations.
sentimentAnalyze sentiment throughout the transcript (positive, negative, neutral).
summarizeGenerate a summary of the audio content.
topicsDetect topics throughout the transcript.
intentsRecognize speaker intent throughout the transcript.
multichannelTranscribe each audio channel independently.
detect_languageAutomatically detect the dominant language spoken in the audio.
utt_splitDefault: 0.8Seconds to wait before detecting a pause between words (default: 0.8).
tagLabel your requests for identification during usage reporting.
About Deepgram Nova-2
Deepgram Nova-2 model for fast, accurate speech-to-text with diarization and audio intelligence options
Pricing
A detailed pricing breakdown will be available here shortly. For now, please refer to the provider's official website.