Try Live STT Models

Compare all STT

Test Google Chirp 2 Live

Upload your own audio file (not huge for now) and get an instant transcript from Google Chirp 2. No login required.

Try Google Chirp 2 on your audio

Drop a file below. We have pre-selected Google Chirp 2 for you.

Input Source

Click or drag audio file here

Supports MP3, M4A, WAV, OGG

Max file size: 100MB

Configuration

Advanced Options

Enables raw parameter access for Google Chirp 2. Disables universal options.

Provides a hint for the minimum and maximum number of expected speakers to improve diarization accuracy.

Boosts the recognition probability of specific words or phrases, such as proper nouns or domain-specific terms. Provide one phrase per line.

Not supported by all selected

Note: Files and transcripts are not stored on our servers and are used only to complete your request. More features are coming.

Technical Specifications

Configurable Parameters

These universal options are mapped to provider-specific features.

languagestring

Language

Primary language of the audio

Capabilities

  • Diarization
  • Diarization_config
  • Profanity_filter
  • Punctuation
  • Word_boost

Native Configuration

These are the provider's native API parameters — shown exactly as exposed by the vendor.

encodingDefault: LINEAR16

The encoding of the audio data sent in the request.

sample_rate_hertzDefault: 16000

Sample rate in Hertz of the audio data sent.

audio_channel_countDefault: 1

The number of channels in the input audio data.

enable_separate_recognition_per_channel

If true, each audio channel will be recognized separately.

max_alternativesDefault: 1

Maximum number of recognition hypotheses to be returned.

enable_word_time_offsetsDefault: true

If true, the top result includes a list of words and the start and end time offsets.

enable_automatic_punctuation

If true, adds punctuation to recognition result hypotheses.

enable_spoken_punctuation

If true, replaces spoken punctuation with the corresponding symbols.

enable_spoken_emojis

If true, replaces spoken emojis with Unicode characters.

profanity_filter

If set to true, the server will attempt to filter out profanities.

modelDefault: chirp_2

Which model to select for the given request.

adaptation_phrase_sets

Inline phrase lists (one phrase per item) or references to existing PhraseSet resources (projects/{project}/locations/{location}/phraseSets/{id}). CustomClasses are supported by the API but not exposed in the UI by default.

About Google Chirp 2

Google Chirp 2 model — modern Conversational & short-form transcription

Pricing

A detailed pricing breakdown will be available here shortly. For now, please refer to the provider's official website.

View Google Chirp 2 official documentation

🚀