chrome.ttsEngine

Description

Use thechrome.ttsEngine API to implement a text-to-speech(TTS) engine using an extension. If your extension registers using this API, it will receive events containing an utterance to be spoken and other parameters when any extension or Chrome App uses thetts API to generate speech. Your extension can then use any available web technology to synthesize and output the speech, and send events back to the calling function to report the status.

Permissions

ttsEngine

Concepts and usage

An extension can register itself as a speech engine. By doing so, it can intercept some or all callsto functions such astts.speak() andtts.stop() and provide an alternate implementation.Extensions are free to use any available web technology to provide speech, including streaming audiofrom a server, HTML5 audio. An extension could even do something differentwith the utterances, like display closed captions in a popup or send them as log messages toa remote server.

To implement a TTS engine, an extension must declare the "ttsEngine" permission and then declare allvoices it provides in the extension manifest, like this:

{"name":"My TTS Engine","version":"1.0","permissions":["ttsEngine"],"tts_engine":{"voices":[{"voice_name":"Alice","lang":"en-US","event_types":["start","marker","end"]},{"voice_name":"Pat","lang":"en-US","event_types":["end"]}]},"background":{"page":"background.html","persistent":false}}

An extension can specify any number of voices.

Thevoice_name parameter is required. The name should be descriptive enough that it identifies thename of the voice and the engine used. In the unlikely event that two extensions register voiceswith the same name, a client can specify the ID of the extension that should do the synthesis.

Thelang parameter is optional, but highly recommended. Almost always, a voice can synthesizespeech in just a single language. When an engine supports more than one language, it can easilyregister a separate voice for each language. Under rare circumstances where a single voice canhandle more than one language, it's easiest to just list two separate voices and handle them usingthe same logic internally. However, if you want to create a voice that will handle utterances in anylanguage, leave out thelang parameter from your extension's manifest.

Finally, theevent_types parameter is required if the engine can send events to update the clienton the progress of speech synthesis. At a minimum, supporting the'end' event type to indicatewhen speech is finished is highly recommended, otherwise Chrome cannot schedule queued utterances.

Once loaded, an extension can replace the list of declared voices by callingchrome.ttsEngine.updateVoices. (Note that the parameters used in the programatic call toupdateVoices are in camel case: e.g.,voiceName, unlike the manifest file which usesvoice_name.)

Note: If your TTS engine does not support the'end' event type, Chrome cannot queue utterancesbecause it has no way of knowing when your utterance has finished. To help mitigate this, Chromepasses an additional booleanenqueue option to your engine's onSpeak handler, giving you theoption of implementing your own queueing. This is discouraged because then clients are unable toqueue utterances that should get spoken by different speech engines.

The possible event types that you can send correspond to the event types that thespeak() methodreceives:

'start': The engine has started speaking the utterance.
'word': A word boundary was reached. Useevent.charIndex to determine the current speechposition.
'sentence': A sentence boundary was reached. Useevent.charIndex to determine the currentspeech position.
'marker': An SSML marker was reached. Useevent.charIndex to determine the current speechposition.
'end': The engine has finished speaking the utterance.
'error': An engine-specific error occurred and this utterance cannot be spoken. Pass moreinformation inevent.errorMessage.

The'interrupted' and'cancelled' events are not sent by the speech engine; they are generatedautomatically by Chrome.

Text-to-speech clients can get the voice information from your extension's manifest by calling tts.getVoices, assuming you've registered speech event listeners as described below.

Handle speech events

To generate speech at the request of clients, your extension must register listeners for bothonSpeak andonStop, like this:

constspeakListener=(utterance,options,sendTtsEvent)=>{sendTtsEvent({type:'start',charIndex:0})// (start speaking)sendTtsEvent({type:'end',charIndex:utterance.length})};conststopListener=()=>{// (stop all speech)};chrome.ttsEngine.onSpeak.addListener(speakListener);chrome.ttsEngine.onStop.addListener(stopListener);

Caution: If your extension does not register listeners for bothonSpeak andonStop, itwill not intercept any speech calls, regardless of what is in the manifest.

The decision of whether or not to send a given speech request to an extension is based solely onwhether the extension supports the given voice parameters in its manifest and has registeredlisteners foronSpeak andonStop. In other words, there's no way for an extension to receive aspeech request and dynamically decide whether to handle it.

Types

AudioBuffer

Chrome 92+

Parameters containing an audio buffer and associated data.

Properties

audioBuffer
ArrayBuffer
The audio buffer from the text-to-speech engine. It should have length exactly audioStreamOptions.bufferSize and encoded as mono, at audioStreamOptions.sampleRate, and as linear pcm, 32-bit signed float i.e. the Float32Array type in javascript.
charIndex
number optional
The character index associated with this audio buffer.
isLastBuffer
boolean optional
True if this audio buffer is the last for the text being spoken.

AudioStreamOptions

Chrome 92+

Contains the audio stream format expected to be produced by an engine.

Properties

bufferSize
number
The number of samples within an audio buffer.
sampleRate
number
The sample rate expected in an audio buffer.

LanguageInstallStatus

Chrome 132+

The install status of a voice.

Enum

"notInstalled"

"installing"

"installed"

"failed"

LanguageStatus

Chrome 132+

Install status of a language.

Properties

error

string optional

Detail about installation failures. Optionally populated if the language failed to install.

installStatus
LanguageInstallStatus
Installation status.
lang
string
Language string in the form of language code-region code, where the region may be omitted. Examples are en, en-AU, zh-CH.

Properties

gender
VoiceGender optional
Deprecated since Chrome 92
Gender is deprecated and will be ignored.
Gender of voice for synthesized speech.
lang
string optional
The language to be used for synthesis, in the formlanguage-region. Examples: 'en', 'en-US', 'en-GB', 'zh-CN'.
pitch
number optional
Speaking pitch between 0 and 2 inclusive, with 0 being lowest and 2 being highest. 1.0 corresponds to this voice's default pitch.
rate
number optional
Speaking rate relative to the default rate for this voice. 1.0 is the default rate, normally around 180 to 220 words per minute. 2.0 is twice as fast, and 0.5 is half as fast. This value is guaranteed to be between 0.1 and 10.0, inclusive. When a voice does not support this full range of rates, don't return an error. Instead, clip the rate to the range the voice supports.
voiceName
string optional
The name of the voice to use for synthesis.
volume
number optional
Speaking volume between 0 and 1 inclusive, with 0 being lowest and 1 being highest, with a default of 1.0.

TtsClient

Chrome 131+

Identifier for the client requesting status.

Properties

id
string
Client making a language management request. For an extension, this is the unique extension ID. For Chrome features, this is the human-readable name of the feature.
source
TtsClientSource
Type of requestor.

TtsClientSource

Chrome 131+

Type of requestor.

Enum

"chromefeature"

"extension"

VoiceGender

Chrome 54+ Deprecated since Chrome 70

Gender is deprecated and will be ignored.

Enum

"male"

"female"

Methods

updateLanguage()

Chrome 132+

chrome.ttsEngine.updateLanguage(
  status: LanguageStatus,
): void

Called by an engine when a language install is attempted, and when a language is uninstalled. Also called in response to a status request from a client. When a voice is installed or uninstalled, the engine should also call ttsEngine.updateVoices to register the voice.

Parameters

status
LanguageStatus
The install status of the language.

updateVoices()

Chrome 66+

chrome.ttsEngine.updateVoices(
  voices: TtsVoice[],
): void

Called by an engine to update its list of voices. This list overrides any voices declared in this extension's manifest.

Parameters

voices
TtsVoice[]
Array oftts.TtsVoice objects representing the available voices for speech synthesis.

Events

onInstallLanguageRequest

Chrome 131+

chrome.ttsEngine.onInstallLanguageRequest.addListener(
  callback: function,
)

Fired when a TTS client requests to install a new language. The engine should attempt to download and install the language, and call ttsEngine.updateLanguage with the result. On success, the engine should also call ttsEngine.updateVoices to register the newly available voices.

Parameters

callback
function
Thecallback parameter looks like:
```
(requestor: TtsClient, lang: string) => void
```
- requestor
  TtsClient
- lang
  string

onLanguageStatusRequest

Chrome 132+

chrome.ttsEngine.onLanguageStatusRequest.addListener(
  callback: function,
)

Fired when a TTS client requests the install status of a language.

Parameters

callback
function
Thecallback parameter looks like:
```
(requestor: TtsClient, lang: string) => void
```
- requestor
  TtsClient
- lang
  string

onPause

chrome.ttsEngine.onPause.addListener(
  callback: function,
)

Optional: if an engine supports the pause event, it should pause the current utterance being spoken, if any, until it receives a resume event or stop event. Note that a stop event should also clear the paused state.

Parameters

callback
function
Thecallback parameter looks like:
```
() => void
```

onResume

chrome.ttsEngine.onResume.addListener(
  callback: function,
)

Optional: if an engine supports the pause event, it should also support the resume event, to continue speaking the current utterance, if any. Note that a stop event should also clear the paused state.

Parameters

callback
function
Thecallback parameter looks like:
```
() => void
```

onSpeak

chrome.ttsEngine.onSpeak.addListener(
  callback: function,
)

Called when the user makes a call to tts.speak() and one of the voices from this extension's manifest is the first to match the options object.

Parameters

callback
function
Thecallback parameter looks like:
```
(utterance: string, options: SpeakOptions, sendTtsEvent: function) => void
```
- utterance
  string
- options
  SpeakOptions
- sendTtsEvent
  function
  ThesendTtsEvent parameter looks like:
```
(event: tts.TtsEvent) => void
```
  - event
    tts.TtsEvent
    The event from the text-to-speech engine indicating the status of this utterance.

onSpeakWithAudioStream

Chrome 92+

chrome.ttsEngine.onSpeakWithAudioStream.addListener(
  callback: function,
)

Called when the user makes a call to tts.speak() and one of the voices from this extension's manifest is the first to match the options object. Differs from ttsEngine.onSpeak in that Chrome provides audio playback services and handles dispatching tts events.

Parameters

callback
function
Thecallback parameter looks like:
```
(utterance: string, options: SpeakOptions, audioStreamOptions: AudioStreamOptions, sendTtsAudio: function, sendError: function) => void
```
- utterance
  string
- options
  SpeakOptions
- audioStreamOptions
  AudioStreamOptions
- sendTtsAudio
  function
  ThesendTtsAudio parameter looks like:
```
(audioBufferParams: AudioBuffer) => void
```
  - audioBufferParams
    AudioBuffer
    Parameters containing an audio buffer and associated data.
- sendError
  function
  Chrome 94+
  ThesendError parameter looks like:
```
(errorMessage?: string) => void
```
  - errorMessage
    string optional
    A string describing the error.

onStop

chrome.ttsEngine.onStop.addListener(
  callback: function,
)

Fired when a call is made to tts.stop and this extension may be in the middle of speaking. If an extension receives a call to onStop and speech is already stopped, it should do nothing (not raise an error). If speech is in the paused state, this should cancel the paused state.

Parameters

callback
function
Thecallback parameter looks like:
```
() => void
```

onUninstallLanguageRequest

Chrome 132+

chrome.ttsEngine.onUninstallLanguageRequest.addListener(
  callback: function,
)

Fired when a TTS client indicates a language is no longer needed.

Parameters

callback
function
Thecallback parameter looks like:
```
(requestor: TtsClient, lang: string, uninstallOptions: LanguageUninstallOptions) => void
```
- requestor
  TtsClient
- lang
  string
- uninstallOptions
  LanguageUninstallOptions

Except as otherwise noted, the content of this page is licensed under theCreative Commons Attribution 4.0 License, and code samples are licensed under theApache 2.0 License. For details, see theGoogle Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2025-08-11 UTC.

Movatterモバイル変換

chrome.ttsEngine Stay organized with collections Save and categorize content based on your preferences.

Description

Permissions

Concepts and usage

Handle speech events

Types

AudioBuffer

Properties

AudioStreamOptions

Properties

LanguageInstallStatus

Enum

LanguageStatus

Properties

LanguageUninstallOptions

Properties

SpeakOptions

Properties

TtsClient

Properties

TtsClientSource

Enum

VoiceGender

Enum

Methods

updateLanguage()

Parameters

updateVoices()

Parameters

Events

onInstallLanguageRequest

Parameters

onLanguageStatusRequest

Parameters

onPause

Parameters

onResume

Parameters

onSpeak

Parameters

onSpeakWithAudioStream

Parameters

onStop

Parameters

onUninstallLanguageRequest

Parameters

chrome.ttsEngine