Cloud Speech-to-Text API

Converts audio to text by applying powerful neural network models.

Service: speech.googleapis.com

The Service namespeech.googleapis.com is needed to create RPC client stubs.

google.cloud.location.Locations

Methods
GetLocation Gets information about a location.
ListLocations Lists information about the supported locations for this service.

google.cloud.speech.v1.Adaptation

Methods
CreateCustomClass Create a custom class.
CreatePhraseSet Create a set of phrase hints.
DeleteCustomClass Delete a custom class.
DeletePhraseSet Delete a phrase set.
GetCustomClass Get a custom class.
GetPhraseSet Get a phrase set.
ListCustomClasses List custom classes.
ListPhraseSet List phrase sets.
UpdateCustomClass Update a custom class.
UpdatePhraseSet Update a phrase set.

google.cloud.speech.v1.Speech

Methods
LongRunningRecognize Performs asynchronous speech recognition: receive results via the google.longrunning.Operations interface.
Recognize Performs synchronous speech recognition: receive results after all audio has been sent and processed.
StreamingRecognize Performs bidirectional streaming speech recognition: receive results while sending audio.

google.cloud.speech.v1p1beta1.Adaptation

Methods
CreateCustomClass Create a custom class.
CreatePhraseSet Create a set of phrase hints.
DeleteCustomClass Delete a custom class.
DeletePhraseSet Delete a phrase set.
GetCustomClass Get a custom class.
GetPhraseSet Get a phrase set.
ListCustomClasses List custom classes.
ListPhraseSet List phrase sets.
UpdateCustomClass Update a custom class.
UpdatePhraseSet Update a phrase set.

google.cloud.speech.v1p1beta1.Speech

Methods
LongRunningRecognize Performs asynchronous speech recognition: receive results via the google.longrunning.Operations interface.
Recognize Performs synchronous speech recognition: receive results after all audio has been sent and processed.
StreamingRecognize Performs bidirectional streaming speech recognition: receive results while sending audio.

google.cloud.speech.v2.Speech

Methods
BatchRecognize Performs batch asynchronous speech recognition: send a request with N audio files and receive a long running operation that can be polled to see when the transcriptions are finished.
CreateCustomClass Creates aCustomClass.
CreatePhraseSet Creates aPhraseSet.
CreateRecognizer Creates aRecognizer.
DeleteCustomClass Deletes theCustomClass.
DeletePhraseSet Deletes thePhraseSet.
DeleteRecognizer Deletes theRecognizer.
GetConfig Returns the requestedConfig.
GetCustomClass Returns the requestedCustomClass.
GetPhraseSet Returns the requestedPhraseSet.
GetRecognizer Returns the requestedRecognizer.
ListCustomClasses Lists CustomClasses.
ListPhraseSets Lists PhraseSets.
ListRecognizers Lists Recognizers.
Recognize Performs synchronous Speech recognition: receive results after all audio has been sent and processed.
StreamingRecognize Performs bidirectional streaming speech recognition: receive results while sending audio.
UndeleteCustomClass Undeletes theCustomClass.
UndeletePhraseSet Undeletes thePhraseSet.
UndeleteRecognizer Undeletes theRecognizer.
UpdateConfig Updates theConfig.
UpdateCustomClass Updates theCustomClass.
UpdatePhraseSet Updates thePhraseSet.
UpdateRecognizer Updates theRecognizer.

google.longrunning.Operations

Methods
CancelOperation Starts asynchronous cancellation on a long-running operation.
DeleteOperation Deletes a long-running operation.
GetOperation Gets the latest state of a long-running operation.
ListOperations Lists operations that match the specified filter in the request.
WaitOperation Waits until the specified long-running operation is done or reaches at most a specified timeout, returning the latest state.

Except as otherwise noted, the content of this page is licensed under theCreative Commons Attribution 4.0 License, and code samples are licensed under theApache 2.0 License. For details, see theGoogle Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2025-10-23 UTC.