Python Client for Cloud Speech API

imageimageimage

TheCloud Speech API enables developers to convert audio to text by applyingpowerful neural network models. The API recognizes over 80 languages andvariants, to support your global user base.

Quick Start

In order to use this library, you first need to go through the following steps:

  1. Select or create a Cloud Platform project.

  2. Enable billing for your project.

  3. Enable the Cloud Speech API.

  4. Setup Authentication.

Installation

Install this library in avirtualenv using pip.virtualenv is a tool tocreate isolated Python environments. The basic problem it addresses is one ofdependencies and versions, and indirectly permissions.

Withvirtualenv, it’s possible to install this library without needing systeminstall permissions, and without clashing with the installed systemdependencies.

Supported Python Versions

Python >= 3.5

Deprecated Python Versions

Python == 2.7

Mac/Linux

pip install virtualenvvirtualenv <your-env>source <your-env>/bin/activate<your-env>/bin/pip install google-cloud-speech

Windows

pip install virtualenvvirtualenv <your-env><your-env>\Scripts\activate<your-env>\Scripts\pip.exe install google-cloud-speech

Example Usage

from google.cloud import speech_v1from google.cloud.speech_v1 import enumsclient = speech_v1.SpeechClient()encoding = enums.RecognitionConfig.AudioEncoding.FLACsample_rate_hertz = 44100language_code = 'en-US'config = {'encoding': encoding, 'sample_rate_hertz': sample_rate_hertz, 'language_code': language_code}uri = 'gs://bucket_name/file_name.flac'audio = {'uri': uri}response = client.recognize(config, audio)

Next Steps

Except as otherwise noted, the content of this page is licensed under theCreative Commons Attribution 4.0 License, and code samples are licensed under theApache 2.0 License. For details, see theGoogle Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2026-01-12 UTC.