Python Client for Cloud Speech API

TheCloud Speech API enables developers to convert audio to text by applyingpowerful neural network models. The API recognizes over 80 languages andvariants, to support your global user base.

Quick Start

In order to use this library, you first need to go through the following steps:

Installation

Install this library in avirtualenv using pip.virtualenv is a tool tocreate isolated Python environments. The basic problem it addresses is one ofdependencies and versions, and indirectly permissions.

Withvirtualenv, it’s possible to install this library without needing systeminstall permissions, and without clashing with the installed systemdependencies.

Supported Python Versions

Python >= 3.5

Deprecated Python Versions

Python == 2.7

Mac/Linux

pip install virtualenvvirtualenv <your-env>source <your-env>/bin/activate<your-env>/bin/pip install google-cloud-speech

Windows

pip install virtualenvvirtualenv <your-env><your-env>\Scripts\activate<your-env>\Scripts\pip.exe install google-cloud-speech

Example Usage

from google.cloud import speech_v1from google.cloud.speech_v1 import enumsclient = speech_v1.SpeechClient()encoding = enums.RecognitionConfig.AudioEncoding.FLACsample_rate_hertz = 44100language_code = 'en-US'config = {'encoding': encoding, 'sample_rate_hertz': sample_rate_hertz, 'language_code': language_code}uri = 'gs://bucket_name/file_name.flac'audio = {'uri': uri}response = client.recognize(config, audio)

Next Steps

Read theClient Library Documentation for Cloud Speech APIAPI to see other available methods on the client.
Read theProduct documentation to learnmore about the product and see How-to Guides.APIs that we cover.

Except as otherwise noted, the content of this page is licensed under theCreative Commons Attribution 4.0 License, and code samples are licensed under theApache 2.0 License. For details, see theGoogle Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2026-01-12 UTC.

Movatterモバイル変換