Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Speech Recognition & Synthesis

From Wikipedia, the free encyclopedia
Screen reader application by Google
Speech Recognition & Synthesis
DeveloperGoogle
Initial release10 October 2013; 12 years ago (2013-10-10)
Stable release
20251007.02/p0 (Build 816430913) / 28 October 2025; 26 days ago (2025-10-28)[1][2]
Operating systemAndroid 8+
TypeScreen reader

Speech Recognition & Synthesis, formerly known asSpeech Services,[3] is ascreen reader application developed byGoogle for itsAndroid operating system. It powers applications to read aloud (speak) the text on the screen, with support for many languages. Text-to-Speech may be used by apps such asGoogle Play Books for reading books aloud,Google Translate for reading aloud translations for the pronunciation of words,Google TalkBack, and other spoken feedback accessibility-based applications, as well as by third-party apps. Users must install voice data for each language.

Supported languages

[edit]
  • Afrikaans (South Africa)
  • Albanian (Albania)
  • Amharic (Ethiopia)
  • Arabic (Saudi Arabia)
  • Assamese (India)
  • Basque (Spain)
  • Bengali (Bangladesh)
  • Bengali (India)
  • Bodo (India)
  • Bosnian (Bosnia and Herzegovina)
  • Bulgarian (Bulgaria)
  • Burmese (Myanmar)
  • Cantonese (Hong Kong)
  • Catalan (Spain)
  • Chinese (China)
  • Chinese (Taiwan)
  • Croatian (Croatia)
  • Czech (Czech Republic)
  • Danish (Denmark)
  • Dogri (India)
  • Dutch (Belgium)
  • Dutch (Netherlands)
  • English (Australia)
  • English (Nigeria)
  • English (India)
  • English (United Kingdom)
  • English (United States)
  • Estonian (Estonia)
  • Filipino (Philippines)
  • Finnish (Finland)
  • French (Canada)
  • French (France)
  • Galician (Spain)
  • German (Germany)
  • Greek (Greece)
  • Gujarati (India)
  • Hausa (Nigeria)
  • Hebrew (Israel)
  • Hindi (India)
  • Hungarian (Hungary)
  • Icelandic (Iceland)
  • Indonesian (Indonesia)
  • Italian (Italy)
  • Japanese (Japan)
  • Javanese (Indonesia)
  • Kannada (India)
  • Kashmiri (India)
  • Khmer (Cambodia)
  • Konkani (India)
  • Korean (South Korea)
  • Latin (Vatican City)
  • Latvian (Latvia)
  • Lithuanian (Lithuania)
  • Maithili (India)
  • Malay (Malaysia)
  • Malayalam (India)
  • Manipuri (India)
  • Marathi (India)
  • Nepali (Nepal)
  • Norwegian (Norway)
  • Odia (India)
  • Polish (Poland)
  • Portuguese (Brazil)
  • Portuguese (Portugal)
  • Punjabi (India)
  • Romanian (Romania)
  • Russian (Russia)
  • Sanskrit (India)
  • Santali (India)
  • Serbian (Serbia)
  • Sindhi (India)
  • Sinhala (Sri Lanka)
  • Slovak (Slovakia)
  • Slovenian (Slovenia)
  • Spanish (Spain)
  • Spanish (United States)
  • Sundanese (Indonesia)
  • Swahili (Kenya)
  • Swedish (Sweden)
  • Tamil (India)
  • Telugu (India)
  • Thai (Thailand)
  • Turkish (Turkey)
  • Ukrainian (Ukraine)
  • Urdu (Pakistan)
  • Urdu (India)
  • Vietnamese (Vietnam)
  • Welsh (United Kingdom)

History

[edit]
icon
This articleneeds additional citations forverification. Please helpimprove this article byadding citations to reliable sources. Unsourced material may be challenged and removed.
Find sources: "Speech Recognition & Synthesis" – news ·newspapers ·books ·scholar ·JSTOR
(November 2023) (Learn how and when to remove this message)

Some app developers have started adapting and tweaking their Android Auto apps to include Text-to-Speech, such asHyundai in 2015.[4] Apps such as textPlus andWhatsApp use Text-to-Speech to read notifications aloud and provide voice-reply functionality.

Google Cloud Text-to-Speech is powered byWaveNet,[5] software created by Google's UK-based AI subsidiaryDeepMind, which was bought by Google in 2014.[6] It tries to distinguish from its competitors,Amazon andMicrosoft.[7]

Most voice synthesizers (including Apple'sSiri) useconcatenative synthesis,[5] in which a program stores individualphonemes and then pieces them together to form words and sentences.WaveNet synthesizes speech with human-like emphasis and inflection on syllables, phonemes, and words.Unlike most other text-to-speech systems, a WaveNet model createsraw audio waveforms from scratch. The model uses a neural network that has been trained using a large volume of speech samples. During training, the network extracts the underlying structure of the speech, such as which tones follow each other and what a realistic speech waveform looks like. When given a text input, the trained WaveNet model can generate the corresponding speech waveforms from scratch, one sample at a time, with up to 24,000 samples per second and smooth transitions between the individual sounds.[5]

The service was renamed Speech Recognition & Synthesis in 2023.[citation needed]

See also

[edit]

References

[edit]
  1. ^"Speech Recognition & Synthesis".Google Play. Retrieved2025-11-12.
  2. ^"Speech Recognition & Synthesis googletts.google-speech-apk_20251007.02_p0.816430913".APKMirror. 2025-10-28. Retrieved2025-11-12.
  3. ^Wang, Jules (November 8, 2021)."You'll never guess the latest Google app to cross 10 billion installs (seriously)".Android Police.Archived from the original on November 8, 2021. RetrievedNovember 18, 2021.
  4. ^"Google, Hyundai show off new third-party Android Auto apps".CNET. CBS Interactive. Retrieved17 January 2015.
  5. ^abc"WaveNet".www.deepmind.com. Retrieved2023-06-22.
  6. ^Gibbs, Samuel (2014-01-27)."Google buys UK artificial intelligence startup Deepmind for £400m".The Guardian.ISSN 0261-3077. Retrieved2023-06-22.
  7. ^"Text-to-Speech AI: Lifelike Speech Synthesis".Google Cloud. Retrieved2023-06-22.

External links

[edit]
a subsidiary ofAlphabet
Company
Divisions
Subsidiaries
Active
Defunct
Programs
Events
Infrastructure
People
Current
Former
Criticism
General
Incidents
Other
Software
A–C
D–N
O–Z
Operating systems
Machine learning models
Neural networks
Computer programs
Formats and codecs
Programming languages
Search algorithms
Domain names
Typefaces
A
B
C
D
E
F
G
H
I
J
K
L
M
N
O
P
Q
R
S
T
U
V
W
Y
Hardware
Pixel
Smartphones
Smartwatches
Tablets
Laptops
Other
Nexus
Smartphones
Tablets
Other
Other
Advertising
Antitrust
Intellectual
property
Privacy
Other
Related
Concepts
Products
Android
Street View coverage
YouTube
Other
Documentaries
Books
Popular culture
Other
Software
development
Development tools
Official
Other
Integrated development
environments
(IDE)
Languages,databases
Augmented reality andvirtual reality
Events,communities
Releases
Derivatives
Devices
Pixel
Nexus
Play edition
Custom
distributions
Booting and
recovery
APIs
AlternativeUIs
Rooting
Lists
Related topics
Retrieved from "https://en.wikipedia.org/w/index.php?title=Speech_Recognition_%26_Synthesis&oldid=1321759728"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp