Movatterモバイル変換


[0]ホーム

URL:


  1. Web
  2. Web APIs
  3. SpeechRecognition

SpeechRecognition

Limited availability

This feature is not Baseline because it does not work in some of the most widely-used browsers.

TheSpeechRecognition interface of theWeb Speech API is the controller interface for the recognition service; this also handles theSpeechRecognitionEvent sent from the recognition service.

Note:On some browsers, like Chrome, using Speech Recognition on a web page involves a server-based recognition engine. Your audio is sent to a web service for recognition processing, so it won't work offline.

EventTarget SpeechRecognition

Constructor

SpeechRecognition()

Creates a newSpeechRecognition object.

Instance properties

SpeechRecognition also inherits properties from its parent interface,EventTarget.

SpeechRecognition.lang

Returns and sets the language of the currentSpeechRecognition. If not specified, this defaults to the HTMLlang attribute value, or the user agent's language setting if that isn't set either.

SpeechRecognition.continuous

Controls whether continuous results are returned for each recognition, or only a single result. Defaults to single (false.)

SpeechRecognition.interimResults

Controls whether interim results should be returned (true) or not (false.) Interim results are results that are not yet final (e.g., theSpeechRecognitionResult.isFinal property isfalse.)

SpeechRecognition.maxAlternatives

Sets the maximum number ofSpeechRecognitionAlternatives provided per result. The default value is 1.

SpeechRecognition.phrasesExperimental

Sets an array ofSpeechRecognitionPhrase objects to be used forcontextual biasing.

SpeechRecognition.processLocallyExperimental

Specifies whether speech recognition must be performed locally on the user's device.

Deprecated properties

The concept of grammar has been removed from the Web Speech API. Related features remain in the specification and are still recognized by supporting browsers for backwards compatibility, but they have no effect on speech recognition services.

SpeechRecognition.grammars

Returns and sets a collection ofSpeechGrammar objects that represent the grammars understood by the currentSpeechRecognition.

Static methods

SpeechRecognition.available()Experimental

Checks whether the specified languages are available for speech recognition.

SpeechRecognition.install()Experimental

Installs the required language packs for on-device speech recognition in the specified languages.

Instance methods

SpeechRecognition also inherits methods from its parent interface,EventTarget.

SpeechRecognition.abort()

Stops the speech recognition service from listening to incoming audio, and doesn't attempt to return aSpeechRecognitionResult.

SpeechRecognition.start()

Starts the speech recognition service to listen for incoming audio (from a microphone or an audio track) and returns the results of that recognition.

SpeechRecognition.stop()

Stops the speech recognition service from listening for incoming audio and attempts to return aSpeechRecognitionResult based on the results captured so far.

Events

Listen to these events usingaddEventListener() or by assigning an event listener to theoneventname property of this interface.

audiostart

Fired when the user agent has started to capture audio.

audioend

Fired when the user agent has finished capturing audio.

end

Fired when the speech recognition service has disconnected.

error

Fired when a speech recognition error occurs.

nomatch

Fired when the speech recognition service returns a final result with no significant recognition. This may involve some degree of recognition, which doesn't meet or exceed theconfidence threshold.

result

Fired when the speech recognition service returns a result — a word or phrase has been positively recognized and this has been communicated back to the app.

soundstart

Fired when any sound — recognizable speech or not — has been detected.

soundend

Fired when any sound — recognizable speech or not — has stopped being detected.

speechstart

Fired when sound that is recognized by the speech recognition service as speech has been detected.

speechend

Fired when speech recognized by the speech recognition service has stopped being detected.

start

Fired when the speech recognition service begins listening for audio to recognize.

Examples

In ourSpeech color changer example, we create a newSpeechRecognition object instance using theSpeechRecognition() constructor.

After some other values have been defined, we then set it so that the recognition service starts when a button is clicked (seeSpeechRecognition.start()). When a result has been successfully recognized, theresult event fires, we extract the color that was spoken from the event object, and then set the background color of the<html> element to that color.

js
const recognition = new SpeechRecognition();recognition.continuous = false;recognition.lang = "en-US";recognition.interimResults = false;recognition.maxAlternatives = 1;const diagnostic = document.querySelector(".output");const bg = document.querySelector("html");const startBtn = document.querySelector("button");startBtn.onclick = () => {  recognition.start();  console.log("Ready to receive a color command.");};recognition.onresult = (event) => {  const color = event.results[0][0].transcript;  diagnostic.textContent = `Result received: ${color}`;  bg.style.backgroundColor = color;};

Specifications

Specification
Web Speech API
# speechreco-section

Browser compatibility

See also

Help improve MDN

Learn how to contribute

This page was last modified on byMDN contributors.


[8]ページ先頭

©2009-2025 Movatter.jp