Movatterモバイル変換


[0]ホーム

URL:


US20010041978A1 - Search optimization for continuous speech recognition - Google Patents

Search optimization for continuous speech recognition
Download PDF

Info

Publication number
US20010041978A1
US20010041978A1US09/185,529US18552998AUS2001041978A1US 20010041978 A1US20010041978 A1US 20010041978A1US 18552998 AUS18552998 AUS 18552998AUS 2001041978 A1US2001041978 A1US 2001041978A1
Authority
US
United States
Prior art keywords
words
continuous speech
csr
providing
salient
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
US09/185,529
Other versions
US6397179B2 (en
Inventor
Jean-Francois Crespo
Peter R. Stubley
Serge Robillard
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Benhov GmbH LLC
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US09/119,621external-prioritypatent/US6092045A/en
Priority to US09/185,529priorityCriticalpatent/US6397179B2/en
Application filed by IndividualfiledCriticalIndividual
Assigned to NORTHERN TELECOM LIMITEDreassignmentNORTHERN TELECOM LIMITEDASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS).Assignors: CRESPO, JEAN-FRANCOIS, ROBILLARS, SERGE, SSTUBLEY, PETER R.
Priority to EP99305530Aprioritypatent/EP0977174B1/en
Priority to DE69908254Tprioritypatent/DE69908254T2/en
Assigned to NORTHERN TELECOM LIMITEDreassignmentNORTHERN TELECOM LIMITEDRE-RECORD TO CORRECT THE SURNAME OF INVENTORS PREVIOUSLY RECORDED AT REEL/FRAME 9815/0728.Assignors: CRESPO. JEAN-FRANCOIS, ROBILLARD, SERGE, STUBLEY, PETER R.
Assigned to NORTEL NETWORKS CORPORATIONreassignmentNORTEL NETWORKS CORPORATIONCHANGE OF NAME (SEE DOCUMENT FOR DETAILS).Assignors: NORTHERN TELECOM LIMITED
Assigned to NORTEL NETWORKS LIMITEDreassignmentNORTEL NETWORKS LIMITEDCHANGE OF NAME (SEE DOCUMENT FOR DETAILS).Assignors: NORTEL NETWORKS CORPORATION
Publication of US20010041978A1publicationCriticalpatent/US20010041978A1/en
Publication of US6397179B2publicationCriticalpatent/US6397179B2/en
Application grantedgrantedCritical
Assigned to INNOVATION MANAGEMENT SCIENCES, LLCreassignmentINNOVATION MANAGEMENT SCIENCES, LLCASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS).Assignors: NORTEL NETWORKS LIMITED
Assigned to POPKIN FAMILY ASSETS, L.L.C.reassignmentPOPKIN FAMILY ASSETS, L.L.C.ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS).Assignors: INNOVATION MANAGEMENT SCIENCES LLC
Anticipated expirationlegal-statusCritical
Expired - Fee Relatedlegal-statusCriticalCurrent

Links

Images

Classifications

Definitions

Landscapes

Abstract

A system and method for continuous speech recognition (CSR) is optimized to reduce processing time for connected word grammars bounded by semantically null words. The savings, which reduce processing time both during the forward and the backward passes of the search, as well as during rescoring, are achieved by performing only the minimal amount of computation required to produce an exact N-best list of semantically meaningful words (N-best list of salient words). This departs from the standard Spoken Language System modeling which any notion of meaning is handled by the Natural Language Understanding (NLU) component. By expanding the task of the recognizer component from a simple acoustic match to allow semantic information to be fed to the recognizer, significant processing time savings are achieved, and make it possible to run an increased number of speech recognition channels in parallel for improved performance, which may enhance users perception of value and quality of service.

Description

Claims (18)

What is claimed is:
1. A method for continuous speech recognition comprising:
incorporating semantic information during searching by a continuous speech recognizer.
2. A method for continuous speech recognition according to
claim 1
, comprising searching using semantic information to identify semantically-null words and thereby generate a list of N-best salient words.
3. A method for continuous speech recognition
providing speech input to a continuous speech recognizer,
providing to the continuous speech recognizer an acoustic model comprising a set of Hidden Markov Models, and a language model comprising both grammar and semantic information,
performing recognition of speech input using semantic information to eliminate semantically null words from the N-best list of words and restrict searching to an N-best list of salient words,
and performing word matching to output from the speech recognizer the N-best salient word sequences.
4. A method for a continuous speech recognition process according to
claim 3
wherein the step of performing recognition comprises:
detecting connected word grammars bounded by semantically null words;
collapsing each list of semantically null words into a unique single-input single-output acoustic network;
and identifying stop nodes in the acoustic network.
5. A method according to
claim 4
comprising:
during a forward pass of a search detecting forward stop nodes and signalling the search to stop forward scoring along a path currently being followed, and
during a backward pass of the search detecting backwards stop nodes and signalling the search to stop backward scoring along a path currently being followed.
6. A method according to5 wherein right-most semantically null networks are not computed.
7. A method according to5 wherein some semantically salient words are not backward-scored.
8. A method according to5 wherein an N-best list of only salient words is rescored instead of a true N-best list.
9. A method according to
claim 8
wherein scoring comprises Viterbi scoring.
10. Software on a machine readable medium for performing a method of continuous speech recognition comprising:
incorporating semantic information during searching by a continuous speech recognizer.
11. Software for performing a method of continuous speech recognition according to
claim 10
, wherein the method comprises searching using semantic information to generate a list of N-best salient words.
12. Software on a machine readable medium for performing a method for continuous speech recognition
providing speech input to a continuous speech recognizer,
providing to the continuous speech recognizer an acoustic model comprising a set of Hidden Markov Models, and a language model comprising both grammar and semantic information,
performing recognition of speech input using semantic information to eliminate semantically null words from the N-best list of words and restrict searching to an N-best list of salient words,
13. A system for continuous speech recognition comprising:
means for incorporating semantic information during searching by a continuous speech recognizer.
14. A system for continuous speech recognition according to
claim 1
, comprising means for searching using semantic information to generate a list of N-best salient words.
15. A system for continuous speech recognition
comprising a continuous speech recognizer,
input means for providing speech input to the continuous speech recognizer,
means for providing to the continuous speech recognizer an acoustic model comprising a set of Hidden Markov Models, and a language model comprising both grammar and semantic information,
the continuous speech recognizer comprising means for performing recognition of speech input using the semantic information for eliminating semantically null words from the N-best list of words and thereby restricting searching to an N-best list of salient words, and performing word matching to output the N-best salient word sequences.
16. A system according to
claim 15
means for performing recognition of speech input using the semantic information comprises:
means for detecting connected word grammars bounded by semantically null words;
means for collapsing each list of semantically null words into a unique single-input single-output acoustic network;
and means for identifying stop nodes in the acoustic network.
17. A spoken language processing system for speech recognition comprising:
a continuous speech recognition component (CSR)
a natural language understanding component (NLU)
means for providing speech input to the CSR,
means for providing acoustic-phonetic knowledge to the CSR comprising a set of Hidden Markov Models;
means for providing language knowledge comprising grammar and statistical models to the CSR, and means for providing semantic knowledge the NLU, and
means for providing semantic knowledge to the CSR,
the CSR being operable for searching using the semantic knowledge to constrain the search to an N-best list of salient words, and perform word matching to output N-best list of salient words to the NLU for interpretation of meaning.
18. A method for continuous speech recognition using a spoken language system comprising a continuous speech recognition component (CSR) linked to a natural language understanding component (NLU)
providing speech input to the CSR
providing acoustic-phonetic knowledge to the CSR comprising a set of Hidden Markov Models;
providing language knowledge comprising grammar and statistical models to the CSR;
providing language knowledge semantic knowledge to the CSR;
performing searching with the CSR using the semantic knowledge to constrain the search to an N-best list of salient words comprising semantically meaningful words of the N-best list of words,
and performing word matching to output the N-best salient word sequences to the NLU.
US09/185,5291997-12-241998-11-04Search optimization system and method for continuous speech recognitionExpired - Fee RelatedUS6397179B2 (en)

Priority Applications (3)

Application NumberPriority DateFiling DateTitle
US09/185,529US6397179B2 (en)1997-12-241998-11-04Search optimization system and method for continuous speech recognition
EP99305530AEP0977174B1 (en)1998-07-211999-07-13Search optimization system and method for continuous speech recognition
DE69908254TDE69908254T2 (en)1998-07-211999-07-13 Search optimization system and method for continuous speech recognition

Applications Claiming Priority (4)

Application NumberPriority DateFiling DateTitle
US99782497A1997-12-241997-12-24
US089978241997-12-24
US09/119,621US6092045A (en)1997-09-191998-07-21Method and apparatus for speech recognition
US09/185,529US6397179B2 (en)1997-12-241998-11-04Search optimization system and method for continuous speech recognition

Publications (2)

Publication NumberPublication Date
US20010041978A1true US20010041978A1 (en)2001-11-15
US6397179B2 US6397179B2 (en)2002-05-28

Family

ID=26817516

Family Applications (1)

Application NumberTitlePriority DateFiling Date
US09/185,529Expired - Fee RelatedUS6397179B2 (en)1997-12-241998-11-04Search optimization system and method for continuous speech recognition

Country Status (3)

CountryLink
US (1)US6397179B2 (en)
EP (1)EP0977174B1 (en)
DE (1)DE69908254T2 (en)

Cited By (35)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20010049601A1 (en)*2000-03-242001-12-06John KroekerPhonetic data processing system and method
US20030200085A1 (en)*2002-04-222003-10-23Patrick NguyenPattern matching for large vocabulary speech recognition systems
US20060277032A1 (en)*2005-05-202006-12-07Sony Computer Entertainment Inc.Structure for grammar and dictionary representation in voice recognition and method for simplifying link and node-generated grammars
US20070233485A1 (en)*2006-03-312007-10-04Denso CorporationSpeech recognition apparatus and speech recognition program
US20080091429A1 (en)*2006-10-122008-04-17International Business Machines CorporationEnhancement to viterbi speech processing algorithm for hybrid speech models that conserves memory
US7383172B1 (en)2003-08-152008-06-03Patrick William JamiesonProcess and system for semantically recognizing, correcting, and suggesting domain specific speech
US20080183462A1 (en)*2007-01-312008-07-31Motorola, Inc.Method and apparatus for intention based communications for mobile communication devices
US7493253B1 (en)2002-07-122009-02-17Language And Computing, Inc.Conceptual world representation natural language understanding system and method
US7571098B1 (en)*2003-05-292009-08-04At&T Intellectual Property Ii, L.P.System and method of spoken language understanding using word confusion networks
US20090240500A1 (en)*2008-03-192009-09-24Kabushiki Kaisha ToshibaSpeech recognition apparatus and method
US20100211391A1 (en)*2009-02-172010-08-19Sony Computer Entertainment Inc.Automatic computation streaming partition for voice recognition on multiple processors with limited memory
US20100228540A1 (en)*1999-11-122010-09-09Phoenix Solutions, Inc.Methods and Systems for Query-Based Searching Using Spoken Input
US20110029311A1 (en)*2009-07-302011-02-03Sony CorporationVoice processing device and method, and program
US7902447B1 (en)2006-10-032011-03-08Sony Computer Entertainment Inc.Automatic composition of sound sequences using finite state automata
US20110144986A1 (en)*2009-12-102011-06-16Microsoft CorporationConfidence calibration in automatic speech recognition systems
US20110213616A1 (en)*2009-09-232011-09-01Williams Robert E"System and Method for the Adaptive Use of Uncertainty Information in Speech Recognition to Assist in the Recognition of Natural Language Phrases"
US20120245939A1 (en)*2005-02-042012-09-27Keith BrahoMethod and system for considering information about an expected response when performing speech recognition
US20130060570A1 (en)*2011-09-012013-03-07At&T Intellectual Property I, L.P.System and method for advanced turn-taking for interactive spoken dialog systems
US20130080163A1 (en)*2011-09-262013-03-28Kabushiki Kaisha ToshibaInformation processing apparatus, information processing method and computer program product
US8560318B2 (en)2010-05-142013-10-15Sony Computer Entertainment Inc.Methods and system for evaluating potential confusion within grammar structure for set of statements to be used in speech recognition during computing event
US20130304467A1 (en)*2010-01-052013-11-14Google Inc.Word-Level Correction of Speech Input
US8600757B2 (en)*2005-01-072013-12-03At&T Intellectual Property Ii, L.P.System and method of dynamically modifying a spoken dialog system to reduce hardware requirements
US20140176603A1 (en)*2012-12-202014-06-26Sri InternationalMethod and apparatus for mentoring via an augmented reality assistant
US8868409B1 (en)2014-01-162014-10-21Google Inc.Evaluating transcriptions with a semantic parser
US20140316764A1 (en)*2013-04-192014-10-23Sri InternationalClarifying natural language input using targeted questions
US9026431B1 (en)*2013-07-302015-05-05Google Inc.Semantic parsing with multiple parsers
CN106598937A (en)*2015-10-162017-04-26阿里巴巴集团控股有限公司Language recognition method and device for text and electronic equipment
US20170358293A1 (en)*2016-06-102017-12-14Google Inc.Predicting pronunciations with word stress
US20180218735A1 (en)*2008-12-112018-08-02Apple Inc.Speech recognition involving a mobile device
US20190139540A1 (en)*2016-06-092019-05-09National Institute Of Information And Communications TechnologySpeech recognition device and computer program
US10354647B2 (en)2015-04-282019-07-16Google LlcCorrecting voice recognition using selective re-speak
US10380236B1 (en)*2017-09-222019-08-13Amazon Technologies, Inc.Machine learning system for annotating unstructured text
CN110517693A (en)*2019-08-012019-11-29出门问问(苏州)信息科技有限公司Audio recognition method, device, electronic equipment and computer readable storage medium
US10607602B2 (en)*2015-05-222020-03-31National Institute Of Information And Communications TechnologySpeech recognition device and computer program
US20230134696A1 (en)*2021-11-012023-05-04Kabushiki Kaisha ToshibaSpoken sentence processing apparatus, method, and non-transitory computer readable medium

Families Citing this family (46)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US6519562B1 (en)*1999-02-252003-02-11Speechworks International, Inc.Dynamic semantic control of a speech recognition system
JP2001100781A (en)*1999-09-302001-04-13Sony CorpMethod and device for voice processing and recording medium
US8392188B1 (en)1999-11-052013-03-05At&T Intellectual Property Ii, L.P.Method and system for building a phonotactic model for domain independent speech recognition
US20030191625A1 (en)*1999-11-052003-10-09Gorin Allen LouisMethod and system for creating a named entity language model
US7286984B1 (en)1999-11-052007-10-23At&T Corp.Method and system for automatically detecting morphemes in a task classification system using lattices
US9076448B2 (en)1999-11-122015-07-07Nuance Communications, Inc.Distributed real time speech recognition system
US7392185B2 (en)1999-11-122008-06-24Phoenix Solutions, Inc.Speech based learning/training system using semantic decoding
US7050977B1 (en)1999-11-122006-05-23Phoenix Solutions, Inc.Speech-enabled server for internet website and method
JP2001249684A (en)*2000-03-022001-09-14Sony CorpDevice and method for recognizing speech, and recording medium
US7401023B1 (en)*2000-09-062008-07-15Verizon Corporate Services Group Inc.Systems and methods for providing automated directory assistance using transcripts
DE60032776T2 (en)*2000-11-162007-11-08Sony Deutschland Gmbh Method for speech recognition
US20020133347A1 (en)*2000-12-292002-09-19Eberhard SchoneburgMethod and apparatus for natural language dialog interface
US7403938B2 (en)*2001-09-242008-07-22Iac Search & Media, Inc.Natural language query processing
US20040190687A1 (en)*2003-03-262004-09-30Aurilab, LlcSpeech recognition assistant for human call center operator
KR20050054706A (en)*2003-12-052005-06-10엘지전자 주식회사Method for building lexical tree for speech recognition
US7295981B1 (en)*2004-01-092007-11-13At&T Corp.Method for building a natural language understanding model for a spoken dialog system
US7447636B1 (en)2005-05-122008-11-04Verizon Corporate Services Group Inc.System and methods for using transcripts to train an automated directory assistance service
US7860722B1 (en)*2006-01-182010-12-28Securus Technologies, Inc.System and method for keyword detection in a controlled-environment facility using a hybrid application
US7877256B2 (en)*2006-02-172011-01-25Microsoft CorporationTime synchronous decoding for long-span hidden trajectory model
US7809564B2 (en)*2006-12-182010-10-05International Business Machines CorporationVoice based keyword search algorithm
US8428950B2 (en)*2007-02-062013-04-23Nec CorporationRecognizer weight learning apparatus, speech recognition apparatus, and system
US7813929B2 (en)*2007-03-302010-10-12Nuance Communications, Inc.Automatic editing using probabilistic word substitution models
TWI420510B (en)*2010-05-282013-12-21Ind Tech Res InstSpeech recognition system and method with adjustable memory usage
US9244984B2 (en)2011-03-312016-01-26Microsoft Technology Licensing, LlcLocation based conversational understanding
US10642934B2 (en)2011-03-312020-05-05Microsoft Technology Licensing, LlcAugmented conversational understanding architecture
US9298287B2 (en)2011-03-312016-03-29Microsoft Technology Licensing, LlcCombined activation for natural user interface systems
US9842168B2 (en)2011-03-312017-12-12Microsoft Technology Licensing, LlcTask driven user intents
US9760566B2 (en)2011-03-312017-09-12Microsoft Technology Licensing, LlcAugmented conversational understanding agent to identify conversation context between two humans and taking an agent action thereof
US9858343B2 (en)2011-03-312018-01-02Microsoft Technology Licensing LlcPersonalization of queries, conversations, and searches
US9064006B2 (en)2012-08-232015-06-23Microsoft Technology Licensing, LlcTranslating natural language utterances to keyword search queries
US9454962B2 (en)2011-05-122016-09-27Microsoft Technology Licensing, LlcSentence simplification for spoken language understanding
US8914290B2 (en)2011-05-202014-12-16Vocollect, Inc.Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment
US9064503B2 (en)2012-03-232015-06-23Dolby Laboratories Licensing CorporationHierarchical active voice detection
US9514739B2 (en)*2012-06-062016-12-06Cypress Semiconductor CorporationPhoneme score accelerator
US9978395B2 (en)2013-03-152018-05-22Vocollect, Inc.Method and system for mitigating delay in receiving audio stream during production of sound from audio stream
US9390708B1 (en)*2013-05-282016-07-12Amazon Technologies, Inc.Low latency and memory efficient keywork spotting
US9401148B2 (en)2013-11-042016-07-26Google Inc.Speaker verification using neural networks
US9620145B2 (en)2013-11-012017-04-11Google Inc.Context-dependent state tying using a neural network
US9514753B2 (en)2013-11-042016-12-06Google Inc.Speaker identification using hash-based indexing
US9786270B2 (en)2015-07-092017-10-10Google Inc.Generating acoustic models
US10229672B1 (en)2015-12-312019-03-12Google LlcTraining acoustic models using connectionist temporal classification
US20180018973A1 (en)2016-07-152018-01-18Google Inc.Speaker verification
US10714121B2 (en)2016-07-272020-07-14Vocollect, Inc.Distinguishing user speech from background speech in speech-dense environments
US10706840B2 (en)2017-08-182020-07-07Google LlcEncoder-decoder models for sequence to sequence mapping
US10832679B2 (en)2018-11-202020-11-10International Business Machines CorporationMethod and system for correcting speech-to-text auto-transcription using local context of talk
IT201900015506A1 (en)2019-09-032021-03-03St Microelectronics Srl Process of processing an electrical signal transduced by a speech signal, electronic device, connected network of electronic devices and corresponding computer product

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
JP2768561B2 (en)*1990-12-191998-06-25富士通株式会社 Network transformation device and creation device
US5388183A (en)*1991-09-301995-02-07Kurzwell Applied Intelligence, Inc.Speech recognition providing multiple outputs
US5621859A (en)1994-01-191997-04-15Bbn CorporationSingle tree method for grammar directed, very large vocabulary speech recognizer
JP3265864B2 (en)1994-10-282002-03-18三菱電機株式会社 Voice recognition device
US5799276A (en)*1995-11-071998-08-25Accent IncorporatedKnowledge-based speech recognition system and methods having frame length computed based upon estimated pitch period of vocalic intervals
US5819220A (en)*1996-09-301998-10-06Hewlett-Packard CompanyWeb triggered word set boosting for speech interfaces to the world wide web
US5797123A (en)*1996-10-011998-08-18Lucent Technologies Inc.Method of key-phase detection and verification for flexible speech understanding
US6016470A (en)*1997-11-122000-01-18Gte Internetworking IncorporatedRejection grammar using selected phonemes for speech recognition system

Cited By (74)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20100235341A1 (en)*1999-11-122010-09-16Phoenix Solutions, Inc.Methods and Systems for Searching Using Spoken Input and User Context Information
US20100228540A1 (en)*1999-11-122010-09-09Phoenix Solutions, Inc.Methods and Systems for Query-Based Searching Using Spoken Input
US20010049601A1 (en)*2000-03-242001-12-06John KroekerPhonetic data processing system and method
US6895377B2 (en)*2000-03-242005-05-17Eliza CorporationPhonetic data processing system and method
US20030200085A1 (en)*2002-04-222003-10-23Patrick NguyenPattern matching for large vocabulary speech recognition systems
WO2003090203A3 (en)*2002-04-222004-02-26Matsushita Electric Industrial Co LtdPattern matching for large vocabulary speech recognition with packed distribution and localized trellis access
US6879954B2 (en)*2002-04-222005-04-12Matsushita Electric Industrial Co., Ltd.Pattern matching for large vocabulary speech recognition systems
US20050159952A1 (en)*2002-04-222005-07-21Matsushita Electric Industrial Co., LtdPattern matching for large vocabulary speech recognition with packed distribution and localized trellis access
US9292494B2 (en)2002-07-122016-03-22Nuance Communications, Inc.Conceptual world representation natural language understanding system and method
US7493253B1 (en)2002-07-122009-02-17Language And Computing, Inc.Conceptual world representation natural language understanding system and method
US8812292B2 (en)2002-07-122014-08-19Nuance Communications, Inc.Conceptual world representation natural language understanding system and method
US8442814B2 (en)2002-07-122013-05-14Nuance Communications, Inc.Conceptual world representation natural language understanding system and method
US20110179032A1 (en)*2002-07-122011-07-21Nuance Communications, Inc.Conceptual world representation natural language understanding system and method
US7957971B2 (en)2003-05-292011-06-07At&T Intellectual Property Ii, L.P.System and method of spoken language understanding using word confusion networks
US7571098B1 (en)*2003-05-292009-08-04At&T Intellectual Property Ii, L.P.System and method of spoken language understanding using word confusion networks
US7383172B1 (en)2003-08-152008-06-03Patrick William JamiesonProcess and system for semantically recognizing, correcting, and suggesting domain specific speech
US8600757B2 (en)*2005-01-072013-12-03At&T Intellectual Property Ii, L.P.System and method of dynamically modifying a spoken dialog system to reduce hardware requirements
US8612235B2 (en)*2005-02-042013-12-17Vocollect, Inc.Method and system for considering information about an expected response when performing speech recognition
US20120245939A1 (en)*2005-02-042012-09-27Keith BrahoMethod and system for considering information about an expected response when performing speech recognition
WO2006127504A3 (en)*2005-05-202007-06-28Sony Computer Entertainment IncOptimisation of a grammar for speech recognition
US7921011B2 (en)2005-05-202011-04-05Sony Computer Entertainment Inc.Structure for grammar and dictionary representation in voice recognition and method for simplifying link and node-generated grammars
US20060277032A1 (en)*2005-05-202006-12-07Sony Computer Entertainment Inc.Structure for grammar and dictionary representation in voice recognition and method for simplifying link and node-generated grammars
US7818171B2 (en)2006-03-312010-10-19Denso CorporationSpeech recognition apparatus and speech recognition program
US20070233485A1 (en)*2006-03-312007-10-04Denso CorporationSpeech recognition apparatus and speech recognition program
US8450591B2 (en)2006-10-032013-05-28Sony Computer Entertainment Inc.Methods for generating new output sounds from input sounds
US7902447B1 (en)2006-10-032011-03-08Sony Computer Entertainment Inc.Automatic composition of sound sequences using finite state automata
US7805305B2 (en)*2006-10-122010-09-28Nuance Communications, Inc.Enhancement to Viterbi speech processing algorithm for hybrid speech models that conserves memory
US20080091429A1 (en)*2006-10-122008-04-17International Business Machines CorporationEnhancement to viterbi speech processing algorithm for hybrid speech models that conserves memory
US7818166B2 (en)2007-01-312010-10-19Motorola, Inc.Method and apparatus for intention based communications for mobile communication devices
WO2008094332A1 (en)*2007-01-312008-08-07Motorola, Inc.Method and apparatus for intention based communications for mobile communication devices
US20080183462A1 (en)*2007-01-312008-07-31Motorola, Inc.Method and apparatus for intention based communications for mobile communication devices
US20090240500A1 (en)*2008-03-192009-09-24Kabushiki Kaisha ToshibaSpeech recognition apparatus and method
US20180218735A1 (en)*2008-12-112018-08-02Apple Inc.Speech recognition involving a mobile device
US8442829B2 (en)*2009-02-172013-05-14Sony Computer Entertainment Inc.Automatic computation streaming partition for voice recognition on multiple processors with limited memory
US20100211391A1 (en)*2009-02-172010-08-19Sony Computer Entertainment Inc.Automatic computation streaming partition for voice recognition on multiple processors with limited memory
US20110029311A1 (en)*2009-07-302011-02-03Sony CorporationVoice processing device and method, and program
US8612223B2 (en)*2009-07-302013-12-17Sony CorporationVoice processing device and method, and program
US20110213616A1 (en)*2009-09-232011-09-01Williams Robert E"System and Method for the Adaptive Use of Uncertainty Information in Speech Recognition to Assist in the Recognition of Natural Language Phrases"
US8560311B2 (en)*2009-09-232013-10-15Robert W. WilliamsSystem and method for isolating uncertainty between speech recognition and natural language processing
US20110144986A1 (en)*2009-12-102011-06-16Microsoft CorporationConfidence calibration in automatic speech recognition systems
US9070360B2 (en)*2009-12-102015-06-30Microsoft Technology Licensing, LlcConfidence calibration in automatic speech recognition systems
US20130304467A1 (en)*2010-01-052013-11-14Google Inc.Word-Level Correction of Speech Input
US9466287B2 (en)2010-01-052016-10-11Google Inc.Word-level correction of speech input
US12148423B2 (en)2010-01-052024-11-19Google LlcWord-level correction of speech input
US11037566B2 (en)2010-01-052021-06-15Google LlcWord-level correction of speech input
US9881608B2 (en)2010-01-052018-01-30Google LlcWord-level correction of speech input
US10672394B2 (en)2010-01-052020-06-02Google LlcWord-level correction of speech input
US9087517B2 (en)*2010-01-052015-07-21Google Inc.Word-level correction of speech input
US9263048B2 (en)2010-01-052016-02-16Google Inc.Word-level correction of speech input
US9711145B2 (en)2010-01-052017-07-18Google Inc.Word-level correction of speech input
US9542932B2 (en)2010-01-052017-01-10Google Inc.Word-level correction of speech input
US8560318B2 (en)2010-05-142013-10-15Sony Computer Entertainment Inc.Methods and system for evaluating potential confusion within grammar structure for set of statements to be used in speech recognition during computing event
US8914288B2 (en)*2011-09-012014-12-16At&T Intellectual Property I, L.P.System and method for advanced turn-taking for interactive spoken dialog systems
US9378738B2 (en)2011-09-012016-06-28At&T Intellectual Property I, L.P.System and method for advanced turn-taking for interactive spoken dialog systems
US20130060570A1 (en)*2011-09-012013-03-07At&T Intellectual Property I, L.P.System and method for advanced turn-taking for interactive spoken dialog systems
US10152971B2 (en)2011-09-012018-12-11Nuance Communications, Inc.System and method for advanced turn-taking for interactive spoken dialog systems
US20130080163A1 (en)*2011-09-262013-03-28Kabushiki Kaisha ToshibaInformation processing apparatus, information processing method and computer program product
US20140176603A1 (en)*2012-12-202014-06-26Sri InternationalMethod and apparatus for mentoring via an augmented reality assistant
US10573037B2 (en)*2012-12-202020-02-25Sri InternationalMethod and apparatus for mentoring via an augmented reality assistant
US9805718B2 (en)*2013-04-192017-10-31Sri InternaitonalClarifying natural language input using targeted questions
US20140316764A1 (en)*2013-04-192014-10-23Sri InternationalClarifying natural language input using targeted questions
US9026431B1 (en)*2013-07-302015-05-05Google Inc.Semantic parsing with multiple parsers
US8868409B1 (en)2014-01-162014-10-21Google Inc.Evaluating transcriptions with a semantic parser
US10354647B2 (en)2015-04-282019-07-16Google LlcCorrecting voice recognition using selective re-speak
US10607602B2 (en)*2015-05-222020-03-31National Institute Of Information And Communications TechnologySpeech recognition device and computer program
CN106598937A (en)*2015-10-162017-04-26阿里巴巴集团控股有限公司Language recognition method and device for text and electronic equipment
US20190139540A1 (en)*2016-06-092019-05-09National Institute Of Information And Communications TechnologySpeech recognition device and computer program
US10909976B2 (en)*2016-06-092021-02-02National Institute Of Information And Communications TechnologySpeech recognition device and computer program
US10255905B2 (en)*2016-06-102019-04-09Google LlcPredicting pronunciations with word stress
US20170358293A1 (en)*2016-06-102017-12-14Google Inc.Predicting pronunciations with word stress
US10380236B1 (en)*2017-09-222019-08-13Amazon Technologies, Inc.Machine learning system for annotating unstructured text
CN110517693A (en)*2019-08-012019-11-29出门问问(苏州)信息科技有限公司Audio recognition method, device, electronic equipment and computer readable storage medium
US20230134696A1 (en)*2021-11-012023-05-04Kabushiki Kaisha ToshibaSpoken sentence processing apparatus, method, and non-transitory computer readable medium
US12266349B2 (en)*2021-11-012025-04-01Kabushiki Kaisha ToshibaSpoken sentence processing apparatus, method, and non-transitory computer readable medium

Also Published As

Publication numberPublication date
EP0977174A2 (en)2000-02-02
DE69908254T2 (en)2003-11-27
EP0977174A3 (en)2001-02-14
US6397179B2 (en)2002-05-28
EP0977174B1 (en)2003-05-28
DE69908254D1 (en)2003-07-03

Similar Documents

PublicationPublication DateTitle
US6397179B2 (en)Search optimization system and method for continuous speech recognition
US10176802B1 (en)Lattice encoding using recurrent neural networks
US10134388B1 (en)Word generation for speech recognition
US10453117B1 (en)Determining domains for natural language understanding
US5797123A (en)Method of key-phase detection and verification for flexible speech understanding
US6292779B1 (en)System and method for modeless large vocabulary speech recognition
US10170107B1 (en)Extendable label recognition of linguistic input
WardExtracting information in spontaneous speech.
Ljolje et al.Efficient general lattice generation and rescoring.
US10381000B1 (en)Compressed finite state transducers for automatic speech recognition
WO2001022400A1 (en)Iterative speech recognition from multiple feature vectors
KR20070047579A (en) Interactive Speech Recognition Apparatus and Method Using Subject Area Detection
JP2001517816A (en) A speech recognition system for recognizing continuous and separated speech
JP2001249684A (en)Device and method for recognizing speech, and recording medium
US6980954B1 (en)Search method based on single triphone tree for large vocabulary continuous speech recognizer
JPH08505957A (en) Voice recognition system
MODELTROPE
JP4528540B2 (en) Voice recognition method and apparatus, voice recognition program, and storage medium storing voice recognition program
JP2938865B1 (en) Voice recognition device
JPH09134192A (en)Statistical language model forming device and speech recognition device
Smaïli et al.An hybrid language model for a continuous dictation prototype.
Bai et al.A multi-phase approach for fast spotting of large vocabulary Chinese keywords from Mandarin speech using prosodic information
JP2731133B2 (en) Continuous speech recognition device
JPH10254480A (en) Voice recognition method
Fu et al.Combination of multiple predictors to improve confidence measure based on local posterior probabilities

Legal Events

DateCodeTitleDescription
ASAssignment

Owner name:NORTHERN TELECOM LIMITED, CANADA

Free format text:ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:CRESPO, JEAN-FRANCOIS;SSTUBLEY, PETER R.;ROBILLARS, SERGE;REEL/FRAME:009815/0728

Effective date:19990222

ASAssignment

Owner name:NORTHERN TELECOM LIMITED, CANADA

Free format text:RE-RECORD TO CORRECT THE SURNAME OF INVENTORS PREVIOUSLY RECORDED AT REEL/FRAME 9815/0728.;ASSIGNORS:CRESPO. JEAN-FRANCOIS;STUBLEY, PETER R.;ROBILLARD, SERGE;REEL/FRAME:010244/0661

Effective date:19990222

ASAssignment

Owner name:NORTEL NETWORKS CORPORATION, CANADA

Free format text:CHANGE OF NAME;ASSIGNOR:NORTHERN TELECOM LIMITED;REEL/FRAME:010567/0001

Effective date:19990429

ASAssignment

Owner name:NORTEL NETWORKS LIMITED, CANADA

Free format text:CHANGE OF NAME;ASSIGNOR:NORTEL NETWORKS CORPORATION;REEL/FRAME:011195/0706

Effective date:20000830

Owner name:NORTEL NETWORKS LIMITED,CANADA

Free format text:CHANGE OF NAME;ASSIGNOR:NORTEL NETWORKS CORPORATION;REEL/FRAME:011195/0706

Effective date:20000830

REMIMaintenance fee reminder mailed
FPAYFee payment

Year of fee payment:4

SULPSurcharge for late payment
ASAssignment

Owner name:INNOVATION MANAGEMENT SCIENCES, LLC, CALIFORNIA

Free format text:ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:NORTEL NETWORKS LIMITED;REEL/FRAME:019215/0788

Effective date:20070424

ASAssignment

Owner name:POPKIN FAMILY ASSETS, L.L.C., DELAWARE

Free format text:ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:INNOVATION MANAGEMENT SCIENCES LLC;REEL/FRAME:019605/0022

Effective date:20070427

FEPPFee payment procedure

Free format text:PAYER NUMBER DE-ASSIGNED (ORIGINAL EVENT CODE: RMPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Free format text:PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

FPAYFee payment

Year of fee payment:8

REMIMaintenance fee reminder mailed
LAPSLapse for failure to pay maintenance fees
STCHInformation on status: patent discontinuation

Free format text:PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362

FPLapsed due to failure to pay maintenance fee

Effective date:20140528


[8]ページ先頭

©2009-2025 Movatter.jp