Movatterモバイル変換


[0]ホーム

URL:


US20060085194A1 - Speech synthesis apparatus and method, and storage medium - Google Patents

Speech synthesis apparatus and method, and storage medium
Download PDF

Info

Publication number
US20060085194A1
US20060085194A1US11/295,653US29565305AUS2006085194A1US 20060085194 A1US20060085194 A1US 20060085194A1US 29565305 AUS29565305 AUS 29565305AUS 2006085194 A1US2006085194 A1US 2006085194A1
Authority
US
United States
Prior art keywords
distortion
synthesis
unit
modification
obtaining
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US11/295,653
Inventor
Yasuo Okutani
Yasuhiro Komori
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Canon Inc
Original Assignee
Canon Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from JP2000099420Aexternal-prioritypatent/JP4454780B2/en
Priority claimed from US09/818,581external-prioritypatent/US6980955B2/en
Application filed by Canon IncfiledCriticalCanon Inc
Priority to US11/295,653priorityCriticalpatent/US20060085194A1/en
Publication of US20060085194A1publicationCriticalpatent/US20060085194A1/en
Abandonedlegal-statusCriticalCurrent

Links

Images

Classifications

Definitions

Landscapes

Abstract

Input text data undergoes language analysis to generate prosody, and a speech database is searched for a synthesis unit on the basis of the prosody. A modification distortion of the found synthesis unit, and concatenation distortions upon connecting that synthesis unit to those in the preceding phoneme are computed, and a distortion determination unit weights the modification and concatenation distortions to determine the total distortion. An Nbest determination unit obtains N best paths that can minimize the distortion using the A* search algorithm, and a registration unit determination unit selects a synthesis unit to be registered in a synthesis unit inventory on the basis of the N best paths in the order of frequencies of occurrence, and registers it in the synthesis unit inventory.

Description

Claims (23)

1. A synthesis unit selection apparatus comprising:
n-best obtaining means for obtaining one or more best sequences of synthesis units corresponding to a phonetic string on the basis of a distortion obtained by concatenating synthesis units;
obtaining means for obtaining a plurality of sequences by applying said n-best obtaining means to a corpus including a plurality of phonetic strings; and
selection means for selecting synthesis units on the basis of the plurality of sequences obtained by said obtaining means.
2. The apparatus according toclaim 1, wherein the distortion comprises at least one of a concatenation distortion and a modification distortion, and the modification distortion is a distortion between a synthesis unit before and after modification.
3. The apparatus according toclaim 1, further comprising:
text reception means for receiving text data,
wherein the plurality of phonetic strings are included in the text data received by said text reception means.
4. The apparatus according toclaim 1, further comprising:
registration means for registering the synthesis units selected by said selection means in a synthesis unit inventory in a memory.
5. The apparatus according toclaim 2, wherein said selection means selects a synthesis units on the basis of a weighted sum of the concatenation and modification distortions.
6. (canceled)
7. (canceled)
8. The apparatus according toclaim 2, wherein said obtaining means determines the modification distortion by looking up a table that stores the modification distortion.
9. The apparatus according toclaim 2, wherein said obtaining means determines the concatenation distortion by looking up a table that stores the concatenation distortion.
10. (canceled)
11. A synthesis unit selection method comprising:
an n-best obtaining step of obtaining one or more best sequences of synthesis units corresponding to a phonetic string on the basis of a distortion obtained by concatenating synthesis units;
an obtaining step of obtaining a plurality of sequences by applying said n-best obtaining step to a corpus including a plurality of phonetic strings; and
a selection step of selecting synthesis units on the basis of the plurality of sequences obtained in said obtaining step.
12. The method according toclaim 11, wherein the distortion comprises at least one of a concatenation distortion and a modification distortion, and the modification distortion is a distortion between a synthesis unit before and after modification.
13. The method according toclaim 11, further comprising the step of:
receiving text data,
wherein the plurality of phonetic strings are included in the text data received in said receiving step.
14. The method according toclaim 11, further comprising the step of:
registering the synthesis units selected in said selection step in a synthesis unit inventory.
15. The method according toclaim 12, wherein in said selection step, a synthesis unit is selected on the basis of a weighted sum of the concatenation and modification distortions.
16. (canceled)
17. (canceled)
18. The method according toclaim 12, wherein in said obtaining step, the modification distortion is determined by looking up a table that stores the modification distortion.
19. The method according toclaim 12, wherein in said obtaining step, the concatenation distortion is determined by looking up a table that stores the concatenation distortion.
20. (canceled)
21. A computer readable storage medium storing a program that implements the method recited inclaim 11.
22. A synthesis unit selection apparatus comprising:
an n-best obtaining unit configured to obtain one or more best sequences of synthesis units corresponding to a phonetic string on the basis of a distortion obtained by concatenating synthesis units;
an obtaining unit configured to obtain a plurality of sequences by applying said n-best obtaining unit to a corpus including a plurality of phonetic strings; and
a selection unit configured to select synthesis units on the basis of the plurality of sequences obtained by said obtaining unit.
23. A program for implementing a synthesis unit selection method comprising:
an n-best obtaining step module for obtaining one or more best sequences of synthesis units corresponding to a phonetic string on the basis of a distortion obtained by concatenating synthesis units;
an obtaining step module for obtaining a plurality of sequences by applying said n-best obtaining step module to a corpus including a plurality of phonetic strings; and
a selection step module for selecting synthesis units on the basis of the plurality of sequences obtained by said obtaining step module.
US11/295,6532000-03-312005-12-07Speech synthesis apparatus and method, and storage mediumAbandonedUS20060085194A1 (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
US11/295,653US20060085194A1 (en)2000-03-312005-12-07Speech synthesis apparatus and method, and storage medium

Applications Claiming Priority (5)

Application NumberPriority DateFiling DateTitle
JP2000099420AJP4454780B2 (en)2000-03-312000-03-31 Audio information processing apparatus, method and storage medium
JP2000-0994202000-03-31
US09/818,581US6980955B2 (en)2000-03-312001-03-28Synthesis unit selection apparatus and method, and storage medium
US10/928,114US7039588B2 (en)2000-03-312004-08-30Synthesis unit selection apparatus and method, and storage medium
US11/295,653US20060085194A1 (en)2000-03-312005-12-07Speech synthesis apparatus and method, and storage medium

Related Parent Applications (1)

Application NumberTitlePriority DateFiling Date
US10/928,114DivisionUS7039588B2 (en)2000-03-312004-08-30Synthesis unit selection apparatus and method, and storage medium

Publications (1)

Publication NumberPublication Date
US20060085194A1true US20060085194A1 (en)2006-04-20

Family

ID=34106103

Family Applications (2)

Application NumberTitlePriority DateFiling Date
US10/928,114Expired - Fee RelatedUS7039588B2 (en)2000-03-312004-08-30Synthesis unit selection apparatus and method, and storage medium
US11/295,653AbandonedUS20060085194A1 (en)2000-03-312005-12-07Speech synthesis apparatus and method, and storage medium

Family Applications Before (1)

Application NumberTitlePriority DateFiling Date
US10/928,114Expired - Fee RelatedUS7039588B2 (en)2000-03-312004-08-30Synthesis unit selection apparatus and method, and storage medium

Country Status (1)

CountryLink
US (2)US7039588B2 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
EP1857924A1 (en)*2006-05-182007-11-21Kabushiki Kaisha ToshibaSpeech synthesis apparatus and method
US20080154605A1 (en)*2006-12-212008-06-26International Business Machines CorporationAdaptive quality adjustments for speech synthesis in a real-time speech processing system based upon load
US20090018836A1 (en)*2007-03-292009-01-15Kabushiki Kaisha ToshibaSpeech synthesis system and speech synthesis method
US20130268275A1 (en)*2007-09-072013-10-10Nuance Communications, Inc.Speech synthesis system, speech synthesis program product, and speech synthesis method

Families Citing this family (133)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US8645137B2 (en)2000-03-162014-02-04Apple Inc.Fast, language-independent method for user authentication by voice
US7401020B2 (en)*2002-11-292008-07-15International Business Machines CorporationApplication of emotion-based intonation and prosody to speech in text-to-speech systems
US7577568B2 (en)*2003-06-102009-08-18At&T Intellctual Property Ii, L.P.Methods and system for creating voice files using a VoiceXML application
CN1914666B (en)*2004-01-272012-04-04松下电器产业株式会社 sound synthesis device
KR100571835B1 (en)*2004-03-042006-04-17삼성전자주식회사 Method and apparatus for generating recorded sentences for building voice corpus
CN1842702B (en)*2004-10-132010-05-05松下电器产业株式会社Speech synthesis device and speech synthesis method
US20080177548A1 (en)*2005-05-312008-07-24Canon Kabushiki KaishaSpeech Synthesis Method and Apparatus
US8677377B2 (en)2005-09-082014-03-18Apple Inc.Method and apparatus for building an intelligent automated assistant
US20070124148A1 (en)*2005-11-282007-05-31Canon Kabushiki KaishaSpeech processing apparatus and speech processing method
US7924986B2 (en)*2006-01-272011-04-12Accenture Global Services LimitedIVR system manager
US9318108B2 (en)2010-01-182016-04-19Apple Inc.Intelligent automated assistant
JP2008225254A (en)*2007-03-142008-09-25Canon Inc Speech synthesis apparatus and method, and program
US8977255B2 (en)2007-04-032015-03-10Apple Inc.Method and system for operating a multi-function portable electronic device using voice-activation
US9330720B2 (en)2008-01-032016-05-03Apple Inc.Methods and apparatus for altering audio output signals
US8996376B2 (en)2008-04-052015-03-31Apple Inc.Intelligent text-to-speech conversion
US10496753B2 (en)2010-01-182019-12-03Apple Inc.Automatically adapting user interfaces for hands-free interaction
US20100030549A1 (en)2008-07-312010-02-04Lee Michael MMobile device having human language translation capability with positional feedback
US8712776B2 (en)2008-09-292014-04-29Apple Inc.Systems and methods for selective text to speech synthesis
US8352272B2 (en)2008-09-292013-01-08Apple Inc.Systems and methods for text to speech synthesis
US8396714B2 (en)2008-09-292013-03-12Apple Inc.Systems and methods for concatenation of words in text to speech synthesis
US8352268B2 (en)2008-09-292013-01-08Apple Inc.Systems and methods for selective rate of speech and speech preferences for text to speech synthesis
WO2010067118A1 (en)2008-12-112010-06-17Novauris Technologies LimitedSpeech recognition involving a mobile device
US8380507B2 (en)2009-03-092013-02-19Apple Inc.Systems and methods for determining the language to use for speech generated by a text to speech engine
US20120309363A1 (en)2011-06-032012-12-06Apple Inc.Triggering notifications associated with tasks items that represent tasks to perform
US10241752B2 (en)2011-09-302019-03-26Apple Inc.Interface for a virtual digital assistant
US9858925B2 (en)2009-06-052018-01-02Apple Inc.Using context information to facilitate processing of commands in a virtual assistant
US10241644B2 (en)2011-06-032019-03-26Apple Inc.Actionable reminder entries
US9431006B2 (en)2009-07-022016-08-30Apple Inc.Methods and apparatuses for automatic speech recognition
US10679605B2 (en)2010-01-182020-06-09Apple Inc.Hands-free list-reading by intelligent automated assistant
US10553209B2 (en)2010-01-182020-02-04Apple Inc.Systems and methods for hands-free notification summaries
US10276170B2 (en)2010-01-182019-04-30Apple Inc.Intelligent automated assistant
US10705794B2 (en)2010-01-182020-07-07Apple Inc.Automatically adapting user interfaces for hands-free interaction
DE112011100329T5 (en)2010-01-252012-10-31Andrew Peter Nelson Jerram Apparatus, methods and systems for a digital conversation management platform
US8682667B2 (en)2010-02-252014-03-25Apple Inc.User profiling for selecting user specific voice input processing information
US10762293B2 (en)2010-12-222020-09-01Apple Inc.Using parts-of-speech tagging and named entity recognition for spelling correction
US9262612B2 (en)2011-03-212016-02-16Apple Inc.Device access using voice authentication
US10057736B2 (en)2011-06-032018-08-21Apple Inc.Active transport based notifications
US8994660B2 (en)2011-08-292015-03-31Apple Inc.Text correction processing
US10134385B2 (en)2012-03-022018-11-20Apple Inc.Systems and methods for name pronunciation
US9483461B2 (en)2012-03-062016-11-01Apple Inc.Handling speech synthesis of content for multiple languages
US9280610B2 (en)2012-05-142016-03-08Apple Inc.Crowd sourcing information to fulfill user requests
US9721563B2 (en)2012-06-082017-08-01Apple Inc.Name recognition system
US9495129B2 (en)2012-06-292016-11-15Apple Inc.Device, method, and user interface for voice-activated navigation and browsing of a document
FR2993088B1 (en)*2012-07-062014-07-18Continental Automotive France METHOD AND SYSTEM FOR VOICE SYNTHESIS
US9576574B2 (en)2012-09-102017-02-21Apple Inc.Context-sensitive handling of interruptions by intelligent digital assistant
US9547647B2 (en)2012-09-192017-01-17Apple Inc.Voice-based media searching
DE212014000045U1 (en)2013-02-072015-09-24Apple Inc. Voice trigger for a digital assistant
US9368114B2 (en)2013-03-142016-06-14Apple Inc.Context-sensitive handling of interruptions
WO2014144579A1 (en)2013-03-152014-09-18Apple Inc.System and method for updating an adaptive speech recognition model
AU2014233517B2 (en)2013-03-152017-05-25Apple Inc.Training an at least partial voice command system
US9582608B2 (en)2013-06-072017-02-28Apple Inc.Unified ranking with entropy-weighted information for phrase-based semantic auto-completion
WO2014197334A2 (en)2013-06-072014-12-11Apple Inc.System and method for user-specified pronunciation of words for speech synthesis and recognition
WO2014197336A1 (en)2013-06-072014-12-11Apple Inc.System and method for detecting errors in interactions with a voice-based digital assistant
WO2014197335A1 (en)2013-06-082014-12-11Apple Inc.Interpreting and acting upon commands that involve sharing information with remote devices
US10176167B2 (en)2013-06-092019-01-08Apple Inc.System and method for inferring user intent from speech inputs
DE112014002747T5 (en)2013-06-092016-03-03Apple Inc. Apparatus, method and graphical user interface for enabling conversation persistence over two or more instances of a digital assistant
AU2014278595B2 (en)2013-06-132017-04-06Apple Inc.System and method for emergency calls initiated by voice command
DE112014003653B4 (en)2013-08-062024-04-18Apple Inc. Automatically activate intelligent responses based on activities from remote devices
US9620105B2 (en)2014-05-152017-04-11Apple Inc.Analyzing audio input for efficient speech and music recognition
US10592095B2 (en)2014-05-232020-03-17Apple Inc.Instantaneous speaking of content on touch devices
US9502031B2 (en)2014-05-272016-11-22Apple Inc.Method for supporting dynamic grammars in WFST-based ASR
US10078631B2 (en)2014-05-302018-09-18Apple Inc.Entropy-guided text prediction using combined word and character n-gram language models
US10170123B2 (en)2014-05-302019-01-01Apple Inc.Intelligent assistant for home automation
US9785630B2 (en)2014-05-302017-10-10Apple Inc.Text prediction using combined word N-gram and unigram language models
US9715875B2 (en)2014-05-302017-07-25Apple Inc.Reducing the need for manual start/end-pointing and trigger phrases
US9734193B2 (en)2014-05-302017-08-15Apple Inc.Determining domain salience ranking from ambiguous words in natural speech
US9760559B2 (en)2014-05-302017-09-12Apple Inc.Predictive text input
US9430463B2 (en)2014-05-302016-08-30Apple Inc.Exemplar-based natural language processing
CN110797019B (en)2014-05-302023-08-29苹果公司Multi-command single speech input method
US9633004B2 (en)2014-05-302017-04-25Apple Inc.Better resolution when referencing to concepts
US10289433B2 (en)2014-05-302019-05-14Apple Inc.Domain specific language for encoding assistant dialog
US9842101B2 (en)2014-05-302017-12-12Apple Inc.Predictive conversion of language input
US9338493B2 (en)2014-06-302016-05-10Apple Inc.Intelligent automated assistant for TV user interactions
US10659851B2 (en)2014-06-302020-05-19Apple Inc.Real-time digital assistant knowledge updates
US10446141B2 (en)2014-08-282019-10-15Apple Inc.Automatic speech recognition based on user feedback
US9818400B2 (en)2014-09-112017-11-14Apple Inc.Method and apparatus for discovering trending terms in speech requests
US10789041B2 (en)2014-09-122020-09-29Apple Inc.Dynamic thresholds for always listening speech trigger
US9606986B2 (en)2014-09-292017-03-28Apple Inc.Integrated word N-gram and class M-gram language models
US9886432B2 (en)2014-09-302018-02-06Apple Inc.Parsimonious handling of word inflection via categorical stem + suffix N-gram language models
US9646609B2 (en)2014-09-302017-05-09Apple Inc.Caching apparatus for serving phonetic pronunciations
US10127911B2 (en)2014-09-302018-11-13Apple Inc.Speaker identification and unsupervised speaker adaptation techniques
US10074360B2 (en)2014-09-302018-09-11Apple Inc.Providing an indication of the suitability of speech recognition
US9668121B2 (en)2014-09-302017-05-30Apple Inc.Social reminders
JP6415929B2 (en)*2014-10-302018-10-31株式会社東芝 Speech synthesis apparatus, speech synthesis method and program
US10552013B2 (en)2014-12-022020-02-04Apple Inc.Data detection
US9711141B2 (en)2014-12-092017-07-18Apple Inc.Disambiguating heteronyms in speech synthesis
US9865280B2 (en)2015-03-062018-01-09Apple Inc.Structured dictation using intelligent automated assistants
US9721566B2 (en)2015-03-082017-08-01Apple Inc.Competing devices responding to voice triggers
US10567477B2 (en)2015-03-082020-02-18Apple Inc.Virtual assistant continuity
US9886953B2 (en)2015-03-082018-02-06Apple Inc.Virtual assistant activation
US9899019B2 (en)2015-03-182018-02-20Apple Inc.Systems and methods for structured stem and suffix language models
US10726197B2 (en)*2015-03-262020-07-28Lenovo (Singapore) Pte. Ltd.Text correction using a second input
US9842105B2 (en)2015-04-162017-12-12Apple Inc.Parsimonious continuous-space phrase representations for natural language processing
US10083688B2 (en)2015-05-272018-09-25Apple Inc.Device voice control for selecting a displayed affordance
US10127220B2 (en)2015-06-042018-11-13Apple Inc.Language identification from short strings
US10101822B2 (en)2015-06-052018-10-16Apple Inc.Language input correction
US9578173B2 (en)2015-06-052017-02-21Apple Inc.Virtual assistant aided communication with 3rd party service in a communication session
US10186254B2 (en)2015-06-072019-01-22Apple Inc.Context-based endpoint detection
US11025565B2 (en)2015-06-072021-06-01Apple Inc.Personalized prediction of responses for instant messaging
US10255907B2 (en)2015-06-072019-04-09Apple Inc.Automatic accent detection using acoustic models
US10671428B2 (en)2015-09-082020-06-02Apple Inc.Distributed personal assistant
US10747498B2 (en)2015-09-082020-08-18Apple Inc.Zero latency digital assistant
CN107924678B (en)*2015-09-162021-12-17株式会社东芝Speech synthesis device, speech synthesis method, and storage medium
US9697820B2 (en)2015-09-242017-07-04Apple Inc.Unit-selection text-to-speech synthesis using concatenation-sensitive neural networks
US11010550B2 (en)2015-09-292021-05-18Apple Inc.Unified language modeling framework for word prediction, auto-completion and auto-correction
US10366158B2 (en)2015-09-292019-07-30Apple Inc.Efficient word encoding for recurrent neural network language models
US11587559B2 (en)2015-09-302023-02-21Apple Inc.Intelligent device identification
US10691473B2 (en)2015-11-062020-06-23Apple Inc.Intelligent automated assistant in a messaging environment
US10049668B2 (en)2015-12-022018-08-14Apple Inc.Applying neural network language models to weighted finite state transducers for automatic speech recognition
US10223066B2 (en)2015-12-232019-03-05Apple Inc.Proactive assistance based on dialog communication between devices
US10446143B2 (en)2016-03-142019-10-15Apple Inc.Identification of voice inputs providing credentials
US9934775B2 (en)2016-05-262018-04-03Apple Inc.Unit-selection text-to-speech synthesis based on predicted concatenation parameters
US9972304B2 (en)2016-06-032018-05-15Apple Inc.Privacy preserving distributed evaluation framework for embedded personalized systems
US10249300B2 (en)2016-06-062019-04-02Apple Inc.Intelligent list reading
US10049663B2 (en)2016-06-082018-08-14Apple, Inc.Intelligent automated assistant for media exploration
DK179309B1 (en)2016-06-092018-04-23Apple IncIntelligent automated assistant in a home environment
US10586535B2 (en)2016-06-102020-03-10Apple Inc.Intelligent digital assistant in a multi-tasking environment
US10192552B2 (en)2016-06-102019-01-29Apple Inc.Digital assistant providing whispered speech
US10490187B2 (en)2016-06-102019-11-26Apple Inc.Digital assistant providing automated status report
US10509862B2 (en)2016-06-102019-12-17Apple Inc.Dynamic phrase expansion of language input
US10067938B2 (en)2016-06-102018-09-04Apple Inc.Multilingual word prediction
DK179049B1 (en)2016-06-112017-09-18Apple IncData driven natural language event detection and classification
DK201670540A1 (en)2016-06-112018-01-08Apple IncApplication integration with a digital assistant
DK179343B1 (en)2016-06-112018-05-14Apple IncIntelligent task discovery
DK179415B1 (en)2016-06-112018-06-14Apple IncIntelligent device arbitration and control
US10043516B2 (en)2016-09-232018-08-07Apple Inc.Intelligent automated assistant
US10593346B2 (en)2016-12-222020-03-17Apple Inc.Rank-reduced token representation for automatic speech recognition
DK201770439A1 (en)2017-05-112018-12-13Apple Inc.Offline personal assistant
DK179745B1 (en)2017-05-122019-05-01Apple Inc. SYNCHRONIZATION AND TASK DELEGATION OF A DIGITAL ASSISTANT
DK179496B1 (en)2017-05-122019-01-15Apple Inc. USER-SPECIFIC Acoustic Models
DK201770431A1 (en)2017-05-152018-12-20Apple Inc.Optimizing dialogue policy decisions for digital assistants using implicit feedback
DK201770432A1 (en)2017-05-152018-12-21Apple Inc.Hierarchical belief states for digital assistants
DK179549B1 (en)2017-05-162019-02-12Apple Inc.Far-field extension for digital assistant services

Citations (12)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US5633984A (en)*1991-09-111997-05-27Canon Kabushiki KaishaMethod and apparatus for speech processing
US5924067A (en)*1996-03-251999-07-13Canon Kabushiki KaishaSpeech recognition method and apparatus, a computer-readable storage medium, and a computer- readable program for obtaining the mean of the time of speech and non-speech portions of input speech in the cepstrum dimension
US6236962B1 (en)*1997-03-132001-05-22Canon Kabushiki KaishaSpeech processing apparatus and method and computer readable medium encoded with a program for recognizing input speech by performing searches based on a normalized current feature parameter
US6240384B1 (en)*1995-12-042001-05-29Kabushiki Kaisha ToshibaSpeech synthesis method
US20010032079A1 (en)*2000-03-312001-10-18Yasuo OkutaniSpeech signal processing apparatus and method, and storage medium
US20020051955A1 (en)*2000-03-312002-05-02Yasuo OkutaniSpeech signal processing apparatus and method, and storage medium
US6546367B2 (en)*1998-03-102003-04-08Canon Kabushiki KaishaSynthesizing phoneme string of predetermined duration by adjusting initial phoneme duration on values from multiple regression by adding values based on their standard deviations
US6662159B2 (en)*1995-11-012003-12-09Canon Kabushiki KaishaRecognizing speech data using a state transition model
US6665641B1 (en)*1998-11-132003-12-16Scansoft, Inc.Speech synthesis using concatenation of speech waveforms
US6697780B1 (en)*1999-04-302004-02-24At&T Corp.Method and apparatus for rapid acoustic unit selection from a large speech corpus
US7013278B1 (en)*2000-07-052006-03-14At&T Corp.Synthesis-based pre-selection of suitable units for concatenative speech
US7082396B1 (en)*1999-04-302006-07-25At&T CorpMethods and apparatus for rapid acoustic unit selection from a large speech corpus

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
JP3397372B2 (en)*1993-06-162003-04-14キヤノン株式会社 Speech recognition method and apparatus
JP3450411B2 (en)*1994-03-222003-09-22キヤノン株式会社 Voice information processing method and apparatus
JP3530591B2 (en)*1994-09-142004-05-24キヤノン株式会社 Speech recognition apparatus, information processing apparatus using the same, and methods thereof
JP3581401B2 (en)*1994-10-072004-10-27キヤノン株式会社 Voice recognition method
JP3453456B2 (en)*1995-06-192003-10-06キヤノン株式会社 State sharing model design method and apparatus, and speech recognition method and apparatus using the state sharing model
JPH09258771A (en)*1996-03-251997-10-03Canon Inc Audio processing method and apparatus
US5913193A (en)*1996-04-301999-06-15Microsoft CorporationMethod and system of runtime acoustic unit selection for speech synthesis
US6366883B1 (en)*1996-05-152002-04-02Atr Interpreting TelecommunicationsConcatenation of speech segments by use of a speech synthesizer
JPH1097276A (en)*1996-09-201998-04-14Canon Inc Voice recognition method and apparatus, and storage medium
JPH10161692A (en)*1996-12-031998-06-19Canon Inc Voice recognition device and voice recognition method
JPH10187195A (en)*1996-12-261998-07-14Canon Inc Voice synthesis method and apparatus
US6163769A (en)*1997-10-022000-12-19Microsoft CorporationText-to-speech using clustered context-dependent phoneme-based units
JP3180764B2 (en)*1998-06-052001-06-25日本電気株式会社 Speech synthesizer

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US5633984A (en)*1991-09-111997-05-27Canon Kabushiki KaishaMethod and apparatus for speech processing
US6662159B2 (en)*1995-11-012003-12-09Canon Kabushiki KaishaRecognizing speech data using a state transition model
US6240384B1 (en)*1995-12-042001-05-29Kabushiki Kaisha ToshibaSpeech synthesis method
US5924067A (en)*1996-03-251999-07-13Canon Kabushiki KaishaSpeech recognition method and apparatus, a computer-readable storage medium, and a computer- readable program for obtaining the mean of the time of speech and non-speech portions of input speech in the cepstrum dimension
US6236962B1 (en)*1997-03-132001-05-22Canon Kabushiki KaishaSpeech processing apparatus and method and computer readable medium encoded with a program for recognizing input speech by performing searches based on a normalized current feature parameter
US6546367B2 (en)*1998-03-102003-04-08Canon Kabushiki KaishaSynthesizing phoneme string of predetermined duration by adjusting initial phoneme duration on values from multiple regression by adding values based on their standard deviations
US6665641B1 (en)*1998-11-132003-12-16Scansoft, Inc.Speech synthesis using concatenation of speech waveforms
US6697780B1 (en)*1999-04-302004-02-24At&T Corp.Method and apparatus for rapid acoustic unit selection from a large speech corpus
US7082396B1 (en)*1999-04-302006-07-25At&T CorpMethods and apparatus for rapid acoustic unit selection from a large speech corpus
US20010032079A1 (en)*2000-03-312001-10-18Yasuo OkutaniSpeech signal processing apparatus and method, and storage medium
US20020051955A1 (en)*2000-03-312002-05-02Yasuo OkutaniSpeech signal processing apparatus and method, and storage medium
US7013278B1 (en)*2000-07-052006-03-14At&T Corp.Synthesis-based pre-selection of suitable units for concatenative speech

Cited By (10)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
EP1857924A1 (en)*2006-05-182007-11-21Kabushiki Kaisha ToshibaSpeech synthesis apparatus and method
US20070271099A1 (en)*2006-05-182007-11-22Kabushiki Kaisha ToshibaSpeech synthesis apparatus and method
US8468020B2 (en)2006-05-182013-06-18Kabushiki Kaisha ToshibaSpeech synthesis apparatus and method wherein more than one speech unit is acquired from continuous memory region by one access
US8731933B2 (en)2006-05-182014-05-20Kabushiki Kaisha ToshibaSpeech synthesis apparatus and method utilizing acquisition of at least two speech unit waveforms acquired from a continuous memory region by one access
US9666179B2 (en)2006-05-182017-05-30Kabushiki Kaisha ToshibaSpeech synthesis apparatus and method utilizing acquisition of at least two speech unit waveforms acquired from a continuous memory region by one access
US20080154605A1 (en)*2006-12-212008-06-26International Business Machines CorporationAdaptive quality adjustments for speech synthesis in a real-time speech processing system based upon load
US20090018836A1 (en)*2007-03-292009-01-15Kabushiki Kaisha ToshibaSpeech synthesis system and speech synthesis method
US8108216B2 (en)*2007-03-292012-01-31Kabushiki Kaisha ToshibaSpeech synthesis system and speech synthesis method
US20130268275A1 (en)*2007-09-072013-10-10Nuance Communications, Inc.Speech synthesis system, speech synthesis program product, and speech synthesis method
US9275631B2 (en)*2007-09-072016-03-01Nuance Communications, Inc.Speech synthesis system, speech synthesis program product, and speech synthesis method

Also Published As

Publication numberPublication date
US7039588B2 (en)2006-05-02
US20050027532A1 (en)2005-02-03

Similar Documents

PublicationPublication DateTitle
US7039588B2 (en)Synthesis unit selection apparatus and method, and storage medium
US6980955B2 (en)Synthesis unit selection apparatus and method, and storage medium
US6778960B2 (en)Speech information processing method and apparatus and storage medium
US7124083B2 (en)Method and system for preselection of suitable units for concatenative speech
CA2181000C (en)System and method for determining pitch contours
US20010032078A1 (en)Speech information processing method and apparatus and storage medium
US20060259303A1 (en)Systems and methods for pitch smoothing for text-to-speech synthesis
US20020051955A1 (en)Speech signal processing apparatus and method, and storage medium
US20010032079A1 (en)Speech signal processing apparatus and method, and storage medium
EP0942409B1 (en)Phoneme-based speech synthesis
US20060229877A1 (en)Memory usage in a text-to-speech system
JP2003295880A (en) Speech synthesis system that connects recorded speech and synthesized speech
US8478595B2 (en)Fundamental frequency pattern generation apparatus and fundamental frequency pattern generation method
US6832192B2 (en)Speech synthesizing method and apparatus
JP4454780B2 (en) Audio information processing apparatus, method and storage medium
JP2853731B2 (en) Voice recognition device
US6202048B1 (en)Phonemic unit dictionary based on shifted portions of source codebook vectors, for text-to-speech synthesis
JP4533255B2 (en) Speech synthesis apparatus, speech synthesis method, speech synthesis program, and recording medium therefor
US9230536B2 (en)Voice synthesizer
JPH06318094A (en) Speech rule synthesizer
JP2004354644A (en) Speech synthesis method and apparatus, computer program thereof, and information storage medium storing the same
JP2005091747A (en) Speech synthesizer
JP3576792B2 (en) Voice information processing method
JP3423276B2 (en) Voice synthesis method
KR100759172B1 (en)Sound synthesizing device, sound synthesizing method, and storage medium storing sound synthesizing program therein

Legal Events

DateCodeTitleDescription
STCBInformation on status: application discontinuation

Free format text:ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION


[8]ページ先頭

©2009-2025 Movatter.jp