Movatterモバイル変換


[0]ホーム

URL:


US20100324895A1 - Synchronization for document narration - Google Patents

Synchronization for document narration
Download PDF

Info

Publication number
US20100324895A1
US20100324895A1US12/687,240US68724010AUS2010324895A1US 20100324895 A1US20100324895 A1US 20100324895A1US 68724010 AUS68724010 AUS 68724010AUS 2010324895 A1US2010324895 A1US 2010324895A1
Authority
US
United States
Prior art keywords
text
portions
expected
recognized
elapsed time
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US12/687,240
Inventor
Raymond C. Kurzweil
Paul Albrecht
Peter Chapman
Lucy Gibson
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
EM ACQUISITION CORP Inc
K-NFB HOLDING TECHNOLOGY Inc
Original Assignee
K NFB READING Tech Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by K NFB READING Tech IncfiledCriticalK NFB READING Tech Inc
Priority to US12/687,240priorityCriticalpatent/US20100324895A1/en
Priority to PCT/US2010/021104prioritypatent/WO2010083354A1/en
Assigned to K-NFB READING TECHNOLOGY, INC.reassignmentK-NFB READING TECHNOLOGY, INC.ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS).Assignors: ALBRECHT, PAUL, CHAPMAN, PETER, KURZWEIL, RAYMOND C., GIBSON, LUCY
Publication of US20100324895A1publicationCriticalpatent/US20100324895A1/en
Assigned to K-NFB HOLDING TECHNOLOGY, INC.reassignmentK-NFB HOLDING TECHNOLOGY, INC.CHANGE OF NAME (SEE DOCUMENT FOR DETAILS).Assignors: K-NFB READING TECHNOLOGY, INC.
Assigned to K-NFB READING TECHNOLOGY, INC.reassignmentK-NFB READING TECHNOLOGY, INC.ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS).Assignors: K-NFB HOLDING TECHNOLOGY, INC.
Assigned to FISH & RICHARDSON P.C.reassignmentFISH & RICHARDSON P.C.LIEN (SEE DOCUMENT FOR DETAILS).Assignors: K-NFB HOLDING TECHNOLOGY, IMC.
Assigned to DIMENSIONAL STACK ASSETS LLCreassignmentDIMENSIONAL STACK ASSETS LLCASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS).Assignors: K-NFB READING TECHNOLOGY, INC.
Assigned to EM ACQUISITION CORP., INC.reassignmentEM ACQUISITION CORP., INC.ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS).Assignors: DIMENSIONAL STACK ASSETS, LLC
Assigned to DIMENSIONAL STACK ASSETS LLCreassignmentDIMENSIONAL STACK ASSETS LLCRELEASE BY SECURED PARTY (SEE DOCUMENT FOR DETAILS).Assignors: FISH & RICHARDSON P.C.
Abandonedlegal-statusCriticalCurrent

Links

Images

Classifications

Definitions

Landscapes

Abstract

Disclosed are techniques and systems for synchronizing an audio file with a sequence of words displayed on a user interface.

Description

Claims (19)

1. A computer implemented method comprising:
applying speech recognition by one or more computer systems to an audio recording to generate a text version of recognized portions of text;
determining by the one or more computer systems an elapsed time period from a reference time in the audio recording to each recognized portion in the audio recording;
comparing by the one or more computer systems the recognized portions of text to expected portions of text; and
generating by the one or more computer systems a timing file that is stored on a computer-readable storage medium, the timing file comprising the elapsed time information for each expected portion of text by:
storing the elapsed time information for a recognized portion into the timing file if the recognized portion matches the corresponding expected portion of text; and otherwise
computing the elapsed time information for the expected portion of text and storing the computed elapsed time information into the timing file if the recognized portion does not match the corresponding expected portion of text.
8. A computer program product residing on a computer readable medium, the computer program product comprising instructions for causing a processor to:
apply speech recognition to an audio recording to generate a text version of recognized portions of text;
determine an elapsed time period from a reference time in the audio recording to each recognized portion in the audio recording;
generate a timing file that is stored on a computer-readable storage medium, the timing file comprising the elapsed time information for each expected portion of text by storing the elapsed time information for a recognized portion into the word timing file if the recognized portion matches the corresponding expected portion of text, and otherwise computing the elapsed time information for the expected portion of text and storing the computed elapsed time information into the timing file if the recognized portion does not match the expected portion of text.
14. A system comprising:
a memory; and
a computing device configured to:
apply speech recognition to an audio recording to generate a text version of recognized portions of text;
determine an elapsed time period from a reference time in the audio recording to each recognized portion in the audio recording version;
generate a timing file that is stored on a computer-readable storage medium, the timing file comprising the elapsed time information for each expected portion of text by storing the elapsed time information for a recognized portion into the timing file if the recognized portion matches the corresponding expected portion of text, and otherwise computing the elapsed time information for the expected portion of text and storing the computed elapsed time information into the timing file word if the recognized portion does not match the expected portion of text.
US12/687,2402009-01-152010-01-14Synchronization for document narrationAbandonedUS20100324895A1 (en)

Priority Applications (2)

Application NumberPriority DateFiling DateTitle
US12/687,240US20100324895A1 (en)2009-01-152010-01-14Synchronization for document narration
PCT/US2010/021104WO2010083354A1 (en)2009-01-152010-01-15Systems and methods for multiple voice document narration

Applications Claiming Priority (3)

Application NumberPriority DateFiling DateTitle
US14494709P2009-01-152009-01-15
US16596309P2009-04-022009-04-02
US12/687,240US20100324895A1 (en)2009-01-152010-01-14Synchronization for document narration

Publications (1)

Publication NumberPublication Date
US20100324895A1true US20100324895A1 (en)2010-12-23

Family

ID=43125169

Family Applications (7)

Application NumberTitlePriority DateFiling Date
US12/687,231Expired - Fee RelatedUS8498867B2 (en)2009-01-152010-01-14Systems and methods for selection and use of multiple characters for document narration
US12/687,213Expired - Fee RelatedUS8954328B2 (en)2009-01-152010-01-14Systems and methods for document narration with multiple characters having multiple moods
US12/687,271Expired - Fee RelatedUS8364488B2 (en)2009-01-152010-01-14Voice models for document narration
US12/687,208Expired - Fee RelatedUS8352269B2 (en)2009-01-152010-01-14Systems and methods for processing indicia for document narration
US12/687,240AbandonedUS20100324895A1 (en)2009-01-152010-01-14Synchronization for document narration
US12/687,202Expired - Fee RelatedUS8359202B2 (en)2009-01-152010-01-14Character models for document narration
US12/687,220Expired - Fee RelatedUS8498866B2 (en)2009-01-152010-01-14Systems and methods for multiple language document narration

Family Applications Before (4)

Application NumberTitlePriority DateFiling Date
US12/687,231Expired - Fee RelatedUS8498867B2 (en)2009-01-152010-01-14Systems and methods for selection and use of multiple characters for document narration
US12/687,213Expired - Fee RelatedUS8954328B2 (en)2009-01-152010-01-14Systems and methods for document narration with multiple characters having multiple moods
US12/687,271Expired - Fee RelatedUS8364488B2 (en)2009-01-152010-01-14Voice models for document narration
US12/687,208Expired - Fee RelatedUS8352269B2 (en)2009-01-152010-01-14Systems and methods for processing indicia for document narration

Family Applications After (2)

Application NumberTitlePriority DateFiling Date
US12/687,202Expired - Fee RelatedUS8359202B2 (en)2009-01-152010-01-14Character models for document narration
US12/687,220Expired - Fee RelatedUS8498866B2 (en)2009-01-152010-01-14Systems and methods for multiple language document narration

Country Status (1)

CountryLink
US (7)US8498867B2 (en)

Cited By (183)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20100318364A1 (en)*2009-01-152010-12-16K-Nfb Reading Technology, Inc.Systems and methods for selection and use of multiple characters for document narration
US20110153047A1 (en)*2008-07-042011-06-23Booktrack Holdings LimitedMethod and System for Making and Playing Soundtracks
WO2012167276A1 (en)*2011-06-032012-12-06Apple Inc.Automatically creating a mapping between text data and audio data
US20130191125A1 (en)*2012-01-252013-07-25Kabushiki Kaisha ToshibaTranscription supporting system and transcription supporting method
US20130219322A1 (en)*2010-01-112013-08-22Apple Inc.Electronic text manipulation and display
US8520025B2 (en)2011-02-242013-08-27Google Inc.Systems and methods for manipulating user annotations in electronic books
WO2013151610A1 (en)*2012-04-062013-10-10Google Inc.Synchronizing progress in audio and text versions of electronic books
WO2014137074A1 (en)*2013-03-052014-09-12Lg Electronics Inc.Mobile terminal and method of controlling the mobile terminal
US8903723B2 (en)2010-05-182014-12-02K-Nfb Reading Technology, Inc.Audio synchronization for document narration with user-selected playback
US9031493B2 (en)2011-11-182015-05-12Google Inc.Custom narration of electronic books
US9047356B2 (en)2012-09-052015-06-02Google Inc.Synchronizing multiple reading positions in electronic books
US9069744B2 (en)2012-05-152015-06-30Google Inc.Extensible framework for ereader tools, including named entity information
US9141404B2 (en)2011-10-242015-09-22Google Inc.Extensible framework for ereader tools
US9262612B2 (en)2011-03-212016-02-16Apple Inc.Device access using voice authentication
US9318108B2 (en)2010-01-182016-04-19Apple Inc.Intelligent automated assistant
US9323733B1 (en)2013-06-052016-04-26Google Inc.Indexed electronic book annotations
US9330720B2 (en)2008-01-032016-05-03Apple Inc.Methods and apparatus for altering audio output signals
US9338493B2 (en)2014-06-302016-05-10Apple Inc.Intelligent automated assistant for TV user interactions
US9483461B2 (en)2012-03-062016-11-01Apple Inc.Handling speech synthesis of content for multiple languages
US9495129B2 (en)2012-06-292016-11-15Apple Inc.Device, method, and user interface for voice-activated navigation and browsing of a document
US9535906B2 (en)2008-07-312017-01-03Apple Inc.Mobile device having human language translation capability with positional feedback
US9582608B2 (en)2013-06-072017-02-28Apple Inc.Unified ranking with entropy-weighted information for phrase-based semantic auto-completion
US9613653B2 (en)2011-07-262017-04-04Booktrack Holdings LimitedSoundtrack for electronic text
US9620104B2 (en)2013-06-072017-04-11Apple Inc.System and method for user-specified pronunciation of words for speech synthesis and recognition
US9626955B2 (en)2008-04-052017-04-18Apple Inc.Intelligent text-to-speech conversion
US9633674B2 (en)2013-06-072017-04-25Apple Inc.System and method for detecting errors in interactions with a voice-based digital assistant
US9633660B2 (en)2010-02-252017-04-25Apple Inc.User profiling for voice input processing
US9646614B2 (en)2000-03-162017-05-09Apple Inc.Fast, language-independent method for user authentication by voice
US9646609B2 (en)2014-09-302017-05-09Apple Inc.Caching apparatus for serving phonetic pronunciations
US9668121B2 (en)2014-09-302017-05-30Apple Inc.Social reminders
US9697820B2 (en)2015-09-242017-07-04Apple Inc.Unit-selection text-to-speech synthesis using concatenation-sensitive neural networks
US9715875B2 (en)2014-05-302017-07-25Apple Inc.Reducing the need for manual start/end-pointing and trigger phrases
US9721566B2 (en)2015-03-082017-08-01Apple Inc.Competing devices responding to voice triggers
US9760559B2 (en)2014-05-302017-09-12Apple Inc.Predictive text input
US9785630B2 (en)2014-05-302017-10-10Apple Inc.Text prediction using combined word N-gram and unigram language models
US9798393B2 (en)2011-08-292017-10-24Apple Inc.Text correction processing
US9818400B2 (en)2014-09-112017-11-14Apple Inc.Method and apparatus for discovering trending terms in speech requests
US9842105B2 (en)2015-04-162017-12-12Apple Inc.Parsimonious continuous-space phrase representations for natural language processing
US9842101B2 (en)2014-05-302017-12-12Apple Inc.Predictive conversion of language input
US9858925B2 (en)2009-06-052018-01-02Apple Inc.Using context information to facilitate processing of commands in a virtual assistant
US9865280B2 (en)2015-03-062018-01-09Apple Inc.Structured dictation using intelligent automated assistants
US9886432B2 (en)2014-09-302018-02-06Apple Inc.Parsimonious handling of word inflection via categorical stem + suffix N-gram language models
US9886953B2 (en)2015-03-082018-02-06Apple Inc.Virtual assistant activation
US9898077B2 (en)*2013-09-182018-02-20Booktrack Holdings LimitedPlayback system for synchronised soundtracks for electronic media content
US9899019B2 (en)2015-03-182018-02-20Apple Inc.Systems and methods for structured stem and suffix language models
US9934775B2 (en)2016-05-262018-04-03Apple Inc.Unit-selection text-to-speech synthesis based on predicted concatenation parameters
US9953088B2 (en)2012-05-142018-04-24Apple Inc.Crowd sourcing information to fulfill user requests
US9966065B2 (en)2014-05-302018-05-08Apple Inc.Multi-command single utterance input method
US9966068B2 (en)2013-06-082018-05-08Apple Inc.Interpreting and acting upon commands that involve sharing information with remote devices
US9972304B2 (en)2016-06-032018-05-15Apple Inc.Privacy preserving distributed evaluation framework for embedded personalized systems
US9971774B2 (en)2012-09-192018-05-15Apple Inc.Voice-based media searching
US10043516B2 (en)2016-09-232018-08-07Apple Inc.Intelligent automated assistant
US10049668B2 (en)2015-12-022018-08-14Apple Inc.Applying neural network language models to weighted finite state transducers for automatic speech recognition
US10049663B2 (en)2016-06-082018-08-14Apple, Inc.Intelligent automated assistant for media exploration
US10057736B2 (en)2011-06-032018-08-21Apple Inc.Active transport based notifications
US10067938B2 (en)2016-06-102018-09-04Apple Inc.Multilingual word prediction
US10074360B2 (en)2014-09-302018-09-11Apple Inc.Providing an indication of the suitability of speech recognition
US10078631B2 (en)2014-05-302018-09-18Apple Inc.Entropy-guided text prediction using combined word and character n-gram language models
US10079014B2 (en)2012-06-082018-09-18Apple Inc.Name recognition system
US10083688B2 (en)2015-05-272018-09-25Apple Inc.Device voice control for selecting a displayed affordance
US10083690B2 (en)2014-05-302018-09-25Apple Inc.Better resolution when referencing to concepts
US10088976B2 (en)2009-01-152018-10-02Em Acquisition Corp., Inc.Systems and methods for multiple voice document narration
US10089072B2 (en)2016-06-112018-10-02Apple Inc.Intelligent device arbitration and control
US10101822B2 (en)2015-06-052018-10-16Apple Inc.Language input correction
US10127911B2 (en)2014-09-302018-11-13Apple Inc.Speaker identification and unsupervised speaker adaptation techniques
US10127220B2 (en)2015-06-042018-11-13Apple Inc.Language identification from short strings
US10169329B2 (en)2014-05-302019-01-01Apple Inc.Exemplar-based natural language processing
US10176167B2 (en)2013-06-092019-01-08Apple Inc.System and method for inferring user intent from speech inputs
US10185542B2 (en)2013-06-092019-01-22Apple Inc.Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant
US10186254B2 (en)2015-06-072019-01-22Apple Inc.Context-based endpoint detection
US10192552B2 (en)2016-06-102019-01-29Apple Inc.Digital assistant providing whispered speech
US10223066B2 (en)2015-12-232019-03-05Apple Inc.Proactive assistance based on dialog communication between devices
US10241752B2 (en)2011-09-302019-03-26Apple Inc.Interface for a virtual digital assistant
US10241644B2 (en)2011-06-032019-03-26Apple Inc.Actionable reminder entries
US10249300B2 (en)2016-06-062019-04-02Apple Inc.Intelligent list reading
US10255907B2 (en)2015-06-072019-04-09Apple Inc.Automatic accent detection using acoustic models
US10269345B2 (en)2016-06-112019-04-23Apple Inc.Intelligent task discovery
US10276170B2 (en)2010-01-182019-04-30Apple Inc.Intelligent automated assistant
US10283110B2 (en)2009-07-022019-05-07Apple Inc.Methods and apparatuses for automatic speech recognition
US10297253B2 (en)2016-06-112019-05-21Apple Inc.Application integration with a digital assistant
US10303715B2 (en)2017-05-162019-05-28Apple Inc.Intelligent automated assistant for media exploration
US10311144B2 (en)2017-05-162019-06-04Apple Inc.Emoji word sense disambiguation
US10318871B2 (en)2005-09-082019-06-11Apple Inc.Method and apparatus for building an intelligent automated assistant
US10332518B2 (en)2017-05-092019-06-25Apple Inc.User interface for correcting recognition errors
US10356243B2 (en)2015-06-052019-07-16Apple Inc.Virtual assistant aided communication with 3rd party service in a communication session
US10354011B2 (en)2016-06-092019-07-16Apple Inc.Intelligent automated assistant in a home environment
US10366158B2 (en)2015-09-292019-07-30Apple Inc.Efficient word encoding for recurrent neural network language models
US10395654B2 (en)2017-05-112019-08-27Apple Inc.Text normalization based on a data-driven learning network
US10403278B2 (en)2017-05-162019-09-03Apple Inc.Methods and systems for phonetic matching in digital assistant services
US10403283B1 (en)2018-06-012019-09-03Apple Inc.Voice interaction at a primary device to access call functionality of a companion device
US10410637B2 (en)2017-05-122019-09-10Apple Inc.User-specific acoustic models
US10417266B2 (en)2017-05-092019-09-17Apple Inc.Context-aware ranking of intelligent response suggestions
US10446141B2 (en)2014-08-282019-10-15Apple Inc.Automatic speech recognition based on user feedback
US10445429B2 (en)2017-09-212019-10-15Apple Inc.Natural language understanding using vocabularies with compressed serialized tries
US10446143B2 (en)2016-03-142019-10-15Apple Inc.Identification of voice inputs providing credentials
US10474753B2 (en)2016-09-072019-11-12Apple Inc.Language identification using recurrent neural networks
US10482874B2 (en)2017-05-152019-11-19Apple Inc.Hierarchical belief states for digital assistants
US10490187B2 (en)2016-06-102019-11-26Apple Inc.Digital assistant providing automated status report
US10496705B1 (en)2018-06-032019-12-03Apple Inc.Accelerated task performance
US10496753B2 (en)2010-01-182019-12-03Apple Inc.Automatically adapting user interfaces for hands-free interaction
US10509862B2 (en)2016-06-102019-12-17Apple Inc.Dynamic phrase expansion of language input
US10521466B2 (en)2016-06-112019-12-31Apple Inc.Data driven natural language event detection and classification
US10553209B2 (en)2010-01-182020-02-04Apple Inc.Systems and methods for hands-free notification summaries
US10552013B2 (en)2014-12-022020-02-04Apple Inc.Data detection
US10568032B2 (en)2007-04-032020-02-18Apple Inc.Method and system for operating a multi-function portable electronic device using voice-activation
US10567477B2 (en)2015-03-082020-02-18Apple Inc.Virtual assistant continuity
US10592604B2 (en)2018-03-122020-03-17Apple Inc.Inverse text normalization for automatic speech recognition
US10593346B2 (en)2016-12-222020-03-17Apple Inc.Rank-reduced token representation for automatic speech recognition
US10636424B2 (en)2017-11-302020-04-28Apple Inc.Multi-turn canned dialog
US10643611B2 (en)2008-10-022020-05-05Apple Inc.Electronic devices with voice command and contextual data processing capabilities
US10659851B2 (en)2014-06-302020-05-19Apple Inc.Real-time digital assistant knowledge updates
US10657328B2 (en)2017-06-022020-05-19Apple Inc.Multi-task recurrent neural network architecture for efficient morphology handling in neural language modeling
US10671428B2 (en)2015-09-082020-06-02Apple Inc.Distributed personal assistant
US10671251B2 (en)2017-12-222020-06-02Arbordale Publishing, LLCInteractive eReader interface generation based on synchronization of textual and audial descriptors
US10679605B2 (en)2010-01-182020-06-09Apple Inc.Hands-free list-reading by intelligent automated assistant
US10684703B2 (en)2018-06-012020-06-16Apple Inc.Attention aware virtual assistant dismissal
US10691473B2 (en)2015-11-062020-06-23Apple Inc.Intelligent automated assistant in a messaging environment
US10698951B2 (en)*2016-07-292020-06-30Booktrack Holdings LimitedSystems and methods for automatic-creation of soundtracks for speech audio
US10699717B2 (en)2014-05-302020-06-30Apple Inc.Intelligent assistant for home automation
US10705794B2 (en)2010-01-182020-07-07Apple Inc.Automatically adapting user interfaces for hands-free interaction
US10706373B2 (en)2011-06-032020-07-07Apple Inc.Performing actions associated with task items that represent tasks to perform
US10714117B2 (en)2013-02-072020-07-14Apple Inc.Voice trigger for a digital assistant
US10726832B2 (en)2017-05-112020-07-28Apple Inc.Maintaining privacy of personal information
US10733375B2 (en)2018-01-312020-08-04Apple Inc.Knowledge-based framework for improving natural language understanding
US10733993B2 (en)2016-06-102020-08-04Apple Inc.Intelligent digital assistant in a multi-tasking environment
US10733982B2 (en)2018-01-082020-08-04Apple Inc.Multi-directional dialog
US10747498B2 (en)2015-09-082020-08-18Apple Inc.Zero latency digital assistant
US10748546B2 (en)2017-05-162020-08-18Apple Inc.Digital assistant services based on device capabilities
US10755051B2 (en)2017-09-292020-08-25Apple Inc.Rule-based natural language processing
US10755703B2 (en)2017-05-112020-08-25Apple Inc.Offline personal assistant
US10789945B2 (en)2017-05-122020-09-29Apple Inc.Low-latency intelligent automated assistant
US10791176B2 (en)2017-05-122020-09-29Apple Inc.Synchronization and task delegation of a digital assistant
US10789041B2 (en)2014-09-122020-09-29Apple Inc.Dynamic thresholds for always listening speech trigger
US10789959B2 (en)2018-03-022020-09-29Apple Inc.Training speaker recognition models for digital assistants
US10805665B1 (en)2019-12-132020-10-13Bank Of America CorporationSynchronizing text-to-audio with interactive videos in the video framework
US10810274B2 (en)2017-05-152020-10-20Apple Inc.Optimizing dialogue policy decisions for digital assistants using implicit feedback
US10818288B2 (en)2018-03-262020-10-27Apple Inc.Natural assistant interaction
US10839159B2 (en)2018-09-282020-11-17Apple Inc.Named entity normalization in a spoken dialog system
US10892996B2 (en)2018-06-012021-01-12Apple Inc.Variable latency device coordination
US10909331B2 (en)2018-03-302021-02-02Apple Inc.Implicit identification of translation payload with neural machine translation
US10928918B2 (en)2018-05-072021-02-23Apple Inc.Raise to speak
US10984780B2 (en)2018-05-212021-04-20Apple Inc.Global semantic word embeddings using bi-directional recurrent neural networks
US11010127B2 (en)2015-06-292021-05-18Apple Inc.Virtual assistant for media playback
US11010561B2 (en)2018-09-272021-05-18Apple Inc.Sentiment prediction from textual data
US11010550B2 (en)2015-09-292021-05-18Apple Inc.Unified language modeling framework for word prediction, auto-completion and auto-correction
US11025565B2 (en)2015-06-072021-06-01Apple Inc.Personalized prediction of responses for instant messaging
US11023513B2 (en)2007-12-202021-06-01Apple Inc.Method and apparatus for searching using an active ontology
US11069336B2 (en)2012-03-022021-07-20Apple Inc.Systems and methods for name pronunciation
US11140099B2 (en)2019-05-212021-10-05Apple Inc.Providing message response suggestions
US11145294B2 (en)2018-05-072021-10-12Apple Inc.Intelligent automated assistant for delivering content from user experiences
US11170166B2 (en)2018-09-282021-11-09Apple Inc.Neural typographical error modeling via generative adversarial networks
US11204787B2 (en)2017-01-092021-12-21Apple Inc.Application integration with a digital assistant
US11217251B2 (en)2019-05-062022-01-04Apple Inc.Spoken notifications
US11227589B2 (en)2016-06-062022-01-18Apple Inc.Intelligent list reading
US11231904B2 (en)2015-03-062022-01-25Apple Inc.Reducing response latency of intelligent automated assistants
US11237797B2 (en)2019-05-312022-02-01Apple Inc.User activity shortcut suggestions
US11269678B2 (en)2012-05-152022-03-08Apple Inc.Systems and methods for integrating third party services with a digital assistant
US11281993B2 (en)2016-12-052022-03-22Apple Inc.Model and ensemble compression for metric learning
US11289073B2 (en)2019-05-312022-03-29Apple Inc.Device text to speech
US11301477B2 (en)2017-05-122022-04-12Apple Inc.Feedback analysis of a digital assistant
US11307752B2 (en)2019-05-062022-04-19Apple Inc.User configurable task triggers
US11314370B2 (en)2013-12-062022-04-26Apple Inc.Method for extracting salient dialog usage from live data
US11348573B2 (en)2019-03-182022-05-31Apple Inc.Multimodality in digital assistant systems
US11350185B2 (en)2019-12-132022-05-31Bank Of America CorporationText-to-audio for interactive videos using a markup language
US11360641B2 (en)2019-06-012022-06-14Apple Inc.Increasing the relevance of new available information
US11388291B2 (en)2013-03-142022-07-12Apple Inc.System and method for processing voicemail
US11386266B2 (en)2018-06-012022-07-12Apple Inc.Text correction
US11423908B2 (en)2019-05-062022-08-23Apple Inc.Interpreting spoken requests
US11443646B2 (en)*2017-12-222022-09-13Fathom Technologies, LLCE-Reader interface system with audio and highlighting synchronization for digital books
US11462215B2 (en)2018-09-282022-10-04Apple Inc.Multi-modal inputs for voice commands
US11468282B2 (en)2015-05-152022-10-11Apple Inc.Virtual assistant in a communication session
US11475884B2 (en)2019-05-062022-10-18Apple Inc.Reducing digital assistant latency when a language is incorrectly determined
US11475898B2 (en)2018-10-262022-10-18Apple Inc.Low-latency multi-speaker speech recognition
US11488406B2 (en)2019-09-252022-11-01Apple Inc.Text detection using global geometry estimators
US11495218B2 (en)2018-06-012022-11-08Apple Inc.Virtual assistant operation in multi-device environments
US11496600B2 (en)2019-05-312022-11-08Apple Inc.Remote execution of machine-learned models
US11532306B2 (en)2017-05-162022-12-20Apple Inc.Detecting a trigger of a digital assistant
US11587559B2 (en)2015-09-302023-02-21Apple Inc.Intelligent device identification
US11638059B2 (en)2019-01-042023-04-25Apple Inc.Content playback on multiple devices
US11657813B2 (en)2019-05-312023-05-23Apple Inc.Voice identification in digital assistant systems
US11755276B2 (en)2020-05-122023-09-12Apple Inc.Reducing description length based on confidence
US11798547B2 (en)2013-03-152023-10-24Apple Inc.Voice activated device for use with a voice-based digital assistant
US12010262B2 (en)2013-08-062024-06-11Apple Inc.Auto-activating smart responses based on activities from remote devices

Families Citing this family (61)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
WO2010067118A1 (en)2008-12-112010-06-17Novauris Technologies LimitedSpeech recognition involving a mobile device
US8346557B2 (en)*2009-01-152013-01-01K-Nfb Reading Technology, Inc.Systems and methods document narration
US8493344B2 (en)2009-06-072013-07-23Apple Inc.Devices, methods, and graphical user interfaces for accessibility using a touch-sensitive surface
US20110184738A1 (en)*2010-01-252011-07-28Kalisky DrorNavigation and orientation tools for speech synthesis
US20110276327A1 (en)*2010-05-062011-11-10Sony Ericsson Mobile Communications AbVoice-to-expressive text
US8707195B2 (en)2010-06-072014-04-22Apple Inc.Devices, methods, and graphical user interfaces for accessibility via a touch-sensitive surface
WO2012006024A2 (en)*2010-06-282012-01-12Randall Lee ThreewitsInteractive environment for performing arts scripts
US9870134B2 (en)*2010-06-282018-01-16Randall Lee THREEWITSInteractive blocking and management for performing arts productions
CN102314874A (en)*2010-06-292012-01-11鸿富锦精密工业(深圳)有限公司Text-to-voice conversion system and method
US8452600B2 (en)*2010-08-182013-05-28Apple Inc.Assisted reader
US9218680B2 (en)*2010-09-012015-12-22K-Nfb Reading Technology, Inc.Systems and methods for rendering graphical content and glyphs
US10762293B2 (en)2010-12-222020-09-01Apple Inc.Using parts-of-speech tagging and named entity recognition for spelling correction
WO2012090196A1 (en)*2010-12-302012-07-05Melamed GalMethod and system for processing content
US20120218287A1 (en)*2011-02-252012-08-30Mcwilliams Thomas JApparatus, system and method for electronic book reading with audio output capability
US20120226500A1 (en)*2011-03-022012-09-06Sony CorporationSystem and method for content rendering including synthetic narration
JP5463385B2 (en)*2011-06-032014-04-09アップル インコーポレイテッド Automatic creation of mapping between text data and audio data
US8751971B2 (en)2011-06-052014-06-10Apple Inc.Devices, methods, and graphical user interfaces for providing accessibility using a touch-sensitive surface
WO2013015463A1 (en)*2011-07-222013-01-31엘지전자 주식회사Mobile terminal and method for controlling same
US20130063494A1 (en)*2011-09-122013-03-14Microsoft CorporationAssistive reading interface
US9613003B1 (en)2011-09-232017-04-04Amazon Technologies, Inc.Identifying topics in a digital work
US9449526B1 (en)2011-09-232016-09-20Amazon Technologies, Inc.Generating a game related to a digital work
US9639518B1 (en)*2011-09-232017-05-02Amazon Technologies, Inc.Identifying entities in a digital work
US8842085B1 (en)2011-09-232014-09-23Amazon Technologies, Inc.Providing supplemental information for a digital work
JP2013072957A (en)*2011-09-272013-04-22Toshiba CorpDocument read-aloud support device, method and program
US8881269B2 (en)2012-03-312014-11-04Apple Inc.Device, method, and graphical user interface for integrating recognition of handwriting gestures with a screen reader
US9449523B2 (en)*2012-06-272016-09-20Apple Inc.Systems and methods for narrating electronic books
KR102023157B1 (en)2012-07-062019-09-19삼성전자 주식회사Method and apparatus for recording and playing of user voice of mobile terminal
US9570066B2 (en)*2012-07-162017-02-14General Motors LlcSender-responsive text-to-speech processing
US9576574B2 (en)2012-09-102017-02-21Apple Inc.Context-sensitive handling of interruptions by intelligent digital assistant
US9117450B2 (en)*2012-12-122015-08-25Nuance Communications, Inc.Combining re-speaking, partial agent transcription and ASR for improved accuracy / human guided ASR
US9368114B2 (en)2013-03-142016-06-14Apple Inc.Context-sensitive handling of interruptions
AU2014233517B2 (en)2013-03-152017-05-25Apple Inc.Training an at least partial voice command system
WO2014144579A1 (en)2013-03-152014-09-18Apple Inc.System and method for updating an adaptive speech recognition model
AU2014278595B2 (en)2013-06-132017-04-06Apple Inc.System and method for emergency calls initiated by voice command
KR102222122B1 (en)*2014-01-212021-03-03엘지전자 주식회사Mobile terminal and method for controlling the same
US9183831B2 (en)*2014-03-272015-11-10International Business Machines CorporationText-to-speech for digital literature
US9620105B2 (en)2014-05-152017-04-11Apple Inc.Analyzing audio input for efficient speech and music recognition
US10592095B2 (en)2014-05-232020-03-17Apple Inc.Instantaneous speaking of content on touch devices
US9502031B2 (en)2014-05-272016-11-22Apple Inc.Method for supporting dynamic grammars in WFST-based ASR
US10289433B2 (en)2014-05-302019-05-14Apple Inc.Domain specific language for encoding assistant dialog
US9734193B2 (en)2014-05-302017-08-15Apple Inc.Determining domain salience ranking from ambiguous words in natural speech
US9460075B2 (en)2014-06-172016-10-04International Business Machines CorporationSolving and answering arithmetic and algebraic problems using natural language processing
US9514185B2 (en)*2014-08-072016-12-06International Business Machines CorporationAnswering time-sensitive questions
US9430557B2 (en)2014-09-172016-08-30International Business Machines CorporationAutomatic data interpretation and answering analytical questions with tables and charts
US9711141B2 (en)2014-12-092017-07-18Apple Inc.Disambiguating heteronyms in speech synthesis
WO2016123205A1 (en)*2015-01-282016-08-04Hahn Bruce CDeep reading machine and method
CN106156766B (en)*2015-03-252020-02-18阿里巴巴集团控股有限公司Method and device for generating text line classifier
US20160314780A1 (en)*2015-04-272016-10-27Microsoft Technology Licensing, LlcIncreasing user interaction performance with multi-voice text-to-speech generation
US9691378B1 (en)*2015-11-052017-06-27Amazon Technologies, Inc.Methods and devices for selectively ignoring captured audio data
US10698485B2 (en)2016-06-272020-06-30Microsoft Technology Licensing, LlcAugmenting text narration with haptic feedback
US10489110B2 (en)2016-11-222019-11-26Microsoft Technology Licensing, LlcImplicit narration for aural user interface
US10079015B1 (en)2016-12-062018-09-18Amazon Technologies, Inc.Multi-layer keyword detection
US10930302B2 (en)2017-12-222021-02-23International Business Machines CorporationQuality of text analytics
KR20200033140A (en)*2018-09-192020-03-27삼성전자주식회사System and method for providing voice assistant service
WO2020060151A1 (en)2018-09-192020-03-26Samsung Electronics Co., Ltd.System and method for providing voice assistant service
CN111048062B (en)*2018-10-102022-10-04华为技术有限公司 Speech synthesis method and device
CN110399461A (en)*2019-07-192019-11-01腾讯科技(深圳)有限公司Data processing method, device, server and storage medium
US11394799B2 (en)*2020-05-072022-07-19Freeman Augustus JacksonMethods, systems, apparatuses, and devices for facilitating for generation of an interactive story based on non-interactive data
US11875797B2 (en)*2020-07-232024-01-16Pozotron Inc.Systems and methods for scripted audio production
CN115881145A (en)*2021-09-302023-03-31华为技术有限公司Voice processing and training method and electronic equipment
WO2024215857A1 (en)*2023-04-142024-10-17Apple Inc.Digital assistant for providing and modifying an output of an electronic document

Citations (109)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US4397635A (en)*1982-02-191983-08-09Samuels Curtis AReading teaching system
US4636173A (en)*1985-12-121987-01-13Robert MossmanMethod for teaching reading
US4913539A (en)*1988-04-041990-04-03New York Institute Of TechnologyApparatus and method for lip-synching animation
US4965727A (en)*1984-09-131990-10-23Halamka John DComputer card
US5278943A (en)*1990-03-231994-01-11Bright Star Technology, Inc.Speech animation and inflection system
US5649060A (en)*1993-10-181997-07-15International Business Machines CorporationAutomatic indexing and aligning of audio and text using speech recognition
US5721827A (en)*1996-10-021998-02-24James LoganSystem for electrically distributing personalized information
US5732216A (en)*1996-10-021998-03-24Internet Angles, Inc.Audio message exchange system
US5737725A (en)*1996-01-091998-04-07U S West Marketing Resources Group, Inc.Method and system for automatically generating new voice files corresponding to new text from a script
US5786814A (en)*1995-11-031998-07-28Xerox CorporationComputer controlled display system activities using correlated graphical and timeline interfaces for controlling replay of temporal data representing collaborative activities
US5860064A (en)*1993-05-131999-01-12Apple Computer, Inc.Method and apparatus for automatic generation of vocal emotion in a synthetic text-to-speech system
US5953005A (en)*1996-06-281999-09-14Sun Microsystems, Inc.System and method for on-line multimedia access
US6017219A (en)*1997-06-182000-01-25International Business Machines CorporationSystem and method for interactive reading and language instruction
US6064957A (en)*1997-08-152000-05-16General Electric CompanyImproving speech recognition through text-based linguistic post-processing
US6068487A (en)*1998-10-202000-05-30Lernout & Hauspie Speech Products N.V.Speller for reading system
US6076059A (en)*1997-08-292000-06-13Digital Equipment CorporationMethod for aligning text with audio signals
US6081780A (en)*1998-04-282000-06-27International Business Machines CorporationTTS and prosody based authoring system
US6151576A (en)*1998-08-112000-11-21Adobe Systems IncorporatedMixing digitized speech and text using reliability indices
US6199076B1 (en)*1996-10-022001-03-06James LoganAudio program player including a dynamic program selection controller
US6226615B1 (en)*1997-08-062001-05-01British Broadcasting CorporationSpoken text display method and apparatus, for use in generating television signals
US6260011B1 (en)*2000-03-202001-07-10Microsoft CorporationMethods and apparatus for automatically synchronizing electronic audio files with electronic text files
US6263308B1 (en)*2000-03-202001-07-17Microsoft CorporationMethods and apparatus for performing speech recognition using acoustic models which are improved through an interactive process
US6324511B1 (en)*1998-10-012001-11-27Mindmaker, Inc.Method of and apparatus for multi-modal information presentation to computer users with dyslexia, reading disabilities or visual impairment
US20020080179A1 (en)*2000-12-252002-06-27Toshihiko OkabeData transfer method and data transfer device
US20020099552A1 (en)*2001-01-252002-07-25Darryl RubinAnnotating electronic information with audio clips
US6442518B1 (en)*1999-07-142002-08-27Compaq Information Technologies Group, L.P.Method for refining time alignments of closed captions
US6446041B1 (en)*1999-10-272002-09-03Microsoft CorporationMethod and system for providing audio playback of a multi-source document
US20020143534A1 (en)*2001-03-292002-10-03Koninklijke Philips Electronics N.V.Editing during synchronous playback
US6490557B1 (en)*1998-03-052002-12-03John C. JeppesenMethod and apparatus for training an ultra-large vocabulary, continuous speech, speaker independent, automatic speech recognition system and consequential database
US6505153B1 (en)*2000-05-222003-01-07Compaq Information Technologies Group, L.P.Efficient method for producing off-line closed captions
US20030014252A1 (en)*2001-05-102003-01-16Utaha ShizukaInformation processing apparatus, information processing method, recording medium, and program
US20030013073A1 (en)*2001-04-092003-01-16International Business Machines CorporationElectronic book with multimode I/O
US20030018663A1 (en)*2001-05-302003-01-23Cornette Ranjita K.Method and system for creating a multimedia electronic book
US20030028380A1 (en)*2000-02-022003-02-06Freeland Warwick PeterSpeech system
US6633741B1 (en)*2000-07-192003-10-14John G. PosaRecap, summary, and auxiliary information generation for electronic books
US20030212559A1 (en)*2002-05-092003-11-13Jianlei XieText-to-speech (TTS) for hand-held devices
US20030219706A1 (en)*2002-05-222003-11-27Nijim Yousef WasefTalking E-book
US20040135814A1 (en)*2003-01-152004-07-15Vendelin George DavidReading tool and method
US20040138881A1 (en)*2002-11-222004-07-15Olivier DivayAutomatic insertion of non-verbalized punctuation
US6792409B2 (en)*1999-12-202004-09-14Koninklijke Philips Electronics N.V.Synchronous reproduction in a speech recognition system
US20050021343A1 (en)*2003-07-242005-01-27Spencer Julian A.Q.Method and apparatus for highlighting during presentations
US20050096909A1 (en)*2003-10-292005-05-05Raimo BakisSystems and methods for expressive text-to-speech
US20050137867A1 (en)*2003-12-172005-06-23Miller Mark R.Method for electronically generating a synchronized textual transcript of an audio recording
US20050203750A1 (en)*2004-03-122005-09-15International Business Machines CorporationDisplaying text of speech in synchronization with the speech
US6947896B2 (en)*1998-09-022005-09-20International Business Machines CorporationText marking for deferred correction
US6961700B2 (en)*1996-09-242005-11-01Allvoice Computing PlcMethod and apparatus for processing the output of a speech recognition engine
US6961895B1 (en)*2000-08-102005-11-01Recording For The Blind & Dyslexic, IncorporatedMethod and apparatus for synchronization of text and audio data
US20060074659A1 (en)*2004-09-102006-04-06Adams Marilyn JAssessing fluency based on elapsed time
US20060111902A1 (en)*2004-11-222006-05-25Bravobrava L.L.C.System and method for assisting language learning
US20060149558A1 (en)*2001-07-172006-07-06Jonathan KahnSynchronized pattern recognition source data processed by manual or automatic means for creation of shared speaker-dependent speech user profile
US20060190249A1 (en)*2002-06-262006-08-24Jonathan KahnMethod for comparing a transcribed text file with a previously created file
US7110945B2 (en)*1999-07-162006-09-19Dreamations LlcInteractive book
US20060242595A1 (en)*2003-03-072006-10-26Hirokazu KizumiScroll display control
US7174295B1 (en)*1999-09-062007-02-06Nokia CorporationUser interface for text to speech conversion
US7191117B2 (en)*2000-06-092007-03-13British Broadcasting CorporationGeneration of subtitles or captions for moving pictures
US7194411B2 (en)*2001-02-262007-03-20Benjamin SlotznickMethod of displaying web pages to enable user access to text information that the user has difficulty reading
US7194693B2 (en)*2002-10-292007-03-20International Business Machines CorporationApparatus and method for automatically highlighting text in an electronic document
US20070106508A1 (en)*2003-04-292007-05-10Jonathan KahnMethods and systems for creating a second generation session file
US20070118378A1 (en)*2005-11-222007-05-24International Business Machines CorporationDynamically Changing Voice Attributes During Speech Synthesis Based upon Parameter Differentiation for Dialog Contexts
US20070174060A1 (en)*2001-12-202007-07-26Canon Kabushiki KaishaControl apparatus
US20070171189A1 (en)*2006-01-202007-07-26Primax Electronics Ltd.Auxiliary reading system of handheld electronic device
US20070271104A1 (en)*2006-05-192007-11-22Mckay MartinStreaming speech with synchronized highlighting generated by a server
US20080027726A1 (en)*2006-07-282008-01-31Eric Louis HansenText to audio mapping, and animation of the text
US7346506B2 (en)*2003-10-082008-03-18Agfa Inc.System and method for synchronized text display and audio playback
US7366671B2 (en)*2004-09-292008-04-29Inventec CorporationSpeech displaying system and method
US7366714B2 (en)*2000-03-232008-04-29Albert KrachmanMethod and system for providing electronic discovery on computer databases and archives using statement analysis to detect false statements and recover relevant data
US7376560B2 (en)*2001-10-122008-05-20Koninklijke Philips Electronics N.V.Speech recognition device to mark parts of a recognized text
US20080133219A1 (en)*2006-02-102008-06-05Spinvox LimitedMass-Scale, User-Independent, Device-Independent Voice Messaging System
US20080140412A1 (en)*2006-12-072008-06-12Jonathan Travis MillmanInteractive tutoring
US20080140313A1 (en)*2005-03-222008-06-12Searete Llc, A Limited Liability Corporation Of The State Of DelawareMap-based guide system and method
US20080140413A1 (en)*2006-12-072008-06-12Jonathan Travis MillmanSynchronization of audio to reading
US20080140652A1 (en)*2006-12-072008-06-12Jonathan Travis MillmanAuthoring tool
US7412643B1 (en)*1999-11-232008-08-12International Business Machines CorporationMethod and apparatus for linking representation and realization data
US20080195370A1 (en)*2005-08-262008-08-14Koninklijke Philips Electronics, N.V.System and Method For Synchronizing Sound and Manually Transcribed Text
US20080255837A1 (en)*2004-11-302008-10-16Jonathan KahnMethod for locating an audio segment within an audio file
US20080291325A1 (en)*2007-05-242008-11-27Microsoft CorporationPersonality-Based Device
US20090019389A1 (en)*2004-07-292009-01-15Andreas Matthias AustSystem and method for providing visual markers in electronic documents
US7483834B2 (en)*2001-07-182009-01-27Panasonic CorporationMethod and apparatus for audio navigation of an information appliance
US7487086B2 (en)*2002-05-102009-02-03Nexidia Inc.Transcript alignment
US7490040B2 (en)*2002-06-282009-02-10International Business Machines CorporationMethod and apparatus for preparing a document to be read by a text-to-speech reader
US20090048832A1 (en)*2005-11-082009-02-19Nec CorporationSpeech-to-text system, speech-to-text method, and speech-to-text program
US20090202226A1 (en)*2005-06-062009-08-13Texthelp Systems, Ltd.System and method for converting electronic text to a digital multimedia electronic book
US20100023330A1 (en)*2008-07-282010-01-28International Business Machines CorporationSpeed podcasting
US20100031142A1 (en)*2006-10-232010-02-04Nec CorporationContent summarizing system, method, and program
US7669111B1 (en)*1997-01-292010-02-23Philip R KrauseElectronic text reading environment enhancement method and apparatus
US20100057461A1 (en)*2007-02-062010-03-04Andreas NeubacherMethod and system for creating or updating entries in a speech recognition lexicon
US7693717B2 (en)*2006-04-122010-04-06Custom Speech Usa, Inc.Session file modification with annotation using speech recognition or text to speech
US20100094632A1 (en)*2005-09-272010-04-15At&T Corp,System and Method of Developing A TTS Voice
US20100169092A1 (en)*2008-11-262010-07-01Backes Steven JVoice interface ocx
US20100182325A1 (en)*2002-01-222010-07-22Gizmoz Israel 2002 Ltd.Apparatus and method for efficient animation of believable speaking 3d characters in real time
US20100216108A1 (en)*2009-02-202010-08-26Jackson Fish Market, LLCAudiovisual record of a user reading a book aloud for playback with a virtual book
US7809572B2 (en)*2005-07-202010-10-05Panasonic CorporationVoice quality change portion locating apparatus
US20100278453A1 (en)*2006-09-152010-11-04King Martin TCapture and display of annotations in paper and electronic documents
US20100281365A1 (en)*2006-10-192010-11-04Tae Hyeon KimEncoding method and apparatus and decoding method and apparatus
US20100299131A1 (en)*2009-05-212010-11-25Nexidia Inc.Transcript alignment
US20110054901A1 (en)*2009-08-282011-03-03International Business Machines CorporationMethod and apparatus for aligning texts
US7987244B1 (en)*2004-12-302011-07-26At&T Intellectual Property Ii, L.P.Network repository for voice fonts
US7996218B2 (en)*2005-03-072011-08-09Samsung Electronics Co., Ltd.User adaptive speech recognition method and apparatus
US8009966B2 (en)*2002-11-012011-08-30Synchro Arts LimitedMethods and apparatus for use in sound replacement with automatic synchronization to images
US20110213613A1 (en)*2006-04-032011-09-01Google Inc., a CA corporationAutomatic Language Model Update
US8036894B2 (en)*2006-02-162011-10-11Apple Inc.Multi-unit approach to text-to-speech synthesis
US8065142B2 (en)*2007-06-282011-11-22Nuance Communications, Inc.Synchronization of an input text of a speech with a recording of the speech
US20110288861A1 (en)*2010-05-182011-11-24K-NFB Technology, Inc.Audio Synchronization For Document Narration with User-Selected Playback
US8073694B2 (en)*2005-09-272011-12-06At&T Intellectual Property Ii, L.P.System and method for testing a TTS voice
US20110320189A1 (en)*2006-02-272011-12-29Dictaphone CorporationSystems and methods for filtering dictated and non-dictated sections of documents
US8103507B2 (en)*2005-12-302012-01-24Cisco Technology, Inc.Searchable multimedia stream
US8117034B2 (en)*2001-03-292012-02-14Nuance Communications Austria GmbhSynchronise an audio cursor and a text cursor during editing
US8131552B1 (en)*2000-11-212012-03-06At&T Intellectual Property Ii, L.P.System and method for automated multimedia content indexing and retrieval
US8131545B1 (en)*2008-09-252012-03-06Google Inc.Aligning a transcript to audio data

Family Cites Families (47)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
WO1993007562A1 (en)*1991-09-301993-04-15Riverrun TechnologyMethod and apparatus for managing information
US8073695B1 (en)*1992-12-092011-12-06Adrea, LLCElectronic book with voice emulation features
CA2119397C (en)*1993-03-192007-10-02Kim E.A. SilvermanImproved automated voice synthesis employing enhanced prosodic treatment of text, spelling of text and rate of annunciation
US6442523B1 (en)*1994-07-222002-08-27Steven H. SiegelMethod for the auditory navigation of text
JPH08328590A (en)*1995-05-291996-12-13Sanyo Electric Co LtdVoice synthesizer
US6282511B1 (en)*1996-12-042001-08-28At&TVoiced interface with hyperlinked information
US6052663A (en)*1997-06-272000-04-18Kurzweil Educational Systems, Inc.Reading system which reads aloud from an image representation of a document
JP3224760B2 (en)*1997-07-102001-11-05インターナショナル・ビジネス・マシーンズ・コーポレーション Voice mail system, voice synthesizing apparatus, and methods thereof
US6549750B1 (en)*1997-08-202003-04-15Ithaca Media CorporationPrinted book augmented with an electronically stored glossary
US7364068B1 (en)*1998-03-112008-04-29West CorporationMethods and apparatus for intelligent selection of goods and services offered to conferees
US6144938A (en)*1998-05-012000-11-07Sun Microsystems, Inc.Voice user interface with personality
US6199042B1 (en)*1998-06-192001-03-06L&H Applications Usa, Inc.Reading system
JP3703082B2 (en)*1998-10-022005-10-05インターナショナル・ビジネス・マシーンズ・コーポレーション Conversational computing with interactive virtual machines
JP2001034282A (en)*1999-07-212001-02-09Konami Co LtdVoice synthesizing method, dictionary constructing method for voice synthesis, voice synthesizer and computer readable medium recorded with voice synthesis program
JP3720230B2 (en)*2000-02-182005-11-24シャープ株式会社 Expression data control system, expression data control apparatus constituting the same, and recording medium on which the program is recorded
WO2001091109A1 (en)*2000-05-242001-11-29Stars 1-To-1Interactive voice communication method and system for information and entertainment
US6933928B1 (en)*2000-07-182005-08-23Scott E. LilienthalElectronic book player with audio synchronization
JP2002149560A (en)*2000-08-282002-05-24Sharp Corp Electronic mail device and electronic mail system
US6985913B2 (en)*2000-12-282006-01-10Casio Computer Co. Ltd.Electronic book data delivery apparatus, electronic book device and recording medium
US6970820B2 (en)*2001-02-262005-11-29Matsushita Electric Industrial Co., Ltd.Voice personalization of speech synthesizer
US7020663B2 (en)*2001-05-302006-03-28George M. HaySystem and method for the delivery of electronic books
JP2002358092A (en)*2001-06-012002-12-13Sony CorpVoice synthesizing system
US20030028377A1 (en)*2001-07-312003-02-06Noyes Albert W.Method and device for synthesizing and distributing voice types for voice-enabled devices
US6810378B2 (en)*2001-08-222004-10-26Lucent Technologies Inc.Method and apparatus for controlling a speech synthesis system to provide multiple styles of speech
US20060069567A1 (en)*2001-12-102006-03-30Tischer Steven NMethods, systems, and products for translating text to speech
US7483832B2 (en)*2001-12-102009-01-27At&T Intellectual Property I, L.P.Method and system for customizing voice translation of text to speech
JP2003334986A (en)*2002-05-222003-11-25Dainippon Printing Co Ltd Printing system
US20040024582A1 (en)*2002-07-032004-02-05Scott ShepardSystems and methods for aiding human translation
AU2002950502A0 (en)*2002-07-312002-09-12E-Clips Intelligent Agent Technologies Pty LtdAnimated messaging
US20040054694A1 (en)*2002-09-122004-03-18Piccionelli Gregory A.Remote personalization method
US20060074672A1 (en)*2002-10-042006-04-06Koninklijke Philips Electroinics N.V.Speech synthesis apparatus with personalized speech segments
DE102004012208A1 (en)*2004-03-122005-09-29Siemens Ag Individualization of speech output by adapting a synthesis voice to a target voice
US8666746B2 (en)*2004-05-132014-03-04At&T Intellectual Property Ii, L.P.System and method for generating customized text-to-speech voices
US7693719B2 (en)*2004-10-292010-04-06Microsoft CorporationProviding personalized voice font for text-to-speech applications
WO2006067744A2 (en)*2004-12-222006-06-29Koninklijke Philips Electronics N.V.Portable audio playback device and method for operation thereof
US7412389B2 (en)*2005-03-022008-08-12Yang George LDocument animation system
US8073697B2 (en)*2006-09-122011-12-06International Business Machines CorporationEstablishing a multimodal personality for a multimodal application
JP2008145234A (en)*2006-12-082008-06-26Denso CorpNavigation apparatus and program
US8438032B2 (en)*2007-01-092013-05-07Nuance Communications, Inc.System for tuning synthesized speech
US8886537B2 (en)*2007-03-202014-11-11Nuance Communications, Inc.Method and system for text-to-speech synthesis with personalized voice
KR20090047159A (en)*2007-11-072009-05-12삼성전자주식회사 Audio-book playback method and device
US8224652B2 (en)*2008-09-262012-07-17Microsoft CorporationSpeech and text driven HMM-based body animation synthesis
US8863212B2 (en)*2008-10-162014-10-14At&T Intellectual Property I, LpPresentation of an adaptive avatar
US8498867B2 (en)*2009-01-152013-07-30K-Nfb Reading Technology, Inc.Systems and methods for selection and use of multiple characters for document narration
US8346557B2 (en)2009-01-152013-01-01K-Nfb Reading Technology, Inc.Systems and methods document narration
US8150695B1 (en)*2009-06-182012-04-03Amazon Technologies, Inc.Presentation of written works based on character identities and attributes
US9218680B2 (en)*2010-09-012015-12-22K-Nfb Reading Technology, Inc.Systems and methods for rendering graphical content and glyphs

Patent Citations (116)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US4397635A (en)*1982-02-191983-08-09Samuels Curtis AReading teaching system
US4965727A (en)*1984-09-131990-10-23Halamka John DComputer card
US4636173A (en)*1985-12-121987-01-13Robert MossmanMethod for teaching reading
US4913539A (en)*1988-04-041990-04-03New York Institute Of TechnologyApparatus and method for lip-synching animation
US5278943A (en)*1990-03-231994-01-11Bright Star Technology, Inc.Speech animation and inflection system
US5860064A (en)*1993-05-131999-01-12Apple Computer, Inc.Method and apparatus for automatic generation of vocal emotion in a synthetic text-to-speech system
US5649060A (en)*1993-10-181997-07-15International Business Machines CorporationAutomatic indexing and aligning of audio and text using speech recognition
US5786814A (en)*1995-11-031998-07-28Xerox CorporationComputer controlled display system activities using correlated graphical and timeline interfaces for controlling replay of temporal data representing collaborative activities
US5737725A (en)*1996-01-091998-04-07U S West Marketing Resources Group, Inc.Method and system for automatically generating new voice files corresponding to new text from a script
US5953005A (en)*1996-06-281999-09-14Sun Microsystems, Inc.System and method for on-line multimedia access
US6961700B2 (en)*1996-09-242005-11-01Allvoice Computing PlcMethod and apparatus for processing the output of a speech recognition engine
US6199076B1 (en)*1996-10-022001-03-06James LoganAudio program player including a dynamic program selection controller
US5721827A (en)*1996-10-021998-02-24James LoganSystem for electrically distributing personalized information
US5732216A (en)*1996-10-021998-03-24Internet Angles, Inc.Audio message exchange system
US7669111B1 (en)*1997-01-292010-02-23Philip R KrauseElectronic text reading environment enhancement method and apparatus
US6017219A (en)*1997-06-182000-01-25International Business Machines CorporationSystem and method for interactive reading and language instruction
US6226615B1 (en)*1997-08-062001-05-01British Broadcasting CorporationSpoken text display method and apparatus, for use in generating television signals
US6064957A (en)*1997-08-152000-05-16General Electric CompanyImproving speech recognition through text-based linguistic post-processing
US6076059A (en)*1997-08-292000-06-13Digital Equipment CorporationMethod for aligning text with audio signals
US6490557B1 (en)*1998-03-052002-12-03John C. JeppesenMethod and apparatus for training an ultra-large vocabulary, continuous speech, speaker independent, automatic speech recognition system and consequential database
US6081780A (en)*1998-04-282000-06-27International Business Machines CorporationTTS and prosody based authoring system
US6151576A (en)*1998-08-112000-11-21Adobe Systems IncorporatedMixing digitized speech and text using reliability indices
US6947896B2 (en)*1998-09-022005-09-20International Business Machines CorporationText marking for deferred correction
US6324511B1 (en)*1998-10-012001-11-27Mindmaker, Inc.Method of and apparatus for multi-modal information presentation to computer users with dyslexia, reading disabilities or visual impairment
US6068487A (en)*1998-10-202000-05-30Lernout & Hauspie Speech Products N.V.Speller for reading system
US6442518B1 (en)*1999-07-142002-08-27Compaq Information Technologies Group, L.P.Method for refining time alignments of closed captions
US7110945B2 (en)*1999-07-162006-09-19Dreamations LlcInteractive book
US20070011011A1 (en)*1999-07-162007-01-11Cogliano Mary AInteractive book
US7174295B1 (en)*1999-09-062007-02-06Nokia CorporationUser interface for text to speech conversion
US6446041B1 (en)*1999-10-272002-09-03Microsoft CorporationMethod and system for providing audio playback of a multi-source document
US7412643B1 (en)*1999-11-232008-08-12International Business Machines CorporationMethod and apparatus for linking representation and realization data
US6792409B2 (en)*1999-12-202004-09-14Koninklijke Philips Electronics N.V.Synchronous reproduction in a speech recognition system
US20030028380A1 (en)*2000-02-022003-02-06Freeland Warwick PeterSpeech system
US6260011B1 (en)*2000-03-202001-07-10Microsoft CorporationMethods and apparatus for automatically synchronizing electronic audio files with electronic text files
US6263308B1 (en)*2000-03-202001-07-17Microsoft CorporationMethods and apparatus for performing speech recognition using acoustic models which are improved through an interactive process
US7366714B2 (en)*2000-03-232008-04-29Albert KrachmanMethod and system for providing electronic discovery on computer databases and archives using statement analysis to detect false statements and recover relevant data
US6505153B1 (en)*2000-05-222003-01-07Compaq Information Technologies Group, L.P.Efficient method for producing off-line closed captions
US7191117B2 (en)*2000-06-092007-03-13British Broadcasting CorporationGeneration of subtitles or captions for moving pictures
US6633741B1 (en)*2000-07-192003-10-14John G. PosaRecap, summary, and auxiliary information generation for electronic books
US6961895B1 (en)*2000-08-102005-11-01Recording For The Blind & Dyslexic, IncorporatedMethod and apparatus for synchronization of text and audio data
US8131552B1 (en)*2000-11-212012-03-06At&T Intellectual Property Ii, L.P.System and method for automated multimedia content indexing and retrieval
US20020080179A1 (en)*2000-12-252002-06-27Toshihiko OkabeData transfer method and data transfer device
US20020099552A1 (en)*2001-01-252002-07-25Darryl RubinAnnotating electronic information with audio clips
US7194411B2 (en)*2001-02-262007-03-20Benjamin SlotznickMethod of displaying web pages to enable user access to text information that the user has difficulty reading
US8117034B2 (en)*2001-03-292012-02-14Nuance Communications Austria GmbhSynchronise an audio cursor and a text cursor during editing
US20020143534A1 (en)*2001-03-292002-10-03Koninklijke Philips Electronics N.V.Editing during synchronous playback
US20030013073A1 (en)*2001-04-092003-01-16International Business Machines CorporationElectronic book with multimode I/O
US20030014252A1 (en)*2001-05-102003-01-16Utaha ShizukaInformation processing apparatus, information processing method, recording medium, and program
US20030018663A1 (en)*2001-05-302003-01-23Cornette Ranjita K.Method and system for creating a multimedia electronic book
US20060149558A1 (en)*2001-07-172006-07-06Jonathan KahnSynchronized pattern recognition source data processed by manual or automatic means for creation of shared speaker-dependent speech user profile
US7483834B2 (en)*2001-07-182009-01-27Panasonic CorporationMethod and apparatus for audio navigation of an information appliance
US7376560B2 (en)*2001-10-122008-05-20Koninklijke Philips Electronics N.V.Speech recognition device to mark parts of a recognized text
US20070174060A1 (en)*2001-12-202007-07-26Canon Kabushiki KaishaControl apparatus
US20100182325A1 (en)*2002-01-222010-07-22Gizmoz Israel 2002 Ltd.Apparatus and method for efficient animation of believable speaking 3d characters in real time
US20030212559A1 (en)*2002-05-092003-11-13Jianlei XieText-to-speech (TTS) for hand-held devices
US7487086B2 (en)*2002-05-102009-02-03Nexidia Inc.Transcript alignment
US20030219706A1 (en)*2002-05-222003-11-27Nijim Yousef WasefTalking E-book
US20060190249A1 (en)*2002-06-262006-08-24Jonathan KahnMethod for comparing a transcribed text file with a previously created file
US7490040B2 (en)*2002-06-282009-02-10International Business Machines CorporationMethod and apparatus for preparing a document to be read by a text-to-speech reader
US7953601B2 (en)*2002-06-282011-05-31Nuance Communications, Inc.Method and apparatus for preparing a document to be read by text-to-speech reader
US7194693B2 (en)*2002-10-292007-03-20International Business Machines CorporationApparatus and method for automatically highlighting text in an electronic document
US20070124672A1 (en)*2002-10-292007-05-31International Business Machines CorporationApparatus and method for automatically highlighting text in an electronic document
US8009966B2 (en)*2002-11-012011-08-30Synchro Arts LimitedMethods and apparatus for use in sound replacement with automatic synchronization to images
US20040138881A1 (en)*2002-11-222004-07-15Olivier DivayAutomatic insertion of non-verbalized punctuation
US20040135814A1 (en)*2003-01-152004-07-15Vendelin George DavidReading tool and method
US20060242595A1 (en)*2003-03-072006-10-26Hirokazu KizumiScroll display control
US20070106508A1 (en)*2003-04-292007-05-10Jonathan KahnMethods and systems for creating a second generation session file
US7979281B2 (en)*2003-04-292011-07-12Custom Speech Usa, Inc.Methods and systems for creating a second generation session file
US20050021343A1 (en)*2003-07-242005-01-27Spencer Julian A.Q.Method and apparatus for highlighting during presentations
US7346506B2 (en)*2003-10-082008-03-18Agfa Inc.System and method for synchronized text display and audio playback
US20050096909A1 (en)*2003-10-292005-05-05Raimo BakisSystems and methods for expressive text-to-speech
US20050137867A1 (en)*2003-12-172005-06-23Miller Mark R.Method for electronically generating a synchronized textual transcript of an audio recording
US20050203750A1 (en)*2004-03-122005-09-15International Business Machines CorporationDisplaying text of speech in synchronization with the speech
US20090019389A1 (en)*2004-07-292009-01-15Andreas Matthias AustSystem and method for providing visual markers in electronic documents
US20060074659A1 (en)*2004-09-102006-04-06Adams Marilyn JAssessing fluency based on elapsed time
US7366671B2 (en)*2004-09-292008-04-29Inventec CorporationSpeech displaying system and method
US20060111902A1 (en)*2004-11-222006-05-25Bravobrava L.L.C.System and method for assisting language learning
US20080255837A1 (en)*2004-11-302008-10-16Jonathan KahnMethod for locating an audio segment within an audio file
US7987244B1 (en)*2004-12-302011-07-26At&T Intellectual Property Ii, L.P.Network repository for voice fonts
US7996218B2 (en)*2005-03-072011-08-09Samsung Electronics Co., Ltd.User adaptive speech recognition method and apparatus
US20080140313A1 (en)*2005-03-222008-06-12Searete Llc, A Limited Liability Corporation Of The State Of DelawareMap-based guide system and method
US20090202226A1 (en)*2005-06-062009-08-13Texthelp Systems, Ltd.System and method for converting electronic text to a digital multimedia electronic book
US7809572B2 (en)*2005-07-202010-10-05Panasonic CorporationVoice quality change portion locating apparatus
US20080195370A1 (en)*2005-08-262008-08-14Koninklijke Philips Electronics, N.V.System and Method For Synchronizing Sound and Manually Transcribed Text
US8073694B2 (en)*2005-09-272011-12-06At&T Intellectual Property Ii, L.P.System and method for testing a TTS voice
US20100094632A1 (en)*2005-09-272010-04-15At&T Corp,System and Method of Developing A TTS Voice
US20090048832A1 (en)*2005-11-082009-02-19Nec CorporationSpeech-to-text system, speech-to-text method, and speech-to-text program
US8155958B2 (en)*2005-11-082012-04-10Nec CorporationSpeech-to-text system, speech-to-text method, and speech-to-text program
US20070118378A1 (en)*2005-11-222007-05-24International Business Machines CorporationDynamically Changing Voice Attributes During Speech Synthesis Based upon Parameter Differentiation for Dialog Contexts
US8103507B2 (en)*2005-12-302012-01-24Cisco Technology, Inc.Searchable multimedia stream
US20070171189A1 (en)*2006-01-202007-07-26Primax Electronics Ltd.Auxiliary reading system of handheld electronic device
US20080133219A1 (en)*2006-02-102008-06-05Spinvox LimitedMass-Scale, User-Independent, Device-Independent Voice Messaging System
US8036894B2 (en)*2006-02-162011-10-11Apple Inc.Multi-unit approach to text-to-speech synthesis
US20110320189A1 (en)*2006-02-272011-12-29Dictaphone CorporationSystems and methods for filtering dictated and non-dictated sections of documents
US20110213613A1 (en)*2006-04-032011-09-01Google Inc., a CA corporationAutomatic Language Model Update
US7693717B2 (en)*2006-04-122010-04-06Custom Speech Usa, Inc.Session file modification with annotation using speech recognition or text to speech
US20070271104A1 (en)*2006-05-192007-11-22Mckay MartinStreaming speech with synchronized highlighting generated by a server
US20080027726A1 (en)*2006-07-282008-01-31Eric Louis HansenText to audio mapping, and animation of the text
US20100278453A1 (en)*2006-09-152010-11-04King Martin TCapture and display of annotations in paper and electronic documents
US20100281365A1 (en)*2006-10-192010-11-04Tae Hyeon KimEncoding method and apparatus and decoding method and apparatus
US20100031142A1 (en)*2006-10-232010-02-04Nec CorporationContent summarizing system, method, and program
US20080140412A1 (en)*2006-12-072008-06-12Jonathan Travis MillmanInteractive tutoring
US20080140413A1 (en)*2006-12-072008-06-12Jonathan Travis MillmanSynchronization of audio to reading
US20080140652A1 (en)*2006-12-072008-06-12Jonathan Travis MillmanAuthoring tool
US20100057461A1 (en)*2007-02-062010-03-04Andreas NeubacherMethod and system for creating or updating entries in a speech recognition lexicon
US20080291325A1 (en)*2007-05-242008-11-27Microsoft CorporationPersonality-Based Device
US8065142B2 (en)*2007-06-282011-11-22Nuance Communications, Inc.Synchronization of an input text of a speech with a recording of the speech
US20120041758A1 (en)*2007-06-282012-02-16Nuance Communications, Inc.Synchronization of an input text of a speech with a recording of the speech
US20100023330A1 (en)*2008-07-282010-01-28International Business Machines CorporationSpeed podcasting
US8131545B1 (en)*2008-09-252012-03-06Google Inc.Aligning a transcript to audio data
US20100169092A1 (en)*2008-11-262010-07-01Backes Steven JVoice interface ocx
US20100216108A1 (en)*2009-02-202010-08-26Jackson Fish Market, LLCAudiovisual record of a user reading a book aloud for playback with a virtual book
US20100299131A1 (en)*2009-05-212010-11-25Nexidia Inc.Transcript alignment
US20110054901A1 (en)*2009-08-282011-03-03International Business Machines CorporationMethod and apparatus for aligning texts
US20110288861A1 (en)*2010-05-182011-11-24K-NFB Technology, Inc.Audio Synchronization For Document Narration with User-Selected Playback
US8392186B2 (en)*2010-05-182013-03-05K-Nfb Reading Technology, Inc.Audio synchronization for document narration with user-selected playback

Non-Patent Citations (11)

* Cited by examiner, † Cited by third party
Title
Alexander Haubold, John R. Kender, "Alignment of Speech to Highly Imperfect Text Transcriptions" ICME 2007: 224-227.*
Biatov. "Large Text and Audio Data Alignment for Multimedia Applications" 2003.*
Cardinal et al. "Segmentation of Recordings Based on Partial Transcriptions" 2005.*
Fisher, W.M.; Fiscus, J.G. "Better alignment procedures for speech recognition evaluation", Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on, On page(s): 59 - 62 vol.2 Volume: 2, 27-30 April 1993.*
Hazen. "Automatic Alignment and Error Correction of Human Generated Transcripts for Long Speech Recordings" 2006.*
J. Picone, G.R. Doddington, and D.S. Pallett, "PhoneMediated Word Alignment for Speech Recognition Evaluation", IEEE Trans. ASSP, Vol. 38, No.3, March 1990, pp. 559-562.*
J. Picone, K.M. Goudie-Marshall, G.R. Doddington, and W. Fisher, "Automatic Text Alignment for Speech System Evaluation", IEEE Trans. ASSP, Vol. ASSP-34, NO. 4, August 1986, pp. 780-784.*
Lynn Wilcox, John S. Boreczky: Annotation and Segmentation for Multimedia Indexing and Retrieval. HICSS (2) 1998: 259-266.*
Mohamed El-Helaly, Aishy Amer. Synchronization of Processed Audio-Video Signals using Time-Stamps. IEEE International Conference on Image Processing. San Antonio, TX: IEEE, 2007, pp. 193-196.*
Moreno, P.J. et al., "A Recursive Algorithm for the Forced Alignment of Very Long Audio Segments," in Proceedings, ICSLP, 1998.*
Vignoli et al. "A Segmental Time-Alignment Tecnhique for Text-Speech Synchronization" 1999.*

Cited By (289)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US9646614B2 (en)2000-03-162017-05-09Apple Inc.Fast, language-independent method for user authentication by voice
US11928604B2 (en)2005-09-082024-03-12Apple Inc.Method and apparatus for building an intelligent automated assistant
US10318871B2 (en)2005-09-082019-06-11Apple Inc.Method and apparatus for building an intelligent automated assistant
US10568032B2 (en)2007-04-032020-02-18Apple Inc.Method and system for operating a multi-function portable electronic device using voice-activation
US11012942B2 (en)2007-04-032021-05-18Apple Inc.Method and system for operating a multi-function portable electronic device using voice-activation
US11023513B2 (en)2007-12-202021-06-01Apple Inc.Method and apparatus for searching using an active ontology
US10381016B2 (en)2008-01-032019-08-13Apple Inc.Methods and apparatus for altering audio output signals
US9330720B2 (en)2008-01-032016-05-03Apple Inc.Methods and apparatus for altering audio output signals
US9865248B2 (en)2008-04-052018-01-09Apple Inc.Intelligent text-to-speech conversion
US9626955B2 (en)2008-04-052017-04-18Apple Inc.Intelligent text-to-speech conversion
US10140082B2 (en)2008-07-042018-11-27Booktrack Holdings LimitedMethod and system for making and playing soundtracks
US10095465B2 (en)2008-07-042018-10-09Booktrack Holdings LimitedMethod and system for making and playing soundtracks
US20110153047A1 (en)*2008-07-042011-06-23Booktrack Holdings LimitedMethod and System for Making and Playing Soundtracks
US10255028B2 (en)2008-07-042019-04-09Booktrack Holdings LimitedMethod and system for making and playing soundtracks
US10095466B2 (en)2008-07-042018-10-09Booktrack Holdings LimitedMethod and system for making and playing soundtracks
US9223864B2 (en)2008-07-042015-12-29Booktrack Holdings LimitedMethod and system for making and playing soundtracks
US9135333B2 (en)2008-07-042015-09-15Booktrack Holdings LimitedMethod and system for making and playing soundtracks
US10108612B2 (en)2008-07-312018-10-23Apple Inc.Mobile device having human language translation capability with positional feedback
US9535906B2 (en)2008-07-312017-01-03Apple Inc.Mobile device having human language translation capability with positional feedback
US10643611B2 (en)2008-10-022020-05-05Apple Inc.Electronic devices with voice command and contextual data processing capabilities
US11348582B2 (en)2008-10-022022-05-31Apple Inc.Electronic devices with voice command and contextual data processing capabilities
US8498866B2 (en)*2009-01-152013-07-30K-Nfb Reading Technology, Inc.Systems and methods for multiple language document narration
US20100318364A1 (en)*2009-01-152010-12-16K-Nfb Reading Technology, Inc.Systems and methods for selection and use of multiple characters for document narration
US20100324903A1 (en)*2009-01-152010-12-23K-Nfb Reading Technology, Inc.Systems and methods for document narration with multiple characters having multiple moods
US10088976B2 (en)2009-01-152018-10-02Em Acquisition Corp., Inc.Systems and methods for multiple voice document narration
US8954328B2 (en)2009-01-152015-02-10K-Nfb Reading Technology, Inc.Systems and methods for document narration with multiple characters having multiple moods
US20100324904A1 (en)*2009-01-152010-12-23K-Nfb Reading Technology, Inc.Systems and methods for multiple language document narration
US8498867B2 (en)*2009-01-152013-07-30K-Nfb Reading Technology, Inc.Systems and methods for selection and use of multiple characters for document narration
US11080012B2 (en)2009-06-052021-08-03Apple Inc.Interface for a virtual digital assistant
US10795541B2 (en)2009-06-052020-10-06Apple Inc.Intelligent organization of tasks items
US10475446B2 (en)2009-06-052019-11-12Apple Inc.Using context information to facilitate processing of commands in a virtual assistant
US9858925B2 (en)2009-06-052018-01-02Apple Inc.Using context information to facilitate processing of commands in a virtual assistant
US10283110B2 (en)2009-07-022019-05-07Apple Inc.Methods and apparatuses for automatic speech recognition
US20130219322A1 (en)*2010-01-112013-08-22Apple Inc.Electronic text manipulation and display
US10824322B2 (en)2010-01-112020-11-03Apple Inc.Electronic text manipulation and display
US20240272788A1 (en)*2010-01-112024-08-15Apple Inc.Electronic text manipulation and display
US10706841B2 (en)2010-01-182020-07-07Apple Inc.Task flow identification based on user intent
US10496753B2 (en)2010-01-182019-12-03Apple Inc.Automatically adapting user interfaces for hands-free interaction
US11423886B2 (en)2010-01-182022-08-23Apple Inc.Task flow identification based on user intent
US10705794B2 (en)2010-01-182020-07-07Apple Inc.Automatically adapting user interfaces for hands-free interaction
US9548050B2 (en)2010-01-182017-01-17Apple Inc.Intelligent automated assistant
US12087308B2 (en)2010-01-182024-09-10Apple Inc.Intelligent automated assistant
US10679605B2 (en)2010-01-182020-06-09Apple Inc.Hands-free list-reading by intelligent automated assistant
US10741185B2 (en)2010-01-182020-08-11Apple Inc.Intelligent automated assistant
US9318108B2 (en)2010-01-182016-04-19Apple Inc.Intelligent automated assistant
US10553209B2 (en)2010-01-182020-02-04Apple Inc.Systems and methods for hands-free notification summaries
US10276170B2 (en)2010-01-182019-04-30Apple Inc.Intelligent automated assistant
US10692504B2 (en)2010-02-252020-06-23Apple Inc.User profiling for voice input processing
US10049675B2 (en)2010-02-252018-08-14Apple Inc.User profiling for voice input processing
US9633660B2 (en)2010-02-252017-04-25Apple Inc.User profiling for voice input processing
US9478219B2 (en)2010-05-182016-10-25K-Nfb Reading Technology, Inc.Audio synchronization for document narration with user-selected playback
US8903723B2 (en)2010-05-182014-12-02K-Nfb Reading Technology, Inc.Audio synchronization for document narration with user-selected playback
US9645986B2 (en)2011-02-242017-05-09Google Inc.Method, medium, and system for creating an electronic book with an umbrella policy
US9501461B2 (en)2011-02-242016-11-22Google Inc.Systems and methods for manipulating user annotations in electronic books
US9063641B2 (en)2011-02-242015-06-23Google Inc.Systems and methods for remote collaborative studying using electronic books
US10067922B2 (en)2011-02-242018-09-04Google LlcAutomated study guide generation for electronic books
US8520025B2 (en)2011-02-242013-08-27Google Inc.Systems and methods for manipulating user annotations in electronic books
US8543941B2 (en)2011-02-242013-09-24Google Inc.Electronic book contextual menu systems and methods
US9262612B2 (en)2011-03-212016-02-16Apple Inc.Device access using voice authentication
US10102359B2 (en)2011-03-212018-10-16Apple Inc.Device access using voice authentication
US10417405B2 (en)2011-03-212019-09-17Apple Inc.Device access using voice authentication
US10057736B2 (en)2011-06-032018-08-21Apple Inc.Active transport based notifications
US11120372B2 (en)2011-06-032021-09-14Apple Inc.Performing actions associated with task items that represent tasks to perform
US10672399B2 (en)2011-06-032020-06-02Apple Inc.Switching between text data and audio data based on a mapping
WO2012167276A1 (en)*2011-06-032012-12-06Apple Inc.Automatically creating a mapping between text data and audio data
CN103703431A (en)*2011-06-032014-04-02苹果公司Automatically creating a mapping between text data and audio data
US10706373B2 (en)2011-06-032020-07-07Apple Inc.Performing actions associated with task items that represent tasks to perform
US10241644B2 (en)2011-06-032019-03-26Apple Inc.Actionable reminder entries
US11350253B2 (en)2011-06-032022-05-31Apple Inc.Active transport based notifications
US9666227B2 (en)2011-07-262017-05-30Booktrack Holdings LimitedSoundtrack for electronic text
US9613653B2 (en)2011-07-262017-04-04Booktrack Holdings LimitedSoundtrack for electronic text
US9613654B2 (en)2011-07-262017-04-04Booktrack Holdings LimitedSoundtrack for electronic text
US9798393B2 (en)2011-08-292017-10-24Apple Inc.Text correction processing
US10241752B2 (en)2011-09-302019-03-26Apple Inc.Interface for a virtual digital assistant
US9141404B2 (en)2011-10-242015-09-22Google Inc.Extensible framework for ereader tools
US9678634B2 (en)2011-10-242017-06-13Google Inc.Extensible framework for ereader tools
US9031493B2 (en)2011-11-182015-05-12Google Inc.Custom narration of electronic books
US20130191125A1 (en)*2012-01-252013-07-25Kabushiki Kaisha ToshibaTranscription supporting system and transcription supporting method
US11069336B2 (en)2012-03-022021-07-20Apple Inc.Systems and methods for name pronunciation
US9483461B2 (en)2012-03-062016-11-01Apple Inc.Handling speech synthesis of content for multiple languages
WO2013151610A1 (en)*2012-04-062013-10-10Google Inc.Synchronizing progress in audio and text versions of electronic books
US9953088B2 (en)2012-05-142018-04-24Apple Inc.Crowd sourcing information to fulfill user requests
US10102187B2 (en)2012-05-152018-10-16Google LlcExtensible framework for ereader tools, including named entity information
US11269678B2 (en)2012-05-152022-03-08Apple Inc.Systems and methods for integrating third party services with a digital assistant
US9069744B2 (en)2012-05-152015-06-30Google Inc.Extensible framework for ereader tools, including named entity information
US10079014B2 (en)2012-06-082018-09-18Apple Inc.Name recognition system
US9495129B2 (en)2012-06-292016-11-15Apple Inc.Device, method, and user interface for voice-activated navigation and browsing of a document
US9047356B2 (en)2012-09-052015-06-02Google Inc.Synchronizing multiple reading positions in electronic books
US9971774B2 (en)2012-09-192018-05-15Apple Inc.Voice-based media searching
US10714117B2 (en)2013-02-072020-07-14Apple Inc.Voice trigger for a digital assistant
US10978090B2 (en)2013-02-072021-04-13Apple Inc.Voice trigger for a digital assistant
WO2014137074A1 (en)*2013-03-052014-09-12Lg Electronics Inc.Mobile terminal and method of controlling the mobile terminal
KR101952179B1 (en)2013-03-052019-05-22엘지전자 주식회사Mobile terminal and control method for the mobile terminal
KR20140109167A (en)*2013-03-052014-09-15엘지전자 주식회사Mobile terminal and control method for the mobile terminal
US10241743B2 (en)2013-03-052019-03-26Lg Electronics Inc.Mobile terminal for matching displayed text with recorded external audio and method of controlling the mobile terminal
US11388291B2 (en)2013-03-142022-07-12Apple Inc.System and method for processing voicemail
US11798547B2 (en)2013-03-152023-10-24Apple Inc.Voice activated device for use with a voice-based digital assistant
US9323733B1 (en)2013-06-052016-04-26Google Inc.Indexed electronic book annotations
US9582608B2 (en)2013-06-072017-02-28Apple Inc.Unified ranking with entropy-weighted information for phrase-based semantic auto-completion
US9633674B2 (en)2013-06-072017-04-25Apple Inc.System and method for detecting errors in interactions with a voice-based digital assistant
US9966060B2 (en)2013-06-072018-05-08Apple Inc.System and method for user-specified pronunciation of words for speech synthesis and recognition
US9620104B2 (en)2013-06-072017-04-11Apple Inc.System and method for user-specified pronunciation of words for speech synthesis and recognition
US10657961B2 (en)2013-06-082020-05-19Apple Inc.Interpreting and acting upon commands that involve sharing information with remote devices
US9966068B2 (en)2013-06-082018-05-08Apple Inc.Interpreting and acting upon commands that involve sharing information with remote devices
US10176167B2 (en)2013-06-092019-01-08Apple Inc.System and method for inferring user intent from speech inputs
US11048473B2 (en)2013-06-092021-06-29Apple Inc.Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant
US10185542B2 (en)2013-06-092019-01-22Apple Inc.Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant
US10769385B2 (en)2013-06-092020-09-08Apple Inc.System and method for inferring user intent from speech inputs
US11727219B2 (en)2013-06-092023-08-15Apple Inc.System and method for inferring user intent from speech inputs
US12010262B2 (en)2013-08-062024-06-11Apple Inc.Auto-activating smart responses based on activities from remote devices
US9898077B2 (en)*2013-09-182018-02-20Booktrack Holdings LimitedPlayback system for synchronised soundtracks for electronic media content
US11314370B2 (en)2013-12-062022-04-26Apple Inc.Method for extracting salient dialog usage from live data
US10083690B2 (en)2014-05-302018-09-25Apple Inc.Better resolution when referencing to concepts
US10657966B2 (en)2014-05-302020-05-19Apple Inc.Better resolution when referencing to concepts
US10699717B2 (en)2014-05-302020-06-30Apple Inc.Intelligent assistant for home automation
US9842101B2 (en)2014-05-302017-12-12Apple Inc.Predictive conversion of language input
US9760559B2 (en)2014-05-302017-09-12Apple Inc.Predictive text input
US10714095B2 (en)2014-05-302020-07-14Apple Inc.Intelligent assistant for home automation
US11257504B2 (en)2014-05-302022-02-22Apple Inc.Intelligent assistant for home automation
US9715875B2 (en)2014-05-302017-07-25Apple Inc.Reducing the need for manual start/end-pointing and trigger phrases
US9966065B2 (en)2014-05-302018-05-08Apple Inc.Multi-command single utterance input method
US11133008B2 (en)2014-05-302021-09-28Apple Inc.Reducing the need for manual start/end-pointing and trigger phrases
US10169329B2 (en)2014-05-302019-01-01Apple Inc.Exemplar-based natural language processing
US10878809B2 (en)2014-05-302020-12-29Apple Inc.Multi-command single utterance input method
US10497365B2 (en)2014-05-302019-12-03Apple Inc.Multi-command single utterance input method
US10078631B2 (en)2014-05-302018-09-18Apple Inc.Entropy-guided text prediction using combined word and character n-gram language models
US9785630B2 (en)2014-05-302017-10-10Apple Inc.Text prediction using combined word N-gram and unigram language models
US10417344B2 (en)2014-05-302019-09-17Apple Inc.Exemplar-based natural language processing
US10904611B2 (en)2014-06-302021-01-26Apple Inc.Intelligent automated assistant for TV user interactions
US10659851B2 (en)2014-06-302020-05-19Apple Inc.Real-time digital assistant knowledge updates
US9338493B2 (en)2014-06-302016-05-10Apple Inc.Intelligent automated assistant for TV user interactions
US9668024B2 (en)2014-06-302017-05-30Apple Inc.Intelligent automated assistant for TV user interactions
US10446141B2 (en)2014-08-282019-10-15Apple Inc.Automatic speech recognition based on user feedback
US10431204B2 (en)2014-09-112019-10-01Apple Inc.Method and apparatus for discovering trending terms in speech requests
US9818400B2 (en)2014-09-112017-11-14Apple Inc.Method and apparatus for discovering trending terms in speech requests
US10789041B2 (en)2014-09-122020-09-29Apple Inc.Dynamic thresholds for always listening speech trigger
US10074360B2 (en)2014-09-302018-09-11Apple Inc.Providing an indication of the suitability of speech recognition
US10453443B2 (en)2014-09-302019-10-22Apple Inc.Providing an indication of the suitability of speech recognition
US9886432B2 (en)2014-09-302018-02-06Apple Inc.Parsimonious handling of word inflection via categorical stem + suffix N-gram language models
US10127911B2 (en)2014-09-302018-11-13Apple Inc.Speaker identification and unsupervised speaker adaptation techniques
US9646609B2 (en)2014-09-302017-05-09Apple Inc.Caching apparatus for serving phonetic pronunciations
US10390213B2 (en)2014-09-302019-08-20Apple Inc.Social reminders
US10438595B2 (en)2014-09-302019-10-08Apple Inc.Speaker identification and unsupervised speaker adaptation techniques
US9986419B2 (en)2014-09-302018-05-29Apple Inc.Social reminders
US9668121B2 (en)2014-09-302017-05-30Apple Inc.Social reminders
US11556230B2 (en)2014-12-022023-01-17Apple Inc.Data detection
US10552013B2 (en)2014-12-022020-02-04Apple Inc.Data detection
US9865280B2 (en)2015-03-062018-01-09Apple Inc.Structured dictation using intelligent automated assistants
US11231904B2 (en)2015-03-062022-01-25Apple Inc.Reducing response latency of intelligent automated assistants
US9886953B2 (en)2015-03-082018-02-06Apple Inc.Virtual assistant activation
US10529332B2 (en)2015-03-082020-01-07Apple Inc.Virtual assistant activation
US10930282B2 (en)2015-03-082021-02-23Apple Inc.Competing devices responding to voice triggers
US10567477B2 (en)2015-03-082020-02-18Apple Inc.Virtual assistant continuity
US11087759B2 (en)2015-03-082021-08-10Apple Inc.Virtual assistant activation
US9721566B2 (en)2015-03-082017-08-01Apple Inc.Competing devices responding to voice triggers
US10311871B2 (en)2015-03-082019-06-04Apple Inc.Competing devices responding to voice triggers
US9899019B2 (en)2015-03-182018-02-20Apple Inc.Systems and methods for structured stem and suffix language models
US9842105B2 (en)2015-04-162017-12-12Apple Inc.Parsimonious continuous-space phrase representations for natural language processing
US11468282B2 (en)2015-05-152022-10-11Apple Inc.Virtual assistant in a communication session
US10083688B2 (en)2015-05-272018-09-25Apple Inc.Device voice control for selecting a displayed affordance
US11127397B2 (en)2015-05-272021-09-21Apple Inc.Device voice control
US10127220B2 (en)2015-06-042018-11-13Apple Inc.Language identification from short strings
US10356243B2 (en)2015-06-052019-07-16Apple Inc.Virtual assistant aided communication with 3rd party service in a communication session
US10681212B2 (en)2015-06-052020-06-09Apple Inc.Virtual assistant aided communication with 3rd party service in a communication session
US10101822B2 (en)2015-06-052018-10-16Apple Inc.Language input correction
US10186254B2 (en)2015-06-072019-01-22Apple Inc.Context-based endpoint detection
US11025565B2 (en)2015-06-072021-06-01Apple Inc.Personalized prediction of responses for instant messaging
US10255907B2 (en)2015-06-072019-04-09Apple Inc.Automatic accent detection using acoustic models
US11010127B2 (en)2015-06-292021-05-18Apple Inc.Virtual assistant for media playback
US10747498B2 (en)2015-09-082020-08-18Apple Inc.Zero latency digital assistant
US10671428B2 (en)2015-09-082020-06-02Apple Inc.Distributed personal assistant
US11500672B2 (en)2015-09-082022-11-15Apple Inc.Distributed personal assistant
US11126400B2 (en)2015-09-082021-09-21Apple Inc.Zero latency digital assistant
US9697820B2 (en)2015-09-242017-07-04Apple Inc.Unit-selection text-to-speech synthesis using concatenation-sensitive neural networks
US11010550B2 (en)2015-09-292021-05-18Apple Inc.Unified language modeling framework for word prediction, auto-completion and auto-correction
US10366158B2 (en)2015-09-292019-07-30Apple Inc.Efficient word encoding for recurrent neural network language models
US11587559B2 (en)2015-09-302023-02-21Apple Inc.Intelligent device identification
US10691473B2 (en)2015-11-062020-06-23Apple Inc.Intelligent automated assistant in a messaging environment
US11526368B2 (en)2015-11-062022-12-13Apple Inc.Intelligent automated assistant in a messaging environment
US10049668B2 (en)2015-12-022018-08-14Apple Inc.Applying neural network language models to weighted finite state transducers for automatic speech recognition
US10354652B2 (en)2015-12-022019-07-16Apple Inc.Applying neural network language models to weighted finite state transducers for automatic speech recognition
US10223066B2 (en)2015-12-232019-03-05Apple Inc.Proactive assistance based on dialog communication between devices
US10942703B2 (en)2015-12-232021-03-09Apple Inc.Proactive assistance based on dialog communication between devices
US10446143B2 (en)2016-03-142019-10-15Apple Inc.Identification of voice inputs providing credentials
US9934775B2 (en)2016-05-262018-04-03Apple Inc.Unit-selection text-to-speech synthesis based on predicted concatenation parameters
US9972304B2 (en)2016-06-032018-05-15Apple Inc.Privacy preserving distributed evaluation framework for embedded personalized systems
US10249300B2 (en)2016-06-062019-04-02Apple Inc.Intelligent list reading
US11227589B2 (en)2016-06-062022-01-18Apple Inc.Intelligent list reading
US10049663B2 (en)2016-06-082018-08-14Apple, Inc.Intelligent automated assistant for media exploration
US11069347B2 (en)2016-06-082021-07-20Apple Inc.Intelligent automated assistant for media exploration
US10354011B2 (en)2016-06-092019-07-16Apple Inc.Intelligent automated assistant in a home environment
US10490187B2 (en)2016-06-102019-11-26Apple Inc.Digital assistant providing automated status report
US10067938B2 (en)2016-06-102018-09-04Apple Inc.Multilingual word prediction
US10733993B2 (en)2016-06-102020-08-04Apple Inc.Intelligent digital assistant in a multi-tasking environment
US10509862B2 (en)2016-06-102019-12-17Apple Inc.Dynamic phrase expansion of language input
US11037565B2 (en)2016-06-102021-06-15Apple Inc.Intelligent digital assistant in a multi-tasking environment
US10192552B2 (en)2016-06-102019-01-29Apple Inc.Digital assistant providing whispered speech
US11152002B2 (en)2016-06-112021-10-19Apple Inc.Application integration with a digital assistant
US10089072B2 (en)2016-06-112018-10-02Apple Inc.Intelligent device arbitration and control
US10580409B2 (en)2016-06-112020-03-03Apple Inc.Application integration with a digital assistant
US10297253B2 (en)2016-06-112019-05-21Apple Inc.Application integration with a digital assistant
US10521466B2 (en)2016-06-112019-12-31Apple Inc.Data driven natural language event detection and classification
US10269345B2 (en)2016-06-112019-04-23Apple Inc.Intelligent task discovery
US10942702B2 (en)2016-06-112021-03-09Apple Inc.Intelligent device arbitration and control
US10698951B2 (en)*2016-07-292020-06-30Booktrack Holdings LimitedSystems and methods for automatic-creation of soundtracks for speech audio
US10474753B2 (en)2016-09-072019-11-12Apple Inc.Language identification using recurrent neural networks
US10553215B2 (en)2016-09-232020-02-04Apple Inc.Intelligent automated assistant
US10043516B2 (en)2016-09-232018-08-07Apple Inc.Intelligent automated assistant
US11281993B2 (en)2016-12-052022-03-22Apple Inc.Model and ensemble compression for metric learning
US10593346B2 (en)2016-12-222020-03-17Apple Inc.Rank-reduced token representation for automatic speech recognition
US11204787B2 (en)2017-01-092021-12-21Apple Inc.Application integration with a digital assistant
US11656884B2 (en)2017-01-092023-05-23Apple Inc.Application integration with a digital assistant
US10417266B2 (en)2017-05-092019-09-17Apple Inc.Context-aware ranking of intelligent response suggestions
US10332518B2 (en)2017-05-092019-06-25Apple Inc.User interface for correcting recognition errors
US10741181B2 (en)2017-05-092020-08-11Apple Inc.User interface for correcting recognition errors
US11599331B2 (en)2017-05-112023-03-07Apple Inc.Maintaining privacy of personal information
US10755703B2 (en)2017-05-112020-08-25Apple Inc.Offline personal assistant
US10395654B2 (en)2017-05-112019-08-27Apple Inc.Text normalization based on a data-driven learning network
US10847142B2 (en)2017-05-112020-11-24Apple Inc.Maintaining privacy of personal information
US10726832B2 (en)2017-05-112020-07-28Apple Inc.Maintaining privacy of personal information
US10791176B2 (en)2017-05-122020-09-29Apple Inc.Synchronization and task delegation of a digital assistant
US10789945B2 (en)2017-05-122020-09-29Apple Inc.Low-latency intelligent automated assistant
US11301477B2 (en)2017-05-122022-04-12Apple Inc.Feedback analysis of a digital assistant
US11405466B2 (en)2017-05-122022-08-02Apple Inc.Synchronization and task delegation of a digital assistant
US11380310B2 (en)2017-05-122022-07-05Apple Inc.Low-latency intelligent automated assistant
US10410637B2 (en)2017-05-122019-09-10Apple Inc.User-specific acoustic models
US10482874B2 (en)2017-05-152019-11-19Apple Inc.Hierarchical belief states for digital assistants
US10810274B2 (en)2017-05-152020-10-20Apple Inc.Optimizing dialogue policy decisions for digital assistants using implicit feedback
US10909171B2 (en)2017-05-162021-02-02Apple Inc.Intelligent automated assistant for media exploration
US11217255B2 (en)2017-05-162022-01-04Apple Inc.Far-field extension for digital assistant services
US10311144B2 (en)2017-05-162019-06-04Apple Inc.Emoji word sense disambiguation
US11532306B2 (en)2017-05-162022-12-20Apple Inc.Detecting a trigger of a digital assistant
US10403278B2 (en)2017-05-162019-09-03Apple Inc.Methods and systems for phonetic matching in digital assistant services
US10748546B2 (en)2017-05-162020-08-18Apple Inc.Digital assistant services based on device capabilities
US10303715B2 (en)2017-05-162019-05-28Apple Inc.Intelligent automated assistant for media exploration
US10657328B2 (en)2017-06-022020-05-19Apple Inc.Multi-task recurrent neural network architecture for efficient morphology handling in neural language modeling
US10445429B2 (en)2017-09-212019-10-15Apple Inc.Natural language understanding using vocabularies with compressed serialized tries
US10755051B2 (en)2017-09-292020-08-25Apple Inc.Rule-based natural language processing
US10636424B2 (en)2017-11-302020-04-28Apple Inc.Multi-turn canned dialog
US11657725B2 (en)2017-12-222023-05-23Fathom Technologies, LLCE-reader interface system with audio and highlighting synchronization for digital books
US10671251B2 (en)2017-12-222020-06-02Arbordale Publishing, LLCInteractive eReader interface generation based on synchronization of textual and audial descriptors
US11443646B2 (en)*2017-12-222022-09-13Fathom Technologies, LLCE-Reader interface system with audio and highlighting synchronization for digital books
US10733982B2 (en)2018-01-082020-08-04Apple Inc.Multi-directional dialog
US10733375B2 (en)2018-01-312020-08-04Apple Inc.Knowledge-based framework for improving natural language understanding
US10789959B2 (en)2018-03-022020-09-29Apple Inc.Training speaker recognition models for digital assistants
US10592604B2 (en)2018-03-122020-03-17Apple Inc.Inverse text normalization for automatic speech recognition
US11710482B2 (en)2018-03-262023-07-25Apple Inc.Natural assistant interaction
US10818288B2 (en)2018-03-262020-10-27Apple Inc.Natural assistant interaction
US10909331B2 (en)2018-03-302021-02-02Apple Inc.Implicit identification of translation payload with neural machine translation
US10928918B2 (en)2018-05-072021-02-23Apple Inc.Raise to speak
US11854539B2 (en)2018-05-072023-12-26Apple Inc.Intelligent automated assistant for delivering content from user experiences
US11169616B2 (en)2018-05-072021-11-09Apple Inc.Raise to speak
US11145294B2 (en)2018-05-072021-10-12Apple Inc.Intelligent automated assistant for delivering content from user experiences
US10984780B2 (en)2018-05-212021-04-20Apple Inc.Global semantic word embeddings using bi-directional recurrent neural networks
US11009970B2 (en)2018-06-012021-05-18Apple Inc.Attention aware virtual assistant dismissal
US10720160B2 (en)2018-06-012020-07-21Apple Inc.Voice interaction at a primary device to access call functionality of a companion device
US10684703B2 (en)2018-06-012020-06-16Apple Inc.Attention aware virtual assistant dismissal
US11386266B2 (en)2018-06-012022-07-12Apple Inc.Text correction
US10892996B2 (en)2018-06-012021-01-12Apple Inc.Variable latency device coordination
US10984798B2 (en)2018-06-012021-04-20Apple Inc.Voice interaction at a primary device to access call functionality of a companion device
US11495218B2 (en)2018-06-012022-11-08Apple Inc.Virtual assistant operation in multi-device environments
US11431642B2 (en)2018-06-012022-08-30Apple Inc.Variable latency device coordination
US10403283B1 (en)2018-06-012019-09-03Apple Inc.Voice interaction at a primary device to access call functionality of a companion device
US10504518B1 (en)2018-06-032019-12-10Apple Inc.Accelerated task performance
US10944859B2 (en)2018-06-032021-03-09Apple Inc.Accelerated task performance
US10496705B1 (en)2018-06-032019-12-03Apple Inc.Accelerated task performance
US11010561B2 (en)2018-09-272021-05-18Apple Inc.Sentiment prediction from textual data
US11170166B2 (en)2018-09-282021-11-09Apple Inc.Neural typographical error modeling via generative adversarial networks
US10839159B2 (en)2018-09-282020-11-17Apple Inc.Named entity normalization in a spoken dialog system
US11462215B2 (en)2018-09-282022-10-04Apple Inc.Multi-modal inputs for voice commands
US11475898B2 (en)2018-10-262022-10-18Apple Inc.Low-latency multi-speaker speech recognition
US11638059B2 (en)2019-01-042023-04-25Apple Inc.Content playback on multiple devices
US11348573B2 (en)2019-03-182022-05-31Apple Inc.Multimodality in digital assistant systems
US11423908B2 (en)2019-05-062022-08-23Apple Inc.Interpreting spoken requests
US11307752B2 (en)2019-05-062022-04-19Apple Inc.User configurable task triggers
US11475884B2 (en)2019-05-062022-10-18Apple Inc.Reducing digital assistant latency when a language is incorrectly determined
US11217251B2 (en)2019-05-062022-01-04Apple Inc.Spoken notifications
US11140099B2 (en)2019-05-212021-10-05Apple Inc.Providing message response suggestions
US11289073B2 (en)2019-05-312022-03-29Apple Inc.Device text to speech
US11360739B2 (en)2019-05-312022-06-14Apple Inc.User activity shortcut suggestions
US11237797B2 (en)2019-05-312022-02-01Apple Inc.User activity shortcut suggestions
US11496600B2 (en)2019-05-312022-11-08Apple Inc.Remote execution of machine-learned models
US11657813B2 (en)2019-05-312023-05-23Apple Inc.Voice identification in digital assistant systems
US11360641B2 (en)2019-06-012022-06-14Apple Inc.Increasing the relevance of new available information
US11488406B2 (en)2019-09-252022-11-01Apple Inc.Text detection using global geometry estimators
US10805665B1 (en)2019-12-132020-10-13Bank Of America CorporationSynchronizing text-to-audio with interactive videos in the video framework
US11064244B2 (en)2019-12-132021-07-13Bank Of America CorporationSynchronizing text-to-audio with interactive videos in the video framework
US11350185B2 (en)2019-12-132022-05-31Bank Of America CorporationText-to-audio for interactive videos using a markup language
US11755276B2 (en)2020-05-122023-09-12Apple Inc.Reducing description length based on confidence

Also Published As

Publication numberPublication date
US8352269B2 (en)2013-01-08
US20100324905A1 (en)2010-12-23
US20100318364A1 (en)2010-12-16
US20100299149A1 (en)2010-11-25
US8498867B2 (en)2013-07-30
US8359202B2 (en)2013-01-22
US20100324903A1 (en)2010-12-23
US20100318363A1 (en)2010-12-16
US8364488B2 (en)2013-01-29
US8954328B2 (en)2015-02-10
US20100324904A1 (en)2010-12-23
US8498866B2 (en)2013-07-30

Similar Documents

PublicationPublication DateTitle
US20190196666A1 (en)Systems and Methods Document Narration
US8793133B2 (en)Systems and methods document narration
US8498867B2 (en)Systems and methods for selection and use of multiple characters for document narration
US9478219B2 (en)Audio synchronization for document narration with user-selected playback
US9330657B2 (en)Text-to-speech for digital literature
US20080027726A1 (en)Text to audio mapping, and animation of the text
KR20250033180A (en)Method and system for generating synthesis voice using style tag represented by natural language
JP2003295882A (en) Text structure for speech synthesis, speech synthesis method, speech synthesis apparatus, and computer program therefor
KR20220165666A (en)Method and system for generating synthesis voice using style tag represented by natural language
US20230377607A1 (en)Methods for dubbing audio-video media files
JP3936351B2 (en) Voice response service equipment
KR102585031B1 (en)Real-time foreign language pronunciation evaluation system and method
JP2009020264A (en) Speech synthesis apparatus, speech synthesis method, and program
WO2010083354A1 (en)Systems and methods for multiple voice document narration
JP6957069B1 (en) Learning support system
JP3760420B2 (en) Voice response service equipment
CN117475991A (en)Method and device for converting text into audio and computer equipment
Patil et al.Text Echo Personalized TTS System
CN119626203A (en) Character dubbing method, device, electronic device and storage medium

Legal Events

DateCodeTitleDescription
ASAssignment

Owner name:K-NFB READING TECHNOLOGY, INC., MASSACHUSETTS

Free format text:ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:KURZWEIL, RAYMOND C.;ALBRECHT, PAUL;CHAPMAN, PETER;AND OTHERS;SIGNING DATES FROM 20100329 TO 20100819;REEL/FRAME:024921/0307

ASAssignment

Owner name:K-NFB READING TECHNOLOGY, INC., MASSACHUSETTS

Free format text:ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:K-NFB HOLDING TECHNOLOGY, INC.;REEL/FRAME:030059/0351

Effective date:20130315

Owner name:K-NFB HOLDING TECHNOLOGY, INC., MASSACHUSETTS

Free format text:CHANGE OF NAME;ASSIGNOR:K-NFB READING TECHNOLOGY, INC.;REEL/FRAME:030058/0669

Effective date:20130315

ASAssignment

Owner name:FISH & RICHARDSON P.C., MINNESOTA

Free format text:LIEN;ASSIGNOR:K-NFB HOLDING TECHNOLOGY, IMC.;REEL/FRAME:034599/0860

Effective date:20141230

ASAssignment

Owner name:DIMENSIONAL STACK ASSETS LLC, NEW YORK

Free format text:ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:K-NFB READING TECHNOLOGY, INC.;REEL/FRAME:035546/0205

Effective date:20150302

ASAssignment

Owner name:EM ACQUISITION CORP., INC., NEW YORK

Free format text:ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:DIMENSIONAL STACK ASSETS, LLC;REEL/FRAME:036593/0328

Effective date:20150910

Owner name:DIMENSIONAL STACK ASSETS LLC, NEW YORK

Free format text:RELEASE BY SECURED PARTY;ASSIGNOR:FISH & RICHARDSON P.C.;REEL/FRAME:036629/0762

Effective date:20150830

STCBInformation on status: application discontinuation

Free format text:ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION


[8]ページ先頭

©2009-2025 Movatter.jp