



도 1은 일반적인 단일 언어 TTS 처리 장치의 구성도.1 is a block diagram of a general single language TTS processing apparatus;
도 2는 종래의 다중언어 TTS 처리 절차를 보인 플루우챠트.2 is a flow chart showing a conventional multilingual TTS processing procedure.
도 3은 본 발명에 따른 이동통신단말기에서 다중 언어 TTS 처리 회로의 블록 구성도.3 is a block diagram of a multi-language TTS processing circuit in a mobile communication terminal according to the present invention.
도 4는 본 발명에 따른 이동통신단말기에서 다중 언어 TTS 처리 절차를 설명하기 위한 플로우챠트.4 is a flowchart illustrating a multi-language TTS processing procedure in a mobile communication terminal according to the present invention.
** 도면의 주요 부분에 대한 부호의 설명 **** Description of symbols for the main parts of the drawing **
100 : 다중언어 처리부100: multi-language processing unit
110 : TTS 엔진부110: TTS engine unit
120 : 메모리부120: memory
130 : 오디오 처리부130: audio processor
140 : 스피커140: speaker
본 발명은 이동통신단말기에서 다중 언어 TTS 처리 방법에 관한 것으로서, 특히 다중 언어를 TTS 처리 하는 경우, 텍스트 언어가 바뀌는 경우에 지연시간을 최소화하여 연속적으로 음성정보를 제공하기에 적당하도록 한 이동통신단말기에서 다중 언어 TTS 처리 방법에 관한 것이다.The present invention relates to a multi-language TTS processing method in a mobile communication terminal. Particularly, in the case of multi-language TTS processing, a mobile communication terminal suitable for continuously providing voice information by minimizing delay time when a text language is changed is provided. The present invention relates to a multi-language TTS processing method.
도 1은 일반적인 단일 언어 TTS 처리 장치의 구성도이다. 도 1을 참조하면, 소정의 언어로 입력된 문장은 단말기에 구비된 TTS 엔진(11)에 의해 오디오 웨이브 데이터(Audio Wave Data)로 변환된다. 이어, TTS 엔진(11)에 의해 변환된 오디오 웨이브 데이터는 오디오 처리부(12)에 의해 아날로그 음성 신호로 변환된다. 이어, 오디오 처리부(12)에 의해 변환된 아날로그 음성 신호는 스피커(120)를 통해 음성으로 내보내진다.1 is a block diagram of a general monolingual TTS processing apparatus. Referring to FIG. 1, a sentence input in a predetermined language is converted into audio wave data by the
이상에서 설명한 일반적인 단일 언어 TTS 처리 장치는 한 가지 종류의 언어(즉, 한국어 또는 영어 또는 일본어 등)로만 이루어진 문장에 대해서는 적절한 음성을 생성할 수 있으나, 여러 종류의 언어가 혼합되어 있는 문장, 즉 다중언어의 문장에 대해서는 적절한 음성을 생성하지 못하였다.The general single-language TTS processing apparatus described above may generate an appropriate voice for a sentence composed of only one kind of language (ie, Korean, English, Japanese, etc.), but a sentence in which several kinds of languages are mixed, that is, multiple We couldn't generate proper speech for sentences in language.
이러한 단점을 개선하기 위하여 기존의 TTS 처리 장치에 다수의 언어를 TTS 처리하는 다중언어 처리부와 복수의 TTS 엔진을 추가로 구비시키는 방안이 제시되었다.In order to improve this disadvantage, a method of additionally providing a multi-language processor and a plurality of TTS engines for TTS processing a plurality of languages in the existing TTS processing apparatus has been proposed.
도 2는 종래의 이동통신단말기에서 다중언어 TTS 처리 절차를 보인 플루우챠트이다.2 is a flow chart showing a multi-language TTS processing procedure in a conventional mobile communication terminal.
도 2를 참조하면, 이동통신단말기에 A,B,C 3개의 언어를 처리할 수 있는 TTS 엔진과 이에 상응하는 데이터 베이스가 각각 구비된 상태에서, 먼저 A 언어가 입력되면(S21), 단말기의 제어부는 A TTS 엔진을 구동시켜(S22), 입력된 A 언어를 이용하여 PCM 데이터를 생성한다(S23). 이어, 제어부는 생성된 PCM 데이터를 재생한다(S24). PCM 데이터의 생성이 완료되고, PCM 데이터의 재생이 완료된 경우(S25), 제어부는 B 언어 텍스트가 입력되는지를 판단한다(S26). S26 단계의 판단결과, B 언어가 입력되는 경우, B TTS 엔진을 구동하여(S27), S23 단계에서 S25 단계를 순차적으로 실행하고, B 언어가 입력되지 않는 경우 C 언어가 입력되는지를 판단한다(S28).Referring to FIG. 2, when a mobile communication terminal is provided with a TTS engine capable of processing three languages A, B, and C, and a corresponding database, respectively, A language is first input (S21). The control unit drives the A TTS engine (S22) to generate PCM data using the input A language (S23). Subsequently, the controller reproduces the generated PCM data (S24). When the generation of the PCM data is completed and the reproduction of the PCM data is completed (S25), the controller determines whether the B language text is input (S26). As a result of the determination in step S26, when the B language is input, the B TTS engine is driven (S27), and step S25 is sequentially executed in step S23, and when the B language is not input, it is determined whether the C language is input ( S28).
S28 단계의 판단결과, C 언어가 입력되는 경우, C TTS 엔진을 구동하여(S29), S23 단계에서 S25 단계를 순차적으로 실행하고, C 언어가 입력되지 않는 경우 다시 A 언어가 입력되는지를 판단하여(S30), 앞서 설명한 다중언어 TTS 처리 동작을 실행한다.As a result of the determination in step S28, when the C language is input, the C TTS engine is driven (S29), and step S25 is sequentially executed in step S23, and when the C language is not input, it is determined whether the A language is input again. (S30), the above-described multilingual TTS processing operation is executed.
그러나, 이와 같은 종래의 이동통신단말기에서 다중언어 TTS 처리 절차에서는 텍스트에 들어가 있는 언어가 바뀔 때 마다 언어에 맞는 TTS 엔진으로 교체해서 TTS 처리를 수행하기 때문에 TTS 엔진을 교체하는 시간동안 사용자가 기다려야하는 번거로움이 있었다. 즉, 텍스트에 들어가 있는 언어가 바뀌는 경우, 현재 구동중인 TTS 엔진의 동작을 중단하고, 새로운 텍스트에 들어가 있는 언어에 맞는 TTS 엔진을 구동하여 새로운 텍스트에 상응하는 PCM 데이터를 생성 및 재생하여야 하는데, 이 시간이 사용자들에겐 다소 지루한 시간이 될 수도 있는 것이다.However, in the conventional mobile communication terminal, the multi-language TTS processing procedure requires the user to wait for the time to replace the TTS engine because the TTS processing is performed by replacing the TTS engine with the language whenever the language in the text is changed. There was a hassle. In other words, when the language of the text is changed, the currently operating TTS engine must be stopped and the TTS engine corresponding to the language of the new text must be operated to generate and play PCM data corresponding to the new text. Time can be a bit tedious for users.
본 발명은 이상에서 설명한 종래의 기술을 감안하여 창출되어진 것으로서, 본 발명의 목적은 이동통신단말기에서 다중언어 TTS 처리하는 경우, 사용자가 바뀐 언어에 상응하는 음성 정보를 대기하는 시간을 최소화할 수 있는 이동통신단말기에서 다중 언어 TTS 처리 방법을 제공하기 위한 것이다.The present invention has been made in view of the conventional technology described above, and an object of the present invention is to minimize the time for a user to wait for voice information corresponding to a changed language when a multi-language TTS process is performed in a mobile communication terminal. It is to provide a multi-language TTS processing method in a mobile communication terminal.
본 발명의 다른 목적은 이동통신단말기에서 다중언어 TTS 처리하는 경우, 단말기 시스템을, 입력 텍스트에 상응하는 언어에 맞는 TTS 엔진을 구동하여 PCM 데이터를 생성 처리하는 TTS 테스크와, 생성된 PCM 데이터를 재생 처리하는 오디오 테스크로 구분하여 독립적으로 운용할 수 있는 이동통신단말기에서 다중 언어 TTS 처리 방법을 제공하기 위한 것이다.Another object of the present invention is to perform a multi-language TTS processing in a mobile communication terminal, the terminal system, the TTS task for generating and processing PCM data by running a TTS engine for a language corresponding to the input text, and reproduces the generated PCM data It is to provide a multi-language TTS processing method in a mobile communication terminal that can be independently operated by processing audio tasks.
상기한 목적을 달성하기 위해, 본 발명에 따른 이동통신단말기에서 다중 언어 TTS 처리 방법은, 입력된 텍스트를 언어의 종류에 따라 구분하여 해당 이동통신단말기에 구비된 TTS 엔진에 전달하는 제 1 단계; 해당 TTS 엔진을 구동시키고, 입력된 텍스트를 PCM 데이터로 변환하여 이동통신단말기에 구비된 복수개의 저장 매체 중 첫 번째 매체부터 차례로 저장하는 제 2 단계; 상기 입력된 텍스트를 PCM 데이터로 변환하는 동작이 완료된 경우, 다른 언어의 텍스트가 입력되었는지를 판단하는 제 3 단계; 및 상기 판단 단계의 결과, 새로운 언어의 텍스트가 입력된 경우, 기존의 구동중인 TTS 엔진을 정지시키고, 상기 새로운 언어의 TTS 엔진을 구동하여 새로운 언어의 텍스트를 PCM 데이터로 변환하는 제 4 단계로 이루어져, 상기 첫 번째 매체에 상기 PCM 데이터의 저장이 완료되면, 상기 복수개의 매체에 차례로 저장 된 PCM 데이터를 차례로 재생한다.In order to achieve the above object, a multi-language TTS processing method in a mobile communication terminal according to the present invention comprises: a first step of classifying the input text according to the type of language and delivering it to the TTS engine provided in the corresponding mobile communication terminal; A second step of driving the corresponding TTS engine and converting the input text into PCM data and sequentially storing the first one of a plurality of storage media provided in the mobile communication terminal; A third step of determining whether text of another language is input when the operation of converting the input text into PCM data is completed; And a fourth step of converting the text of the new language into PCM data by stopping the existing driving TTS engine when the text of the new language is input as a result of the determination step. When the storing of the PCM data is completed on the first medium, the PCM data stored in the plurality of media are sequentially played.
여기서, 본 발명에 따른 이동통신단말기에서 다중 언어 TTS 처리 방법은, 상기 복수개의 저장 매체에 재생해야할 PCM 데이터가 남아있는지를 판단하는 제 5 단계; 상기 재생해야할 PCM 데이터가 남아 있으면, 상기 PCM 데이터를 저장하고 있는 저장 매체의 개수가 미리 설정된 최대치(n개) 이상인지를 판단하는 제 6 단계와; 상기 판단 결과, 상기 저장 매체의 개수가 미리 설정된 최대치(n개) 이상이면, 상기 TTS 엔진의 동작을 중지할 것을 요청하는 제 7 단계를 더 포함한다.Here, the multi-language TTS processing method in the mobile communication terminal according to the present invention comprises: a fifth step of determining whether PCM data to be reproduced in the plurality of storage media remains; A sixth step of determining whether the number of storage media storing the PCM data is equal to or larger than a preset maximum value (n) if the PCM data to be reproduced remains; The determination may further include a seventh step of requesting to stop the operation of the TTS engine when the number of the storage media is greater than or equal to a preset maximum value (n).
여기서, 본 발명에 따른 이동통신단말기에서 다중 언어 TTS 처리 방법은, 상기 저장 매체의 개수가 미리 설정된 최대치(n개) 이상이 아니면, 재생하지 않은 저장 매체의 수가 미리 설정된 최소치(m개) 이하인지를 판단하는 제 8 단계; 재생하지 않은 저장 매체의 개수가 미리 설정된 최소치(m개) 이하인 경우, 상기 해당 TTS 엔진에게 입력된 텍스트를 PCM 데이터로 변환하여 비어있는 저장 매체에 저장하도록 요청하는 제 9 단계를 더 포함한다.Here, in the mobile communication terminal according to the present invention, in the multi-language TTS processing method, if the number of the storage media is not more than the preset maximum value (n), the number of storage media that has not been played is less than the preset minimum value (m). Determining an eighth step; If the number of non-reproducing storage media is less than or equal to a preset minimum value (m), a ninth step of requesting the corresponding TTS engine to convert the input text into PCM data and store it in an empty storage medium.
이상에서 설명한 본 발명의 특징에 따르면, 다중언어 TTS 처리하는 이동 단말기의 시스템을, 입력 텍스트에 상응하는 언어에 맞는 TTS 엔진을 구동하여 PCM 데이터를 생성 처리하는 TTS 테스크와, 생성된 PCM 데이터를 재생 처리하는 오디오 테스크로 구분하여 독립적으로 운용한다.According to the characteristics of the present invention described above, the system of the mobile terminal for multi-language TTS processing, the TTS task for generating and processing the PCM data by running the TTS engine for the language corresponding to the input text, and reproduces the generated PCM data It is divided into audio tasks to be processed and operated independently.
따라서, 사용자는 단말기 시스템이 다중 언어를 TTS 처리하는 경우라도 연속적으로 음성정보를 들을 수 있는 이점이 있다.Therefore, the user has the advantage of being able to continuously listen to the voice information even when the terminal system processes the multi-language TTS.
이하, 첨부되어진 도면을 참조하여 본 발명의 이동통신단말기에서 다중 언 어 TTS 처리 절차에 따른 실시 예를 구체적으로 설명한다.Hereinafter, embodiments of the multi-language TTS processing procedure in the mobile communication terminal of the present invention will be described in detail with reference to the accompanying drawings.
도 3은 본 발명에 따른 이동통신단말기에서 다중 언어 TTS 처리 회로의 블록 구성도 이다.3 is a block diagram of a multi-language TTS processing circuit in the mobile communication terminal according to the present invention.
도 3을 참조하면, 다중언어 처리부(100)는 이동통신단말기에 입력되는 소정 언어 텍스트를 수신하고, 입력된 언어 텍스트를 언어의 종류에 따라 구분하여 해당 TTS 엔진에 전달한다.Referring to FIG. 3, the
이러한, 다중언어처리부는 처리하는 언어는 해당 언어의 TTS 처리가 가능한 TTS 엔진과 이에 따른 데이터 베이스가 지원되면 추가할 수 있다.Such a language processing unit may add a language to be processed if a TTS engine capable of processing the TTS of the corresponding language and a database thereof are supported.
TTS 엔진부(110)는 다수개의 TTS 엔진(예를 들어, 영어 TTS 엔진, 일어 TTS 엔진, 중국어 TTS 엔진 등)으로 이루어져, 다중언어 처리부(100)에서 구분된 텍스트를 PCM 데이터로 변환한다.The
여기서, 각각의 TTS 엔진은 (1)초기화 동작, (2)언어 데이터 베이스 셋팅, (3)텍스트 입력 (4) 데이터 변환의 단계를 통하여 입력 텍스트를 PCM 데이터로 변환하는 것이다.Here, each TTS engine converts the input text into PCM data through the steps of (1) initialization operation, (2) language database setting, and (3) text input and (4) data conversion.
메모리부(120)는 각각의 TTS 엔진에 용어를 제공하기 위한 데이터 베이스부와, PCM 데이터를 저장하기 위한 복수개의 버퍼로 이루어져, TTS 엔진부에서 변환된 PCM 데이터를 버퍼의 크기에 맞게 나누어서 일정 시간 단위로 버퍼에 저장한다.The memory unit 120 includes a database unit for providing a term to each TTS engine and a plurality of buffers for storing PCM data. The memory unit 120 divides the PCM data converted by the TTS engine unit according to the size of the buffer for a predetermined time. Store in buffers in units.
오디오 처리부(130)는 TTS 엔진부(110)에서 변환된 PCM 데이터를 아날로그 음성 신호로 변환하여 출력한다. 이러한 오디오 처리부(130)는 일반적으로 소프트웨어 모듈로서 오디오 드라이버와 하드웨어 블럭으로서 오디오 카드를 포함하여 구 성된다.The
또한, 스피커(140)는 오디오 처리부(130)에서 변환된 아날로그 음성 신호를 사용자가 들을 수 있는 음성으로 출력한다.In addition, the
이하에서, 첨부된 도면을 참조하여 본 발명에 따른 이동통신단말기에서 다중 언어 TTS 처리 절차를 설명한다.Hereinafter, a multi-language TTS processing procedure in a mobile communication terminal according to the present invention will be described with reference to the accompanying drawings.
도 4는 본 발명에 따른 이동통신단말기에서 다중 언어 TTS 처리 절차를 설명하기 위한 플로우챠트 이다.4 is a flowchart illustrating a multi-language TTS processing procedure in a mobile communication terminal according to the present invention.
도 4를 참조하면, 다중 텍스트가 이동통신단말기의 다중언어 처리부(100)로 입력되면, 다중언어 처리부(100)는 입력된 텍스트를 언어의 종류에 따라 구분하여 해당 TTS 엔진에 전달한다(S41),Referring to FIG. 4, when the multi-text is input to the
TTS 엔진부(110)는 해당 TTS 엔진을 구동시키고(S42), 입력된 텍스트를 메모리부의 해당 데이터 베이스를 이용하여 PCM 데이터로 변환하여 메모리부(120)의 첫 번째 버퍼부터 차례로 저장한다(S43). TTS 엔진부(110)는 이동통신단말기의 제어부의 제어에 따라 데이터 베이스의 갱신과 삭제 등을 제어하며, 버퍼의 초기화와 저장 순서 등도 제어한다.The
TTS 엔진부(110)는 입력된 텍스트를 수신하여 PCM 데이터로 변환하였는지를 판단한다(S44).The
S44 판단결과, 입력된 텍스트를 모두 PCM 데이터로 변환한 경우, 다른 언어의 텍스트가 입력되었는지를 판단한다(S45).As a result of S44 determination, when all the input text is converted into PCM data, it is determined whether text of another language is input (S45).
S45 판단결과, 입력된 새로운 언어의 텍스트가 있는 경우, TTS 엔진부는 기존 의 구동중인 TTS 엔진을 정지시키고, 새로운 언어의 TTS 엔진을 구동하고, 새로운 언어의 TTS 엔진을 구동하여 새로운 언어의 텍스트를 PCM 데이터로 변환하고(S47), S44단계를 실행한다 .As a result of S45 determination, if there is text of a new language input, the TTS engine unit stops the existing TTS engine, drives the TTS engine of the new language, and drives the TTS engine of the new language to display the text of the new language. Convert to data (S47), and execute step S44.
한편, 오디오 처리부(130)는 메모리부(120)의 첫 번째 버퍼에 PCM 데이터의 저장이 완료되면, 바로 저장된 PCM 데이터를 가지고 와서 재생한다(S50).Meanwhile, when the PCM data is completely stored in the first buffer of the memory unit 120, the
이어, 오디오 처리부(130)는 메모리부(120)의 버퍼에 재생해야할 PCM 데이터가 남아있는지를 판단하여(S51), 재생해야할 PCM 데이터가 남아 있으면 버퍼의 개수가 미리 설정된 최대치(n개) 이상인지를 판단한다(S52).Subsequently, the
오디오 처리부(130)는 재생하지 않은 버퍼의 개수가 미리 설정된 최대치(n개) 이상이면, TTS 엔진부(110)의 동작을 중지할 것을 요청한다(S53).The
S52단계의 판단결과, 재생하지 않은 버퍼의 개수가 미리 설정된 최대치(n개) 이상이 아닌 경우, 오디오 처리부(130)는 재생하지 않은 버퍼의 개수가 미리 설정된 최소치(m개) 이하인지를 판단한다(S54).As a result of the determination in step S52, when the number of unplayed buffers is not more than the preset maximum value (n), the
S54단계의 판단 결과, 재생하지 않은 버퍼의 개수가 미리 설정된 최소치(m개) 이하인 경우, 오디오 처리부(130)는 TTS 엔진부(110)에게 입력된 텍스트를 메모리부(120)의 해당 데이터 베이스를 이용하여 PCM 데이터로 변환하여 메모리부(120)의 비어있는 버퍼부터 다시 차례로 저장하도록 요청한다(S55).As a result of the determination in step S54, when the number of unplayed buffers is equal to or less than the preset minimum value (m), the
반면, S54단계의 판단 결과, 재생하지 않은 버퍼의 개수가 미리 설정된 최소치(m개) 이상인 경우, 오디오 처리부(130)는 S50 단계를 실행하도록 제어한다.On the other hand, when the determination result in step S54, if the number of the buffers that are not reproduced is more than the predetermined minimum value (m), the
또한, S51 단계에서 재생해야할 PCM 데이터가 없는 경우로 판단되면, 오디오 태스크 일시 정지 상태를 유지한다(S56).If it is determined that there is no PCM data to be reproduced in step S51, the audio task pause state is maintained (S56).
이상에서 설명한 바와 같이, 본 발명에 따른 이동통신단말기에서 다중 언어 TTS 처리 방법에 의하면, 다중언어 TTS 처리하는 이동 단말기의 시스템을, 입력 텍스트에 상응하는 언어에 맞는 TTS 엔진을 구동하여 PCM 데이터를 생성 처리하는 TTS 테스크와, 생성된 PCM 데이터를 재생 처리하는 오디오 테스크로 구분하여 독립적으로 운용한다.As described above, according to the multi-language TTS processing method in the mobile communication terminal according to the present invention, the PCM data is generated by driving a system of a mobile terminal that processes the multi-language TTS by driving a TTS engine for a language corresponding to the input text. The TTS task to be processed and the audio task to reproduce the generated PCM data are divided and operated independently.
따라서, TTS 테스크는 입력 텍스트에 따라 상응하는 TTS 엔진을 구동하여 PCM 데이터를 연속적으로 생성하여 별도의 저장매체에 순차적으로 저장하고, 오디오 테스크는 생성된 PCM 데이터를 순차적으로 재생하기 때문에, 다중언어가 포함된 텍스트가 단말기에 입력되더라도 사용자의 입장에서는 대기 시간이나 끊김이 없이 연속적으로 음성정보를 들을 수 있는 효과를 제공한다.Therefore, the TTS task drives the corresponding TTS engine according to the input text to continuously generate the PCM data, and sequentially stores the data on separate storage media. The audio task reproduces the generated PCM data sequentially. Even if the included text is input to the terminal, it provides the effect that the user can continuously hear the voice information without waiting time or interruption.
한편, 본 발명은 상술한 실시예로만 한정되는 것이 아니라 본 발명의 요지를 벗어나지 않는 범위 내에서 수정 및 변형하여 실시할 수 있고, 이러한 수정 및 변경 등은 이하의 특허 청구의 범위에 속하는 것으로 보아야 할 것이다.On the other hand, the present invention is not limited to the above-described embodiment, but can be modified and modified within the scope not departing from the gist of the present invention, such modifications and changes should be regarded as belonging to the following claims. will be.
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| KR1020050135350AKR20070071675A (en) | 2005-12-30 | 2005-12-30 | Multilingual TTS Processing Method in Mobile Communication Terminal |
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| KR1020050135350AKR20070071675A (en) | 2005-12-30 | 2005-12-30 | Multilingual TTS Processing Method in Mobile Communication Terminal |
| Publication Number | Publication Date |
|---|---|
| KR20070071675Atrue KR20070071675A (en) | 2007-07-04 |
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| KR1020050135350AWithdrawnKR20070071675A (en) | 2005-12-30 | 2005-12-30 | Multilingual TTS Processing Method in Mobile Communication Terminal |
| Country | Link |
|---|---|
| KR (1) | KR20070071675A (en) |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US8892446B2 (en) | 2010-01-18 | 2014-11-18 | Apple Inc. | Service orchestration for intelligent automated assistant |
| US9262612B2 (en) | 2011-03-21 | 2016-02-16 | Apple Inc. | Device access using voice authentication |
| US9300784B2 (en) | 2013-06-13 | 2016-03-29 | Apple Inc. | System and method for emergency calls initiated by voice command |
| US9330720B2 (en) | 2008-01-03 | 2016-05-03 | Apple Inc. | Methods and apparatus for altering audio output signals |
| US9338493B2 (en) | 2014-06-30 | 2016-05-10 | Apple Inc. | Intelligent automated assistant for TV user interactions |
| US9368114B2 (en) | 2013-03-14 | 2016-06-14 | Apple Inc. | Context-sensitive handling of interruptions |
| US9430463B2 (en) | 2014-05-30 | 2016-08-30 | Apple Inc. | Exemplar-based natural language processing |
| US9483461B2 (en) | 2012-03-06 | 2016-11-01 | Apple Inc. | Handling speech synthesis of content for multiple languages |
| US9495129B2 (en) | 2012-06-29 | 2016-11-15 | Apple Inc. | Device, method, and user interface for voice-activated navigation and browsing of a document |
| US9502031B2 (en) | 2014-05-27 | 2016-11-22 | Apple Inc. | Method for supporting dynamic grammars in WFST-based ASR |
| US9535906B2 (en) | 2008-07-31 | 2017-01-03 | Apple Inc. | Mobile device having human language translation capability with positional feedback |
| US9576574B2 (en) | 2012-09-10 | 2017-02-21 | Apple Inc. | Context-sensitive handling of interruptions by intelligent digital assistant |
| US9582608B2 (en) | 2013-06-07 | 2017-02-28 | Apple Inc. | Unified ranking with entropy-weighted information for phrase-based semantic auto-completion |
| US9620104B2 (en) | 2013-06-07 | 2017-04-11 | Apple Inc. | System and method for user-specified pronunciation of words for speech synthesis and recognition |
| US9620105B2 (en) | 2014-05-15 | 2017-04-11 | Apple Inc. | Analyzing audio input for efficient speech and music recognition |
| US9626955B2 (en) | 2008-04-05 | 2017-04-18 | Apple Inc. | Intelligent text-to-speech conversion |
| US9633004B2 (en) | 2014-05-30 | 2017-04-25 | Apple Inc. | Better resolution when referencing to concepts |
| US9633674B2 (en) | 2013-06-07 | 2017-04-25 | Apple Inc. | System and method for detecting errors in interactions with a voice-based digital assistant |
| US9633660B2 (en) | 2010-02-25 | 2017-04-25 | Apple Inc. | User profiling for voice input processing |
| US9646614B2 (en) | 2000-03-16 | 2017-05-09 | Apple Inc. | Fast, language-independent method for user authentication by voice |
| US9646609B2 (en) | 2014-09-30 | 2017-05-09 | Apple Inc. | Caching apparatus for serving phonetic pronunciations |
| US9668121B2 (en) | 2014-09-30 | 2017-05-30 | Apple Inc. | Social reminders |
| US9697820B2 (en) | 2015-09-24 | 2017-07-04 | Apple Inc. | Unit-selection text-to-speech synthesis using concatenation-sensitive neural networks |
| US9697822B1 (en) | 2013-03-15 | 2017-07-04 | Apple Inc. | System and method for updating an adaptive speech recognition model |
| US9711141B2 (en) | 2014-12-09 | 2017-07-18 | Apple Inc. | Disambiguating heteronyms in speech synthesis |
| US9715875B2 (en) | 2014-05-30 | 2017-07-25 | Apple Inc. | Reducing the need for manual start/end-pointing and trigger phrases |
| US9721566B2 (en) | 2015-03-08 | 2017-08-01 | Apple Inc. | Competing devices responding to voice triggers |
| US9734193B2 (en) | 2014-05-30 | 2017-08-15 | Apple Inc. | Determining domain salience ranking from ambiguous words in natural speech |
| US9760559B2 (en) | 2014-05-30 | 2017-09-12 | Apple Inc. | Predictive text input |
| US9785630B2 (en) | 2014-05-30 | 2017-10-10 | Apple Inc. | Text prediction using combined word N-gram and unigram language models |
| US9798393B2 (en) | 2011-08-29 | 2017-10-24 | Apple Inc. | Text correction processing |
| US9818400B2 (en) | 2014-09-11 | 2017-11-14 | Apple Inc. | Method and apparatus for discovering trending terms in speech requests |
| US9842101B2 (en) | 2014-05-30 | 2017-12-12 | Apple Inc. | Predictive conversion of language input |
| US9842105B2 (en) | 2015-04-16 | 2017-12-12 | Apple Inc. | Parsimonious continuous-space phrase representations for natural language processing |
| US9858925B2 (en) | 2009-06-05 | 2018-01-02 | Apple Inc. | Using context information to facilitate processing of commands in a virtual assistant |
| US9865280B2 (en) | 2015-03-06 | 2018-01-09 | Apple Inc. | Structured dictation using intelligent automated assistants |
| US9886432B2 (en) | 2014-09-30 | 2018-02-06 | Apple Inc. | Parsimonious handling of word inflection via categorical stem + suffix N-gram language models |
| US9886953B2 (en) | 2015-03-08 | 2018-02-06 | Apple Inc. | Virtual assistant activation |
| US9899019B2 (en) | 2015-03-18 | 2018-02-20 | Apple Inc. | Systems and methods for structured stem and suffix language models |
| US9922642B2 (en) | 2013-03-15 | 2018-03-20 | Apple Inc. | Training an at least partial voice command system |
| US9934775B2 (en) | 2016-05-26 | 2018-04-03 | Apple Inc. | Unit-selection text-to-speech synthesis based on predicted concatenation parameters |
| US9953088B2 (en) | 2012-05-14 | 2018-04-24 | Apple Inc. | Crowd sourcing information to fulfill user requests |
| US9959870B2 (en) | 2008-12-11 | 2018-05-01 | Apple Inc. | Speech recognition involving a mobile device |
| US9966065B2 (en) | 2014-05-30 | 2018-05-08 | Apple Inc. | Multi-command single utterance input method |
| US9966068B2 (en) | 2013-06-08 | 2018-05-08 | Apple Inc. | Interpreting and acting upon commands that involve sharing information with remote devices |
| US9971774B2 (en) | 2012-09-19 | 2018-05-15 | Apple Inc. | Voice-based media searching |
| US9972304B2 (en) | 2016-06-03 | 2018-05-15 | Apple Inc. | Privacy preserving distributed evaluation framework for embedded personalized systems |
| KR20180079759A (en)* | 2017-01-02 | 2018-07-11 | 삼성전자주식회사 | Method and terminal for recognizing a text |
| US10043516B2 (en) | 2016-09-23 | 2018-08-07 | Apple Inc. | Intelligent automated assistant |
| US10049668B2 (en) | 2015-12-02 | 2018-08-14 | Apple Inc. | Applying neural network language models to weighted finite state transducers for automatic speech recognition |
| US10049663B2 (en) | 2016-06-08 | 2018-08-14 | Apple, Inc. | Intelligent automated assistant for media exploration |
| US10057736B2 (en) | 2011-06-03 | 2018-08-21 | Apple Inc. | Active transport based notifications |
| US10067938B2 (en) | 2016-06-10 | 2018-09-04 | Apple Inc. | Multilingual word prediction |
| US10074360B2 (en) | 2014-09-30 | 2018-09-11 | Apple Inc. | Providing an indication of the suitability of speech recognition |
| US10078631B2 (en) | 2014-05-30 | 2018-09-18 | Apple Inc. | Entropy-guided text prediction using combined word and character n-gram language models |
| US10079014B2 (en) | 2012-06-08 | 2018-09-18 | Apple Inc. | Name recognition system |
| US10083688B2 (en) | 2015-05-27 | 2018-09-25 | Apple Inc. | Device voice control for selecting a displayed affordance |
| US10089072B2 (en) | 2016-06-11 | 2018-10-02 | Apple Inc. | Intelligent device arbitration and control |
| US10101822B2 (en) | 2015-06-05 | 2018-10-16 | Apple Inc. | Language input correction |
| US10127911B2 (en) | 2014-09-30 | 2018-11-13 | Apple Inc. | Speaker identification and unsupervised speaker adaptation techniques |
| US10127220B2 (en) | 2015-06-04 | 2018-11-13 | Apple Inc. | Language identification from short strings |
| US10134385B2 (en) | 2012-03-02 | 2018-11-20 | Apple Inc. | Systems and methods for name pronunciation |
| US10170123B2 (en) | 2014-05-30 | 2019-01-01 | Apple Inc. | Intelligent assistant for home automation |
| US10176167B2 (en) | 2013-06-09 | 2019-01-08 | Apple Inc. | System and method for inferring user intent from speech inputs |
| US10185542B2 (en) | 2013-06-09 | 2019-01-22 | Apple Inc. | Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant |
| US10186254B2 (en) | 2015-06-07 | 2019-01-22 | Apple Inc. | Context-based endpoint detection |
| US10192552B2 (en) | 2016-06-10 | 2019-01-29 | Apple Inc. | Digital assistant providing whispered speech |
| US10199051B2 (en) | 2013-02-07 | 2019-02-05 | Apple Inc. | Voice trigger for a digital assistant |
| US10223066B2 (en) | 2015-12-23 | 2019-03-05 | Apple Inc. | Proactive assistance based on dialog communication between devices |
| US10241752B2 (en) | 2011-09-30 | 2019-03-26 | Apple Inc. | Interface for a virtual digital assistant |
| US10241644B2 (en) | 2011-06-03 | 2019-03-26 | Apple Inc. | Actionable reminder entries |
| US10249300B2 (en) | 2016-06-06 | 2019-04-02 | Apple Inc. | Intelligent list reading |
| US10255907B2 (en) | 2015-06-07 | 2019-04-09 | Apple Inc. | Automatic accent detection using acoustic models |
| US10269345B2 (en) | 2016-06-11 | 2019-04-23 | Apple Inc. | Intelligent task discovery |
| US10276170B2 (en) | 2010-01-18 | 2019-04-30 | Apple Inc. | Intelligent automated assistant |
| US10283110B2 (en) | 2009-07-02 | 2019-05-07 | Apple Inc. | Methods and apparatuses for automatic speech recognition |
| US10289433B2 (en) | 2014-05-30 | 2019-05-14 | Apple Inc. | Domain specific language for encoding assistant dialog |
| US10297253B2 (en) | 2016-06-11 | 2019-05-21 | Apple Inc. | Application integration with a digital assistant |
| US10318871B2 (en) | 2005-09-08 | 2019-06-11 | Apple Inc. | Method and apparatus for building an intelligent automated assistant |
| US10332518B2 (en) | 2017-05-09 | 2019-06-25 | Apple Inc. | User interface for correcting recognition errors |
| US10356243B2 (en) | 2015-06-05 | 2019-07-16 | Apple Inc. | Virtual assistant aided communication with 3rd party service in a communication session |
| US10354011B2 (en) | 2016-06-09 | 2019-07-16 | Apple Inc. | Intelligent automated assistant in a home environment |
| US10366158B2 (en) | 2015-09-29 | 2019-07-30 | Apple Inc. | Efficient word encoding for recurrent neural network language models |
| US10410637B2 (en) | 2017-05-12 | 2019-09-10 | Apple Inc. | User-specific acoustic models |
| US10446143B2 (en) | 2016-03-14 | 2019-10-15 | Apple Inc. | Identification of voice inputs providing credentials |
| US10446141B2 (en) | 2014-08-28 | 2019-10-15 | Apple Inc. | Automatic speech recognition based on user feedback |
| US10482874B2 (en) | 2017-05-15 | 2019-11-19 | Apple Inc. | Hierarchical belief states for digital assistants |
| US10490187B2 (en) | 2016-06-10 | 2019-11-26 | Apple Inc. | Digital assistant providing automated status report |
| US10496753B2 (en) | 2010-01-18 | 2019-12-03 | Apple Inc. | Automatically adapting user interfaces for hands-free interaction |
| US10509862B2 (en) | 2016-06-10 | 2019-12-17 | Apple Inc. | Dynamic phrase expansion of language input |
| US10521466B2 (en) | 2016-06-11 | 2019-12-31 | Apple Inc. | Data driven natural language event detection and classification |
| US10553209B2 (en) | 2010-01-18 | 2020-02-04 | Apple Inc. | Systems and methods for hands-free notification summaries |
| US10552013B2 (en) | 2014-12-02 | 2020-02-04 | Apple Inc. | Data detection |
| US10568032B2 (en) | 2007-04-03 | 2020-02-18 | Apple Inc. | Method and system for operating a multi-function portable electronic device using voice-activation |
| US10567477B2 (en) | 2015-03-08 | 2020-02-18 | Apple Inc. | Virtual assistant continuity |
| US10592095B2 (en) | 2014-05-23 | 2020-03-17 | Apple Inc. | Instantaneous speaking of content on touch devices |
| US10593346B2 (en) | 2016-12-22 | 2020-03-17 | Apple Inc. | Rank-reduced token representation for automatic speech recognition |
| US10607141B2 (en) | 2010-01-25 | 2020-03-31 | Newvaluexchange Ltd. | Apparatuses, methods and systems for a digital conversation management platform |
| US10659851B2 (en) | 2014-06-30 | 2020-05-19 | Apple Inc. | Real-time digital assistant knowledge updates |
| US10671428B2 (en) | 2015-09-08 | 2020-06-02 | Apple Inc. | Distributed personal assistant |
| US10679605B2 (en) | 2010-01-18 | 2020-06-09 | Apple Inc. | Hands-free list-reading by intelligent automated assistant |
| US10691473B2 (en) | 2015-11-06 | 2020-06-23 | Apple Inc. | Intelligent automated assistant in a messaging environment |
| US10706373B2 (en) | 2011-06-03 | 2020-07-07 | Apple Inc. | Performing actions associated with task items that represent tasks to perform |
| US10705794B2 (en) | 2010-01-18 | 2020-07-07 | Apple Inc. | Automatically adapting user interfaces for hands-free interaction |
| US10733993B2 (en) | 2016-06-10 | 2020-08-04 | Apple Inc. | Intelligent digital assistant in a multi-tasking environment |
| US10747498B2 (en) | 2015-09-08 | 2020-08-18 | Apple Inc. | Zero latency digital assistant |
| US10755703B2 (en) | 2017-05-11 | 2020-08-25 | Apple Inc. | Offline personal assistant |
| US10762293B2 (en) | 2010-12-22 | 2020-09-01 | Apple Inc. | Using parts-of-speech tagging and named entity recognition for spelling correction |
| US10789945B2 (en) | 2017-05-12 | 2020-09-29 | Apple Inc. | Low-latency intelligent automated assistant |
| US10789041B2 (en) | 2014-09-12 | 2020-09-29 | Apple Inc. | Dynamic thresholds for always listening speech trigger |
| US10791176B2 (en) | 2017-05-12 | 2020-09-29 | Apple Inc. | Synchronization and task delegation of a digital assistant |
| US10791216B2 (en) | 2013-08-06 | 2020-09-29 | Apple Inc. | Auto-activating smart responses based on activities from remote devices |
| US10810274B2 (en) | 2017-05-15 | 2020-10-20 | Apple Inc. | Optimizing dialogue policy decisions for digital assistants using implicit feedback |
| CN112652294A (en)* | 2020-12-25 | 2021-04-13 | 深圳追一科技有限公司 | Speech synthesis method, apparatus, computer device and storage medium |
| US11010550B2 (en) | 2015-09-29 | 2021-05-18 | Apple Inc. | Unified language modeling framework for word prediction, auto-completion and auto-correction |
| US11025565B2 (en) | 2015-06-07 | 2021-06-01 | Apple Inc. | Personalized prediction of responses for instant messaging |
| US11217255B2 (en) | 2017-05-16 | 2022-01-04 | Apple Inc. | Far-field extension for digital assistant services |
| US11281993B2 (en) | 2016-12-05 | 2022-03-22 | Apple Inc. | Model and ensemble compression for metric learning |
| US11587559B2 (en) | 2015-09-30 | 2023-02-21 | Apple Inc. | Intelligent device identification |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9646614B2 (en) | 2000-03-16 | 2017-05-09 | Apple Inc. | Fast, language-independent method for user authentication by voice |
| US10318871B2 (en) | 2005-09-08 | 2019-06-11 | Apple Inc. | Method and apparatus for building an intelligent automated assistant |
| US9117447B2 (en) | 2006-09-08 | 2015-08-25 | Apple Inc. | Using event alert text as input to an automated assistant |
| US8942986B2 (en) | 2006-09-08 | 2015-01-27 | Apple Inc. | Determining user intent based on ontologies of domains |
| US8930191B2 (en) | 2006-09-08 | 2015-01-06 | Apple Inc. | Paraphrasing of user requests and results by automated digital assistant |
| US10568032B2 (en) | 2007-04-03 | 2020-02-18 | Apple Inc. | Method and system for operating a multi-function portable electronic device using voice-activation |
| US10381016B2 (en) | 2008-01-03 | 2019-08-13 | Apple Inc. | Methods and apparatus for altering audio output signals |
| US9330720B2 (en) | 2008-01-03 | 2016-05-03 | Apple Inc. | Methods and apparatus for altering audio output signals |
| US9626955B2 (en) | 2008-04-05 | 2017-04-18 | Apple Inc. | Intelligent text-to-speech conversion |
| US9865248B2 (en) | 2008-04-05 | 2018-01-09 | Apple Inc. | Intelligent text-to-speech conversion |
| US10108612B2 (en) | 2008-07-31 | 2018-10-23 | Apple Inc. | Mobile device having human language translation capability with positional feedback |
| US9535906B2 (en) | 2008-07-31 | 2017-01-03 | Apple Inc. | Mobile device having human language translation capability with positional feedback |
| US9959870B2 (en) | 2008-12-11 | 2018-05-01 | Apple Inc. | Speech recognition involving a mobile device |
| US11080012B2 (en) | 2009-06-05 | 2021-08-03 | Apple Inc. | Interface for a virtual digital assistant |
| US10795541B2 (en) | 2009-06-05 | 2020-10-06 | Apple Inc. | Intelligent organization of tasks items |
| US10475446B2 (en) | 2009-06-05 | 2019-11-12 | Apple Inc. | Using context information to facilitate processing of commands in a virtual assistant |
| US9858925B2 (en) | 2009-06-05 | 2018-01-02 | Apple Inc. | Using context information to facilitate processing of commands in a virtual assistant |
| US10283110B2 (en) | 2009-07-02 | 2019-05-07 | Apple Inc. | Methods and apparatuses for automatic speech recognition |
| US10553209B2 (en) | 2010-01-18 | 2020-02-04 | Apple Inc. | Systems and methods for hands-free notification summaries |
| US11423886B2 (en) | 2010-01-18 | 2022-08-23 | Apple Inc. | Task flow identification based on user intent |
| US9318108B2 (en) | 2010-01-18 | 2016-04-19 | Apple Inc. | Intelligent automated assistant |
| US8892446B2 (en) | 2010-01-18 | 2014-11-18 | Apple Inc. | Service orchestration for intelligent automated assistant |
| US10496753B2 (en) | 2010-01-18 | 2019-12-03 | Apple Inc. | Automatically adapting user interfaces for hands-free interaction |
| US10705794B2 (en) | 2010-01-18 | 2020-07-07 | Apple Inc. | Automatically adapting user interfaces for hands-free interaction |
| US10276170B2 (en) | 2010-01-18 | 2019-04-30 | Apple Inc. | Intelligent automated assistant |
| US10679605B2 (en) | 2010-01-18 | 2020-06-09 | Apple Inc. | Hands-free list-reading by intelligent automated assistant |
| US10706841B2 (en) | 2010-01-18 | 2020-07-07 | Apple Inc. | Task flow identification based on user intent |
| US9548050B2 (en) | 2010-01-18 | 2017-01-17 | Apple Inc. | Intelligent automated assistant |
| US12087308B2 (en) | 2010-01-18 | 2024-09-10 | Apple Inc. | Intelligent automated assistant |
| US8903716B2 (en) | 2010-01-18 | 2014-12-02 | Apple Inc. | Personalized vocabulary for digital assistant |
| US10607140B2 (en) | 2010-01-25 | 2020-03-31 | Newvaluexchange Ltd. | Apparatuses, methods and systems for a digital conversation management platform |
| US12307383B2 (en) | 2010-01-25 | 2025-05-20 | Newvaluexchange Global Ai Llp | Apparatuses, methods and systems for a digital conversation management platform |
| US10607141B2 (en) | 2010-01-25 | 2020-03-31 | Newvaluexchange Ltd. | Apparatuses, methods and systems for a digital conversation management platform |
| US10984327B2 (en) | 2010-01-25 | 2021-04-20 | New Valuexchange Ltd. | Apparatuses, methods and systems for a digital conversation management platform |
| US11410053B2 (en) | 2010-01-25 | 2022-08-09 | Newvaluexchange Ltd. | Apparatuses, methods and systems for a digital conversation management platform |
| US10984326B2 (en) | 2010-01-25 | 2021-04-20 | Newvaluexchange Ltd. | Apparatuses, methods and systems for a digital conversation management platform |
| US10049675B2 (en) | 2010-02-25 | 2018-08-14 | Apple Inc. | User profiling for voice input processing |
| US9633660B2 (en) | 2010-02-25 | 2017-04-25 | Apple Inc. | User profiling for voice input processing |
| US10762293B2 (en) | 2010-12-22 | 2020-09-01 | Apple Inc. | Using parts-of-speech tagging and named entity recognition for spelling correction |
| US9262612B2 (en) | 2011-03-21 | 2016-02-16 | Apple Inc. | Device access using voice authentication |
| US10102359B2 (en) | 2011-03-21 | 2018-10-16 | Apple Inc. | Device access using voice authentication |
| US10241644B2 (en) | 2011-06-03 | 2019-03-26 | Apple Inc. | Actionable reminder entries |
| US10706373B2 (en) | 2011-06-03 | 2020-07-07 | Apple Inc. | Performing actions associated with task items that represent tasks to perform |
| US10057736B2 (en) | 2011-06-03 | 2018-08-21 | Apple Inc. | Active transport based notifications |
| US11120372B2 (en) | 2011-06-03 | 2021-09-14 | Apple Inc. | Performing actions associated with task items that represent tasks to perform |
| US9798393B2 (en) | 2011-08-29 | 2017-10-24 | Apple Inc. | Text correction processing |
| US10241752B2 (en) | 2011-09-30 | 2019-03-26 | Apple Inc. | Interface for a virtual digital assistant |
| US10134385B2 (en) | 2012-03-02 | 2018-11-20 | Apple Inc. | Systems and methods for name pronunciation |
| US9483461B2 (en) | 2012-03-06 | 2016-11-01 | Apple Inc. | Handling speech synthesis of content for multiple languages |
| US9953088B2 (en) | 2012-05-14 | 2018-04-24 | Apple Inc. | Crowd sourcing information to fulfill user requests |
| US10079014B2 (en) | 2012-06-08 | 2018-09-18 | Apple Inc. | Name recognition system |
| US9495129B2 (en) | 2012-06-29 | 2016-11-15 | Apple Inc. | Device, method, and user interface for voice-activated navigation and browsing of a document |
| US9576574B2 (en) | 2012-09-10 | 2017-02-21 | Apple Inc. | Context-sensitive handling of interruptions by intelligent digital assistant |
| US9971774B2 (en) | 2012-09-19 | 2018-05-15 | Apple Inc. | Voice-based media searching |
| US10978090B2 (en) | 2013-02-07 | 2021-04-13 | Apple Inc. | Voice trigger for a digital assistant |
| US10199051B2 (en) | 2013-02-07 | 2019-02-05 | Apple Inc. | Voice trigger for a digital assistant |
| US9368114B2 (en) | 2013-03-14 | 2016-06-14 | Apple Inc. | Context-sensitive handling of interruptions |
| US9697822B1 (en) | 2013-03-15 | 2017-07-04 | Apple Inc. | System and method for updating an adaptive speech recognition model |
| US9922642B2 (en) | 2013-03-15 | 2018-03-20 | Apple Inc. | Training an at least partial voice command system |
| US9620104B2 (en) | 2013-06-07 | 2017-04-11 | Apple Inc. | System and method for user-specified pronunciation of words for speech synthesis and recognition |
| US9633674B2 (en) | 2013-06-07 | 2017-04-25 | Apple Inc. | System and method for detecting errors in interactions with a voice-based digital assistant |
| US9966060B2 (en) | 2013-06-07 | 2018-05-08 | Apple Inc. | System and method for user-specified pronunciation of words for speech synthesis and recognition |
| US9582608B2 (en) | 2013-06-07 | 2017-02-28 | Apple Inc. | Unified ranking with entropy-weighted information for phrase-based semantic auto-completion |
| US9966068B2 (en) | 2013-06-08 | 2018-05-08 | Apple Inc. | Interpreting and acting upon commands that involve sharing information with remote devices |
| US10657961B2 (en) | 2013-06-08 | 2020-05-19 | Apple Inc. | Interpreting and acting upon commands that involve sharing information with remote devices |
| US10185542B2 (en) | 2013-06-09 | 2019-01-22 | Apple Inc. | Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant |
| US10176167B2 (en) | 2013-06-09 | 2019-01-08 | Apple Inc. | System and method for inferring user intent from speech inputs |
| US9300784B2 (en) | 2013-06-13 | 2016-03-29 | Apple Inc. | System and method for emergency calls initiated by voice command |
| US10791216B2 (en) | 2013-08-06 | 2020-09-29 | Apple Inc. | Auto-activating smart responses based on activities from remote devices |
| US9620105B2 (en) | 2014-05-15 | 2017-04-11 | Apple Inc. | Analyzing audio input for efficient speech and music recognition |
| US10592095B2 (en) | 2014-05-23 | 2020-03-17 | Apple Inc. | Instantaneous speaking of content on touch devices |
| US9502031B2 (en) | 2014-05-27 | 2016-11-22 | Apple Inc. | Method for supporting dynamic grammars in WFST-based ASR |
| US10170123B2 (en) | 2014-05-30 | 2019-01-01 | Apple Inc. | Intelligent assistant for home automation |
| US9842101B2 (en) | 2014-05-30 | 2017-12-12 | Apple Inc. | Predictive conversion of language input |
| US9715875B2 (en) | 2014-05-30 | 2017-07-25 | Apple Inc. | Reducing the need for manual start/end-pointing and trigger phrases |
| US11257504B2 (en) | 2014-05-30 | 2022-02-22 | Apple Inc. | Intelligent assistant for home automation |
| US10169329B2 (en) | 2014-05-30 | 2019-01-01 | Apple Inc. | Exemplar-based natural language processing |
| US10289433B2 (en) | 2014-05-30 | 2019-05-14 | Apple Inc. | Domain specific language for encoding assistant dialog |
| US10083690B2 (en) | 2014-05-30 | 2018-09-25 | Apple Inc. | Better resolution when referencing to concepts |
| US10078631B2 (en) | 2014-05-30 | 2018-09-18 | Apple Inc. | Entropy-guided text prediction using combined word and character n-gram language models |
| US10497365B2 (en) | 2014-05-30 | 2019-12-03 | Apple Inc. | Multi-command single utterance input method |
| US9633004B2 (en) | 2014-05-30 | 2017-04-25 | Apple Inc. | Better resolution when referencing to concepts |
| US9734193B2 (en) | 2014-05-30 | 2017-08-15 | Apple Inc. | Determining domain salience ranking from ambiguous words in natural speech |
| US9760559B2 (en) | 2014-05-30 | 2017-09-12 | Apple Inc. | Predictive text input |
| US9966065B2 (en) | 2014-05-30 | 2018-05-08 | Apple Inc. | Multi-command single utterance input method |
| US9430463B2 (en) | 2014-05-30 | 2016-08-30 | Apple Inc. | Exemplar-based natural language processing |
| US11133008B2 (en) | 2014-05-30 | 2021-09-28 | Apple Inc. | Reducing the need for manual start/end-pointing and trigger phrases |
| US9785630B2 (en) | 2014-05-30 | 2017-10-10 | Apple Inc. | Text prediction using combined word N-gram and unigram language models |
| US10904611B2 (en) | 2014-06-30 | 2021-01-26 | Apple Inc. | Intelligent automated assistant for TV user interactions |
| US9668024B2 (en) | 2014-06-30 | 2017-05-30 | Apple Inc. | Intelligent automated assistant for TV user interactions |
| US9338493B2 (en) | 2014-06-30 | 2016-05-10 | Apple Inc. | Intelligent automated assistant for TV user interactions |
| US10659851B2 (en) | 2014-06-30 | 2020-05-19 | Apple Inc. | Real-time digital assistant knowledge updates |
| US10446141B2 (en) | 2014-08-28 | 2019-10-15 | Apple Inc. | Automatic speech recognition based on user feedback |
| US9818400B2 (en) | 2014-09-11 | 2017-11-14 | Apple Inc. | Method and apparatus for discovering trending terms in speech requests |
| US10431204B2 (en) | 2014-09-11 | 2019-10-01 | Apple Inc. | Method and apparatus for discovering trending terms in speech requests |
| US10789041B2 (en) | 2014-09-12 | 2020-09-29 | Apple Inc. | Dynamic thresholds for always listening speech trigger |
| US10127911B2 (en) | 2014-09-30 | 2018-11-13 | Apple Inc. | Speaker identification and unsupervised speaker adaptation techniques |
| US9646609B2 (en) | 2014-09-30 | 2017-05-09 | Apple Inc. | Caching apparatus for serving phonetic pronunciations |
| US10074360B2 (en) | 2014-09-30 | 2018-09-11 | Apple Inc. | Providing an indication of the suitability of speech recognition |
| US9886432B2 (en) | 2014-09-30 | 2018-02-06 | Apple Inc. | Parsimonious handling of word inflection via categorical stem + suffix N-gram language models |
| US9986419B2 (en) | 2014-09-30 | 2018-05-29 | Apple Inc. | Social reminders |
| US9668121B2 (en) | 2014-09-30 | 2017-05-30 | Apple Inc. | Social reminders |
| US11556230B2 (en) | 2014-12-02 | 2023-01-17 | Apple Inc. | Data detection |
| US10552013B2 (en) | 2014-12-02 | 2020-02-04 | Apple Inc. | Data detection |
| US9711141B2 (en) | 2014-12-09 | 2017-07-18 | Apple Inc. | Disambiguating heteronyms in speech synthesis |
| US9865280B2 (en) | 2015-03-06 | 2018-01-09 | Apple Inc. | Structured dictation using intelligent automated assistants |
| US10311871B2 (en) | 2015-03-08 | 2019-06-04 | Apple Inc. | Competing devices responding to voice triggers |
| US10567477B2 (en) | 2015-03-08 | 2020-02-18 | Apple Inc. | Virtual assistant continuity |
| US9721566B2 (en) | 2015-03-08 | 2017-08-01 | Apple Inc. | Competing devices responding to voice triggers |
| US9886953B2 (en) | 2015-03-08 | 2018-02-06 | Apple Inc. | Virtual assistant activation |
| US11087759B2 (en) | 2015-03-08 | 2021-08-10 | Apple Inc. | Virtual assistant activation |
| US9899019B2 (en) | 2015-03-18 | 2018-02-20 | Apple Inc. | Systems and methods for structured stem and suffix language models |
| US9842105B2 (en) | 2015-04-16 | 2017-12-12 | Apple Inc. | Parsimonious continuous-space phrase representations for natural language processing |
| US10083688B2 (en) | 2015-05-27 | 2018-09-25 | Apple Inc. | Device voice control for selecting a displayed affordance |
| US10127220B2 (en) | 2015-06-04 | 2018-11-13 | Apple Inc. | Language identification from short strings |
| US10101822B2 (en) | 2015-06-05 | 2018-10-16 | Apple Inc. | Language input correction |
| US10356243B2 (en) | 2015-06-05 | 2019-07-16 | Apple Inc. | Virtual assistant aided communication with 3rd party service in a communication session |
| US10186254B2 (en) | 2015-06-07 | 2019-01-22 | Apple Inc. | Context-based endpoint detection |
| US11025565B2 (en) | 2015-06-07 | 2021-06-01 | Apple Inc. | Personalized prediction of responses for instant messaging |
| US10255907B2 (en) | 2015-06-07 | 2019-04-09 | Apple Inc. | Automatic accent detection using acoustic models |
| US10671428B2 (en) | 2015-09-08 | 2020-06-02 | Apple Inc. | Distributed personal assistant |
| US10747498B2 (en) | 2015-09-08 | 2020-08-18 | Apple Inc. | Zero latency digital assistant |
| US11500672B2 (en) | 2015-09-08 | 2022-11-15 | Apple Inc. | Distributed personal assistant |
| US9697820B2 (en) | 2015-09-24 | 2017-07-04 | Apple Inc. | Unit-selection text-to-speech synthesis using concatenation-sensitive neural networks |
| US11010550B2 (en) | 2015-09-29 | 2021-05-18 | Apple Inc. | Unified language modeling framework for word prediction, auto-completion and auto-correction |
| US10366158B2 (en) | 2015-09-29 | 2019-07-30 | Apple Inc. | Efficient word encoding for recurrent neural network language models |
| US11587559B2 (en) | 2015-09-30 | 2023-02-21 | Apple Inc. | Intelligent device identification |
| US10691473B2 (en) | 2015-11-06 | 2020-06-23 | Apple Inc. | Intelligent automated assistant in a messaging environment |
| US11526368B2 (en) | 2015-11-06 | 2022-12-13 | Apple Inc. | Intelligent automated assistant in a messaging environment |
| US10049668B2 (en) | 2015-12-02 | 2018-08-14 | Apple Inc. | Applying neural network language models to weighted finite state transducers for automatic speech recognition |
| US10223066B2 (en) | 2015-12-23 | 2019-03-05 | Apple Inc. | Proactive assistance based on dialog communication between devices |
| US10446143B2 (en) | 2016-03-14 | 2019-10-15 | Apple Inc. | Identification of voice inputs providing credentials |
| US9934775B2 (en) | 2016-05-26 | 2018-04-03 | Apple Inc. | Unit-selection text-to-speech synthesis based on predicted concatenation parameters |
| US9972304B2 (en) | 2016-06-03 | 2018-05-15 | Apple Inc. | Privacy preserving distributed evaluation framework for embedded personalized systems |
| US10249300B2 (en) | 2016-06-06 | 2019-04-02 | Apple Inc. | Intelligent list reading |
| US11069347B2 (en) | 2016-06-08 | 2021-07-20 | Apple Inc. | Intelligent automated assistant for media exploration |
| US10049663B2 (en) | 2016-06-08 | 2018-08-14 | Apple, Inc. | Intelligent automated assistant for media exploration |
| US10354011B2 (en) | 2016-06-09 | 2019-07-16 | Apple Inc. | Intelligent automated assistant in a home environment |
| US10192552B2 (en) | 2016-06-10 | 2019-01-29 | Apple Inc. | Digital assistant providing whispered speech |
| US10509862B2 (en) | 2016-06-10 | 2019-12-17 | Apple Inc. | Dynamic phrase expansion of language input |
| US11037565B2 (en) | 2016-06-10 | 2021-06-15 | Apple Inc. | Intelligent digital assistant in a multi-tasking environment |
| US10733993B2 (en) | 2016-06-10 | 2020-08-04 | Apple Inc. | Intelligent digital assistant in a multi-tasking environment |
| US10067938B2 (en) | 2016-06-10 | 2018-09-04 | Apple Inc. | Multilingual word prediction |
| US10490187B2 (en) | 2016-06-10 | 2019-11-26 | Apple Inc. | Digital assistant providing automated status report |
| US10297253B2 (en) | 2016-06-11 | 2019-05-21 | Apple Inc. | Application integration with a digital assistant |
| US10269345B2 (en) | 2016-06-11 | 2019-04-23 | Apple Inc. | Intelligent task discovery |
| US11152002B2 (en) | 2016-06-11 | 2021-10-19 | Apple Inc. | Application integration with a digital assistant |
| US10521466B2 (en) | 2016-06-11 | 2019-12-31 | Apple Inc. | Data driven natural language event detection and classification |
| US10089072B2 (en) | 2016-06-11 | 2018-10-02 | Apple Inc. | Intelligent device arbitration and control |
| US10553215B2 (en) | 2016-09-23 | 2020-02-04 | Apple Inc. | Intelligent automated assistant |
| US10043516B2 (en) | 2016-09-23 | 2018-08-07 | Apple Inc. | Intelligent automated assistant |
| US11281993B2 (en) | 2016-12-05 | 2022-03-22 | Apple Inc. | Model and ensemble compression for metric learning |
| US10593346B2 (en) | 2016-12-22 | 2020-03-17 | Apple Inc. | Rank-reduced token representation for automatic speech recognition |
| KR20180079759A (en)* | 2017-01-02 | 2018-07-11 | 삼성전자주식회사 | Method and terminal for recognizing a text |
| US10332518B2 (en) | 2017-05-09 | 2019-06-25 | Apple Inc. | User interface for correcting recognition errors |
| US10755703B2 (en) | 2017-05-11 | 2020-08-25 | Apple Inc. | Offline personal assistant |
| US10789945B2 (en) | 2017-05-12 | 2020-09-29 | Apple Inc. | Low-latency intelligent automated assistant |
| US11405466B2 (en) | 2017-05-12 | 2022-08-02 | Apple Inc. | Synchronization and task delegation of a digital assistant |
| US10791176B2 (en) | 2017-05-12 | 2020-09-29 | Apple Inc. | Synchronization and task delegation of a digital assistant |
| US10410637B2 (en) | 2017-05-12 | 2019-09-10 | Apple Inc. | User-specific acoustic models |
| US10810274B2 (en) | 2017-05-15 | 2020-10-20 | Apple Inc. | Optimizing dialogue policy decisions for digital assistants using implicit feedback |
| US10482874B2 (en) | 2017-05-15 | 2019-11-19 | Apple Inc. | Hierarchical belief states for digital assistants |
| US11217255B2 (en) | 2017-05-16 | 2022-01-04 | Apple Inc. | Far-field extension for digital assistant services |
| CN112652294B (en)* | 2020-12-25 | 2023-10-24 | 深圳追一科技有限公司 | Speech synthesis method, device, computer equipment and storage medium |
| CN112652294A (en)* | 2020-12-25 | 2021-04-13 | 深圳追一科技有限公司 | Speech synthesis method, apparatus, computer device and storage medium |
| Publication | Publication Date | Title |
|---|---|---|
| KR20070071675A (en) | Multilingual TTS Processing Method in Mobile Communication Terminal | |
| JP5320064B2 (en) | Voice-controlled wireless communication device / system | |
| US20030200858A1 (en) | Mixing MP3 audio and T T P for enhanced E-book application | |
| US8606560B2 (en) | Automatic simultaneous interpertation system | |
| US8340797B2 (en) | Method and system for generating and processing digital content based on text-to-speech conversion | |
| JP7230145B2 (en) | Context Denormalization for Automatic Speech Recognition | |
| JPH0830287A (en) | Text-speech converting system | |
| JPH08339198A (en) | Presentation device | |
| US20140019132A1 (en) | Information processing apparatus, information processing method, display control apparatus, and display control method | |
| CN100514384C (en) | Talking e-book | |
| JP2006119534A (en) | Computer system, method for supporting correction work, and program | |
| US20080243510A1 (en) | Overlapping screen reading of non-sequential text | |
| CN100487788C (en) | Method for realizing text-to-speech function | |
| US11250837B2 (en) | Speech synthesis system, method and non-transitory computer readable medium with language option selection and acoustic models | |
| KR20030079497A (en) | service method of language study | |
| KR102479023B1 (en) | Apparatus, method and program for providing foreign language learning service | |
| JP2016012315A (en) | Spaced-wording unit dividing program for text data, and electronic book reproduction device | |
| JP2000148176A (en) | Information processing device and method, serving medium, speech recognision system, speech synthesizing system, translation device and method, and translation system | |
| JP2005326811A (en) | Speech synthesis apparatus and speech synthesis method | |
| JP2000293187A (en) | Device and method for synthesizing data voice | |
| Shabtay et al. | Spoken question answering for visual queries | |
| CN120068887A (en) | Translation method, video conference method and electronic equipment | |
| WO2023132140A1 (en) | Program, file generation method, information processing device, and information processing system | |
| JP2006047866A (en) | Electronic dictionary device and control method thereof | |
| KR20110054218A (en) | Audio playback device and history method including history storage module |
| Date | Code | Title | Description |
|---|---|---|---|
| PA0109 | Patent application | Patent event code:PA01091R01D Comment text:Patent Application Patent event date:20051230 | |
| PG1501 | Laying open of application | ||
| PC1203 | Withdrawal of no request for examination | ||
| WITN | Application deemed withdrawn, e.g. because no request for examination was filed or no examination fee was paid |