Movatterモバイル変換


[0]ホーム

URL:


KR20070071675A - Multilingual TTS Processing Method in Mobile Communication Terminal - Google Patents

Multilingual TTS Processing Method in Mobile Communication Terminal
Download PDF

Info

Publication number
KR20070071675A
KR20070071675AKR1020050135350AKR20050135350AKR20070071675AKR 20070071675 AKR20070071675 AKR 20070071675AKR 1020050135350 AKR1020050135350 AKR 1020050135350AKR 20050135350 AKR20050135350 AKR 20050135350AKR 20070071675 AKR20070071675 AKR 20070071675A
Authority
KR
South Korea
Prior art keywords
language
pcm data
text
tts
mobile communication
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
KR1020050135350A
Other languages
Korean (ko)
Inventor
양진우
Original Assignee
주식회사 팬택
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 주식회사 팬택filedCritical주식회사 팬택
Priority to KR1020050135350ApriorityCriticalpatent/KR20070071675A/en
Publication of KR20070071675ApublicationCriticalpatent/KR20070071675A/en
Withdrawnlegal-statusCriticalCurrent

Links

Images

Classifications

Landscapes

Abstract

Translated fromKorean

본 발명은 이동통신단말기에서 다중언어 TTS 처리하는 경우, 사용자가 바뀐 언어에 상응하는 음성 정보를 대기하는 시간을 최소화할 수 있는 이동통신단말기에서 다중 언어 TTS 처리 방법에 관한 것으로서, 본 발명은 입력된 텍스트를 언어의 종류에 따라 구분하여 해당 이동통신단말기에 구비된 TTS 엔진에 전달하는 단계와; 해당 TTS 엔진을 구동시키고, 입력된 텍스트를 PCM 데이터로 변환하여 이동통신단말기에 구비된 복수개의 저장 매체 중 첫 번째 매체부터 차례로 저장하는 단계와; 상기 입력된 텍스트를 PCM 데이터로 변환하는 동작이 완료된 경우, 다른 언어의 텍스트가 입력되었는지를 판단하는 단계와; 상기 판단 단계의 결과, 새로운 언어의 텍스트가 입력된 경우, 기존의 구동중인 TTS 엔진을 정지시키고, 상기 새로운 언어의 TTS 엔진을 구동하여 새로운 언어의 텍스트를 PCM 데이터로 변환하는 단계로 이루어져, 상기 첫 번째 매체에 상기 PCM 데이터의 저장이 완료되면, 상기 복수개의 매체에 차례로 저장된 PCM 데이터를 차례로 재생한다.The present invention relates to a multi-language TTS processing method in a mobile communication terminal capable of minimizing a user's waiting time for voice information corresponding to a changed language when the multi-language TTS processing is performed in a mobile communication terminal. Classifying the text according to the type of language and transmitting the text to a TTS engine provided in the corresponding mobile communication terminal; Driving the corresponding TTS engine, converting the input text into PCM data and sequentially storing the first text from among a plurality of storage media provided in the mobile communication terminal; Determining whether text of another language is input when the converting of the input text into PCM data is completed; As a result of the determination step, when the text of the new language is input, the step of stopping the existing TTS engine, and driving the TTS engine of the new language to convert the text of the new language into PCM data, the first When the storing of the PCM data is completed on the first medium, the PCM data stored in the plurality of media are sequentially played.

Description

Translated fromKorean
이동통신단말기에서 다중 언어 TTS 처리 방법{METHOD FOR PERFORMING MULTIPLE LANGUAGE TTS PROCESS IN MIBILE TERMINAL}How to handle multilingual TTS in mobile communication terminal {METHOD FOR PERFORMING MULTIPLE LANGUAGE TTS PROCESS IN MIBILE TERMINAL}

도 1은 일반적인 단일 언어 TTS 처리 장치의 구성도.1 is a block diagram of a general single language TTS processing apparatus;

도 2는 종래의 다중언어 TTS 처리 절차를 보인 플루우챠트.2 is a flow chart showing a conventional multilingual TTS processing procedure.

도 3은 본 발명에 따른 이동통신단말기에서 다중 언어 TTS 처리 회로의 블록 구성도.3 is a block diagram of a multi-language TTS processing circuit in a mobile communication terminal according to the present invention.

도 4는 본 발명에 따른 이동통신단말기에서 다중 언어 TTS 처리 절차를 설명하기 위한 플로우챠트.4 is a flowchart illustrating a multi-language TTS processing procedure in a mobile communication terminal according to the present invention.

** 도면의 주요 부분에 대한 부호의 설명 **** Description of symbols for the main parts of the drawing **

100 : 다중언어 처리부100: multi-language processing unit

110 : TTS 엔진부110: TTS engine unit

120 : 메모리부120: memory

130 : 오디오 처리부130: audio processor

140 : 스피커140: speaker

본 발명은 이동통신단말기에서 다중 언어 TTS 처리 방법에 관한 것으로서, 특히 다중 언어를 TTS 처리 하는 경우, 텍스트 언어가 바뀌는 경우에 지연시간을 최소화하여 연속적으로 음성정보를 제공하기에 적당하도록 한 이동통신단말기에서 다중 언어 TTS 처리 방법에 관한 것이다.The present invention relates to a multi-language TTS processing method in a mobile communication terminal. Particularly, in the case of multi-language TTS processing, a mobile communication terminal suitable for continuously providing voice information by minimizing delay time when a text language is changed is provided. The present invention relates to a multi-language TTS processing method.

도 1은 일반적인 단일 언어 TTS 처리 장치의 구성도이다. 도 1을 참조하면, 소정의 언어로 입력된 문장은 단말기에 구비된 TTS 엔진(11)에 의해 오디오 웨이브 데이터(Audio Wave Data)로 변환된다. 이어, TTS 엔진(11)에 의해 변환된 오디오 웨이브 데이터는 오디오 처리부(12)에 의해 아날로그 음성 신호로 변환된다. 이어, 오디오 처리부(12)에 의해 변환된 아날로그 음성 신호는 스피커(120)를 통해 음성으로 내보내진다.1 is a block diagram of a general monolingual TTS processing apparatus. Referring to FIG. 1, a sentence input in a predetermined language is converted into audio wave data by theTTS engine 11 provided in the terminal. Subsequently, the audio wave data converted by theTTS engine 11 is converted into an analog voice signal by theaudio processor 12. Subsequently, the analog voice signal converted by theaudio processor 12 is output as voice through the speaker 120.

이상에서 설명한 일반적인 단일 언어 TTS 처리 장치는 한 가지 종류의 언어(즉, 한국어 또는 영어 또는 일본어 등)로만 이루어진 문장에 대해서는 적절한 음성을 생성할 수 있으나, 여러 종류의 언어가 혼합되어 있는 문장, 즉 다중언어의 문장에 대해서는 적절한 음성을 생성하지 못하였다.The general single-language TTS processing apparatus described above may generate an appropriate voice for a sentence composed of only one kind of language (ie, Korean, English, Japanese, etc.), but a sentence in which several kinds of languages are mixed, that is, multiple We couldn't generate proper speech for sentences in language.

이러한 단점을 개선하기 위하여 기존의 TTS 처리 장치에 다수의 언어를 TTS 처리하는 다중언어 처리부와 복수의 TTS 엔진을 추가로 구비시키는 방안이 제시되었다.In order to improve this disadvantage, a method of additionally providing a multi-language processor and a plurality of TTS engines for TTS processing a plurality of languages in the existing TTS processing apparatus has been proposed.

도 2는 종래의 이동통신단말기에서 다중언어 TTS 처리 절차를 보인 플루우챠트이다.2 is a flow chart showing a multi-language TTS processing procedure in a conventional mobile communication terminal.

도 2를 참조하면, 이동통신단말기에 A,B,C 3개의 언어를 처리할 수 있는 TTS 엔진과 이에 상응하는 데이터 베이스가 각각 구비된 상태에서, 먼저 A 언어가 입력되면(S21), 단말기의 제어부는 A TTS 엔진을 구동시켜(S22), 입력된 A 언어를 이용하여 PCM 데이터를 생성한다(S23). 이어, 제어부는 생성된 PCM 데이터를 재생한다(S24). PCM 데이터의 생성이 완료되고, PCM 데이터의 재생이 완료된 경우(S25), 제어부는 B 언어 텍스트가 입력되는지를 판단한다(S26). S26 단계의 판단결과, B 언어가 입력되는 경우, B TTS 엔진을 구동하여(S27), S23 단계에서 S25 단계를 순차적으로 실행하고, B 언어가 입력되지 않는 경우 C 언어가 입력되는지를 판단한다(S28).Referring to FIG. 2, when a mobile communication terminal is provided with a TTS engine capable of processing three languages A, B, and C, and a corresponding database, respectively, A language is first input (S21). The control unit drives the A TTS engine (S22) to generate PCM data using the input A language (S23). Subsequently, the controller reproduces the generated PCM data (S24). When the generation of the PCM data is completed and the reproduction of the PCM data is completed (S25), the controller determines whether the B language text is input (S26). As a result of the determination in step S26, when the B language is input, the B TTS engine is driven (S27), and step S25 is sequentially executed in step S23, and when the B language is not input, it is determined whether the C language is input ( S28).

S28 단계의 판단결과, C 언어가 입력되는 경우, C TTS 엔진을 구동하여(S29), S23 단계에서 S25 단계를 순차적으로 실행하고, C 언어가 입력되지 않는 경우 다시 A 언어가 입력되는지를 판단하여(S30), 앞서 설명한 다중언어 TTS 처리 동작을 실행한다.As a result of the determination in step S28, when the C language is input, the C TTS engine is driven (S29), and step S25 is sequentially executed in step S23, and when the C language is not input, it is determined whether the A language is input again. (S30), the above-described multilingual TTS processing operation is executed.

그러나, 이와 같은 종래의 이동통신단말기에서 다중언어 TTS 처리 절차에서는 텍스트에 들어가 있는 언어가 바뀔 때 마다 언어에 맞는 TTS 엔진으로 교체해서 TTS 처리를 수행하기 때문에 TTS 엔진을 교체하는 시간동안 사용자가 기다려야하는 번거로움이 있었다. 즉, 텍스트에 들어가 있는 언어가 바뀌는 경우, 현재 구동중인 TTS 엔진의 동작을 중단하고, 새로운 텍스트에 들어가 있는 언어에 맞는 TTS 엔진을 구동하여 새로운 텍스트에 상응하는 PCM 데이터를 생성 및 재생하여야 하는데, 이 시간이 사용자들에겐 다소 지루한 시간이 될 수도 있는 것이다.However, in the conventional mobile communication terminal, the multi-language TTS processing procedure requires the user to wait for the time to replace the TTS engine because the TTS processing is performed by replacing the TTS engine with the language whenever the language in the text is changed. There was a hassle. In other words, when the language of the text is changed, the currently operating TTS engine must be stopped and the TTS engine corresponding to the language of the new text must be operated to generate and play PCM data corresponding to the new text. Time can be a bit tedious for users.

본 발명은 이상에서 설명한 종래의 기술을 감안하여 창출되어진 것으로서, 본 발명의 목적은 이동통신단말기에서 다중언어 TTS 처리하는 경우, 사용자가 바뀐 언어에 상응하는 음성 정보를 대기하는 시간을 최소화할 수 있는 이동통신단말기에서 다중 언어 TTS 처리 방법을 제공하기 위한 것이다.The present invention has been made in view of the conventional technology described above, and an object of the present invention is to minimize the time for a user to wait for voice information corresponding to a changed language when a multi-language TTS process is performed in a mobile communication terminal. It is to provide a multi-language TTS processing method in a mobile communication terminal.

본 발명의 다른 목적은 이동통신단말기에서 다중언어 TTS 처리하는 경우, 단말기 시스템을, 입력 텍스트에 상응하는 언어에 맞는 TTS 엔진을 구동하여 PCM 데이터를 생성 처리하는 TTS 테스크와, 생성된 PCM 데이터를 재생 처리하는 오디오 테스크로 구분하여 독립적으로 운용할 수 있는 이동통신단말기에서 다중 언어 TTS 처리 방법을 제공하기 위한 것이다.Another object of the present invention is to perform a multi-language TTS processing in a mobile communication terminal, the terminal system, the TTS task for generating and processing PCM data by running a TTS engine for a language corresponding to the input text, and reproduces the generated PCM data It is to provide a multi-language TTS processing method in a mobile communication terminal that can be independently operated by processing audio tasks.

상기한 목적을 달성하기 위해, 본 발명에 따른 이동통신단말기에서 다중 언어 TTS 처리 방법은, 입력된 텍스트를 언어의 종류에 따라 구분하여 해당 이동통신단말기에 구비된 TTS 엔진에 전달하는 제 1 단계; 해당 TTS 엔진을 구동시키고, 입력된 텍스트를 PCM 데이터로 변환하여 이동통신단말기에 구비된 복수개의 저장 매체 중 첫 번째 매체부터 차례로 저장하는 제 2 단계; 상기 입력된 텍스트를 PCM 데이터로 변환하는 동작이 완료된 경우, 다른 언어의 텍스트가 입력되었는지를 판단하는 제 3 단계; 및 상기 판단 단계의 결과, 새로운 언어의 텍스트가 입력된 경우, 기존의 구동중인 TTS 엔진을 정지시키고, 상기 새로운 언어의 TTS 엔진을 구동하여 새로운 언어의 텍스트를 PCM 데이터로 변환하는 제 4 단계로 이루어져, 상기 첫 번째 매체에 상기 PCM 데이터의 저장이 완료되면, 상기 복수개의 매체에 차례로 저장 된 PCM 데이터를 차례로 재생한다.In order to achieve the above object, a multi-language TTS processing method in a mobile communication terminal according to the present invention comprises: a first step of classifying the input text according to the type of language and delivering it to the TTS engine provided in the corresponding mobile communication terminal; A second step of driving the corresponding TTS engine and converting the input text into PCM data and sequentially storing the first one of a plurality of storage media provided in the mobile communication terminal; A third step of determining whether text of another language is input when the operation of converting the input text into PCM data is completed; And a fourth step of converting the text of the new language into PCM data by stopping the existing driving TTS engine when the text of the new language is input as a result of the determination step. When the storing of the PCM data is completed on the first medium, the PCM data stored in the plurality of media are sequentially played.

여기서, 본 발명에 따른 이동통신단말기에서 다중 언어 TTS 처리 방법은, 상기 복수개의 저장 매체에 재생해야할 PCM 데이터가 남아있는지를 판단하는 제 5 단계; 상기 재생해야할 PCM 데이터가 남아 있으면, 상기 PCM 데이터를 저장하고 있는 저장 매체의 개수가 미리 설정된 최대치(n개) 이상인지를 판단하는 제 6 단계와; 상기 판단 결과, 상기 저장 매체의 개수가 미리 설정된 최대치(n개) 이상이면, 상기 TTS 엔진의 동작을 중지할 것을 요청하는 제 7 단계를 더 포함한다.Here, the multi-language TTS processing method in the mobile communication terminal according to the present invention comprises: a fifth step of determining whether PCM data to be reproduced in the plurality of storage media remains; A sixth step of determining whether the number of storage media storing the PCM data is equal to or larger than a preset maximum value (n) if the PCM data to be reproduced remains; The determination may further include a seventh step of requesting to stop the operation of the TTS engine when the number of the storage media is greater than or equal to a preset maximum value (n).

여기서, 본 발명에 따른 이동통신단말기에서 다중 언어 TTS 처리 방법은, 상기 저장 매체의 개수가 미리 설정된 최대치(n개) 이상이 아니면, 재생하지 않은 저장 매체의 수가 미리 설정된 최소치(m개) 이하인지를 판단하는 제 8 단계; 재생하지 않은 저장 매체의 개수가 미리 설정된 최소치(m개) 이하인 경우, 상기 해당 TTS 엔진에게 입력된 텍스트를 PCM 데이터로 변환하여 비어있는 저장 매체에 저장하도록 요청하는 제 9 단계를 더 포함한다.Here, in the mobile communication terminal according to the present invention, in the multi-language TTS processing method, if the number of the storage media is not more than the preset maximum value (n), the number of storage media that has not been played is less than the preset minimum value (m). Determining an eighth step; If the number of non-reproducing storage media is less than or equal to a preset minimum value (m), a ninth step of requesting the corresponding TTS engine to convert the input text into PCM data and store it in an empty storage medium.

이상에서 설명한 본 발명의 특징에 따르면, 다중언어 TTS 처리하는 이동 단말기의 시스템을, 입력 텍스트에 상응하는 언어에 맞는 TTS 엔진을 구동하여 PCM 데이터를 생성 처리하는 TTS 테스크와, 생성된 PCM 데이터를 재생 처리하는 오디오 테스크로 구분하여 독립적으로 운용한다.According to the characteristics of the present invention described above, the system of the mobile terminal for multi-language TTS processing, the TTS task for generating and processing the PCM data by running the TTS engine for the language corresponding to the input text, and reproduces the generated PCM data It is divided into audio tasks to be processed and operated independently.

따라서, 사용자는 단말기 시스템이 다중 언어를 TTS 처리하는 경우라도 연속적으로 음성정보를 들을 수 있는 이점이 있다.Therefore, the user has the advantage of being able to continuously listen to the voice information even when the terminal system processes the multi-language TTS.

이하, 첨부되어진 도면을 참조하여 본 발명의 이동통신단말기에서 다중 언 어 TTS 처리 절차에 따른 실시 예를 구체적으로 설명한다.Hereinafter, embodiments of the multi-language TTS processing procedure in the mobile communication terminal of the present invention will be described in detail with reference to the accompanying drawings.

도 3은 본 발명에 따른 이동통신단말기에서 다중 언어 TTS 처리 회로의 블록 구성도 이다.3 is a block diagram of a multi-language TTS processing circuit in the mobile communication terminal according to the present invention.

도 3을 참조하면, 다중언어 처리부(100)는 이동통신단말기에 입력되는 소정 언어 텍스트를 수신하고, 입력된 언어 텍스트를 언어의 종류에 따라 구분하여 해당 TTS 엔진에 전달한다.Referring to FIG. 3, themulti-language processing unit 100 receives a predetermined language text input to a mobile communication terminal, classifies the input language text according to the type of language, and transmits the received language text to a corresponding TTS engine.

이러한, 다중언어처리부는 처리하는 언어는 해당 언어의 TTS 처리가 가능한 TTS 엔진과 이에 따른 데이터 베이스가 지원되면 추가할 수 있다.Such a language processing unit may add a language to be processed if a TTS engine capable of processing the TTS of the corresponding language and a database thereof are supported.

TTS 엔진부(110)는 다수개의 TTS 엔진(예를 들어, 영어 TTS 엔진, 일어 TTS 엔진, 중국어 TTS 엔진 등)으로 이루어져, 다중언어 처리부(100)에서 구분된 텍스트를 PCM 데이터로 변환한다.TheTTS engine unit 110 is composed of a plurality of TTS engines (for example, an English TTS engine, a Japanese TTS engine, a Chinese TTS engine, etc.), and converts the text divided by themultilingual processing unit 100 into PCM data.

여기서, 각각의 TTS 엔진은 (1)초기화 동작, (2)언어 데이터 베이스 셋팅, (3)텍스트 입력 (4) 데이터 변환의 단계를 통하여 입력 텍스트를 PCM 데이터로 변환하는 것이다.Here, each TTS engine converts the input text into PCM data through the steps of (1) initialization operation, (2) language database setting, and (3) text input and (4) data conversion.

메모리부(120)는 각각의 TTS 엔진에 용어를 제공하기 위한 데이터 베이스부와, PCM 데이터를 저장하기 위한 복수개의 버퍼로 이루어져, TTS 엔진부에서 변환된 PCM 데이터를 버퍼의 크기에 맞게 나누어서 일정 시간 단위로 버퍼에 저장한다.The memory unit 120 includes a database unit for providing a term to each TTS engine and a plurality of buffers for storing PCM data. The memory unit 120 divides the PCM data converted by the TTS engine unit according to the size of the buffer for a predetermined time. Store in buffers in units.

오디오 처리부(130)는 TTS 엔진부(110)에서 변환된 PCM 데이터를 아날로그 음성 신호로 변환하여 출력한다. 이러한 오디오 처리부(130)는 일반적으로 소프트웨어 모듈로서 오디오 드라이버와 하드웨어 블럭으로서 오디오 카드를 포함하여 구 성된다.Theaudio processor 130 converts the PCM data converted by theTTS engine unit 110 into an analog voice signal and outputs the analog voice signal. Theaudio processor 130 generally includes an audio driver as a software module and an audio card as a hardware block.

또한, 스피커(140)는 오디오 처리부(130)에서 변환된 아날로그 음성 신호를 사용자가 들을 수 있는 음성으로 출력한다.In addition, thespeaker 140 outputs the analog voice signal converted by theaudio processor 130 as a voice that a user can hear.

이하에서, 첨부된 도면을 참조하여 본 발명에 따른 이동통신단말기에서 다중 언어 TTS 처리 절차를 설명한다.Hereinafter, a multi-language TTS processing procedure in a mobile communication terminal according to the present invention will be described with reference to the accompanying drawings.

도 4는 본 발명에 따른 이동통신단말기에서 다중 언어 TTS 처리 절차를 설명하기 위한 플로우챠트 이다.4 is a flowchart illustrating a multi-language TTS processing procedure in a mobile communication terminal according to the present invention.

도 4를 참조하면, 다중 텍스트가 이동통신단말기의 다중언어 처리부(100)로 입력되면, 다중언어 처리부(100)는 입력된 텍스트를 언어의 종류에 따라 구분하여 해당 TTS 엔진에 전달한다(S41),Referring to FIG. 4, when the multi-text is input to themulti-language processing unit 100 of the mobile communication terminal, themulti-language processing unit 100 classifies the input text according to the type of language and transmits the text to the corresponding TTS engine (S41). ,

TTS 엔진부(110)는 해당 TTS 엔진을 구동시키고(S42), 입력된 텍스트를 메모리부의 해당 데이터 베이스를 이용하여 PCM 데이터로 변환하여 메모리부(120)의 첫 번째 버퍼부터 차례로 저장한다(S43). TTS 엔진부(110)는 이동통신단말기의 제어부의 제어에 따라 데이터 베이스의 갱신과 삭제 등을 제어하며, 버퍼의 초기화와 저장 순서 등도 제어한다.TheTTS engine unit 110 drives the corresponding TTS engine (S42), converts the input text into PCM data using the corresponding database of the memory unit, and sequentially stores the first buffer of the memory unit 120 (S43). . TheTTS engine unit 110 controls the update and deletion of the database under the control of the control unit of the mobile communication terminal, and also controls the initialization and storage order of the buffer.

TTS 엔진부(110)는 입력된 텍스트를 수신하여 PCM 데이터로 변환하였는지를 판단한다(S44).TheTTS engine unit 110 determines whether the received text is converted into PCM data (S44).

S44 판단결과, 입력된 텍스트를 모두 PCM 데이터로 변환한 경우, 다른 언어의 텍스트가 입력되었는지를 판단한다(S45).As a result of S44 determination, when all the input text is converted into PCM data, it is determined whether text of another language is input (S45).

S45 판단결과, 입력된 새로운 언어의 텍스트가 있는 경우, TTS 엔진부는 기존 의 구동중인 TTS 엔진을 정지시키고, 새로운 언어의 TTS 엔진을 구동하고, 새로운 언어의 TTS 엔진을 구동하여 새로운 언어의 텍스트를 PCM 데이터로 변환하고(S47), S44단계를 실행한다 .As a result of S45 determination, if there is text of a new language input, the TTS engine unit stops the existing TTS engine, drives the TTS engine of the new language, and drives the TTS engine of the new language to display the text of the new language. Convert to data (S47), and execute step S44.

한편, 오디오 처리부(130)는 메모리부(120)의 첫 번째 버퍼에 PCM 데이터의 저장이 완료되면, 바로 저장된 PCM 데이터를 가지고 와서 재생한다(S50).Meanwhile, when the PCM data is completely stored in the first buffer of the memory unit 120, theaudio processor 130 immediately brings back the stored PCM data and plays it (S50).

이어, 오디오 처리부(130)는 메모리부(120)의 버퍼에 재생해야할 PCM 데이터가 남아있는지를 판단하여(S51), 재생해야할 PCM 데이터가 남아 있으면 버퍼의 개수가 미리 설정된 최대치(n개) 이상인지를 판단한다(S52).Subsequently, theaudio processor 130 determines whether PCM data to be reproduced remains in the buffer of the memory unit 120 (S51). Determine (S52).

오디오 처리부(130)는 재생하지 않은 버퍼의 개수가 미리 설정된 최대치(n개) 이상이면, TTS 엔진부(110)의 동작을 중지할 것을 요청한다(S53).Theaudio processor 130 requests to stop the operation of theTTS engine unit 110 when the number of unplayed buffers is greater than or equal to a preset maximum value (n) (S53).

S52단계의 판단결과, 재생하지 않은 버퍼의 개수가 미리 설정된 최대치(n개) 이상이 아닌 경우, 오디오 처리부(130)는 재생하지 않은 버퍼의 개수가 미리 설정된 최소치(m개) 이하인지를 판단한다(S54).As a result of the determination in step S52, when the number of unplayed buffers is not more than the preset maximum value (n), theaudio processor 130 determines whether the number of unplayed buffers is less than or equal to the preset minimum value (m). (S54).

S54단계의 판단 결과, 재생하지 않은 버퍼의 개수가 미리 설정된 최소치(m개) 이하인 경우, 오디오 처리부(130)는 TTS 엔진부(110)에게 입력된 텍스트를 메모리부(120)의 해당 데이터 베이스를 이용하여 PCM 데이터로 변환하여 메모리부(120)의 비어있는 버퍼부터 다시 차례로 저장하도록 요청한다(S55).As a result of the determination in step S54, when the number of unplayed buffers is equal to or less than the preset minimum value (m), theaudio processor 130 may convert the text input to theTTS engine unit 110 into a corresponding database of the memory unit 120. It converts the data into PCM data and stores the data from the empty buffer of the memory unit 120 in order (S55).

반면, S54단계의 판단 결과, 재생하지 않은 버퍼의 개수가 미리 설정된 최소치(m개) 이상인 경우, 오디오 처리부(130)는 S50 단계를 실행하도록 제어한다.On the other hand, when the determination result in step S54, if the number of the buffers that are not reproduced is more than the predetermined minimum value (m), theaudio processor 130 controls to execute step S50.

또한, S51 단계에서 재생해야할 PCM 데이터가 없는 경우로 판단되면, 오디오 태스크 일시 정지 상태를 유지한다(S56).If it is determined that there is no PCM data to be reproduced in step S51, the audio task pause state is maintained (S56).

이상에서 설명한 바와 같이, 본 발명에 따른 이동통신단말기에서 다중 언어 TTS 처리 방법에 의하면, 다중언어 TTS 처리하는 이동 단말기의 시스템을, 입력 텍스트에 상응하는 언어에 맞는 TTS 엔진을 구동하여 PCM 데이터를 생성 처리하는 TTS 테스크와, 생성된 PCM 데이터를 재생 처리하는 오디오 테스크로 구분하여 독립적으로 운용한다.As described above, according to the multi-language TTS processing method in the mobile communication terminal according to the present invention, the PCM data is generated by driving a system of a mobile terminal that processes the multi-language TTS by driving a TTS engine for a language corresponding to the input text. The TTS task to be processed and the audio task to reproduce the generated PCM data are divided and operated independently.

따라서, TTS 테스크는 입력 텍스트에 따라 상응하는 TTS 엔진을 구동하여 PCM 데이터를 연속적으로 생성하여 별도의 저장매체에 순차적으로 저장하고, 오디오 테스크는 생성된 PCM 데이터를 순차적으로 재생하기 때문에, 다중언어가 포함된 텍스트가 단말기에 입력되더라도 사용자의 입장에서는 대기 시간이나 끊김이 없이 연속적으로 음성정보를 들을 수 있는 효과를 제공한다.Therefore, the TTS task drives the corresponding TTS engine according to the input text to continuously generate the PCM data, and sequentially stores the data on separate storage media. The audio task reproduces the generated PCM data sequentially. Even if the included text is input to the terminal, it provides the effect that the user can continuously hear the voice information without waiting time or interruption.

한편, 본 발명은 상술한 실시예로만 한정되는 것이 아니라 본 발명의 요지를 벗어나지 않는 범위 내에서 수정 및 변형하여 실시할 수 있고, 이러한 수정 및 변경 등은 이하의 특허 청구의 범위에 속하는 것으로 보아야 할 것이다.On the other hand, the present invention is not limited to the above-described embodiment, but can be modified and modified within the scope not departing from the gist of the present invention, such modifications and changes should be regarded as belonging to the following claims. will be.

Claims (3)

Translated fromKorean
입력된 텍스트를 언어의 종류에 따라 구분하여 해당 이동통신단말기에 구비된 TTS 엔진에 전달하는 제 1 단계;A first step of classifying the input text according to the type of language and transferring the input text to a TTS engine provided in the corresponding mobile communication terminal;해당 TTS 엔진을 구동시키고, 입력된 텍스트를 PCM 데이터로 변환하여 이동통신단말기에 구비된 복수개의 저장 매체 중 첫 번째 매체부터 차례로 저장하는 제 2 단계;A second step of driving the corresponding TTS engine and converting the input text into PCM data and sequentially storing the first one of a plurality of storage media provided in the mobile communication terminal;상기 입력된 텍스트를 PCM 데이터로 변환하는 동작이 완료된 경우, 다른 언어의 텍스트가 입력되었는지를 판단하는 제 3 단계; 및A third step of determining whether text of another language is input when the operation of converting the input text into PCM data is completed; And상기 판단 단계의 결과, 새로운 언어의 텍스트가 입력된 경우, 기존의 구동중인 TTS 엔진을 정지시키고, 상기 새로운 언어의 TTS 엔진을 구동하여 새로운 언어의 텍스트를 PCM 데이터로 변환하는 제 4 단계를 구비하여,As a result of the determination step, if a text of a new language is input, a fourth step of converting the text of the new language into PCM data by stopping the existing TTS engine and driving the TTS engine of the new language; ,상기 첫 번째 매체에 상기 PCM 데이터의 저장이 완료되면, 상기 복수개의 매체에 차례로 저장된 PCM 데이터를 차례로 재생하는 것을 특징으로 하는 이동통신단말기에서 다중 언어 TTS 처리 방법.And when the storing of the PCM data is completed on the first medium, the PCM data stored in the plurality of media in turn are sequentially played.제 1 항에 있어서,The method of claim 1,상기 복수개의 저장 매체에 재생해야할 PCM 데이터가 남아있는지를 판단하는 제 5 단계;A fifth step of determining whether PCM data to be reproduced in the plurality of storage media remains;상기 재생해야할 PCM 데이터가 남아 있으면, 상기 PCM 데이터를 저장하고 있 는 저장 매체의 개수가 미리 설정된 최대치(n개) 이상인지를 판단하는 제 6 단계;A sixth step of determining whether the number of storage media storing the PCM data is equal to or greater than a preset maximum value (n) if the PCM data to be reproduced remains;상기 판단 결과, 상기 저장 매체의 개수가 미리 설정된 최대치(n개) 이상이면, 상기 TTS 엔진의 동작을 중지할 것을 요청하는 제 7 단계를 더 포함하는 것을 특징으로 하는 이동통신단말기에서 다중 언어 TTS 처리 방법.And the seventh step of requesting to stop the operation of the TTS engine when the number of the storage media is greater than or equal to a preset maximum number (n), the multilingual TTS process in the mobile communication terminal. Way.제 2 항에 있어서,The method of claim 2,상기 저장 매체의 개수가 미리 설정된 최대치(n개) 이상이 아니면, 재생하지 않은 저장 매체의 수가 미리 설정된 최소치(m개) 이하인지를 판단하는 제 8 단계;An eighth step of determining whether the number of non-reproducing storage media is less than or equal to a preset minimum value (m) if the number of the storage media is not more than the preset maximum value (n); 재생하지 않은 저장 매체의 개수가 미리 설정된 최소치(m개) 이하인 경우, 상기 해당 TTS 엔진에게 입력된 텍스트를 PCM 데이터로 변환하여 비어있는 저장 매체에 저장하도록 요청하는 제 9 단계를 더 포함하는 것을 특징으로 하는 이동통신단말기에서 다중 언어 TTS 처리 방법. If the number of storage media that has not been reproduced is equal to or less than a predetermined minimum value (m), further comprising a ninth step of requesting the corresponding TTS engine to convert the input text into PCM data and store it in an empty storage medium. Multi-language TTS processing method in a mobile communication terminal.
KR1020050135350A2005-12-302005-12-30 Multilingual TTS Processing Method in Mobile Communication TerminalWithdrawnKR20070071675A (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
KR1020050135350AKR20070071675A (en)2005-12-302005-12-30 Multilingual TTS Processing Method in Mobile Communication Terminal

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
KR1020050135350AKR20070071675A (en)2005-12-302005-12-30 Multilingual TTS Processing Method in Mobile Communication Terminal

Publications (1)

Publication NumberPublication Date
KR20070071675Atrue KR20070071675A (en)2007-07-04

Family

ID=38506783

Family Applications (1)

Application NumberTitlePriority DateFiling Date
KR1020050135350AWithdrawnKR20070071675A (en)2005-12-302005-12-30 Multilingual TTS Processing Method in Mobile Communication Terminal

Country Status (1)

CountryLink
KR (1)KR20070071675A (en)

Cited By (119)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US8892446B2 (en)2010-01-182014-11-18Apple Inc.Service orchestration for intelligent automated assistant
US9262612B2 (en)2011-03-212016-02-16Apple Inc.Device access using voice authentication
US9300784B2 (en)2013-06-132016-03-29Apple Inc.System and method for emergency calls initiated by voice command
US9330720B2 (en)2008-01-032016-05-03Apple Inc.Methods and apparatus for altering audio output signals
US9338493B2 (en)2014-06-302016-05-10Apple Inc.Intelligent automated assistant for TV user interactions
US9368114B2 (en)2013-03-142016-06-14Apple Inc.Context-sensitive handling of interruptions
US9430463B2 (en)2014-05-302016-08-30Apple Inc.Exemplar-based natural language processing
US9483461B2 (en)2012-03-062016-11-01Apple Inc.Handling speech synthesis of content for multiple languages
US9495129B2 (en)2012-06-292016-11-15Apple Inc.Device, method, and user interface for voice-activated navigation and browsing of a document
US9502031B2 (en)2014-05-272016-11-22Apple Inc.Method for supporting dynamic grammars in WFST-based ASR
US9535906B2 (en)2008-07-312017-01-03Apple Inc.Mobile device having human language translation capability with positional feedback
US9576574B2 (en)2012-09-102017-02-21Apple Inc.Context-sensitive handling of interruptions by intelligent digital assistant
US9582608B2 (en)2013-06-072017-02-28Apple Inc.Unified ranking with entropy-weighted information for phrase-based semantic auto-completion
US9620104B2 (en)2013-06-072017-04-11Apple Inc.System and method for user-specified pronunciation of words for speech synthesis and recognition
US9620105B2 (en)2014-05-152017-04-11Apple Inc.Analyzing audio input for efficient speech and music recognition
US9626955B2 (en)2008-04-052017-04-18Apple Inc.Intelligent text-to-speech conversion
US9633004B2 (en)2014-05-302017-04-25Apple Inc.Better resolution when referencing to concepts
US9633674B2 (en)2013-06-072017-04-25Apple Inc.System and method for detecting errors in interactions with a voice-based digital assistant
US9633660B2 (en)2010-02-252017-04-25Apple Inc.User profiling for voice input processing
US9646614B2 (en)2000-03-162017-05-09Apple Inc.Fast, language-independent method for user authentication by voice
US9646609B2 (en)2014-09-302017-05-09Apple Inc.Caching apparatus for serving phonetic pronunciations
US9668121B2 (en)2014-09-302017-05-30Apple Inc.Social reminders
US9697820B2 (en)2015-09-242017-07-04Apple Inc.Unit-selection text-to-speech synthesis using concatenation-sensitive neural networks
US9697822B1 (en)2013-03-152017-07-04Apple Inc.System and method for updating an adaptive speech recognition model
US9711141B2 (en)2014-12-092017-07-18Apple Inc.Disambiguating heteronyms in speech synthesis
US9715875B2 (en)2014-05-302017-07-25Apple Inc.Reducing the need for manual start/end-pointing and trigger phrases
US9721566B2 (en)2015-03-082017-08-01Apple Inc.Competing devices responding to voice triggers
US9734193B2 (en)2014-05-302017-08-15Apple Inc.Determining domain salience ranking from ambiguous words in natural speech
US9760559B2 (en)2014-05-302017-09-12Apple Inc.Predictive text input
US9785630B2 (en)2014-05-302017-10-10Apple Inc.Text prediction using combined word N-gram and unigram language models
US9798393B2 (en)2011-08-292017-10-24Apple Inc.Text correction processing
US9818400B2 (en)2014-09-112017-11-14Apple Inc.Method and apparatus for discovering trending terms in speech requests
US9842101B2 (en)2014-05-302017-12-12Apple Inc.Predictive conversion of language input
US9842105B2 (en)2015-04-162017-12-12Apple Inc.Parsimonious continuous-space phrase representations for natural language processing
US9858925B2 (en)2009-06-052018-01-02Apple Inc.Using context information to facilitate processing of commands in a virtual assistant
US9865280B2 (en)2015-03-062018-01-09Apple Inc.Structured dictation using intelligent automated assistants
US9886432B2 (en)2014-09-302018-02-06Apple Inc.Parsimonious handling of word inflection via categorical stem + suffix N-gram language models
US9886953B2 (en)2015-03-082018-02-06Apple Inc.Virtual assistant activation
US9899019B2 (en)2015-03-182018-02-20Apple Inc.Systems and methods for structured stem and suffix language models
US9922642B2 (en)2013-03-152018-03-20Apple Inc.Training an at least partial voice command system
US9934775B2 (en)2016-05-262018-04-03Apple Inc.Unit-selection text-to-speech synthesis based on predicted concatenation parameters
US9953088B2 (en)2012-05-142018-04-24Apple Inc.Crowd sourcing information to fulfill user requests
US9959870B2 (en)2008-12-112018-05-01Apple Inc.Speech recognition involving a mobile device
US9966065B2 (en)2014-05-302018-05-08Apple Inc.Multi-command single utterance input method
US9966068B2 (en)2013-06-082018-05-08Apple Inc.Interpreting and acting upon commands that involve sharing information with remote devices
US9971774B2 (en)2012-09-192018-05-15Apple Inc.Voice-based media searching
US9972304B2 (en)2016-06-032018-05-15Apple Inc.Privacy preserving distributed evaluation framework for embedded personalized systems
KR20180079759A (en)*2017-01-022018-07-11삼성전자주식회사Method and terminal for recognizing a text
US10043516B2 (en)2016-09-232018-08-07Apple Inc.Intelligent automated assistant
US10049668B2 (en)2015-12-022018-08-14Apple Inc.Applying neural network language models to weighted finite state transducers for automatic speech recognition
US10049663B2 (en)2016-06-082018-08-14Apple, Inc.Intelligent automated assistant for media exploration
US10057736B2 (en)2011-06-032018-08-21Apple Inc.Active transport based notifications
US10067938B2 (en)2016-06-102018-09-04Apple Inc.Multilingual word prediction
US10074360B2 (en)2014-09-302018-09-11Apple Inc.Providing an indication of the suitability of speech recognition
US10078631B2 (en)2014-05-302018-09-18Apple Inc.Entropy-guided text prediction using combined word and character n-gram language models
US10079014B2 (en)2012-06-082018-09-18Apple Inc.Name recognition system
US10083688B2 (en)2015-05-272018-09-25Apple Inc.Device voice control for selecting a displayed affordance
US10089072B2 (en)2016-06-112018-10-02Apple Inc.Intelligent device arbitration and control
US10101822B2 (en)2015-06-052018-10-16Apple Inc.Language input correction
US10127911B2 (en)2014-09-302018-11-13Apple Inc.Speaker identification and unsupervised speaker adaptation techniques
US10127220B2 (en)2015-06-042018-11-13Apple Inc.Language identification from short strings
US10134385B2 (en)2012-03-022018-11-20Apple Inc.Systems and methods for name pronunciation
US10170123B2 (en)2014-05-302019-01-01Apple Inc.Intelligent assistant for home automation
US10176167B2 (en)2013-06-092019-01-08Apple Inc.System and method for inferring user intent from speech inputs
US10185542B2 (en)2013-06-092019-01-22Apple Inc.Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant
US10186254B2 (en)2015-06-072019-01-22Apple Inc.Context-based endpoint detection
US10192552B2 (en)2016-06-102019-01-29Apple Inc.Digital assistant providing whispered speech
US10199051B2 (en)2013-02-072019-02-05Apple Inc.Voice trigger for a digital assistant
US10223066B2 (en)2015-12-232019-03-05Apple Inc.Proactive assistance based on dialog communication between devices
US10241752B2 (en)2011-09-302019-03-26Apple Inc.Interface for a virtual digital assistant
US10241644B2 (en)2011-06-032019-03-26Apple Inc.Actionable reminder entries
US10249300B2 (en)2016-06-062019-04-02Apple Inc.Intelligent list reading
US10255907B2 (en)2015-06-072019-04-09Apple Inc.Automatic accent detection using acoustic models
US10269345B2 (en)2016-06-112019-04-23Apple Inc.Intelligent task discovery
US10276170B2 (en)2010-01-182019-04-30Apple Inc.Intelligent automated assistant
US10283110B2 (en)2009-07-022019-05-07Apple Inc.Methods and apparatuses for automatic speech recognition
US10289433B2 (en)2014-05-302019-05-14Apple Inc.Domain specific language for encoding assistant dialog
US10297253B2 (en)2016-06-112019-05-21Apple Inc.Application integration with a digital assistant
US10318871B2 (en)2005-09-082019-06-11Apple Inc.Method and apparatus for building an intelligent automated assistant
US10332518B2 (en)2017-05-092019-06-25Apple Inc.User interface for correcting recognition errors
US10356243B2 (en)2015-06-052019-07-16Apple Inc.Virtual assistant aided communication with 3rd party service in a communication session
US10354011B2 (en)2016-06-092019-07-16Apple Inc.Intelligent automated assistant in a home environment
US10366158B2 (en)2015-09-292019-07-30Apple Inc.Efficient word encoding for recurrent neural network language models
US10410637B2 (en)2017-05-122019-09-10Apple Inc.User-specific acoustic models
US10446143B2 (en)2016-03-142019-10-15Apple Inc.Identification of voice inputs providing credentials
US10446141B2 (en)2014-08-282019-10-15Apple Inc.Automatic speech recognition based on user feedback
US10482874B2 (en)2017-05-152019-11-19Apple Inc.Hierarchical belief states for digital assistants
US10490187B2 (en)2016-06-102019-11-26Apple Inc.Digital assistant providing automated status report
US10496753B2 (en)2010-01-182019-12-03Apple Inc.Automatically adapting user interfaces for hands-free interaction
US10509862B2 (en)2016-06-102019-12-17Apple Inc.Dynamic phrase expansion of language input
US10521466B2 (en)2016-06-112019-12-31Apple Inc.Data driven natural language event detection and classification
US10553209B2 (en)2010-01-182020-02-04Apple Inc.Systems and methods for hands-free notification summaries
US10552013B2 (en)2014-12-022020-02-04Apple Inc.Data detection
US10568032B2 (en)2007-04-032020-02-18Apple Inc.Method and system for operating a multi-function portable electronic device using voice-activation
US10567477B2 (en)2015-03-082020-02-18Apple Inc.Virtual assistant continuity
US10592095B2 (en)2014-05-232020-03-17Apple Inc.Instantaneous speaking of content on touch devices
US10593346B2 (en)2016-12-222020-03-17Apple Inc.Rank-reduced token representation for automatic speech recognition
US10607141B2 (en)2010-01-252020-03-31Newvaluexchange Ltd.Apparatuses, methods and systems for a digital conversation management platform
US10659851B2 (en)2014-06-302020-05-19Apple Inc.Real-time digital assistant knowledge updates
US10671428B2 (en)2015-09-082020-06-02Apple Inc.Distributed personal assistant
US10679605B2 (en)2010-01-182020-06-09Apple Inc.Hands-free list-reading by intelligent automated assistant
US10691473B2 (en)2015-11-062020-06-23Apple Inc.Intelligent automated assistant in a messaging environment
US10706373B2 (en)2011-06-032020-07-07Apple Inc.Performing actions associated with task items that represent tasks to perform
US10705794B2 (en)2010-01-182020-07-07Apple Inc.Automatically adapting user interfaces for hands-free interaction
US10733993B2 (en)2016-06-102020-08-04Apple Inc.Intelligent digital assistant in a multi-tasking environment
US10747498B2 (en)2015-09-082020-08-18Apple Inc.Zero latency digital assistant
US10755703B2 (en)2017-05-112020-08-25Apple Inc.Offline personal assistant
US10762293B2 (en)2010-12-222020-09-01Apple Inc.Using parts-of-speech tagging and named entity recognition for spelling correction
US10789945B2 (en)2017-05-122020-09-29Apple Inc.Low-latency intelligent automated assistant
US10789041B2 (en)2014-09-122020-09-29Apple Inc.Dynamic thresholds for always listening speech trigger
US10791176B2 (en)2017-05-122020-09-29Apple Inc.Synchronization and task delegation of a digital assistant
US10791216B2 (en)2013-08-062020-09-29Apple Inc.Auto-activating smart responses based on activities from remote devices
US10810274B2 (en)2017-05-152020-10-20Apple Inc.Optimizing dialogue policy decisions for digital assistants using implicit feedback
CN112652294A (en)*2020-12-252021-04-13深圳追一科技有限公司Speech synthesis method, apparatus, computer device and storage medium
US11010550B2 (en)2015-09-292021-05-18Apple Inc.Unified language modeling framework for word prediction, auto-completion and auto-correction
US11025565B2 (en)2015-06-072021-06-01Apple Inc.Personalized prediction of responses for instant messaging
US11217255B2 (en)2017-05-162022-01-04Apple Inc.Far-field extension for digital assistant services
US11281993B2 (en)2016-12-052022-03-22Apple Inc.Model and ensemble compression for metric learning
US11587559B2 (en)2015-09-302023-02-21Apple Inc.Intelligent device identification

Cited By (165)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US9646614B2 (en)2000-03-162017-05-09Apple Inc.Fast, language-independent method for user authentication by voice
US10318871B2 (en)2005-09-082019-06-11Apple Inc.Method and apparatus for building an intelligent automated assistant
US9117447B2 (en)2006-09-082015-08-25Apple Inc.Using event alert text as input to an automated assistant
US8942986B2 (en)2006-09-082015-01-27Apple Inc.Determining user intent based on ontologies of domains
US8930191B2 (en)2006-09-082015-01-06Apple Inc.Paraphrasing of user requests and results by automated digital assistant
US10568032B2 (en)2007-04-032020-02-18Apple Inc.Method and system for operating a multi-function portable electronic device using voice-activation
US10381016B2 (en)2008-01-032019-08-13Apple Inc.Methods and apparatus for altering audio output signals
US9330720B2 (en)2008-01-032016-05-03Apple Inc.Methods and apparatus for altering audio output signals
US9626955B2 (en)2008-04-052017-04-18Apple Inc.Intelligent text-to-speech conversion
US9865248B2 (en)2008-04-052018-01-09Apple Inc.Intelligent text-to-speech conversion
US10108612B2 (en)2008-07-312018-10-23Apple Inc.Mobile device having human language translation capability with positional feedback
US9535906B2 (en)2008-07-312017-01-03Apple Inc.Mobile device having human language translation capability with positional feedback
US9959870B2 (en)2008-12-112018-05-01Apple Inc.Speech recognition involving a mobile device
US11080012B2 (en)2009-06-052021-08-03Apple Inc.Interface for a virtual digital assistant
US10795541B2 (en)2009-06-052020-10-06Apple Inc.Intelligent organization of tasks items
US10475446B2 (en)2009-06-052019-11-12Apple Inc.Using context information to facilitate processing of commands in a virtual assistant
US9858925B2 (en)2009-06-052018-01-02Apple Inc.Using context information to facilitate processing of commands in a virtual assistant
US10283110B2 (en)2009-07-022019-05-07Apple Inc.Methods and apparatuses for automatic speech recognition
US10553209B2 (en)2010-01-182020-02-04Apple Inc.Systems and methods for hands-free notification summaries
US11423886B2 (en)2010-01-182022-08-23Apple Inc.Task flow identification based on user intent
US9318108B2 (en)2010-01-182016-04-19Apple Inc.Intelligent automated assistant
US8892446B2 (en)2010-01-182014-11-18Apple Inc.Service orchestration for intelligent automated assistant
US10496753B2 (en)2010-01-182019-12-03Apple Inc.Automatically adapting user interfaces for hands-free interaction
US10705794B2 (en)2010-01-182020-07-07Apple Inc.Automatically adapting user interfaces for hands-free interaction
US10276170B2 (en)2010-01-182019-04-30Apple Inc.Intelligent automated assistant
US10679605B2 (en)2010-01-182020-06-09Apple Inc.Hands-free list-reading by intelligent automated assistant
US10706841B2 (en)2010-01-182020-07-07Apple Inc.Task flow identification based on user intent
US9548050B2 (en)2010-01-182017-01-17Apple Inc.Intelligent automated assistant
US12087308B2 (en)2010-01-182024-09-10Apple Inc.Intelligent automated assistant
US8903716B2 (en)2010-01-182014-12-02Apple Inc.Personalized vocabulary for digital assistant
US10607140B2 (en)2010-01-252020-03-31Newvaluexchange Ltd.Apparatuses, methods and systems for a digital conversation management platform
US12307383B2 (en)2010-01-252025-05-20Newvaluexchange Global Ai LlpApparatuses, methods and systems for a digital conversation management platform
US10607141B2 (en)2010-01-252020-03-31Newvaluexchange Ltd.Apparatuses, methods and systems for a digital conversation management platform
US10984327B2 (en)2010-01-252021-04-20New Valuexchange Ltd.Apparatuses, methods and systems for a digital conversation management platform
US11410053B2 (en)2010-01-252022-08-09Newvaluexchange Ltd.Apparatuses, methods and systems for a digital conversation management platform
US10984326B2 (en)2010-01-252021-04-20Newvaluexchange Ltd.Apparatuses, methods and systems for a digital conversation management platform
US10049675B2 (en)2010-02-252018-08-14Apple Inc.User profiling for voice input processing
US9633660B2 (en)2010-02-252017-04-25Apple Inc.User profiling for voice input processing
US10762293B2 (en)2010-12-222020-09-01Apple Inc.Using parts-of-speech tagging and named entity recognition for spelling correction
US9262612B2 (en)2011-03-212016-02-16Apple Inc.Device access using voice authentication
US10102359B2 (en)2011-03-212018-10-16Apple Inc.Device access using voice authentication
US10241644B2 (en)2011-06-032019-03-26Apple Inc.Actionable reminder entries
US10706373B2 (en)2011-06-032020-07-07Apple Inc.Performing actions associated with task items that represent tasks to perform
US10057736B2 (en)2011-06-032018-08-21Apple Inc.Active transport based notifications
US11120372B2 (en)2011-06-032021-09-14Apple Inc.Performing actions associated with task items that represent tasks to perform
US9798393B2 (en)2011-08-292017-10-24Apple Inc.Text correction processing
US10241752B2 (en)2011-09-302019-03-26Apple Inc.Interface for a virtual digital assistant
US10134385B2 (en)2012-03-022018-11-20Apple Inc.Systems and methods for name pronunciation
US9483461B2 (en)2012-03-062016-11-01Apple Inc.Handling speech synthesis of content for multiple languages
US9953088B2 (en)2012-05-142018-04-24Apple Inc.Crowd sourcing information to fulfill user requests
US10079014B2 (en)2012-06-082018-09-18Apple Inc.Name recognition system
US9495129B2 (en)2012-06-292016-11-15Apple Inc.Device, method, and user interface for voice-activated navigation and browsing of a document
US9576574B2 (en)2012-09-102017-02-21Apple Inc.Context-sensitive handling of interruptions by intelligent digital assistant
US9971774B2 (en)2012-09-192018-05-15Apple Inc.Voice-based media searching
US10978090B2 (en)2013-02-072021-04-13Apple Inc.Voice trigger for a digital assistant
US10199051B2 (en)2013-02-072019-02-05Apple Inc.Voice trigger for a digital assistant
US9368114B2 (en)2013-03-142016-06-14Apple Inc.Context-sensitive handling of interruptions
US9697822B1 (en)2013-03-152017-07-04Apple Inc.System and method for updating an adaptive speech recognition model
US9922642B2 (en)2013-03-152018-03-20Apple Inc.Training an at least partial voice command system
US9620104B2 (en)2013-06-072017-04-11Apple Inc.System and method for user-specified pronunciation of words for speech synthesis and recognition
US9633674B2 (en)2013-06-072017-04-25Apple Inc.System and method for detecting errors in interactions with a voice-based digital assistant
US9966060B2 (en)2013-06-072018-05-08Apple Inc.System and method for user-specified pronunciation of words for speech synthesis and recognition
US9582608B2 (en)2013-06-072017-02-28Apple Inc.Unified ranking with entropy-weighted information for phrase-based semantic auto-completion
US9966068B2 (en)2013-06-082018-05-08Apple Inc.Interpreting and acting upon commands that involve sharing information with remote devices
US10657961B2 (en)2013-06-082020-05-19Apple Inc.Interpreting and acting upon commands that involve sharing information with remote devices
US10185542B2 (en)2013-06-092019-01-22Apple Inc.Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant
US10176167B2 (en)2013-06-092019-01-08Apple Inc.System and method for inferring user intent from speech inputs
US9300784B2 (en)2013-06-132016-03-29Apple Inc.System and method for emergency calls initiated by voice command
US10791216B2 (en)2013-08-062020-09-29Apple Inc.Auto-activating smart responses based on activities from remote devices
US9620105B2 (en)2014-05-152017-04-11Apple Inc.Analyzing audio input for efficient speech and music recognition
US10592095B2 (en)2014-05-232020-03-17Apple Inc.Instantaneous speaking of content on touch devices
US9502031B2 (en)2014-05-272016-11-22Apple Inc.Method for supporting dynamic grammars in WFST-based ASR
US10170123B2 (en)2014-05-302019-01-01Apple Inc.Intelligent assistant for home automation
US9842101B2 (en)2014-05-302017-12-12Apple Inc.Predictive conversion of language input
US9715875B2 (en)2014-05-302017-07-25Apple Inc.Reducing the need for manual start/end-pointing and trigger phrases
US11257504B2 (en)2014-05-302022-02-22Apple Inc.Intelligent assistant for home automation
US10169329B2 (en)2014-05-302019-01-01Apple Inc.Exemplar-based natural language processing
US10289433B2 (en)2014-05-302019-05-14Apple Inc.Domain specific language for encoding assistant dialog
US10083690B2 (en)2014-05-302018-09-25Apple Inc.Better resolution when referencing to concepts
US10078631B2 (en)2014-05-302018-09-18Apple Inc.Entropy-guided text prediction using combined word and character n-gram language models
US10497365B2 (en)2014-05-302019-12-03Apple Inc.Multi-command single utterance input method
US9633004B2 (en)2014-05-302017-04-25Apple Inc.Better resolution when referencing to concepts
US9734193B2 (en)2014-05-302017-08-15Apple Inc.Determining domain salience ranking from ambiguous words in natural speech
US9760559B2 (en)2014-05-302017-09-12Apple Inc.Predictive text input
US9966065B2 (en)2014-05-302018-05-08Apple Inc.Multi-command single utterance input method
US9430463B2 (en)2014-05-302016-08-30Apple Inc.Exemplar-based natural language processing
US11133008B2 (en)2014-05-302021-09-28Apple Inc.Reducing the need for manual start/end-pointing and trigger phrases
US9785630B2 (en)2014-05-302017-10-10Apple Inc.Text prediction using combined word N-gram and unigram language models
US10904611B2 (en)2014-06-302021-01-26Apple Inc.Intelligent automated assistant for TV user interactions
US9668024B2 (en)2014-06-302017-05-30Apple Inc.Intelligent automated assistant for TV user interactions
US9338493B2 (en)2014-06-302016-05-10Apple Inc.Intelligent automated assistant for TV user interactions
US10659851B2 (en)2014-06-302020-05-19Apple Inc.Real-time digital assistant knowledge updates
US10446141B2 (en)2014-08-282019-10-15Apple Inc.Automatic speech recognition based on user feedback
US9818400B2 (en)2014-09-112017-11-14Apple Inc.Method and apparatus for discovering trending terms in speech requests
US10431204B2 (en)2014-09-112019-10-01Apple Inc.Method and apparatus for discovering trending terms in speech requests
US10789041B2 (en)2014-09-122020-09-29Apple Inc.Dynamic thresholds for always listening speech trigger
US10127911B2 (en)2014-09-302018-11-13Apple Inc.Speaker identification and unsupervised speaker adaptation techniques
US9646609B2 (en)2014-09-302017-05-09Apple Inc.Caching apparatus for serving phonetic pronunciations
US10074360B2 (en)2014-09-302018-09-11Apple Inc.Providing an indication of the suitability of speech recognition
US9886432B2 (en)2014-09-302018-02-06Apple Inc.Parsimonious handling of word inflection via categorical stem + suffix N-gram language models
US9986419B2 (en)2014-09-302018-05-29Apple Inc.Social reminders
US9668121B2 (en)2014-09-302017-05-30Apple Inc.Social reminders
US11556230B2 (en)2014-12-022023-01-17Apple Inc.Data detection
US10552013B2 (en)2014-12-022020-02-04Apple Inc.Data detection
US9711141B2 (en)2014-12-092017-07-18Apple Inc.Disambiguating heteronyms in speech synthesis
US9865280B2 (en)2015-03-062018-01-09Apple Inc.Structured dictation using intelligent automated assistants
US10311871B2 (en)2015-03-082019-06-04Apple Inc.Competing devices responding to voice triggers
US10567477B2 (en)2015-03-082020-02-18Apple Inc.Virtual assistant continuity
US9721566B2 (en)2015-03-082017-08-01Apple Inc.Competing devices responding to voice triggers
US9886953B2 (en)2015-03-082018-02-06Apple Inc.Virtual assistant activation
US11087759B2 (en)2015-03-082021-08-10Apple Inc.Virtual assistant activation
US9899019B2 (en)2015-03-182018-02-20Apple Inc.Systems and methods for structured stem and suffix language models
US9842105B2 (en)2015-04-162017-12-12Apple Inc.Parsimonious continuous-space phrase representations for natural language processing
US10083688B2 (en)2015-05-272018-09-25Apple Inc.Device voice control for selecting a displayed affordance
US10127220B2 (en)2015-06-042018-11-13Apple Inc.Language identification from short strings
US10101822B2 (en)2015-06-052018-10-16Apple Inc.Language input correction
US10356243B2 (en)2015-06-052019-07-16Apple Inc.Virtual assistant aided communication with 3rd party service in a communication session
US10186254B2 (en)2015-06-072019-01-22Apple Inc.Context-based endpoint detection
US11025565B2 (en)2015-06-072021-06-01Apple Inc.Personalized prediction of responses for instant messaging
US10255907B2 (en)2015-06-072019-04-09Apple Inc.Automatic accent detection using acoustic models
US10671428B2 (en)2015-09-082020-06-02Apple Inc.Distributed personal assistant
US10747498B2 (en)2015-09-082020-08-18Apple Inc.Zero latency digital assistant
US11500672B2 (en)2015-09-082022-11-15Apple Inc.Distributed personal assistant
US9697820B2 (en)2015-09-242017-07-04Apple Inc.Unit-selection text-to-speech synthesis using concatenation-sensitive neural networks
US11010550B2 (en)2015-09-292021-05-18Apple Inc.Unified language modeling framework for word prediction, auto-completion and auto-correction
US10366158B2 (en)2015-09-292019-07-30Apple Inc.Efficient word encoding for recurrent neural network language models
US11587559B2 (en)2015-09-302023-02-21Apple Inc.Intelligent device identification
US10691473B2 (en)2015-11-062020-06-23Apple Inc.Intelligent automated assistant in a messaging environment
US11526368B2 (en)2015-11-062022-12-13Apple Inc.Intelligent automated assistant in a messaging environment
US10049668B2 (en)2015-12-022018-08-14Apple Inc.Applying neural network language models to weighted finite state transducers for automatic speech recognition
US10223066B2 (en)2015-12-232019-03-05Apple Inc.Proactive assistance based on dialog communication between devices
US10446143B2 (en)2016-03-142019-10-15Apple Inc.Identification of voice inputs providing credentials
US9934775B2 (en)2016-05-262018-04-03Apple Inc.Unit-selection text-to-speech synthesis based on predicted concatenation parameters
US9972304B2 (en)2016-06-032018-05-15Apple Inc.Privacy preserving distributed evaluation framework for embedded personalized systems
US10249300B2 (en)2016-06-062019-04-02Apple Inc.Intelligent list reading
US11069347B2 (en)2016-06-082021-07-20Apple Inc.Intelligent automated assistant for media exploration
US10049663B2 (en)2016-06-082018-08-14Apple, Inc.Intelligent automated assistant for media exploration
US10354011B2 (en)2016-06-092019-07-16Apple Inc.Intelligent automated assistant in a home environment
US10192552B2 (en)2016-06-102019-01-29Apple Inc.Digital assistant providing whispered speech
US10509862B2 (en)2016-06-102019-12-17Apple Inc.Dynamic phrase expansion of language input
US11037565B2 (en)2016-06-102021-06-15Apple Inc.Intelligent digital assistant in a multi-tasking environment
US10733993B2 (en)2016-06-102020-08-04Apple Inc.Intelligent digital assistant in a multi-tasking environment
US10067938B2 (en)2016-06-102018-09-04Apple Inc.Multilingual word prediction
US10490187B2 (en)2016-06-102019-11-26Apple Inc.Digital assistant providing automated status report
US10297253B2 (en)2016-06-112019-05-21Apple Inc.Application integration with a digital assistant
US10269345B2 (en)2016-06-112019-04-23Apple Inc.Intelligent task discovery
US11152002B2 (en)2016-06-112021-10-19Apple Inc.Application integration with a digital assistant
US10521466B2 (en)2016-06-112019-12-31Apple Inc.Data driven natural language event detection and classification
US10089072B2 (en)2016-06-112018-10-02Apple Inc.Intelligent device arbitration and control
US10553215B2 (en)2016-09-232020-02-04Apple Inc.Intelligent automated assistant
US10043516B2 (en)2016-09-232018-08-07Apple Inc.Intelligent automated assistant
US11281993B2 (en)2016-12-052022-03-22Apple Inc.Model and ensemble compression for metric learning
US10593346B2 (en)2016-12-222020-03-17Apple Inc.Rank-reduced token representation for automatic speech recognition
KR20180079759A (en)*2017-01-022018-07-11삼성전자주식회사Method and terminal for recognizing a text
US10332518B2 (en)2017-05-092019-06-25Apple Inc.User interface for correcting recognition errors
US10755703B2 (en)2017-05-112020-08-25Apple Inc.Offline personal assistant
US10789945B2 (en)2017-05-122020-09-29Apple Inc.Low-latency intelligent automated assistant
US11405466B2 (en)2017-05-122022-08-02Apple Inc.Synchronization and task delegation of a digital assistant
US10791176B2 (en)2017-05-122020-09-29Apple Inc.Synchronization and task delegation of a digital assistant
US10410637B2 (en)2017-05-122019-09-10Apple Inc.User-specific acoustic models
US10810274B2 (en)2017-05-152020-10-20Apple Inc.Optimizing dialogue policy decisions for digital assistants using implicit feedback
US10482874B2 (en)2017-05-152019-11-19Apple Inc.Hierarchical belief states for digital assistants
US11217255B2 (en)2017-05-162022-01-04Apple Inc.Far-field extension for digital assistant services
CN112652294B (en)*2020-12-252023-10-24深圳追一科技有限公司Speech synthesis method, device, computer equipment and storage medium
CN112652294A (en)*2020-12-252021-04-13深圳追一科技有限公司Speech synthesis method, apparatus, computer device and storage medium

Similar Documents

PublicationPublication DateTitle
KR20070071675A (en) Multilingual TTS Processing Method in Mobile Communication Terminal
JP5320064B2 (en) Voice-controlled wireless communication device / system
US20030200858A1 (en)Mixing MP3 audio and T T P for enhanced E-book application
US8606560B2 (en)Automatic simultaneous interpertation system
US8340797B2 (en)Method and system for generating and processing digital content based on text-to-speech conversion
JP7230145B2 (en) Context Denormalization for Automatic Speech Recognition
JPH0830287A (en)Text-speech converting system
JPH08339198A (en)Presentation device
US20140019132A1 (en)Information processing apparatus, information processing method, display control apparatus, and display control method
CN100514384C (en)Talking e-book
JP2006119534A (en)Computer system, method for supporting correction work, and program
US20080243510A1 (en)Overlapping screen reading of non-sequential text
CN100487788C (en)Method for realizing text-to-speech function
US11250837B2 (en)Speech synthesis system, method and non-transitory computer readable medium with language option selection and acoustic models
KR20030079497A (en)service method of language study
KR102479023B1 (en)Apparatus, method and program for providing foreign language learning service
JP2016012315A (en)Spaced-wording unit dividing program for text data, and electronic book reproduction device
JP2000148176A (en)Information processing device and method, serving medium, speech recognision system, speech synthesizing system, translation device and method, and translation system
JP2005326811A (en) Speech synthesis apparatus and speech synthesis method
JP2000293187A (en)Device and method for synthesizing data voice
Shabtay et al.Spoken question answering for visual queries
CN120068887A (en)Translation method, video conference method and electronic equipment
WO2023132140A1 (en)Program, file generation method, information processing device, and information processing system
JP2006047866A (en) Electronic dictionary device and control method thereof
KR20110054218A (en) Audio playback device and history method including history storage module

Legal Events

DateCodeTitleDescription
PA0109Patent application

Patent event code:PA01091R01D

Comment text:Patent Application

Patent event date:20051230

PG1501Laying open of application
PC1203Withdrawal of no request for examination
WITNApplication deemed withdrawn, e.g. because no request for examination was filed or no examination fee was paid

[8]ページ先頭

©2009-2025 Movatter.jp