Movatterモバイル変換


[0]ホーム

URL:


CN101201892A - Voice coding talking book and pickup main body - Google Patents

Voice coding talking book and pickup main body
Download PDF

Info

Publication number
CN101201892A
CN101201892ACNA2006102012899ACN200610201289ACN101201892ACN 101201892 ACN101201892 ACN 101201892ACN A2006102012899 ACNA2006102012899 ACN A2006102012899ACN 200610201289 ACN200610201289 ACN 200610201289ACN 101201892 ACN101201892 ACN 101201892A
Authority
CN
China
Prior art keywords
sentence
aforementioned
pronunciation
word
language
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CNA2006102012899A
Other languages
Chinese (zh)
Inventor
游旭
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by IndividualfiledCriticalIndividual
Priority to CNA2006102012899ApriorityCriticalpatent/CN101201892A/en
Publication of CN101201892ApublicationCriticalpatent/CN101201892A/en
Withdrawnlegal-statusCriticalCurrent

Links

Images

Landscapes

Abstract

Translated fromChinese

本发明涉及一种使用拼合信息码的语音编码有声读物及拾音器,在语音编码有声读物的页面上有码字,一般是每个或几个印刷的句子(语句)旁边印一个码字,码字的值一般是与数个表示如何用音码拼合成语句发音的拼合信息的值有关的,由音码来完成拼合,用作拼合的音码可以是词组合,单词(词),词根或音标等,所涉及的语种既可以是英语,法语,德语,俄语等,也可以是汉语,韩语等。拾音器读取这种码字,并解码播出语句发音来。拾音器的读码头可以使用条码读码头或导电码读码头,而相应的码字是条码或导电码等。The present invention relates to a voice-coded audio book and a pickup using a splicing information code. There are code words on the pages of the voice-coded audio book, generally one code word is printed next to each or several printed sentences (statements), and the value of the code word is generally related to the values of several splicing information indicating how to use the phonetic code to splice the sentence pronunciation. The splicing is completed by the phonetic code, and the phonetic code used for splicing can be a word combination, a word (word), a root or a phonetic symbol, etc. The language involved can be English, French, German, Russian, etc., or Chinese, Korean, etc. The pickup reads the code word and decodes and broadcasts the sentence pronunciation. The reading terminal of the pickup can use a bar code reading terminal or a conductive code reading terminal, and the corresponding code word is a bar code or a conductive code, etc.

Description

Translated fromChinese
一种语音编码有声读物及拾音器主体A speech coding audiobook and a pickup body

技术领域:Technical field:

本发明涉及一种使用音码来拼合语句发音的有声读物及拾音器的解决方案。The invention relates to an audio book and a sound pick-up solution which uses phonetic codes to spell out the pronunciation of sentences.

背景技术:Background technique:

在现有技术中,有声读物的解决形式一般采取发音器带各种读码头,根据读得的码的内容对应发声。如中国专利号为:91207427.2  的专利,其读码方式是卡片读取器,这类解决方式是读得的码的内容为对应的语音的顺序编号或磁带的地址,然后寻号发声,其缺点是语音通用性差,如更换有声读物,就需更换语音存储器内容或磁带,麻烦而且成本也高。In the prior art, audiobooks are generally solved in the form of a pronunciation device with various reading terminals, and corresponding sounds are made according to the content of the read codes. For example, the Chinese patent number is: 91207427.2 patent, its code reading method is a card reader, this kind of solution is that the content of the read code is the sequence number of the corresponding voice or the address of the tape, and then find the number and make a sound, its disadvantages It is poor voice versatility. If you change audiobooks, you need to change the content of the voice memory or the tape, which is troublesome and expensive.

发明内容:Invention content:

本发明的任务是提供一种结构简单,成本低,使用方便的语音编码有声读物系统,包括语音编码有声读物及拾音器主体(还有可能包括数据卡或部件式数据卡)。The task of the present invention is to provide a simple structure, low cost, and easy-to-use voice-coded audiobook system, including the voice-coded audiobook and a pickup body (may also include a data card or a component data card).

本发明的主要内容:对某语种(比如英语或汉语),将此语种中能表示发音或能转换成发音的比如元辅音,单词(词)等作为音码,对此语种一语句,可用多个音码来拼合成此语句的发音,对此语种另一语句,可用另外的多个音码来拼合成此另一语句的发音,照此对此语种的多个语句中每个语句,都可用多个音码来拼合成此语句的发音,(以上5句话中提到的‘多’互相不一定是相同数目)。Main content of the present invention: to certain language (such as English or Chinese), can represent pronunciation or can be converted into pronunciation in this language such as vowel consonant, word (word) etc. phonetic codes to form the pronunciation of this sentence, another sentence of this language can be combined with other multiple phonetic codes to form the pronunciation of this other sentence, as such, each sentence in the multiple sentences of this language can be The pronunciation of this sentence can be put together by a plurality of phonetic codes, (' many ' mentioned in the above 5 sentences are not necessarily the same number each other).

此语音编码有声读物系统在使用时主要涉及:This speech coding audiobook system mainly involves in use:

一是语音编码有声读物,其主体是读物,在读物的整页面上形成有诸个码字(诸个码字中的每个码字的值是共与数个表示如何用音码拼合成语句发音的拼合信息的值有关的,可称诸个码字中的各码字为拼合信息码,一般一个拼合信息码的码值有一个表示语句发音的拼合信息的值),各码字的码种可以是条形码,导电码等;一本语音编码有声读物的整页面上可以只有一个语种的码字,也可以有多个语种的码字。The one, the phonetic coding audiobook, its main body is reading material, forms all codewords (the value of each codeword in all codewords is the total and several representation how to splice into sentence with phonetic sign indicating number) on the whole page of reading material The value of the combined information of pronunciation is relevant, each code word in each code word can be said to be combined information code, generally the code value of a combined information code has a value representing the combined information of sentence pronunciation), the code word of each code word The type can be a barcode, a conductive code, etc.; the entire page of a speech-encoded audiobook can have code words of only one language, or code words of multiple languages.

二是拾音器,拾音器用于与语音编码有声读物配合使用,使用时,使拾音器的读码头与语音编码有声读物的整页面上的拼合信息码接触或接近,由拾音器利用读码头读取语音编码有声读物的整页面上的拼合信息码,并根据拼合信息码的码值来播出语句发音。The second is the pickup. The pickup is used in conjunction with the speech-coded audiobook. When in use, the reading dock of the pickup is in contact with or close to the spliced information code on the entire page of the speech-coded audiobook. The pickup uses the reading dock to read the speech coded audio. The combined information code on the entire page of the reading material, and the pronunciation of the sentence is played according to the code value of the combined information code.

拾音器可以是非插卡方式拾音器,此种拾音器就是非插卡方式拾音器主体本身;拾音器还可以是插数据卡方式拾音器,相应的插数据卡方式拾音器主体应有卡的插槽,使用时,把数个(一个或多个)数据卡插入插数据卡方式拾音器主体使用,这时这数个数据卡和此拾音器主体合为一个插数据卡方式拾音器,这数个数据卡上一般存储有对语音编码有声读物的整页面上的拼合信息码涉及的拼合信息的值进行解码的解码程序及解码数据(如拾音器是采用直接产音方式或既采用直接产音方式又采用TTS产音方式的拾音器,那数据卡上一般还应存储有诸音码的发音数据);各数据卡可与语音编码有声读物配套或与插数据卡方式拾音器主体配套;拾音器还可以是插部件式数据卡方式拾音器。The pickup can be a non-card pickup, which is the main body of the non-card pickup; the pickup can also be a data card pickup, and the corresponding data card pickup body should have a card slot. One (one or more) data cards are inserted into the main body of the pickup in the way of inserting the data card for use. At this moment, these several data cards and the main body of the pickup are combined into a pickup in the way of inserting the data card. Generally, a pair of speech codes are stored on the several data cards. The decoding program and decoding data for decoding the value of the combined information code involved in the combined information code on the entire page of the audiobook (if the pickup is a pickup that adopts a direct sound production method or both a direct sound production method and a TTS sound production method, then Generally, the pronunciation data of all sound codes should also be stored on the data card); each data card can be matched with the speech coding audiobook or be matched with the main body of the pickup of the mode of inserting the data card;

一般情况,是将一个数据卡插入一个插数据卡方式拾音器主体使此数据卡与此插数据卡方式拾音器主体合为一个插数据卡方式拾音器;一般情况,是将一个部件式数据卡插入一个插部件式数据卡方式拾音器主体使此部件式数据卡与此插部件式数据卡方式拾音器主体合为一个插部件式数据卡方式拾音器。In general, a data card is inserted into the main body of a data card-inserting pickup so that the data card and the data-card-plugging pickup body are combined into a data card-inserting pickup; in general, a component data card is inserted into a plug-in The main body of the pickup in the form of a component data card makes the main body of the pickup in the form of a plug-in data card combined with the main body of the pickup in the form of a plug-in data card to form a pickup in the form of a plug-in data card.

非插卡方式拾音器主体或插数据卡方式拾音器主体的硬件一般包括一个读码头,一个单片机芯片,一个发音器件,存储器,按键模块等。The hardware of the main body of the non-card mode pickup or the data card mode pickup generally includes a reading terminal, a single-chip microcomputer chip, a sounding device, a memory, a key module, and the like.

是使用编码规则来编码生成语音编码有声读物的整页面上的码字涉及的表示语句发音的拼合信息的值。表示语句发音的拼合信息的值是用编码规则中有关此语句的语种的编码细则来编码生成的。It is the value of the splicing information representing the pronunciation of the sentence involved in the code words on the entire page of the generated phonetic coded audiobook by using the coding rules. The value of the combined information representing the pronunciation of the sentence is coded and generated using the coding rules related to the language of the sentence in the coding rules.

对于印欧语系,一种直播方式的以只适用于英语为例的编码规则包括如下编码细则:编码语句集范围完整定义,用于解码时按直接发音方式播放语句发音的音码语音的处理的方式定义,用于编码的音码的种类选择,对音码赋编号,单词数据库建立,词类的建立,各种词段编码方式,时态词形编码处理,陌生单词编码处理及附加头段信息等。具体为:For the Indo-European languages, a live encoding rule that only applies to English as an example includes the following encoding rules: complete definition of the scope of the encoding sentence set, and the processing of the phonetic code voice used to play the pronunciation of the sentence in the direct pronunciation mode during decoding Method definition, selection of phonetic codes used for coding, numbering of phonetic codes, establishment of word database, establishment of part of speech, various word segment coding methods, tense word form coding processing, unfamiliar word coding processing and additional header information wait. Specifically:

完整定义的编码语句集的范围是任意英语语句,也即编码语句集包括任意英语语句。The scope of the fully defined coded sentence set is any English sentence, that is, the coded sentence set includes any English sentence.

用于解码时按直接发音方式播放语句发音的音码语音的处理方式,可以是将各个元音,辅音的发音录下来。When being used for decoding, the processing method of the phonetic code voice of playing the pronunciation of the sentence in a direct pronunciation mode can be that each vowel and the pronunciation of the consonants are recorded.

主要以单词及单词组合作为音码来编码。Mainly encode with word and word combination as phonetic sign indicating number.

将平常使用的英语单词及单词组合按其在日常生活中的使用频率排序,如is,and这类单词常用就排前面,排前面的常用单词或单词组合用短的编号表示,后面不常用的单词或单词组合用长编号表示;生成单词数据库,单词数据库里记录这些平常使用的单词及单词组合,也记录它们的编号;对单词数据库里的每个单词及单词组合可记录其发音信息数据(词形音标),要发单词或单词组合的音时,可以按其词形音标里的音标的组合情况而将音标(元音,辅音)发音拼接起来发音;然后可再按各不同单词的性质而把各单词归到不同的类中,比如run,go,jump等表示动作的词就归在一起,he,she,you等名词归在一起,然后对于不同的单词,建立它与这些类的关联信息,各单词的归类信息以及各单词与各类的关联信息都可记录在单词数据库里。Sort the commonly used English words and word combinations according to their frequency of use in daily life, such as is, and, these words are commonly used in the front, the commonly used words or word combinations in the front are represented by short numbers, and the less commonly used words in the back are represented by short numbers Word or word combination is represented with long numbering; Generate word database, record these commonly used words and word combination in word database, also record their numbering; Can record its pronunciation information data to each word in word database and word combination ( word form phonetic symbol), when wanting to send the pronunciation of word or word combination, can splicing pronunciation of phonetic symbol (vowel, consonant) pronunciation according to the combination situation of the phonetic symbol in its word form phonetic symbol; Then can press the character of each different word again And classify each word into different classes, such as run, go, jump and other words that represent actions are grouped together, and nouns such as he, she, and you are grouped together, and then for different words, establish it and these classes The related information, the classification information of each word and the related information of each word and each type can be recorded in the word database.

对单词段的编码主要采取的几种词段编码方式:There are several ways to encode word segments:

1。频率排序方式:  将语句里的每个单词及单词组合替换成其在单词数据表里的频率编号,每个频率编号前还带有此频率编号的长度信息,将语句里单词及单词组合的个数信息和这些频率编号长度信息和频率编号按顺序排组起来,即为单词段的值。1. Frequency sorting method: Replace each word and word combination in the sentence with its frequency number in the word data table, each frequency number is preceded by the length information of the frequency number, and the individual words and word combinations in the sentence The number information and these frequency number length information and frequency numbers are arranged in sequence, which is the value of the word segment.

2。频率关联混合方式:将语句里的单词及单词组合转成频率编号(阶梯编号)后,再将各频率编号按顺序排组起来并与分节信息一起组成单词段的值,这种方式利用各单词的关联性,以达到在单词段中省略频率编号的长度信息字节的目的。2. Frequency-associated hybrid method: convert the words and word combinations in the sentence into frequency numbers (ladder numbers), and then arrange the frequency numbers in order and form the value of the word segment together with the section information. This method uses each The associativity of words, to achieve the purpose of omitting the length information byte of the frequency number in the word segment.

3。关联方式:通过语句里的单词间的关联信息,对单词分层,并进行压缩编码。3. Association method: through the association information between words in the sentence, the words are layered and compressed.

4。整句差别方式:存储一些常用范句,然后将当前需编码语句与某常用范句的区别信息,作为单词段的值。4. Sentence difference method: store some commonly used model sentences, and then use the difference information between the current sentence to be encoded and a commonly used model sentence as the value of the word segment.

对单词及单词组合进行编码产生单词段的值后,还需对各单词及单词组合记录其时态词型选择信息。After encoding the words and word combinations to generate the value of the word segment, it is also necessary to record the tense word type selection information for each word and word combination.

具有对陌生单词的处理能力(包括陌生人名,地名的处理)。Have the ability to deal with unfamiliar words (including the handling of stranger names and place names).

编码生成的一个处理单位包括:头段,单词段,时态词形选择段,陌生单词段等。A processing unit generated by encoding includes: head segment, word segment, tense and word form selection segment, unfamiliar word segment, etc.

非印欧语系语种(可以是汉语或韩语或日语等)的编码处理可以是与印欧语系语种的编码处理类似的。汉韩日语里一般没有词根的概念,但对于汉语其带调音节(单字拼音)相当于英语的词根,汉语的所有不同的字发音(不同的带调音节)可归纳限制到约1300个(对于韩语,可把韩语的所有的单个音节分别当成词根;对于日语,可把日语的所有的假名分别当成词根),汉韩日语一般没有单复数,词的时态变化;除了这些区别造成一些处理不同外,总的来说,非印欧语系语种与印欧语系语种的编码处理一般是类似的。The encoding process of the non-Indo-European languages (which may be Chinese, Korean, or Japanese, etc.) may be similar to the encoding process of the Indo-European languages. There is generally no concept of root in Chinese, Korean and Japanese, but for Chinese, its toned syllable (single-character pinyin) is equivalent to the root of English. All the different pronunciations of Chinese characters (different toned syllables) can be summarized and limited to about 1300 (for For Korean, all single syllables in Korean can be regarded as roots; for Japanese, all Japanese pseudonyms can be regarded as roots respectively), Chinese, Korean and Japanese generally do not have singular and plural, and tense changes of words; except for these differences, some processing is different In addition, in general, the encoding process of non-Indo-European languages is generally similar to that of Indo-European languages.

本说明书对表示语句发音的拼合信息的值的编解码是以二进制为例的。如果使用其它进制(如八进制)或其它码制进行编解码,应是二进制的等同替换。In this specification, the encoding and decoding of the value of the combination information representing the pronunciation of a sentence is based on binary as an example. If other bases (such as octal) or other coding systems are used for encoding and decoding, they should be replaced by binary equivalents.

语音编码有声读物的整页面上的码字,可以是一维条形码或二维条形码,也可以是导电码,也可以是图形码,磁码或本说明书没提及的其它码种的码字;一本语音编码有声读物的整页面上可有一种(比如只有导电码)或多种(比如有条形码和导电码两种)码种的码字。一般情况下一个码字的值有一个表示语句发音的拼合信息的值。The codewords on the entire page of the voice-coded audiobook can be one-dimensional barcodes or two-dimensional barcodes, conductive codes, graphic codes, magnetic codes or other codewords not mentioned in this manual; The entire page of a voice-coded audio book can have code words of one type (such as only conductive code) or multiple (such as two types of bar code and conductive code). Generally, the value of a codeword has a value representing the combined information of the pronunciation of the sentence.

对于包含码值较少的码,其一个码的码值一般有一个表示短的语句发音的拼合信息的值,而如使用包含码值较多的如二维条形码如pdf417码,一个码的码值一般可以有表示一整页面课文的所有语句发音的拼合信息的值。For codes that contain fewer code values, the code value of one code generally has a value that represents the combined information of the pronunciation of a short sentence, and if a code that contains more code values such as a two-dimensional barcode such as a pdf417 code is used, a code value of one code The value may generally have a value representing the stitched information of all the sentence pronunciations of a whole page of text.

对于某一语种,是用解码规则来对表示此语种语句发音的拼合信息的值进行解码的,且可以是用某解码规则中有关此语种的解码细则来解码的。For a certain language, the decoding rule is used to decode the value of the combined information representing the pronunciation of the sentence in this language, and it can be decoded by using the decoding rules related to the language in a certain decoding rule.

对于印欧语系,一种直播方式的以只适用于英语为例的解码规则(此解码规则是与前述的直播方式的以只适用于英语为例的编码规则对应的),包括如下一些解码细则:解码语句集范围完整定义,用于解码时按直接发音方式播放语句发音的音码语音的处理的方式定义,用于解码的音码的种类选择,对音码赋编号,单词数据库建立,词类的建立,各种词段解码方式,时态词形解码处理,陌生单词解码处理,头段信息读取及用发音形式播放语句发音等For the Indo-European language family, a decoding rule that only applies to English in a live broadcast method (this decoding rule corresponds to the encoding rule that only applies to English in the live broadcast method above), including the following decoding rules : The complete definition of the scope of the decoding statement set, the definition of the processing method of the sound code speech of the pronunciation of the sentence in the direct pronunciation mode when decoding, the selection of the type of the sound code used for decoding, the numbering of the sound code, the establishment of the word database, and the part of speech The establishment of various word segment decoding methods, tense word form decoding processing, unfamiliar word decoding processing, header information reading and pronunciation playback of sentences, etc.

非印欧语系语种(可以是汉语或韩语或日语等)的解码处理可以是与印欧语系语种的解码处理类似的。The decoding process of the non-Indo-European languages (which may be Chinese, Korean, or Japanese, etc.) may be similar to the decoding process of the Indo-European languages.

对印欧语系或非印欧语系里的各语种,一般情况,编码规则是只适用于一个语种的编码规则,且与此编码规则有关此语种对应的解码规则一般是只适用于此语种的解码规则。For languages in the Indo-European or non-Indo-European languages, in general, the coding rules are only applicable to one language, and the decoding rules related to this coding rule are generally only applicable to the decoding of this language. rule.

在与拾音器有关的存储器集里存储有用于拼合信息的值解码的解码程序及解码数据,此解码程序可以是按适用于某语种的某解码规则中有关此语种的解码细则来对拾音器读码得到的语音编码有声读物的整页面上的码字涉及的此语种的拼合信息的值进行解码的(此解码规则一般是与适用于此语种的某编码规则有关此语种对应的),如拾音器是采用直接产音方式或既采用直接产音方式又采用TTS产音方式的拾音器,那在与拾音器有关的存储器集里一般还需存储有诸音码的发音数据;其读码头可以是光笔等光电式读码头以读取语音编码有声读物的整页面上的条形码或导电码,也可以是本发明人申请的另一专利里的机电式导电码读码头以读取语音编码有声读物的整页面上的导电码。The decoding program and decoding data for the value decoding of the combined information are stored in the memory set related to the pickup. This decoding program can be obtained by reading the pickup according to the decoding rules for this language in a certain decoding rule applicable to a certain language. The value of the mosaic information of this language involved in the code words on the entire page of the speech coded audiobook (this decoding rule is generally related to a certain coding rule applicable to this language and corresponds to this language), such as the pickup is adopted Direct sound production mode or pickups that adopt both direct sound production mode and TTS sound production mode, generally also need to store the pronunciation data of various sound codes in the memory set related to the pickup; its reading terminal can be a photoelectric type such as a light pen Read the dock to read the barcode or conductive code on the entire page of the voice-coded audiobook, or the electromechanical conductive code reading dock in another patent applied by the inventor to read the entire page of the voice-coded audiobook conduction code.

在解码时播出语句发音的可以是MCU或MCU外接的语音芯片或TTS器件等。It can be the MCU or the voice chip or TTS device connected to the MCU that broadcasts the pronunciation of the sentence during decoding.

本编码方式及其硬件实现使用数据压缩,及语音合成技术,达到不需换数据卡,成本低,使用方便等效果;平均每100个二进制位的码字,可以析解出至少20个英语单词的语句(大学听力难度英语)。This encoding method and its hardware use data compression and speech synthesis technology to achieve effects such as no need to change data cards, low cost, and convenient use; on average, at least 20 English words can be parsed out of every 100 binary bit codewords Sentences (University Listening Difficulty English).

附图说明:Description of drawings:

图1:使用光笔作为读码头的拾音器主体的一个实施例的电路图(此拾音器主体即拾音器)Fig. 1: use light pen as the circuit diagram of an embodiment of the pickup main body of reading dock (this pickup main body is pickup)

图2:使用TTS芯片来发音的拾音器主体的一个实施例的电路图(此拾音器主体即拾音器)Fig. 2: use TTS chip to come the circuit diagram of an embodiment of the pickup main body of pronunciation (this pickup main body is pickup)

图3:其整页面上印有导电码的语音编码有声读物的实例图Figure 3: Illustration of an example of a speech-encoded audiobook with conductive codes printed across its entire page

图4:单排码线单码段甲导电码的示例Figure 4: An example of a conductive code with a single code line and a single code segment

图5:机电式导电码读码头的结构示意图及电流流动示意图Figure 5: Schematic diagram of the structure and current flow of the electromechanical conductive code reading dock

图6:具有可读取单排码线单码段甲导电码的机电式导电码读码头的插数据卡方式拾音器的一个实施例的电路图Fig. 6: have the circuit diagram of an embodiment of the data card mode pickup of the electromechanical conductive code reading dock that can read the single code line single code segment A conductive code

图7:其整页面上印有条形码的语音编码有声读物的实例图Figure 7: Illustration of an example of a speech-encoded audiobook with a barcode printed on its entire page

图8:大机体连读码头形式的拾音器主体(机体的外型适于让语音编码有声读物放入机体里使用)外型实例和语音编码有声读物的实例Figure 8: The main body of the pickup in the form of a large-body continuous reading dock (the shape of the body is suitable for allowing speech-coded audiobooks to be put into the body for use) and an example of the appearance of the speech-coded audiobook

图9:大机体连读码头形式的拾音器主体(机体的外型适于让语音编码有声读物放在机体上使用)外型实例和语音编码有声读物的实例Figure 9: The main body of the pickup in the form of a large body continuous reading dock (the shape of the body is suitable for allowing speech-coded audiobooks to be used on the body) and an example of the appearance of the speech-coded audiobook

图10:手持式拾音器主体外型实例和页面上印有单排码线单码段甲导电码的语音编码有声读物的实例Figure 10: An example of the appearance of the main body of a handheld pickup and an example of a voice-coded audiobook with a single row of code lines and a single code segment A conductive code printed on the page

图11:小壳体连读码头形式的拾音器主体外型实例和语音编码有声读物的实例Figure 11: An example of the appearance of the main body of the pickup in the form of a small shell continuous reading dock and an example of a voice-coded audiobook

1:甲读码头1: A reading dock

3:MCU3: MCU

4:光笔4: light pen

5:按键模块5: Button module

6:外接数据存储器6: External data memory

7:导电胶条(斑马条)7: Conductive adhesive strip (zebra strip)

10:双排码线多码段甲导电码10: Double-row code line multi-code segment A conductive code

11:条形码11: barcode

12:甲码盘12: A code disc

13:甲码线区13: A code line area

14:甲感应排14: A induction row

15:甲扫描排15: A scanning row

16:甲电阻排16: A resistance row

20:单排码线单码段甲导电码20: Single row code line single code segment A conductive code

44:语音编码有声读物的整页面上的文字部分44: Speech-encoding the text portion on a full page of an audiobook

58:数据卡58: Data card

具体实施方式:Detailed ways:

本发明涉及两个方面,一是语音编码有声读物,其主体是读物,在读物的整页面上形成有诸个码字(诸个码字中的每个码字的值是共与数个表示如何用音码拼合成语句发音的拼合信息的值有关的,可称诸个码字中的各码字为拼合信息码,一般一个拼合信息码的码值有一个表示语句发音的拼合信息的值),诸个码字中的各码字的码种可以是条形码,导电码等,一般是以印刷或其它方式在作为有声读物主体的读物的整页面上形成码字的。本发明文件各处提及的语音编码有声读物,一般是把作为语音编码有声读物主体的读物的页作为码字载体。二是拾音器,拾音器用于与语音编码有声读物配合使用,使用时,由一个拾音器利用读码头读取语音编码有声读物的整页面上的诸个码字中的部诸码字中的码字,并根据读得的码字的码值使发音器件播出语句发音(拾音器需对码字涉及的拼合信息的值执行解码以使发音器件播出语句发音);拾音器可以是非插卡方式拾音器或插数据卡方式拾音器或插部件式数据卡方式拾音器。The present invention relates to two aspects, the one, the speech coding audio book, its main body is reading material, is formed with codewords (the value of each codeword in all codewords is total and several representations) on the whole page of reading material How to use phonetic sign indicating number to splice into the value of the splicing information of sentence pronunciation is relevant, can claim that each code word in all code words is splicing information code, generally the code value of a splicing information code has the value of the splicing information that represents sentence pronunciation ), the code type of each code word in the various code words can be a bar code, a conductive code, etc., and generally forms a code word on the entire page of the reading material as the main body of the audiobook by printing or other means. The speech-encoded audiobooks mentioned in the document of the present invention generally use the page of the reading material as the main body of the speech-encoded audiobook as the codeword carrier. The second is a pickup, and the pickup is used to cooperate with the audio-coded audiobook. When in use, a pickup uses the reading dock to read some of the codewords in the codewords on the entire page of the audio-coded audiobook. And make pronunciation device broadcast sentence pronunciation according to the code value of the code word that gets read (pickup needs to carry out decoding to the value of the splicing information that code word relates to so that pronunciation device broadcasts sentence pronunciation); Data card mode pickup or plug-in data card mode pickup.

本发明文件所说的拼合信息码是指其码值是共与数个表示如何用音码拼合成语句发音的拼合信息的值有关的码字。The said splicing information code of the document of the present invention refers to that its code value is the code word relevant with the value of the splicing information of how to splice into sentence pronunciation with several representations altogether.

在本发明文件的权利要求书和说明书中,各处提及的‘用音码拼合成语句发音’的含义,是默认拼合由拾音器来执行,即由拾音器用一些音码来作处理(一般是作组合)而得到语句的发音;拾音器在做拼合时一般要用音码生成语句的预备发音数据(但也可能不是生成语句的预备发音数据而是采用其它形式来做拼合,本说明书以要用音码生成语句的预备发音数据为例),再可将语句的预备发音数据转换成语句的发音;根据拾音器在对单个的拼合信息的值进行解码时播放语句发音采用的不同的发音方式,语句的预备发音数据的形式也相应不同;如解码时采用的发音方式为整步直接发音方式,在解码时,MCU就可按拼合信息的值,把各音码的发音信息数据(比如词形音标或词形拼音)按各音码的发音信息数据对应的发音在语句发音中的顺序排组起来成预备发音数据,并且可把预备发音数据中的各音码的发音信息数据按它们在预备发音数据中的排列顺序依次转换成其对应的发音(比如由主MCU以播放ADPCM格式的录音数据的方式来播音)连续播放出来,即为语句的全部发音(整步直接发音方式在本说明书里有反复叙述);如解码时采用的发音方式为分步直接发音方式,可看成是由数个部分预备发音数据组成语句的预备发音数据的,各个部分预备发音数据由音码的发音信息数据组成,在解码时,MCU每次得到一个发音接续的部分预备发音数据就可把组成此部分预备发音数据的各音码的发音信息数据转换成其对应的发音(比如由主MCU以播放ADPCM格式的录音数据的方式来播音)播放出来,依此依次分次播放完组成语句的预备发音数据的所有的部分预备发音数据对应的发音即形成语句的全部发音;如解码时采用的发音方式为整步TTS发音方式,在解码时MCU可按拼合信息的值,把各音码的拼法数据(其中也可能有音码的发音信息数据)按这些数据对应的发音在语句发音中的顺序排组起来成预备发音数据(即语句的拼法格式数据),并且可由MCU将预备发音数据传给TTS器件发音,即为语句的全部发音;如解码时采用的发音方式为分步TTS发音方式,可看成是由数个部分预备发音数据组成语句的预备发音数据的,各个部分预备发音数据由数个音码的拼法数据(也可能有音码的发音信息数据)组成,MCU每次得到一个发音接续的部分预备发音数据就可把此部分预备发音数据传给TTS器件发音,依此依次分次播放完组成语句的预备发音数据的所有的部分预备发音数据即形成语句的全部发音。(如解码时涉及的发音形式是用混合发音方式,那拾音器对不同的拼合信息的值进行解码时,可能采用整步直接发音方式,分步直接发音方式,整步TTS发音方式,分步TTS发音方式等发音方式中的不同的发音方式来发音,显然对这4种发音方式中任一种的发音处理都如本段内前面所述)In the claims and the description of the present invention document, the meaning of 'spliced into sentence pronunciation with phonetic codes' mentioned everywhere is that the default combination is performed by the pickup, that is, the pickup uses some phonetic codes for processing (generally combination) to obtain the pronunciation of the sentence; the pickup generally needs to use the sound code to generate the prepared pronunciation data of the sentence when doing the combination (but it may not be the prepared pronunciation data of the generated sentence but adopts other forms to do the combination. This specification will use Sound sign indicating number generates the prepared pronunciation data of sentence as an example), then the prepared pronunciation data of sentence can be converted into the pronunciation of sentence; The form of the prepared pronunciation data of each phonetic sign indicating number is also correspondingly different; As the pronunciation mode that adopts when decoding is the whole step direct pronunciation mode, when decoding, MCU just can press the value of putting together information, the pronunciation information data of each phonetic sign indicating number (such as word form phonetic symbol Or word form pinyin) according to the pronunciation corresponding to the pronunciation information data of each phonetic sign indicating number in the sentence pronunciation, arrange and arrange into the preliminary pronunciation data, and the pronunciation information data of each phonetic code in the preliminary pronunciation data can be pronounced according to them The arrangement order in the data is converted into its corresponding pronunciation in sequence (for example, the main MCU plays the audio in the form of recording data in ADPCM format) and is played continuously, that is, the entire pronunciation of the sentence (the whole-step direct pronunciation method is described in this manual. repeatedly narrate); as the pronunciation mode that adopts during decoding is the step-by-step direct pronunciation mode, can be regarded as the preparation pronunciation data that is made up of sentence by several parts preparation pronunciation data, and each part preparation pronunciation data is made up of the pronunciation information data of sound sign indicating number , when decoding, the MCU can convert the pronunciation information data of each phonetic code that forms this part of the prepared pronunciation data into its corresponding pronunciation (such as by the main MCU to play the ADPCM format by the main MCU at every turn when obtaining a pronunciation continuation part of the prepared pronunciation data) The mode of recording data comes broadcasting) to play out, and the corresponding pronunciation of all parts of the prepared pronunciation data that forms the sentence is played successively in turn, and the pronunciation corresponding to the prepared pronunciation data forms the whole pronunciation of the sentence; In the TTS pronunciation mode, the MCU can arrange the spelling data of each phonetic code (the pronunciation information data of the phonetic code) according to the pronunciation corresponding to these data in the order of the sentence pronunciation according to the value of the combined information when decoding. into the prepared pronunciation data (that is, the spelling format data of the sentence), and the MCU can transmit the prepared pronunciation data to the TTS device for pronunciation, that is, the entire pronunciation of the sentence; if the pronunciation method used in decoding is the step-by-step TTS pronunciation method, see The prepared pronunciation data of a sentence is composed of several parts of the prepared pronunciation data, and each part of the prepared pronunciation data is composed of spelling data of several phonetic codes (there may also be pronunciation information data of the phonetic code), and the MCU gets a pronunciation every time Continuing part of the preparation pronunciation data just can pass this part preparation pronunciation data to TTS device pronunciation, plays all parts preparation pronunciation data of the preparation pronunciation data that forms sentence successively accordingly and promptly forms the whole pronunciation of sentence. (As the pronunciation form involved in decoding is mixed pronunciation mode, when the pickup decodes the value of different combination information, it may adopt the whole-step direct pronunciation mode, the step-by-step direct pronunciation mode, the whole-step TTS pronunciation mode, and the step-by-step TTS Pronunciation mode and other pronunciation modes to pronounce, obviously, the pronunciation processing of any one of these 4 pronunciation modes is as described earlier in this paragraph)

对于上段述的4种发音方式中的任一种发音方式下的预备发音数据,都可能有更多的形式(比如发音方式是整步直接发音方式时,可把各音码的发音信息数据按某种约定的顺序排组起来成预备发音数据);对于上段述的4种发音方式中的任一种发音方式,在此种发音方式下,其它形式的预备发音数据应是上段述的此种发音方式下的预备发音数据形式的等同替换。(本发明文件引入‘预备发音数据’概念仅是为了叙述的方便,本发明文件各处提及的其前面没用‘部分’来修饰的‘预备发音数据’就是指一个语句的预备发音数据)For the preparation pronunciation data under any kind of pronunciation mode in the above-mentioned 4 kinds of pronunciation modes, there may be more forms (such as when the pronunciation mode is the direct pronunciation mode of the whole step, the pronunciation information data of each phonetic sign indicating number can be pressed Arranged in an agreed order to form the preparatory pronunciation data); for any one of the 4 pronunciation methods mentioned in the previous paragraph, in this pronunciation method, other forms of preparatory pronunciation data should be the above paragraph. An equivalent replacement for the prepared pronunciation data form in the pronunciation mode. (the document of the present invention introduces the concept of 'prepared pronunciation data' only for the convenience of narration, and the 'prepared pronunciation data' that is not modified by 'part' in front of it mentioned in the document of the present invention refers to the prepared pronunciation data of a sentence)

拼合信息的值由编码规则来编码生成,并且拼合信息的值可被拾音器解码播出语句发音;拼合信息的值的格式,可以是本说明书反复提及的连续二进制格式(即一连串的二进制BIT都是属于拼合信息的值,而没有穿插其它的信息的BIT)。(本发明文件各处所称的‘拼合信息的值’和‘表示语句发音的拼合信息的值’和‘表示如何用音码拼合成语句发音的拼合信息的值’是相同所指,这里说的语句是指某一语句)。本说明书后面在述及印欧语系语种或非印欧语系语种的编解码规则时,各处是以一个拼合信息的值就是一个处理单位的值为例的,当然一个拼合信息的值还可以是其它形式。本发明文件各处所说的用音码拼合语句发音,涉及到音码的定义,拼合信息的值的编码及解码处理,以下所述的有关音码的定义和拼合信息的值的编码方案及解码方案是用音码拼合语句发音技术的实施方案。The value of the combined information is coded and generated by the encoding rules, and the value of the combined information can be decoded by the pickup to broadcast the pronunciation of the sentence; the format of the value of the combined information can be the continuous binary format repeatedly mentioned in this manual (that is, a series of binary BITs are all It is a value that belongs to the merged information, and there is no BIT interspersed with other information). (" the value of the combined information " and " the value of the combined information representing the pronunciation of the sentence" and " the value of the combined information of how to spell the pronunciation of the sentence expressed " in the documents of the present invention are the same reference, said here statement refers to a statement). When referring to the encoding and decoding rules of Indo-European languages or non-Indo-European languages later in this specification, the value of a combined information is the value of a processing unit as an example everywhere. Of course, the value of a combined information can also be other forms. Said everywhere in the documents of the present invention, spell sentence pronunciation with phonetic sign indicating number, relate to the definition of sound sign indicating number, the coding of the value of putting together information and decoding process, the definition of relevant sound sign indicating number and the encoding scheme and the decoding of the value of putting together information below The scheme is an implementation scheme of spelling sentence pronunciation technology with phonetic codes.

本发明文件涉及的语句可以属于各语种。对各语种的拼合信息的值的编解码处理的实质是基本相同的,但会涉及到一些语言特性的差别,本发明文件对世界各语种中可能用于本发明的语音编码有声读物及拾音器的语种定义了印欧语系和非印欧语系,并分别阐述印欧语系语种的相关内容和非印欧语系语种的相关内容(这里说的相关内容是指表示语种的语句的发音的拼合信息的值的编解码处理)。The sentences involved in the document of the present invention may belong to various languages. The essence of the encoding and decoding processing of the value of the combined information in each language is basically the same, but it will involve some differences in language characteristics. Language defines the Indo-European language family and non-Indo-European language family, and respectively expounds the relevant content of the Indo-European language family and the non-Indo-European language family (the relevant content mentioned here refers to the value of the combined information of the pronunciation of the statement representing the language type codec processing).

本发明文件对世界范围内语系的划分采用常规的划法,本发明文件各处所称的印欧语系,包括欧洲,印度,西亚各国的语种,比如英语,法语,德语,俄语,意大利语,西班牙语,希腊语,印度语等都属于本发明文件所说的印欧语系,印欧语系里的各语种的语法风格发音规则比较类似,词有词形变化。The document of the present invention adopts a conventional method for dividing the language families in the world. The Indo-European language family referred to in the document of the present invention includes the languages of Europe, India, and West Asian countries, such as English, French, German, Russian, Italian, and Spanish. Chinese, Greek, Indian language etc. all belong to the said Indo-European language family of the present invention document, and the grammatical style pronunciation rule of each language in the Indo-European language family is more similar, and word has morphological changes.

本发明文件各处所称的非印欧语系包括除了印欧语系语种外的世界其它地区的其语法风格发音规则类似于汉藏语系语种的所有语种,主要包括汉藏,南亚,南岛等语系的语种,也包括阿尔泰语系的语种,以及非洲,美洲的语系的语种等。汉语属于汉藏语系,韩语,日语属于阿尔泰语系,这三个语种的语法风格发音规则比较类似,名词无单复数变化,词一般没有词形变化(或可处理成不需考虑词形变化),汉语,韩语,日语都属于本发明文件所说的非印欧语系。对于印欧语系外的其它语种,如其语法风格发音规则与汉藏语系的语种显著不同,则应视为不属于本发明所称的非印欧语系,这是显然的。The non-Indo-European languages referred to in the document of the present invention include all languages whose grammatical styles and pronunciation rules are similar to the Sino-Tibetan languages in other parts of the world except the Indo-European languages, mainly including Sino-Tibetan, South Asia, Austronesia and other language families languages, including Altaic languages, and African and American languages. Chinese belongs to the Sino-Tibetan language family, Korean, and Japanese belong to the Altaic language family. The grammatical style and pronunciation rules of these three languages are relatively similar. Nouns have no singular and plural changes, and words generally have no inflection (or it can be processed without considering the inflection) , Chinese, Korean, Japanese all belong to the said non-Indo-European language family of the present invention document. For other languages outside the Indo-European language family, if its grammatical style and pronunciation rules are significantly different from those of the Sino-Tibetan language family, it should be regarded as not belonging to the non-Indo-European language family referred to in the present invention, which is obvious.

本来韩语,日语有些时态变化是在动词后面附不同音节(韩语是附音节,音节类似于汉语的字,日语一般是附假名),有点类似于英语的词形变化即词后缀变化,但韩语或日语这动词后面附的音节可以当成平常音节(字)或词处理且这样一来就可把此音节前面的动词当成没有时态变化的词了(对于其它词性的词比如日语的形容词,也有类似动词这样的在词后附不同音节来作变化,这也可象动词这样处理成不需考虑词形变化),并且韩语,日语的语法风格发音规则更接近于汉藏语系(比如日语里有些汉字的发音就是模仿汉语里对应的汉字的发音的),所以本发明文件将韩语,日语归入非印欧语系。Originally, some tense changes in Korean and Japanese are to attach different syllables after verbs (Korean has syllables, syllables are similar to Chinese characters, and Japanese generally have kana), which is somewhat similar to English inflections, that is, word suffix changes, but Korean Or the syllable attached after this verb in Japanese can be treated as ordinary syllables (words) or words and in this way, the verb before the syllable can be regarded as a word without tense change (for other words of speech such as Japanese adjectives, there are also Similar to verbs, different syllables are attached to the words to make changes, which can also be treated like verbs without considering the inflection), and the grammatical style and pronunciation rules of Korean and Japanese are closer to those of the Sino-Tibetan language family (such as Japanese The pronunciation of some Chinese characters imitates the pronunciation of corresponding Chinese characters in Chinese), so the document of the present invention puts Korean and Japanese into the non-Indo-European language family.

先述有关编码的技术说明,解码及拾音器的说明在后面。The technical description of the encoding is described first, and the description of the decoding and pickup is followed.

本发明文件所使用的语句概念,一个语句可以是一个完整句子,或者是一个非完整句子(即一个完整句子的部分,比如可以是完整句子的前半部分或后半部分),或者是含有多个完整句子,或者是含有多个非完整句子等情况;本发明文件各处涉及的句子的概念定义是与常识定义相同的,一个完整的句子(完整句子)一般有一个主谓结构,还可能有宾语及作修饰用的定补状结构;如提及时无特别指明有他义,本发明文件的说明书和权利要求书中各处提及的‘句子’默认是指完整句子。语句的难度可从幼儿园,小学,中学直至大学程度。本说明书后面在阐述印欧语系语种或非印欧语系语种可使用的编解码规则举例时,各处是以定义一个处理单位对应一个语句为例的,即以定义表示语句发音的拼合信息的值就是一个处理单位的值为例的(这里的语句是指一个语句;本说明书后面在阐述印欧语系语种或非印欧语系语种可使用的编解码规则时,对于某一语句,各处提及的‘表示语句发音的拼合信息的值’与‘拼合信息的值’与‘语句的编码值’与‘语句编码值’是相同所指;本说明书各处提及的‘一个处理单位对应一个语句’的意思显然是指把对语句进行编码得到的数据作为一个处理单位的值,对于本说明书各处的与此类似的表达比如‘一个处理单位对应一个句子’等,显然也应按这样类推理解)。本说明书的编解码举例中的语句一般是含有一个中间无逗号的句子。(本段说的处理单位的具体定义见后面,如使用本说明书所述及的编解码规则举例之外的编解码规则,那处理单位的格式可不同,甚至可把表示语句发音的拼合信息的值不称为处理单位)The sentence concept used in the documents of the present invention, a sentence can be a complete sentence, or an incomplete sentence (that is, a part of a complete sentence, such as the first half or the second half of a complete sentence), or contain multiple Complete sentences, or situations such as containing multiple non-complete sentences; Objects and fixed-complement structures used for modification; if there is no special meaning when mentioned, the 'sentence' mentioned in the specification and claims of the present invention document refers to a complete sentence by default. The difficulty of sentences can range from kindergarten, elementary school, middle school to university level. When explaining the examples of codec rules that can be used by Indo-European languages or non-Indo-European languages later in this manual, the examples everywhere are to define a processing unit corresponding to a sentence, that is, to define the value of the combined information that represents the pronunciation of a sentence It is the value of a processing unit as an example (the sentence here refers to a sentence; when explaining the encoding and decoding rules available for Indo-European languages or non-Indo-European languages later in this manual, for a certain sentence, it is mentioned everywhere "The value of the combined information representing the pronunciation of the sentence" and "the value of the combined information" are the same as the "coded value of the sentence" and the "coded value of the sentence"; The meaning of 'obviously refers to the value of the data obtained by encoding the sentence as a processing unit. For similar expressions in this specification such as 'a processing unit corresponds to a sentence', etc., it should be understood by analogy ). The sentences in the encoding and decoding examples in this manual generally contain a sentence without a comma in the middle. (See later for the specific definition of the processing unit mentioned in this paragraph. If a codec rule other than the codec rule mentioned in this specification is used, the format of the processing unit can be different, and even the combined information representing the pronunciation of the sentence can be used. value is not called a processing unit)

本发明文件各处所提及的句子,除可以是中间无逗号的句子外,也可以是中间有逗号的句子,由逗号将句子分隔成多个分句或主从句,本段以上所称的句子本身是指完整句子,如取其部分而成非完整句子后,显然非完整句子的中间仍是可无逗号或可有逗号的。The sentences mentioned in the documents of the present invention can be sentences without commas in the middle, or sentences with commas in the middle, and the sentences are separated into multiple clauses or main clauses by commas. The term referred to above in this paragraph The sentence itself refers to a complete sentence. If a part of it is taken to form an incomplete sentence, it is obvious that there can be no comma or a comma in the middle of the incomplete sentence.

有关本发明文件各处所说的语句:对于一本要作为有声读物主体的读物,读物内容的各语句的划分(比如一个语句包括哪些完整句子或非完整句子)一般是根据拼合信息的值的编码处理的方便或读物内容的需要来划分的,比如读物的一页面上印有两个紧挨的英语的完整句子,如需这两个句子分别发音,那就可把这两个句子分别定义成两个语句,分别编码生成两个拼合信息的值,然后可用两个码字,每个码字放一个拼合信息的值,可让读物的此页面上印刷有这两个码字;而如需一次发这两个句子的音,那可把这两个句子定义成一个语句,编码生成一个表示此语句发音的拼合信息的值,然后可用一个码字,在此码字里放这个拼合信息的值(或把此拼合信息的值分散放在多个码字里),可让读物的此页面上印刷有这个(或多个)码字;如需一次发这两个句子的音,那还可把这两个句子分别定义成两个语句,分别编码生成两个拼合信息的值,然后可用一个码字,在此码字里放这两个拼合信息的值,可让读物的此页面上印刷有这个码字。Regarding the sentences mentioned in the documents of the present invention: for a reading to be used as the main body of the audiobook, the division of each sentence of the content of the reading (such as which complete sentences or incomplete sentences a sentence includes) is generally based on the encoding of the value of the combined information It can be divided according to the convenience of processing or the needs of the content of the reading material. For example, there are two complete sentences in English printed next to each other on one page of the reading material. If these two sentences need to be pronounced separately, then these two sentences can be defined as Two sentences are coded to generate two values of combined information respectively, and then two code words can be used, and each code word can be used to put a value of combined information, so that these two code words can be printed on this page of the reading material; and if required If the sounds of these two sentences are pronounced at once, then these two sentences can be defined as a sentence, and the encoding generates a value representing the combined information of the pronunciation of the sentence, and then a code word can be used to put the combined information in this code word Value (or the value of this splicing information is dispersed in a plurality of codewords), can allow this (or a plurality of) codewords to be printed on this page of reading material; These two sentences can be defined as two sentences respectively, and the values of the two combined information can be generated by encoding respectively, and then a code word can be used to put the values of the two combined information in this code word, which can make the page of the reading material This code word is printed.

如提及时无特别指明有他义,本发明文件的说明书和权利要求书中各处提及的‘语句’默认是指一个语句(或单个语句)。如提及时无特别指明有他义,本说明书各处所提及的语句,其含有的各句体(本发明文件各处提及的一个句体可以是一个完整句子或一个非完整句子)默认都是同一非复合语种的(即默认此语句是非复合语种的语句),如提及时无特别指明有他义,本说明书各处所提及的拼合信息的值涉及的语句,默认其含有的各句体都是同一非复合语种的(即默认此语句是非复合语种的语句)。If no other meaning is specified when mentioned, the 'statement' mentioned everywhere in the specification and claims of the present invention document refers to a sentence (or a single sentence) by default. If there is no special indication of other meanings when mentioned, the sentences mentioned in this specification, and the sentences contained in it (a sentence mentioned in the document of the present invention can be a complete sentence or an incomplete sentence) are defaulted. are all in the same non-compound language (that is, the default statement is a statement in a non-compound language), if there is no special meaning when it is mentioned, the statements involved in the value of the combined information mentioned throughout this manual will default to the various sentences contained in it. The sentence bodies are all in the same non-compound language (that is, the default statement is a statement in a non-compound language).

在本发明文件中的各处提及的‘多个’中的‘多’显然是指等于或大于2的一个数值。在本发明文件中的各处提及的‘多种’中的‘多’显然也是指等于或大于2的一个数值。如提及时无特别指明有他义,在本发明文件中的各处提及的作为数词的‘几’默认是指大于一且小于十的数值中的一个数值,这是常识。在本发明文件中的各处提及的‘数个’中的‘数’是指等于或大于1的一个数值。在本发明文件中的各处提及的‘数种’中的‘数’是指等于或大于1的一个数值。在本发明文件中其它各处提到的作为数词的‘多’显然是指等于或大于2的一个数值。在本发明文件中的其它各处提及的作为数词的‘数’是指等于或大于1的一个数值。The 'multiple' mentioned in various places in the present document obviously refers to a numerical value equal to or greater than 2. The 'many' in 'multiple' mentioned in various places in the present document obviously also refers to a numerical value equal to or greater than 2. If there is no other meaning when it is mentioned, the 'several' mentioned as a numeral in various places in the present invention documents defaults to one of the values greater than one and less than ten, which is common sense. The 'number' in 'several' mentioned in various places in this document refers to a numerical value equal to or greater than 1. The 'number' in 'several' mentioned in various places in the present document refers to a numerical value equal to or greater than 1. References to 'many' as a numeral elsewhere in this patent document clearly refer to a numerical value equal to or greater than 2. References to 'number' as a numeral elsewhere in this patent document mean a numerical value equal to or greater than one.

本说明书涉及的印欧语系语种的拼合信息的值的编解码及非印欧语系语种的拼合信息的值的编解码是以二进制码为例的,相应的拾音器的与解码有关的处理(包括直播方式和TTS方式的)也是以二进制码为例的。如果使用其它进制(如八进制)或其它码制进行编解码,应是二进制的等同替换。The coding and decoding of the values of the combined information of the Indo-European languages and the coding and decoding of the values of the combined information of the non-Indo-European languages involved in this manual are based on the binary code as an example, and the corresponding decoding-related processing of the pickup (including live broadcasting) mode and TTS mode) also take binary code as an example. If other bases (such as octal) or other coding systems are used for encoding and decoding, they should be replaced by binary equivalents.

因为不方便打音标(考虑到电子档文件的兼容问题,因本申请是电子申请),所以本说明书有个别涉及音标的地方是用英语字母模拟表示音标的。Because it is inconvenient to type phonetic symbols (considering the compatibility of electronic files, because this application is an electronic application), some places in this manual that involve phonetic symbols use English letters to simulate phonetic symbols.

本发明文件在阐述英语的拼合信息的值的直播方式的编解码处理时有如下约定:如提及时无特别指明有他义,本发明文件各处提及的‘音标’默认包括国际音标(包括48个元音和辅音音标)以及重读和次重读的符号(这里的重读和次重读是指针对单词内音节的);由于国际音标的符号特殊,在单片机系统中(比如存储器里)无法直接表示,所以可对这些各音标顺序编号成音标编号(重读和次重读符号也与国际音标一起编音标编号),即用音标编号来表示音标(音标编号一般可为6BIT长),音标编号可直接使用在单片机系统中;一个单词(或其变化词形)的发音可看成是由一个或多个音标编号组成的,并把用于组成一个单词(或其变化词形)的发音的一个或多个音标编号的组合(只含一个音标的单词发音也可看成是由一个音标编号组合成的)称为词形音标,并且也把用于组成一个单词组合(或其变化词形)的发音的多个音标编号的组合也称为词形音标。在阐述英语的拼合信息的值的TTS方式的编解码处理时,提及的‘音标’仍是包括国际音标以及重读和次重读的符号,TTS方式编解码处理时,是用音标字符来表示音标(包括国际音标,重读和次重读的符号)的,此音标字符应是TTS器件能识别的表示音标的字符(一般是ASCII字符),TTS方式编解码处理时的一个单词(或其变化词形)的词形音标由一个或多个音标字符组成,TTS方式编解码处理时的一个单词组合(或其变化词形)的词形音标由多个音标字符组成(此句的‘多个’与上一句的‘多个’不是相同所指)。The document of the present invention has the following agreement when explaining the encoding and decoding process of the live mode of the value of the combined information in English: if there is no special indication when referring to other meanings, the 'phonetic symbols' mentioned in the documents of the present invention include the International Phonetic Alphabet (including 48 vowel and consonant phonetic symbols) and symbols of stress and sub-stress (the stress and sub-stress here refer to syllables within a word); due to the special symbols of the International Phonetic Alphabet, they cannot be directly represented in a single-chip system (such as in a memory) , so these phonetic symbols can be sequentially numbered into phonetic symbols (accented and sub-accented symbols are also coded with the international phonetic symbols). In the single-chip microcomputer system; the pronunciation of a word (or its variant form) can be regarded as being composed of one or more phonetic symbols, and one or more phonetic symbols used to form the pronunciation of a word (or its variant form) The combination of two phonetic symbols (the pronunciation of a word containing only one phonetic symbol can also be regarded as being composed of a phonetic symbol number) is called a phonetic symbol, and the pronunciation used to form a word combination (or its change form) is also used. The combination of a plurality of phonetic symbols is also called word-form phonetic symbols. When explaining the TTS encoding and decoding process of the value of English splicing information, the "phonetic symbols" mentioned still include the international phonetic symbols and symbols of stress and sub-stress. (including international phonetic symbols, accented and sub-stressed symbols), this phonetic character should be a character (generally an ASCII character) that can be recognized by the TTS device to represent the phonetic symbol, a word (or its change form) when TTS mode codec processing ) is made up of one or more phonetic symbols, and the morphological phonetic symbols of a word combination (or its variation word form) during TTS mode encoding and decoding processing is made up of a plurality of phonetic symbols ('multiple' of this sentence and The 'plurality' in the previous sentence does not refer to the same).

本发明文件在阐述汉语的拼合信息的值的直播方式的编解码处理时有如下约定:如提及时无特别指明有他义,本发明文件各处提及的‘拼音’(这里说的‘拼音’,其前面是没加‘词形’两字的)默认就是指平常意义的带调音节(也被称为单字拼音),一个拼音是由一个或多个拼音字符来组成表示的(只含一个拼音字符的拼音可看成是由一个拼音字符组成的;拼音字符使用ASCII格式字符,包括A-Z的字母和1-5这五个表示声调的数字;而如遇到‘绿’这样其拼音字符比较特殊的字,其拼音就可表示成LV2,这是按常识惯例的;另,除了可用ASCII格式字符作为拼音字符外,显然也可选用别的格式种类的字符作为拼音字符),表示一个拼音的这些拼音字符的组成情况也被称为拼音字符拼法(比如‘等’字,其拼音表示成‘DENG3’,‘DENG3’也是‘等’字拼音的拼音字符拼法,其中‘D’,‘E’,‘N’,‘G’,‘3’是5个ASCII字符,也是5个拼音字符,其中的‘3’是表示声调);一个词的发音可看成是由一个或多个拼音组成的,并把用于组成一个词的发音的一个或多个拼音的组合(只含一个单字拼音的词的发音也可看成是由一个拼音组合成的)称为词形拼音,并且也把用于组成一个词组合的发音的多个拼音的组合也称为词形拼音(此句的‘多个’与上一句的‘多个’不是相同所指)。在阐述汉语的拼合信息的值的TTS方式的编解码处理时,提及的‘拼音’仍是指平常意义的带调音节,TTS方式编解码处理时,是用拼音字符来组成表示拼音的,表示一个拼音的这些拼音字符的组成情况也被称为拼音字符拼法,此拼音字符应是TTS器件能识别的表示拼音的字符(一般是ASCII字符,拼音字符里一般仍有表示拼音声调的字符),TTS方式编解码处理时的一个词或词组合的词形拼音由一个或多个拼音组成。(本段有提及‘词形拼音由一个或多个拼音组成’,而拼音又是由拼音字符组成表示的,所以词形拼音也是由拼音字符组成表示的)The document of the present invention has the following agreement when explaining the codec processing of the live mode of the value of the combined information of Chinese: if there is no special indication when referring to other meanings, the 'Pinyin' mentioned in the document of the present invention (the 'Pinyin' mentioned here ', the word "form" is not added in front of it) the default is to refer to the tone syllable in the usual sense (also known as single-character pinyin), and a pinyin is represented by one or more pinyin characters (only contains The pinyin of a pinyin character can be regarded as composed of one pinyin character; the pinyin character uses ASCII format characters, including the letters of A-Z and the five numbers representing the tone of 1-5; and the pinyin characters such as 'green' For special characters, their pinyin can be represented as LV2, which is based on common sense conventions; in addition, in addition to using ASCII format characters as pinyin characters, obviously, characters in other formats can also be used as pinyin characters), to represent a pinyin The composition of these pinyin characters is also called pinyin spelling (such as 'etc.' word, its pinyin is represented as ' DENG3 ', and ' DENG3 ' is also the pinyin spelling of ' wait ' word pinyin, wherein ' D ', 'E', 'N', 'G', '3' are 5 ASCII characters, and also 5 pinyin characters, where '3' represents tone); the pronunciation of a word can be regarded as composed of one or more Composed of pinyin, and the combination of one or more pinyin used to form the pronunciation of a word (the pronunciation of a word containing only one single-character pinyin can also be regarded as being composed of one pinyin) is called morphological pinyin, and The combination of multiple pinyins used to form the pronunciation of a word combination is also referred to as morphological pinyin (' multiple' in this sentence is not the same reference as ' multiple' in the previous sentence). When explaining the encoding and decoding process of the TTS mode of the value of Chinese splicing information, the "Pinyin" mentioned still refers to the toned syllable in the ordinary meaning. When encoding and decoding in the TTS mode, Pinyin characters are used to represent Pinyin. The composition of these pinyin characters representing a pinyin is also called the spelling of pinyin characters. This pinyin character should be a character representing pinyin that can be recognized by the TTS device (usually ASCII characters, and there are generally characters representing pinyin tones in pinyin characters ), the morphological pinyin of a word or word combination during TTS mode codec processing is made up of one or more pinyin. (It is mentioned in this paragraph that 'word form pinyin is composed of one or more pinyin characters', and pinyin is represented by pinyin characters, so word form pinyin is also represented by pinyin characters)

以下在叙述印欧语系语种或非印欧语系语种的编码规则时提及到的整步直接发音方式的概念和分步直接发音方式的概念,在后面的解码规则阐述时有述。The concepts of the full-step direct pronunciation method and the step-by-step direct pronunciation method mentioned below when describing the encoding rules of the Indo-European languages or non-Indo-European languages will be described in the following decoding rules.

先述以英语为代表的印欧语系语种的编码处理:The encoding processing of Indo-European languages represented by English is described first:

以下叙述印欧语系语种的编码处理时,涉及到举例时是以英语为例的。When describing the encoding processing of Indo-European languages in the following, English is used as an example when it comes to examples.

音码的定义:Definition of sound code:

这里音码是按照印欧语系语种的发音规则来划分的。对于印欧语系任一语种,可被当成音码的,可以是表示发音的元辅音或元音辅音组合,也可以是可以转换成发音的词根,单词,单词组合或范句,(这里的‘表示发音’和‘转换成发音’所指的发音,也是在语句发音中音码所对应的发音,这在本发明文件里有反复叙述),既可以由元辅音,元音辅音组合和单词中的任一种音码来拼合语句集发音,也可以由元辅音,元音辅音组合,词根,单词和单词组合中的多种音码来拼合语句集发音,也可以由元辅音,元音辅音组合和单词中的任一种音码与范句音码来拼合语句集发音,也可以由元辅音,元音辅音组合,词根,单词和单词组合中的多种音码与范句音码来拼合语句集发音。(为了叙述简洁,本发明文件中各处提及的印欧语系语种的词根概念,既包括平常意义的词根,也包括前缀和后缀;并且本发明文件把平常意义的词根称为词基;比如英语,平常意义的词根即词基有sit,fuse等,前缀有dis,un等,后缀有able,er等,这些都归入本发明文件的词根概念。另注:本段提的‘元辅音,元音辅音组合,词根,单词和单词组合中的多种音码’,是指这里提及的5种音码中的任意2种或任意3种或任意4种或全部5种音码,即可以是两种种数的任意音码组合情况或是两种以上且种数不大于最大种数的任意种数的任意音码组合情况,最大种数是指比如前面说的5种音码中的最大种数就是5种,这也是常识;本发明文件其它地方述及印欧语系某语种用音码来拼合语句集发音时的其它与此类似的提法涉及音码种数的问题也按这样类推理解。)Here the phonetic codes are divided according to the pronunciation rules of the Indo-European languages. For any language of the Indo-European language family, what can be regarded as a phonetic code can be a vowel and consonant or a combination of vowels and consonants representing pronunciation, or a root, word, word combination or model sentence that can be converted into pronunciation, (here' Express pronunciation ' and ' be converted into the pronunciation indicated by pronunciation ', also be the pronunciation corresponding to the sound sign indicating number in the sentence pronunciation, this has repeatedly stated in the document of the present invention), both can be by vowel consonant, vowel consonant combination and word Any kind of phonetic sign indicating number to spell out the sentence set pronunciation, also can be by vowel-consonant, vowel-consonant combination, word root, word and the multiple phonetic codes in word combination to come together sentence-set pronunciation, also can be by vowel-consonant, vowel-consonant Combination and any kind of phonetic sign indicating number and model sentence phonetic sign indicating number in combination and word come together sentence set pronunciation, also can come by vowel and consonant, vowel and consonant combination, word root, multiple phonetic sign indicating number and model sentence sound sign indicating number in word and word combination Flatten the pronunciation of the sentence set. (for succinct description, the root concept of the Indo-European languages mentioned everywhere in the document of the present invention, both includes the root of common sense, also comprises prefix and suffix; And the root of common sense is called word base in the document of the present invention; English, the root of common meaning is that word base has sit, fuse etc., and prefix has dis, un etc., and suffix has able, er etc., and these all come into the root concept of the present invention's document.Another note: the ' vowel consonant of this paragraph mentions , the combination of vowels and consonants, the root, the multiple phonetic codes in words and word combinations', refers to any 2 or any 3 or any 4 or all 5 phonetic codes in the 5 kinds of phonetic codes mentioned here, Promptly can be the arbitrary phonetic code combination situation of two kinds of numbers or more than two kinds and the arbitrary phonetic code combination situation of the arbitrary number of kind number that is not greater than maximum kind number, and maximum kind number refers to such as in the 5 kinds of phonetic codes that say above The maximum number of kinds is exactly 5 kinds, and this is also common sense; Other places of the document of the present invention mention that a certain language of the Indo-European language family uses phonetic codes to assemble the pronunciation of the sentence collection, and other similar formulations relate to the problem of the number of phonetic codes according to understand by analogy.)

对于印欧语系任一语种,除了用以上一些音码的使用组合情况(用一种或多种音码)来拼合语句集发音外,实际上根据不同的需要,还可能使用一些别的音码使用组合情况来作拼合,比如对于英语(印欧语系别的语种也可),有时要编码的语句涉及的单词很有限,就可能只使用一些词根就可单独完成语句发音的拼合(即只由词根这一种音码来拼合语句集发音)。具体的,可以由词根和单词组合中的任一种音码来拼合语句集发音,也可以由词根和单词组合中的任一种音码与范句音码来拼合语句集发音;这些别的音码使用组合情况可视为上段述的那些音码使用组合情况的等同替换。For any language of the Indo-European language family, in addition to using the combination of the above phonetic codes (using one or more phonetic codes) to assemble the pronunciation of the sentence set, in fact, according to different needs, some other phonetic codes may also be used Use combination situation to do splicing, such as for English (other languages of the Indo-European language family are also available), sometimes the words involved in the sentences to be encoded are very limited, and it is possible to only use some roots to complete the splicing of the pronunciation of the sentence (that is, only by This a kind of phonetic sign indicating number of word root is put together the statement collection pronunciation). Concrete, can put together the sentence collection pronunciation by any kind of phonetic code in the root and word combination, also can put together the sentence collection pronunciation by any kind of phonetic code and model sentence phonetic code in the root and word combination; These other The use combination of phonetic codes can be regarded as the equal replacement of those phonetic codes mentioned in the previous paragraph.

前面两段有述及的印欧语系任一语种的由一种或多种音码来拼合语句集发音,实际是指先把这一种或多种音码看成一个音码集合(这句话这么理解:如是把一种音码当成一个音码集合,如用来拼合的这种音码有用N个音码,那音码集合里就只包括这N个音码,比如把英语的元辅音音码当成一个音码集合,用来拼合的元辅音音码约有用几十个,那这个音码集合就包括这几十个元辅音音码;如是把多种音码当成一个音码集合,那就是把这多种音码都合起来成一个音码集合,比如多种音码包括两种音码,用来拼合的第一种音码有用N个音码,用来拼合的第二种音码有用M个音码,那合起来的音码集合就包括这N个第一种音码和这M个第二种音码,比如多种音码包括两种音码,用来拼合的第一种音码英语的元辅音音码约有用几十个,用来拼合的第二种音码英语的词根音码约有用几千个,那合起来的音码集合就包括这几十个元辅音音码和这几千个词根音码;用于语句集拼合的某种音码有用多少个音码及用哪些音码是根据需要来定的,元辅音音码数量很少所以常规语言领域本来有多少个元辅音音码那拼合一般就有用多少个,象单词,词根这种在常规语言领域数量很大的音码,那一般是根据需要选一些来用于语句集拼合,比如一般难度的英语拼合用的词根约有用1两千个,而且根据不同需要可能会选一些不同音码),然后把一个应用范围内(比如教给幼儿班小孩的所有此语种语句)的此语种的所有语句当成一个语句集,此语句集中的任一个语句,都可由此音码集合里的部分音码来拼合成此语句发音,并且此音码集合里的任一个音码可用于此语句集里不同语句的发音的拼合;在本发明文件的权利要求书中述的以及本说明书后面述的印欧语系任一语种的有关‘由一种或多种音码来拼合语句集发音’的提法以及类似提法的意思也如本段前面述;本发明文件述及印欧语系时各处所说的把一个应用范围内的某语种的所有语句当成一个语句集,其中相同的语句是只留一个的,即语句集里的语句互不相同;本段所述是显然的。(本段说的‘应用范围’可以是有限的,比如把教给幼儿班小孩的所有英语语句定义为一个应用范围,相应的把此应用范围内的所有英语语句当成一个语句集,此语句集包括的语句个数显然一般是有限多个;本段说的应用范围还可以是无限的,比如定义一个应用范围,任意一种英语读物都属于此应用范围,相应的把此应用范围内的英语的所有语句当成一个语句集,此语句集包括的语句数量很大可当成语句个数是无限的)The preceding two paragraphs have mentioned that any language of the Indo-European language family is composed of one or more phonetic codes to assemble the pronunciation of the sentence set, which actually means that this one or more phonetic codes are regarded as a phonetic code set (this sentence Understand like this: a kind of phonetic sign indicating number is regarded as a phonetic sign indicating number set like this, useful N phonetic sign indicating number as this phonetic sign indicating number that is used for splicing together, just only comprise these N sound sign indicating numbers in that phonetic sign indicating number set, such as the vowel consonant of English Sound code is regarded as a sound code collection, and the vowel-consonant sound code that is used for splicing together has about useful dozens, and then this sound code collection just comprises these dozens of vowel-consonant sound codes; That is to combine these multiple phonetic codes into a phonetic code collection, such as multiple phonetic codes comprising two kinds of phonetic codes, the first kind of phonetic codes used to put together useful N phonetic codes, the second kind of phonetic codes used to put together There are M phonetic codes useful for sign indicating number, and the phonetic code collection that puts together just includes these N first kind of phonetic codes and these M second kind of phonetic codes, and comprises two kinds of phonetic codes such as multiple phonetic codes, and is used for the first kind of phonetic codes that are put together There are dozens of vowel and consonant codes in one kind of phonetic code English, and thousands of root phonetic codes in the second phonetic code English used for splicing together. The combined phonetic code set includes these tens of yuan Consonant phonetic codes and these thousands of root phonetic codes; how many phonetic codes and which phonetic codes are useful for a certain phonetic code used for sentence set combination are determined according to needs, and the number of vowels and consonant phonetic codes is seldom There are as many vowel and consonant phonetic codes as there are generally used for spelling, such as words and roots, which have a large number of phonetic codes in the field of conventional languages, and some are generally selected for sentence set spelling according to needs, such as general difficulty There are about 1,000 or 2,000 roots used in English spelling, and some different phonetic codes may be selected according to different needs), and then all the words of this language within an application range (such as teaching all sentences of this language to children in kindergartens) Sentence is regarded as a statement set, and any statement in this statement set can be combined into the pronunciation of this sentence by part of the phonetic codes in the phonetic code set, and any phonetic code in this phonetic code set can be used for different in this sentence set. The spelling of the pronunciation of the sentence; the formulation of "putting together the pronunciation of the sentence set by one or more phonetic codes" and The meaning of similar formulations is also as mentioned above in this paragraph; when the document of the present invention refers to the Indo-European language family, it is said that all the sentences of a certain language within an application range are regarded as a set of sentences, and only one of the same sentences is left , that is, the statements in the statement set are different from each other; what is stated in this paragraph is obvious. (The 'scope of application' mentioned in this paragraph can be limited, such as defining all English sentences taught to children in kindergarten classes as a scope of application, and correspondingly treating all English sentences in this scope of application as a set of sentences, this set of sentences The number of sentences included is obviously generally limited; the scope of application mentioned in this paragraph can also be unlimited, such as defining a scope of application, any kind of English reading material belongs to this scope of application, correspondingly the English within this scope of application All the statements of the statement are regarded as a statement set, and the number of statements included in this statement set is very large, which can be regarded as the number of statements is unlimited)

在本发明文件中各处提及的对印欧语系任一语种的语句进行编码,对此语种是对应定义了一个语句集的,此语句集里语句个数可以是有限或无限的,这里说的编码就是对此语句集里语句进行编码;在本发明文件中各处提及的对表示印欧语系任一语种的语句发音的拼合信息的值进行解码,对此语种是对应定义了一个语句集的,此语句集里语句个数可以是有限或无限的,这里说的解码就是指对表示此语句集里语句发音的拼合信息的值进行解码。本发明文件各处在述及对印欧语系任一语种的语句进行编码时,提及的语句集默认是指某编码规则中有关此语种的编码细则定义的此语种的编码语句集且默认是按照此编码规则中有关此语种的编码细则来编码的;本发明文件各处在述及对表示印欧语系任一语种的语句的发音的拼合信息的值进行解码时,提及的语句集默认是指某解码规则中有关此语种的解码细则定义的此语种的解码语句集且默认是按照此解码规则中有关此语种的解码细则来解码的。在以下直播方式的以只适用于英语为例的编码规则里,以及TTS方式的以只适用于英语为例的编码规则里,都是以编码规则定义(这里说的定义与在这些编码规则里涉及的编码语句集范围的完整定义是等价的)的英语的编码语句集是包括任意英语语句的也即编码语句集里语句个数是无限的为例的(因为有陌生单词处理功能,所以任一英语语句都可编解码)。在以下直播方式的以只适用于英语为例的解码规则里,以及TTS方式的以只适用于英语为例的解码规则里,都是以解码规则定义(这里说的定义与在这些解码规则里涉及的解码语句集范围的完整定义是等价的)的英语的解码语句集是包括任意英语语句的也即解码语句集里语句个数是无限的为例的(因为有陌生单词处理功能,所以任一英语语句都可编解码)。In the documents of the present invention, the sentences of any language of the Indo-European language family mentioned everywhere are encoded, and a sentence set is correspondingly defined for this language. The number of sentences in this sentence set can be limited or unlimited. Here it is said The encoding is exactly to encode the sentences in this sentence set; in the documents of the present invention, the value of the combined information to represent the pronunciation of sentences in any language of the Indo-European language family is decoded, and a sentence is correspondingly defined for this language The number of sentences in this sentence set can be limited or unlimited. The decoding mentioned here refers to decoding the value of the combined information representing the pronunciation of sentences in this sentence set. When referring to the encoding of sentences of any language of the Indo-European language family in the documents of the present invention, the sentence set mentioned defaults to the coded sentence set of this language defined in the coding rules for this language in a certain coding rule and defaults to Encoded according to the coding rules about this language in this coding rule; when the document of the present invention refers to decoding the value of the mosaic information representing the pronunciation of any language in the Indo-European language family, the statement set mentioned defaults It refers to the decoding statement set of this language defined in the decoding rules of this language in a certain decoding rule, and is decoded by default according to the decoding rules of this language in this decoding rule. In the following encoding rules that are only applicable to English in the live broadcast mode, and in the encoding rules that are only applicable to English in the TTS mode, they are all defined by encoding rules (the definitions mentioned here are the same as those in these encoding rules The complete definition of the scope of the coded sentence set involved is equivalent) The coded sentence set of English includes any English sentence, that is, the number of sentences in the coded sentence set is unlimited as an example (because there is a strange word processing function, so Any English sentence can be coded). In the decoding rules that only apply to English in the following live broadcast mode, and in the decoding rules that only apply to English in the TTS mode, they are all defined by decoding rules (the definitions mentioned here are the same as those in these decoding rules. The complete definition of the scope of the decoding sentence set involved is equivalent) the English decoding sentence set includes any English sentence, that is, the number of sentences in the decoding sentence set is unlimited as an example (because there is a strange word processing function, so Any English sentence can be coded).

在本发明文件中各处提及的有关印欧语系任一语种的类似‘用音码拼合成语句发音’这样的叙述(提及的‘语句’的后面没有‘集’这个词),是指由音码(这里说的音码是指属于一种或多种音码中的多个音码)来拼合成一个具体的语句的发音。(本段有提及的‘多种’里的‘多’与本段有提及的‘多个’里的‘多’不一定是相同数目)The narration (not having the word "collection" behind the "sentence" mentioned) about any language of the Indo-European language family mentioned everywhere in the document of the present invention is to mean The pronunciation of a concrete sentence is spliced together by the phonetic code (the phonetic code mentioned here refers to a plurality of phonetic codes belonging to one or more phonetic codes). (The "many" in the "multiple" mentioned in this paragraph and the "many" in the "multiple" mentioned in this paragraph are not necessarily the same number)

本发明文件各处提及的印欧语系任一语种的‘元辅音’的意思是表示将此语种的全部单个元音和全部单个辅音,集合起来当成一种音码‘元辅音’,因元音和辅音在常规情况下是同时使用的,一般不会单独用元音或单独用辅音来构成语句发音;另如把元音和辅音分开当成两种音码并同时使用这两种音码来拼合语句集发音也可,但描述起来比较麻烦(并且分开成两种音码应是把元音和辅音当成一种音码的等同替换),所以本发明文件是将全部单个元音加全部单个辅音当成一种音码‘元辅音’了;因音标是专作为元辅音的书面表示形式的,所以‘元辅音’音码也可被称为‘音标’音码。The meaning of the 'yuan consonant' of any language of the Indo-European language family mentioned in the documents of the present invention means that all single vowels and all single consonants of this language are assembled as a kind of phonetic code 'yuan consonant', because yuan Sound and consonant are used at the same time under normal circumstances, generally do not use vowel alone or use consonant alone to form sentence pronunciation; It is also possible to piece together the pronunciation of the sentence set, but it is cumbersome to describe (and being separated into two kinds of phonetic codes should be to treat vowels and consonants as an equivalent replacement of a kind of phonetic code), so the document of the present invention is to add all single vowels to all single The consonant is regarded as a kind of phonetic code 'yuan consonant'; because the phonetic symbol is specially used as the written expression form of the vowel consonant, so the 'yuan consonant' phonetic code can also be called the 'phonetic symbol' phonetic code.

元音辅音组合,是指对语句的分解不是按单词或词根的概念,比如allow of这样一个短语,如将其分解成a和llow of这样两部分,其表示发音的元辅音也相应分成两段(e)(lauov)(本说明书这里因不方便打音标所以用字母来模拟音标),可把这两段发音(e),(lauov)分别当成两个元音辅音组合音码,对这两段分别赋予编号如001010011,001110011;这两个编号001010011和001110011可以分别转换成(e)和(lau ov)这样的音标信息,将这两个编号组合起来后可转换成(elau ov)这样的音标信息,所以将这样的元音辅音组合音码组合起来也是可以组成语句发音的。Combination of vowels and consonants refers to the concept that the sentence is not decomposed by words or roots. For example, a phrase such as allow of, if it is decomposed into two parts such as a and allow of, the vowels and consonants that represent pronunciation are also divided into two parts. (e) (lauov) (this specification is here because it is inconvenient to make phonetic symbols so use letters to simulate phonetic symbols), these two sections of pronunciation (e), (lauov) can be regarded as two vowel and consonant combination phonetic codes respectively, for these two The segments are assigned numbers such as 001010011 and 001110011; these two numbers 001010011 and 001110011 can be converted into phonetic information such as (e) and (lau ov) respectively, and can be converted into (elau ov) after combining these two numbers Phonetic symbol information, so the combination of such vowel and consonant combination phonetic codes can also form a sentence pronunciation.

对于印欧语系任一语种,语音编码有声读物的整页面上的码字涉及的表示此语种语句发音的拼合信息的值是按照某种编码规则中有关此语种的编码细则来编码生成的,此编码规则可以是只适用于此语种或适用于包括此语种在内的多个语种的,此编码规则中有关此语种的编码细则可以是直播方式的。For any language of the Indo-European language family, the value of the combined information representing the pronunciation of sentences in this language involved in the codewords on the entire page of the phonetic coded audiobook is encoded and generated according to the coding rules related to this language in a certain coding rule. The coding rules can be applicable only to this language or to multiple languages including this language, and the coding rules related to this language in this coding rule can be live broadcast.

使用编码规则的目的是为了编码生成表示语句发音的拼合信息的值;只适用于印欧语系任一语种或可适用于包括数个印欧语系语种在内的多个语种的编码规则可以有很多种(只适用于一个语种的编码规则就可有多种),编码规则的种数实际上是没法穷举的。权利要求3,12,20或23中提及的其有关某语种的编码细则含有压缩编码处理的编码规则(此编码规则可以是只适用于此一个语种或适用于包括此一个语种在内的多个语种的)可以是各种具体的编码规则。The purpose of using encoding rules is to encode the value that generates the combined information representing the pronunciation of the sentence; there can be many encoding rules that are only applicable to any Indo-European language or can be applied to multiple languages including several Indo-European languages There are many types of coding rules applicable to only one language, and the number of coding rules cannot be exhaustive in fact. Its coding rules about a certain language mentioned inclaim 3, 12, 20 or 23 contain coding rules for compression coding processing (this coding rule can be only applicable to this one language or be applicable to many including this one language) language) can be various specific coding rules.

一种只适用于印欧语系某语种的简单的编码规则,可以是用音码作简单的叠加来拼合语句集发音;举例:对于印欧语系某一语种(比如英语),把单词当成音码,将单词词典里的各单词(各单词的不同变形也分别当成不同的单词,比如A,AN就当成2个不同的单词)都赋予不同的二进制编号(各单词的二进制编号长度互相相同,比如14BIT;象A,AN都当成不同的单词分别赋予不同的二进制编号),生成一个单词数据库,记录单词词典里的各单词的拼法(ASCII字符)及单词的二进制编号及单词的发音信息数据(比如词形音标,是用音标编号来组成词形音标的);对于需编码的语句中的各单词,分别替换成其在单词数据库里的二进制编号,将各单词的二进制编号按原来各单词在语句中的顺序排组起来即成语句编码值。与此编码规则相对应的只适用于此语种的解码规则可以是:解码时,可按语句编码值中的各二进制编号,到单词数据库里查找其对应的单词的记录项,并取出单词的发音信息数据(比如词形音标),并可依各二进制编号的排列顺序将各二进制编号对应的单词的发音信息数据(比如词形音标)转成声音发出,即为语句发音。可把本段述的语句编码值看成是表示如何用音码拼合成语句发音的拼合信息的值,可把本段述的编码时的这些手法看成是此编码规则的编码细则,这些编码细则也是此编码规则中有关此语种的编码细则,可把本段述的解码时的这些手法看成是此解码规则的解码细则,这些解码细则也是此解码规则中有关此语种的解码细则;并且本段述的对各单词都用二进制编号来表示(一般为14BIT),而不是用数个ASCII码来表示单词(如用ASCII码表示一个5字母的单词需5*7=35BIT),所以用二进制编号来表示单词是有压缩作用的,所以本段述的编码规则中有关此语种的编码细则是含有压缩编码处理的,而且本段述的解码规则中有关此语种的解码细则是含有压缩解码处理的。权利要求3,12,20或23中提到的拼合信息的值可以是本段说的拼合信息的值,权利要求3,12,20或23中提到的编码规则可以是本段说的编码规则,权利要求3,12,20或23中提到的解码规则可以是本段说的解码规则。A simple encoding rule that is only applicable to a certain language of the Indo-European language family, which can be used to make simple superpositions of phonetic codes to assemble the pronunciation of the sentence set; for example: for a certain language of the Indo-European language family (such as English), words are regarded as phonetic codes , each word in the word dictionary (the different deformations of each word are also regarded as different words, such as A, AN is regarded as two different words) are all given different binary numbers (the lengths of the binary numbers of each word are the same, such as 14BIT; like A, AN are all treated as different words and given different binary codes respectively), generate a word database, record the spelling (ASCII characters) of each word in the word dictionary and the binary code of the word and the pronunciation information data of the word ( Such as word shape phonetic symbol, is to form word shape phonetic symbol with phonetic symbol numbering); The sequences in the statement are grouped together to form the statement coded value. The decoding rule corresponding to this encoding rule that is only applicable to this language can be: when decoding, you can search for the record item of the corresponding word in the word database according to each binary number in the sentence encoding value, and take out the pronunciation of the word Information data (such as word-form phonetic symbols), and the pronunciation information data (such as word-form phonetic symbols) of the words corresponding to each binary number can be converted into sound according to the arrangement order of each binary number, which is the sentence pronunciation. The sentence encoding value described in this paragraph can be regarded as the value of how to use the phonetic sign indicating number to stitch together the combined information of sentence pronunciation, and these techniques during the coding described in this paragraph can be regarded as the coding rules of this coding rule. The detailed rules are also the coding rules related to this language in this coding rule. The decoding methods described in this paragraph can be regarded as the decoding rules of this decoding rule, and these decoding rules are also the decoding rules related to this language in this decoding rule; and Each word described in this paragraph is represented by a binary number (generally 14BIT), rather than using several ASCII codes to represent a word (such as expressing a 5-letter word with an ASCII code that requires 5*7=35BIT), so use Binary numbers are used to represent words with compression, so the encoding details of this language in the encoding rules described in this paragraph include compression encoding processing, and the decoding details of this language in the decoding rules described in this paragraph include compression decoding processed. The value of the combination information mentioned inclaim 3, 12, 20 or 23 may be the value of the combination information mentioned in this paragraph, and the encoding rule mentioned inclaim 3, 12, 20 or 23 may be the encoding described in this paragraph Rules, the decoding rules mentioned inclaim 3, 12, 20 or 23 may be the decoding rules mentioned in this paragraph.

以下是以英语作为印欧语系语种的例子来阐述印欧语系语种的编码技术的。The following uses English as an example of the Indo-European language to illustrate the encoding technology of the Indo-European language.

以下述的主要是对一种编码规则及其编码实例的阐述,此编码规则包括:编码语句集范围完整定义,用于解码时按直接发音方式播放语句发音的音码语音的处理的方式定义,用于编码的音码的种类选择,对音码赋编号,单词数据库建立,词类的建立,各种词段编码方式,时态词形编码处理,陌生单词编码处理及附加头段信息等编码细则,并且是以只适用于一个语种即英语为例的(有时附带提及到法语,德语等其它语种的内容时是有注明是这些语种的),即此编码规则中的所有编码细则都属于此编码规则中有关英语语种的编码细则,并且此编码规则中的对音码赋编号一般是有压缩处理的,编码时可供选用的几种词段编码方式(频率排序,频率关联混合,整句差别等编码方式)中除频率排序编码方式外都是含有压缩编码处理的,所以以只适用于英语为例的此编码规则是含有压缩编码处理的。此编码规则是直播方式的。印欧语系其它语种比如法语,德语,俄语等也可采用此编码规则,当然其具体细节与英语有别,需作相应修改。权利要求3,12,20或23中提及的编码规则,可以是本说明书提及或没提及的适用于印欧语系任一语种(英语或法语或德语或俄语等)的其有关此语种的编码细则含有压缩编码处理且是直播方式的的各种编码规则(本句说的‘各种编码规则’中任一编码规则可能是只适用于此语种或适用于包括此语种在内的多个语种)。此直播方式的以只适用于英语为例的编码规则及其编码实例的具体阐述是从下一行起至41页为止的:Mainly be the elaboration to a kind of coding rule and coding example thereof as follows, this coding rule comprises: the complete definition of coded sentence set scope, the definition of the processing mode of the phonetic code speech that plays sentence pronunciation by direct pronunciation mode when being used for decoding, Selection of the type of phonetic code used for coding, numbering of phonetic codes, establishment of word database, establishment of part of speech, various coding methods of word segments, coding processing of tense and word form, coding processing of unfamiliar words and additional head information, etc. , and it is only applicable to one language, that is, English as an example (sometimes the contents of other languages such as French and German are mentioned as these languages), that is, all the coding rules in this coding rule belong to The relevant coding rules of the English language in this coding rule, and the numbering of the phonetic codes in this coding rule generally has compression processing, and several word segment coding methods (frequency sorting, frequency association mixing, whole Sentence difference and other encoding methods) all contain compression encoding processing except the frequency sorting encoding method, so this encoding rule that is only applicable to English as an example contains compression encoding processing. This encoding rule is live. Other Indo-European languages such as French, German, Russian, etc. can also use this encoding rule, of course, the specific details are different from English and need to be modified accordingly. The encoding rule mentioned inclaim 3, 12, 20 or 23 may be any language (English or French or German or Russian) of the Indo-European language family (English or French or German or Russian, etc.) that is mentioned or not mentioned in this description. The encoding rules contain various encoding rules for compression encoding and live broadcasting (any encoding rule in the "various encoding rules" mentioned in this sentence may be only applicable to this language or applicable to multiple languages including this language. languages). The specific elaboration of the coding rules and coding examples of this live broadcast method, which is only applicable to English, starts from the next line and ends on page 41:

完整定义的编码语句集的范围是任意英语语句,也即编码语句集包括任意英语语句。The scope of the fully defined coded sentence set is any English sentence, that is, the coded sentence set includes any English sentence.

用于解码时按直接发音方式播放语句发音的音码语音的处理,有几种方案:(本发明文件各处所称将语音录音生成的数据的格式可以是诸如ADPCM这类格式,这是公知技术通用惯例)When being used for decoding, play the processing of the phonetic sign indicating number pronunciation of sentence pronunciation by direct pronunciation mode, several schemes are arranged: (the format of the data that voice recording is said to generate can be such as ADPCM such as this type of format in document of the present invention everywhere, and this is known technology general practice)

1:对于每个单词及其词型变化,都录音,并将这些录音数据合起来就是诸音码的发音数据(也是诸音码的录音数据);在解码时,由这些单词及其词型变化的发音(用录音数据播音)来组成语句的发音。这种方案的录音数据存储量很大。(本段所说的单词及其词型变化都是音码)1: for each word and its form change, all recordings, and these recording data are combined the pronunciation data of all phonetic codes (also the recording data of all phonetic codes); When decoding, by these words and word forms thereof The pronunciation of the change (broadcasting with recording data) forms the pronunciation of the sentence. The recording data storage capacity of this scheme is very large. (words and word form changes thereof mentioned in this paragraph are all phonetic codes)

2:由于很多单词都是由词根组成的,比如commitment就是由前缀com,词基mit,后缀ment所组成,这样的词根有一两千个,所以可以对每个词根录音(对于有些个词根,如有轻读,次重读,重读之分的,应把这些个词根的轻读,次重读,重读分开录成不同的音,这是常识),并对各个音标(元辅音)也录音(对于有些个元音或辅音,有轻读,次重读,重读之分的,应把这些个元音或辅音的轻读,次重读,重读分开录成不同的音,这是常识),将这些录音数据合起来就是诸音码的发音数据(也是诸音码的录音数据);在解码时,由词根(还可能包括有少数元辅音)的发音来组成单词的发音,再由各单词的发音来组成语句的发音。这种方案的录音数据存储量也较大。(本段所说的词根,元辅音都是音码)2: Since many words are composed of roots, for example, commitment is composed of prefix com, word base mit, and suffix ment. There are one or two thousand such roots, so each root can be recorded (for some roots, such as If there are light reading, secondary stress reading, and stress reading, the light reading, secondary stress reading, and stress reading of these roots should be recorded separately into different sounds, which is common sense), and each phonetic symbol (consonant) is also recorded (for some If a vowel or consonant has light reading, secondary stress, and heavy reading, the light reading, secondary stress, and heavy reading of these vowels or consonants should be recorded separately into different sounds, which is common sense), and these recording data Together is the pronunciation data of all phonetic codes (also the recording data of all phonetic codes); when decoding, the pronunciation of the root (may also include a small number of vowels and consonants) forms the pronunciation of the word, and then is formed by the pronunciation of each word The pronunciation of the sentence. The recording data storage capacity of this scheme is also relatively large. (The root and consonants mentioned in this paragraph are all phonetic codes)

3:只对各个元音,辅音进行录音(对于有些个元音或辅音,有轻读,次重读,重读之分的,应把这些个元音或辅音的轻读,次重读,重读分开录成不同的音,这是常识),将这些录音数据合起来就是诸音码的发音数据(也是诸音码的录音数据);在解码时,由元音,辅音(音标)的发音(用录音数据播音)组成单词的发音,再由各单词的发音组成语句的发音。这种方案录音数据存储量很小。(本段所说的元音,辅音都是音码)3: Only record each vowel and consonant (for some vowels or consonants, there are light reading, secondary stress reading, and stress reading, the light reading, secondary stress reading, and stress reading of these vowels or consonants should be recorded separately into different sounds, which is common sense), and these recording data are combined to be the pronunciation data of all phonetic codes (also the recording data of all phonetic codes); data broadcasting) to form the pronunciation of words, and then form the pronunciation of sentences by the pronunciation of each word. The recording data storage capacity of this scheme is very small. (The vowels and consonants mentioned in this paragraph are all phonetic codes)

这些录音数据可用于整步直接发音方式或分步直接发音方式;一般是把需用到的以上方案1或方案2或方案3的诸音码的录音数据存储在与拾音器有关的存储器集里。These recorded data can be used for full-step direct pronunciation mode or step-by-step direct pronunciation mode; Generally, the recorded data of all sound codes of theabove scheme 1 orscheme 2 orscheme 3 to be used are stored in the memory set relevant to the pickup.

对于单词组合,可以直接记录其音标发音信息数据(词形音标),或记录其由哪些单词组成,由单词的发音来组成单词组合的发音。For word combination, can directly record its phonetic symbol pronunciation information data (word shape phonetic symbol), or record which words it is made up of, form the pronunciation of word combination by the pronunciation of word.

解码时,如采用整步或分步直接发音方式,在将音码发音组合成语句发音时,有些特殊地方需注意(比如如何处理或衔接语音波形使语音流畅和更合乎发音习惯),可参考现成的语音合成技术。When decoding, if you use the whole-step or step-by-step direct pronunciation method, when combining the pronunciation of the phonetic code into a sentence pronunciation, some special points need to be paid attention to (such as how to process or connect the voice waveform to make the voice smooth and more in line with pronunciation habits), please refer to Off-the-shelf speech synthesis technology.

在各音标语音之间,以及各单词发音之间,一般有静音间隔,可用程序调节静音间隔的长度,可实现语句的变速朗读。There is usually a silent interval between the sounds of each phonetic symbol and the pronunciation of each word. The length of the silent interval can be adjusted by the program, and the variable-speed reading of the sentence can be realized.

另如果不录音而是使用其它方式比如用语音波形的矢量技术来模拟发音码的语音,那应该是录音方式的等同替换。本说明书在叙述采用直接发音方式的以只适用于英语为例的编解码规则及英语的编解码举例涉及音码语音的处理时是以录音方式为例的。In addition, if you do not record but use other methods such as using the vector technology of speech waveforms to simulate the speech of the pronunciation code, that should be an equivalent replacement of the recording method. This specification uses the recording method as an example when describing the encoding and decoding rules that are only applicable to English as an example in the direct pronunciation mode and the encoding and decoding examples in English that involve the processing of phonetic code speech.

对音码进行录音的目的是为了在解码时,用这些音码的录音数据来发音以形成语句的发音(即用直接发音方式),但是有点需注意,就是在编解码过程中用于编解码的主要音码与解码时的发音音码不一定是相同种类,比如,如对音码的录音是采取上面述的第1种方式,对每个单词及其词型变化(单词及其词型变化都可看成是单词音码)都录音,而在编解码过程中是主要用单词音码来作编解码的,解码时调各单词或其词型变化的录音数据来发音以组成语句语音,这种情况就是编解码过程中用于编解码的主要音码与解码时的发音音码是相同种类(都是单词音码);而如对音码的录音是采取上面述的第3种方式,只对各个元音,辅音(音码)进行录音,而在编解码过程中是主要用单词音码来作编解码的,解码时调元音,辅音(可由音标表示)的发音的录音数据来发音以组成各单词的发音(比如单词的发音信息数据就是词形音标)并由各单词的发音组成语句语音,这种情况就是编解码过程中用于编解码的主要音码与解码时的发音音码不是相同种类。本段所述是显然的。The purpose of recording the phonetic code is to use the recording data of these phonetic codes to pronounce the pronunciation of the sentence to form the pronunciation of the sentence (that is, to use the direct pronunciation method) when decoding, but one thing to note is that it is used in the codec process. The main phonetic code and the pronunciation phonetic code when decoding are not necessarily the same kind, such as, as the recording of the phonetic code is to take the above-mentioned 1st way, for each word and its word type change (word and its word type) Changes can be regarded as word sound codes) are all recorded, and in the process of encoding and decoding, the word sound codes are mainly used for encoding and decoding. When decoding, the recorded data of each word or its word form changes are transferred to pronounce to form sentence voice , this situation is exactly the same type (all word phonetic codes) for the main phonetic code used for encoding and decoding in the codec process and the pronunciation phonetic code when decoding; Mode, only each vowel, consonant (sound code) is recorded, and in the process of encoding and decoding, the word sound code is mainly used for encoding and decoding, and the recording of pronunciation of vowel and consonant (which can be represented by phonetic symbols) is adjusted during decoding Data to pronounce to form the pronunciation of each word (such as the pronunciation information data of the word is the word form phonetic symbol) and form the sentence voice by the pronunciation of each word. This situation is the main phonetic code and decoding time used for encoding and decoding in the encoding and decoding process The pronunciation phonetic sign indicating number of is not same kind. What is stated in this paragraph is obvious.

有些单词的一些词形形态在不同的语句里,根据其语句环境或前后不同单词情况,可能会出现不同发音情况,即变音的情况(比如出现连音),如采用上述第3种方式对元辅音来录音及在解码时用元辅音的发音来组成单词及语句的发音,那可制定一些有关变音的发音规则(比如记录解码时遇到哪些单词的词形形态靠在一起时要变音等),在解码生成语句的全部(或非全部)预备发音数据时,考察要生成的全部(或非全部)预备发音数据里各单词的词形形态的排列情况(比如在解码过程中考察由各频率编号和时态词形选择段的数据来得到的各单词的词形形态的排列情况),按有关变音的发音规则,来确定哪些单词的词形形态要变音及怎样变音,并在生成全部(或非全部)预备发音数据时把全部(或非全部)预备发音数据中要变音的单词的词形形态的词形音标替换成要实际发音的词形音标,这样就可把语句的全部(或非全部)发音发出来;而编码处理这边不管其是否有变音;本段内容可从常识推理出来,而且如读物中无变音的语句,编解码时就可不考虑变音情况。对于编解码过程中使用单词组合出现变音的情况,对单词组合的处理仍可参照本段以上所述对单词的处理(或可把要变音的单词组合分拆成多个单词来处理)。本段发音处理是按本说明书后面述的解码时用整步直接发音方式为例来说的,分步直接发音方式可用别的方式来实现变音或可不处理变音。Some morphological forms of some words in different sentences may have different pronunciations according to the sentence environment or different words before and after, that is, the situation of inflection (for example, ligatures), such as using the third method above. vowels and consonants to record and use the pronunciation of vowels and consonants to form the pronunciation of words and sentences during decoding, then you can formulate some pronunciation rules about inflection (such as recording which words are encountered when decoding. sound, etc.), when decoding all (or not all) pre-pronunciation data of generated sentences, examine the arrangement of the morphological forms of each word in all (or not all) pre-pronunciation data to be generated (such as examining in the decoding process The arrangement of the morphological forms of each word obtained by the data of each frequency number and tense morphological selection section), according to the pronunciation rules of the relevant pronunciation rules, determine the morphological forms of which words to change the pronunciation and how to change the pronunciation , and when generating all (or not all) pre-pronunciation data, the morphological-phonetic symbol of the morphological form of the word to be changed in all (or not all) pre-pronunciation data is replaced with the morphological-phonetic symbol of the actual pronunciation, like this All (or not all) of the sentence can be pronounced; and the encoding process does not care whether it has an umlaut; the content of this paragraph can be deduced from common sense, and if there is no umlaut in the reading material, it can be encoded and decoded. Consider the inflection case. For the situation that the combination of words is used in the process of encoding and decoding, the processing of the combination of words can still refer to the processing of the words described above in this paragraph (or the combination of words to be changed can be split into multiple words for processing) . The pronunciation processing in this section is based on the full-step direct pronunciation method as an example when decoding described later in this manual. The step-by-step direct pronunciation method can be used to realize the pronunciation change or not to process the pronunciation change.

在本编码规则里就对用于解码时按直接发音方式播放语句发音的音码语音的处理的方式进行定义,是为了便于选择主要用于编码的音码及定义编码用的各数据库里一些数据的格式,比如如解码那边采取上述第1种方式对于每个单词及其词型变化都录音,那么在选择主要用于编码的音码时,一般就不要选元辅音音码,否则解码时得到预备发音数据或部分预备发音数据后要发音时,各元音和辅音就没有用于发音的录音数据(因为仅对每个单词及其词型变化有录音),并且如解码那边采取上述第1种方式对于每个单词及其词型变化都录音,那在单词数据库里各单词对应的记录项里的各词形形态的发音信息数据字段里,一般就不要记录词形音标,而是应该记录此单词的此词形形态的录音数据的编号(或地址)。本编码规则可看成解码那边是以上述第3种方式只对各个元音和辅音(即音标)进行录音为例的,解码时语句的发音是由音标的语音来组成的(比如用直接发音方式时可由主MCU调与拾音器有关的存储器集里音标的录音数据把组成语句发音的各音标播放出语音来),相应的主要是用单词及单词组合这两种音码来作编码,在单词数据库里各单词及单词组合对应的记录项里的各词形形态的发音信息数据字段里,就是记录词形音标。本段以及上几段提及的解码时由单词的发音组成语句的发音或由音标的语音组成语句的发音或类似表达,是适用于整步直接发音方式或分步直接发音方式的(因分步直接发音方式时,虽然是分步来发音,但听觉上还是由单词的发音组成语句的发音或由音标的语音组成语句的发音或是由其它音码的发音组成语句的发音的)。In this coding rule, just define the processing mode of the phonetic sign indicating number speech of playing sentence pronunciation by the direct pronunciation mode when being used for decoding, be for the convenience of selecting the phonetic sign indicating number that is mainly used in coding and define some data in each database of coding usefulness For example, if the decoding side adopts the first method above to record each word and its word type changes, then when selecting the phonetic code mainly used for encoding, generally do not choose the vowel and consonant phonetic code, otherwise when decoding When pronouncing after obtaining the preliminary pronunciation data or part of the preliminary pronunciation data, each vowel and consonant does not have recording data for pronunciation (because there are only recordings for each word and its word form change), and if the decoding side adopts the above The first way is all recordings for each word and its word form change, then in the pronunciation information data field of each word form in the record item corresponding to each word in the word database, generally just do not record the word form phonetic symbol, but The number (or address) of the recording data of this morphological form of this word should be recorded. This encoding rule can be regarded as the decoding over there is the above-mentioned 3rd way only to each vowel and consonant (being phonetic symbol) is recorded as example, the pronunciation of sentence is formed by the pronunciation of phonetic symbol during decoding (for example with direct During the pronunciation mode, the recording data of the phonetic symbols in the memory set relevant to the pickup can be adjusted by the main MCU to play out each phonetic symbol that forms the pronunciation of the sentence), and correspondingly mainly use these two phonetic codes of words and word combinations to do coding. In the pronunciation information data field of each morphological form in the record item corresponding to each word and word combination in the word database, the morphological phonetic symbols are recorded. When this paragraph and last several paragraphs are mentioned, the pronunciation of the sentence formed by the pronunciation of the word or the pronunciation of the sentence formed by the pronunciation of the phonetic symbol or similar expressions are applicable to the whole step direct pronunciation mode or the step-by-step direct pronunciation mode (due to the division During the direct pronunciation mode of the first step, although it is pronounced step by step, the pronunciation of the sentence is formed by the pronunciation of the word or the pronunciation of the sentence is formed by the pronunciation of the phonetic symbol or the pronunciation of the sentence is formed by the pronunciation of other phonetic codes).

以下主要是用单词及单词组合这两种音码来拼合编码语句集发音,即用于编码的音码的种类主要选择单词及单词组合。(因为单词组合是由单词组成的,并且组成各单词组合的单词一般都记录在单词数据库里,所以即使语句中有单词组合,也可用单词来分开表示单词组合,所以,即使对各单词和各单词组合都排序编号并存入单词数据库里,编码时也可只使用单词来编码,而可将单词组合先搁置不用,如使用单词组合,那编码的压缩效率会更高,并且在编码时单词组合的使用方式是与单词的使用方式相似的;在本发明文件的说明书中的直播方式的以只适用于英语为例的编码规则和直播方式的以只适用于英语为例的解码规则的阐述里,是主要选单词音码及单词组合音码来编解码的,并且有对单词音码及单词组合音码进行排序编号,在相应的编解码举例中主要涉及用单词音码,后面的编解码举例也有涉及用单词组合音码)The following mainly use these two kinds of phonetic codes of word and word combination to piece together the pronunciation of coded sentence set, promptly be used for the kind of the phonetic code of coding and mainly select word and word combination. (Because the word combination is made up of words, and the words forming each word combination are generally all recorded in the word database, so even if there are word combinations in the sentence, words can also be used to represent the word combination separately, so even for each word and each The word combinations are sorted and numbered and stored in the word database. When encoding, only words can be used for encoding, and word combinations can be left unused. If word combinations are used, the compression efficiency of encoding will be higher, and words will be encoded when encoding The way of using the combination is similar to the way of using words; in the description of the document of the present invention, the encoding rules of the live broadcast mode are only applicable to English as an example and the decoding rules of the live broadcast mode are only applicable to English as an example Here, word phonetic codes and word combination phonetic codes are mainly selected for encoding and decoding, and the word phonetic codes and word combination phonetic codes are sorted and numbered. In the corresponding encoding and decoding examples, word phonetic codes are mainly used. The following codes The example of decoding also has to involve the phonetic sign indicating number with combination of words)

由于在用单词作为主要音码来建立单词数据库时,既可把各常用单词组合作排序编号并存入单词数据库,也可不存入,即使对各单词和各单词组合都排序编号并存入单词数据库里,在编解码时,仍然可以只使用单词(原因上段有述);本发明文件中在述及印欧语系语种时在各处提及这两个概念时,是用‘单词及单词组合’;本来按常识,‘及’是连词并与‘和’的意思相同,但本发明文件约定在本发明文件中各处提及的‘单词及单词组合’的意思与‘单词和单词组合’的意思是有点不同的,‘单词及单词组合’可在两种具体情况下作两种不同的理解,如编解码时只涉及用单词而不用单词组合,那提及的‘单词及单词组合’就指‘单词’,如编解码时既涉及用单词也涉及用单词组合,那提及的‘单词及单词组合’就指‘单词和单词组合’。Because when using word as main phonetic code to set up word database, both commonly used word combinations can be sorted and numbered and stored in word database, and can not be stored, even if each word and each word combination are all sorted and numbered and stored in word In the database, when encoding and decoding, only words can still be used (the reason is stated in the previous paragraph); when referring to the Indo-European languages in the document of the present invention, when referring to these two concepts everywhere, use 'word and word combination '; Originally, according to common sense, 'and' is a conjunction and has the same meaning as 'and', but the document of the present invention agrees that the meaning of 'word and word combination' mentioned in the document of the present invention is the same as that of 'word and word combination' The meaning is a bit different, 'words and word combinations' can be understood in two different ways in two specific situations, such as encoding and decoding only involves the use of words but not word combinations, the mentioned 'words and word combinations' It refers to 'words'. If both words and word combinations are involved in encoding and decoding, the mentioned 'words and word combinations' refer to 'words and word combinations'.

单词及单词组合的排序及分配编号:Sorting and assigning numbers of words and word combinations:

这里的单词是指如he,man这样的单词,单词组合既包括平时所指的如:The words here refer to words such as he and man, and the combination of words includes what they usually refer to, such as:

take up这类词组,也包括像he is a这样的单词组合。Phrases such as take up also include word combinations like he is a.

而且语句发音可完全看成是由元辅音来组合起来的,所以把各单词及单词组合转成他们的元音辅音组合(空格用停顿音表示),主要用元音辅音组合这种音码来拼合编码语句集发音也是可以的。And the sentence pronunciation can be regarded as being combined by vowel and consonant completely, so each word and word combination are changed into their vowel and consonant combination (the space is expressed with pause sound), mainly come with this phonetic sign indicating number of vowel and consonant combination It is also possible to flatten the pronunciation of the coded sentence set.

本说明书为了叙述方便,采取主要用单词及单词组合这两种音码来拼合编码语句集发音的方式。  (另外,在陌生单词的处理时,可能还会用到元辅音等音码,对陌生单词处理涉及的元辅音等音码的编号等处理在后面陌生单词段的阐述里有叙述)For the convenience of narration, this instruction manual adopts the mode that mainly uses these two phonetic codes of words and word combinations to piece together the pronunciation of the coded sentence set. (In addition, in the processing of unfamiliar words, phonetic codes such as vowels and consonants may also be used. The numbering of phonetic codes such as vowels and consonants involved in the processing of unfamiliar words is described in the explanation of unfamiliar words later)

对单词及单词组合分配的编号一般可以是二进制格式。The numbers assigned to words and combinations of words may generally be in binary format.

对作为音码的单词和单词组合进行编号,这本来就是一种压缩处理,比如一个单词有5个字母,如直接用ASCII字符来表示,那每7个BIT的ASCII字符表示一个字母,5个字母就要占35个BIT,而如采用编号来表示此一个单词,即使是使用后面述的等长编号,其长度也仅12BIT(以单词数据库中的单词总个数为4千来计的),而如使用阶梯编号等来表示一个单词那长度会更短,所以采用编号来表示一个单词是有明显压缩效果的。Numbering words and word combinations as sound codes is originally a kind of compression processing. For example, a word has 5 letters. If it is directly represented by ASCII characters, then every 7 ASCII characters of BIT represent a letter, 5 Letter will account for 35 BITs, and as adopting numbering to represent this word, even if using the equal-length numbering described later, its length is also only 12BIT (based on the total number of words in the word database being 4,000) , and if you use a ladder number to represent a word, the length will be shorter, so using a number to represent a word has a significant compression effect.

给单词和单词组合编号的方法,最简单的就是每个单词或单词组合固定长度编一个二进制码,比如单词数据库中的单词及单词组合总个数为4千个的话,那每个单词或单词组合用12位bit表示即可。本发明文件称此编码法编的号为等长编号。The easiest way to number words and word combinations is to encode a binary code with a fixed length for each word or word combination. For example, if the total number of words and word combinations in the word database is 4,000, then each word or word combination The combination can be represented by 12 bits. The document of the present invention claims that the number that this coding method makes up is equal-length numbering.

如编解码不使用单词组合,就可对单词单独排序,如编解码使用单词组合,就可对单词和单词组合混合排序,即把单词组合也当成单词并与单词统一处理,比如一个排序片段可以是:SHEEP,MAP,TAKE UP,TABLE这样的。If the codec does not use word combinations, words can be sorted separately. If the codec uses word combinations, words and word combinations can be mixed and sorted, that is, word combinations are also treated as words and treated together with words. For example, a sorting segment can be It is: SHEEP, MAP, TAKE UP, TABLE and so on.

按大学四级的大纲要求,有4千多个单词,加上一些常用单词组合,约5,6千数量,可以对这5,6千数量的平常使用的单词及单词组合进行排序,即统计这些单词及单词组合在平时使用的各种教材及读物中的使用频率,并按使用频率排序,使用频率高的,就排在前面,使用频率低的,就排在后面;比如the单词的使用频率高,就排在前面,比如wait的使用频率较低,就排在后面。According to the syllabus requirements of the fourth level of the university, there are more than 4,000 words, plus some commonly used word combinations, about 5,6,000. You can sort these 5,6,000 commonly used words and word combinations, that is, statistics The frequency of use of these words and word combinations in various textbooks and reading materials used in daily life, and sorted by the frequency of use, the high frequency of use will be in the front, and the low frequency of use will be in the back; for example, the use of the word If the frequency is high, it will be in the front. For example, if the use frequency of wait is low, it will be in the back.

对单词及单词组合按使用频率排序的目的是为了节省编号长度,按哈夫曼编号的原理,对于使用频率高的分配短的编号,使用频率低的分配长的编号;所以这里既可以直接使用哈夫曼编号的原理进行编号,也可使用另一种方式,即按使用频率对几千个单词及单词组合先排序,然后,前8个单词及单词组合用3个二进制位表示,接着的64个单词及单词组合用6个二进制位表示,再接着的512个单词及单词组合用9个二进制位表示,余下的4096个单词及单词组合用12个二进制位表示,共可表示8+64+512+4096=4680个单词及单词组合。比如the是第一段的8个单词及单词组合中的第3个,用010表示,say是第二段的64个单词及单词组合中的第18个,用010001表示,wait是第三段的512个单词及单词组合中的第91个,用001011010表示,potato是第4段的4096个单词及单词组合中的第1693个,用011010011100表示,tree也是第4段的4096个单词及单词组合中的第173个,用000010101100表示。本发明文件称此编码法编的号为阶梯编号,阶梯的长度改为别的如4,8,12也可以的。The purpose of sorting words and word combinations according to the frequency of use is to save the length of the number. According to the principle of Huffman numbering, assign short numbers to those with high frequency of use, and assign long numbers to those with low frequency; so here you can directly use The principle of Huffman numbering is used for numbering, and another method can also be used, that is, to sort thousands of words and word combinations according to the frequency of use, and then, the first 8 words and word combinations are represented by 3 binary bits, and then 64 words and word combinations are represented by 6 binary bits, and the next 512 words and word combinations are represented by 9 binary bits, and the remaining 4096 words and word combinations are represented by 12 binary bits, which can represent a total of 8+64 +512+4096=4680 words and word combinations. For example, the is the third of the 8 words and word combinations in the first paragraph, represented by 010, say is the 18th of the 64 words and word combinations in the second paragraph, represented by 010001, and wait is the third paragraph The 91st of the 512 words and word combinations in , represented by 001011010, potato is the 1693rd of the 4096 words and word combinations in the fourth paragraph, represented by 011010011100, and tree is also the 4096 words and words in the fourth paragraph The 173rd in the combination is represented by 000010101100. The document of the present invention claims that the number that this coding method makes up is the step numbering, and the length of the step changes other as 4,8,12 also can.

哈夫曼编号方式,可以采用常规的编法,将单词及单词组合全统计其使用频率值,然后再根据各单词及单词组合的使用频率值,按哈夫曼编码法给每个单词及单词组合赋予号值。这种码前面不需表示码长。The Huffman numbering method can use the conventional coding method to count the frequency values of words and word combinations, and then assign each word and word according to the frequency value of each word and word combination according to the Huffman coding method. Combination assigned number value. There is no need to indicate the code length in front of this code.

另还可采取一些别的编号方式,对使用频率高的单词及单词组合赋短编号,频率低的赋长编号。Also can take some other numbering ways in addition, assign short numbering to the word and word combination with high frequency of use, endow long numbering with low frequency.

为了叙述方便,下面将等长编号,阶梯编号或哈夫曼编号都称为频率编号的一种(等长编号的各单词及单词组合的使用频率假设相同),即频率编号可以是等长编号,阶梯编号或哈夫曼编号等。本说明书涉及的印欧语系语种(以只适用于英语为例)的各编解码举例中各单词及单词组合的频率编号都默认是阶梯编号的(即比如可在以只适用于英语为例的编码规则中和与其对应的以只适用于英语为例的解码规则中以及编解码涉及的各数据库里都默认各单词及单词组合的频率编号是阶梯编号);如各单词及单词组合使用其它比如等长编号,哈夫曼编号等频率编号也是可以的,并且如使用等长编号或哈夫曼编号时,在这些频率编号的前面不用带频率编号的BIT长度信息;并且可在编解码过程中默认只使用等长编号,哈夫曼编号等频率编号的一种,如在编解码过程中有使用几种频率编号,那可以在编解码规则里规定每个处理单位的头段里需有一个或多个BIT来表示本处理单位里的单词及单词组合是使用哪种频率编号,显然,如在编解码过程中有使用等长编号,哈夫曼编号等频率编号,那在编解码涉及的数据库比如单词数据库里应记录有这些频率编号的信息。For the convenience of description, the equal-length numbering, ladder numbering or Huffman numbering are all referred to as a type of frequency numbering (the frequency of use of each word and word combination of the equal-length numbering is assumed to be the same), that is, the frequency numbering can be an equal-length numbering , ladder numbering or Huffman numbering, etc. The frequency numbers of each word and word combination in the codec examples of the Indo-European languages (only applicable to English as an example) involved in this manual are stepwise numbers by default (that is, for example, they can be used only in English as an example) In the encoding rules and the corresponding decoding rules that are only applicable to English as an example and in each database involved in encoding and decoding, the frequency number of each word and word combination is a step number by default); if each word and word combination use other such as Frequency numbers such as equal-length numbers and Huffman numbers are also possible, and when using equal-length numbers or Huffman numbers, there is no BIT length information with frequency numbers in front of these frequency numbers; and it can be used in the encoding and decoding process By default, only equal-length numbers, Huffman numbers and other frequency numbers are used. If several frequency numbers are used in the encoding and decoding process, it can be stipulated in the encoding and decoding rules that there must be one in the header of each processing unit. or multiple BITs to indicate which frequency number is used for the words and word combinations in this processing unit. Obviously, if frequency numbers such as equal-length numbers and Huffman numbers are used in the process of encoding and decoding, then the frequency numbers involved in encoding and decoding Information about these frequency codes should be recorded in a database such as a word database.

对单词及单词组合排序后,将这些单词及单词组合的信息存储在单词数据表里(当然如只使用单词来进行编码,那只需把这些单词的信息存储在单词数据表里而不需存储单词组合的信息),单词数据表的每个记录项里存储一个单词或单词组合的信息,此信息包括比如此单词或单词组合的拼法(比如ASCII字符),频率编号,发音信息数据(比如词形音标)等,在解码时可在单词数据库里提取语句中各单词及单词组合对应的记录项里的发音信息数据(比如单词或单词组合的词形音标),以构成语句的发音。After sorting the words and word combinations, store the information of these words and word combinations in the word data table (of course, if only words are used for encoding, then you only need to store the information of these words in the word data table without storing The information of word combination), the information of a word or word combination is stored in each record item of the word data table, and this information includes such as the spelling (such as ASCII characters) of this word or word combination, frequency number, pronunciation information data (such as morphological phonetic symbols), etc., can extract the pronunciation information data (such as the morphological phonetic symbols of words or word combinations) in the record items corresponding to each word and word combination in the sentence in the word database to form the pronunciation of the sentence.

(另外,因为印欧语系里有的语种,如英语,德语,法语,其单词拼法,即隐含或部分隐含了读音,所以当单词数据库里需同时带单词拼法和读音拼法时,可只记录单词拼法,然后由读音规则从单词拼法得出其读音,如有的单词不符合读音规则,就同时记录单词拼法和读音拼法。)(In addition, because some languages in the Indo-European language family, such as English, German, and French, have word spellings that imply or partially imply pronunciation, so when the word database needs to have both word spelling and pronunciation spelling , you can only record the spelling of the word, and then get its pronunciation from the spelling of the word according to the pronunciation rules. If some words do not conform to the pronunciation rules, the spelling and pronunciation of the word will be recorded at the same time.)

单词的分类:Classification of words:

按单词的属性进行归类,比如可分为:社会,家庭,人,食品,服装,建筑,动作,相貌,表情,运动,艺术,上班,经贸,连词,助词等Classify according to the attributes of words, for example, it can be divided into: society, family, people, food, clothing, architecture, action, appearance, expression, sports, art, work, economy and trade, conjunctions, auxiliary words, etc.

对单词归类,比如:Categorize words, such as:

人:people,man,woman,boy,girl,youth,baby,child,human,personPeople: people, man, woman, boy, girl, youth, baby, child, human, person

动作:open,close,pull,draw,put,place,lay,give,carry,run,eatActions: open, close, pull, draw, put, place, lay, give, carry, run, eat

相貌:beauty,ugly,prettyAppearance: beauty, ugly, pretty

可把每个类里具体包含的单词都记录下来,成为单词分类数据(单词分类数据里一般只需包括单词,不需包括单词组合),可供编解码时使用。每一个类里面包含的单词数量可以固定,也可不固定,在不固定时,可按其单词数量的多少分配类里的每个单词的序号的位长,比如某一个类里面,有32个单词,此类里的每个单词的序号就分配5个二进制位,另一个类有6个单词,此类里的每个单词的序号就分配3个二进制位。The words specifically contained in each class can be recorded and become word classification data (generally only need to include words in the word classification data, and do not need to include word combinations), which can be used during encoding and decoding. The number of words contained in each class can be fixed or not. When it is not fixed, the bit length of the serial number of each word in the class can be allocated according to the number of words. For example, there are 32 words in a class , the serial number of each word in this class is assigned 5 binary bits, and the other class has 6 words, and the serial number of each word in this class is assigned 3 binary bits.

另外,单词的分类也可增加按词性来分,比如除以上的‘社会’,‘家庭’等类外,再增加动词,名词,形容词等类,然后象后面述的建立关联关系也是同样可行的,比如一个单词boy,可以用形容词去修饰此单词,所以在单词数据库里此单词对应的记录项里可以记录此单词与形容词类关联,编解码时就可利用此关联性来达到压缩目的。In addition, the classification of words can also be divided by part of speech. For example, in addition to the above categories of 'society' and 'family', verbs, nouns, adjectives and other categories can be added, and then it is also feasible to establish associations as described later. , For example, a word boy can be modified with an adjective, so the record item corresponding to this word in the word database can record the association between this word and the adjective class, and this association can be used to achieve compression purposes during encoding and decoding.

建立单词的关联:Create word associations:

对于语句中出现的单词,其附近都可能会有单词来修饰它或有单词与它有关联关系,而修饰它或与它有关联关系的单词数量是有限的,然后就可以利用这个原理达到压缩编码的目的;对于名词,一般有形容词,数词去修饰,比如boy,此词前面修饰的词可以是onehandsome,与boy这个词有关联关系的还有动作等类的词,比如eat;对于动词,一般有副词形容词去修饰,比如run,修饰的词可是fast run。本说明书在编解码举例中记录单词的关联信息是采取这种方式的:对于单词数据表里的各个单词,就在单词数据库里此单词对应的记录项里记录下此单词所属词类编号及与此单词关联的各词类的编号,比如boy,与其可能关联的词类可能是:动作,相貌,运动,表情等,然后就在单词数据库里单词boy对应的记录项的一个字段里记录下单词boy所属词类编号并在此记录项的另一个字段里记录下与boy关联的动作,相貌,运动,表情等词类的编号。For a word that appears in a sentence, there may be words nearby to modify it or have a word associated with it, and the number of words that modify it or have an associated relationship with it is limited, and then this principle can be used to achieve compression The purpose of encoding; for nouns, there are generally adjectives and numerals to modify, such as boy, the word modified in front of this word can be onehandsome, and words such as actions are related to the word boy, such as eat; for verbs , there are usually adverbial adjectives to modify, such as run, the modified word is fast run. This manual records the associated information of words in the encoding and decoding examples in this way: for each word in the word data table, record the number of the part of speech to which the word belongs and the number corresponding to the word in the record item corresponding to the word in the word database. The number of each part of speech associated with the word, such as boy, and the part of speech that may be associated with it may be: action, appearance, movement, expression, etc., and then record the part of speech to which the word boy belongs in a field of the record item corresponding to the word boy in the word database Number and record the number of parts of speech such as actions, looks, sports, and expressions associated with boy in another field of this record item.

对于有些单词组合,也可如上法编入类里面,但一般可定义各单词组合不属于任一类且与其它类没有关联关系,在实际编解码中如有需考察单词组合与旁边单词的关联性的时候可用单词组合里包括的单词来考察其与旁边单词是否关联,如有需考察两个单词组合之间的关联性,则可考察分属于两个单词组合的单词之间的关联性而得到单词组合之间的关联性。For some word combinations, it can also be incorporated into the class as above, but generally it can be defined that each word combination does not belong to any class and has no relationship with other classes. In actual encoding and decoding, if it is necessary to examine the association between word combinations and adjacent words When it is time to use the words included in the word combination to check whether it is associated with the next word, if it is necessary to examine the relevance between the two word combinations, you can examine the relevance between the words that belong to the two word combinations. Get the correlation between word combinations.

另外,记录单词的关联信息还可采取另一种方式:在单词数据库里各单词对应的记录项里记录下此单词所属词类编号,而不需记录此单词与哪些词类关联,但另有数据库记录各词类之间的关联关系(比如人这一类的单词与相貌类的单词就有关联,比如社会类的单词与运动类的单词就没有关联),在编解码考察两个单词之间的关联性时,实际是考察这两个单词所属词类的关联性,此种方式比较节约关联信息数据的存储空间。In addition, another method can be used to record the associated information of words: record the number of the part of speech to which the word belongs in the record item corresponding to each word in the word database, without recording which parts of speech this word is associated with, but there is another database record The relationship between the various parts of speech (for example, words such as people are related to words of appearance, such as words of society and words of sports are not related), and the relationship between the two words is examined in the encoding and decoding In fact, it is to examine the relevance of the parts of speech to which the two words belong. This method saves the storage space of the associated information data.

建立单词的关联还可采取另外一种方式:即对应于每个单词,都穷举罗列与其有关联的单词(而不是词类);比如与boy的有关联的词就有(fat,low,good,old,young,eat等),当然这样做的缺点是单词的关联数据会很多。Establishing the association of words can also take another way: that is, corresponding to each word, all the words associated with it are exhaustively listed (rather than part of speech); for example, the words associated with boy have (fat, low, good , old, young, eat, etc.), of course, the disadvantage of this is that there will be a lot of associated data for words.

建立关联关系,是为了在后面述的有些词段编解码方式里作压缩之用。The purpose of establishing an association relationship is to use it for compression in some word segment encoding and decoding methods described later.

这样单词数据表的单词或单词组合对应的记录项的结构(即一个记录项的各字段)就如下:(本发明文件在述及印欧语系语种编解码处理及以英语为例举例时各处所称单词表,单词数据表,单词数据库是相同概念;另本发明文件在述及非印欧语系语种编解码处理及以汉语为例举例时各处所称词表,词数据表与词数据库是相同概念,这是显然的,而且在数据库技术范畴,数据表和数据库通常是一个概念)The structure (that is, each field of a record item) corresponding to the word or word combination of the word data table is just as follows: Said word list, word data table, and word database are identical concepts; Another file of the present invention is said word table everywhere when referring to non-Indo-European languages encoding and decoding processing and taking Chinese as an example, and word data table and word database are identical concept, which is obvious, and in the field of database technology, data tables and databases are usually a concept)

man:(单词)原型的拼法字段man: spelling field of (word) prototype

011100100:频率编号字段(一般为二进制格式)011100100: frequency number field (usually in binary format)

3:所属词类编号字段(man所属词类编号)3: The part of speech number field to which man belongs (the number of the part of speech to which man belongs)

7,8,9,10:关联词类编号字段(man与7,8,9,10几类词关联)7, 8, 9, 10: associated part of speech number field (man is associated with 7, 8, 9, 10 words)

[man]:词形形态1的发音信息数据字段(单词原型man的词形音标,本说明书这里是用英语字母模拟表示音标的)[man]: the pronunciation information data field of morphological form 1 (the word form phonetic symbol of the word prototype man, this manual uses English letters to simulate the phonetic symbol here)

[men]:词形形态2的发音信息数据字段(man的复数men的词形音标,本说明书这里是用英语字母模拟表示音标的)[men]: the pronunciation information data field of morphological form 2 (the word form phonetic symbols of the plural men of man, this manual uses English letters to simulate the phonetic symbols here)

或(再看一个单词组合对应的记录项的各字段,因在编码规则中一般是把单词组合当成与单词一样处理的,所以单词组合对应的记录项的各字段与单词对应的记录项的各字段是基本相同的;当然如单词数据库里没有记录单词组合的信息,那单词数据库里就没有这样的单词组合的记录项):Or (look at each field of the record item corresponding to a word combination again, because generally word combination is treated as the same as word in the encoding rule, so each field of the record item corresponding to the word combination is the same as each field of the record item corresponding to the word The fields are basically the same; of course, if there is no word combination information recorded in the word database, then there is no such word combination record in the word database):

is a:(单词组合)原型的拼法字段is a: (word combination) spelling field of prototype

110011:频率编号字段(一般为二进制格式)110011: frequency number field (usually in binary format)

254:所属词类编号字段(假设254表示本单词或单词组合不属于任何词类)254: Part of speech number field (assuming 254 indicates that the word or combination of words does not belong to any part of speech)

255:关联词类编号字段(假设255表示本单词或单词组合与任何词类无关联关系)255: associated part of speech number field (assuming 255 indicates that this word or word combination has no association with any part of speech)

[is e]:词形形态1的发音信息数据字段(单词组合原型is a的词形音标,本说明书这里是用英语字母模拟表示音标的)[is e]: Pronunciation information data field of morphological form 1 (the phonetic symbol of the word combination prototype is a, this manual uses English letters to simulate the phonetic symbol)

[bi e]:词形形态2的发音信息数据字段(be a的词形音标,本说明书这里是用英语字母模拟表示音标的)[bi e]: Pronunciation information data field of morphological form 2 (the phonetic symbol of the word form of be a, this manual uses English letters to simulate the phonetic symbol)

[waz e]:词形形态3的发音信息数据字段(was a的词形音标,本说明书这里是用英语字母模拟表示音标的)[waz e]: Pronunciation information data field of morphological form 3 (the phonetic symbol of word form of was a, here in this manual, English letters are used to simulate the phonetic symbol)

[is an]:词形形态4的发音信息数据字段(is an的词形音标,本说明书这里是用英语字母模拟表示音标的)[is an]: Pronunciation information data field of word form 4 (the word form phonetic symbol of is an, this manual uses English letters to simulate phonetic symbols here)

[bi an]:词形形态5的发音信息数据字段(be an的词形音标,本说明书这里是用英语字母模拟表示音标的)[bi an]: Pronunciation information data field of morphological form 5 (be an word form phonetic symbol, this manual uses English letters to simulate phonetic symbols here)

[waz an]:词形形态6的发音信息数据字段(was an的词形音标,本说明书这里是用英语字母模拟表示音标的)[waz an]: Pronunciation information data field of morphological form 6 (the word form phonetic symbol of was an, this manual uses English letters to simulate the phonetic symbol here)

这里示的单词数据表的单词或单词组合对应的记录项的结构仅为例子(有些没使用的字段没标出),还可以包括词性或词形形态个数等其它字段;如某单词或单词组合的某词形形态有重读,次重读的音节发音,那在相应记录此单词或单词组合的词形形态的发音信息数据(词形音标)字段里应记录重读或次重读的信息,这是显然的,本发明文件各处涉及用音标编号来记录发音信息数据的地方(比如单词数据库里的各词形形态的发音信息数据字段)是默认单词的音节的重读,次重读符号也属于音标,并分配有音标编号,并与其它元音,辅音等的音标编号一起来记录发音信息数据的,这也是常识。(这里定义的是记录项的数据结构,至于各字段里的数据用什么字符格式不关紧要由编码者自定,比如单词的拼法一般可用ASCII字符,这也是常识,频率编号一般是二进制格式,词形音标则可由多个音标编号组成)The structure of the record item corresponding to the word or word combination of the word data table shown here is only an example (some unused fields are not marked), and can also include other fields such as part of speech or the number of morphological forms; such as a certain word or word A certain word form of combination has stress, the syllable pronunciation of second stress, then should record the information of stress or second stress in the pronunciation information data (word form phonetic symbol) field of the word form of corresponding record this word or word combination, this is Obviously, the document of the present invention relates to the place (such as the pronunciation information data field of each morphological form in the word database) of recording the pronunciation information data with the phonetic symbol numbering everywhere, is the stress of the syllable of the default word, and the secondary stress symbol also belongs to the phonetic symbol, And assign phonetic symbol numbers, and record pronunciation information data together with the phonetic symbol numbers of other vowels, consonants etc., this is also common sense. (The data structure of the record item is defined here. As for the character format of the data in each field, it does not matter. It is up to the coder to decide. For example, the spelling of words can generally use ASCII characters. This is also common sense. Frequency numbers are generally in binary format. , word form phonetic symbols can be composed of multiple phonetic symbol numbers)

就可把大学四级大纲要求的约4千个单词,加上1,2千个常用单词组合,编成单词数据库,即这约4千个单词和1,2千个单词组合,每个都在单词数据库里开一个且只开一个记录项(单词数据库里除这些单词和单词组合的记录项外没别的记录项),记录项的结构见上几段述,单词数据库里不同记录项里的原型的拼法字段的数据一定是不同的,各记录项里的频率编号,所属词类编号,关联词类编号等字段里的数据内容可根据本说明书第19页至第22页所述来得到。The about 4,000 words required by the university's four-level syllabus, plus 1,2,000 commonly used word combinations, can be compiled into a word database, that is, the about 4,000 words and 1,2,000 word combinations, each Open one and only one record item in the word database (there are no other record items in the word database except the record items of these words and word combinations). The structure of the record item is described in the previous paragraphs. The data in the spelling field of the prototype must be different, and the data content in the fields such as the frequency number in each record item, the part of speech number to which it belongs, and the associated part of speech number can be obtained according to the description on pages 19 to 22 of this manual.

英语里一般没有形同音不同的单词(或单词组合)的情况,本说明书有关英语的编解码规则举例也是按单词的各词形形态和单词组合的各词形形态没有形同音不同的这样的情况来作的,(一个单词或单词组合的轻读重读不算形同音不同的情况,本发明文件约定变音情况也不算形同音不同的情况);如万一出现形同音不同的,在编解码时可把形同音不同的都分别当成不同的单词(或单词组合),并在单词数据表里分别开记录项来记录其信息。In English, there are generally no words (or word combinations) with different homophones. The codec rules in this instruction manual for English are also examples in which each morphological form of a word and each morphological form of a word combination do not have a different homonym. (the light reading and stress reading of a word or combination of words does not count as the case of different homonyms, and the document of the present invention stipulates that the situation of changing the pronunciation does not count as the situation of different homophones); as in case homonyms occur Different, when coding and decoding, homonyms and different words can be regarded as different words (or word combinations), and record items are respectively opened in the word data table to record their information.

建立单词的关联的作用,是为了在采用单词段的频率关联混合方式或关联方式编解码时,以提高压缩效率,而如是采用单词段的频率排序方式或整句差别方式编解码,就不使用单词的关联信息。The function of establishing the association of words is to improve the compression efficiency when using the frequency association mixing method or association method of word segments to encode and decode, but if the frequency sorting method of word segments or the whole sentence difference method is used for encoding and decoding, it will not be used word association information.

在同一个单词数据库里,单词组合和用于组成它的各个单词可分别开频率编号,比如is a,is,a三个可共存于同一个单词数据库里,在三个不同的记录项里分别记录,并各自具有不同的频率编号。(本说明书默认各单词组合和用于组成它们的各个单词在单词数据库里都分别对应有各自的记录项,并在各自的记录项里记录有各自的频率编号)In the same word database, the word combination and each word used to form it can be assigned a frequency number separately, such as is a, is, and a can coexist in the same word database, in three different record items respectively records, each with a different frequency number. (This manual defaults that each word combination and each word used to form them has its own record item in the word database, and its frequency number is recorded in the respective record item)

如果为了要处理陌生单词的方便,一般还需建立词根数据库,单词一般可看作是由词根(包括有词基,前缀,后缀)组成的,对于英语来说,常见的词基如sit,put,fuse等,常见的前缀如dis,un等,常见的后缀如able,er等,而单词的发音,可看作是由词根发音(辅以元辅音)组合起来的,所以可建立这样的词根数据库,收录常用的词根和元辅音,并记录它们的发音信息数据(比如由哪些音标编号组成一个词根的发音,各元音辅音各自对应哪个音标编号),以用于表示陌生单词的编码信息,常用词根一般有1,2千个。(本段所说的词根,元辅音显然都是音码)For the convenience of dealing with unfamiliar words, it is generally necessary to establish a root database. Words can generally be regarded as being composed of roots (including bases, prefixes, and suffixes). For English, common bases such as sit and put , fuse, etc., common prefixes such as dis, un, etc., common suffixes such as able, er, etc., and the pronunciation of a word can be regarded as a combination of root pronunciation (supplemented by vowels and consonants), so such a root can be established The database includes commonly used roots and vowel consonants, and records their pronunciation information data (such as which phonetic symbols numbers form the pronunciation of a word root, and which phonetic symbol numbers correspond to each vowel and consonant), so as to be used to represent the coding information of unfamiliar words, Commonly used roots generally have 1,2 thousand. (The roots mentioned in this paragraph, vowels and consonants are obviously phonetic codes)

(欧洲的其它语种也类似,比如德语,词neulich就是由词基neu加后缀lich组合成的,所以欧洲的其它语种编码时的处理也与英语类似,可建立词根数据库。)(other languages in Europe are also similar, such as German, the word neulich is formed by base neu plus suffix lich, so the processing of other languages in Europe is also similar to English when encoding, and a root database can be established.)

编码的数据分成以下顺序排列的段(对编码的数据进行分段,也可认为其是属于本编码规则中的一个编码细则):The coded data is divided into segments arranged in the following order (segmenting the coded data can also be considered as a coding rule in this coding rule):

头段:记录本编码语句所属语种,本编码的单词段所用词段编码方式,本编码语句中的单词及单词组合个数等信息(头段里的各信息不是必需的可以都不记录或记录部分信息,比如单词及单词组合个数是可以不记录的,而在用频率关联混合方式对单词段编码时,如在头段里记录单词及单词组合个数可有助于解码,再比如如果某一本书里的全部码字涉及播放的语音都属于一个语种,在这本书的第一页面有一个特殊码字携带了本书的语种种类的信息,那这本书其余页面上的各码字里的头段就可不带其编码语句所属语种的信息,在读者使用时,要求读者先用拾音器读第一页面的特殊码字取得语种种类信息比如英语,然后其它页面的码字的值就按约定是按英语来对其解码处理,所以其它页面的各码字的头段里就不需记录本编码语句所属语种;甚至如果编解码都默认使用频率排序编码方式,那头段里就不需再记录本编码的单词段用哪种词段编码方式;所以如头段里的各信息都省略的话,编码就可省略头段。在头段里哪些信息需记录哪些信息不需记录,编码规则里需约定好;对于头段里约定好的需记录的每个信息可以各自固定分配一个或多个BIT,本编码规则及编解码举例中约定头段里记录有本编码语句属于什么语种和本处理单位的单词段使用何种词段编码方式这两个信息,每个信息是各自固定分配2个BIT来记录的)。在后面的以英语为例的编解码举例中,对头段用的一些信息是这样定义的:用两个BIT来表示本编码语句所属语种的信息,用00代表英语,用01代表汉语;用两个BIT来表示词段编码方式信息,用00代表频率排序编码方式,用01代表频率关联混合编码方式,用10代表关联编码方式,用11代表整句差别编码方式。Header: record the language to which this encoded sentence belongs, the word segment encoding method used in this encoded word segment, the words in this encoded sentence and the number of word combinations and other information (each information in the header is not necessary and can not be recorded or recorded Some information, such as the number of words and word combinations, may not be recorded. When encoding word segments in a frequency-associated mixed manner, recording the number of words and word combinations in the header can help decoding. For example, if All the codewords in a certain book involve the playback of voices belonging to one language, and there is a special codeword on the first page of the book that carries the information of the language type of the book. The header in the code word may not carry the information of the language of the coded sentence. When the reader uses it, the reader is required to use the pickup to read the special code word on the first page to obtain the language type information such as English, and then the value of the code word on other pages According to the agreement, it is decoded in English, so there is no need to record the language of this coded statement in the header of each codeword on other pages; even if the codec uses the frequency sorting encoding method by default, then the header will be It is not necessary to record which word segment encoding method is used for the word segment of this encoding; so if each information in the header is omitted, the encoding can omit the header. Which information needs to be recorded in the header and which information does not need to be recorded, It needs to be agreed in the coding rules; for each information that needs to be recorded in the header section, one or more BITs can be fixedly allocated respectively. In this coding rule and codec example, it is agreed that the header section records which language the coding statement belongs to. and the word segment encoding method used by the word segment of this processing unit, each information is fixedly assigned 2 BITs to record). In the following encoding and decoding examples taking English as an example, some information used in the header is defined as follows: use two BITs to represent the information of the language to which the coded sentence belongs, use 00 to represent English, and 01 to represent Chinese; A BIT is used to represent the word segment coding method information, 00 is used to represent the frequency sorting coding method, 01 is used to represent the frequency-associated mixed coding method, 10 is used to represent the associated coding method, and 11 is used to represent the whole sentence difference coding method.

单词段:这段的数据是对语句中的全部单词进行编码后的主要数据(对于语句中的单词组合,可当成与单词同等对待处理)。Word section: the data in this section is the main data after encoding all the words in the sentence (for word combinations in the sentence, they can be treated as the same as words).

时态词形选择段:是对语句中的各单词及单词组合的词形选择。Tensile word form selection segment: It is the word form selection for each word and word combination in the sentence.

陌生单词段:是先在单词段里用111111作引导,本段再放置陌生单词的具体的编码信息。Unfamiliar word section: first use 111111 in the word section as a guide, and then place the specific coding information of the unfamiliar word in this section.

前后调用段:是先在单词段里用111作引导,本段再放置往前调用或往后调用的单词的距离等信息。Before and after the calling section: first use 111 in the word section as a guide, and then place information such as the distance of the words called forward or backward in this section.

把这些段的数据集合起来(如按顺序排组起来)即成语句的编码值,可被拾音器解码播出语句发音,可把上述各段集合起来(如顺序排列起来)定义为一个处理单位(包括头段,单词段,时态词形选择段,陌生单词段,前后调用段)。(对一个处理单位,其中的单词段是必需的,其它段可能省略不用,比如某被编码语句的所有各单词及单词组合的词形形态都只有一个,那其处理单位里就没有时态词形选择段,再比如某被编码语句无陌生单词,那其处理单位里就没有陌生单词段,前后调用段也是可能有,可能没有的)。The data of these sections are assembled (as arranged in sequence) to form the coded value of the sentence, which can be decoded by the pickup to broadcast the pronunciation of the sentence, and the above-mentioned sections can be assembled (as arranged in order) and defined as a processing unit ( Including head section, word section, tense word form selection section, unfamiliar word section, front and back call section). (for a processing unit, the word segment wherein is necessary, and other segments may be omitted, such as all words and word combinations of a coded sentence have only one morphological form, then there is no tense word in the processing unit If there is no unfamiliar word in a coded sentence, then there is no unfamiliar word segment in its processing unit, and there may be or may not be in the calling segment before and after).

语句可以是一个句子或一个句子的部分,语句也可以是含有多个句子,一个处理单位对应一个语句,表示语句发音的拼合信息的值就是一个处理单位的值,此处理单位的值可被拾音器解码播出一个语句的发音,一个码字的值可以有一个表示语句发音的拼合信息的值。A sentence can be a sentence or a part of a sentence, and a sentence can also contain multiple sentences. A processing unit corresponds to a sentence, and the value of the combined information representing the pronunciation of a sentence is the value of a processing unit. The decoding broadcasts the pronunciation of a sentence, and the value of a codeword can have a value representing the combined information of the pronunciation of the sentence.

本说明书涉及的英语语种的编解码举例主要是这样的:语句就是一个中间无逗号的句子,一个处理单位对应一个语句,表示语句发音的拼合信息的值就是一个处理单位的值。The encoding and decoding examples of the English language involved in this manual are mainly as follows: a sentence is a sentence without a comma in the middle, a processing unit corresponds to a sentence, and the value of the combined information indicating the pronunciation of the sentence is the value of a processing unit.

词段编码方式主要是用于单词段的编码的,一般有以下几种(其中频率排序方式本身无压缩作用,但其使用的频率编号可以有压缩作用,其它的频率关联混合方式,关联方式,整句差别方式等词段编码方式本身是有压缩作用的):The word segment encoding method is mainly used for the encoding of word segments, and generally has the following types (the frequency sorting method itself has no compression effect, but the frequency number used by it can have a compression effect, other frequency association mixing methods, association methods, The word segment coding methods such as the whole sentence difference method itself have a compression effect):

频率排序方式:Sort by frequency:

编码时,先在单词段里的最前面记录单词段里含有的单词及单词组合的个数信息,语句都是由单词及单词组合顺序组成的,对语句里的单词及单词组合,全部换成其频率编号,并且每个频率编号前都带一个2BIT的长度信息(取值为00,01,10,11,分别对应频率编号的BIT长度为3,6,9,12),以表示其频率编号的BIT长度是3,6,9,12中的哪一个。然后把这些长度信息,频率编号按与其对应的单词及单词组合在语句中的顺序排在单词段里即可。这种频率排序编码方式及其对应的频率排序解码方式本身无压缩作用,但涉及使用的单词及单词组合的频率编号是有压缩作用的(比如用一个频率编号来表示一个含5个字母的单词,需用2BIT的此频率编号的长度信息加12BIT的此频率编号共14个BIT,而如采用常规文本记录方式用5个7BIT长的ASCII字符来表示同一个含5字母的单词需用5*7=35个BIT,显然频率编号有压缩作用)。When encoding, first record the number information of the words and word combinations contained in the word segment at the front of the word segment. The sentences are composed of words and word combination sequences. For the words and word combinations in the sentence, all are replaced with Its frequency number, and each frequency number is preceded by a 2BIT length information (the value is 00, 01, 10, 11, and the BIT length corresponding to the frequency number is 3, 6, 9, 12) to indicate its frequency Which one of 3, 6, 9, 12 is the BIT length of the number. Then arrange the length information and the frequency number in the word segment according to the order in which the words corresponding to it and the words are combined in the sentence. This frequency sorting encoding method and its corresponding frequency sorting decoding method itself have no compression effect, but the frequency numbers involved in the used words and word combinations have a compression effect (such as using a frequency number to represent a word containing 5 letters , it needs to use the length information of the frequency number of 2BIT plus the frequency number of 12BIT for a total of 14 BITs, and if the conventional text recording method is used to use 5 ASCII characters of 7BIT length to represent the same 5-letter word, 5* 7=35 BITs, obviously the frequency code has a compression effect).

举例:语句为:The man is a Chinese。For example: the sentence is: The man is a Chinese.

编码的各BIT如下:The encoded BITs are as follows:

00101:在单词段的最前面记录单词段里的单词及单词组合的个数信息,共5个00101: Record the number of words and word combinations in the word segment at the front of the word segment, a total of 5

再将语句里的各单词及单词组合换成其频率编号,每个频率编号前面附带2个BIT的此频率编号的长度信息:Then replace each word and word combination in the sentence with its frequency number, and each frequency number is preceded by 2 BIT length information of this frequency number:

00:the单词的频率编号的长度信息00: Length information of the frequency number of the word

010:the单词的频率编号010: The frequency number of the word

10:man单词的频率编号的长度信息10: The length information of the frequency number of the man word

011100100:man单词的频率编号011100100: the frequency number of the word man

01:is单词的频率编号的长度信息01: The length information of the frequency number of the is word

010110:is单词的频率编号010110: the frequency number of the word is

00:a单词的频率编号的长度信息00: length information of the frequency number of a word

101:a单词的频率编号101: frequency number of a word

10:Chinese单词的频率编号的长度信息10: The length information of the frequency number of the Chinese word

011001101:Chinese单词的频率编号011001101: Frequency number of Chinese word

再把以上各BIT按顺序合起来,就得到编码值:Then combine the above BITs in order to get the encoded value:

001010001010011100100010101100010110 011001101001010001010011100100010101100010110 011001101

这个编码值是单词段的值,时态词形选择段的编码处理原理在后面再述,本举例没涉及时态词形选择段。为了使举例简洁,本举例也没涉及头段,头段里一般含有语种信息以及词段编码方式选择信息等数据。This encoding value is the value of the word segment, and the encoding processing principle of the tense word form selection segment will be described later. This example does not involve the tense word form selection segment. In order to make the example concise, this example does not involve the header, which generally contains data such as language information and word segment encoding method selection information.

本编码举例涉及的都是单词(音码),对于在需编码的语句中有单词组合(音码)的情况与单词(音码)同等处理。What this coding example relates to is all words (phonetic sign indicating number), for the situation that word combination (phonetic sign indicating number) is arranged in the sentence that needs encoding, handle equally with word (phonetic sign indicating number).

(本词段编码方式里,各单词及单词组合的频率编号除了可以是本例示的阶梯编号外,还可以是等长编号或哈夫曼编号等;如为等长编号,哈夫曼编号那其前面不用带频率编号长度信息。)(in this speech segment encoding mode, the frequency numbering of each word and word combination can also be equal-length numbering or Huffman numbering etc. except the ladder numbering of this example; as for equal-length numbering, the Huffman numbering that It is not preceded by frequency number length information.)

频率关联混合方式:Frequency correlation mixing method:

本词段编码方式及其举例主要以用单词(音码)来阐述的,如有用单词组合(音码)的类似于单词(音码)处理。This speech segment encoding mode and example thereof are mainly set forth with words (sound codes), as the processing similar to words (sound codes) of useful word combinations (sound codes).

这是对频率排序(使用阶梯编号)编码方式的改进,由于完全用频率排序(使用阶梯编号)编码时,每个单词或单词组合的阶梯编号前面需带2个BIT的此阶梯编号的长度信息,如15个单词及单词组合的语句总共就要耗30个BIT,占位较多;可以结合关联的方式进行改进。具体做法为:This is an improvement on the encoding method of frequency sorting (using ladder numbering), because when fully encoding with frequency sorting (using ladder numbering), each word or combination of words needs to be preceded by 2 BITs of the length information of the ladder numbering , For example, a statement with 15 words and word combinations will consume a total of 30 BITs, occupying a lot of space; it can be improved by combining association methods. The specific method is:

本编码方式以下述以语句全由单词组成为例,语句中有单词组合的类似于对单词的处理,先把语句里的所有单词都转成阶梯编号并按原顺序排列,每个阶梯编号前不带此阶梯编号的长度信息,然后假如排列(即组合)起来的值(可命名为阶梯信息值)的二进制最长长度为84位(BIT),就在第21位,第42位,第63位处分别设点分节,在第21位处向后查与最近的单个单词阶梯编号的结束处的距离值(每增加3个位BIT,距离值加1)并记录下来(距离值占2个二进制位),并把分节处挪到这个阶梯编号结束处,对第42位,第63位处如法处理,这三个距离值就是三个分节处的偏移信息;这样就将阶梯信息值分成4节,每节长度大约为21位(当然如阶梯信息值比较短的话,那后面几节可能是空的没有单词),在每节里可能有1个或多个单词,每个单词的阶梯编号的长度可能为3,6,9,12位,但是每节里具体的第一个单词占几位,第二个单词占几位等,还是未知的。但由于分节后每节位数较少,所以可以对其阶梯编号分法进行穷举,对于每种阶梯编号长度可能分法情况都计算,计算这种长度分法可能情况时,本节的各单词之间的关联度;(假设某种分法下各频率编号对应的各单词正好就是原语句在本节位置的各单词时,可称这种分法为本义分法,否则是歧义分法);因为在一个句子中紧排在一起的几个单词或隔1,2个单词的单词,一般是互相有关联的,利用前述的单词关联机制,计算节里的单词间的关联度,然后比较每种分法的关联度,如关联度值最大的那种分法为本义分法,则本节可采用这种编法;如每节(各节BIT长度相同)都可采用这种编法,那本语句的编码就可采用频率关联混合方式;当在编码时发现分节后有某节的歧义分法的关联度值比本义分法的关联度值还大时,可调整基本节长,再试,如仍不行,就改另外的编码方式。This encoding method takes the following sentence as an example consisting of words. If there are word combinations in the sentence, it is similar to the processing of words. First, convert all the words in the sentence into step numbers and arrange them in the original order. Before each step number Without the length information of this ladder number, then if the binary longest length of the arranged (that is, combined) value (which can be named as the ladder information value) is 84 bits (BIT), it is at the 21st, 42nd, and The 63-bit place is respectively set up as a section, and the distance value at the end of the nearest single word ladder number is looked up at the 21st place (every increase of 3 bits BIT, the distance value is increased by 1) and recorded (the distance value accounts for 2 binary digits), and move the section to the end of the ladder number, and do the same for the 42nd and 63rd, these three distance values are the offset information of the three sections; thus Divide the ladder information value into 4 sections, the length of each section is about 21 bits (of course, if the ladder information value is relatively short, the next few sections may be empty without words), and there may be one or more words in each section, The length of the ladder number of each word may be 3, 6, 9, or 12 digits, but how many digits the first word occupies in each section, how many digits the second word occupies, etc., are still unknown. However, due to the small number of digits in each section after subdivision, it is possible to exhaustively enumerate its ladder number division method, and calculate the possible division method of each ladder number length. When calculating the possible length division method of this section, the The degree of association between each word; (assuming that each word corresponding to each frequency number under a certain division method is just in time each word of the original sentence in this section position, this division method can be called the original meaning division method, otherwise it is an ambiguity division method. method); because in a sentence, several words that are closely arranged together or words that are 1 or 2 words apart are generally related to each other, and the aforementioned word association mechanism is used to calculate the degree of association between words in the section. Then compare the correlation degree of each division method, if the division method with the highest correlation degree value is the original division method, then this section can adopt this compilation method; such as each section (each section BIT length is the same) can adopt this method If the coding method of this sentence can adopt the frequency correlation mixing method; when it is found that the correlation value of the ambiguous division method of a certain section is larger than the correlation value of the original meaning division method after the division, the basic division method can be adjusted. Section length, try again, if it still doesn't work, change another encoding method.

每节的基本节长(BIT)除了可以是21外,也可以是其它比如24,18等,在编码值中可放置基本节长的选择信息值(以下的编解码举例是以只有21或24两个基本节长选择值为例)。In addition to 21, the basic section length (BIT) of each section can also be other such as 24, 18, etc., and the selection information value of the basic section length can be placed in the coded value (the following codec example is based on only 21 or 24 Two basic section length selection values as an example).

这种词段编码方式需对每个分节处设2个二进制位的偏移信息,如分4节(每节长约21BIT),有3个分节处,需3*2=6位二进制位。This word segment encoding method needs to set 2 binary bits of offset information for each subsection, such as 4 subsections (each section is about 21BIT long), and there are 3 subsections, 3*2=6 binary bits are required bit.

举例:语句为:The fat man is an engineer。Example: The sentence is: The fat man is an engineer.

顺序排组如下BIT作为编码值:Order the following BITs as coded values:

01100:阶梯信息值的长度是12个单位(3个bit一个单位,本例有3*12个bit)01100: The length of the ladder information value is 12 units (3 bits per unit, this example has 3*12 bits)

0:每节基本节长(BIT)选为21(0:表示每节基本节长为21,1:表示每节基本节长为24)0: The basic section length (BIT) of each section is selected as 21 (0: indicates that the basic section length of each section is 21, 1: indicates that the basic section length of each section is 24)

010000:三个分节处的偏移信息(第21位是阶梯信息值中的is词的频率编号的中间,将此分节处挪到is词的频率编号末,并记录距离值为01,因阶梯信息值的长度不到42,所以后面第42位和第63位处的距离值是00和00,然后01与00与00合并成010000)010000: Offset information at the three sections (the 21st bit is the middle of the frequency number of the word is in the ladder information value, move this section to the end of the frequency number of the word is, and record the distance value as 01, Because the length of the ladder information value is less than 42, the distance values at the 42nd and 63rd bits are 00 and 00, and then 01, 00 and 00 are combined into 010000)

010:the单词的频率编号(也即阶梯编号,本例以下同)010: The frequency number of the word (that is, the ladder number, the same as in this example below)

001011:fat单词的频率编号001011: frequency number of fat word

011100100:man单词的频率编号011100100: the frequency number of the word man

010110:is单词的频率编号010110: the frequency number of the word is

      |\分节处|\section

101:an单词的频率编号101: frequency number of word an

010110010:engineer单词的频率编号010110010: the frequency number of the word engineer

把以上各BIT按顺序合起来,得编码值为(这个编码值是单词段的值,时态词形选择段的编码处理原理在后面述,本举例没涉及时态词形选择段;为了使举例简洁,本举例也没涉及头段,头段里一般含有语种信息以及词段编码方式选择信息等数据):The above BITs are put together in order, and the encoding value is obtained (this encoding value is the value of the word segment, and the encoding processing principle of the tense word form selection segment is described later, and this example does not involve the tense word form selection segment; in order to make The example is concise, and this example does not involve the header, which generally contains data such as language information and word segment encoding method selection information):

011000010000010001011011100100010110101010110010011000010000010001011011100100010110101010110010

至于这样编的编码值是否可以正常解码,要看下面的判断,如不能正常解码,可调整基本节长再试或换别的词段编码方式来对单词段进行编码:As for whether the coded value encoded in this way can be decoded normally, it depends on the following judgments. If it cannot be decoded normally, you can adjust the basic section length and try again or use another word segment encoding method to encode the word segment:

这里把单词段里的阶梯信息值分成两节:010001011011100100010110,101010110010Here, the ladder information value in the word segment is divided into two sections: 010001011011100100010110, 101010110010

如对于后一节101010110010,可分解为7种情况(阶梯编号组合情况):For example, for the latter section 101010110010, it can be decomposed into 7 situations (the combination of ladder numbers):

101,010,110,010(被逗号隔开成4个阶梯编号)101, 010, 110, 010 (separated by commas into 4 ladder numbers)

101,010,110010(被逗号隔开成3个阶梯编号,以下类推)101, 010, 110010 (separated by commas into 3 ladder numbers, and so on)

101,010110010101, 010110010

101,010110,010101, 010110, 010

101010110,010101010110, 010

101010,110,010101010, 110, 010

101010,110010101010, 110010

(本来还有第8种分法101010110010,即这12个BIT组成一个阶梯编号,但因这种情况时这节内只有一个阶梯编号,不能计算这节内各阶梯编号之间的关联度,所以如出现这种某节的所有BIT完全就是实际语句中这个位置的这个单词的阶梯编号的情况,可调整基本节长再试或放弃这种编法)(Originally there is an 8th division method 101010110010, that is, these 12 BITs form a ladder number, but in this case, there is only one ladder number in this section, and the correlation between the ladder numbers in this section cannot be calculated, so If it happens that all BITs in a certain section are exactly the ladder number of the word at this position in the actual sentence, you can adjust the basic section length and try again or give up this compilation method)

分别计算这7种分法的各单词(阶梯编号)间的关联度(本词段编解码方式所称某一节的某分法的各单词间的关联度与某分法的关联度是同一个概念),比如前一单词所属词类信息记录在后一单词对应的记录项(记录项是在单词数据库里的)的关联词类编号字段里(即前一单词与后一单词关联,比如draw属于第7个词类即动作词类,在单词数据库里man对应的记录项的关联词类编号字段里记录有7,所以draw与man关联),这种分法的关联度值就加1,其计算标准自定(一般是对一节的某分法的所以单词之间都要考察关联度,比如某一节的某分法是分成3个单词,可这样计算这节这种分法的关联度值:关联度值起始为0,如单词1与单词2关联,关联度值就加1,如单词1与单词3关联,关联度值再加1,如单词2与单词3关联,关联度值再加1),如关联度值最大的分法是本义分法,本节就可使用这种编法。Calculate the degree of association between each word (ladder numbering) of these 7 kinds of methods respectively a concept), such as the part of speech information of the previous word is recorded in the associated part of speech number field of the record item corresponding to the next word (the record item is in the word database) (that is, the previous word is associated with the next word, such as draw belongs to The 7th part of speech is the action part of speech, and there is 7 recorded in the associated part of speech number field of the record item corresponding to man in the word database, so draw is associated with man), the correlation value of this division method is just added 1, and its calculation standard is from (Generally, it is for a certain division of a section, so the correlation between words must be examined. For example, a certain division of a certain section is divided into 3 words, and the correlation value of this division of this section can be calculated as follows: The correlation value starts at 0. Ifword 1 is associated withword 2, the correlation value will be increased by 1. Ifword 1 is associated withword 3, the correlation value will be increased by 1. Ifword 2 is associated withword 3, the correlation value will be increased by 1. plus 1), if the division method with the largest correlation value is the original division method, this section can use this compilation method.

这里除了可以定量地计算其关联度外,还可以利用排除方法,比如在存储器集里存储一些信息数据库,指示在某些单词后面或前面不会出现某些单词,这样如果某种分法出现前后非法的搭配,即排除这种分法。比如the,jump这样的两单词一般不会在句子中前后紧挨在一起的,如出现the jump这样分法,即可判断为不正确的分法。另外也可记录单词类与单词类非法相邻的信息,比如食品类的单词与建筑类的单词挨在一起是非法等。如果单词数据库的记录项里有词性字段的话,那还可规定词性的非法搭配,比如两个冠词不能紧邻,不能前冠词后动词紧邻,一般对于单一词性的处理比较方便,对于多词性的可不考虑其词性的非法搭配;如果单词数据库的记录项里没有词性字段的话,就不考虑词性的非法搭配。In addition to quantitatively calculating its relevance, exclusion methods can also be used, such as storing some information databases in the memory set, indicating that certain words will not appear after or in front of certain words, so that if a certain division method appears before and after Illegal collocation means excluding this division. For example, two words such as the and jump are generally not close together in a sentence. If there is a division like the jump, it can be judged as an incorrect division. In addition, the information that words are illegally adjacent to each other can also be recorded. For example, it is illegal for words of food and words of construction to be next to each other. If there is a part-of-speech field in the record item of the word database, it can also stipulate the illegal collocation of parts of speech. For example, two articles cannot be adjacent to each other, and the verb after the preceding article cannot be adjacent to each other. Generally, it is more convenient to deal with a single part of speech, and for multi-part of speech Illegal collocations of parts of speech may not be considered; if there is no part of speech field in the record item of the word database, illegal collocations of parts of speech will not be considered.

对于单词段里的阶梯信息值的某节的各种分法,就可先排除有这些非法情况的分法,然后再计算余下的各分法的关联度,如关联度值最大的分法是本义分法即表示本节可采用这种编法。第1节完后再处理第2节直至完,如每节都可采用这种编法,则此语句编码的词段可采用这种编法,如有某一节关联度值最大的分法是歧义分法,可调整基本节长再试或可放弃这种编法(当然调整基本节长后,又需重新考察每节是否都可用这种编法)。(在以上述及频率关联混合编码方式时提及的‘这种编法’,是指频率关联混合方式)。For the various divisions of a certain section of the ladder information value in the word segment, the divisions with these illegal situations can be excluded first, and then the correlations of the remaining divisions are calculated. For example, the division with the largest correlation value is The original meaning of the method means that this section can adopt this method. After the 1st section, the 2nd section is processed until the end. If this method can be used in each section, the word segment encoded by this sentence can adopt this method. It is an ambiguous division method, and the basic section length can be adjusted to try again or this weaving method can be abandoned (of course, after adjusting the basic section length, it is necessary to re-examine whether each section can use this weaving method). ('this coding method' mentioned in the time of above-mentioned and frequency-correlation hybrid coding mode refers to the frequency-correlation hybrid mode).

对于其中有单词组合的,可取其前后单词(可在单词数据库里各单词组合对应的记录项里的拼法字段得到,由单词的拼法就可在单词数据库里找到此单词对应的记录项),再与旁边的单词计算关联度及判断是否非法。For those with word combinations, the words before and after it can be selected (can be obtained in the spelling field in the record item corresponding to each word combination in the word database, and the record item corresponding to this word can be found in the word database by the spelling of the word) , and then calculate the degree of association with the words next to it and judge whether it is illegal.

本词段编码方式,涉及的频率编号是阶梯编号。In this word segment encoding method, the frequency numbers involved are ladder numbers.

在叙述频率关联混合编解码方式时,经常提及单词之间的关联,单词的词性等,实际上是通过阶梯编号到单词数据库等数据库里查此阶梯编号对应的记录项,再在记录项里取出此阶梯编号(即单词)的关联关系,词性等信息。When describing the frequency-associated mixed codec method, the association between words and the part of speech of the word are often mentioned. In fact, the record item corresponding to the ladder number is checked in the database such as the word database through the ladder number, and then in the record item Take out information such as the association relationship and part of speech of this ladder number (ie word).

关联方式:Association:

本词段编码方式及其举例主要以用单词(音码)来阐述的,如有用单词组合(音码)的类似于单词(音码)处理。This speech segment encoding mode and example thereof are mainly set forth with words (sound codes), as the processing similar to words (sound codes) of useful word combinations (sound codes).

这种编码法是利用单词的关联性来进行压缩编码的,比如名词动词之间的搭配,形容词对名词的修饰都属于关联性,一个名词与其搭配的动词数量是有限的,比如tree,搭配的动词可以是stand,但一般不会是say这样的动词,形容词副词的修饰类似,fat可以修饰人,但一般不会修饰book,利用这种有限性可以达到压缩的目的。具体处理方法:This encoding method uses the relevance of words to compress and encode, such as the collocation between nouns and verbs, and the modification of adjectives to nouns are all related. The number of verbs that a noun can match with is limited, such as tree, collocation The verb can be stand, but it is generally not a verb like say. Adjectives and adverbs are similarly modified. Fat can modify people, but generally it does not modify book. This limitation can be used to achieve the purpose of compression. Specific treatment methods:

语句由单词及单词组合组成,(本编码方式以下述以单词为例,有单词组合的类似于对单词的处理),对于语句,先将其单词分成几层,下面一层单词是对其紧邻的上面一层单词的修饰,如某单词其下面没有单词修饰它,它就处于最上层,示例:Sentences are composed of words and word combinations. (This encoding method takes words as an example below. Word combinations are similar to the processing of words). For sentences, the words are first divided into several layers. The modification of the upper layer of words, if there is no word below a word to modify it, it is at the top layer, for example:

有语句:The fat man make his brother angry。There is a sentence: The fat man make his brother angry.

分层为:Stratified as:

Figure A20061020128900571
Figure A20061020128900571

the和fat都是修饰man的,brother和angry有关联关系,his修饰brother,make没有与其它单词的修饰关系,处于最上层。共3层Both the and fat modify man, brother and angry are associated, his modifies brother, make has no modification relationship with other words, and is at the top. 3 floors

编码如下:(这里为了方便注解,采用分行的方式写)The code is as follows: (here, for the convenience of annotation, it is written in a branch way)

00110:语句中的单词个数(这里5个二进制位限制一语句最多32个单词)00110: the number of words in the sentence (here 5 binary bits limit a sentence to a maximum of 32 words)

00010000100100:7个单词,bit为00表示处于第1层,bit为01表示处于第2层,bit为00010000100100: 7 words, the bit is 00 means it is in the first layer, the bit is 01 means it is in the second layer, and the bit is

10表示处于第3层,而且从这14个bit可以看见总语句某个位置的单词是处于某层单词的第几个,以及每层各共有多少单词10 indicates that it is in the third layer, and from these 14 bits, it can be seen that the word at a certain position in the total sentence is in a certain layer of words, and how many words there are in each layer

00:the单词的频率编号的长度信息00: Length information of the frequency number of the word

010:the单词的频率编号010: The frequency number of the word

10:man单词的频率编号的长度信息10: The length information of the frequency number of the man word

011100100:man单词的频率编号011100100: the frequency number of the word man

01:make单词的频率编号的长度信息01: length information of the frequency number of the make word

010111:make单词的频率编号010111: frequency number of the word make

10:angry单词的频率编号的长度信息10: The length information of the frequency number of the angry word

010110011:angry单词的频率编号010110011: frequency number of angry word

01:fat单词是对上层(1层)第2个单词man的修饰01: The word fat is a modification of the second word man in the upper layer (1st layer)

10:fat单词是man单词第2个关联类(相貌)。(这里的bit长度据上层单词man的关联类个数自己变)10: The fat word is the second association class (appearance) of the man word. (The bit length here is changed according to the number of association classes of the upper word man)

0011:fat单词在相貌类中的序号(这里的bit长度也据类里的单词个数自己变)0011: The serial number of the fat word in the appearance class (the bit length here also changes according to the number of words in the class)

11:brother单词是对上层(1层)第4个单词angry的关联,这里使用2位二进制,如果上层单词超4个,就需使用更多位二进制表达11: The word brother is associated with the fourth word angry in the upper layer (layer 1). Here, 2-bit binary is used. If there are more than 4 words in the upper layer, more binary bits need to be used for expression

11:brother单词属于angry单词第3个关联类(人)。(这里的bit长度据上层单词angry的关联类个数自己变)11: The word brother belongs to the third association category (person) of the angry word. (The bit length here is changed according to the number of associated classes of the upper word angry)

0110:brother单词在人类中的序号(这里的bit长度也据类里的单词个数自己变)0110: The serial number of the brother word in humans (the bit length here also changes according to the number of words in the class)

01:his单词是对上层(2层)第2个单词brother的修饰,这里使用2位二进制,如果上层单词超4个,就需使用更多位二进制表达01: The word "his" is a modification of the second word "brother" in the upper layer (layer 2). Here, 2-bit binary is used. If there are more than 4 words in the upper layer, more binary bits need to be used.

010:his单词属于brother单词第2个关联类(物主代词类)。(这里的bit长度据上层单词brothe的关联类个数自己变)010: The word his belongs to the second association category of the brother word (possessive pronoun category). (The bit length here is changed according to the number of association classes of the upper word brothe)

0101:his单词在相貌类中的序号(这里的bit长度也据类里的单词个数自己变)0101: The serial number of his word in the appearance class (the bit length here also changes according to the number of words in the class)

由于在00010000100100中包含了各层个数信息,位置信息,所以不再需附加其它bit,即可正确解读。Since 00010000100100 contains information on the number and location of each layer, it can be interpreted correctly without adding other bits.

把以上各BIT按顺序合起来得编码值为:0011000010000100100000101001110010001010111100101100110110001111110110010100101Combine the above BITs in order to get the coded value: 0011000010000100100000101001110010001010111100101100110110001111110110010100101

(此编码值是单词段的值,时态词形选择段的编码处理原理在后面述,本举例没涉及时态词形选择段;为了使举例简洁,本举例也没涉及头段,头段里一般含有语种信息以及词段编码方式选择信息等数据)(this encoding value is the value of the word segment, and the encoding processing principle of the tense word form selection segment is described later, and this example does not involve the tense word form selection segment; in order to make the example succinct, this example also does not involve the head section, the head section It generally contains data such as language information and word segment coding method selection information)

下层的单词,如果其本身频率编号的长度很短,可以直接放在最高层,比如单词a。The word of the lower layer, if the length of its own frequency number is very short, can be directly placed in the highest layer, such as the word a.

对于包含较多单词的单词组合,由于一般没有与其它单词的关联,直接放在最高层。For word combinations that contain more words, they are placed directly at the highest level because they generally have no association with other words.

(本词段编码方式里,各单词及单词组合的频率编号除了可以是本例示的阶梯编号外,还可以是等长编号或哈夫曼编号等;如为等长编号,哈夫曼编号那其前面不用带频率编号长度信息。)(in this speech segment encoding mode, the frequency numbering of each word and word combination can also be equal-length numbering or Huffman numbering etc. except the ladder numbering of this example; as for equal-length numbering, the Huffman numbering that It is not preceded by frequency number length information.)

整句差别方式:The difference in the whole sentence:

本词段编码方式及其举例主要以用单词(音码)来阐述的,如有用单词组合(音码)的类似于单词(音码)处理。This speech segment encoding mode and example thereof are mainly set forth with words (sound codes), as the processing similar to words (sound codes) of useful word combinations (sound codes).

由于许多的外语教材,涉及的语音及示例,是在举例,所以同样或相似的句型经常出现,比如He is an engineer,应该是经常在初级教材中出现的语句,而与其相似的语句是Since many foreign language teaching materials, the pronunciation and examples involved are examples, so the same or similar sentence patterns often appear. For example, He is an engineer should be a sentence that often appears in elementary textbooks, and the similar sentence is

He is not an engineer,仅多了一个not;所以,可以先统计这些常用的范句(比如He is an engineer),比如有1千句范句,然后把这1千句集合起来,做成数据库(范句库),也放在存储器集里,对在范句库中的每条范句都顺序编号;在此范句库里记录一条范句的方法可以是:先记录范句里单词个数,再把范句里的各单词全换成其在单词数据库里对应的频率编号,然后依范句里各单词原来的顺序把频率编号排起来,(如所用频率编号是阶梯编号,前需有长度信息BIT),然后后面再附带各单词的词形选择信息,总之其记录的范句数据类似于频率排序编码方式。前述的He is an engineer就编入范句库,但He is not anengineer不编入范句库,当在做语句编码时,如遇到He is an engineer时,直接将其在范句库里的顺序号作为编码值即可;而如遇到he is not an engineer时,就先把he is anengineer这句范句的顺序号放前面,然后再在后面附上与其差别的信息,这种差别可能是在某几处加词了,可能是在某几处删词了,举例:假定一句的词数上限为32个,即用5个二进制位可表示各词的位置编号,附差别信息时,先附删词信息,即从原范句第几个词开始删,删几个词;再附增加词信息,即从第几个词位置开始增,增几个,接着附增加的词的频率编号长度信息及频率编号(注:这几句话涉及的‘几’显然可以是10以内或10或10以上)。He is not an engineer, there is only one more not; therefore, you can first count these commonly used model sentences (such as He is an engineer), for example, there are 1,000 model sentences, and then collect these 1,000 sentences to make a database (model sentence storehouse), also be placed in memory set, all sequential numbering to every model sentence in the model sentence storehouse; The method of recording a model sentence in this model sentence storehouse can be: first record word in the model sentence Then replace each word in the model sentence with its corresponding frequency number in the word database, and then arrange the frequency numbers according to the original order of each word in the model sentence, (if the frequency number used is a ladder number, it needs to There is length information BIT), and then the morphological selection information of each word is attached, in a word, the sample sentence data recorded is similar to the frequency sorting coding method. The aforementioned He is an engineer is compiled into the sample sentence database, but He is not an engineer is not compiled into the sample sentence database. When doing sentence coding, if you encounter He is an engineer, directly put it in the sample sentence database. The serial number can be used as the encoding value; and if you encounter he is not an engineer, put the serial number of the model sentence he is an engineer first, and then attach the difference information at the back. This difference may Words are added in some places, and words may be deleted in some places. For example: assuming that the upper limit of the number of words in a sentence is 32, that is, 5 binary bits can be used to represent the position number of each word. When attaching difference information, Add the word deletion information first, that is, delete from the first word in the original model sentence, and delete a few words; then attach the added word information, that is, increase from the first word position, add a few, and then attach the frequency of the added word Number length information and frequency number (Note: The 'several' involved in these few sentences can obviously be less than 10 or 10 or more).

至于某一步删词的位置或增词的位置的具体确定,是要累计前面各步的删词和增词情况的,比如范句是HE IS A MAN,需编码语句是HE IS NOT A FAT MAN,第1步对范句在第3的一个单词位置增一个单词NOT,得HE IS NOT A MAN,第2步确定要增词FAT的位置,就要以第1步增词后得到的语句为准而不是以开始的范句为准,第2步要增的单词是在第1步增词后的语句HE IS NOT A MAN中第5个单词的位置。本段所述对非印欧语系语种的整句差别编解码方式也同样有效。As for the specific determination of the position of deletion or addition of words in a certain step, it is necessary to accumulate the deletion and addition of words in the previous steps. For example, the model sentence is HE IS A MAN, and the sentence to be encoded is HE IS NOT A FAT MAN , the first step is to add a word NOT to the position of the third word in the model sentence to get HE IS NOT A MAN, and the second step is to determine the position of the word FAT to be added, and the statement obtained after the first step is added as It is not based on the sample sentence at the beginning. The word to be added in the second step is the position of the fifth word in the sentence HE IS NOT A MAN after the word is added in the first step. The entire sentence differential encoding and decoding methods of non-Indo-European languages described in this paragraph are also effective.

后面的时态词形选择段记录语句里(单词段里)所有单词的时态词形选择值(当然不需选择的如THE就不用记录),此处处理与其它编码方式同。The tense morphological selection value (certainly not needing to select as THE just need not record) in the tense morphological selection segment record sentence (in the word segment) all words, processing is the same with other coding modes here.

如果是仅对范句库里的某范句中的少量词改了时态,也可以以此方式先直接是范句地址,然后时态词形选择段接在后面(时态词形选择段的编码处理原理见后面述)。If only a small amount of words in a certain model sentence in the model sentence storehouse have changed the tense, it can also be the model sentence address directly in this way first, and then the tense word form selection section is connected at the back (the tense word form selection section The encoding processing principle is described later).

例:需编码句:The young man is not an  engineerExample: Sentence to be coded: The young man is not an engineer

此语句与在范句库里第0000100101句(第37句)相似:The fat man is an engineer。This sentence is similar to sentence 0000100101 (sentence 37) in the sample sentence library: The fat man is an engineer.

第1次删去第2个单词fatDelete the second word fat for the first time

第2次从第2个单词的位置,增一个单词youngFor the second time, from the position of the second word, add a word young

第3次从第5个单词的位置,增一个单词notFor the third time, from the position of the fifth word, add a word not

分行写:Branch write:

0000100101:所用范句在范句库中的顺序号(固定10个bit)0000100101: The sequence number of the sample sentence used in the sample sentence database (fixed 10 bits)

0:删词标志0: delete word flag

00001:从第2个单词开始删(如为11111,就结束编码)00001: Delete from the second word (if it is 11111, end the encoding)

001:删1个单词(一次删增数限制8个内或4个内等)001: Delete 1 word (the number of deletions and additions at a time is limited to 8 or 4, etc.)

1:增词标志1: add word sign

00001:从第2个单词的位置开始增00001: Increment from the position of the second word

001:增1个单词(一次删增数限制8个内或4个内等)001: Add 1 word (the number of deletions and additions at a time is limited to 8 or 4, etc.)

10:增的单词young的频率编号的长度信息10: The length information of the frequency number of the increased word young

010100110:增的单词young的频率编号010100110: The frequency number of the word young added

1:增词标志1: add word sign

00100:从第5个单词的位置开始增00100: Increment from the position of the fifth word

001:增1个单词001: add 1 word

01:增的单词not的频率编号的长度信息01: The length information of the frequency number of the increased word not

001101:增的单词not的频率编号001101: Increased frequency number of the word not

0:遇到0,结束编码0: Encounter 0, end encoding

把以上各BIT按顺序合起来得编码值(此编码值是单词段的值,时态词形选择段的编码处理原理在后面述,本举例没涉及时态词形选择段;为了使举例简洁,本举例也没涉及头段,头段里一般含有语种信息以及词段编码方式选择信息等数据):The above BITs are combined in order to get the coded value (this coded value is the value of the word segment, and the coding processing principle of the tense word form selection segment is described later, and this example does not involve the tense word form selection segment; in order to make examples succinct , this example does not involve the header, which generally contains data such as language information and word segment encoding selection information):

0000100101000001001 100001001100101001101001000010100110100000100101000001001 10000100110010100110100100001010011010

如把含多个句子的语句也录入范句库,那就可以对多句子的语句编码,处理类似于单句子的语句。If the sentence that contains multiple sentences is also entered into the sample sentence database, then the sentence encoding of multiple sentences can be processed similar to the sentence of a single sentence.

(本词段编码方式里,各单词及单词组合的频率编号除了可以是本例示的阶梯编号外,还可以是等长编号或哈夫曼编号等;如为等长编号,哈夫曼编号那其前面不用带频率编号长度信息。)(in this speech segment encoding mode, the frequency numbering of each word and word combination can also be equal-length numbering or Huffman numbering etc. except the ladder numbering of this example; as for equal-length numbering, the Huffman numbering that It is not preceded by frequency number length information.)

显然,在编码生成一个处理单位的单词段时,4种词段编码方式(频率排序方式,频率关联混合方式,关联方式和整句差别方式)是只选其一的。除这4种词段编解码方式(频率排序方式,频率关联混合方式,关联方式和整句差别方式)外,还可以使用本发明文件没提及的其它词段编解码方式来对单词段进行编解码处理。Apparently, when coding and generating a word segment of a processing unit, only one of the four word segment encoding methods (frequency sorting, frequency-association mixing, association and whole sentence difference) is selected. Except these 4 kinds of word segment encoding and decoding modes (frequency sorting mode, frequency association mixed mode, association mode and whole sentence difference mode), can also use other word segment encoding and decoding modes that the document of the present invention does not mention to carry out word segment Codec processing.

前后调用处理:Call processing before and after:

在对单词段进行编码时,前后调用处理是与前几种词段编码方式中的某一种合用的。When word segments are encoded, the forward and backward call processing is used in conjunction with one of the preceding several word segment encoding methods.

对于紧邻的几个句子,里面可能有一些比较偏僻的单词或单词组合被重复调用,而其本身的频率编号较长,这时可以使用前后调用处理,(下面以单词为例,对单词组合的处理是与单词的处理类似的),即,例如当某单词出现在前面句子时,使用频率编号,而当紧邻一句子又出现这单词时,就可以将指针指到前面的单词的位置,而不必用其较长的频率编号;具体可以在频率编号中占用一个111,当出现与前面句子中或后面句子中或本句子前面或本句子后面的某单词相同的单词时,就在单词段里用111代替此当前单词,然后在整个编码的前后调用段里附上往前调或往后调的标志,并附上5个bit来表示当前单词位置与前面或后面被调用单词位置的距离。(为了处理简洁,一般需要求当前单词与被调用的单词的词形时态都相同)For the next few sentences, there may be some relatively remote words or word combinations that are repeatedly called, and their frequency numbers are relatively long. The processing is similar to the processing of words), that is, for example, when a certain word appears in the previous sentence, the frequency number is used, and when the word appears again in the next sentence, the pointer can be pointed to the position of the previous word, and It is not necessary to use its longer frequency number; specifically, a 111 can be used in the frequency number, and when a word identical to a word in the previous sentence or in the following sentence or in the front of the sentence or in the back of the sentence appears, it is in the word segment Replace this current word with 111, then add the sign of forward or backward transfer in the front and rear call segments of the whole encoding, and attach 5 bits to represent the distance between the current word position and the front or back called word position. (In order to deal with brevity, it is generally required that the word form and tense of the current word and the called word are the same)

显然如有使用前后调用处理的条件(即比如语句的前后有相同的单词),但编码时也可不使用前后调用处理。Obviously, if there is a condition for using front and back call processing (that is, for example, the front and back of the statement have the same word), but the front and back call processing may not be used during encoding.

陌生单词的处理:Handling of unfamiliar words:

由于单词数据表里的单词的数量总是有限的,所以随时会遇到陌生单词,所以在编解码规则里一般需处理陌生单词;英语的陌生单词的发音构成一般可看成为几种(其它印欧语系语种与英语是类似的):一种是由元辅音的发音组成的,一种是主要由词根的发音组成的,比如commitment的发音由com和mit和ment这几个词根的发音组成,一种是主要由常用短单词的发音组成的,比如loudspeaker的发音由短单词loud和短单词speaker的发音组成。Because the quantity of the word in the word data table is always limited, so can run into strange word at any time, so generally need to deal with strange word in codec rule; European languages are similar to English): one is composed of the pronunciation of vowels and consonants, and the other is mainly composed of the pronunciation of roots. For example, the pronunciation of commitment is composed of the pronunciations of com, mit and ment. One is mainly composed of the pronunciation of commonly used short words, for example, the pronunciation of loudspeaker is composed of the pronunciation of the short word loud and the short word speaker.

所以陌生单词的处理可为,在单词的频率编号中,留一个频率编号专用于代替陌生单词,比如111111,此编号只出现在单词段中以代替陌生单词(显然在使用频率排序或整句差别等词段编码方式来生成单词段时,都可以在单词段里使用111111这个频率编号来代替语句中的陌生单词,并把陌生单词的具体编码信息附在陌生单词段里),然后陌生单词的具体编码信息附在陌生单词段里,陌生单词的编码信息的格式可分几种方式(以英语为例):Therefore, the processing of unfamiliar words can be, in the frequency number of words, reserve a frequency number dedicated to replace unfamiliar words, such as 111111, this number only appears in word segments to replace unfamiliar words (obviously in the use of frequency sorting or whole sentence difference When a word segment is generated by a word segment encoding method, the frequency number 111111 can be used in the word segment to replace the strange word in the sentence, and the specific coding information of the strange word is attached to the strange word segment), and then the strange word The specific encoding information is attached in the strange word segment, and the format of the encoding information of the unfamiliar word can be divided into several ways (taking English as an example):

1:元辅音作组合:陌生单词的编码信息,完全由元辅音组成,陌生单词的编码信息就是:开始4个bit表示组成陌生单词发音的元辅音个数,接着是组成陌生单词发音的各元辅音的编号(即音标编号),每个元辅音的编号(即音标编号)占6bit,解码时,要发陌生单词的音时,直接把组成陌生单词发音的各元音辅音的语音(也即音标的语音)顺次发出即可。1: Combination of vowels and consonants: the encoding information of unfamiliar words is completely composed of vowels and consonants. The encoding information of unfamiliar words is: the first 4 bits indicate the number of vowels and consonants that make up the pronunciation of unfamiliar words, followed by the elements that make up the pronunciation of unfamiliar words The numbering of the consonants (i.e. the phonetic numbering), the numbering (i.e. the phonetic numbering) of each vowel and consonant accounts for 6 bits, and when decoding, when the sound of an unfamiliar word is to be pronounced, the voice of each vowel and consonant that forms the pronunciation of the unfamiliar word (i.e. The pronunciation of the phonetic symbol) can be issued in sequence.

2:词根元辅音组合:陌生单词的编码信息主要由词根(包括词基,前缀,后缀)和元辅音组成,如此须先建立词根与元辅音的混合库(词根数据库),并对各词根与元辅音进行编号(即赋予顺序号,词根与元辅音可以统一编号),词根数据库里还需记录各词根的发音信息数据(这里的发音信息数据是指比如由哪些音标编号组成一个词根的发音),词根数据库里还需记录各元辅音的发音信息数据(比如其对应的音标编号);这样陌生单词的编码信息就是:开始4个bit表示组成陌生单词发音的词根和元辅音的总个数(词根可能有零个或一个或多个,元辅音也可能有零个或一个或多个),接着是组成陌生单词发音的各词根和元辅音的顺序号,每个词根或元辅音的顺序号占10bit(假设词根加上元辅音的总数为1千个,所以每个词根或元辅音的顺序号分配10个BIT)。解码时,要发陌生单词的音时,直接把组成陌生单词发音的各词根和元辅音的语音顺次发出来即可,其中元辅音的语音就是音标的语音,词根的语音也是由几个音标的语音组成的。2: root vowel and consonant combination: the coding information of unfamiliar words is mainly made up of word root (comprising word base, prefix, suffix) and vowel and consonant, so must establish the mixed library (root database) of word root and vowel consonant earlier, and each root and The vowels and consonants are numbered (promptly given a sequence number, the root and the vowels and consonants can be numbered uniformly), and the pronunciation information data of each root also needs to be recorded in the root database (the pronunciation information data here refers to the pronunciation of a root composed of phonetic symbols, for example) , also need to record the pronunciation information data of each vowel consonant in the root database (such as its corresponding phonetic symbol number); the coding information of such unfamiliar words is: the first 4 bits represent the total number of radicals and vowel consonants that form the pronunciation of unfamiliar words ( There may be zero or one or more roots, and zero or one or more vowels), followed by the sequence number of each root and vowel consonant that make up the pronunciation of the unfamiliar word, and the sequence number of each root or vowel consonant Occupies 10 bits (assuming that the total number of roots plus consonants is 1,000, so each root or sequence number of consonants is assigned 10 BITs). When decoding, when you want to pronounce the sound of an unfamiliar word, you can directly send out the sounds of the roots and vowels and consonants that make up the pronunciation of the unfamiliar word in sequence. composed of sounds.

3:差别式:找与本陌生单词相似读音的单词数据库里的单词,然后记录与其读音差别(删增音);这样陌生单词的编码信息就是:开始12bit是单词数据库里的单词的寻址地址,接着0,表示删,接着是删的起始位置(即从单词的发音信息数据的左起第几个音标编号开始删),删几个音标编号,还可以接着再是0,再删;接着1,开始增,接着是增的起始位置(即从单词的发音信息数据的左起第几个音标编号位置开始增),增几个音标编号,接着是各音标的编号(6bit),还可以接着再是1,再增,最后以0结束。解码时,要发陌生单词的音时,直接把组成陌生单词发音的各音标的语音顺次发出来即可。3: Difference method: find the word in the word database with similar pronunciation to the unfamiliar word, and then record the difference between the pronunciation (delete and add pronunciation); the coding information of the unfamiliar word is: the first 12bit is the address address of the word in the word database , followed by 0, representing deletion, followed by the starting position of deletion (that is, the number of phonetic symbols from the left of the pronunciation information data of the word is deleted), and several phonetic symbols are deleted, and then 0 can be deleted again; Then 1, start to increase, then the initial position of increase (that is, from the left of the pronunciation information data of the word, which phonetic symbol number position starts to increase), increase several phonetic symbol numbers, and then the numbering (6bit) of each phonetic symbol, It can also be followed by 1, increased again, and finally ended with 0. During decoding, when the sound of an unfamiliar word is to be pronounced, the voices of each phonetic symbol forming the pronunciation of the unfamiliar word are directly sent out in sequence.

4:短单词元辅音组合:陌生单词的编码信息由常用短单词和元辅音组成,如此须先建立常用短单词与元辅音的混合库(短单词库),并对各短单词与元辅音进行编号(即赋予顺序号,短单词与元辅音可统一编号),短单词库里还需记录各短单词的发音信息数据(这里的发音信息数据是指比如由哪些音标编号组成一个短单词的发音),短单词库里还需记录各元辅音的发音信息数据(比如其对应的音标编号);这样陌生单词的编码信息就是:开始4个bit表示组成陌生单词发音的短单词和元辅音的总个数,接着是组成陌生单词发音的各短单词和元辅音的顺序号,每个短单词或元辅音的顺序号占10bit(假设短单词加上元辅音的总数为1千个,所以每个短单词或元辅音的顺序号分配10个BIT)。解码时,要发陌生单词的音时,直接把组成陌生单词发音的各短单词和元辅音的语音顺次发出来即可,其中元辅音的语音就是音标的语音,短单词的语音也是由几个音标的语音组成的。4: Combination of short words and vowels and consonants: the coding information of unfamiliar words is composed of common short words and vowels and consonants, so a mixed library (short word library) of commonly used short words and vowels and consonants must be established first, and each short word and vowels and consonants should be Numbering (promptly giving sequence numbers, short words and vowels and consonants can be numbered uniformly), and the pronunciation information data of each short word also needs to be recorded in the short word library (the pronunciation information data here refers to the pronunciation of a short word formed by which phonetic symbols numbers, for example ), the pronunciation information data of each vowel and consonant (such as its corresponding phonetic symbol number) needs to be recorded in the short word library; the encoding information of such a strange word is: the first 4 bits represent the total number of short words and vowel consonants that make up the pronunciation of a strange word number, followed by the sequence numbers of each short word and consonant that make up the pronunciation of unfamiliar words, and the sequence number of each short word or consonant accounts for 10 bits (assuming that the total number of short words plus consonants is 1,000, so each 10 BITs are assigned to the sequence numbers of short words or vowel consonants). When decoding, when the sound of an unfamiliar word is to be pronounced, the voices of each short word and vowel consonant that make up the pronunciation of the unfamiliar word are directly sent out in sequence, wherein the voice of the vowel consonant is exactly the phonetic symbol, and the voice of the short word is also composed of several It is composed of phonetic symbols.

陌生单词的发音信息数据即是用元辅音,词根,单词,短单词(短单词也是单词)这些音码中的一种或多种音码来构成的。Pronunciation information data of unfamiliar word promptly is constituted with one or more phonetic codes in these phonetic codes of vowel and consonant, word root, word, short word (short word is also word).

在陌生单词的编码信息的前面有编码信息的格式选择信息,以下的英语的编解码举例中,涉及陌生单词的编码信息的格式选择时,以00(二进制)代表元辅音作组合,01代表词根元辅音组合,10代表差别式,11代表短单词元辅音组合。In front of the encoding information of unfamiliar words, there is format selection information of encoding information. In the following English encoding and decoding examples, when the format selection of encoding information involving unfamiliar words is selected, 00 (binary) represents the combination of vowels and consonants, and 01 represents the root The combination of vowels and consonants, 10 represents the differential formula, and 11 represents the combination of vowels and consonants of short words.

这样处理陌生单词后,不需再处理此陌生单词的时态词形的变化。After the unfamiliar word is processed in this way, it is not necessary to process the change of tense and word form of the unfamiliar word.

如单词段里没陌生单词不出现111111,那陌生单词段也不占BIT,这是显然的。If there are no unfamiliar words in the word segment and 111111 does not appear, then the unfamiliar word segment does not occupy BIT, which is obvious.

显然在一个单词段中可以有一个或多个陌生单词(一个单词段里对应含有多个111111),如一个单词段中有多个陌生单词,那各个陌生单词的具体编码信息(即比如由哪些元辅音或词根组成等信息)就按各陌生单词在语句里(单词段里)的先后顺序依次放在陌生单词段里即可。本段所述是技术常识。Obviously, there can be one or more unfamiliar words in a word segment (correspondingly containing multiple 111111 in a word segment), if there are multiple unfamiliar words in a word segment, then the specific coding information of each unfamiliar word (that is, for example, which Information such as vowel consonant or word root composition) just be placed in the unfamiliar word segment successively by each unfamiliar word in the sentence (in the word segment) sequentially. This paragraph is technical common sense.

当然在编码规则中也可以不处理陌生单词,比如如在读物中英语语句遇到陌生单词,此语句可不编码且读物的整页面上此语句旁不设码字,或者修改此语句内容,使语句中不再含有陌生单词,再对语句进行编码。Of course, unfamiliar words may not be processed in the coding rules. For example, if an English sentence encounters a strange word in a reading material, this sentence may not be coded and there is no code word next to this sentence on the entire page of the reading material, or modify the content of this sentence so that the sentence No longer contain unfamiliar words, and then encode the sentence.

以上12段所述是以只适用于英语为例的编码规则所采用的陌生单词处理方式,而且涉及到解码时陌生单词的发音处理是以解码规则是采用整步直接发音方式来播放语句发音为例的(也可适用于解码规则是采用分步直接发音方式来播放语句发音的);如解码规则采用分步直接发音方式来播放语句发音,那对于任一个陌生单词,在解码过程中得到的此陌生单词的发音信息数据(有关陌生单词的发音信息数据的具体阐述在后面解码时有述)一般应放在一个部分预备发音数据里(一般不会分开放在几个部分预备发音数据里)。对于印欧语系其它语种的陌生单词处理也可按英语这样处理。The above 12 paragraphs describe the unfamiliar word processing method that is only applicable to the coding rules of English as an example, and when it comes to decoding, the pronunciation processing of unfamiliar words is based on the fact that the decoding rule is to use the full-step direct pronunciation method to play the sentence pronunciation as (also applicable to the decoding rule is to adopt the step-by-step direct pronunciation mode to play the sentence pronunciation); as the decoding rule adopts the step-by-step direct pronunciation mode to play the sentence pronunciation, then for any unfamiliar word, the obtained in the decoding process The pronunciation information data of this unfamiliar word (the specific elaboration of the pronunciation information data of the unfamiliar word will be described later when decoding) should generally be placed in a part of the prepared pronunciation data (generally it will not be placed separately in several parts of the prepared pronunciation data) . It can also be handled in the same way as English for unfamiliar words in other languages of the Indo-European language family.

时态词形变化的处理:Handling of tense inflections:

前已述过,印欧语系各语种,单词是有时态,单复数变化的,各单词变形的个数是各异的,以下是英语的时态词形变化的处理(其它语种类似于英语,并且以下所述是以编解码涉及的全是单词为例的,对于单词组合的处理类似于单词的处理):把各单词的各种词形形态的发音信息数据都记录在单词数据表的这个单词对应的记录项里(并可以在单词数据表的此单词对应的记录项里专门开一个字段来表示本单词有多少个词形形态,也可采取其它方式来表示,其它方式比如每个单词对应的记录项里共开有10个词形形态发音信息数据字段,如某单词有3个词形形态,那就把这3个词形形态的发音信息数据分别记录在3个词形形态发音信息数据字段里,其它7个词形形态发音信息数据字段都空着,解码时,查这个记录项,得知有3个词形形态发音信息数据字段不是空的,就知道此单词有3个词形形态,这些方式都属于常识),编码时,查需编码语句中的各单词在单词数据表对应的记录项,如只有一个词形形态(只有原型而不存在变形)的,此单词的时态词形选择值就不占bit,有2个词形形态的,时态词形选择值就占1个bit,有34个词形形态的,就占2个bit,有58个词形形态的,就占3个bit,依此类推,时态词形选择值就是表示本编码语句的此单词的词形形态是处在单词数据表中此单词对应的记录项中此单词所有词形形态中的顺序第几个词形形态(比如某单词的占3个BIT的时态词形选择值就可表示此单词最多8个词形形态中的从第1至第8的顺序),把各单词的时态词形选择值依语句中各单词的顺序放在单词段后的时态词形选择段即可;As mentioned above, in the various languages of the Indo-European language family, words have tenses, singular and plural changes, and the number of inflections of each word is different. The following is the processing of tense and inflection changes in English (other languages are similar to English, And the following is an example of all words involved in encoding and decoding, and the processing of word combinations is similar to the processing of words): the pronunciation information data of the various morphological forms of each word are all recorded in this word data table In the record item corresponding to the word (and a field can be specially opened in the record item corresponding to this word in the word data table to indicate how many morphological forms the word has, or it can be expressed in other ways, such as each word There are a total of 10 morphological pronunciation information data fields in the corresponding record item. If a word has 3 morphological forms, then the pronunciation information data of these 3 morphological forms are respectively recorded in the 3 morphological pronunciation information fields. In the information data field, the other 7 morphological and pronunciation information data fields are all empty. When decoding, check this record item and know that there are 3 morphological and pronunciation information data fields that are not empty, so you know that this word has 3 Morphology, these methods all belong to common sense), during encoding, look up the record item corresponding to each word in the word data table in the encoding sentence, as only one morphological form (there is only prototype and no deformation), the word's The tense word form selection value does not occupy a bit. If there are 2 word forms, the tense word form selection value will occupy 1 bit. If there are 34 word forms, it will occupy 2 bits. There are 58 word forms Morphological, it occupies 3 bits, and so on, the tense morphological selection value means that the morphological form of this word in this coded sentence is all the morphological forms of this word in the record item corresponding to this word in the word data table The order of the form in the form (for example, the tense form selection value of a word that accounts for 3 BITs can represent the order from the 1st to the 8th in the maximum 8 form forms of this word), put The tense morphological selection value of each word can be placed in the temporal morphological selection segment after the word segment according to the order of each word in the sentence;

解码时,先解出单词段里的各频率编号(单词),再依次查各频率编号(单词)在单词数据表对应的记录项里的词形形态个数(可以在单词数据表的各单词对应的记录项里专门开一个字段来表示本单词有多少个词形形态,也可采取其它方式来表示,这是常识),得出各频率编号(单词)的时态词形选择值各占几位bit(对于象the这样只有一个词形形态而无词形变化的单词,在单词数据库里the词对应的记录项里只记录着the这个词只有一个其本身的词形形态而没有别的词形形态,那象the这样的单词的时态词形选择值就不占BIT,在时态词形选择段里也不占BIT),然后依次在时态词形选择段读取各频率编号(单词)的时态词形选择值(BIT值,象THE这样只有一个词形形态的单词就没有时态词形选择值),再依此读得的各时态词形选择值(BIT值)在单词数据表各频率编号(单词)对应的记录项里得到各频率编号(单词)实际用的词形形态。During decoding, first solve each frequency numbering (word) in the word segment, then look up the number of morphological forms of each frequency numbering (word) in the record item corresponding to the word data table (can be in each word of the word data table) A field is specially opened in the corresponding record item to indicate how many morphological forms the word has, and it can also be expressed in other ways, which is common sense), and the tense morphological selection values of each frequency number (word) are respectively accounted for. Several bits (for a word like the that has only one morphological form but no inflectional change, the word corresponding to the word in the word database only records that the word has only one morphological form of itself and nothing else Morphological form, the tense morphological selection value of the word like the does not account for BIT, and does not account for BIT in the tense morphological selection section), and then reads each frequency number in the tense morphological selection section successively (Word) tense morphological selection value (BIT value, the word that has only one morphological form like THE just does not have tense morphological selection value), and each tense morphological selection value (BIT value) that reads accordingly ) Obtain the actually used word form of each frequency number (word) in the record item corresponding to each frequency number (word) of the word data table.

对于单词组合仍然可以参照单词处理,比如‘IS A’单词组合,其变形是‘BE A’,’WAS A’,‘IS AN’,‘BE AN’,‘WAS AN’,所以这个单词组合在单词数据库对应的记录项里就有6个词形形态;然后其时态词形选择值的长度的确定,其时态词形选择值在时态词形段的排布等处理都与单词的处理相同,其解码时涉及的处理也与单词相同,(实际上就是把单词组合当成与单词同等对待),如在一个语句中含2个单词和1个单词组合,其在语句中的排列是‘单词1单词组合1单词2’,并且这2个单词和1个单词组合都有时态词形选择值,那在时态词形选择段里各时态词形选择值的排列就是:‘单词1的时态词形选择值单词组合1的时态词形选择值单词2的时态词形选择值’。For word combinations, you can still refer to word processing, such as the word combination of 'IS A', its deformation is 'BE A', 'WAS A', 'IS AN', 'BE AN', 'WAS AN', so this word combination is in There are 6 morphological forms in the record item corresponding to the word database; then the determination of the length of its tense morphological selection value, and the processing such as the arrangement of its tense morphological selection value in the tense morphological segment are all related to the word's The processing is the same, and the processing involved in its decoding is also the same as that of words, (in fact, the combination of words is treated as the same as words). For example, if a sentence contains 2 words and 1 word combination, its arrangement in the sentence is 'Word 1word combination 1 word 2', and these 2 words and 1 word combination have tense word form selection value, then the arrangement of each tense word form selection value in the tense word form selection section is: 'word A tense word choice value of 1 A tense word choice value of word combination A tense word choice value of word 2'.

按常理即可推知:对于有些单词有多种词性的,如既可作可数名词有单复数变化,又可作动词有时态变化,仍可用此方式,把此单词各种变化都排在单词数据库里此单词对应的记录项里统一处理,比如某单词,其原型词形形态有一个,作为名词其有1个复数的词形形态,作为动词有过去,将来共2个词形形态,那么在单词数据库的记录项里它的词形形态总数为4个,在时态词形选择段里要用2BIT来表示其时态词形选择值,只管它用什么词形,而不管其词性的改变。(显然,在对单词段编码时,如有使用单词组合的频率编号,比如‘ISA’的频率编号,那相应的在时态词形选择段里是要记录‘IS A’的时态词形选择值,而不是分开记录‘IS’和‘A’各自的时态词形选择值)It can be deduced according to common sense: For some words with multiple parts of speech, if they can be used as countable nouns with singular and plural changes, and as verbs with tense changes, this method can still be used to rank all the changes of this word in the word The records corresponding to this word in the database are processed uniformly. For example, a word has one prototype morphological form, as a noun, it has 1 plural morphological form, and as a verb, it has past and future morphological forms. Then In the record item of the word database, its morphological totality is 4, and 2BIT will be used to represent its tense morphological selection value in the tense morphological selection segment, no matter what morphological it uses, regardless of its part of speech Change. (Obviously, when encoding the word segment, if there is a frequency number using word combinations, such as the frequency number of 'ISA', then the corresponding tense word form of 'IS A' will be recorded in the tense word form selection segment option values instead of recording the respective tense-lemma option values for 'IS' and 'A' separately)

此段举例(这里是对单词举例,单词组合类似于单词的情况):Example of this paragraph (here is an example of words, the combination of words is similar to the situation of words):

如某语句用了单词’men’If a sentence uses the word 'men'

在单词数据库里此单词对应的记录项各字段是:The fields of the record item corresponding to this word in the word database are:

man:(单词)原型的拼法字段man: spelling field of (word) prototype

011100100:频率编号字段(一般为二进制格式)011100100: frequency number field (usually in binary format)

3:所属词类编号字段3: The part of speech number field to which it belongs

7,8,9,10:关联词类编号字段7, 8, 9, 10: associated part of speech number field

[man]:词形形态1的发音信息数据字段(单词原型man的词形音标,本说明书这里是用英语字母模拟表示音标的)[man]: the pronunciation information data field of morphological form 1 (the word form phonetic symbol of the word prototype man, this manual uses English letters to simulate the phonetic symbol here)

[men]:词形形态2的发音信息数据字段(man的复数men的词形音标,本说明书这里是用英语字母模拟表示音标的)[men]: the pronunciation information data field of morphological form 2 (the word form phonetic symbols of the plural men of man, this manual uses English letters to simulate the phonetic symbols here)

在记录项里共有2个词形形态(顺序排):man,menThere are 2 morphological forms (ordered order) in the record item: man, men

所以men的时态词形选择值占1个BIT,而men本身排在第2,所以时态词形选择值BIT就是1(如排在第1,那时态词形选择值BIT就应是0)。So the tense word form selection value of men accounts for 1 BIT, and men itself ranks second, so the tense word form selection value BIT is exactly 1 (as ranking first, then the tense word form selection value BIT should be 0).

在语句中出现的man或men,在单词段中都是使用一个相同的频率编号011100100,并在时态词形选择段里带不同的时态词形选择值来区别到底是这2个词形形态中的哪一个。(对于在一个记录项记录的单词有超过两个词形形态的,依此类推)The man or men that appear in the sentence use the same frequency number 011100100 in the word segment, and have different tense word form selection values in the tense word form selection segment to distinguish these two word forms Which of the. (For words with more than two morphological forms recorded in one entry, and so on)

注:本发明文件在以英语为例的编解码举例时,对单词或单词组合的词性形态的确定是仅供举例参考的,与实际编解码规则可能会有出入;比如对于‘IS A’单词组合,本说明书确定其词性形态有6个‘IS A’,‘BE A’,’WAS A’,‘IS AN’,‘BE AN’,‘WASAN’,而对于‘BEEN A’,‘BEING A’这些不太常见的词形形态,就没有作为‘IS A’的词形形态记录在单词数据库里‘IS A’对应的记录项里(如实际编码时遇到‘BEING A’这样的单词组合,那可以将其分拆成两个单词‘BEING’和‘A’来编码,这是显然的);而在实际编解码规则里也可以把‘BEEN A’,‘BEING A’这些不太常见的词形形态,作为‘ISA’的词形形态添加到单词数据库里‘IS A’单词组合对应的记录项里去。Note: when the document of the present invention takes English as an example of encoding and decoding, the determination of the part-of-speech form of a word or combination of words is only for reference, and may be different from the actual encoding and decoding rules; for example, for the word 'IS A' Combination, this manual determines that its part-of-speech form has 6 'IS A', 'BE A', 'WAS A', 'IS AN', 'BE AN', 'WASAN', and for 'BEEN A', 'BEING A' 'These uncommon morphological forms are not recorded as the morphological forms of 'IS A' in the record item corresponding to 'IS A' in the word database (such as word combinations such as 'BEING A' encountered during actual encoding , then it can be split into two words 'BEING' and 'A' to encode, which is obvious); and in the actual codec rules, 'BEEN A', 'BEING A', which are less common The morphological form of 'ISA' is added to the record item corresponding to the 'IS A' word combination in the word database as the morphological form of 'ISA'.

举一个较完整的英语的编码例:To give a more complete English coding example:

语句为:The handsome man is an engineer。The sentence is: The handsome man is an engineer.

依照上述编码规则,分开编码是:According to the above encoding rules, separate encoding is:

头段:header:

00:本语句是英语语种00: This statement is in English language

00:单词段采用第1种词段编码方式(即频率排序编码方式)00: The word segment adopts the first word segment encoding method (that is, the frequency sorting encoding method)

单词段:word segment:

00110:单词段里的单词及单词组合个数(6个)00110: the number of words and word combinations in the word segment (6)

00:the单词的频率编号的长度信息00: Length information of the frequency number of the word

010:the单词的频率编号010: The frequency number of the word

01:111111的频率编号长度信息01: Frequency number length information of 111111

111111:111111代表陌生单词(handsome),(111111也计入单词及单词组合个数)111111: 111111 represents an unfamiliar word (handsome), (111111 is also included in the number of words and word combinations)

10:man单词的频率编号的长度信息10: The length information of the frequency number of the man word

011100100:man单词的频率编号011100100: the frequency number of the word man

01:is单词的频率编号的长度信息01: The length information of the frequency number of the is word

010110:is单词的频率编号010110: the frequency number of the word is

00:an单词的频率编号的长度信息00: Length information of the frequency number of an word

101:an单词的频率编号101: frequency number of word an

10:engineer单词的频率编号的长度信息10: The length information of the frequency number of the word engineer

010110010:engineer单词的频率编号010110010: the frequency number of the word engineer

时态词形选择段(象the这样没词形变化的单词在此段就不占BIT):Temporal word form selection segment (words without inflection change like the do not occupy BIT in this segment):

0:man单词的时态词形选择值(此单词在单词数据库此词对应的记录项里有man,men共两个词形形态,用1个bit表示时态词形选择值,本语句用的man词是此词两个词形形态man,men中的第1个,所以时态词形选择值是0)0: tense word form selection value of the word man (there are two word form forms of man and men in the record item corresponding to this word in the word database, and 1 bit is used to represent the tense word form selection value. This statement uses The word man is the first of the two morphological forms of this word, man and men, so the tense morphological selection value is 0)

000:is单词的时态词形选择值(此单词有is,be,was,been,being共五个词形形态,用3个bit表示,is是5个词形形态中的第1个,所以时态词形选择值是000)000: tense word form selection value of the word is (this word has five form forms of is, be, was, been, being, represented by 3 bits, is is the first of the five form forms, So tense word form selection value is 000)

1:an单词的时态词形选择值(此单词有a,an共两个词形形态,用1个bit表示,an是2个词形形态中的第2个,所以时态词形选择值是1)1: The tense and word form selection value of the word an (this word has two forms, a and an, represented by 1 bit, and an is the second of the two forms, so the tense form is selected value is 1)

0:engineer单词的时态词形选择值(此单词有engineer,engineers共两个词形形态,用1个bit表示,engineer是2个词形形态的第1个,所以时态词形选择值是0)0: tense word form selection value of the word engineer (this word has engineer and engineers, which are represented by 1 bit, and engineer is the first of the two word forms, so the tense word form selection value is 0)

陌生单词段:Unfamiliar word segment:

01:陌生单词的编码信息的格式选择信息(词根元辅音组合格式)01: Format selection information for encoding information of unfamiliar words (root vowel and consonant combination format)

01:组成陌生单词发音的词根和元辅音的总个数(2个,本例只用了2个词根,没用元辅音;词根包括词基,前缀,后缀)01: The total number of roots and vowels that make up the pronunciation of unfamiliar words (2, only 2 roots are used in this example, no vowels; roots include word bases, prefixes, and suffixes)

0100101100:词根hand在词根数据库里的顺序号0100101100: The sequence number of the root hand in the root database

0010111001:词根some在词根数据库里的顺序号0010111001: the sequence number of the root some in the root database

把以上各BIT按顺序合起来得语句的编码值为(这里的语句的编码值是一个处理单位的值,也是表示语句发音的拼合信息的值):Combine the above BITs in sequence to obtain the coded value of the sentence (the coded value of the sentence here is the value of a processing unit, and is also the value of the combined information representing the pronunciation of the sentence):

00 00 00110000100111111110011100100010101100000 00 001100001001111111100111001000101011000

1011001011001000001001010100101100001011100110110010110010000010010101001011000010111001

(本编码举例中,各单词及单词组合的频率编号除了可以是本例示的阶梯编号外,还可以是等长编号或哈夫曼编号等;如为等长编号,哈夫曼编号那其前面不用带频率编号长度信息。)(in this coding example, the frequency numbering of each word and word combination can also be equal-length numbering or Huffman numbering etc. except the step numbering of this example; No frequency number length information is required.)

再述非印欧语系语种的编码处理:Let's recount the encoding processing of non-Indo-European languages:

非印欧语系语种的编码处理,一般与前述印欧语系语种的处理方式类似,只是词一般没有时态单复数的变化。The encoding processing of non-Indo-European languages is generally similar to the processing method of the aforementioned Indo-European languages, except that words generally do not have tense singular and plural changes.

以下叙述非印欧语系语种的编码处理时,涉及到举例时是以汉语为例的。When describing the encoding processing of non-Indo-European languages below, Chinese is used as an example when it comes to examples.

音码的定义:Definition of sound code:

这里音码是按照非印欧语系语种(汉语或韩语或日语等)的发音规则来划分的。对于非印欧语系任一语种,可被当成音码的,可以是表示发音的元辅音或元音辅音组合,也可以是可以转换成发音的字,词,词组合或范句,(这里的‘表示发音’和‘转换成发音’所指的发音,也是在语句发音中音码所对应的发音,这在本发明文件里已有反复叙述),既可以由元辅音,元音辅音组合,字和词中的任一种音码来拼合语句集发音,也可以由元辅音,元音辅音组合,字,词和词组合中的多种音码来拼合语句集发音,也可以由元辅音,元音辅音组合,字和词中的任一种音码与范句音码来拼合语句集发音,也可以由元辅音,元音辅音组合,字,词和词组合中的多种音码与范句音码来拼合语句集发音。(注:本段提的‘元辅音,元音辅音组合,字,词和词组合中的多种音码’,是指这里提及的5种音码中的任意2种或任意3种或任意4种或全部5种音码,即可以是两种种数的任意音码组合情况或是两种以上且种数不大于最大种数的任意种数的任意音码组合情况,最大种数是指比如前面说的5种音码中的最大种数就是5种,这也是常识;本发明文件其它地方述及非印欧语系某语种用音码来拼合语句集发音时的其它与此类似的提法涉及音码种数的问题也按这样类推理解。)Here the phonetic codes are divided according to the pronunciation rules of non-Indo-European languages (Chinese or Korean or Japanese, etc.). For any language of the non-Indo-European language family, what can be regarded as a phonetic code can be a vowel and consonant or a vowel and consonant combination representing pronunciation, or a word, word, word combination or model sentence that can be converted into pronunciation, (here The pronunciation indicated by 'expressing pronunciation' and 'converting into pronunciation' is also the pronunciation corresponding to the sound code in the sentence pronunciation, which has been repeatedly stated in the document of the present invention), both can be composed of vowel and consonant, vowel and consonant combination, Any one of the phonetic codes in words and words is used to assemble the pronunciation of the sentence set, and it is also possible to combine the pronunciation of the sentence set by vowels and consonants, combinations of vowels and consonants, words, words and word combinations, or by vowels and consonants , the combination of vowels and consonants, any kind of phonetic codes in words and words and the phonetic codes of model sentences to combine the pronunciation of sentence sets, and can also be composed of vowels and consonants, combinations of vowels and consonants, multiple phonetic codes in words, words and word combinations Come together the sentence set pronunciation with model sentence phonetic sign indicating number. (note: the ' vowel and consonant, vowel and consonant combination, word, word and multiple phonetic codes in word combination' mentioned in this paragraph refers to any 2 or any 3 or any of the 5 kinds of phonetic codes mentioned here. Arbitrary 4 kinds or all 5 kinds of phonetic codes, promptly can be the arbitrary phonetic code combination situation of two kinds of numbers or the arbitrary phonetic code combination situation of more than two kinds and kind number is not greater than the arbitrary kind number of maximum kind number, and maximum kind number is Refer to such as the maximum number of kinds in the 5 kinds of phonetic codes mentioned above is exactly 5 kinds, and this is also common sense; Other places of the present invention file mention non-Indo-European language family when a certain language of non-Indo-European language family uses phonetic codes to put together other similar to this The problem that the formulation involves the number of phonetic codes is also understood by analogy.)

对于非印欧语系任一语种,除了用以上一些音码的使用组合情况(用一种或多种音码)来拼合语句集发音外,实际上根据不同的需要,还可能使用一些别的音码使用组合情况来作拼合,比如对于汉语(非印欧语系别的语种也可),有时要编码的语句涉及的词很有限,就可能只使用一些词组合就可单独完成语句发音的拼合(即只使用词组合这一种音码来拼合语句集发音)。具体的,可以由词组合这一种音码来拼合语句集发音,也可以由词组合音码与范句音码来拼合语句集发音;这些别的音码使用组合情况可视为上段述的那些音码使用组合情况的等同替换。For any language of the non-Indo-European language family, in addition to using the combination of the above phonetic codes (using one or more phonetic codes) to piece together the pronunciation of the sentence set, in fact, according to different needs, some other phonetic codes may also be used. The code uses the combination situation to do the splicing, such as for Chinese (other languages of the non-Indo-European language family are also available), sometimes the words involved in the sentence to be encoded are very limited, and it is possible to complete the splicing of the pronunciation of the sentence only by using some word combinations ( Promptly only use this a kind of phonetic sign indicating number of word combination to put together the statement set pronunciation). Concrete, can come pieced together sentence collection pronunciation by this a kind of phonetic code of word combination, also can come pieced together sentence collection pronunciation by word combination phonetic code and model sentence phonetic code; Those phonetic codes use equivalent replacements for the combined cases.

前两段所述的非印欧语系任一语种的由一种或多种音码来拼合语句集发音,实际是指先把这一种或多种音码看成一个音码集合(这句话这么理解:如是把一种音码当成一个音码集合,如用来拼合的这种音码有用N个音码,那音码集合里就只包括这N个音码,比如把汉语的词音码当成一个音码集合,用来拼合的词音码约有用几千个,那这个音码集合就包括这几千个词音码;如是把多种音码当成一个音码集合,那就是把这多种音码都合起来成一个音码集合,比如多种音码包括两种音码,用来拼合的第一种音码有用N个音码,用来拼合的第二种音码有用M个音码,那合起来的音码集合就包括这N个第一种音码和这M个第二种音码,比如多种音码包括两种音码,用来拼合的第一种音码汉语的词音码有用几千个,用来拼合的第二种音码汉语的字音码有用几千个,那合起来的音码集合就包括这几千个词音码和这几千个字音码;用于语句集拼合的某种音码有用多少个音码及用哪些音码是根据需要来定的,如有些种音码数量很少所以在常规语言领域本来有多少个音码那拼合一般就有用多少个,象词,字这种在常规语言领域数量很大的音码,那一般是根据需要选一些来用于语句集拼合,比如一般难度的汉语拼合用的词音码约有用几千个,而且根据不同需要可能会选一些不同的音码),然后把一个应用范围内(比如教给幼儿班小孩的所有此语种语句)的此语种的所有语句当成一个语句集,此语句集中的任一个语句,都可由此音码集合里的部分音码来拼合成此语句发音,并且此音码集合里的任一个音码可用于此语句集里不同语句的发音的拼合;在本发明文件的权利要求书中述的以及本说明书后面述的非印欧语系任一语种的有关‘由一种或多种音码来拼合语句集发音’的提法以及类似提法的意思也如本段前面述;本发明文件述及非印欧语系时各处所说的把一个应用范围内的某语种的所有语句当成一个语句集,其中相同的语句是只留一个的,即语句集里的语句互不相同;本段所述是显然的。(本段说的‘应用范围’可以是有限的,比如把教给幼儿班小孩的所有汉语语句定义为一个应用范围,相应的把此应用范围内所有汉语语句当成一个语句集,此语句集包括的语句显然一般是有限多个;本段说的应用范围还可以是无限的,比如定义一个应用范围,任意一种汉语读物都属于此应用范围,相应的把此应用范围内的汉语的所有语句当成一个语句集,此语句集包括的语句数量很大可看成语句个数是无限的)The non-Indo-European language described in the first two paragraphs is combined with one or more phonetic codes to pronounce the sentence set pronunciation, which actually means that this one or more phonetic codes are regarded as a phonetic code set (this sentence Understand like this: a kind of phonetic sign indicating number is regarded as a phonetic sign indicating number set like this, useful N phonetic sign indicating number as this phonetic sign indicating number that is used for splicing together, just only comprise these N phonetic sign indicating numbers in that phonetic sign indicating number set, such as the word sound of Chinese Code is regarded as a phonetic sign indicating number set, and the word phonetic sign indicating number that is used for splicing together is useful about several thousand, and then this phonetic sign indicating number set just comprises these several thousand word phonetic sign indicating numbers; These multiple phonetic codes are all combined into a phonetic code collection, such as multiple phonetic codes comprising two kinds of phonetic codes, the first kind of phonetic codes that are used to stitch together useful N phonetic codes, and the second kind of phonetic codes that are used to stitch together useful M phonetic code, the combined phonetic code set just includes these N first kind of phonetic codes and these M second kind of phonetic codes, such as multiple phonetic codes comprising two kinds of phonetic codes, the first kind of phonetic codes used for splicing There are thousands of word phonetic codes in sign indicating number Chinese, and there are thousands of word phonetic codes in the second kind of phonetic code Chinese that are used for splicing together. How many phonetic codes and which phonetic codes are used for a certain phonetic code that is used for sentence set combination are determined according to needs, so how many phonetic codes are originally there in the conventional language field as some phonetic codes are seldom How many are generally useful for spelling, such as words and words, which have a large number of phonetic codes in the field of conventional language, and that generally selects some according to needs to be used for sentence set spelling, such as the word phonetic codes used for Chinese spelling of general difficulty There are thousands of them, and some different phonetic codes may be selected according to different needs), then all the sentences of this language in an application range (such as teaching all the sentences of this language for children in kindergartens) are regarded as a sentence set, this Any sentence in the sentence collection can be combined into this sentence pronunciation by part of the phonetic codes in the phonetic code collection, and any phonetic code in this phonetic code collection can be used for the spelling of the pronunciation of different sentences in this sentence collection; In the claims of the present invention document and in any language of the non-Indo-European language family described later in this specification, the meanings of "combining the pronunciation of a sentence set by one or more phonetic codes" and similar formulations are also As mentioned above in this paragraph; when the document of the present invention refers to non-Indo-European languages, it is said that all the sentences of a certain language within an application range are regarded as a sentence set, and only one of the same sentences is left, that is, in the sentence set The sentences are different from each other; what is stated in this paragraph is obvious. (The 'scope of application' mentioned in this paragraph may be limited, such as defining all Chinese sentences taught to children in kindergarten classes as a scope of application, and correspondingly treating all Chinese sentences in this scope of application as a set of sentences. This set of sentences includes Obviously, the number of sentences in this paragraph is generally limited; the scope of application mentioned in this paragraph can also be unlimited, such as defining a scope of application, any kind of Chinese reading material belongs to this scope of application, correspondingly all the sentences in Chinese within this scope of application As a statement set, the statement set includes a large number of statements, which can be regarded as an infinite number of statements)

在本发明文件中各处提及的对非印欧语系任一语种的语句进行编码,对此语种是对应定义了一个语句集的,此语句集里语句个数可以是有限或无限的,这里说的编码就是对此语句集里语句进行编码;在本发明文件中各处提及的对表示非印欧语系任一语种的语句发音的拼合信息的值进行解码,对此语种是对应定义了一个语句集的,此语句集里语句个数可以是有限或无限的,这里说的解码就是指对表示此语句集里一语句发音的拼合信息的值进行解码。本发明文件各处在述及对非印欧语系任一语种的语句进行编码时,提及的语句集默认是指某编码规则中有关此语种的编码细则定义的此语种的编码语句集且默认是按照此编码规则中有关此语种的编码细则来编码的;本发明文件各处在述及对表示非印欧语系任一语种的语句的发音的拼合信息的值进行解码时,提及的语句集默认是指某解码规则中有关此语种的解码细则定义的此语种的解码语句集且默认是按照此解码规则中有关此语种的解码细则来解码的。在以下直播方式的以只适用于汉语为例的编码规则里,以及TTS方式的以只适用于汉语为例的编码规则里,都是以编码规则定义(这里说的定义与在这些编码规则里涉及的编码语句集范围的完整定义是等价的)的汉语的编码语句集是包括任意汉语语句的也即编码语句集里语句个数是无限的为例的(因为有陌生词处理功能,所以任一汉语语句都可编解码)。在以下直播方式的以只适用于汉语为例的解码规则里,以及TTS方式的以只适用于汉语为例的解码规则里,都是以解码规则定义(这里说的定义与在这些解码规则里涉及的解码语句集范围的完整定义是等价的)的汉语的解码语句集是包括任意汉语语句的也即解码语句集里语句个数是无限的为例的(因为有陌生词处理功能,所以任一汉语语句都可编解码)。In the documents of the present invention, the sentences of any language of the non-Indo-European language family are encoded everywhere, and a sentence set is correspondingly defined for this language. The number of sentences in this sentence set can be limited or unlimited. Here Said encoding is exactly to encode the sentences in this sentence set; in the documents of the present invention, the value of the combined information to represent the pronunciation of sentences in any language of the non-Indo-European language family is decoded, and this language is correspondingly defined For a set of sentences, the number of sentences in the set of sentences can be limited or unlimited. The decoding mentioned here refers to decoding the value of the combined information representing the pronunciation of a sentence in the set of sentences. When referring to the encoding of sentences in any language of the non-Indo-European language family in the documents of the present invention, the mentioned sentence set refers to the coded sentence set of this language defined in the coding rules for this language in a certain coding rule by default. It is coded according to the coding rules about this language in this coding rule; when the document of the present invention is referring to the value of the combined information representing the pronunciation of any language of the non-Indo-European language family, the sentence mentioned The default set refers to the decoding sentence set of this language defined in the decoding rules of this language in a certain decoding rule, and is decoded by default according to the decoding rules of this language in this decoding rule. In the following encoding rules that are only applicable to Chinese as an example in the live broadcast mode, and in the encoding rules that are only applicable to Chinese in the TTS mode, they are all defined by encoding rules (the definitions mentioned here are the same as those in these encoding rules The complete definition of the scope of the coded sentence set involved is equivalent) the Chinese coded sentence set includes any Chinese sentence, that is, the number of sentences in the coded sentence set is infinite as an example (because there is an unfamiliar word processing function, so Any Chinese sentence can be coded). In the following decoding rules that are only applicable to Chinese in the live broadcast mode, and in the decoding rules that are only applicable to Chinese in the TTS mode, they are all defined by decoding rules (the definitions mentioned here are the same as those in these decoding rules. The complete definition of the scope of the decoding sentence set involved is equivalent) the Chinese decoding sentence set includes any Chinese sentence, that is, the number of sentences in the decoding sentence set is unlimited as an example (because there is an unfamiliar word processing function, so Any Chinese sentence can be coded).

在本发明文件中各处提及的有关非印欧语系任一语种的类似‘用音码拼合成语句发音’这样的叙述(提及的‘语句’的后面没有‘集’这个词),是指由音码(这里说的音码是指属于一种或多种音码中的多个音码)来拼合成一个具体的语句的发音。(本段有提及的‘多种’里的‘多’与本段有提及的‘多个’里的‘多’不一定是相同数目)In the documents of the present invention, such descriptions (not having the word "collection" behind the "sentence" mentioned) about any language of the non-Indo-European language family that are similar to "using phonetic codes to form a sentence pronunciation" are Refer to the pronunciation that is assembled into a concrete sentence by phonetic sign indicating number (sound sign indicating number that says here refers to belonging to a plurality of sound sign indicating number in one or more phonetic sign indicating numbers). (The "many" in the "multiple" mentioned in this paragraph and the "many" in the "multiple" mentioned in this paragraph are not necessarily the same number)

本发明文件各处提及的非印欧语系任一语种的‘元辅音’的意思是表示将此语种的全部单个元音和全部单个辅音,集合起来当成一种音码‘元辅音’,因元音和辅音在常规情况下是同时使用的,一般不会单独用元音或单独用辅音来构成语句发音;另如把元音和辅音分开当成两种音码并同时使用这两种音码来拼合语句集发音也可,但描述起来比较麻烦(并且分开成两种音码应是把元音和辅音当成一种音码的等同替换),所以本发明文件是将全部单个元音加全部单个辅音当成一种音码‘元辅音’了;对于象韩语,日语这样的语种,可以采用直接用元辅音音码来拼合语句集发音的方式,但汉语一般不宜采用此种方式,因汉语的发音的形式是由元音,辅音组成声母,韵母,(声母,韵母又各具有不同的声调),再由声母,韵母组成字的发音,所以显然不方便直接用元音,辅音来组成字的发音,当然如非要把元音,辅音都细分成用于组成不同声调声母和不同声调韵母的带调元音和带调辅音,由这些带调(带声调)元音和带调辅音来拼合语句集发音也是可以的,只是很麻烦。The meaning of the ' vowel consonant' of any language of the non-Indo-European language family mentioned in the documents of the present invention means that all single vowels and all single consonants of this language are assembled as a kind of phonetic code ' vowel consonant ', so Vowels and consonants are used at the same time under normal circumstances, and generally do not use vowels or consonants alone to form sentence pronunciation; another example is to separate vowels and consonants as two phonetic codes and use these two phonetic codes at the same time It is also possible to put together the pronunciation of the sentence set, but it is cumbersome to describe (and being separated into two kinds of phonetic codes should be the equivalent replacement of vowels and consonants as a kind of phonetic code), so the document of the present invention is to add all single vowels A single consonant has been regarded as a kind of phonetic code 'yuan consonant'; for languages such as Korean and Japanese, the method of directly using the vowel and consonant codes to assemble the pronunciation of the sentence set can be adopted, but Chinese is generally not suitable for adopting this kind of method, because of the The form of pronunciation is composed of initials and finals composed of vowels and consonants (the initials and finals have different tones), and then the pronunciation of words is composed of initials and finals, so it is obviously inconvenient to directly use vowels and consonants to form words Pronunciation, of course, if you have to subdivide vowels and consonants into tonal vowels and tonal consonants that are used to form different tonal initials and different tonal finals, these tonal (with tonal) vowels and tonal consonants It is also possible to combine the pronunciation of the sentence set, but it is very troublesome.

元音辅音组合,是指对语句的分解不是按词或字的概念,比如‘电视机’这样一个词,如将其分解成‘电’和‘视机’这样两部分,其发音信息数据也相应分成两段dian4和shi4ji1(这两段发音信息数据分别是拼音和拼音的组合;拼音是由元音和辅音组成的),可把这两段发音(dian4和shi4ji1)分别当成两个元音辅音组合音码,对这两段分别赋予编号如011010010,011110010;这两个编号011010010,011110010可以分别转换成dian4和shi4ji1这样的拼音或拼音的组合(即发音信息数据),将这两个编号组合起来后可转换成dian4shi4ji1这样的拼音的组合(即发音信息数据);所以将这样的元音辅音组合音码组合起来也是可以组成语句发音的。The combination of vowels and consonants means that the decomposition of sentences is not based on the concept of words or characters. For example, if a word like 'television' is decomposed into two parts such as 'electricity' and 'video machine', its pronunciation information data will also be Correspondingly divided into two sections dian4 and shi4ji1 (these two pieces of pronunciation information data are the combination of pinyin and pinyin respectively; pinyin is composed of vowels and consonants), these two sections of pronunciation (dian4 and shi4ji1) can be regarded as two vowels respectively Consonant combined sound sign indicating number, give numbering such as 011010010, 011110010 respectively to these two paragraphs; These two numbering 011010010, 011110010 can be converted into the pinyin such as dian4 and shi4ji1 respectively or the combination of pinyin (pronunciation information data), these two numbering Combination can be converted into the combination of pinyin such as dian4shi4ji1 (ie pronunciation information data); so the combination of such vowel and consonant combination sound codes can also form the pronunciation of sentences.

对于汉语,由元音和辅音又组成声母(约21个)和韵母(约35个),这些声母和韵母也可各自看成是元音辅音组合音码,而由于各韵母一般有4或5种声调,各声母也可分4或5种声调(另如按简单方式把各声母当成只一种声调也可,本发明文件是按各声母可分4或5种声调来阐述的),所以可把拼法(元音,辅音组合情况)相同而声调不同的韵母当成不同韵母,可把拼法(辅音)相同而声调不同的声母当成不同声母,这样不同的韵母就有1百多个,不同的声母也有约1百个,就可只用这些带调声母(约1百个)和这1百多个带调韵母来拼合语句集发音(这些带调声母和带调韵母都可看成是元音辅音组合音码)。For Chinese, initial consonants (about 21) and final vowels (about 35) are formed by vowels and consonants. A variety of tones, each initial consonant also can be divided into 4 or 5 kinds of tones (in addition, each initial consonant can be regarded as only a kind of tone in a simple way, and the document of the present invention can be divided into 4 or 5 kinds of tones to set forth by each initial consonant), so The finals with the same spelling (vowel and consonant combination) but different tones can be regarded as different finals, and the initials with the same spelling (consonants) but different tones can be regarded as different initials, so that there are more than 100 different finals. There are also about 100 different initials, so you can use only these tonal initials (about 100) and these more than 100 tonal finals to combine the pronunciation of the sentence set (these tonal initials and tonal finals can be regarded as It is the vowel-consonant combined sound sign indicating number).

一种特殊的情况,对于汉语,其常用的字有几千个,其中有些字的发音互相相同,比如‘班’和‘搬’的发音完全相同,可共用一个字的语音,可以统计各个字的不同发音,总共约1300个(把相同元辅音组合情况不同声调的当成不同的字发音),这约1300个字的发音中的每个字的发音(即拼音,也即带调音节)是由带调声母和带调韵母组成的,而带调声母和带调韵母的发音又可看成是由带调元音,带调辅音组成的,所以可把这约1300个字的发音中的各个字的发音看成是元音辅音组合音码;可用这约1300个字的发音(即元音辅音组合音码),来拼合语句集发音。(注:实际上对汉语的字的发音即拼音的分解一般到带调声母韵母这一级即可,再往下分到带调元音辅音这一级已无必要而且很麻烦,本段之所以提及分到带调元音辅音这一级,是为了说明可把这约1300个字的发音中的各个字的发音看成是元音辅音组合音码)In a special case, for Chinese, there are thousands of commonly used characters, and some of them have the same pronunciation. For example, the pronunciation of 'ban' and 'moving' are exactly the same, and the pronunciation of one character can be shared, and each character can be counted There are about 1,300 different pronunciations in total (the same vowel and consonant combination and different tones are regarded as different pronunciations of characters), and the pronunciation of each word in the pronunciation of about 1,300 characters (that is, pinyin, that is, syllables with tones) is It is composed of tonal initials and tonal finals, and the pronunciation of tonal initials and tonal finals can be regarded as composed of tonal vowels and tonal consonants. Therefore, the pronunciation of about 1,300 characters can be The pronunciation of each word is regarded as the vowel-consonant combination sound sign indicating number; The pronunciation (being the vowel-consonant combination sound sign indicating number) of these about 1300 words can be used to put together the sentence set pronunciation. (Note: In fact, the pronunciation of Chinese characters, that is, the decomposition of pinyin, is generally enough to the level of consonants and finals with tonal consonants, and it is unnecessary and troublesome to divide it into the level of vowels and consonants with tones. So it is mentioned that the grade of the tonal vowel and consonant is divided into, in order to illustrate that the pronunciation of each word in the pronunciation of these about 1300 words can be regarded as the combination of vowel and consonant code)

由上两段可看出,对于非印欧语系同一语种,对不同的编解码规则,元音辅音组合音码的集合里可能包括不同形式不同个数的元音辅音组合音码,比如可把汉语的约1百个带调声母(相同辅音而声调不同的当成不同声母)加1百多个带调韵母(元音,辅音组合情况相同而声调不同的当成不同韵母)集合起来成为一个元音辅音组合音码的集合,用这个集合里的音码作各语句发音的拼合;也可把汉语的约1300个带调音节集合起来成为一个元音辅音组合音码的集合,用这个集合里的音码作各语句发音的拼合。It can be seen from the above two paragraphs that for the same non-Indo-European language and different encoding and decoding rules, the set of vowel-consonant combination codes may include different forms and different numbers of vowel-consonant combination codes. For example, the About 100 tonal initials in Chinese (same consonants but different tones are regarded as different initials) plus more than 100 tonal finals (vowels and consonants with the same combination but different tones are regarded as different finals) are combined to form a vowel The collection of consonant combined phonetic codes, use the phonetic codes in this collection to make the spelling of each sentence pronunciation; also the about 1300 band tone syllables of Chinese can be assembled into a collection of vowel and consonant combined phonetic codes, use the phonetic codes in this collection The phonetic sign indicating number is made the splicing of each sentence pronunciation.

对于非印欧语系任一语种,语音编码有声读物的整页面上的码字涉及的表示此语种语句发音的拼合信息的值是按照某种编码规则中有关此语种的编码细则来编码生成的,此编码规则可以是只适用于此语种或适用于包括此语种在内的多个语种的,此编码规则中有关此语种的编码细则可以是直播方式的。For any language of the non-Indo-European language family, the codewords on the entire page of the phonetic coded audiobook involve the value of the combined information representing the pronunciation of sentences in this language, which is coded and generated according to the coding rules for this language in a certain coding rule. This encoding rule can be applicable only to this language or to multiple languages including this language, and the encoding rules related to this language in this encoding rule can be live broadcast.

使用编码规则的目的是为了编码生成表示语句发音的拼合信息的值;只适用于非印欧语系任一语种或可适用于包括数个非印欧语系语种在内的多个语种的编码规则可以有很多种(只适用于一个语种的编码规则就可有多种),编码规则的种数实际上是没法穷举的。权利要求3,12,20或23中提及的其有关某语种的编码细则含有压缩编码处理的编码规则(此编码规则可以是只适用于此一个语种或适用于包括此一个语种在内的多个语种的)可以是各种具体的编码规则。The purpose of using encoding rules is to encode the value that generates the combined information representing the pronunciation of the sentence; encoding rules that are only applicable to any non-Indo-European language or applicable to multiple languages including several non-Indo-European languages can be There are many kinds (there are many kinds of coding rules applicable to only one language), and the number of kinds of coding rules cannot be exhausted in fact. Its coding rules about a certain language mentioned inclaim 3, 12, 20 or 23 contain coding rules for compression coding processing (this coding rule can be only applicable to this one language or be applicable to many including this one language) language) can be various specific coding rules.

一种只适用于非印欧语系某语种的简单的编码规则,可以是用音码作简单的叠加来拼合语句集发音;举例:对于非印欧语系某一语种(比如汉语),把词当成音码,将词典里的各词都赋予不同的二进制编号(各词的二进制编号长度互相相同,比如14BIT),并生成词数据库,词数据库里记录词典里的各词的大五码及词的二进制编号及词的发音信息数据(比如词形拼音),对于需编码的语句中的各词,分别替换成其在词典里的二进制编号,将各词的二进制编号按原来各词在语句中的顺序排组起来即成语句编码值。与此编码规则相对应的只适用于此语种的解码规则可以是:解码时,可按语句编码值中的各二进制编号,到词数据库里查找其对应的词的记录项,并取出词的发音信息数据(比如词形拼音),并可依各二进制编号的排列顺序将各二进制编号对应的词的发音信息数据转成声音发出,即为语句发音。可把本段述的语句编码值看成是表示如何用音码拼合成语句发音的拼合信息的值,可把本段述的编码时的这些手法看成是此编码规则的编码细则,这些编码细则也是此编码规则中有关此语种的编码细则,可把本段述的解码时的这些手法看成是此解码规则的解码细则,这些解码细则也是此解码规则中有关此语种的解码细则;并且本段述的对各词都用二进制编号来表示(一般为14BIT),而不是用数个大五码来表示词(如用大五码表示两个字的词需2*16=32BIT),所以用二进制编号来表示词是有压缩作用的,所以本段述的编码规则中有关此语种的编码细则是含有压缩编码处理的,而且本段述的解码规则中有关此语种的解码细则是含有压缩解码处理的。权利要求3,12,20或23中提到的拼合信息的值可以是本段说的拼合信息的值,权利要求3,12,20或23中提到的编码规则可以是本段说的编码规则,权利要求3,12,20或23中提到的解码规则可以是本段说的解码规则。A simple encoding rule that is only applicable to a certain language of the non-Indo-European language family, which can be used for simple superposition of phonetic codes to combine the pronunciation of the sentence set; for example: for a certain language of the non-Indo-European language family (such as Chinese), the word is regarded Phonetic sign indicating number, each word in the dictionary is all endowed with different binary codes (the binary number lengths of each word are identical to each other, such as 14BIT), and generate a word database, record the big five codes and the words of each word in the dictionary in the word database The pronunciation information data (such as morphological pinyin) of binary numbering and word, for each word in the sentence that needs encoding, replace with its binary numbering in dictionary respectively, the binary numbering of each word is by original each word in sentence Arranged in sequence to form the statement code value. Corresponding to this encoding rule, the decoding rule only applicable to this language can be: when decoding, you can search for the record item of its corresponding word in the word database according to each binary number in the sentence encoding value, and take out the pronunciation of the word Information data (such as word shape pinyin), and the pronunciation information data of the word corresponding to each binary number can be converted into sound according to the arrangement order of each binary number, which is the sentence pronunciation. The sentence encoding value described in this paragraph can be regarded as the value of how to use the phonetic sign indicating number to stitch together the combined information of sentence pronunciation, and these techniques during the coding described in this paragraph can be regarded as the coding rules of this coding rule. The detailed rules are also the coding rules related to this language in this coding rule. The decoding methods described in this paragraph can be regarded as the decoding rules of this decoding rule, and these decoding rules are also the decoding rules related to this language in this decoding rule; and Each word described in this paragraph is represented by a binary number (generally 14BIT), rather than using several big five codes to represent the word (as the words needing 2*16=32BIT to represent two words with the big five codes), Therefore, the use of binary numbers to represent words has a compression effect, so the coding details about this language in the coding rules described in this paragraph contain compression coding processing, and the decoding details about this language in the decoding rules described in this paragraph contain Compressed and decoded. The value of the combination information mentioned inclaim 3, 12, 20 or 23 may be the value of the combination information mentioned in this paragraph, and the encoding rule mentioned inclaim 3, 12, 20 or 23 may be the encoding described in this paragraph Rules, the decoding rules mentioned inclaim 3, 12, 20 or 23 may be the decoding rules mentioned in this paragraph.

以下是以汉语作为非印欧语系语种的例子来阐述非印欧语系语种的编码技术的。The following uses Chinese as an example of a non-Indo-European language to illustrate the encoding technology of a non-Indo-European language.

以下述的主要是对一种编码规则及其实例的阐述,此编码规则包括:编码语句集范围完整定义,用于解码时按直接发音方式播放语句发音的音码语音的处理的方式定义,用于编码的音码的种类选择,对音码赋编号,词数据库建立,词类的建立,各种词段编码方式,陌生词编码处理及附加头段信息等编码细则,并且是以只适用于一个语种即汉语为例的(有时附带提及到韩语,日语的内容时是有注明是韩语或日语语种的),即此编码规则中的所有编码细则都是属于此编码规则中有关汉语语种的编码细则,并且此编码规则中的对音码赋编号一般是有压缩处理的,编码时可供选用的几种词段编码方式(频率排序,频率关联混合,整句差别等编码方式)中除频率排序编码方式外都是含有压缩编码处理的,所以以只适用于汉语为例的此编码规则是含有压缩编码处理的。此编码规则是直播方式的。非印欧语系其它语种比如韩语,日语等也可采用此编码规则,当然其具体细节与汉语有别,需作相应修改。权利要求3,12,20或23中提及的编码规则,可以是本说明书提及或没提及的适用于非印欧语系任一语种(汉语或韩语或日语等)的其有关此语种的编码细则含有压缩编码处理且是直播方式的的各种编码规则(本句说的‘各种编码规则’中任一编码规则可能是只适用于此语种或适用于包括此语种在内的多个语种)。此直播方式的以只适用于汉语为例的编码规则及其编码实例的具体阐述是从下一行起至66页为止的:Mainly be the elaboration to a kind of encoding rule and examples thereof with following, this encoding rule comprises: the complete definition of encoding statement set scope, is used for decoding by the definition of the processing mode of the phonetic sign indicating number pronunciation of statement pronunciation by direct pronunciation mode, uses The selection of the type of coded phonetic codes, the numbering of phonetic codes, the establishment of word databases, the establishment of parts of speech, the coding methods of various word segments, the coding of unfamiliar words and the additional header information and other coding rules, and is only applicable to one The language is Chinese as an example (sometimes Korean is mentioned incidentally, and the content of Japanese is marked as Korean or Japanese), that is, all the coding rules in this coding rule belong to the relevant Chinese language in this coding rule Encoding rules, and the numbering of phonetic codes in this encoding rule generally has compression processing, and can be used in several speech segment encoding modes (frequency sorting, frequency correlation mixing, whole sentence difference etc. encoding modes) during encoding except Except for the frequency sorting encoding method, it contains compression encoding processing, so the encoding rule that is only applicable to Chinese as an example includes compression encoding processing. This encoding rule is live. Other non-Indo-European languages such as Korean, Japanese, etc. can also use this encoding rule, of course, the specific details are different from Chinese, and need to be modified accordingly. The encoding rule mentioned inclaim 3, 12, 20 or 23 may be any language (Chinese or Korean or Japanese, etc.) that is not mentioned in this specification and is applicable to this language. The encoding rules contain various encoding rules for compression encoding processing and live broadcasting (any encoding rule in the "various encoding rules" mentioned in this sentence may be only applicable to this language or applicable to multiple languages including this language. language). The specific elaboration of the coding rules and coding examples of this live broadcast method, which is only applicable to Chinese as an example, starts from the next line and ends on page 66:

完整定义的编码语句集的范围是任意汉语语句,也即编码语句集包括汉语任意语句。The scope of the fully defined coded sentence set is any Chinese sentence, that is, the coded sentence set includes any Chinese sentence.

用于解码时按直接发音方式播放语句发音的音码语音的处理:When being used for decoding, the processing of the phonetic code voice of the pronunciation of the sentence is played by the direct pronunciation mode:

非印欧语系语种的语句一般由词组成,词一般由1-4个字组成,每个字的发音一般由元音和辅音组合而成;对于汉语,是由元音和辅音组成声母,韵母,由声母和韵母来组成字的发音,并且字和声母和韵母是有声调变化的,由字组成词,由词组成语句;对于韩语:语句由词组成,其词由音节组成,其单个的音节(一个音节相当于汉语里的一个字)的发音基本上都是由元音和辅音(韵尾也算在辅音里)构成;对于日语:语句由词组成,其单个词的构成单位是字和假名(日语里的罗马字可对应成假名来处理),其单个字的发音由假名的发音组成,各假名都包括在50音表及少部分浊音表里,各假名的发音主要由元音和辅音组成,各假名的发音又可能有高低调之分,这里说的日语的字是指日语里的汉字,可把日语里的各汉字和各假名看成相当于汉语里的单个字。Sentences in non-Indo-European languages are generally composed of words, words are generally composed of 1-4 characters, and the pronunciation of each character is generally composed of vowels and consonants; for Chinese, initials and finals are composed of vowels and consonants , the pronunciation of a character is composed of initials and finals, and the characters, initials and finals have tone changes, and words are composed of words, sentences are composed of words; for Korean: sentences are composed of words, words are composed of syllables, and single The pronunciation of a syllable (a syllable is equivalent to a word in Chinese) is basically composed of vowels and consonants (the rhyme ending is also counted in the consonants); for Japanese: sentences are composed of words, and the constituent units of a single word are words and Kana (roman characters in Japanese can be handled as kana), the pronunciation of a single character is composed of the pronunciation of kana. Consonant sounds, and the pronunciation of each kana may be divided into high and low tones. The Japanese characters mentioned here refer to Chinese characters in Japanese. The Chinese characters and kana in Japanese can be regarded as equivalent to a single character in Chinese.

音码语音的处理有几种方案:There are several schemes for the processing of phonetic code speech:

1:对于汉语所有的可能的字的发音(也即拼音,也即带调音节)都录音,将这些录音数据合起来就是诸音码的发音数据(也是诸音码的录音数据),而各字的发音(拼音)则是如da4(‘大’),xie3(‘写’),并且有1声,2声,3声,4声,轻声之分,总共约有1300个不同的字发音(也即约1300个不同的带调音节,每个带调音节也可看成是一个单字的拼音);在解码时,用这些字的发音(用录音数据播音)来组成词和语句的发音。这种方案的录音数据存储量较大。(如把这些字的发音也即单字拼音按其拼音字符拼法顺序排列起来,基本上就是新华字典里的拼音检索表的内容)。如建立一个数据库来记录这约1300个单字拼音的信息(每个单字拼音对应一个记录项,每个记录项里记录一个单字拼音的顺序号和拼音字符拼法及其语音的地址等信息),此数据库就可看成是后面述及的单字拼音表。本说明书表示拼音的形式(拼音字符拼法)比如xie3(ASCII字符),其中的数字是声调,1,2,3,4,5分别表示1声,2声,3声,4声,轻声。如‘班’和‘搬’的拼音相同为ban1。1: All recordings for the pronunciation (that is, pinyin, that is, syllables with tone) of all possible words in Chinese, these recording data are combined to be the pronunciation data of all phonetic codes (also the recording data of all phonetic codes), and each The pronunciation (Pinyin) of the characters is like da4 ('big'), xie3 ('write'), and there are 1 tone, 2 tone, 3 tone, 4 tone, and soft tone. There are about 1300 different pronunciations of characters in total. (that is, about 1300 different tone syllables, each tone tone syllable can also be regarded as the pinyin of a single word); when decoding, use the pronunciation of these words (broadcasting with recording data) to form the pronunciation of words and sentences . The recording data storage capacity of this scheme is relatively large. (as the pronunciation of these words also i.e. the single-character pinyin is arranged in order of its pinyin character spelling, it is basically the content of the pinyin retrieval table in the Xinhua dictionary). Such as setting up a database to record the information of these about 1300 single-character phonetic alphabets (each single-character phonetic alphabet corresponds to a record item, and records information such as the sequence number of a single-character phonetic alphabet and the address of the pinyin character spelling and its pronunciation in each record item), This database just can be regarded as the individual character pinyin table mentioned later. This manual indicates the form of pinyin (pinyin character spelling) such as xie3 (ASCII characters), where the numbers are tones, and 1, 2, 3, 4, and 5 represent 1 tone, 2 tone, 3 tone, 4 tone, and soft tone respectively. For example, the pinyin of 'class' and 'moving' is ban1.

(对于韩语:其单个的音节的发音基本上都是由元音和辅音构成,并且韩语一般无声调的变化,而元音,辅音各自只有几十个,所以元音,辅音组合起来后可能的音节总数也是有限的几百个,可以把可能的各音节都录音,一个音节也可被看成相当于汉语里的一个字;在解码时,用这些音节的发音来组成词和语句的发音,将这些录音数据合起来就是诸音码的发音数据也是诸音码的录音数据。对于日语:可对包括在50音表及少部分浊音表里的各假名进行录音,并且各假名的发音又可能有高低调之分,可对各假名的高低调都分别录音,将这些录音数据合起来就是诸音码的发音数据也是诸音码的录音数据;解码时,用这些假名的发音来组成汉字的发音,并由汉字的发音和假名的发音来组成词和语句的发音;日语里的各假名和日语里的各汉字都可分别看成相当于汉语里的字,日语里的汉字约有2000个。)(For Korean: the pronunciation of a single syllable is basically composed of vowels and consonants, and Korean generally has no tone changes, and there are only dozens of vowels and consonants, so it is possible to combine vowels and consonants. The total number of syllables is also limited to a few hundred, and all possible syllables can be recorded, and a syllable can also be regarded as equivalent to a character in Chinese; when decoding, the pronunciation of these syllables is used to form the pronunciation of words and sentences, The pronunciation data of all phonetic codes is also the recording data of all phonetic codes that these recording data are combined.For Japanese: each pseudonym that is included in 50 phonetic tables and a small part of voiced tone table can be recorded, and the pronunciation of each pseudonym is possible again There are high and low key points, and the high and low keys of each pseudonym can be recorded separately, and these recording data are combined to be the pronunciation data of all phonetic codes and also the recording data of all phonetic codes; when decoding, use the pronunciation of these pseudonyms to form Chinese characters Pronunciation, and the pronunciation of Chinese characters and pseudonyms to form the pronunciation of words and sentences; each pseudonym in Japanese and each Chinese character in Japanese can be regarded as equivalent characters in Chinese, and there are about 2,000 Chinese characters in Japanese .)

这种方案录音数据存储量较大。This solution has a large amount of recording data storage.

2:只对于所有的单个元音和所有的单个辅音录音,将这些录音数据合起来就是诸音码的发音数据(也是诸音码的录音数据)。平常人朗读语句时,是顺序读出词,而词的发音由元辅音的发音组合而成;在解码时可以用这些元辅音组成语句发音。这种方案录音数据存储量很小。2: Only for all single vowels and all single consonant recordings, these recording data are combined to be the pronunciation data of all phonetic codes (also the recording data of all phonetic codes). When ordinary people read sentences aloud, they read words in sequence, and the pronunciation of words is formed by the pronunciation of vowels and consonants; these vowels and consonants can be used to form sentence pronunciations during decoding. The recording data storage capacity of this scheme is very small.

(这种方案对韩语是适合的,因其音节无声调变化,可录其元音,辅音的音,韵尾也可当成辅音并录音;这种方案对日语也是适合的,因日语的发音虽然有高低调但处理比较简单;但这种方案对于汉语不太适合,因为汉语的音节有声调的变化,处理起来麻烦,当然如非采用此方案也是可以的,可把元音,辅音都细分成用于组成不同声调声母和不同声调韵母的带调元音和带调辅音,并对这些带调元音和带调辅音录音,解码时由这些带调元音和带调辅音来组成带调声母和带调韵母的发音以及字和词和语句的发音。)(This scheme is suitable for Korean, because its syllable has no tone change, and the sound of vowel and consonant can be recorded, and the rhyme ending can also be regarded as consonant and recorded; this scheme is also suitable for Japanese, because although Japanese pronunciation has High and low tone but the processing is relatively simple; but this solution is not suitable for Chinese, because Chinese syllables have tonal changes, which is troublesome to deal with. Of course, this solution is also possible, and vowels and consonants can be subdivided into Toned vowels and consonants used to form initial consonants of different tones and finals of different tones, and record these tonal vowels and consonants. When decoding, these tonal vowels and tonal consonants are used to form tonal initials And the pronunciation of the finals with the tone and the pronunciation of the characters and words and sentences.)

3:只对用于组成语句发音的元音辅音组合进行录音,比如可对汉语的各声母和各韵母进行录音,并且把拼法(元音,辅音组合情况)相同声调不同的韵母也当成不同韵母(带调韵母)来录音,把拼法(辅音)相同声调不同的声母也当成不同声母(带调声母)来录音,将这些录音数据合起来就是诸音码的发音数据(也是诸音码的录音数据),在解码时可以用这些声母和韵母的发音来组成字和词的发音,再由字和词的发音组成语句发音;(对于韩语,日语也可类似处理);这种方案录音数据存储量较小。3: Only record the combination of vowels and consonants used to form the pronunciation of the sentence. For example, you can record the initials and finals of Chinese, and treat the finals with the same spelling (vowel, consonant combination) and different tones as different Final consonant (consonant with tone) is recorded, and the same initial consonant (consonant) with different tones is also recorded as different initial consonant (consonant with tone), and these recording data are combined to be the pronunciation data of all phonetic codes (also all phonetic codes recording data), the pronunciation of these initials and finals can be used to form the pronunciation of words and words during decoding, and then the pronunciation of words and words is used to form sentence pronunciation; (for Korean, Japanese can also be handled similarly); this program recording Data storage is small.

4:对汉语所有常用的词的发音(一般是几千个)都录音,将这些录音数据合起来就是诸音码的发音数据(也是诸音码的录音数据),在解码时用这些词的发音来组成语句发音;(对于韩语,日语也可类似处理);这种方案的录音数据存储量较大。4: the pronunciation (being generally several thousand) of all commonly used words of Chinese is all recorded, and these recording data are put together and be the pronunciation data of all phonetic codes (also be the recording data of all phonetic codes), use these words when decoding Pronunciation is formed sentence pronunciation; (for Korean, Japanese also can handle similarly); The recording data storage capacity of this scheme is larger.

这些录音数据可用于整步直接发音方式或分步直接发音方式;一般是把需用到的以上方案1或方案2或方案3或方案4的诸音码的录音数据存储在与拾音器有关的存储器集里。These recorded data can be used for full-step direct pronunciation mode or step-by-step direct pronunciation mode; Generally, the recorded data of all sound codes of theabove scheme 1 orscheme 2 orscheme 3 orscheme 4 that need to be used are stored in the memory relevant to the pickup set.

对于词组合,可直接记录其发音信息数据由哪些元辅音组成,或记录其发音信息数据由哪些元音辅音组合组成(对汉语来说比如记录其声母韵母组合情况或词形拼音),也可记录其由哪些词组成,由词的发音来组成词组合的发音。For word combination, can directly record its pronunciation information data to be made up of which vowels and consonants, or record its pronunciation information data to be made up of which vowels and consonants combination (for Chinese such as recording its initial consonant or vowel combination situation or word form pinyin), also can Record which words it consists of, and form the pronunciation of the word combination by the pronunciation of the words.

解码时,如采用整步或分步直接发音方式,在将音码发音组合成语句发音时,有些特殊地方需注意,(比如如何处理或衔接语音波形使语音流畅和更合乎发音习惯),这可参考现成的语音合成技术。During decoding, as adopting whole-step or step-by-step direct pronunciation mode, some special places should be paid attention to when combining the sound code pronunciation into sentence pronunciation, (such as how to process or connect the voice waveform to make the voice smooth and more in line with pronunciation habits), this You can refer to the existing speech synthesis technology.

在各音码(比如元辅音)语音之间,以及各词的发音之间,一般有静音间隔,可用程序调节静音间隔的长度,可实现语句的变速朗读。Between each phonetic code (such as vowel and consonant) voice, and between the pronunciation of each word, generally have silence interval, available program regulates the length of silence interval, can realize the variable-speed reading of sentence.

另如果不录音而是使用其它方式比如用语音波形的矢量技术来模拟发音码的语音,那应该是录音方式的等同替换。本说明书在叙述采用直接发音方式的以只适用于汉语为例的编解码规则及汉语的编解码举例涉及音码语音的处理时是以录音方式为例的。In addition, if you do not record but use other methods such as using the vector technology of speech waveforms to simulate the speech of the pronunciation code, that should be an equivalent replacement of the recording method. This manual uses the recording method as an example when describing the coding and decoding rules that are only applicable to Chinese as an example in the direct pronunciation method and the coding and decoding examples of Chinese that involve the processing of phonetic code speech.

对音码进行录音的目的是为了在解码时,用这些音码的录音数据来发音以形成语句的发音(即用直接发音方式),但是有点需注意,就是在编解码过程中用于编解码的主要音码与解码时的发音音码不一定是相同种类,比如,如对音码的录音是采取上面述的第4种方式,对所有常用的词(音码)都录音,而在编解码过程中是主要用词音码来作编解码的,解码时调各词(音码)的录音数据来发音以组成语句语音,这种情况就是编解码过程中用于编解码的主要音码与解码时的发音音码是相同种类(都是词音码);而如对音码的录音是采取上面述的第1种方式,只对各个可能的字的发音(字的发音即单字拼音,单字拼音可看成是元音辅音组合音码)进行录音,而在编解码过程中是主要用词音码来作编解码的,解码时调字(音码)的发音的录音数据来发音以组成各词的发音并由各词的发音组成语句语音,这种情况就是编解码过程中用于编解码的主要音码与解码时的发音音码不是相同种类。本段所述是显然的。The purpose of recording the phonetic code is to use the recording data of these phonetic codes to pronounce the pronunciation of the sentence to form the pronunciation of the sentence (that is, to use the direct pronunciation method) when decoding, but one thing to note is that it is used in the codec process. The main phonetic code and the pronunciation code when decoding are not necessarily the same kind, such as, as the recording of the phonetic code is to take the above-mentioned 4th way, all commonly used words (phonetic code) are recorded, and in the coding In the decoding process, the word phonetic code is mainly used for encoding and decoding. When decoding, the recording data of each word (phonetic code) is adjusted to pronounce to form the sentence voice. This situation is the main phonetic code used for encoding and decoding in the encoding and decoding process. It is the same kind (all are word phonetic sign indicating number) with the pronunciation sound sign indicating number when decoding; , single-character pinyin can be regarded as a combination of vowel and consonant sound codes) for recording, and in the process of encoding and decoding, the word phonetic codes are mainly used for encoding and decoding. To form the pronunciation of each word and form the sentence voice by the pronunciation of each word, this situation is exactly that the main phonetic code used for encoding and decoding in the encoding and decoding process is not the same type as the pronunciation phonetic code when decoding. What is stated in this paragraph is obvious.

有些词在不同的语句里,根据其语句环境或前后不同词的情况,可能会出现不同发音情况,即变音的情况(比如汉语的连音情况和儿化音情况,对于韩语,日语是类似的),如采用上述第1种方式对带调音节(即拼音)来录音及在解码时用带调音节的发音来组成词及语句的发音,那可制定一些有关变音的发音规则(比如记录解码时遇到哪些词的频率编号靠在一起时要变音等),在解码时在生成语句的全部(或非全部)预备发音数据时,可考察要生成的全部(或非全部)预备发音数据里各词的排列情况(比如在解码过程中考察各词的频率编号的排列情况),按有关变音的发音规则,来确定哪些词要变音及怎样变音,并在生成全部(或非全部)预备发音数据时将全部(或非全部)预备发音数据中组成要变音的词的那些带调音节(拼音)替换成要实际发的带调音节(拼音),再可把语句的全部(或非全部)发音发出来;而编码处理这边不管其是否有变音;本段内容可从常识推理出来,而且如读物中无变音的语句,编解码时就可不考虑变音情况。对于编解码过程中使用词组合出现变音的情况,对词组合的处理仍可参照本段以上所述对词的处理(或者把要变音的词组合分拆成多个词来处理)。本段发音是按整步直接发音方式为例来说的,分步直接发音方式可用别的方式来实现变音或可不处理变音。Some words may have different pronunciations in different sentences, depending on the sentence environment or the situation of different words before and after, that is, the situation of inflection (such as the situation of Chinese ligatures and erhuayin, for Korean, Japanese is similar ), such as adopting the above-mentioned first method to record the syllable with tone (i.e. pinyin) and use the pronunciation of the syllable with tone to form the pronunciation of words and sentences when decoding, then you can formulate some pronunciation rules about inflection (such as When recording the frequency numbers of which words are close to each other when recording decoding, the pronunciation should be changed, etc.), when decoding all (or not all) preparation pronunciation data of the sentence, you can examine all (or not all) preparations to be generated The arrangement of each word in the pronunciation data (such as examining the arrangement of the frequency numbers of each word in the decoding process), according to the pronunciation rules of the relevant pronunciation, determine which words will be changed and how, and generate all ( Or not all) when preparing the pronunciation data, those band tone syllables (pinyin) that form the words to be changed in all (or not all) of the preparation pronunciation data are replaced with the band tone syllables (pinyin) that will actually send out, and then the statement All (or not all) of the pronunciation is pronounced; and the encoding process does not care whether there is an umlaut; the content of this paragraph can be deduced from common sense, and if there is no umlaut in the reading, the umlaut can be ignored when encoding and decoding Condition. For the situation that the combination of words used in the encoding and decoding process has an inflection, the processing of the combination of words can still refer to the processing of the words described above in this paragraph (or split the combination of words to be changed into multiple words for processing). The pronunciation of this section is based on the whole-step direct pronunciation method as an example. The step-by-step direct pronunciation method can be used to realize the inflection or not to deal with the inflection.

在本编码规则里就对用于解码时按直接发音方式播放语句发音的音码语音的处理的方式进行定义,是为了便于选择主要用于编码的音码及定义编码用的各数据库里一些数据的格式,比如如解码那边采取上述第4种方式对于所有常用的词都录音,那么在选择主要用于编码的音码时,一般就不要选字作为音码,否则解码时得到预备发音数据或部分预备发音数据后要发音时,各个字就没有用于发音的录音数据(因为仅对每个词有录音),并且如解码那边采取上述第4种方式对于所有常用的词都录音,那在词数据库里各词对应的记录项里的发音信息数据字段里,一般就不要记录词形拼音,而是应该记录此词的录音数据的编号(或地址)。本编码规则可看成解码那边是以上述第1种方式只对于汉语所有的可能的字的发音(约1300个带调音节,也即约1300个拼音,带调音节是元音辅音组合音码,拼音也是元音辅音组合音码)进行录音为例的,解码时语句的发音是由这些字的发音(即带调音节,也即拼音)来组成的(比如用直接发音方式时可由主MCU调与拾音器有关的存储器集里带调音节的录音数据把组成语句发音的各带调音节播放出语音来),相应的主要是用词及词组合这两种音码来作编码,在词数据库里各词及词组合对应的记录项里的发音信息数据字段里,就是记录词形拼音。本段以及上几段提及的解码时由词的发音组成语句的发音或由字的发音组成语句的发音或类似表达,是适用于整步直接发音方式或分步直接发音方式的(因分步直接发音方式时,虽然是分步来发音,但听觉上还是由词的发音组成语句的发音或由字的发音组成语句的发音或是由其它音码的发音组成语句的发音的)。In this coding rule, just define the processing mode of the phonetic sign indicating number speech of playing sentence pronunciation by the direct pronunciation mode when being used for decoding, be for the convenience of selecting the phonetic sign indicating number that is mainly used in coding and define some data in each database of coding usefulness For example, if the decoding side adopts the above-mentioned fourth method to record all commonly used words, then when selecting the phonetic code mainly used for encoding, generally do not select the word as the phonetic code, otherwise the preparatory pronunciation data will be obtained when decoding Or when part of the prepared pronunciation data is to be pronounced, each word does not have recording data for pronunciation (because there is only a recording for each word), and if the decoding takes the above-mentioned 4th way to record all commonly used words, Then, in the pronunciation information data field in the record item corresponding to each word in the word database, generally just do not record the word form pinyin, but should record the numbering (or address) of the recording data of this word. This encoding rule can be regarded as the decoding over there is the pronunciation (about 1300 band tone syllables, namely about 1300 pinyin, band tone syllables are vowel and consonant combination sounds only for the pronunciation of all possible words of Chinese in the above-mentioned 1st way code, pinyin is also a combination of vowel and consonant sound code) for example, the pronunciation of the sentence is formed by the pronunciation of these words (i.e. the tone syllable, i.e. pinyin) during decoding (such as when using the direct pronunciation mode, it can be determined by the main character MCU adjusts the recording data with tuned syllables in the memory set relevant to the pickup to play out each banded syllable that forms the pronunciation of a sentence), and correspondingly mainly uses these two phonetic codes of words and word combinations to encode, in the word database In the pronunciation information data field in the record item corresponding to each word and word combination in , it is to record the word form pinyin. When this paragraph and last several paragraphs mention, the pronunciation that forms sentence by the pronunciation of word or the pronunciation that forms sentence or similar expression by the pronunciation of word is applicable to whole step direct pronunciation mode or step-by-step direct pronunciation mode (due to the division) During the direct pronunciation mode of the first step, although it is pronounced step by step, the pronunciation of the sentence is formed by the pronunciation of the word or the pronunciation of the sentence is formed by the pronunciation of the word or the pronunciation of the sentence is formed by the pronunciation of other phonetic codes).

以下主要是用词及词组合这两种音码来拼合编码语句集发音,即用于编码的音码的种类主要选择词及词组合。(因为词组合是由词组成的,并且组成各词组合的词一般都记录在词数据库里,所以即使语句中有词组合,也可用词来分开表示词组合,所以,即使对各词和各词组合都排序编号并存入词数据库里,编码时也可只使用词来编码,而可将词组合先搁置不用,如使用词组合,那编码的压缩效率会更高,并且在编码时词组合的使用方式是与词的使用方式相似的;在本发明文件的说明书中的直播方式的以只适用于汉语为例的编码规则和直播方式的以只适用于汉语为例的解码规则的阐述里,是主要选词音码及词组合音码来编解码的,并且有对词音码及词组合音码进行排序编号,在相应的编解码举例中主要涉及用词音码,后面的编解码举例也有涉及用词组合音码)Below mainly be these two kinds of phonetic codes of word and word combination to piece together the coded statement set pronunciation, promptly be used for the kind main selection word and word combination of the phonetic sign indicating number of encoding. (because the word combination is made up of words, and the words forming each word combination are generally all recorded in the word database, so even if there is a word combination in the statement, the word combination can also be represented separately by words, so even if each word and each word combination The word combinations are sorted and numbered and stored in the word database. When encoding, only words can be used for encoding, and word combinations can be left unused. If word combinations are used, the compression efficiency of encoding will be higher, and words will be encoded when encoding. The usage mode of the combination is similar to the usage mode of words; the encoding rules and the decoding rules of the live mode are only applicable to Chinese as an example of the live mode in the description of the document of the present invention Here, word phonetic codes and word combination phonetic codes are mainly selected for encoding and decoding, and word phonetic codes and word combination phonetic codes are sorted and numbered. In the corresponding encoding and decoding examples, word phonetic codes are mainly used. The following codes The example of decoding also has to involve the phonetic sign indicating number combined with words)

由于在用词作为主要音码来建立词数据库时,既可把各常用词组合作排序编号并存入词数据库,也可不存入,即使对各词和各词组合都排序编号并存入词数据库里,在编解码时,仍然可以只使用词(原因上段有述);本发明文件中在述及非印欧语系语种时在各处提及这两个概念时,是用‘词及词组合’;本来按常识,‘及’是连词并与‘和’的意思相同,但本发明文件约定在本发明文件中各处提及的‘词及词组合’的意思与‘词和词组合’的意思是有点不同的,‘词及词组合’可在两种具体情况下作两种不同的理解,如编解码时只涉及用词而不用词组合,那提及的‘词及词组合’就指‘词’,如编解码时既涉及用词也涉及用词组合,那提及的‘词及词组合’就指‘词和词组合’。Because when using words as the main phonetic codes to set up the word database, both commonly used phrases can be co-sorted and numbered and stored in the word database, or not stored, even if each word and each word combination are all sorted and numbered and stored in the word database Here, when encoding and decoding, only words can still be used (the reason is stated in the previous paragraph); when referring to the non-Indo-European languages in the document of the present invention, when referring to these two concepts everywhere, they use 'words and word combinations '; Originally, according to common sense, 'and' is a conjunction and has the same meaning as 'and', but this invention document agrees that the meaning of 'words and word combinations' mentioned in various places in this invention document is the same as that of 'words and word combinations' The meaning is a bit different, 'words and word combinations' can be understood in two different ways in two specific situations, such as encoding and decoding only involves the use of words but not word combinations, then the mentioned 'words and word combinations' It refers to "words". If both words and word combinations are involved in encoding and decoding, the "words and word combinations" mentioned refer to "words and word combinations".

词及词组合的排序及分配编号:Sorting and assigning numbers of words and word combinations:

对于汉语,语句中可用的单个的词,如‘学校’,还可有词的组合,如‘他们是’。For Chinese, a single word that can be used in a sentence, such as 'school', can also have a combination of words, such as 'they are'.

有时词可以是一个单字,比如‘他’。本说明书约定非印欧语系语种的具有完整词性的字包括在词的概念里面(如具有代词词性的‘他’是词,而单个字‘蜘’不是词),这样一来,语句就可看作是由词组成。Sometimes a word can be a single word, such as 'he'. This manual stipulates that the words with complete parts of speech in non-Indo-European languages are included in the concept of words (such as "he" with pronoun part of speech is a word, while the single character "spider" is not a word), so that the sentence can be seen A work is made up of words.

词是指如‘他’,‘是’,‘跑动’,‘一个’这样的词,词组合就是像‘是一个’,‘他是一个’这样的几个词的组合。Words refer to words such as 'he', 'is', 'running', and 'one', and word combinations are exactly like the combination of several words like 'is one' and 'he is one'.

而且语句发音可完全看成是由元辅音来组合起来的,所以把各词及词组合转成他们的元音辅音组合,主要用元音辅音组合这种音码来拼合编码语句集发音也是可以的。And the sentence pronunciation can be regarded as being combined by vowel and consonant completely, so each word and word combination are changed into their vowel and consonant combination, mainly use the vowel and consonant combination this phonetic sign indicating number to piece together coded sentence set pronunciation and also can of.

本说明书为了叙述方便,采取主要用词及词组合这两种音码来拼合编码语句集发音的方式。(另外,在陌生词的处理时,可能还会用到元辅音等音码,对陌生词处理涉及的元辅音等音码的编号等处理在后面陌生词段的阐述里有叙述)This specification sheet is for the convenience of narration, adopts the mode that these two phonetic codes mainly use word and word combination to piece together the pronunciation of coded sentence set. (In addition, during the processing of unfamiliar words, phonetic codes such as vowels and consonants may also be used, and the numbering of phonetic codes such as vowels and consonants involved in the processing of unfamiliar words is described in the explanation of unfamiliar words later)

对词及词组合分配的编号一般可以是二进制格式。The numbers assigned to words and word combinations may generally be in binary format.

对作为音码的词和词组合进行编号,这本来就是一种压缩处理,比如一个词含有2个字,如直接用大五码(大五码是表示繁体字,国标码是表示简体字,这两种码表示一个汉字一般都是占16个BIT,而且表示的汉字的个数及范围基本相同,所以这两种码互相是等同替换,本说明书是以大五码为例来表示汉字)来表示,如表示一个字的大五码占16BIT,那用大五码来表示此词的2个字就要占32个BIT,而如采用编号来表示此一个词,即使是使用后面述的等长编号,其长度也仅12BIT(以词数据库中的词总个数为4千来计的),而如使用阶梯编号来表示一个词那长度会更短,所以采用编号来表示一个词是有明显压缩效果的。Word and word combination as phonetic code are numbered, and this is exactly a kind of compression processing originally, contains 2 words such as a word, as directly using big five yards (big five yards are to represent traditional Chinese characters, and national standard code is to represent simplified characters, this The two codes represent a Chinese character generally occupying 16 BITs, and the number and range of Chinese characters represented are basically the same, so these two codes are equivalent replacements for each other. This manual uses the big five codes as an example to represent Chinese characters) Represent, if represent the big five yards of a word to occupy 16BIT, then will represent 2 words of this word with big five yards and will take 32 BITs, and as adopting numbering to represent this word, even if use the etc. mentioned later The length of the long serial number is only 12BIT (the total number of words in the word database is 4,000), and if a ladder number is used to represent a word, the length will be shorter, so it is useful to use a number to represent a word Obvious compression effect.

给词和词组合编号的方法,最简单的就是每个词或词组合固定长度编一个二进制码,比如词数据库中的词及词组合总个数为4千个的话,那每个词或词组合用12位bit表示即可。本发明文件称此编码法编的号为等长编号。The method for numbering words and word combinations is the simplest to encode a binary code with a fixed length for each word or word combination. If the total number of words and word combinations in the word database is 4,000, each word or word combination The combination can be represented by 12 bits. The document of the present invention claims that the number that this coding method makes up is equal-length numbering.

如编解码不使用词组合,就可对词单独排序,如编解码使用词组合,就可对词和词组合混合排序,即把词组合也当成词与词统一处理,比如一个排序片段可是:‘绵羊’,‘茶杯’,‘是一个’,‘跑步’这样的,(其中的‘是一个’是词组合)。If the codec does not use word combinations, the words can be sorted separately. If the codec uses word combinations, the words and word combinations can be mixed and sorted, that is, the word combinations are also treated as words and words. For example, a sorting fragment can be: 'Sheep', 'teacup', 'is a', 'running', (wherein 'is a' is a combination of words).

排序的方式与前面述的印欧语系语种是类似的。The sorting method is similar to the aforementioned Indo-European languages.

一般可对几千数量的词和词组合进行排序,即统计这些词及词组合在平时使用的各种教材及读物中的使用频率,并按使用频率排序,使用频率高的,就排在前面,使用频率低的,就排在后面;比如‘的’词的使用频率高,就排在前面,比如‘蔬菜’词的使用频率较低,就排在后面。Generally, thousands of words and word combinations can be sorted, that is, the frequency of use of these words and word combinations in various textbooks and reading materials used in daily life can be counted, and they can be sorted according to the frequency of use, and those with high frequency of use will be ranked first , the word with low frequency of use will be ranked at the back; for example, the word "的" with high frequency of use will be ranked at the front, such as the word "vegetable" with low frequency of use, it will be ranked at the back.

这里既可以直接使用哈夫曼编号的原理进行编号,也可使用前面述及的印欧语系语种编解码技术采用的频率编号方式,即按使用频率分成,前8个词及词组合用3个二进制位表示,接着的64个词及词组合用6个二进制位表示,再接着的512个词及词组合用9个二进制位表示,余下的4096个词及词组合用12个二进制位表示,共可表示8+64+512+4096=4680个词及词组合。比如‘他’是第一段的8个词及词组合中的第2个,用001表示,‘说’是第二段的64个词及词组合中的第6个,用000101表示,‘等’是第三段的512个词及词组合中的第151个,用010010110表示,‘土豆’是第4段的4096个词及词组合中的第1692个,用011010011011表示。本发明文件称此编码法编的号为阶梯编号,阶梯的长度改为别的如4,8,12也可以的。Here you can directly use the Huffman numbering principle for numbering, or you can use the frequency numbering method adopted by the Indo-European language coding and decoding technology mentioned above, that is, divide according to the frequency of use, anduse 3 for the first 8 words and word combinations. Binary bit representation, the next 64 words and word combinations are represented by 6 binary bits, the next 512 words and word combinations are represented by 9 binary bits, and the remaining 4096 words and word combinations are represented by 12 binary bits. A total of 8+64+512+4096=4680 words and word combinations can be represented. For example, 'he' is the 2nd in the 8 words and word combinations of the first paragraph, represented by 001, and 'say' is the 6th in the 64 words and word combinations of the second paragraph, represented by 000101,' Etc' is the 151st among the 512 words and word combinations in the third paragraph, represented by 010010110, and 'potato' is the 1692nd among the 4096 words and word combinations in the fourth paragraph, represented by 011010011011. The document of the present invention claims that the number that this coding method makes up is the step numbering, and the length of the step changes other as 4,8,12 also can.

哈夫曼编号方式,可以采用常规的编法,将词及词组合全统计其使用频率值,然后再根据各词及词组合的使用频率值,按哈夫曼编码法给每个词及词组合赋予号值。这种码的前面不需表示码长。The Huffman numbering method can use the conventional coding method to count the use frequency values of words and word combinations, and then assign each word and word combination according to the use frequency value of each word and word combination according to the Huffman coding method. Combination assigned number value. The front of this code does not need to indicate the code length.

另还可采取一些别的编号方式,对使用频率高的词或词组合赋短编号,使用频率低的赋长编号。Also can take some other numbering ways in addition, assign short numbering to word or word combination with high frequency of use, endow long numbering with low frequency of use.

下面将等长编号,阶梯编号或哈夫曼编号都称为频率编号的一种(等长编号的各词及词组合的使用频率假设相同),即频率编号可以是等长编号,阶梯编号或哈夫曼编号等。本说明书涉及的非印欧语系语种(以只适用于汉语为例)的各编解码举例中各词及词组合的频率编号都默认是阶梯编号的(即比如可在以只适用于汉语为例的编码规则中和与其对应的以只适用于汉语为例的解码规则中以及编解码涉及的各数据库里都默认各词及词组合的频率编号是阶梯编号);各词及词组合的频率编号也可以是其它比如等长编号,哈夫曼编号等,并且如各词及词组合的频率编号是等长编号或哈夫曼编号时,在编码时这些频率编号的前面不用带频率编号的BIT长度信息;并且可在编解码过程中默认使用的各词及词组合的频率编号只是等长编号,哈夫曼编号等中的一种,如在编解码过程中有使用几种频率编号,那可以在编解码规则里规定每个处理单位的头段里需有一个或多个BIT来表示本处理单位里的词及词组合的频率编号是哪种,显然,如在编解码过程中有使用等长编号,哈夫曼编号等频率编号,那在编解码涉及的数据库比如词数据库里应记录有这些频率编号的信息。The equal-length numbering, ladder numbering or Huffman numbering are all referred to as a kind of frequency numbering below (the frequency of use of each word and word combination of the equal-length numbering is assumed to be the same), that is, the frequency numbering can be equal-length numbering, step numbering or Huffman No. et al. The frequency numbers of each word and word combination in each codec example of the non-Indo-European languages involved in this manual (only applicable to Chinese as an example) default to step numbering (that is, for example, only applicable to Chinese as an example) In the coding rules and the corresponding decoding rules that are only applicable to Chinese as an example, as well as in the databases involved in encoding and decoding, the frequency numbers of each word and word combination are defaulted as ladder numbers); the frequency numbers of each word and word combination It can also be other such as equal-length numbering, Huffman numbering, etc., and when the frequency numbers of each word and word combination are equal-length numbering or Huffman numbering, these frequency numbers do not need BIT with frequency numbering before encoding Length information; and the frequency number of each word and word combination that can be used by default in the process of encoding and decoding is only one of equal-length numbers, Huffman numbers, etc. If several frequency numbers are used in the process of encoding and decoding, then It can be stipulated in the encoding and decoding rules that one or more BITs are required in the header of each processing unit to indicate the frequency number of the word and word combination in this processing unit. Obviously, if it is used in the encoding and decoding process Frequency numbers such as equal-length numbers, Huffman numbers, etc., information about these frequency numbers should be recorded in databases involved in encoding and decoding, such as word databases.

对词及词组合排序后,将这些词及词组合的信息存储在词数据库里(当然如只使用词来进行编码,那只需把这些词的信息存储在词数据表里而不需存储词组合的信息),词数据库的每个记录项里存储一个词或词组合的信息,此信息包括比如此词或词组合的拼法(比如大五码),频率编号,发音信息数据(比如词形拼音)等,在解码时可在词数据库里提取语句中各词及词组合对应的记录项里的发音信息数据(比如词或词组合的词形拼音),以构成语句的发音。After sorting the words and word combinations, store the information of these words and word combinations in the word database (of course, if only words are used for encoding, then only the information of these words needs to be stored in the word data table instead of storing words Combination information), store a word or word combination information in each record item of the word database, this information includes such as the spelling of this word or word combination (such as big five yards), frequency number, pronunciation information data (such as word Pinyin) etc., can extract the pronunciation information data (such as the phonetic pinyin of word or word combination) in the record item corresponding to each word and word combination in the sentence in the word database during decoding, to form the pronunciation of the sentence.

词的分类:Classification of words:

这与前面印欧语系语种的处理类似。按词的属性进行归类,比如可分为:This is similar to the previous treatment of Indo-European languages. Classified according to the attributes of words, for example, it can be divided into:

社会,家庭,人,食品,服装,建筑,动作,相貌,表情,运动,艺术,上班,经贸,连词,助词等。Society, family, people, food, clothing, architecture, action, appearance, expression, sports, art, work, economy and trade, conjunctions, particles, etc.

可把每个类里具体包含的词都记录下来,成为词分类数据(词分类数据里一般只需包括词,不需包括词组合),可供编解码时使用。每一个类里面包含的词数量可以固定,也可不固定,在不固定时,可按其词数量的多少分配类里的每个词的序号的位长,比如某一个类里面,有32个词,此类里的每个词的序号就分配5个二进制位,另一个类有6个词,此类里的每个词的序号就分配3个二进制位。The words specifically contained in each class can be recorded and become word classification data (generally only need to include words in the word classification data, and do not need to include word combinations), which can be used during encoding and decoding. The number of words contained in each class can be fixed or not. When it is not fixed, the bit length of the serial number of each word in the class can be assigned according to the number of words. For example, there are 32 words in a class , the serial number of each word in this class is assigned 5 binary bits, and the other class has 6 words, and the serial number of each word in this class is assigned 3 binary bits.

另外,词的分类也可增加按词性来分,比如除以上的‘社会’,‘家庭’等类外,再增加动词,名词,形容词等类,然后象后面述的建立关联关系也是同样可行的,比如一个词‘工程师’,可以用形容词去修饰此词,所以在词数据库里此词对应的记录项里可以记录此词与形容词类关联,编解码时就可利用此关联性来达到压缩目的。In addition, the classification of words can also be divided by part of speech. For example, in addition to the above categories of 'society' and 'family', verbs, nouns, adjectives and other categories can be added, and then it is also feasible to establish associations as described later. , For example, a word 'engineer' can be modified with an adjective, so the record item corresponding to this word in the word database can record the association between this word and the adjective class, and this association can be used to achieve the purpose of compression when encoding and decoding .

建立词的关联:Create word associations:

这也与前面印欧语系语种的处理类似。This is also similar to the previous Indo-European languages.

对于语句中出现的词,其附近都可能会有词来修饰它或有词与它有关联关系,而修饰它或与它有关联关系的词的数量是有限的,然后就可以利用这个原理达到压缩编码的目的;对于一个名词,一般有形容词,数词去修饰它,或有动词与此名词有关联关系(比如一个名词‘人’会执行有关的动作‘吃’,‘吃’是动词);而对于一个动词,一般有副词形容词去修饰。本说明书在编解码举例中记录词的关联信息是采取这种方式的:对于词数据库里的各个词,就在词数据库里此词对应的记录项里记录下此词所属词类编号及与此词关联的各词类的编号,比如‘工程师’,与其可能关联的词类可能是:动作,相貌,运动,表情等,然后就在词数据库里词‘工程师’对应的记录项的一个字段里记录下词‘工程师’所属词类编号并在此记录项的另一个字段里记录下与此词‘工程师’关联的动作,相貌,运动,表情等词类的编号。For the words that appear in the sentence, there may be words nearby to modify it or have words associated with it, but the number of words that modify it or have an associated relationship with it is limited, and then this principle can be used to achieve The purpose of compression coding; for a noun, there are generally adjectives and numerals to modify it, or there are verbs related to the noun (for example, a noun 'person' will perform the related action 'eat', and 'eat' is a verb) ; and for a verb, there are generally adverb adjectives to modify it. This manual records the related information of words in the codec example in this way: for each word in the word database, record the number of the part of speech to which the word belongs and the number associated with the word in the record item corresponding to the word in the word database. The number of each associated part of speech, such as 'engineer', and the part of speech that may be associated with it may be: action, appearance, movement, expression, etc., and then record the word in a field of the record item corresponding to the word 'engineer' in the word database The part-of-speech number to which 'engineer' belongs, and record the part-of-speech numbers of actions, looks, sports, expressions, etc. associated with the word 'engineer' in another field of this record item.

对于有些词组合,也可如上法编入类里面,但一般可以定义各词组合不属于任一类而且与其它类没有关联关系,在实际编解码中如有需考察词组合与旁边词的关联性的时候可用词组合里包括的词来考察其与旁边词是否关联,如有需考察两个词组合之间的关联性,则可考察分属于两个词组合的词之间的关联性而得到词组合之间的关联性。For some word combinations, it can also be incorporated into the class as above, but generally it can be defined that each word combination does not belong to any class and has no relationship with other classes. When it is time to use the words included in the word combination to examine whether it is related to the next word, if it is necessary to examine the relevance between the two word combinations, you can examine the relationship between the words belonging to the two word combinations and Get the correlation between word combinations.

另外,记录词的关联信息还可采取另一种方式:在词数据库里各词对应的记录项里记录下此词所属词类编号,而不需记录此词与哪些词类关联,但另有数据库记录各词类之间的关联关系(比如人这一类的词与相貌类的词就有关联,比如社会类的词与运动类的词就没有关联),在编解码考察两个词之间的关联性时,实际是考察这两个词所属词类的关联性,此种方式比较节约关联信息数据的存储空间。In addition, another method can be used to record the associated information of words: record the number of the part of speech to which the word belongs in the record item corresponding to each word in the word database, without recording which parts of speech this word is associated with, but there is another database record The relationship between the various parts of speech (for example, the words of people are related to the words of appearance, such as the words of society and the words of sports are not related), and the relationship between the two words is examined in the encoding and decoding In fact, it is to examine the relevance of the parts of speech to which the two words belong. This method saves the storage space of the associated information data.

建立词的关联还可采取另外一种方式:即对应于每个词,都穷举罗列与其有关联的词(而不是词类);当然这样做的缺点是词的关联数据会很多。Another way to establish word associations is to enumerate the associated words (rather than parts of speech) corresponding to each word; of course, the disadvantage of doing this is that there will be a lot of associated data for words.

建立关联关系,是为了在后面述的有些词段编解码方式里作压缩之用。The purpose of establishing an association relationship is to use it for compression in some word segment encoding and decoding methods described later.

这样词数据表里的每个词或词组合对应的记录项的结构(即一个记录项的各字段)就如下:(本发明文件在述及非印欧语系语种编解码处理及以汉语为例举例时各处所称词表,词数据表与词数据库是相同概念,这是显然的,而且在数据库技术范畴,数据表和数据库通常是一个概念),这个词数据表与前述印欧语系语种用的单词数据表的结构及功用类似:The structure (that is, each field of a record item) of the record item corresponding to each word or word combination in the word data table is just as follows: (the document of the present invention is dealing with non-Indo-European languages coding and decoding and taking Chinese as an example The so-called vocabulary everywhere during the example, word data table and word database are the same concept, and this is obvious, and in the database technology category, data table and database are usually a concept), this word data table and the aforementioned Indo-European languages are used The structure and function of the word data table are similar:

‘男人’:(词的)拼法字段'man': (word) spelling field

011001100:频率编号字段(一般为二进制格式)011001100: frequency number field (usually in binary format)

3:所属词类编号字段(‘男人’所属词类编号)3: The part of speech number field (the number of the part of speech to which 'man' belongs)

7,8,9,10:关联词类编号字段(‘男人’词与7,8,9,10类词关联)7, 8, 9, 10: Associated part of speech number field (‘man’ word is associated with 7, 8, 9, 10 words)

nan2ren2:发音信息数据字段(词形拼音)nan2ren2: Pronunciation information data field (morphological pinyin)

或(再看一个词组合对应的记录项的各字段,因在编码规则中一般是把词组合当成与词一样处理的,所以词组合对应的记录项的各字段与词对应的记录项的各字段是基本相同的;当然如词数据库里没有记录词组合的信息,那词数据库里就没有这样的词组合的记录项):Or (see each field of the record item corresponding to a word combination again, because generally word combination is treated as the same as word in the encoding rule, so each field of the record item corresponding to the word combination and each field of the record item corresponding to the word The fields are basically the same; of course, if there is no record of word combination information in the word database, then there is no such word combination record in the word database):

‘是一个’:(词组合的)拼法字段'is a': spelling field (of word combinations)

011001:频率编号字段(一般为二进制格式)011001: frequency number field (usually in binary format)

254:所属词类编号字段(假设255表示本词或词组合不属于任何词类)254: Part of speech number field (assuming 255 indicates that the word or word combination does not belong to any part of speech)

255:关联词类编号字段(假设255表示本词或词组合与任何词类无关联关系)255: associated part of speech number field (assuming 255 indicates that the word or word combination has no association with any part of speech)

shi4yi1ge4:发音信息数据字段(词形拼音)shi4yi1ge4: Pronunciation information data field (morphological pinyin)

这里示的词数据表的词或词组合对应的记录项的结构仅为例子,还可以包括词性等其它字段。(这里定义的是记录项的数据结构,至于各字段里的数据用什么字符格式不关紧要由编码者自定,比如词的拼法一般可用大五码,这也是常识,频率编号一般是二进制格式)The structures of record items corresponding to words or word combinations in the word data table shown here are only examples, and may also include other fields such as part of speech. (The data structure of the record item is defined here. As for the character format of the data in each field, it does not matter. It is up to the coder to decide. For example, the spelling of words can generally use big five codes. This is also common sense, and the frequency number is generally binary. Format)

以上关于词数据表里的每个词或词组合对应的记录项的结构,是可以从英语编码用的单词数据表里的每个单词或单词组合对应的记录项的结构推出的,因为非印欧语系语种和印欧语系语种的编码处理方式,数据组织方式等是基本相同的。The structure of the record item corresponding to each word or word combination in the word data table above can be deduced from the structure of the record item corresponding to each word or word combination in the word data table used for English encoding, because non-printing The encoding processing methods and data organization methods of European languages and Indo-European languages are basically the same.

对于汉语里的形同音不同的词及词组合的情况,在编解码时可把形同音不同的都分别当成不同的词(或词组合),并在词数据表里分别开记录项来记录其信息,比如‘为’是词,有两种发音,那可把第1种发音的‘为’当成一个词并在词数据表里用一个记录项来记录此词的频率编号及发音信息数据等信息,并把第2种发音的‘为’当成另一个词并也在此词数据表里用另一个记录项来记录此词的频率编号(此频率编号与上句的频率编号不同)及发音信息数据等信息,编解码时发第1种发音的词‘为’用前面那个记录项里的频率编号表示,编解码时发第2种发音的词‘为’用后面那个记录项里的频率编号表示;(一个词或词组合的轻读重读不算形同音不同的情况,本发明文件约定变音情况也不算形同音不同的情况)。For the situation of words and word combinations with different homonyms in Chinese, different words (or word combinations) with different homophones can be regarded as different words (or word combinations) when encoding and decoding, and record items are opened respectively in the word data table. Record its information, for example, 'is' is a word with two pronunciations, then you can treat the 'is' of the first pronunciation as a word and use a record item in the word data table to record the frequency number and pronunciation information of this word Data and other information, and regard the second pronunciation of 'is' as another word and also use another record item in the word data table to record the frequency number of this word (this frequency number is different from the frequency number in the previous sentence) And pronunciation information data and other information, when encoding and decoding, the word "为" with the first pronunciation is represented by the frequency number in the previous record item; when encoding and decoding, the word "为" with the second pronunciation is expressed in the latter record item (the light reading and stress reading of a word or combination of words is not counted as the case of different homonyms, and the document of the present invention stipulates that the situation of diacritics is also not considered as the case of different homophones).

就可把经常使用的约几千个词和词组合,编成词数据库,即这约几千个词和词组合,每个都在词数据库里开一个记录项(形同音不同的词或词组合开多个记录项),记录项的结构见上几段述,各记录项里的频率编号,所属词类编号,关联词类编号等字段里的数据内容可根据本说明书第50页至第53页所述来得到。Just can about several thousand words and word combination frequently used, be compiled into word database, promptly these about several thousand words and word combination, each all open a record item in word database (words with different sounds or homonyms) word combination to open multiple record items), the structure of the record item is described in the previous paragraphs, the frequency number in each record item, the number of the part of speech to which it belongs, the data content in the fields such as the number of associated part of speech can be based on the 50th to 53rd pages of this manual as described on the page.

建立词的关联的作用,是为了在采用词段的频率关联混合方式或关联方式编解码时,以提高压缩效率,而如是采用词段的频率排序方式或整句差别方式编解码,就不使用词的关联信息。The function of establishing the association of words is to improve the compression efficiency when using the frequency association mixing method or association method of word segments to encode and decode, but if the frequency sorting method of word segments or the whole sentence difference method is used for encoding and decoding, it is not used Word related information.

在同一个词数据库里,词组合和用于组成它的各个词可分别开频率编号。比如‘是一个’,‘是’,‘一个’三个可共存于一个词数据库,在三个不同的记录项里分别记录,并各自具有不同的频率编号。(本说明书默认各词组合和用于组成它们的各个词在词数据库里都分别对应有各自的记录项,并在各自的记录项里记录有各自的频率编号)In the same word database, the word combination and the individual words used to form it can be assigned frequency numbers separately. Such as 'is one', 'is', 'one' three can coexist in a word database, record respectively in three different record items, and respectively have different frequency numbers. (This manual defaults that each word combination and each word used to form them have their own record items in the word database, and record their respective frequency numbers in the respective record items)

汉语一般没有词根的概念,但是其带调音节(单字拼音)类似于英语中的词根,可以统计所有可能的字的发音(约有1300个,字的发音也即单字拼音,也即带调音节),集中起来建立一个数据库(单字拼音表),单字拼音表里需记录各单字拼音的信息(每个单字拼音对应一个记录项,每个记录项里记录一个单字拼音的顺序号和拼音字符拼法及其语音的地址等信息,比如4个ASCII字符‘D’,‘U’,‘O’,‘1’组合起来成‘DUO1’就可看成是‘多’字的发音的拼音字符拼法),在处理陌生词时可使用单字拼音表。(韩语,日语一般也没有词根的概念,也可按本段上面述的方式处理,可把韩语的所有的单个音节分别当成词根,可把日语的所有的假名分别当成词根)Chinese generally does not have the concept of a word root, but its toned syllable (single-character pinyin) is similar to the root in English, and the pronunciation of all possible words can be counted (there are about 1300, and the pronunciation of the word is also a single-character pinyin, that is, a tonal syllable ), gather together and set up a database (single-character phonetic table), need to record the information of each individual-character phonetic alphabet in the individual-character phonetic table (each individual-character phonetic is corresponding to a recording item, records the serial number of a single-character phonetic alphabet and the phonetic character spelling in each recording item For example, the combination of 4 ASCII characters 'D', 'U', 'O', and '1' to form 'DUO1' can be regarded as the pinyin character spelling of the pronunciation of the word 'multi' method), the single-character pinyin table can be used when dealing with unfamiliar words. (Korean and Japanese generally do not have the concept of roots, and can also be processed in the above-mentioned manner in this paragraph. All single syllables in Korean can be regarded as roots respectively, and all pseudonyms in Japanese can be regarded as roots respectively)

编码的数据分成以下顺序排列的段(对编码的数据进行分段,也可认为其是属于本编码规则中的一个编码细则):The coded data is divided into segments arranged in the following order (segmenting the coded data can also be considered as a coding rule in this coding rule):

头段:记录本编码语句所属语种,本编码的词段所用词段编码方式,本编码语句中的词及词组合个数等信息(头段里的各信息不是必需的可以都不记录或记录部分信息,比如词及词组合个数可以不用记录,而在用频率关联混合方式对词段进行编码时,如在头段里记录词及词组合个数可有助于解码,再比如如果某一本书里的全部码字涉及播放的语音都属于一个语种,在这本书的第一页面有一个特殊码字携带了本书的语种种类的信息,那这本书其余页面上的各码字里的头段就可不带其编码语句所属语种的信息,在读者使用时,要求读者先用拾音器读第一页面的特殊码字取得语种种类信息比如汉语,然后其它页面的码字的值就按约定是按汉语来对其解码处理,所以其它页面的各码字的头段里就不需记录本编码语句所属语种;甚至如果编解码都默认使用频率排序编码方式,那头段里就不需再记录本编码的词段用哪种词段编码方式;所以如头段里的各信息都省略的话,编码就可省略头段。在头段里哪些信息需记录哪些信息不需记录,编码规则里需约定好;对于头段里约定好的需记录的每个信息可以各自固定分配一个或多个BIT,本编码规则及编解码举例中约定头段里记录有本编码语句属于什么语种和本处理单位的词段使用何种词段编码方式这两个信息,每个信息是各自固定分配2个BIT来记录的)。在后面的以汉语为例的编解码举例中,对头段用的一些信息是这样定义的:用两个BIT来表示本编码语句所属语种的信息,用00代表英语,用01代表汉语;用两个BIT来表示词段编码方式信息,用00代表频率排序编码方式,用01代表频率关联混合编码方式,用10代表关联编码方式,用11代表整句差别编码方式。Header section: record the language to which this coded sentence belongs, the word segment coding method used in the word segment of this coded sentence, information such as the words and the number of word combinations in this coded sentence (each information in the head section is not necessary and can not be recorded or recorded Some information, such as the number of words and word combinations can not be recorded, but when encoding word segments in a frequency-associated mixed way, such as recording the number of words and word combinations in the header can be helpful for decoding. For example, if a certain All the codewords in a book involve the playback of voices belonging to one language, and there is a special codeword on the first page of the book that carries the information of the language type of the book, then the codewords on the other pages of the book The head section in the word may not carry the information of the language to which the coded sentence belongs. When the reader uses it, the reader is required to use the pickup to read the special code word on the first page to obtain the language type information such as Chinese, and then the value of the code word on other pages is According to the agreement, it is decoded according to Chinese, so there is no need to record the language of this coded sentence in the header of each codeword on other pages; even if the codec uses the frequency sorting encoding method by default, then there is no need to record it in the header. It is necessary to record which word segment encoding method is used for the word segment of this encoding; so if all the information in the header is omitted, the encoding can omit the header. In the header, which information needs to be recorded and which information does not need to be recorded, the encoding It must be agreed in the rules; for each information agreed to be recorded in the header, one or more BITs can be fixedly allocated respectively. In this encoding rule and codec example, it is agreed that the header includes the language and language of the encoding statement. The word segment encoding method used by the word segment of this processing unit is two pieces of information, each of which is fixedly assigned 2 BITs to record). In the following codec examples taking Chinese as an example, some information used in the header is defined as follows: use two BITs to represent the language information of the coded sentence, use 00 to represent English, and 01 to represent Chinese; A BIT is used to represent the word segment coding method information, 00 is used to represent the frequency sorting coding method, 01 is used to represent the frequency-associated mixed coding method, 10 is used to represent the associated coding method, and 11 is used to represent the whole sentence difference coding method.

词段:这段的数据是对语句中的全部词进行编码后的主要数据(对于语句中的词组合,可当成与词同等对待处理)。Word segment: the data in this section is the main data after encoding all the words in the sentence (for the word combination in the sentence, it can be treated as the same as the word).

时态词形选择段:是对语句中的词的词形选择。(此段一般不用)Tensile word form selection segment: It is the word form selection of the words in the sentence. (This paragraph is generally not used)

陌生词段:是先在词段里用111111作引导,本段再放置陌生词具体的编码信息。Unfamiliar words: 111111 is used as a guide in the words first, and then the specific coding information of unfamiliar words is placed in this paragraph.

前后调用段:是先在词段里用111作引导,本段再放置往前调用或往后调用的词的距离等信息。Before and after calling section: use 111 as a guide in the word section first, and then place information such as the distance of the word called forward or called back in this section.

把这些段的数据集合起来(如按顺序排组起来)即成语句的编码值,可被拾音器解码播出语句发音,可把上述各段集合起来(如按顺序排组起来)定义为一个处理单位(包括头段,词段,陌生词段,前后调用段)。(对一个处理单位,其中的词段是必需的,其它段可能省略不用,比如对各语句编码时陌生词段经常不用,前后调用段是可能有,也可能没有的)The data of these sections are assembled (such as arranged in order) to form the code value of the sentence, which can be decoded by the pickup to broadcast the pronunciation of the sentence. The collection of the above-mentioned sections (such as arranged in order) can be defined as a processing Units (including head paragraphs, phrases, unfamiliar phrases, front and back calling paragraphs). (For a processing unit, the word segments in it are necessary, and other segments may be omitted. For example, when encoding each sentence, unfamiliar word segments are often not used, and the front and rear calling segments may or may not be present.)

语句可以是一个句子或一个句子的部分,语句也可以是含有多个句子,一个处理单位对应一个语句,表示语句发音的拼合信息的值就是一个处理单位的值,此处理单位的值可被拾音器解码播出一个语句的发音,一个码字的值可以有一个表示语句发音的拼合信息的值。A sentence can be a sentence or a part of a sentence, and a sentence can also contain multiple sentences. A processing unit corresponds to a sentence, and the value of the combined information representing the pronunciation of a sentence is the value of a processing unit. The decoding broadcasts the pronunciation of a sentence, and the value of a codeword can have a value representing the combined information of the pronunciation of the sentence.

本说明书涉及的汉语语种的编解码举例主要是这样的:语句就是一个中间无逗号的句子,一个处理单位对应一个语句,表示语句发音的拼合信息的值就是一个处理单位的值。The encoding and decoding examples of the Chinese language involved in this manual are mainly as follows: a sentence is a sentence without a comma in the middle, a processing unit corresponds to a sentence, and the value of the combined information indicating the pronunciation of the sentence is the value of a processing unit.

词段编码方式主要是用于词段的编码的,一般有以下几种(其中频率排序方式本身无压缩作用,但其使用的频率编号可以有压缩作用,其它的频率关联混合方式,关联方式,整句差别方式等词段编码方式本身是有压缩作用的):The word segment encoding method is mainly used for the encoding of word segments, and generally has the following types (the frequency sorting method itself has no compression effect, but the frequency number used by it can have a compression effect, other frequency association mixing methods, association methods, The word segment coding methods such as the whole sentence difference method itself have a compression effect):

频率排序方式:Sort by frequency:

这也与前面英语的处理类似。This is also similar to the previous treatment of English.

编码时,先在词段的最前面记录词段里含有的词及词组合的个数信息,语句都是由词及词组合顺序组成的,对语句里的词及词组合,全部换成其频率编号,并且每个频率编号前都带一个2BIT的长度信息(取值为00,01,10,11,分别对应频率编号的BIT长度为3,6,9,12),以表示其频率编号的BIT长度是3,6,9,12中的哪一个。然后把这些长度信息,频率编号按与其对应的词及词组合在语句中的顺序排在词段里即可。这种频率排序编码方式及其对应的频率排序解码方式本身无压缩作用,但涉及使用的词及词组合的频率编号是有压缩作用的(比如用一个频率编号来表示一含两个字的词,需用2BIT的此频率编号的长度信息加12BIT的此频率编号共14个BIT,而如采用常规文本记录方式用两个大五码来表示同一含两个字的词需用2*16即32个BIT,显然频率编号有压缩作用;这里说的大五码,一个大五码一般占16BIT)。When encoding, first record the information on the number of words and word combinations contained in the word segment at the front of the word segment. The sentences are all composed of words and word combinations. For the words and word combinations in the sentence, all of them are replaced by other words. Frequency number, and each frequency number is preceded by a 2BIT length information (values are 00, 01, 10, 11, respectively corresponding to the BIT length of the frequency number is 3, 6, 9, 12) to indicate its frequency number Which one of the BIT length is 3, 6, 9, 12. Then the length information and the frequency number are arranged in the word segment according to the order in which the words corresponding to it and the words are combined in the sentence. This frequency sorting coding method and its corresponding frequency sorting decoding method itself have no compression effect, but the frequency numbers involving the used words and word combinations have a compression effect (such as using a frequency number to represent a word containing two characters , it is necessary to use the length information of the frequency number of 2BIT plus the frequency number of 12BIT for a total of 14 BITs, and if the conventional text recording method is used to use two big five codes to represent the same word containing two characters, 2*16 is required 32 BITs, obviously the frequency number has a compression effect; the big five codes mentioned here, one big five codes generally occupy 16BIT).

举例:语句为:‘这个男人是一个工程师’Example: The sentence is: 'This man is an engineer'

编码的各BIT如下:The encoded BITs are as follows:

00101:在词段的最前面记录词段里的词及词组合的个数信息,共5个00101: Record the number of words and word combinations in the word segment at the front of the word segment, a total of 5

再将各词及词组合换成其频率编号,每个频率编号前面附带2个BIT的此频率编号的长度信息:Then replace each word and word combination with its frequency number, and the length information of this frequency number with 2 BITs is attached in front of each frequency number:

00:‘这个’词的频率编号的长度信息00: The length information of the frequency number of the word 'this'

011:‘这个’词的频率编号011: Frequency number of word 'this'

10:‘男人’词的频率编号的长度信息10: length information of the frequency number of the word 'man'

011001100:‘男人’词的频率编号011001100: frequency number for the word 'man'

01:‘是’词的频率编号的长度信息01: Length information of the frequency number of the word 'is'

010101:‘是’词的频率编号010101: Frequency number for the word 'is'

00:‘一个’词的频率编号的长度信息00: length information of frequency number of word 'one'

110:‘一个’词的频率编号110: frequency number of word 'one'

10:‘工程师’词的频率编号的长度信息10: Length information of the frequency number of the word 'engineer'

011101010:‘工程师’词的频率编号011101010: frequency number for the word 'engineer'

再把以上各BIT按顺序合起来,就得到编码值:Then combine the above BITs in order to get the encoded value:

001010001110011001100 01 0101010011010011101010001010001110011001100 01 0101010011010011101010

此编码值是词段的值,为了使举例简洁,本举例没涉及头段,头段里一般含有语种信息以及词段编码方式选择信息等数据。This encoding value is the value of the word segment. In order to make the example concise, this example does not involve the header, which generally contains data such as language information and word segment encoding method selection information.

本编码举例涉及的都是词(音码),对于在需编码的语句中有词组合(音码)的情况与词(音码)同等处理。What this coding example relates to is all words (phonetic sign indicating number), for the situation that word combination (phonetic sign indicating number) is arranged in the sentence that needs coding is handled equally with word (phonetic sign indicating number).

(本词段编码方式里,各词及词组合的频率编号除了可以是本例示的阶梯编号外,还可以是等长编号或哈夫曼编号等;如为等长编号,哈夫曼编号那其前面不用带频率编号长度信息。)(in this speech segment encoding mode, the frequency numbering of each word and word combination can also be equal-length numbering or Huffman numbering etc. except the ladder numbering of this example; as for equal-length numbering, Huffman numbering that It is not preceded by frequency number length information.)

频率关联混合方式:Frequency correlation mixing method:

本词段编码方式及其举例主要以用词(音码)来阐述的,如有用词组合(音码)的类似于词(音码)处理。This speech segment encoding mode and example thereof are mainly set forth with speech (sound code), as the similar speech (sound code) processing of useful word combination (sound code).

这也与前面英语的处理类似。This is also similar to the previous treatment of English.

这是对频率排序(使用阶梯编号)编码方式的改进,由于完全用频率排序(使用阶梯编号)编码时,每个词或词组合的阶梯编号前面需带2个二进制位以表示此阶梯编号长度,如15个词及词组合的语句总共就要耗30个二进制位,占位较多;可以结合关联的方式进行改进。具体做法为:This is an improvement on the coding method of frequency sorting (using ladder numbering), because when coding with frequency sorting (using ladder numbering) completely, each word or word combination needs to have 2 binary bits in front of the ladder numbering to indicate the length of the ladder numbering , For example, a statement with 15 words and word combinations will consume a total of 30 binary bits, occupying a lot of space; it can be improved in combination with the way of association. The specific method is:

本编码方式以下述以语句由词组成为例,语句中有词组合的类似处理,先把语句里的所有词都转成阶梯编号并按原顺序排列,每个阶梯编号前不带此阶梯编号的长度信息,然后假如排列并组合起来的值(可命名为阶梯信息值)的二进制最长长度为84位(BIT),就在第21位,第42位,第63位处分别设点分节,在第21位处向后查与最近的单个词的阶梯编号的结束处的距离值(每增加3BIT,距离值就加1)并记录下来(距离值占2个二进制位),并把分节处挪到这个阶梯编号结束处,对第42位,第63位处如法处理,这三个距离值就是三个分节处的偏移信息;这样就将阶梯信息值分成4节,每节长度大约为21位(当然如阶梯信息值比较短,那后面几节可能是空的没有词),在每节里可能有一个或多个词,每个词的阶梯编号的长度可能为3,6,9,12位,但是每节里具体的第一个词占几位,第二个词占几位,还是未知的。但由于分节后每节位数较少,所以可以对其阶梯编号分法进行穷举,对于每种阶梯编号长度可能分法情况都计算,计算这种长度可能分法情况时,节里的各词之间的关联度;(假设某种分法下各频率编号对应的各词正好就是原语句在本节位置的各词时,可称这种分法为本义分法,否则是歧义分法);因为在语句中紧排在一起的几个词或隔1,2个词的词,一般是互相有关联的,利用前述的词关联机制,计算节里的各词间的关联度,然后比较每种分法的关联度,如关联度值最大的那种分法为本义分法,则本节可采用这种编法;如每节(各节BIT长度相同)都可采用这种编法,那本语句的编码就可采用频率关联混合方式;当在编码时发现分节后有某节的歧义分法的关联度值比本义分法的关联度值还大时,可调整基本节长,再试,如仍不行,就改另外的编码方式。In this encoding method, the following sentence is composed of words as an example. Similar to the combination of words in the sentence, first convert all the words in the sentence into ladder numbers and arrange them in the original order. There is no ladder number before each ladder number. Length information, and if the longest binary length of the arranged and combined value (which can be named as the ladder information value) is 84 bits (BIT), set points and subsections at the 21st, 42nd, and 63rd bits , check backward at the 21st position and the distance value at the end of the ladder number of the nearest single word (every increase of 3BIT, the distance value is just increased by 1) and record it (the distance value occupies 2 binary bits), and put the score The section is moved to the end of the ladder number, and the 42nd and 63rd positions are processed in the same way. These three distance values are the offset information of the three subsections; in this way, the ladder information value is divided into 4 sections, each The length of the section is about 21 bits (of course, if the ladder information value is relatively short, then the next few sections may be empty without words), there may be one or more words in each section, and the length of the ladder number of each word may be 3 , 6, 9, and 12 digits, but it is still unknown how many digits the first word occupies in each section, and how many digits the second word occupies. However, due to the small number of digits in each section after subdivision, it is possible to exhaustively enumerate its ladder numbering method, and calculate the possible division method of each ladder number length. When calculating the possible division method of this length, the section in the The degree of association between each word; (assuming that each word corresponding to each frequency number under a certain division method is exactly the original sentence in this section position, this division method can be called the original meaning division method, otherwise it is an ambiguous division method. method); because several words that are closely arranged together in a sentence or words that are separated by 1 or 2 words are generally related to each other, and the aforementioned word association mechanism is used to calculate the degree of association between each word in the section. Then compare the correlation degree of each division method, if the division method with the highest correlation degree value is the original division method, then this section can adopt this compilation method; such as each section (each section BIT length is the same) can adopt this method If the coding method of this sentence can adopt the frequency correlation mixing method; when it is found that the correlation value of the ambiguous division method of a certain section is larger than the correlation value of the original meaning division method after the division, the basic division method can be adjusted. Section length, try again, if it still doesn't work, change another encoding method.

每节的基本节长(BIT)除了可以是21外,也可以是其它比如24,18等,在编码值中可放置基本节长的选择信息值(以下的编解码举例是以只有21或24两个基本节长选择值为例)In addition to 21, the basic section length (BIT) of each section can also be other such as 24, 18, etc., and the selection information value of the basic section length can be placed in the coded value (the following codec example is based on only 21 or 24 Two basic section length selection values as an example)

这种词段编码方式需对每个分节处设2个二进制位的偏移信息,如分4节(每节长约21BIT),有3个分节处,需3*2=6位二进制位。This word segment encoding method needs to set 2 binary bits of offset information for each subsection, such as 4 subsections (each section is about 21BIT long), and there are 3 subsections, 3*2=6 binary bits are required bit.

举例:语句为:‘这个胖男人是一个工程师’。Example: The sentence is: 'This fat man is an engineer'.

编码如下:The encoding is as follows:

01100:阶梯信息值的长度是12个单位(3个bit一个单位,本例有3*12个bit)01100: The length of the ladder information value is 12 units (3 bits per unit, this example has 3*12 bits)

0:每节基本节长(BIT)选为21(0:表示每节基本节长为21,1:表示每节基本节长为24)0: The basic section length (BIT) of each section is selected as 21 (0: indicates that the basic section length of each section is 21, 1: indicates that the basic section length of each section is 24)

010000:三个分节处的偏移信息(第21位是阶梯信息值中的’是’词的频率编号的中间,将此分节处挪到’是’词的频率编号末,并记录距离值为01,因阶梯信息值的长度不到42,所以后面第42位和第63位处的距离值是00和00,然后01与00与00合并成010000)010000: Offset information at the three sections (the 21st bit is the middle of the frequency number of the word "yes" in the ladder information value, move this section to the end of the frequency number of the word "yes", and record the distance The value is 01, because the length of the ladder information value is less than 42, so the distance values at the 42nd and 63rd bits are 00 and 00, and then 01, 00 and 00 are combined into 010000)

011:’这个’词的频率编号(也即阶梯编号,本例以下同)011: the frequency number of the word 'this' (that is, the ladder number, the same as in this example below)

011011:’胖’词的频率编号011011: Frequency number for the word 'fat'

011001100:’男人’词的频率编号011001100: frequency number for the word 'man'

010101:’是’词的频率编号010101: Frequency number of the word 'is'

      |\分节处|\section

110:’一个’词的频率编号110: frequency number of word 'one'

011101010:’工程师’词的频率编号011101010: frequency number for the word 'engineer'

把以上各BIT按顺序合起来得编码值是:Combining the above BITs in sequence, the coded value is:

011000010000011011011011001100010101110011101010011000010000011011011011001100010101110011101010

此编码值是词段的值,为了使举例简洁,本举例没涉及头段,头段里一般含有语种信息以及词段编码方式选择信息等数据。This encoding value is the value of the word segment. In order to make the example concise, this example does not involve the header, which generally contains data such as language information and word segment encoding method selection information.

至于这样编的编码值是否可以正常解码,要看下面的判断,如不能正常解码,可调整基本节长再试或换别的词段编码方式来对词段进行编码:As for whether the coded value encoded in this way can be decoded normally, it depends on the following judgments. If it cannot be decoded normally, you can adjust the basic section length and try again or use another word segment encoding method to encode the word segment:

这里词段里的阶梯信息值分成两节:011011011011001100010101 110011101010The ladder information value in the phrase here is divided into two sections: 011011011011001100010101 110011101010

如对于后一节110011101010,可分解为7种情况(阶梯编号组合情况):For example, for the latter section 110011101010, it can be decomposed into 7 situations (the combination of ladder numbers):

110,011,101,010(被逗号隔开成4个阶梯编号)110, 011, 101, 010 (separated by commas into 4 ladder numbers)

110,011,101010(被逗号隔开成3个阶梯编号,以下类推)110, 011, 101010 (separated by commas into 3 ladder numbers, and so on)

110,011101010110, 011101010

110,011101,010110, 011101, 010

110011101,010110011101,010

110011,101,010110011, 101, 010

110011,101010110011, 101010

(本来还有第8种分法110011101010,即这12个BIT组成一个阶梯编号,但因这种情况时这节内只有一个阶梯编号,不能计算这节内各阶梯编号之间的关联度,所以如出现这种某节的所有BIT完全就是实际语句这个位置的这个词的阶梯编号的情况,可调整基本节长再试或放弃这种编法)(Originally there is an 8th division method 110011101010, that is, these 12 BITs form a ladder number, but in this case, there is only one ladder number in this section, and the correlation between the ladder numbers in this section cannot be calculated, so If it happens that all BITs in a certain section are exactly the ladder number of the word at the position of the actual sentence, you can adjust the basic section length and try again or give up this compilation method)

分别计算这7种情况的各词(阶梯编号)间的关联度(本词段编解码方式所称某一节的某分法的各词间的关联度与某分法的关联度是同一个概念),比如前一词所属词类信息记录在后一词对应的记录项(记录项是在词数据库里的)的关联词类编号字段里(即前一词与后一词关联,比如词‘吃’属于第7个词类即动作词类,在词数据库里词‘男人’对应的记录项的关联词类编号字段里记录有7,所以词‘吃’与词‘男人’关联),这种分法的关联度值就加1,其计算标准自定(一般是对一节的某分法的所以词之间都要考察关联度,比如某一节的某分法是分成3个词,可这样计算这节这种分法的关联度值:关联度值起始为0,如词1与词2关联,关联度值就加1,如词1与词3关联,关联度值再加1,如词2与词3关联,关联度值再加1),如关联度值最大的分法是本义分法,就可使用这种编法。Calculate the degree of association between each word (ladder numbering) of these 7 kinds of situations respectively (the degree of association between each word of a certain sub-method of a certain section and the degree of association of a certain sub-method are the same concept), such as the part-of-speech information of the former word is recorded in the associated part-of-speech number field of the record item corresponding to the latter word (the record item is in the word database) (that is, the former word is associated with the latter word, such as the word 'eat 'belongs to the 7th part of speech, that is, the action part of speech, andrecords 7 in the associated part of speech number field of the record item corresponding to the word 'man' in the word database, so the word 'eat' is associated with the word 'man'), thisdivision Add 1 to the correlation degree value, and its calculation standard is self-determined (generally, it is for a certain division of a section, so the correlation between words must be examined. For example, a certain division of a certain section is divided into 3 words, which can be calculated in this way The correlation value of this division method in this section: the correlation value starts from 0, ifword 1 is associated withword 2, the correlation value will be increased by 1, ifword 1 is associated withword 3, the correlation value will be added by 1, such asWord 2 is associated withword 3, and the degree of association value adds 1), as the division method that the degree of association value is maximum is the original meaning division method, just can use this compilation.

这里除了可以定量地计算其关联度外,还可以利用排除方法,比如在存储器集里存储一些信息数据库,指示在某些词后面或前面不会出现某些词,这样如果某种分法出现前后非法的搭配,即排除这种分法。比如‘的’,‘个’这样的两词一般不会在句子中前后紧挨在一起的,如出现‘的个’这样分法,即可判断为不正确的分法。另外也可记录词类与词类非法相邻的信息,比如食品类的词与建筑类的词挨在一起是非法等。如果词数据库的记录项里有词性字段的话,那还可规定词性的非法搭配,比如不能前名词后量词紧邻,一般对于单一词性的处理比较方便,对于多词性的可不考滤其词性的非法搭配;如果词数据库的记录项里没有词性字段的话,就不考虑词性的非法搭配。Here, in addition to quantitatively calculating its relevance, you can also use exclusion methods, such as storing some information databases in the memory set, indicating that certain words will not appear behind or in front of certain words, so that if a certain division method appears before and after Illegal collocation means excluding this division. For example, two words such as 'de' and 'ge' are generally not placed next to each other in a sentence. If there is a division like 'de', it can be judged as an incorrect division. In addition, the information that the part of speech is illegally adjacent to the part of speech can also be recorded, for example, it is illegal for a word of food and a word of construction to be next to each other. If there is a part-of-speech field in the record item of the word database, the illegal collocation of part-of-speech can also be stipulated. For example, the quantifier cannot be adjacent to the front noun. Generally, it is more convenient to deal with a single part of speech. For multiple parts of speech, illegal collocations of its parts of speech can not be considered. ; If there is no part-of-speech field in the record item of the word database, the illegal collocation of part-of-speech will not be considered.

对于词段里的阶梯信息值的某节的各种分法,就可先排除有这些非法情况的分法,然后再计算余下的各分法的各词间的关联度,如关联度最大的分法是本义分法即表示本节可采用这种编法。第1节完后再处理第2节直至完,如每节都可采用这种编法,则此语句编码的词段可采用这种编法,如有某一节关联度值最大的分法是歧义分法,可调整基本节长再试或可放弃这种编法(当然调整基本节长后,又需重新考察每节是否都可用这种编法)。(在以上述及频率关联混合编码方式时提及的‘这种编法’,是指频率关联混合方式)。For the various divisions of a certain section of the ladder information value in the word segment, the divisions with these illegal situations can be excluded first, and then the degree of association between each word of the remaining divisions is calculated, such as the highest degree of association The division method is the original meaning division method, which means that this section can adopt this compilation method. After the 1st section, the 2nd section is processed until the end. If this method can be used in each section, the word segment encoded by this sentence can adopt this method. It is an ambiguous division method, and the basic section length can be adjusted to try again or this weaving method can be abandoned (of course, after adjusting the basic section length, it is necessary to re-examine whether each section can use this weaving method). ('this coding method' mentioned in the time of above-mentioned and frequency-correlation hybrid coding mode refers to the frequency-correlation hybrid mode).

对于其中有词组合的,可取其前后词(可在词数据库里各词组合对应的记录项里的拼法字段得到,由词的拼法就可在词数据库里找到此词对应的记录项),再与旁边的词计算关联度及判断是否非法。For wherein there are word combinations, the words before and after it can be taken (can be obtained in the spelling field in the record item corresponding to each word combination in the word database, and the record item corresponding to this word can be found in the word database by the spelling of the word) , and then calculate the degree of association with the words next to it and judge whether it is illegal.

本词段编码方式,涉及的频率编号是阶梯编号。In this word segment encoding method, the frequency numbers involved are ladder numbers.

在叙述频率关联混合编解码方式时,经常有提及词之间的关联,词的词性等,实际上是通过阶梯编号到词数据库等数据库里查此阶梯编号对应的记录项,再在记录项里取出此阶梯编号(即词)的关联关系,词性等信息。When describing the frequency-associated hybrid encoding and decoding method, the association between words and the part of speech of words are often mentioned. In fact, the record item corresponding to the ladder number is checked in the database such as the word database through the ladder number, and then in the record item Take out the association relationship, part of speech and other information of this ladder number (ie word) in.

关联方式:Association:

这也与前面英语的处理类似。This is also similar to the previous treatment of English.

语句由词组成,(本编码方式以下述以词为例,有词组合的类似处理),对于语句,先将其词及词组合分成几层,下面一层词是对其紧邻的上面一层词的修饰,如某词其下面没有词修饰它,它就处于最上层。Sentence is made up of words, (this coding mode is with following words as example, the similar treatment of word combination is arranged), for sentence, first its word and word combination are divided into several layers, and the following layer of words is the upper layer next to it Modification of words, if there is no word to modify it below a certain word, it is at the top level.

下层的词,如果本身频率编号的长度很短,可以直接放在最高层。The words in the lower layer can be directly placed in the highest layer if the length of the frequency number itself is very short.

对于包含较多词的词组合,由于一般没有与其它词的关联,直接放在最高层。For word combinations that contain more words, they are placed directly at the highest level because they generally have no association with other words.

编码的前面是词的个数信息。The front of the encoding is the number information of words.

接着是分层信息,其中包含了各层个数信息,位置信息,所以不再需附加其它bit,即可正确解读。Then there is the layered information, which contains the number information and location information of each layer, so it can be interpreted correctly without adding other bits.

接着顺序排出最高层的频率编号及其编码长度。Then, the frequency number of the highest layer and its code length are sequentially arranged.

再顺序排出下一层词最上一层词的修饰偏移信息。The modified offset information of the word of the next layer and the word of the upper layer is sequentially discharged.

再顺序排出下一层词最上一层词的修饰偏移信息。The modified offset information of the word of the next layer and the word of the upper layer is sequentially discharged.

直至所有层排列完。until all layers are arranged.

这里的频率编号可以是等长编号,阶梯编号,哈夫曼编号。如为等长编号,哈夫曼编号那其前面不用带编号长度。The frequency number here can be an equal-length number, a ladder number, or a Huffman number. If the number is of equal length, the Huffman number does not need to be preceded by the number length.

整句差别方式:The difference in the whole sentence:

这也与前面英语的处理类似。可以是用词(音码)及词组合(音码)来编码。This is also similar to the previous treatment of English. Can be coded with word (phonetic sign indicating number) and word combination (phonetic sign indicating number).

先统计常用的范句,并做成数据库(范句库,当然汉语用的范句库与英语用的范句库是不同的),对每条范句按顺序编号。范句库里每条范句的记录方式类似于频率排序编码方式;然后在编语句时,在库中找到与其相似的范句,将那范句的顺序号排在词段的前面,然后在词段后面跟着删词信息和增词信息。First count the commonly used model sentences, and make a database (the model sentence database, of course the model sentence database used in Chinese is different from the model sentence database used in English), and number each model sentence in sequence. The recording method of each model sentence in the model sentence library is similar to the frequency sorting coding method; then when compiling a sentence, find a model sentence similar to it in the library, arrange the sequence number of that model sentence in front of the word segment, and then Word segments are followed by word deletion information and word addition information.

这里的频率编号可以是等长编号,阶梯编号,哈夫曼编号。如为等长编号,哈夫曼编号那其前面不用带编号长度信息。The frequency number here can be an equal-length number, a ladder number, or a Huffman number. If it is an equal-length number, the Huffman number does not need to carry number length information in front of it.

如把含多个句子的语句也录入范句库,那就可以对多个句子的语句编码,处理类似于单句子的语句。If the sentences containing multiple sentences are also entered into the model sentence database, the sentences of multiple sentences can be encoded, and the processing is similar to the sentences of single sentences.

显然,在编码生成一个处理单位的词段时,4种词段编码方式(频率排序方式,频率关联混合方式,关联方式和整句差别方式)是只选其一的。除这4种词段编解码方式(频率排序方式,频率关联混合方式,关联方式和整句差别方式)外,还可以使用本发明文件没提及的其它词段编解码方式来对词段进行编解码处理。Apparently, when encoding and generating a word segment of a processing unit, only one of the four word segment encoding methods (frequency sorting method, frequency-associated mixing method, association method and whole sentence difference method) is selected. Except these 4 kinds of word segment encoding and decoding modes (frequency sorting mode, frequency association mixed mode, association mode and whole sentence difference mode), can also use other word segment encoding and decoding modes that the document of the present invention does not mention to carry out word segment Codec processing.

前后调用处理:Call processing before and after:

这也与前面英语的处理类似。在对词段进行编码时,前后调用处理是与前几种词段编码方式中的某一种合用的。This is also similar to the previous treatment of English. When the word segment is encoded, the forward and backward call processing is used in conjunction with one of the previous several word segment encoding methods.

对于紧邻的几个句子,里面可能有一些比较偏僻的词或词组合被重复调用,而其本身的频率编号较长,这时可以使用前后调用处理,(下面以词为例,对词组合的处理是与词的处理类似的),即,例如某词出现在前面句子时,使用频率编号,而当紧邻一句子又出现这词时,就可以将指针指到前面的词的位置,而不必用其较长的频率编号;具体可以在频率编号中占用一个111,当出现与前面句子中或后面句子中或本句子前面或本句子后面的某词相同的词时,就在词段里用111代替此当前词,然后在整个编码的前后调用段里附上往前调用或往后调用的标志,并附上5个bit来表示当前词位置与前面或后面被调用的词位置的距离。For the next few sentences, there may be some relatively remote words or word combinations that are repeatedly called, and the frequency number itself is relatively long. At this time, the front and rear call processing can be used. The processing is similar to the processing of words), that is, for example, when a certain word appears in the previous sentence, the frequency number is used, and when the word appears again in the next sentence, the pointer can be pointed to the position of the previous word without having to Use its longer frequency number; specifically, you can take a 111 in the frequency number, and when there is a word identical to a certain word in the front sentence or the back sentence or the front of the sentence or the back of the sentence, use it in the word segment 111 replaces this current word, then adds the sign of calling forward or calling backward in the front and rear calling segments of the whole encoding, and attaching 5 bits to represent the distance of the current word position and the word position called before or after.

显然如有使用前后调用处理的条件(即比如语句的前后有相同的词),但编码时也可不使用前后调用处理。Obviously, if there is a condition for using front and back call processing (that is, for example, there are the same words before and after the statement), but the front and back call processing may not be used during encoding.

陌生词处理:Strange word processing:

这也与前面英语的处理类似。This is also similar to the previous treatment of English.

由于词数据表里的词的数量总是有限的,所以随时会遇到陌生词,所以在编解码规则里一般需处理陌生词;汉语的陌生词的发音构成方式一般可看成为几种(其它非印欧语系语种与汉语是类似的):一种可看作是由多个元辅音的发音组成的,一种是由常用的单字拼音的发音组成的,比如陌生词‘课程’由两个单字拼音的发音(ke4和cheng2)组成。Because the quantity of words in the word data table is always limited, so can run into strange words at any time, so generally need to deal with strange words in the codec rule; Non-Indo-European languages are similar to Chinese): one can be regarded as composed of the pronunciation of multiple vowels and consonants, and the other is composed of the pronunciation of commonly used single-character pinyin. For example, the unfamiliar word 'course' consists of two It consists of the pronunciation (ke4 and cheng2) of the pinyin of a single character.

所以陌生词的处理可为,在词的频率编号中,留一个频率编号专用于代替陌生词,比如111111,这个编码只出现在词段中以代替陌生词(显然在使用频率排序或整句差别等编码方式来生成词段时,都可以在词段里使用111111这个频率编号来代替语句中的陌生词,并把陌生词的具体编码信息附在陌生词段里),然后陌生词的具体编码信息附在陌生词段里,陌生词的编码信息的格式可分几种方式(以汉语为例):Therefore, the treatment of unfamiliar words can be, in the frequency number of words, reserve a frequency number dedicated to replace unfamiliar words, such as 111111, this code only appears in the word segment to replace unfamiliar words (obviously in the use of frequency sorting or whole sentence difference When the word segment is generated by the encoding method, the frequency number 111111 can be used in the word segment to replace the unfamiliar word in the sentence, and the specific encoding information of the unfamiliar word is attached to the unfamiliar word segment), and then the specific encoding of the unfamiliar word The information is attached to the unfamiliar words, and the format of the encoding information of the unfamiliar words can be divided into several ways (taking Chinese as an example):

1:元辅音作组合:陌生词的编码信息,完全由元辅音组成,陌生词的编码信息就是:开始4个bit表示组成陌生词发音的带调元辅音个数,接着是组成陌生词发音的各带调元音和带调辅音的顺序号(每个顺序号占8BIT),解码时,要发陌生词的音时,直接把组成陌生词发音的各带调元音和带调辅音的语音按顺序播放出来即可。1: Combination of vowels and consonants: the encoding information of unfamiliar words is completely composed of vowels and consonants. The encoding information of unfamiliar words is: the first 4 bits indicate the number of tonal consonants that form the pronunciation of unfamiliar words, and then the number of tonal consonants that form the pronunciation of unfamiliar words The sequence numbers of each tone vowel and tone consonant (each sequence number accounts for 8BIT), when decoding, when the sound of an unfamiliar word is to be pronounced, the voice of each tone vowel and tone consonant forming the pronunciation of the unfamiliar word is directly used Play it out in order.

2:单字拼音作组合:陌生词的发音由单字拼音的发音组成,常用字一般仅几千个,其中有些字的发音互相相同,比如‘班’和‘搬’的发音完全相同,可当成一个单字的发音,这些常用的不同的单字的发音(元辅音组成情况相同而声调不同的单字发音算不同的单字发音)总共约1300个(对于汉语),各单字的发音也可看成是单字拼音;所以可先建立这约1300个单字拼音的数据库(即单字拼音表),并对各单字拼音进行编号(即赋予顺序号),单字拼音表里需记录各单字拼音的顺序号及其拼音字符拼法及其语音的地址;这样陌生词的编码信息就是:开始4个bit表示组成陌生词发音的单字拼音的个数,接着是组成陌生词发音的各单字拼音的顺序号,每个单字拼音的顺序号占11bit(以单字拼音表里约有1300个单字拼音计)。解码时,要发陌生词的音时,直接把组成陌生词发音的各单字拼音的语音顺次发出来即可。2: Combination of single-character pinyin: the pronunciation of unfamiliar words is composed of the pronunciation of single-character pinyin. Generally, there are only a few thousand commonly used words, and some of them have the same pronunciation. The pronunciation of a single character, the pronunciation of these commonly used different single characters (words with the same composition of vowels and consonants but different pronunciations are counted as different pronunciations of a single word) totals about 1300 (for Chinese), and the pronunciation of each single character can also be regarded as a single-character pinyin ; So the database of these about 1300 single-character phonetics (i.e. the single-character phonetic table) can be set up earlier, and each single-character phonetic alphabet is numbered (promptly given a sequence number), and the sequence number of each single-character phonetic alphabet and its phonetic characters need to be recorded in the single-character phonetic table The address of the spelling and its pronunciation; the encoding information of such unfamiliar words is: the first 4 bits represent the number of single-character pinyins that make up the pronunciation of unfamiliar words, followed by the sequence numbers of each single-character pinyin that constitutes the pronunciation of unfamiliar words, and each single-character pinyin The serial number occupies 11 bits (there are approximately 1300 individual-character phonetic alphabets in the individual-character phonetic alphabet table). During decoding, when the sound of an unfamiliar word is to be pronounced, the phonetic sounds of each single-character phonetic alphabet that forms the pronunciation of the unfamiliar word are directly sent out in sequence.

3:差别式:找与陌生词相似读音的词数据库里的词,然后记录此词与陌生词的读音差别(删增音);这样陌生词的编码信息就是:开始12bit是词数据库里词的寻址地址,接着0,表示删,接着是删的起始位置,删几个元辅音,还可以接着再是0,再删;接着是1,开始增,增的起始位置,增几个元辅音,接着是各元音或辅音或声调的字符(7bit,可为ASCII字符),还可以接着再是1,再增,最后以0结束(在解码时,这些元音和辅音和声调的字符又可看成是组成一个或多个单字拼音的拼音字符拼法的)。解码时,要发陌生词的音时,直接把组成陌生词发音的各单字拼音(每个单字拼音的拼音字符拼法由元音,辅音和声调的字符组成)的语音顺次发出来即可。3: Difference method: find the word in the word database with similar pronunciation to the unfamiliar word, and then record the pronunciation difference between this word and the unfamiliar word (delete and add pronunciation); the coding information of the unfamiliar word is: the first 12bit is the word in the word database Addressing address, followed by 0, means deletion, followed by the starting position of deletion, delete a few consonants, and then 0, and then delete; followed by 1, start to increase, increase the starting position, increase a few Vowels and consonants, followed by the characters of each vowel or consonant or tone (7bit, which can be an ASCII character), it can also be followed by 1, then increase, and finally end with 0 (when decoding, these vowels and consonants and tones Character can be regarded as the pinyin character spelling that forms one or more single-character pinyin again). When decoding, when the sound of an unfamiliar word is to be pronounced, the voices of each single-character pinyin (the spelling of each single-character pinyin is composed of vowels, consonants and tonal characters) that make up the pronunciation of the unfamiliar word can be sent in sequence .

陌生词的发音信息数据即是用元辅音,词,单字拼音(单字拼音也是元辅音组合音码)这些音码中的一种或多种音码来构成的。The pronunciation information data of unfamiliar word promptly is constituted with one or more phonetic codes in these phonetic codes of vowel consonant, word, single-character phonetic alphabet (single-character phonetic alphabet is also vowel-consonant combination phonetic code).

在陌生词的编码信息的前面有编码信息的格式选择信息,以下的汉语的编解码举例中,涉及陌生词的编码信息的格式选择时,以00(二进制)代表元辅音作组合,01代表单字拼音作组合,10代表差别式。In front of the encoding information of unfamiliar words, there is format selection information of encoding information. In the following codec example of Chinese, when the format selection of encoding information of unfamiliar words is involved, 00 (binary) represents the combination of vowels and consonants, and 01 represents single characters. Pinyin is combined, and 10 represents the difference.

如词段里没陌生词不出现111111,那陌生词段也不占BIT,这是显然的。If there are no unfamiliar words in the phrase and 111111 does not appear, then the unfamiliar phrase will not occupy BIT, which is obvious.

显然在一个词段中可以有一个或多个陌生词(一个词段中对应含有多个111111),如一个词段中有多个陌生词,那各个陌生词的具体编码信息(即比如由哪些元辅音或字组成等信息)就按各陌生词在语句里(词段里)的先后顺序依次放在陌生词段里即可。本段所述是技术常识。Obviously, there can be one or more unfamiliar words in a word segment (correspondingly containing a plurality of 111111 in a word segment). Information such as vowels and consonants or word composition) just be placed in the strange word segment successively by each strange word in the sentence (in the word segment) sequentially. This paragraph is technical common sense.

当然在编码规则中也可以不处理陌生词,比如如在读物中某汉语语句遇到陌生词,此语句可不编码且读物的整页面上此语句旁不设码字,或者修改此语句内容,使语句中不再含有陌生词,再对语句进行编码。Of course, unfamiliar words may not be processed in the coding rules. For example, if a Chinese sentence encounters an unfamiliar word in a reading material, this sentence may not be coded and there is no code word next to this sentence on the entire page of the reading material, or modify the content of this sentence so that The sentence no longer contains unfamiliar words, and then the sentence is encoded.

以上10段所述是以只适用于汉语为例的编码规则所采用的陌生词处理方式,而且涉及到解码时陌生词的发音处理是以解码规则是采用整步直接发音方式来播放语句发音为例的(也适用于解码规则是采用分步直接发音方式来播放语句发音的);如解码规则采用分步直接发音方式来播放语句发音,那对于任一个陌生词,在解码过程中得到的此陌生词的发音信息数据(有关陌生词的发音信息数据的具体阐述在后面解码时有述)一般应放在一个部分预备发音数据里(一般不会分开放在几个部分预备发音数据里)。对于非印欧语系的其它语种的陌生词处理也可按汉语这样处理。The above 10 paragraphs describe the unfamiliar word processing method adopted by the coding rules that are only applicable to Chinese as an example, and the pronunciation processing of unfamiliar words when it involves decoding is that the decoding rule adopts the full-step direct pronunciation method to play the sentence pronunciation as Example (also applicable to the decoding rule is to adopt the step-by-step direct pronunciation mode to play the sentence pronunciation); as the decoding rule adopts the step-by-step direct pronunciation mode to play the sentence pronunciation, then for any unfamiliar word, the obtained in the decoding process The pronunciation information data of unfamiliar words (the specific elaboration of the pronunciation information data of relevant unfamiliar words is described when decoding later) should generally be placed in a part of the preparation pronunciation data (generally can not be placed separately in several parts preparation pronunciation data). The treatment of unfamiliar words in other languages of the non-Indo-European language family can also be handled in the same way as Chinese.

时态词形变化的处理:在汉语里,一般没有词的时态词形变化及单复数变化,遇到需表示过去,将来等时态时,一般是附加一些词以表示时态,比如要表示将来时态,句子中一般会加有‘将’这样的时态词,而如需表示复数,则可在名词后加‘们’。所以对汉语(或其它类似汉语的)语种,其时态词形的处理可以是:在编码时不考虑时态词形变化,附加的时态词(如‘将’)或表示复数的词(如‘们’)作为平常词处理。对于韩语或日语这样的语种,其名词也无复数形式,有些时态变化是在动词后面附不同音节(韩语是附音节,音节类似于汉语的字,日语是附假名),另外有些词性的词比如日语里的形容词也是在形容词后面附不同音节(假名)来产生变化,这些有点类似于英语的词形变化(词后缀变化);这后面附的音节可以当成平常词来处理,这样一来音节前面的词(如前述的动词或形容词)就可当成没有时态词形变化的词了(即同汉语一样处理,不考虑时态词形变化),另也可以使用前述的印欧语系语种的时态词形选择机制,将词(如前述的动词或形容词)与其后附加的音节当成一个有不同时态词形的词。(本说明书在作为非印欧语系语种例子的汉语的各编解码举例中是采用不考虑时态词形变化的方式,不设时态词形选择段)Handling of tense and inflection changes: In Chinese, there are generally no tense and inflection changes of words and singular and plural changes. When encountering tenses such as the past and the future, it is generally necessary to add some words to express tenses, such as to To express the future tense, tense words such as 'will' are generally added to the sentence, and if it is necessary to express plurals, 'we' can be added after the noun. Therefore, for Chinese (or other Chinese-like) languages, the processing of the tense word form can be: when encoding, the tense word form change is not considered, and the additional tense words (such as 'will') or plural words ( Such as 'we') are treated as ordinary words. For languages such as Korean or Japanese, there is no plural form for nouns, and some tense changes are attached to different syllables after the verb (Korean has syllables, syllables are similar to Chinese characters, and Japanese has kana), and some parts of speech words For example, adjectives in Japanese are also changed by attaching different syllables (kana) to the end of the adjective. These are somewhat similar to English inflections (word suffix changes); the syllables attached to them can be treated as ordinary words. The previous words (such as the aforementioned verbs or adjectives) can be regarded as words without tense inflections (i.e., treated the same as Chinese, regardless of tense inflection), and the aforementioned Indo-European languages can also be used. The tense word form selection mechanism regards a word (such as the aforementioned verb or adjective) and the additional syllables as a word with different tense word forms. (In this manual, in the codec examples of Chinese as an example of non-Indo-European languages, the method of not considering the change of tense and word form is adopted, and there is no tense and word form selection section)

下面举一个较完整的汉语的编码例子:Here is a more complete example of Chinese encoding:

语句为:’这个漂亮男人是一个工程师’The statement is: 'This beautiful man is an engineer'

依照上述汉语的编码规则,分开编码是:According to the above Chinese encoding rules, the separate encoding is:

头段:header:

01:本语句是汉语语种01: This sentence is in Chinese language

00:词段采用第1种词段编码方式(即频率排序编码方式)00: The word segment adopts the first word segment encoding method (that is, the frequency sorting encoding method)

词段:Phrases:

00110:词段里的词及词组合的个数(6个)00110: the number of words and word combinations in the word segment (6)

00:’这个’词的频率编号的长度信息00: The length information of the frequency number of the word 'this'

011:’这个’词的频率编号011: frequency number of word 'this'

01:111111的频率编号长度信息01: Frequency number length information of 111111

111111:111111代表陌生词(‘漂亮’),(111111也计入词及词组合的个数)111111: 111111 represents an unfamiliar word ('beautiful'), (111111 is also included in the number of words and word combinations)

10:’男人’词的频率编号的长度信息10: length information of the frequency number of the word 'man'

011001100:’男人’词的频率编号011001100: frequency number for the word 'man'

01:’是’词的频率编号的长度信息01: The length information of the frequency number of the word 'is'

010101:’是’词的频率编号010101: Frequency number of the word 'is'

00:’一个’词的频率编号的长度信息00: length information of the frequency number of the word 'one'

110:’一个’词的频率编号110: frequency number of word 'one'

10:’工程师’词的频率编号的长度信息10: length information of the frequency number of the word 'engineer'

011101010:’工程师’词的频率编号011101010: frequency number for the word 'engineer'

陌生词段:Unfamiliar phrases:

01:陌生词的编码信息的格式选择信息(单字拼音作组合格式)01: Format selection information for encoding information of unfamiliar words (single-character pinyin as combined format)

01:单字拼音个数(2个)01: The number of pinyin of a single character (2)

01001011010:单字’漂’的拼音在单字拼音表里的顺序号01001011010: the sequence number of the pinyin of the single character 'Piao' in the single-character pinyin table

00101110000:单字’亮’的拼音在单字拼音表里的顺序号00101110000: the sequence number of the pinyin of the single character 'liang' in the single-character pinyin table

把以上各BIT按顺序合起来得语句的编码值为(这里的语句的编码值是一个处理单位的值,也是表示语句发音的拼合信息的值):Combine the above BITs in order to get the coded value of the sentence (the coded value of the sentence here is the value of a processing unit, and also the value of the combined information representing the pronunciation of the sentence):

01 00 00110000110111111110011001100010101010011001 00 001100001101111111100110011000101010100110

10 0111010100101010010110100010111000010 01110101001010100101101000101110000

(本词段编码方式里,各词及词组合的频率编号除了可以是本例示的阶梯编号外,还可以是等长编号或哈夫曼编号等;如为等长编号,哈夫曼编号那其前面不用带频率编号长度信息。)。(in this speech segment encoding mode, the frequency numbering of each word and word combination can also be equal-length numbering or Huffman numbering etc. except the ladder numbering of this example; as for equal-length numbering, Huffman numbering that There is no frequency number length information in front of it.).

解码是拾音器的单片机芯片按照解码规则,将拾音器用读码头读有声读物的整页面上一个或多个码字而得到的表示某语种的语句发音的拼合信息的值,还原成语句的全部发音或非全部发音。解码规则可以是多种多样的,一个解码规则可以是只适用于一个语种或可适用于多个语种的;由某个只适用于一个语种或适用于多个语种的编码规则中有关某语种的编码细则来编码生成的表示此语种的语句发音的拼合信息的值可以是由某个只适用于一个语种或适用于多个语种的解码规则中有关此语种的解码细则来解码的(本句和上句说的‘一个语种’,‘某语种’,‘此语种’都是相同所指)。Decoding means that the single-chip microcomputer chip of the pickup uses the reading terminal to read one or more codewords on the entire page of the audiobook, and restores the value of the combined information representing the pronunciation of a sentence in a certain language to the entire pronunciation of the sentence or Not all pronounced. Decoding rules can be varied, and a decoding rule can be applicable only to one language or applicable to multiple languages; Encoding rules to encode the generated value of the combined information representing the sentence pronunciation of this language can be decoded by the decoding rules about this language in a decoding rule that is only applicable to one language or applicable to multiple languages (this sentence and The 'a language', 'a certain language', and 'this language' mentioned in the previous sentence all refer to the same).

解码涉及的硬件发音的方式可以是直接产音方式或TTS产音方式。如使用直接产音方式,那实现发音(也即播出语音)的可以是主MCU,也可以是外接的语音芯片(此语音芯片可与主MCU连接使用,此语音芯片不是TTS芯片);如述及时无特别指明有其他情况,在本说明书中各处述及直接产音方式时,是以主MCU发音的方式为例来阐述的;如使用TTS产音方式,那实现发音(也即播出语音)的是TTS器件(比如可以是TTS芯片)。The hardware pronunciation method involved in the decoding may be a direct sound generation method or a TTS sound generation method. If the direct sound production method is used, then the main MCU or an external voice chip (this voice chip can be connected to the main MCU, and this voice chip is not a TTS chip) can be used to realize the pronunciation (that is, broadcast voice); if When mentioning, there are no other situations that are specifically indicated. When referring to the direct sound production mode in this manual, the mode of the main MCU pronunciation is used as an example to illustrate; if the TTS sound production mode is used, then the pronunciation (that is, broadcast output voice) is a TTS device (for example, it can be a TTS chip).

直接产音方式,是先在拾音器里存储诸音码的发音数据,在解码时,用从诸音码的发音数据中得到的多个音码的发音数据(比如多个音码的录音数据)连续发音(在分步直接发音时在听觉效果上仍是连续的)以形成语句的全部发音或语句的一些发音。如拾音器在对一拼合信息的值解码时硬件发音总是采用直接产音方式,那拾音器的MCU在对此一拼合信息的值解码时,是采取直接发音方式来解码的(而且如一拾音器的MCU在对一拼合信息的值解码时是采取直接发音方式来解码的,那此拾音器在对此拼合信息的值解码时硬件发音总是采用直接产音方式的),直接发音方式可以是整步直接发音方式或分步直接发音方式或本说明书提及的其它更多具体形式的直接发音方式或本说明书没提及的其它更多具体形式的直接发音方式。Direct sound production mode is to store the pronunciation data of all sound codes in the pickup earlier, when decoding, use the pronunciation data of a plurality of sound codes (such as the recording data of a plurality of sound codes) obtained from the pronunciation data of all sound codes Continuous utterances (still aurally continuous when articulated in steps) to form all utterances of a sentence or some utterances of a sentence. When decoding the value of a mosaic information as the pickup, the hardware pronunciation always adopts the direct sound production mode, then the MCU of the pickup adopts the direct pronunciation mode to decode (and as the MCU of a pickup) when decoding the value of this mosaic information When decoding the value of a combination of information, it adopts the direct pronunciation mode to decode, then the hardware pronunciation of the pickup always adopts the direct sound production mode when decoding the value of the combination information), and the direct pronunciation mode can be the whole step directly Pronunciation mode or step-by-step direct pronunciation mode or the direct pronunciation mode of other more specific forms mentioned in this manual or the direct pronunciation mode of other more specific forms not mentioned in this manual.

TTS产音方式,是在解码时采用用TTS器件发音的方式来播出语句的全部发音或语句的一些发音。如拾音器在对一拼合信息的值解码时硬件发音总是采用TTS产音方式,那拾音器的MCU在对此一拼合信息的值解码时,是采取TTS发音方式来解码的(而且如一拾音器的MCU在对一拼合信息的值解码时是采取TTS发音方式来解码的,那此拾音器在对此拼合信息的值解码时硬件发音总是采用TTS产音方式的),TTS发音方式可以是整步TTS发音方式或分步TTS发音方式或本说明书提及的其它更多具体形式的TTS发音方式或本说明书没提及的其它更多具体形式的TTS发音方式;如是整步TTS发音方式,解码时MCU得到语句的预备发音数据(语句的拼法格式数据)后,将语句的全部预备发音数据传给TTS器件(比如TTS芯片)播出语句的全部发音,或将语句的非全部预备发音数据传给TTS器件(比如TTS芯片)播出语句的非全部发音;如是分步TTS发音方式,解码时MCU每次得到一个部分预备发音数据(语句的部分拼法格式数据),并将此部分预备发音数据传给TTS器件(比如TTS芯片)播放发音,按这样MCU把组成语句的全部预备发音数据(语句的拼法格式数据)的那些部分预备发音数据(语句的部分拼法格式数据)依次分次传给TTS器件发音,即形成语句的全部发音(或按这样MCU把组成语句的非全部预备发音数据(即语句的非全部的拼法格式数据)的那些部分预备发音数据(即语句的部分拼法格式数据)依次分次传给TTS器件发音,即形成语句的非全部发音)。The TTS sound production method is to broadcast all the pronunciations of the sentence or some pronunciations of the sentence by using the TTS device pronunciation mode during decoding. When decoding the value of a mosaic information as the pickup, the hardware pronunciation always adopts the TTS sound production mode, then the MCU of the pickup adopts the TTS pronunciation mode to decode (and as the MCU of a pickup) when decoding the value of this mosaic information When decoding the value of a mosaic information, the TTS pronunciation method is adopted to decode, then the pickup always adopts the TTS sound production method when decoding the value of the mosaic information), and the TTS pronunciation method can be full-step TTS Pronunciation method or step-by-step TTS pronunciation method or other more specific forms of TTS pronunciation methods mentioned in this manual or other more specific forms of TTS pronunciation methods not mentioned in this manual; if it is a full-step TTS pronunciation method, MCU when decoding After obtaining the prepared pronunciation data (spelling format data) of the sentence, pass all the prepared pronunciation data of the sentence to the TTS device (such as a TTS chip) to broadcast the whole pronunciation of the sentence, or pass the non-all prepared pronunciation data of the sentence to TTS devices (such as TTS chips) broadcast non-full pronunciation of sentences; as in the step-by-step TTS pronunciation mode, MCU obtains a part of prepared pronunciation data (partial spelling format data of sentences) at a time during decoding, and uses this part of prepared pronunciation data Pass to TTS device (such as TTS chip) and play pronunciation, by MCU in this way, those part prepared pronunciation data (partial spelling format data of sentence) that make up all the prepared pronunciation data (spelling format data of sentence) of sentence are transmitted successively Pronunciation to the TTS device, i.e. forming the whole pronunciation of the sentence (or those parts of the non-all preparation pronunciation data (that is, the spelling format data of the sentence) that form the sentence by the MCU in this way (that is, the partial spelling of the sentence) Format data) is passed to the TTS device in turn for pronunciation, that is, not all pronunciations of the sentence are formed).

拾音器解码涉及的硬件发音方式除了可用直接产音方式或TTS产音方式外,可能还可用别的产音方式,这些别的产音方式应是直接产音方式或TTS产音方式的等同替换,如采用别的产音方式,那解码时要采取相应的发音方式,解码规则甚至编码规则一般都需相应调整。In addition to the direct sound production method or the TTS sound production method, other sound production methods may be used for the hardware pronunciation method involved in the pickup decoding. These other sound production methods should be equivalent replacements of the direct sound production method or the TTS sound production method. If other sound production methods are used, the corresponding pronunciation method should be adopted when decoding, and the decoding rules and even the encoding rules generally need to be adjusted accordingly.

这里先讲有关直接产音方式的内容,至于有关TTS产音方式的内容在后面述。Here we will first talk about the content of the direct sound production method, and the content of the TTS sound production method will be described later.

整步直接发音方式是指,在解码时,按拼合信息的值,MCU把各音码的发音信息数据(比如词形音标或词形拼音)按某种约定顺序排组起来成预备发音数据,并且把预备发音数据中的全部音码中各音码的发音信息数据按它们对应的发音在语句发音中的排列顺序依次转换成其对应的发音(比如由主MCU以播放ADPCM格式的录音数据的方式来播音)连续播放出来,即为语句的全部发音,或把预备发音数据中的部分音码中各音码的发音信息数据按它们对应的发音在语句发音中的排列顺序依次转换成其对应的发音(比如由主MCU以播放ADPCM格式的录音数据的方式来播音)连续播放出来,即为语句的非全部发音。一般情况,整步直接发音方式是指,在解码时,按拼合信息的值,MCU把各音码的发音信息数据(比如词形音标或词形拼音)按这些数据对应的发音在语句发音中的顺序排组起来成预备发音数据,并且把预备发音数据中的各音码的发音信息数据按它们在预备发音数据中的排列顺序依次转换成其对应的发音(比如由主MCU以播放ADPCM格式的录音数据的方式来播音)连续播放出来,即为语句的全部发音,如述及时无特别指明有其他情况,本说明书各处在述及整步直接发音方式时默认是以从本段第7行‘一般情况’至本段第12行‘全部发音’为止所述方式为例的。整步直接发音方式适用于印欧语系各语种和非印欧语系各语种和任一复合语种。整步直接发音方式在本发明文件里有反复叙述。The full-step direct pronunciation method means that when decoding, according to the value of the combined information, the MCU arranges the pronunciation information data (such as word-form phonetic symbols or word-form pinyin) of each phonetic code into a preparatory pronunciation data in a certain agreed order. And the pronunciation information data of each phonetic code in the whole phonetic code in the preparation pronunciation data is converted into its corresponding pronunciation successively by their corresponding pronunciation in the order of arrangement in the sentence pronunciation (such as by the main MCU to play the recording data of ADPCM format method to broadcast) to play out continuously, which is the whole pronunciation of the sentence, or the pronunciation information data of each phonetic code in the part of the phonetic codes in the prepared pronunciation data is converted into its corresponding pronunciation in sequence according to their corresponding pronunciation in the sentence pronunciation. The pronunciation of (for example, broadcast by the main MCU in the form of recording data in ADPCM format) is played continuously, that is, the partial pronunciation of the sentence. In general, the full-step direct pronunciation method refers to that when decoding, according to the value of the combined information, the MCU puts the pronunciation information data (such as word-form phonetic symbols or word-form pinyin) of each phonetic code in the sentence pronunciation according to the pronunciation corresponding to these data The order of arrangement is arranged into preparation pronunciation data, and the pronunciation information data of each phonetic sign indicating number in the preparation pronunciation data is converted into its corresponding pronunciation successively by their order of arrangement in the preparation pronunciation data (such as by main MCU to play ADPCM format The way of recording data to broadcast) is played continuously, that is, the entire pronunciation of the sentence. If there are no other circumstances specified in the above, when referring to the whole-step direct pronunciation method in this manual, the default is to start from the seventh paragraph of this paragraph. Line 'general situation' to the 12th line of this paragraph 'all pronunciation' as an example. The full-step direct pronunciation method is applicable to all Indo-European languages and non-Indo-European languages and any compound languages. Whole-step direct pronunciation mode has repeated narration in the document of the present invention.

对于印欧语系语种涉及的整步直接发音方式,在以下英语各解码举例中,MCU在解码时,按拼合信息的值进行解码,解码的最后几步,把解出的各代表音码的编号(比如代表单词及单词组合音码的频率编号,或比如用以组成陌生单词发音的词根,元辅音音码的顺序号)转成其发音信息数据(如词形音标),并把这些音码的发音信息数据按这些数据对应的发音在语句发音中的顺序排组起来,即得语句的预备发音数据,再可由MCU把各音码的发音信息数据按它们在预备发音数据中的排列顺序依次转换成其对应的语音连续发出音来(对于英语,可由MCU用各音标对应的ADPCM格式的录音数据,对预备发音数据中的各单词的词形音标按它们在预备发音数据中的排列顺序依次连续播放出它们对应的发音来),即为语句的全部发音。For the full-step direct pronunciation method involved in the Indo-European languages, in the following decoding examples of English, the MCU decodes according to the value of the combined information when decoding. (such as representing the frequency number of words and word combination phonetic codes, or such as the root used to form the pronunciation of unfamiliar words, the sequence number of vowel and consonant phonetic codes) into its pronunciation information data (such as word-form phonetic symbols), and these phonetic codes The pronunciation information data of each phonetic code is arranged according to the order of the pronunciation corresponding to these data in the pronunciation of the sentence, and the prepared pronunciation data of the sentence is obtained, and then the pronunciation information data of each phonetic code can be sequentially arranged in the prepared pronunciation data by the MCU Convert to its corresponding voice and continue to pronounce (for English, the recording data of the ADPCM format corresponding to each phonetic symbol can be used by the MCU, and the word-form phonetic symbols of each word in the preliminary pronunciation data are pressed in order in their order in the preliminary pronunciation data Continuously play out their corresponding pronunciations), which is the entire pronunciation of the sentence.

对于非印欧语系语种涉及的整步直接发音方式,在以下汉语各解码举例中,MCU在解码时,按拼合信息的值进行解码,解码的最后几步,把解出的各代表音码的编号(比如代表词及词组合音码的频率编号,或比如用以组成陌生词发音的单字拼音的顺序号)转成其发音信息数据(如词形拼音),并把这些音码的发音信息数据按这些数据对应的发音在语句发音中的顺序排组起来,即得语句的预备发音数据(本段说的单字拼音也是元音辅音组合音码),再可由MCU把各音码的发音信息数据按它们在预备发音数据中的排列顺序依次转换成其对应的语音连续发出音来(对于汉语,可由MCU用各带调音节对应的ADPCM格式的录音数据,对预备发音数据中的各词的词形拼音按它们在预备发音数据中的排列顺序依次连续播放出它们对应的发音来),即为语句的全部发音。For the full-step direct pronunciation method involved in non-Indo-European languages, in the following Chinese decoding examples, the MCU decodes according to the value of the combined information when decoding. In the last few steps of decoding, the decoded representative phonetic codes Numbering (such as the frequency numbering of representative words and word combination phonetic codes, or such as the sequence number used to form the single-character pinyin of unfamiliar word pronunciation) is converted into its pronunciation information data (such as word form pinyin), and the pronunciation information of these phonetic codes The data are arranged in the order of the pronunciation corresponding to these data in the pronunciation of the sentence to get the prepared pronunciation data of the sentence (the single-character pinyin mentioned in this paragraph is also the combined sound code of vowel and consonant), and then the pronunciation information of each sound code can be processed by the MCU. The data is converted into its corresponding voice successively by their order of arrangement in the preparation pronunciation data and is pronounced continuously (for Chinese, the recording data of the ADPCM format corresponding to each band tone syllable can be used by the MCU, to each word in the preparation pronunciation data Word form pinyin plays their corresponding pronunciation successively successively according to their sequence in the prepared pronunciation data), which is the whole pronunciation of the sentence.

如拾音器采用直接产音方式,一般是需把涉及发音的诸音码的发音数据(比如英语语种时各元辅音的录音数据)放在与拾音器有关的存储器集里(这种情况比如是用主MCU来发音的),另也可放在只能被MCU外接的语音芯片读取的存储器里(这种情况一般是由MCU把语句的全部或非全部预备发音数据传给语音芯片,由语音芯片自己发音,预备发音数据是指比如按语句发音顺序排列的表示英语语句发音的各音标的编号信息或按语句发音顺序排列的表示汉语语句发音的各拼音的代号信息,这里说的传语句的全部或非全部预备发音数据是指整步直接发音方式时;而对于分步直接发音方式,每次是传部分预备发音数据)。本说明书各处说的拾音器里存储的诸音码的发音数据(或诸音码的录音数据)所涉及的诸音码的发音数据在拾音器里存储的情况一般就是如本段所说的两种存储情况。Adopt the direct sound production mode as pickup, generally be to need the pronunciation data (the recording data of each vowel consonant during the time such as English language kind) to be placed in the memory collection relevant with pickup (this situation is such as using main It is pronounced by the MCU), and it can also be placed in a memory that can only be read by an external voice chip connected to the MCU (in this case, the MCU generally transmits all or part of the prepared pronunciation data of the sentence to the voice chip, and the voice chip Self-pronunciation, prepared pronunciation data refers to the numbering information of each phonetic symbol that represents the pronunciation of English sentences arranged in the sentence pronunciation order or the code number information of each pinyin representing the pronunciation of Chinese sentences arranged in the sentence pronunciation order, the whole of the passed sentence is said here Or not all of the prepared pronunciation data refers to the whole-step direct pronunciation mode; and for the step-by-step direct pronunciation mode, it is to transmit part of the prepared pronunciation data each time). The pronunciation data of all phonetic codes (or the recording data of all phonetic codes) involved in the pronunciation data of all phonetic codes stored in the pickup said in this specification is stored in the pickup generally is exactly as this paragraph said two kinds storage situation.

另在解码采用直接发音方式时,为了使音码发音组合起来的语音听起来更自然,处理或衔接语音波形时可采取一些措施使语音衔接更流畅和更合乎发音习惯,这可参考现成的语音合成技术。解码采用直接发音方式时用于播出语音(也即发音)的可以是主MCU(本发明文件有此约定:如是主MCU用喇叭播出语音,那发音器件是主MCU外接的喇叭)或主MCU片外接的被当作发音器件的带喇叭的语音芯片(这里讲的语音芯片比如可以是采用播放ADPCM格式录音数据方式并且接受主MCU发来的ADPCM格式录音数据来播出语音的,而不是TTS芯片,因这里讲的是直接产音方式,至于TTS产音方式在后面述)。In addition, when the decoding adopts the direct pronunciation method, in order to make the voice combined with the pronunciation of the phonetic code sound more natural, some measures can be taken to make the voice connection smoother and more in line with the pronunciation habits when processing or connecting the voice waveform. This can refer to the ready-made voice synthesis technology. When the decoding adopts the direct pronunciation mode, it can be the main MCU (the document of the present invention has this agreement: if the main MCU broadcasts the voice with the loudspeaker, then the pronunciation device is the external loudspeaker of the main MCU) or the main MCU for broadcasting the voice (that is, pronunciation) A voice chip with a speaker that is externally connected to the MCU and is used as a pronunciation device (the voice chip mentioned here can, for example, use the method of playing ADPCM format recording data and accepting the ADPCM format recording data sent by the main MCU to broadcast voice, instead of TTS chip, so the direct sound production method is mentioned here, as for the TTS sound production method, it will be described later).

另外,对整步直接发音方式还可变通一下,即在对拼合信息的值进行解码时,把各播放发音的动作穿插在解码过程中,这些播放发音的动作产生的发音在听觉上是连续的而且可形成语句的全部发音或非全部发音,可称这种解码过程中播放语句发音的方式为分步直接发音方式;比如语句的全部(或非全部)预备发音数据由一些部分预备发音数据组成,在解码过程中每得到一个发音接续的部分预备发音数据,就将此部分预备发音数据转换成其对应的发音播放出音来,照这样可把这些部分预备发音数据依次分次转换成发音,直到解码完成为止,这些发音在听觉上是连续的即形成所解码语句的全部发音(或非全部发音),本说明书默认是以这种形式来阐述分步直接发音方式的;这里说的发音接续的部分预备发音数据是指:在解码过程中,如刚才得到一个部分预备发音数据并将其转换成发音了,而在需发的音中紧接此刚才一个部分预备发音数据对应的发音后的发音,正是现在得到的这个部分预备发音数据对应的发音,那现在得到的这个部分预备发音数据就是发音接续的部分预备发音数据(本发明文件其它各处述及分步直接发音方式时,提及的发音接续的部分预备发音数据也是按这样理解);如解码过程中得到一个部分预备发音数据但不是发音接续的部分预备发音数据,那就可先放一下等解码到该此部分预备发音数据发音时再将其转换成发音,当然解码时得到的此次对一拼合信息的值解码发音中需最开始发音对应的部分预备发音数据应算为是发音接续的部分预备发音数据并应将其转换成发音;分步直接发音方式还可以有别的具体形式,比如是采用一种语音芯片发音,MCU这边在解码过程中分次(不一定按发音次序)把各部分预备发音数据传给此语音芯片,但要到语句的全部或非全部预备发音数据(或约定好的所有部分预备发音数据)全部传完后语音芯片才发音,这种形式可看成是前面那种(分步)形式的等同替换。显然一个部分预备发音数据是由数个音码的发音信息数据组成,这些数据一般是按它们对应的发音在此部分预备发音数据对应的发音中的顺序排列的(也可按其它约定顺序排列);分步直接发音方式可适用于印欧语系各语种和非印欧语系各语种和任一复合语种。In addition, the whole-step direct pronunciation method can also be modified, that is, when decoding the value of the combined information, the actions of playing pronunciations are interspersed in the decoding process, and the pronunciations produced by these actions of playing pronunciations are continuous in hearing And can form the whole pronunciation of sentence or non-full pronunciation, can claim that the mode of playing sentence pronunciation in this decoding process is step-by-step direct pronunciation mode; For example all (or non-all) preparation pronunciation data of sentence are made up of some part preparation pronunciation data , in the decoding process, each part of the prepared pronunciation data that is obtained as a pronunciation continuation is converted into its corresponding pronunciation and played out. In this way, these parts of the prepared pronunciation data can be converted into pronunciations successively. Until the decoding is completed, these pronunciations are auditory continuous, that is, all pronunciations (or not all pronunciations) of the decoded sentence are formed. This manual uses this form by default to explain the step-by-step direct pronunciation method; the pronunciation mentioned here continues Part of the prepared pronunciation data refers to: in the decoding process, if a part of the prepared pronunciation data has just been obtained and converted into a pronunciation, and in the sound to be issued immediately after the pronunciation corresponding to the part of the prepared pronunciation data just now Pronunciation is the pronunciation corresponding to this part of the preparation pronunciation data obtained now, and this part of the preparation pronunciation data obtained now is exactly the part preparation pronunciation data of pronunciation continuation (when other places of the document of the present invention refer to the step-by-step direct pronunciation mode, mention The part of the preparatory pronunciation data of the continuation of the pronunciation is also understood in this way); if a part of the preparatory pronunciation data is obtained during the decoding process but not the partial preparatory pronunciation data of the continuation of the pronunciation, then you can first wait for a while to decode to this part of the preparatory pronunciation data Then convert it into pronunciation during pronunciation. Of course, the value of the combined information obtained during decoding will be the part of the preliminary pronunciation data corresponding to the initial pronunciation in the decoding pronunciation. Converted into pronunciation; the step-by-step direct pronunciation method can also have other specific forms, such as using a voice chip to pronounce, and the MCU sends each part of the prepared pronunciation data to the This voice chip, but the voice chip will not pronounce until all or part of the prepared pronunciation data of the sentence (or all parts of the prepared pronunciation data agreed upon) are all transmitted, this form can be regarded as the previous one (step by step) equivalent substitution of the form. Obviously a part of the prepared pronunciation data is made up of the pronunciation information data of several phonetic codes, and these data are generally arranged by their corresponding pronunciations in the pronunciation corresponding to this part of the prepared pronunciation data (also can be arranged in other agreed orders) ; The step-by-step direct pronunciation method can be applied to various languages of the Indo-European language family and non-Indo-European languages and any compound language.

对于非印欧语系语种的分步直接发音方式,举一个例子:比如汉语,如对某语句的编码值(一个处理单位的值)进行解码,此语句编码值是主要以词及词组合为音码来编码的,其词段编码采用本说明书述的频率排序编码方式,如编码语句中无陌生词,并且假设对词段编码时不使用前后调用处理;MCU在解码时,先查语句编码值知道无陌生词,并且汉语本来无时态词形选择段,接着看词段,就可以在词段里每取一个词或词组合的频率编号(按频率排序解码方式,取出词段里词及词组合的频率编号的顺序就是这些词及词组合原来在语句中的顺序),就在词数据库里去查此词或词组合的发音信息数据(比如词形拼音),此发音信息数据就是部分预备发音数据,并由MCU调发音信息数据(比如词形拼音)的发音数据将此词的发音播放出来(比如是按播放ADPCM格式录音数据的方式,假设单字拼音表里的约1300个不同的单字拼音也即带调音节都被各自录音成录音数据),接着在词段里再取下个词或词组合的信息,并依此办法播放下个词或词组合的发音(即每解出一个词或词组合,就播这个词或词组合的音),直到把词段里的所有词及词组合都取完播放完,解码过程完成,这些词及词组合的发音就形成语句的全部发音。(实际上每次解出的部分预备发音数据,可以是一个词或词组合的发音信息数据,也可以是多个词或词组合的发音信息数据)For the step-by-step direct pronunciation method of non-Indo-European languages, take an example: such as Chinese, such as decoding the coded value (value of a processing unit) of a sentence, the coded value of this sentence is mainly based on words and word combinations. Code to code, the word segment code adopts the frequency sorting coding method described in this manual, if there are no unfamiliar words in the coded sentence, and it is assumed that the word segment code is not used before and after calling processing; when the MCU decodes, first check the sentence code value Know that there are no unfamiliar words, and Chinese has no tense word form selection segment, and then look at the word segment, you can get the frequency number of each word or word combination in the word segment (according to the frequency sorting decoding method, take out the word and word in the word segment The order of the frequency numbers of word combinations is exactly the order of these words and word combinations in the sentence), just go to look up the pronunciation information data (such as word shape pinyin) of this word or word combination in the word database, this pronunciation information data is exactly part Prepare the pronunciation data, and play the pronunciation of the word by the pronunciation data of the pronunciation information data (such as word shape pinyin) by the MCU (for example, by playing the mode of recording data in ADPCM format, assuming about 1300 different characters in the single-character pinyin table Single-character pinyin (i.e., tone-banded syllables are all recorded into recording data respectively), and then the information of the next word or word combination is removed in the word segment, and the pronunciation of the next word or word combination is played in this way (that is, every solution A word or combination of words, just broadcast the sound of this word or combination of words), until all the words and combination of words in the word segment have been taken and played, the decoding process is completed, and the pronunciation of these words and combination of words will form the whole of the sentence pronounce. (In fact, the part of the prepared pronunciation data solved each time can be the pronunciation information data of a word or a combination of words, or the pronunciation information data of a plurality of words or a combination of words)

此分步直接发音方式对印欧语系各语种也是可以的,举一个例子:可以在编解码时不使用时态词形选择段,比如英语,比如把元辅音及元音辅音组合作为音码,使用类似本说明书述的频率排序那样的编码方式来对单词段编码(编码时只涉及对单词来编码,单词组合都分拆成单词来处理),而且编码时不涉及处理陌生单词(因所有单词都可由元辅音及元音辅音组合组成起来表示),也不用带时态词形选择段,并且假设对单词段编码时不使用前后调用处理;MCU在解码时,可每次在语句编码值的单词段里解出组成一个单词的元辅音及元音辅音组合(单词都是由元辅音及元音辅音组合组成的;当然单词段里各单词的数据之间需有间隔标志,这里说的单词的数据是指组成单词发音的元辅音及元音辅音组合数据;按类似于频率排序那样的解码方式,取出单词段里单词的数据的顺序就是这些单词原来在语句中的顺序),组成此单词的元辅音及元音辅音组合就是单词的发音信息数据,也是部分预备发音数据,就将此单词的发音信息数据(比如词形音标,可由组成此单词的元辅音及元音辅音组合得到)转成语音播出音来(比如是按播放ADPCM格式录音数据的方式,顺次播放出各音标的音来),照此方式逐个解出单词段里的全部单词(且每解出一个单词,就播这个单词的音),解码过程完成,这些单词的发音就形成语句的全部发音。(实际上每次解出的部分预备发音数据,可以是一个单词或单词组合的发音信息数据,也可以是多个单词或单词组合的发音信息数据)This step-by-step direct pronunciation method is also possible for various languages of the Indo-European language family. For example: you can not use tense word form selection segments when encoding and decoding, such as English, such as combining vowel consonants and vowel consonants as phonetic codes, Use the encoding method similar to the frequency sorting described in this manual to encode word segments (encoding only involves encoding words, and word combinations are split into words for processing), and encoding does not involve processing unfamiliar words (because all words can be composed of vowels and consonants and vowels and consonants), and there is no need to have a tense word form selection segment, and it is assumed that the front and rear call processing is not used when encoding the word segments; Solve the vowels and consonants and combinations of vowels and consonants that make up a word in the word segment (words are composed of vowels and consonants; of course, there must be interval marks between the data of each word in the word segment, the words mentioned here The data refers to the vowel and consonant and vowel and consonant combination data that make up the pronunciation of the word; according to the decoding method similar to frequency sorting, the order of taking out the data of the words in the word segment is the original order of these words in the sentence), to form this word The vowel and consonant and the combination of vowel and consonant are the pronunciation information data of the word, which is also part of the prepared pronunciation data, so the pronunciation information data of the word (such as the phonetic symbol of the word shape can be obtained by the combination of the vowel and consonant and the vowel and consonant of the word) Become voice and broadcast the sound (for example, by playing the mode of ADPCM format recording data, the sound of each phonetic symbol is played out sequentially), in this way, solve all the words in the word segment one by one (and every time you solve a word, just Play the sound of this word), the decoding process is completed, and the pronunciation of these words just forms the whole pronunciation of sentence. (In fact, the part of the prepared pronunciation data solved each time can be the pronunciation information data of a single word or word combination, or the pronunciation information data of multiple words or word combinations)

对于非印欧语系各语种和印欧语系各语种,解码采用分步直接发音方式时,在每个部分预备发音数据的发音之后,需有点延时,MCU在这个延时期间要解码取下一个发音接续的部分预备发音数据,然后再播这下一个部分预备发音数据的音,不过这个延时很短,对形成语句的全部发音和非全部发音没什么影响,而且可在划分各部分预备发音数据时,使每个部分预备发音数据的发音之后本来就需有一小段静音延时(比如每个部分预备发音数据的发音是一个词的发音,每个词的发音之后本来就需有一小段静音延时)。For non-Indo-European languages and Indo-European languages, when the decoding adopts the step-by-step direct pronunciation method, after the pronunciation of each part of the prepared pronunciation data, there is a delay, and the MCU needs to decode and take the next one during this delay period. Pronunciation continues part of the prepared pronunciation data, and then broadcast the sound of the next part of the prepared pronunciation data, but this delay is very short, it has no effect on the entire pronunciation and non-full pronunciation of the formed sentence, and can be divided into each part of the prepared pronunciation data When making the pronunciation of each part of the prepared pronunciation data, a small section of silence delay is originally needed (such as the pronunciation of each part of the prepared pronunciation data is the pronunciation of a word, and a small section of silence delay is originally required after the pronunciation of each word) ).

对于某语种,因为解码时可以把语句的预备发音数据全解码出来后一起发音,也可以把各发音动作穿插在解码过程中,即拾音器的解码程序里的对此语种的拼合信息的值的解码动作与发音动作可能是交织的(直接发音方式或TTS发音方式时都有这种可能),而解码程序又是根据解码规则中有关此语种的解码细则来解码的,所以本发明文件述及的编解码规则举例里是把用发音形式播放语句发音的解码细则也归入解码规则中有关此语种的解码细则里的;如果解码规则中有关此语种的解码细则里不包括用发音形式播放语句发音的解码细则,此解码规则中有关此语种的解码细则只用于表示如何把语句的预备发音数据全解码出来,而用解码出来的语句的全部或非全部预备发音数据来发音则是解码规则中有关此语种的解码细则以外的事,显然这里解码和发音两步骤可等同看成是一个步骤,所以就可等同看成此解码规则中有关此语种的解码细则里包括了用发音形式播放语句发音的解码细则,其技术特征及处理等可参考本说明书其它地方讨论的解码规则中有关某语种的解码细则(这句话说的解码规则中有关某语种的解码细则是含有用发音形式播放语句发音的解码细则的)。For a certain language, when decoding, the preparatory pronunciation data of the sentence can be decoded and pronounced together, and each pronunciation action can also be interspersed in the decoding process, that is, the decoding of the value of the combination information of the language in the decoding program of the pickup Actions and pronunciation actions may be intertwined (all have this possibility during direct pronunciation mode or TTS pronunciation mode), and the decoding program decodes according to the decoding rules about this language in the decoding rules, so the documents mentioned in the present invention In the codec rule example, the decoding rules for playing sentence pronunciation in pronunciation form are also included in the decoding rules for this language in the decoding rules; if the decoding rules for this language in the decoding rules do not include playing sentence pronunciation in pronunciation form The decoding rules for this language in this decoding rule are only used to indicate how to decode all the pre-pronunciation data of the sentence, and use all or part of the pre-pronunciation data of the decoded sentence to pronounce is in the decoding rule For things other than the decoding rules for this language, obviously the two steps of decoding and pronunciation can be regarded as one step, so it can be regarded as the same as the decoding rules for this language in this decoding rule. For the decoding rules, its technical features and processing, etc., please refer to the decoding rules about a certain language in the decoding rules discussed elsewhere in this manual (the decoding rules about a certain language in the decoding rules in this sentence include playing the pronunciation of sentences in the form of pronunciation) decoding details).

如在以下举例中没特别指明有其他情况,以下直播方式的以只适用于英语为例或以只适用于汉语为例的解码规则涉及的发音形式默认是全用整步直接发音方式的,以下直播方式的以只适用于英语为例或以只适用于汉语为例的解码规则涉及的各解码举例默认是以整步直接发音方式为例的。If there are no other circumstances specified in the following examples, the pronunciation forms involved in the decoding rules of the following live broadcast methods are only applicable to English as an example or only applicable to Chinese as an example. By default, the full-step direct pronunciation method is used. The following In the live broadcast mode, the decoding rules that only apply to English as an example or only apply to Chinese as an example refer to the whole-step direct pronunciation method as an example by default.

用于直接产音方式的被当成发音器件(也是子发音器件)的带喇叭的语音芯片的技术特征,在本说明书第123页有述(而本说明书第123页和第124页在对单片机芯片使发音器件播放语音的方式进行解释时述及的语音芯片是涉及语音芯片的应用的)。可以是MCU外接一种形式的语音芯片,由MCU把从拾音器主体内的存储器里(或数据卡上的存储器里或部件式数据卡上的存储器里)的诸音码的录音数据得到的一些音码的录音数据(比如ADPCM格式的录音数据)通过通讯线传给外接的语音芯片(如是用整步直接发音方式,MCU就一次把组成语句发音的全部或部分音码中各音码的录音数据传给语音芯片发音,如是用分步直接发音方式,MCU就在解码过程中分多次每次把一些音码的录音数据传给语音芯片发音),由带喇叭的外接的语音芯片(带喇叭的外接的语音芯片为发音器件)播出语句的全部发音或非全部发音;或者是MCU外接另一种形式的语音芯片,如是用整步直接发音方式,就由MCU一次把语句的全部或非全部预备发音数据(比如按语句发音顺序排组起来的表示英语语句发音的各音标的编号信息或按语句发音顺序排组起来的表示汉语语句发音的各拼音的代号信息)通过通讯线传给外接的语音芯片以播出语句的全部发音或非全部发音,如是用分步直接发音方式,MCU就在解码过程中分次把各部分预备发音数据(部分预备发音数据是指比如按发音顺序排组起来的表示部分英语语句发音的各音标的编号信息或按发音顺序排组起来的表示部分汉语语句发音的各拼音的代号信息)传给语音芯片发音,带喇叭的外接的这种形式语音芯片(带喇叭的外接的语音芯片为发音器件)是从其存储器(此存储器一般仅能被此语音芯片读取而不能被作为主处理芯片的MCU读取)里的诸音码的录音数据得到一些音码的录音数据(比如ADPCM格式的音标或拼音的录音数据)并用这些音码的录音数据来发音的。The technical characteristics of the voice chip with the speaker as the sounding device (also a sub-sounding device) for the direct sound production mode are described on the 123rd page of this specification (and the 123rd and 124th pages of this specification are in the single-chip microcomputer chip The voice chip mentioned when explaining how to make the pronunciation device play voice relates to the application of the voice chip). It can be that the MCU is externally connected to a form of voice chip, and the MCU uses some voice recording data obtained from the recording data of the voice codes in the memory in the pickup body (or in the memory on the data card or in the memory on the component data card). Code recording data (such as the recording data in ADPCM format) is passed to the external voice chip through the communication line (if using the full-step direct pronunciation method, the MCU will once make up the recording data of all or part of the voice codes in the pronunciation of the sentence. Pass it to the voice chip for pronunciation. If you use the step-by-step direct pronunciation method, the MCU will transmit the recording data of some sound codes to the voice chip for multiple times during the decoding process), and the external voice chip with the speaker (with the speaker) The externally connected voice chip is the pronunciation device) to broadcast all or part of the pronunciation of the sentence; or the MCU is connected to another form of voice chip. All pre-pronunciation data (such as the numbering information of each phonetic symbol representing the pronunciation of English sentences arranged in the order of pronunciation of the sentence or the code information of each pinyin representing the pronunciation of Chinese sentences arranged in the order of pronunciation of the sentence) is passed to the external connection through the communication line The voice chip of the voice broadcasts the whole or part of the pronunciation of the sentence. If it is a step-by-step direct pronunciation method, the MCU will divide each part of the prepared pronunciation data in stages during the decoding process (partial prepared pronunciation data refers to, for example, arranging groups in the order of pronunciation) The numbering information of each phonetic symbol representing the pronunciation of part of the English sentence or the code information of each pinyin representing the pronunciation of part of the Chinese sentence arranged in the order of pronunciation) is passed to the voice chip for pronunciation, and this form of voice chip connected externally with a loudspeaker ( An external voice chip with a speaker is a sounding device) to obtain some voices from the recording data of various voice codes in its memory (this memory can only be read by this voice chip and cannot be read by the MCU as the main processing chip). The recording data of code (such as the phonetic symbol of ADPCM format or the recording data of pinyin) and use the recording data of these phonetic codes to pronounce.

如拾音器采用直接产音方式,可以是用主MCU发音或用主MCU外接的语音芯片(比如上段述的接收录音数据形式的语音芯片或接收预备发音数据形式的语音芯片)来发音,其发音的形式,除了可以是播音码的录音数据外,还可有别的形式,比如用语音波形的矢量技术来模拟发音码的语音(采用这种形式时一个音码的发音数据就是指用语音波形的矢量技术来模拟发音码的语音时所采用的参数),这些其它的形式应是播一个音码的录音数据形式的等同替换,如用这些其它的形式来发音那解码时的播出语句发音也可看成是采用直接产音方式,而且采取这些其它的形式时,还可能使用一些别的类型的子发音器件(比如带喇叭的用语音波形的矢量技术来模拟发语音的芯片),这些子发音器件与喇叭以及上段说的带喇叭的语音芯片等子发音器件是等同替换。If the pickup adopts a direct sound production method, it can be pronounced by the main MCU or by an external voice chip connected to the main MCU (such as the voice chip in the form of receiving recording data or receiving the voice chip in the form of prepared voice data) to pronounce the voice. Form, except that it can be the recording data of the broadcast sign indicating number, also can have other forms, such as using the vector technology of the speech waveform to simulate the speech of the pronunciation sign indicating number (when adopting this form, the pronunciation data of a sound sign indicating number refers to the use of the speech waveform The parameters used when simulating the pronunciation of the pronunciation sign indicating number by vector technology), these other forms should be the equivalent replacement of the recording data form of broadcasting a tone sign indicating number, as using these other forms to pronounce, the broadcast sentence pronunciation when decoding is also It can be regarded as a direct sound production method, and when these other forms are adopted, some other types of sub-sounding devices (such as a chip with a speaker that uses the vector technology of the voice waveform to simulate the voice) may also be used. Pronunciation devices are equivalent replacements with sub-pronunciation devices such as speakers and the voice chip with speakers mentioned in the previous paragraph.

本发明文件各处述及印欧语系或非印欧语系的直接产音方式时,音码的发音是以采用播音码的录音数据为例的,这种情况下诸音码的发音数据就是指诸音码的录音数据,且这种情况下音码的发音数据就是指音码的录音数据,而且是以把诸音码的发音数据(如诸音码的录音数据)存储在与拾音器有关的存储器集里而不是存储在只有语音芯片可读取的存储器里为例的,而且是以主MCU发音(而不是主MCU外接的语音芯片发音)为例的。When the document of the present invention mentioned the direct sound production mode of Indo-European language family or non-Indo-European language family everywhere, the pronunciation of sound sign indicating number is to adopt the recording data of broadcast sign indicating number to be example, in this case the pronunciation data of all sound sign indicating number refers to The recording data of all sound sign indicating numbers, and the pronunciation data of sound sign indicating number refers to the recording data of sound sign indicating number in this case, and is stored in the sound recording data (as the recording data of all sound sign indicating numbers) of all sound sign indicating numbers in relevant with pickup The memory set is not stored in the memory that only the voice chip can read, and the pronunciation of the main MCU (not the pronunciation of the voice chip external to the main MCU) is taken as an example.

本发明文件各处涉及解码采用直接发音方式时所说的把发音信息数据转换成语音,以英语为例,发音信息数据一般是指词形音标,是组成一个单词发音的数个音标,如从拾音器里存储的诸音码的发音数据可得到各单词(音码)的发音数据(比如录音数据),那直接用MCU或MCU外接的语音芯片把此发音信息数据对应的单词的音发出来即可,如从拾音器里存储的诸音码的发音数据可得到各元辅音(音码)的发音数据(比如录音数据),那可用MCU或MCU外接的语音芯片把此发音信息数据里的各音标对应的元辅音语音顺次发出来;对于汉语,发音信息数据一般是指词形拼音,如从拾音器里存储的诸音码的发音数据可得到各词(音码)的发音数据(比如录音数据),那直接用MCU或MCU外接的语音芯片把此发音信息数据对应的词的音发出来即可,如从拾音器里存储的诸音码的发音数据可得到各带调音节(元音辅音组合音码)的发音数据(比如录音数据),那可用MCU或MCU外接的语音芯片把此发音信息数据里的各带调音节的语音顺次发出来。Documents of the present invention relate to the said pronunciation information data conversion into voice when decoding adopts the direct pronunciation mode everywhere, take English as example, the pronunciation information data generally refers to word form phonetic symbols, is to form several phonetic symbols of a word pronunciation, as from The pronunciation data (such as recording data) of each word (sound code) can be obtained by the pronunciation data of all sound codes stored in the pickup, and then the pronunciation of the word corresponding to this pronunciation information data is sent out directly with the voice chip that MCU or MCU connects. Yes, as the pronunciation data (such as recording data) of each vowel and consonant (sound code) can be obtained from the pronunciation data of the various phonetic codes stored in the pickup, then each phonetic symbol in the pronunciation information data can be converted by the MCU or the external voice chip of the MCU. Corresponding vowel and consonant sounds are sent out sequentially; For Chinese, the pronunciation information data generally refers to word-form pinyin, can obtain the pronunciation data (such as recording data) of each word (sound code) as the pronunciation data of all phonetic codes stored in the pickup ), then directly use the MCU or the voice chip externally connected to the MCU to send out the pronunciation of the word corresponding to the pronunciation information data, as the pronunciation data of the various phonetic codes stored in the pickup can be obtained with each tone syllable (vowel and consonant combination Tone code) pronunciation data (such as recording data), then available MCU or the voice chip that MCU connects can send out the pronunciation of each band tone syllable in this pronunciation information data sequentially.

印欧语系的解码处理:Decoding processing for Indo-European languages:

以下叙述印欧语系语种的解码处理时,涉及到举例时是以英语为例的。When describing the decoding processing of the Indo-European languages in the following, English is used as an example when it comes to examples.

显然,对印欧语系任一语种,解码时对音码的定义方式及使用的音码情况与对拼合信息的值编码(这里说的解码与编码是专门对应的)时对音码的定义方式及使用的音码情况一般是相同或兼容的;可被当成解码时涉及的音码的,可以是表示发音的元辅音或元音辅音组合,也可以是可以转换成发音的词根,单词,单词组合或范句,也可以是经组合后可转换为发音的字符组合或字母,可由这些音码中的一种或多种来拼合解码时定义的解码语句集发音。Obviously, for any language of the Indo-European language family, the way of defining the phonetic code and the situation of the used phonetic code during decoding and the way of defining the phonetic code when encoding the value of the combined information (the decoding and encoding mentioned here are specially corresponding) and the phonetic codes used are generally the same or compatible; what can be regarded as the phonetic codes involved in decoding can be vowels and consonants or combinations of vowels and consonants that represent pronunciation, or can be converted into root words, words, and words that are pronounced Combinations or model sentences can also be converted into character combinations or letters that can be pronounced after being combined, and one or more of these phonetic codes can be used to assemble the pronunciation of the decoding sentence set defined during decoding.

在对印欧语系的某语种的拼合信息的值进行解码时,首先在与拾音器有关的存储器集里需存储解码数据,解码数据一般是与用某编码规则中有关此语种的编码细则对此语种语句编码时所使用的编码数据内容有关的,解码数据一般与此编码数据是一样或兼容的(本说明书述的以只适用于英语为例的编码规则涉及的编码数据和与此编码规则对应的以只适用于英语为例的解码规则涉及的解码数据是一样的,这里说的一样是指比如编码数据包括有A1,A2,A3三个数据库,解码数据包括有B1,B2,B3三个数据库,并且A1与B1的数据库名称及数据库结构及数据内容都相同,并且A2与B2的数据库名称及数据库结构及数据内容都相同,并且A3与B3的数据库名称及数据库结构及数据内容都相同),再使用解码程序来进行解码,此解码程序一般是按与对此语种语句编码时所用编码规则(此编码规则可以是只适用于此语种或适用于包括此语种在内的多个语种)有关此语种对应的解码规则(此解码规则可以是只适用于此语种或适用于包括此语种在内的多个语种)中有关此语种的解码细则来进行解码的,(前一句里提及的两个‘多个’不一定是相同数目)。(本发明文件各处提及的兼容概念是属于计算机领域常规概念,比如对于某一语种,2.0版解码程序及解码数据,其能解码编码版本是2.0版的拼合信息的值,1.0版解码程序及解码数据,其能解码编码版本是1.0版的拼合信息的值,如2.0版解码程序及解码数据还能用于解码编码版本是1.0版的拼合信息的值,那2.0版的解码程序及解码数据就是兼容1.0版的解码程序及解码数据;再是比如解码数据里包括全部编码数据,那解码数据就是与编码数据兼容的)When decoding the value of the mosaic information of a certain language of the Indo-European language family, it is first necessary to store the decoded data in the memory set related to the pickup. The decoded data is generally related to the encoding rules related to this language in a certain encoding rule. It is related to the content of the encoded data used when encoding the sentence, and the decoded data is generally the same or compatible with the encoded data (the encoded data involved in the encoding rule that is only applicable to English as an example described in this manual and the corresponding encoding rule Taking the decoding rules that only apply to English as an example, the decoding data involved are the same. The same here means that for example, the encoded data includes three databases A1, A2, and A3, and the decoded data includes three databases B1, B2, and B3. , and the database name, database structure and data content of A1 and B1 are the same, and the database name, database structure and data content of A2 and B2 are the same, and the database name, database structure and data content of A3 and B3 are the same), Then use the decoding program to decode. This decoding program is generally related to the encoding rules used when encoding the language sentence (this encoding rule can be only applicable to this language or applicable to multiple languages including this language). The decoding rules corresponding to the language (this decoding rule can be only applicable to this language or applicable to multiple languages including this language) to decode the decoding rules of this language, (the two mentioned in the previous sentence 'Multiple' is not necessarily the same number). (The compatible concepts mentioned in the documents of the present invention belong to the conventional concepts in the field of computers, such as for a certain language, version 2.0 decoding program and decoding data, which can decode the value of the information that the encoding version is 2.0 version, version 1.0 decoding program And decoded data, which can decode the value of the combined information whose encoding version is version 1.0, such as version 2.0 The decoding program and decoded data can also be used to decode the value of the combined information whose encoding version is version 1.0, then the decoding program and decoding of version 2.0 The data is compatible with version 1.0 of the decoding program and decoded data; for example, if the decoded data includes all encoded data, then the decoded data is compatible with the encoded data)

对于印欧语系任一语种,语音编码有声读物的整页面上的码字涉及的表示此语种语句发音的拼合信息的值是可被某种解码规则中有关此语种的直播方式的解码细则来解码的(此解码规则可以是只适用于此语种或适用于包括此语种在内的多个语种的)。For any language of the Indo-European language family, the value of the combined information representing the pronunciation of sentences in this language involved in the codewords on the entire page of the phonetic coded audiobook can be decoded by the decoding rules related to the live broadcast method of this language in a certain decoding rule (This decoding rule can be only applicable to this language or applicable to multiple languages including this language).

如解码时播放语句发音是采取直接发音方式,那还需使用到诸音码的发音数据(一般是诸音码的录音数据),至于如何对印欧语系某语种的音码进行录音以产生诸音码的录音数据,在前面阐述直播方式的以只适用于英语为例的编码规则时有述,且在编解码过程中用于编解码的主要音码与解码时的发音音码不一定是相同种类,这在前面阐述直播方式的以只适用于英语为例的编码规则时已有述。Take the direct pronunciation mode when playing sentence pronunciation during decoding, then also need to use the pronunciation data of all phonetic codes (generally the recording data of all phonetic codes), as for how to record the phonetic codes of a certain language of the Indo-European language family to produce various phonetic codes The recording data of the sound code is described in the live broadcast mode, which is only applicable to English as an example, and the main sound code used for encoding and decoding in the process of encoding and decoding is not necessarily the same as the pronunciation sound code during decoding. The same type, which has been mentioned in the previous explanation of the encoding rules for live broadcasts, which only applies to English as an example.

只适用于印欧语系任一语种或可适用于包括数个印欧语系语种在内的多个语种的编码规则可以有很多种,编码规则的种数实际上是没法穷举的,同样,只适用于印欧语系任一语种或可适用于包括数个印欧语系语种在内的多个语种的解码规则的种数也是没法穷举的(只适用于一个语种的解码规则就可有多种)。(本段提及的两个‘数个印欧语系语种’不一定是相同所指,本段提及的两个‘多个语种’不一定是相同所指)There can be many kinds of encoding rules that are only applicable to any language of the Indo-European language family or applicable to multiple languages including several Indo-European languages. The number of encoding rules is actually inexhaustible. Similarly, The number of decoding rules that are only applicable to any language of the Indo-European language family or applicable to multiple languages including several Indo-European languages is also inexhaustible (the decoding rules that are only applicable to one language can have variety). (The two 'several Indo-European languages' mentioned in this paragraph are not necessarily the same reference, and the two 'multiple languages' mentioned in this paragraph are not necessarily the same reference)

权利要求3,12,20或23中提及的其有关某语种的解码细则含有压缩解码处理的解码规则(此解码规则可以是只适用于此一个语种或适用于包括此一个语种在内的多个语种的)可以是与各种编码规则有关此语种对应的各种具体的解码规则。(本段提及的‘对应’是指一般对应或专门对应)The decoding rules about a certain language mentioned inclaim 3, 12, 20 or 23 contain decoding rules for compression decoding processing (this decoding rule can be only applicable to this one language or be applicable to many including this one language) language) can be various specific decoding rules corresponding to this language related to various coding rules. (The 'correspondence' mentioned in this paragraph refers to general correspondence or special correspondence)

权利要求3,12,20或23中提及的解码规则,可以是与本说明书提及或没提及的适用于印欧语系任一语种(英语或法语或德语或俄语等)的其有关此语种的编码细则含有压缩编码处理的任一编码规则(此编码规则可能是只适用于此语种或适用于包括此语种在内的多个语种的)有关此语种对应的本说明书提及或没提及的适用于此语种的其有关此语种的解码细则含有压缩解码处理且是直播方式的的各种解码规则(这各种解码规则中任一解码规则可能是只适用于此语种或适用于包括此语种在内的多个语种的)。(本段提及的两个‘多个’不一定是相同数目;本段提及的‘对应’是指一般对应或专门对应)The decoding rule mentioned inclaim 3, 12, 20 or 23 may be related to any language (English or French or German or Russian) of the Indo-European language family (English or French or German or Russian, etc.) mentioned or not mentioned in this specification. The coding rules of the language contain any coding rules for compression coding processing (this coding rule may be applicable only to this language or to multiple languages including this language). And the decoding rules for this language applicable to this language include various decoding rules for compressed decoding processing and live broadcasting (any decoding rule in these various decoding rules may only be applicable to this language or apply to include multiple languages including this one). (The two "multiples" mentioned in this paragraph are not necessarily the same number; the "correspondence" mentioned in this paragraph refers to general correspondence or special correspondence)

以下是以英语作为印欧语系语种的例子来阐述印欧语系语种的解码技术的。The following uses English as an example of the Indo-European language to illustrate the decoding technology of the Indo-European language.

以下述的是对与本说明书前面述的直播方式的以只适用于英语为例的编码规则有关英语对应的一种解码规则及其解码实例的阐述,此解码规则包括如下一些解码细则:解码语句集范围完整定义,用于解码时按直接发音方式播放语句发音的音码语音的处理的方式定义,用于解码的音码的种类选择,对音码赋编号,单词数据库建立,词类的建立,各种词段解码方式,时态词形解码处理,陌生单词解码处理,头段信息读取及用发音形式播放语句发音等,并且此解码规则是以只适用于一个语种即英语为例的,即此解码规则中的所有解码细则都是属于此解码规则中有关英语语种的解码细则,以只适用于英语为例的此解码规则是与直播方式的以只适用于英语为例的编码规则对应的,其中对音码赋编号是有压缩处理的,解码时可供选用的几种词段解码方式(频率排序,频率关联混合,关联,整句差别等解码方式)中除频率排序解码方式外都是含有压缩解码处理的,所以直播方式的以只适用于英语为例的此解码规则是含有压缩解码处理的,显然此解码规则里的压缩解码处理与直播方式的以只适用于英语为例的编码规则里的压缩编码处理是对应的。此解码规则是直播方式的。印欧语系其它语种比如法语,德语,俄语等也可采用此解码规则,当然其具体细节与英语有别,需作相应修改(当然这种情况时,相应的这些语种编码时需选用基本上同于本说明书前面述的直播方式的以只适用于英语为例的编码规则里的那些编码细则)。此直播方式的以只适用于英语为例的解码规则及其解码实例的具体阐述是从下一行起至89页为止的:The following is an explanation of a decoding rule and its decoding examples corresponding to the encoding rules of the live broadcast mode mentioned above in this manual, which are only applicable to English as an example. This decoding rule includes the following decoding rules: Complete definition of set range, definition of the way of processing the phonetic code voice for playing sentence pronunciation in the direct pronunciation mode when decoding, selection of the type of phonetic code used for decoding, numbering of phonetic codes, establishment of word database, establishment of part of speech, Various word segment decoding methods, tense word form decoding processing, unfamiliar word decoding processing, header information reading and sentence pronunciation playing in pronunciation form, etc., and this decoding rule is only applicable to one language, namely English, as an example. That is to say, all the decoding rules in this decoding rule belong to the decoding rules related to the English language in this decoding rule. The decoding rules that only apply to English as an example correspond to the encoding rules that only apply to English in the live broadcast mode. Among them, there is a compression process for assigning numbers to the phonetic code, and several word segment decoding modes (frequency sorting, frequency correlation mixing, association, whole sentence difference, etc. decoding modes) that can be selected during decoding are except for the frequency sorting decoding mode Both contain compression and decoding processing, so the decoding rules for live broadcasting are only applicable to English as an example. Obviously, the compression and decoding processing in this decoding rule is only applicable to English as an example The compression encoding processing in the encoding rules is corresponding. This decoding rule is live. Other languages of the Indo-European language family such as French, German, Russian, etc. can also use this decoding rule. Of course, the specific details are different from English and need to be modified accordingly (of course, in this case, the corresponding codes of these languages need to be basically the same. Those encoding rules in the encoding rules that are only applicable to English as an example in the aforementioned live broadcast mode in this specification). The specific elaboration of the decoding rules and decoding examples of this live broadcast method, which is only applicable to English, starts from the next line and ends on page 89:

完整定义的解码语句集的范围是任意英语语句,也即解码语句集包括英语任意语句。The scope of the fully defined set of decoded sentences is any English sentence, that is, the set of decoded sentences includes any sentence in English.

解码规则中的用于解码时按直接发音方式播放语句发音的音码语音的处理的方式定义,用于解码的音码的种类选择,对音码赋编号,单词数据库建立,词类的建立等解码细则是与编码时用的直播方式的以只适用于英语为例的编码规则中相应的那些编码细则(这里说的那些编码细则也即用于解码时按直接发音方式播放语句发音的音码语音的处理的方式定义,用于解码的音码的种类选择,对音码赋编号等)相同的,相应建立的解码数据也是与编码时用的编码数据相同的(其中解码时用的单词数据库与编码时用的单词数据库是相同的);而此解码规则中的各种词段解码方式,时态词形解码处理,陌生单词解码处理这些解码细则是与编码时用的直播方式的以只适用于英语为例的编码规则中的各词段编码方式,时态词形编码处理,陌生单词编码处理这些编码细则对应的。In the decoding rule, it is used to play the sound code speech processing method of the sentence pronunciation in the direct pronunciation mode when decoding, the type selection of the sound code used for decoding, the numbering of the sound code, the establishment of the word database, the establishment of the part of speech, etc. decoding Details are those encoding rules corresponding to the encoding rules that are only applicable to English in the live broadcast mode used during encoding (those encoding rules mentioned here are also the phonetic sign indicating number voice that is used to play the pronunciation of the sentence in the direct pronunciation mode when decoding The mode definition of the processing, be used for the kind selection of the sound sign indicating number of decoding, numbering etc. is assigned to sound sign indicating number) identical, the decoding data of corresponding establishment is also identical with the encoding data used when encoding (wherein the word database used when decoding and The word database used for encoding is the same); and the various word segment decoding methods in this decoding rule, tense and word form decoding processing, and unfamiliar word decoding processing, these decoding details are only applicable to the live broadcast method used in encoding Taking English as an example in the encoding rules for each word segment encoding method, tense word form encoding processing, and unfamiliar word encoding processing are corresponding to these encoding rules.

此解码规则涉及的发音形式是:解码表示此解码规则定义的解码语句集里任一语句的发音的拼合信息的值时播放语句发音用的发音方式都是用整步直接发音方式。The pronunciation form involved in this decoding rule is: when decoding the value of the combined information representing the pronunciation of any sentence in the decoding sentence set defined by this decoding rule, the pronunciation mode used for playing the pronunciation of the sentence is all the full-step direct pronunciation mode.

在前面直播方式的以只适用于英语为例的编码规则里已约定好解码这边是只对各个元音和辅音(即音标)进行录音以形成诸音码的录音数据的,解码时语句的发音是由音标的语音来组成的。以只适用于英语为例的解码规则所用的诸音码的录音数据是按本说明书第16页述的第3种方式形成的。In the encoding rules that are only applicable to English in the previous live broadcast mode, it has been agreed that the decoding here only records each vowel and consonant (that is, the phonetic symbol) to form the recording data of the various phonetic codes. When decoding, the sentence Pronunciation is made up of phonetic symbols. The recording data of all phonetic sign indicating numbers used by the decoding rule that is only applicable to English is to form by the 3rd kind of mode described in the 16th page of this specification sheet.

如采用上述第3种方式对元辅音来录音及在解码时用元辅音的发音来组成单词及语句的发音,如解码时需处理变音,那可制定一些有关变音的发音规则(比如记录解码时遇到哪些单词的词形形态靠在一起时要变音等),在解码生成语句的全部(或非全部)预备发音数据时,考察要生成的全部(或非全部)预备发音数据里各单词的词形形态的排列情况(比如在解码过程中考察由各频率编号和时态词形选择段的数据来得到的各单词的词形形态的排列情况),按有关变音的发音规则,来确定哪些单词的词形形态要变音及怎样变音,并在生成全部(或非全部)预备发音数据时把全部(或非全部)预备发音数据中要变音的单词的词形形态的词形音标替换成要实际发音的词形音标,再可按全部(或非全部)预备发音数据把语句的全部(或非全部)发音发出来,对于可能有变音的单词的词形形态(或单词组合的词形形态),在单词数据库里只需记录其一个发音(基本发音)的信息(比如词形音标),而不需记录其怎样变音的信息;(当然这种方式要求所录的元辅音的音里要有这些变音的音,如有些变音发音特殊,在原元辅音里没有,那应补录进去);本段是按解码时用整步直接发音方式来说的。可把本段说的有关变音的发音规则写成直播方式的以只适用于英语为例的解码规则里的一解码细则。If the above-mentioned third method is used to record vowels and consonants and use the pronunciation of vowels and consonants to form the pronunciation of words and sentences when decoding. When decoding, when the morphological forms of which words are close together, they need to change their pronunciation, etc.), when decoding all (or not all) of the prepared pronunciation data of the generated sentence, examine all (or not all) of the prepared pronunciation data to be generated. The arrangement of the morphological form of each word (such as investigating the arrangement of the morphological form of each word obtained by the data of each frequency number and tense morphological selection segment in the decoding process), according to the pronunciation rules of the relevant inflection , to determine the morphological form of which words will be changed and how to change the pronunciation, and when generating all (or not all) prepared pronunciation data, the morphological form of the word to be changed in all (or not all) prepared pronunciation data The morphological phonetic symbols of the word are replaced with the morphological phonetic symbols to be actually pronounced, and then all (or not all) of the sentence can be pronounced according to all (or not all) of the prepared pronunciation data. (or the morphological form of word combination), only need to record the information of one pronunciation (basic pronunciation) (such as morphological phonetic symbols) in the word database, and do not need to record the information of how it changes its pronunciation; (of course, this method requires The recorded vowel and consonant sounds must have these inflected sounds, if there are some inflected sounds that are special and not in the original vowel consonants, then they should be added in); this section is based on the whole-step direct pronunciation method when decoding of. The pronunciation rules about inflection mentioned in this paragraph can be written as a decoding rule in the decoding rules that are only applicable to English as an example in the live broadcast mode.

按常规,语句可以是一个句子或一个句子的部分,语句也可以是含有多个句子。编码时,是用一个处理单位对应一个语句(一个处理单位包括头段,单词段,时态词形选择段,陌生单词段,前后调用段等),如语句含多个句子的,这时需定义一个编号作为句子间停顿(断句)的标志(此停顿编号可与单词的频率编号统一编号,即占用一个单词的频率编号,编解码时把停顿当成单词来同等对待处理),对于把一个句子的部分(非完整句子)作为一个语句,用一个处理单位对应此语句的,对此语句的编解码处理仍可使用本说明书反复叙述的对于一完整句子的编解码处理方式(因如一个句子缺少如主语或谓语等后,并不影响采用频率排序词段编解码方式等对其编解码)。下面例是这样的:语句就是一个中间无逗号的句子,一个处理单位对应一个语句,一个处理单位的值就是表示语句发音的拼合信息的值。Conventionally, a statement can be a sentence or part of a sentence, or a statement can consist of multiple sentences. During encoding, a sentence is corresponding to a processing unit (a processing unit includes a head section, a word section, a tense word form selection section, an unfamiliar word section, and a front and back call section, etc.). Define a number as the sign of a pause (sentence sentence) between sentences (this pause number can be numbered uniformly with the frequency number of a word, that is, the frequency number of a word is occupied, and the pause is treated equally as a word during encoding and decoding), for a sentence part (non-complete sentence) as a sentence, and use a processing unit corresponding to this sentence, the coding and decoding processing of this sentence can still use the coding and decoding processing method for a complete sentence repeatedly described in this specification (because a sentence lacks For example, after the subject or predicate, etc., it does not affect the encoding and decoding of frequency-sorted word segments, etc.). The following example is like this: a sentence is a sentence without a comma in the middle, a processing unit corresponds to a sentence, and the value of a processing unit is the value of the combined information representing the pronunciation of the sentence.

一个处理单位可以看作含有:头段,单词段,时态词形选择段,陌生单词段,前后调用段等,(这些段中的单词段是必需的,其它段是可能有,也可能无的,比如某被编码语句的所有各单词及单词组合的词形形态都只有一个,那其处理单位里就没有时态词形选择段,再比如某被编码语句里没陌生单词,那其处理单位里就没有陌生单词段;前后调用段也是可能有,可能没有的);解码时,也是相应地按段来读取处理单位中各段的数据(按段来读取处理单位中各段的数据,也可认为其是属于解码规则中的一个解码细则)。A processing unit can be regarded as containing: head segment, word segment, tense word form selection segment, unfamiliar word segment, front and rear call segments, etc. (the word segment in these segments is required, and other segments may or may not exist. Yes, for example, there is only one morphological form of all words and word combinations in a coded sentence, then there is no tense word form selection segment in the processing unit, and for example, if there is no unfamiliar word in a coded sentence, then its processing unit There is no unfamiliar word segment in the unit; there may or may not be called segments before and after; when decoding, the data of each segment in the processing unit is also read correspondingly by segment (read the data of each segment in the processing unit by segment) data, it can also be regarded as a decoding rule belonging to the decoding rule).

先读取头段含的信息:本编码语句属于什么语种,本处理单位的单词段使用何种词段编码方式;头段里也可带有本编码语句中的单词及单词组合的个数信息(但不必需带);根据不同需要,头段里的各信息可以省略;与前面的直播方式的以只适用于英语为例的编码规则相对应,本解码规则及解码举例里约定所读取的头段里记录有本编码语句属于什么语种和本处理单位的单词段使用何种词段编码方式这两个信息,每个信息是各自固定分配2个BIT来记录的。First read the information contained in the header: what language does the coded statement belong to, and what kind of word segment coding method is used for the word segment of this processing unit; the header can also contain the number information of the words and word combinations in the coded sentence (but not necessary); according to different needs, the information in the header can be omitted; corresponding to the encoding rules that are only applicable to English in the previous live broadcast method, the decoding rules and decoding examples agreed to read In the head section of the file, there are two information records, which language the coded statement belongs to and which word segment encoding method is used for the word segment of this processing unit. Each information is fixedly assigned 2 BITs to record.

接下来是单词段,按其使用的词段编码方式而使用相应的词段解码方式来解码(相关数据的数据结构参见前面编码时的介绍)。Next is the word segment, which is decoded using the corresponding word segment decoding method according to the word segment encoding method used (for the data structure of relevant data, refer to the introduction during the previous encoding).

接下来是时态词形选择段,对此段数据的解码处理是与对此段编码时的处理对应的。Next is the tense word form selection segment, and the decoding processing of this segment data is corresponding to the processing when this segment is encoded.

再接下来是陌生单词段,对此段数据的解码处理是与对此段编码时的处理对应的。解码时,如在前面解出的单词段里有代表陌生单词的频率编号111111,就在此段取陌生单词的编码信息来组成陌生单词的发音信息数据。如单词段中没出现频率编号111111,那本处理单位就不存在陌生单词段。Next is an unfamiliar word segment, and the decoding processing of this segment of data corresponds to the processing of this segment of encoding. During decoding, if there is a frequency number 111111 representing an unfamiliar word in the word segment solved in the front, just get the encoding information of the unfamiliar word to form the pronunciation information data of the unfamiliar word in this section. If the frequency number 111111 does not appear in the word segment, then there is no unfamiliar word segment in the processing unit.

再接下来是前后调用段,对此段数据的解码处理是与对此段编码时的处理对应的。解码时,如在前面解出的单词段里有代表前后调用的频率编号111,就在此段取往前或往后调用及调用的单词或单词组合的位置信息。如单词段中没出现频率编号111,那本处理单位就不存在前后调用段。Next is the front and rear calling segments, and the decoding processing of this segment of data corresponds to the processing of this segment of encoding. During decoding, if there is the frequency number 111 of calling before and after being represented in the word segment that is solved in front, just get the position information of the word or the combination of words that calls before or after and calls in this section. If the frequency number 111 does not appear in the word segment, then the processing unit does not have the front and rear call segments.

以下是与(直播方式的以只适用于英语为例的编码规则里的)几种词段编码方式相对应的(直播方式的以只适用于英语为例的解码规则里的)几种词段解码方式(这几种词段解码方式是用于对单词段的解码;频率排序,频率关联混合,关联,整句差别等这几种词段解码方式中除频率排序解码方式外都是含有压缩解码处理的):The following are several word segments (in the decoding rules of live broadcast mode only applicable to English as an example) corresponding to several word segment encoding methods (in the live broadcast method only applicable to English as an example) Decoding method (these kinds of word segment decoding methods are used to decode word segments; frequency sorting, frequency correlation mixing, correlation, whole sentence difference, etc., these kinds of word segment decoding methods all contain compression except for the frequency sorting decoding method decoded):

频率排序方式:Sort by frequency:

先在单词段的前面取5个bit的单词段里含有的单词及单词组合的个数信息,然后只需在单词段里再顺序取出各单词及单词组合的信息,先是取2BIT的一个单词或单词组合的频率编号长度信息,根据长度,再取此单词或单词组合的频率编号,然后再取下一个单词或单词组合的频率编号长度信息和频率编号,直到取完语句(单词段)的全部单词及单词组合的频率编号长度信息和频率编号为止,把这些频率编号按各频率编号原来在单词段里的顺序排组起来即可。First take the number information of words and word combinations contained in the 5-bit word segment in front of the word segment, and then only need to sequentially extract the information of each word and word combination in the word segment, first take a word of 2BIT or The frequency number length information of word combination, according to length, get the frequency number of this word or word combination again, then get the frequency number length information and frequency number of next word or word combination, until get all of sentence (word segment) As far as the frequency number length information and the frequency number of words and word combinations, these frequency numbers can be arranged according to the original order of each frequency number in the word segment.

例:语句原话是:The man is a Chinese.Example: The original words of the sentence are: The man is a Chinese.

编码值为:001010001010011100100010101100010110011001101The encoded value is: 001010001010011100100010101100010110011001101

顺序看编码值中的各BIT(本举例的编码值是单词段的值,没包括时态词形选择段,为了使举例简洁,本举例的编码值也没包括头段,头段里一般含有语种信息以及词段编码方式选择信息等数据):Look at each BIT in the coded value in order (the coded value in this example is the value of a word segment, and does not include the tense word form selection segment. In order to make the example concise, the coded value in this example does not include the header, which generally contains Language information and word segment coding method selection information and other data):

00101:单词段里的单词及单词组合个数(5个)00101: the number of words and word combinations in the word segment (5)

00:the单词的频率编号的长度信息00: Length information of the frequency number of the word

010:the单词的频率编号010: The frequency number of the word

10:man单词的频率编号的长度信息10: The length information of the frequency number of the man word

011100100:man单词的频率编号011100100: the frequency number of the word man

01:is单词的频率编号的长度信息01: The length information of the frequency number of the is word

010110:is单词的频率编号010110: the frequency number of the word is

00:a单词的频率编号的长度信息00: length information of the frequency number of a word

101:a单词的频率编号101: frequency number of a word

10:Chinese单词的频率编号的长度信息10: The length information of the frequency number of the Chinese word

011001101:Chinese单词的频率编号011001101: Frequency number of Chinese words

解码过程为:The decoding process is:

解码时,先取单词段里的单词及单词组合个数信息(5个),再往下先取2BIT的一个频率编号的长度信息,再按长度取一个频率编号,依此顺次取得5个单词及单词组合的频率编号,得:010 011100100 010110 101 011001101(这5个频率编号分别是the,man,is,a,Chinese这5个单词的频率编号)When decoding, first get the word and word combination number information (5) in the word segment, then get the length information of a frequency number of 2BIT first, and then get a frequency number according to the length, and then get 5 words and The frequency number of the word combination is: 010 011100100 010110 101 011001101 (these 5 frequency numbers are the frequency numbers of the, man, is, a, Chinese, respectively)

解码到这一步,得到语句中各单词及单词组合的频率编号,用频率排序解码方式对单词段的解码也到此为止;接下来的解码步骤一般是(这里仅以此语句来说,没涉及陌生单词等):结合时态词形段的信息(时态词形段在本例暂没涉及)来查单词数据表,将各频率编号换成单词或单词组合的实际的词形形态的发音信息数据(比如词形音标),得到语句的预备发音数据,由MCU顺次发出语句的预备发音数据中各音标的音来(与拾音器有关的存储器集里应存有各音标的录音数据比如ADPCM格式的录音数据),就是解码得到的语句的全部发音。Decode to this step, get the frequency numbers of each word and word combination in the sentence, and the decoding of the word segment with the frequency sorting decoding method also stops here; the next decoding step is generally (here only for this sentence, not involved Unfamiliar words, etc.): check the word data table in combination with the information of the tense word form (the tense word form is not involved in this example), and replace each frequency number with the pronunciation of the actual word form of the word or word combination Information data (such as morphological phonetic symbols) obtains the prepared pronunciation data of sentence, sends the sound of each phonetic symbol in the prepared pronunciation data of sentence in turn by MCU (the recording data of each phonetic symbol should be stored in the memory set relevant with pickup such as ADPCM format recording data), that is, the entire pronunciation of the decoded sentence.

本解码举例涉及的都是单词(音码),对于编解码的语句中有单词组合(音码)的情况与单词(音码)同等处理。What this decoding example relates to is all words (sound codes), and the situation that there are word combinations (sound codes) in the sentence of encoding and decoding is treated equally with words (sound codes).

(本词段解码方式里,各单词及单词组合的频率编号除了可以是本例示的阶梯编号外,还可以是等长编号或哈夫曼编号等;如为等长编号,哈夫曼编号那其前面是不带频率编号长度信息的。)(in this word segment decoding mode, the frequency numbering of each word and word combination can also be equal-length numbering or Huffman numbering etc. except the ladder numbering of this example; if it is equal-length numbering, then the Huffman numbering There is no frequency number length information in front of it.)

频率关联混合方式:Frequency correlation mixing method:

本词段解码方式及其举例主要以用单词(音码)来阐述的,如有用单词组合(音码)的类似于单词(音码)处理。This speech segment decoding mode and its examples are mainly set forth with words (sound codes), as the processing similar to words (sound codes) of useful word combinations (sound codes).

在单词段里先读取5个bit的阶梯信息值的长度信息,再读取1个BIT的基本节长信息,然后读取三个分节处的偏移信息,得出实际的分节情况,再对各节,排出各种单词(阶梯编号)分法,计算各分法下各单词(阶梯编号)之间的关联度(如编码时有采用排除法,那计算以前应根据组句规则先排除一些非法的单词分法),取关联度值最大的分法为解得的结果。本词段解码方式涉及的频率编号是阶梯编号。In the word segment, first read the length information of the ladder information value of 5 bits, then read the basic section length information of 1 BIT, and then read the offset information at the three sections to obtain the actual section situation , to each section, discharge various word (ladder numbering) sub-methods, and calculate the degree of association between each word (ladder numbering) under each sub-method (if the method of exclusion is adopted during encoding, then the calculation should be based on the rule of forming sentences before First exclude some illegal word division methods), and take the division method with the largest correlation value as the solution result. The frequency numbers involved in this word segment decoding method are ladder numbers.

举例:语句原话是:The fat man is an engineer.Example: The original sentence is: The fat man is an engineer.

编码值是:011000010000010001011011100100010110101010110010The encoded value is: 011000010000010001011011100100010110101010110010

顺序看编码值中的各BIT(本举例的编码值是单词段的值,没包括时态词形选择段,为了使举例简洁,本举例的编码值也没包括头段,头段里一般含有语种信息以及词段编码方式选择信息等数据):Look at each BIT in the coded value in order (the coded value in this example is the value of a word segment, and does not include the tense word form selection segment. In order to make the example concise, the coded value in this example does not include the header, which generally contains Language information and word segment coding method selection information and other data):

01100:阶梯信息值的长度是12个单位(3个bit一个单位,本例有3*12个bit)01100: The length of the ladder information value is 12 units (3 bits per unit, this example has 3*12 bits)

0:每节基本节长(BIT)选为21(0:表示每节基本节长为21,1:表示每节基本节长为24)0: The basic section length (BIT) of each section is selected as 21 (0: indicates that the basic section length of each section is 21, 1: indicates that the basic section length of each section is 24)

010000:三个分节处的偏移信息(第21位是阶梯信息值中的is词的频率编号的中间,将此分节处挪到is词的频率编号末,并记录距离值为01,因阶梯信息值的长度不到42,所以后面第42位和第63位处的距离值是00和00,即01与00与00合并成010000)010000: Offset information at the three sections (the 21st bit is the middle of the frequency number of the word is in the ladder information value, move this section to the end of the frequency number of the word is, and record the distance value as 01, Because the length of the ladder information value is less than 42, the distance values at the 42nd and 63rd bits are 00 and 00, that is, 01, 00 and 00 are combined into 010000)

010:the单词的频率编号(阶梯编号,本例以下同)010: the frequency number of the word (step number, the same as in this example below)

001011:fat单词的频率编号001011: frequency number of fat word

011100100:man单词的频率编号011100100: the frequency number of the word man

010110:is单词的频率编号010110: the frequency number of the word is

      |\分节处|\section

101:an单词的频率编号101: frequency number of word an

010110010:engineer单词的频率编号010110010: the frequency number of the word engineer

解码过程为:The decoding process is:

第1次在阶梯信息值的第21位BIT位置分节:The first subsection at the 21st BIT position of the ladder information value:

010001011011100100010 110101010110010010001011011100100010 110101010110010

第2次根据三个分节处的偏移信息值010000对分节处位置进行修正:For the second time, the position of the subsection is corrected according to the offset information value 010000 at the three subsections:

010001011011100100010110 101010110010010001011011100100010110 101010110010

对两节分别穷举分解为各种分法(阶梯编号的分法),并计算各种分法的关联度值。The two sections are exhaustively decomposed into various division methods (division of ladder numbers), and the correlation value of each division method is calculated.

比如对于后一节101010110010,可分解为7种情况:For example, for the latter section 101010110010, it can be decomposed into seven situations:

101,010,110,010(被逗号隔开成4个阶梯编号)101, 010, 110, 010 (separated by commas into 4 ladder numbers)

101,010,110010(被逗号隔开成3个阶梯编号,以下类推)101, 010, 110010 (separated by commas into 3 ladder numbers, and so on)

101,010110010101, 010110010

101,010110,010101, 010110, 010

101010110,010101010110, 010

101010,110,010101010, 110, 010

101010,110010101010, 110010

如在编码时有使用排除法,那在解码时应先用排除法,排除一些明显不合符语法等习惯的分法,比如在与拾音器有关的存储器集里存储一些信息数据库,可记录单词类与单词类非法相邻的信息,比如食品类的单词与建筑类的单词挨在一起是非法等,这些规则需与编码时的规则相同或兼容。这样如果某种分法出现有单词之间非法的搭配情况,即排除这种分法。If the method of elimination is used when encoding, then the method of elimination should be used first when decoding to eliminate some divisions that obviously do not conform to the habits of grammar and the like, such as storing some information databases in the memory set related to the pickup, which can record words and For information about illegal adjacent words, for example, it is illegal for food words and construction words to be next to each other. These rules need to be the same as or compatible with the encoding rules. In this way, if there is an illegal collocation between words in a certain division, this division will be excluded.

对于经过排除后剩下的各分法(或如没使用排除法,那就是原来7种分法),计算这些分法情况的各单词(阶梯编号)间的关联度(对某节某种分法的关联度的计算方式在前面阐述编码时有介绍,并且解码时对各节各种分法的关联度的计算方式应与编码时的计算方式相同),关联度值最大的分法即为解,关联度值最大的分法就与原语句的单词分法情况相同。For the remaining sub-methods after exclusion (or if the exclusion method is not used, that is the original 7 sub-methods), calculate the correlation between each word (ladder number) of these sub-methods (for a certain section and a certain sub-level The calculation method of the correlation degree of the method was introduced in the previous explanation of the encoding, and the calculation method of the correlation degree of each division method in each section during decoding should be the same as the calculation method of the encoding), and the division method with the largest correlation value is Solution, the division method with the largest correlation value is the same as the word division method of the original sentence.

对于其中有单词组合的,可取单词组合中的前后单词(可在单词数据库里各单词组合对应的记录项里的拼法字段得到,由单词的拼法就可在单词数据库里找到此单词对应的记录项),再与旁边的单词计算关联度及判断是否非法。For wherein there are word combinations, the words before and after the desirable word combination (can be obtained in the spelling field in the record item corresponding to each word combination in the word database, just can find the corresponding word of this word in the word database by the spelling of the word record item), and then calculate the degree of association with the words next to it and judge whether it is illegal.

结果这两节分别分解成(都是阶梯编号):As a result, these two sections are decomposed into (both ladder numbers):

010 001011 011100100 010110和101 010110010010 001011 011100100 010110 and 101 010110010

上面一行的6个阶梯编号分别是the,fat,man,is和an,engineer这6个单词的频率编号。The 6 ladder numbers in the upper line are the frequency numbers of the 6 words of the, fat, man, is and an, engineer.

解码到这一步,得到语句中各单词及单词组合的频率编号(阶梯编号),用频率关联混合解码方式对单词段的解码也到此为止;接下来的解码步骤一般是(这里仅以此语句来说,没涉及陌生单词等):结合时态词形段的信息(时态词形段在本例暂没涉及)来查单词数据表,将各频率编号换成单词或单词组合的实际的词形形态的发音信息数据(比如词形音标),得到语句的预备发音数据,由MCU顺次发出语句的预备发音数据中各音标的音来(与拾音器有关的存储器集里应存有各音标的录音数据比如ADPCM格式的录音数据),就是这语句‘The fat man is an engineer’的语音。Decode to this step, get the frequency numbers (ladder numbers) of each word and word combination in the sentence, and use the frequency-associated mixed decoding method to decode the word segment so far; the next decoding step is generally (here only this sentence For example, unfamiliar words, etc. are not involved): In combination with the information of the tense word form (the tense word form is not involved in this example), check the word data table, and replace each frequency number with the actual number of the word or word combination The pronunciation information data (such as word form phonetic symbol) of morphological form obtains the prepared pronunciation data of sentence, sends the sound of each phonetic symbol in the prepared pronunciation data of sentence sequentially by MCU (should store each phonetic symbol in the memory set relevant to pickup device The recording data such as the recording data in ADPCM format), is the voice of the sentence 'The fat man is an engineer'.

关联方式:Association:

先读取语句中含的单词个数的信息,再读取分层的信息,这段信息的长度与单词个数相等,bit为0表示处于第1层,bit为1表示处于第2层(如是4层需两个bit),而且从这些个bit可以看见总语句某个位置的单词是处于某层词的第几个,以及每层各共有多少词。由第1层词的个数顺序取出第1层词(每词都是先频率编号长度信息,再频率编号),接着读取第2层词的信息,这层每词的信息包含:是上层第几个词的修饰,属于其第几个关联类,本词在关联类中的编号;顺序读取完第2层词,如还有下层的,继续此步骤。First read the information of the number of words contained in the sentence, and then read the layered information. The length of this piece of information is equal to the number of words. A bit of 0 means that it is in the first layer, and a bit of 1 means that it is in the second layer ( If it is 4 layers, two bits are needed), and from these bits, it can be seen which word in a certain position of the total sentence is in a certain layer of words, and how many words there are in each layer. Take out the words of the first layer in order of the number of words in the first layer (each word is the frequency number length information first, and then the frequency number), and then read the information of the second layer of words. The information of each word in this layer includes: it is the upper layer The modification of which word belongs to which association class it belongs to, and the number of the word in the association class; after reading the second-level words sequentially, if there are still lower-level words, continue this step.

整句差别方式:The difference in the whole sentence:

本词段解码方式及其举例主要以用单词(音码)来阐述的,如有用单词组合(音码)的类似于单词(音码)处理。This speech segment decoding mode and its examples are mainly set forth with words (sound codes), as the processing similar to words (sound codes) of useful word combinations (sound codes).

解码步骤:在单词段里先读取出所用范句在范句库中的顺序号,并在范句库中取出范句,接着在单词段里读取出删词标志,及从第几个词开始删,删几个等信息,并做删词处理,再重复此删词过程,直到接着读取的是增词标志,再读取从第几个词的位置开始增,增几个,增的词的频率编号长度信息及频率编号,重复增词至读取到结束标志止。(这段话说的‘几’可以是小于10或等于10或大于10的任何数)Decoding step: first read out the sequence number of the used model sentence in the model sentence bank in the word segment, and take out the model sentence in the model sentence bank, then read the deletion sign in the word segment, and from the number Words start to be deleted, delete a few and other information, and delete words, and then repeat the process of deleting words, until the next thing to read is the sign of adding words, and then read from the position of the first word to increase, how many, The frequency number length information and the frequency number of the increased words are repeated until the end mark is read. (The "several" mentioned in this passage can be any number less than 10 or equal to 10 or greater than 10)

例:语句原话是:The young man is not an engineer.Example: The original sentence is: The young man is not an engineer.

编码值为:0000100101000001001 10000100110010100110100100001010011010The encoded value is: 0000100101000001001 10000100110010100110100100001010011010

顺序看各BIT(本举例的编码值是单词段的值,没包括时态词形选择段,为了使举例简洁,本举例的编码值也没包括头段,头段里一般含有语种信息以及词段编码方式选择信息等数据):Look at each BIT in order (the coded value of this example is the value of the word segment, and does not include the tense word form selection segment. In order to make the example concise, the coded value of this example does not include the header, which generally contains language information and words. Section encoding method selection information and other data):

0000100101:所用范句在范句库中的顺序号(固定10个bit)0000100101: The sequence number of the sample sentence used in the sample sentence database (fixed 10 bits)

0:删词标志0: delete word flag

00001:从第2个单词开始删(如为11111,就结束编码)00001: Delete from the second word (if it is 11111, end the encoding)

001:删1个单词(一次删增单词数限制8个内或4个内等)001: Delete 1 word (the number of words deleted at a time is limited to 8 or 4, etc.)

1:增词标志1: add word sign

00001:从第2个单词的位置开始增00001: Increment from the position of the second word

001:增1个单词001: add 1 word

10:增的单词young的频率编号的长度信息10: The length information of the frequency number of the increased word young

010100110:增的单词young的频率编号010100110: The frequency number of the word young added

1:增词标志1: add word sign

00100:从第5个单词的位置开始增00100: Increment from the position of the fifth word

001:增1个单词001: add 1 word

01:增的单词not的频率编号的长度信息01: The length information of the frequency number of the increased word not

001101:增的单词not的频率编号001101: Increased frequency number of the word not

0:遇到0,结束编码0: Encounter 0, end encoding

解码过程为:The decoding process is:

取出范句库里第0000100101句(第37句):The fat man is an engineer。Take out sentence 0000100101 (sentence 37) in the sample sentence library: The fat man is an engineer.

还应在范句库里读取此范句的时态词形选择段以修整此范句,这里修正后仍不变。Also should read the tense word form selection paragraph of this model sentence in the model sentence storehouse to revise this model sentence, still unchanged after amending here.

第1步作删操作后,得:the man is an engineerAfter the deletion operation instep 1, get: the man is an engineer

第2步作增操作,增young,得:the young man is an engineerThe second step is to increase the operation, increase young, get: the young man is an engineer

第3步作增操作,增not,得:the young man is not an engineerThe third step is to increase the operation, increase not, get: the young man is not an engineer

一般码后面还有此语句的时态词形选择段。(本例没写出)There is also the tense word form selection segment of this sentence behind the general sign indicating number. (not shown in this example)

(解码过程中,以上各步得的句里的各单词实际都是用频率编号表示的,这里仅为方便看才用单词本身来表示)。(in the decoding process, each word in the sentence obtained in the above steps is actually all represented by a frequency number, which is only represented by the word itself for convenience here).

解码到这一步,得到语句中各单词及单词组合的频率编号,用整句差别解码方式对单词段的解码也到此为止;接下来的解码步骤一般是(这里仅以此语句来说,没涉及陌生单词等):结合时态词形段的信息(时态词形段在本例暂没涉及)来查单词数据表,将各频率编号换成单词或单词组合的实际的词形形态的发音信息数据(比如词形音标),得到语句的预备发音数据;由MCU顺序发出预备发音数据中各音标的音来(与拾音器有关的存储器集里应存有各音标的录音数据比如ADPCM格式的录音数据),就是这语句‘The young man is not anengineer’的语音。Decode to this step, obtain the frequency numbering of each word and word combination in the sentence, and also end here with the decoding of the word segment by the whole sentence difference decoding mode; Involving unfamiliar words, etc.): check the word data table in combination with the information of the tense word form segment (the tense word form segment is not involved in this example), and replace each frequency number with the actual word form of the word or word combination Pronunciation information data (such as morphological phonetic symbols) obtains the prepared pronunciation data of sentence; The sound of each phonetic symbol in the prepared pronunciation data is sent out in sequence by MCU (the recording data of each phonetic symbol such as the ADPCM format should be stored in the memory set relevant to the pickup recording data), which is the voice of the sentence 'The young man is not an engineer'.

含多句子的语句的解码处理类似于单句子语句(需将多句子语句的范句录在范句库里)The decoding processing of sentences containing multiple sentences is similar to that of single sentence sentences (the model sentences of multi-sentence sentences need to be recorded in the sample sentence library)

(本词段解码方式里,各单词及单词组合的频率编号除了可以是本例示的阶梯编号外,还可以是等长编号或哈夫曼编号等;如为等长编号,哈夫曼编号那其前面是不带频率编号长度信息的。)(in this word segment decoding mode, the frequency numbering of each word and word combination can also be equal-length numbering or Huffman numbering etc. except the ladder numbering of this example; if it is equal-length numbering, then the Huffman numbering There is no frequency number length information in front of it.)

前后调用处理:是与前面几种词段解码方式中的一种结合使用的,比如与频率排序方式合用,先按频率排序解码方式解出单词段,如其中某频率编号为111,即表示此单词或单词组合是采用前后调用处理方式的,然后调后面的前后调用段信息来得到这个单词或单词组合的实际频率编号。Front and back call processing: It is used in combination with one of the previous word segment decoding methods, for example, it is used together with the frequency sorting method, and the word segment is first decoded by the frequency sorting decoding method. The word or word combination adopts the forward and backward call processing method, and then adjusts the back and forth call section information to obtain the actual frequency number of the word or word combination.

接下来是时态词形选择段(有关时态词形选择的处理以单词为例,单词组合的处理与单词相同):解出前面的单词段的各频率编号(单词)以后,依次查每个频率编号(单词)在单词数据库里对应的记录项里此频率编号(单词)的词形形态个数(1个或多个,可以是在单词数据表的此单词的记录项里专门开一个字段来表示此单词有多少个词形形态,也可以是采取其它方式来表示的),确定此频率编号(单词)对应的时态词形选择值的长度是没有,还是1位BIT,2位BIT(有3个或4个词形形态的单词就占2位bit),或更多位,然后依次在时态词形选择段里读取各频率编号(单词)的时态词形选择值BIT,并依一个时态词形选择值表示的顺序值在单词数据库里此频率编号(单词)对应的记录项里查到此频率编号(单词)的词形形态,并取出其词形形态的发音信息数据(比如词形音标)即可;依次按这样处理完读得的各个时态词形选择值即可;对于一个单词有几种词性,各词性下又有不同词形变化,其各词性下的各词形形态都统一记录在单词数据库的此单词对应的记录项里,解码时不分其各词形形态的词性仍按上述步骤解码即可。Next is tense word form selection section (the processing of relevant tense word form selection is example with word, and the processing of word combination is identical with word): after solving each frequency numbering (word) of preceding word section, check each successively A frequency number (word) in the record item corresponding to this frequency number (word) in the record item of the word database (1 or more, can be to open a specially in the record item of this word in the word data table field to indicate how many morphological forms this word has, and it can also be expressed in other ways), to determine whether the length of the tense morphological selection value corresponding to this frequency number (word) is not, or 1 bit BIT, 2 bits BIT (the word with 3 or 4 morphological forms just accounts for 2 bits), or more bits, then read the temporal morphological selection value of each frequency number (word) in the temporal morphological selection segment successively BIT, and according to the sequence value represented by a tense word form selection value, find the word form of this frequency number (word) in the record item corresponding to this frequency number (word) in the word database, and take out the word form of its word form Pronunciation information data (such as morphological phonetic symbols) get final product; Process each tense morphological selection value that has been read in turn and get final product; There are several parts of speech for a word, and different morphological changes are arranged under each part of speech, and its each Each form of speech under the part of speech is all uniformly recorded in the record item corresponding to this word in the word database, and the part of speech of each form of speech is still decoded according to the above steps when decoding.

举一例来说明对时态词形选择段的解码处理:语句原话是:The man is an engineer.Give an example to illustrate the decoding processing of tense word form selection: the original sentence is: The man is an engineer.

编解码时无陌生词,是用频率排序方式来对单词段编码的,以单词作编解码为例(单词组合的处理与单词相同);此语句的编码值即是一个处理单位的值;There are no unfamiliar words during encoding and decoding, and word segments are encoded by frequency sorting, taking words as an example (the processing of word combinations is the same as that of words); the encoding value of this sentence is the value of a processing unit;

(处理单位的头段的值可不用看)(The value of the header section of the processing unit can be ignored)

处理单位的单词段的数据为:001010001010011100100010101100010110010110010(前面5个BIT 00101表示本单词段里单词的个数,后面的BIT是5个频率编号的信息,依次是各频率编号的长度信息和频率编号,此单词段的数据格式同于前面阐述的单词段的频率排序编解码方式里述及的单词段的数据格式)The data of the word section of the processing unit is: 001010001010011100100010101100010110010110010 (the first 5 BITs 00101 represent the number of words in this word section, and the following BITs are the information of 5 frequency numbers, followed by the length information and frequency numbers of each frequency number. The data format of the word segment is the same as the data format of the word segment mentioned in the frequency sorting codec method of the word segment described above)

处理单位的时态词形选择段的数据为:000010The data of the tense word form selection segment of the processing unit is: 000010

解码过程:Decoding process:

先用频率排序解码方式(具体解码过程参见前述)从单词段解出各频率编号是:First use the frequency sorting decoding method (see the foregoing for the specific decoding process) to solve each frequency number from the word segment:

010 011100100 010110 101 010110010010 011100100 010110 101 010110010

再在单词数据库里查这5个频率编号(单词)对应的纪录项,得010(单词)只有一个词形形态,没有时态词形选择值,011100100(单词)有2个词形形态,时态词形选择值占1个BIT,010110(单词)有5个词形形态,时态词形选择值占3个BIT,101(单词)有2个词形形态,时态词形选择值占1个BIT,010110010(单词)有2个词形形态,时态词形选择值占1个BIT。Check the record items corresponding to these 5 frequency numbers (words) in the word database again, get 010 (word) only has one morphological form, there is no tense morphological selection value, 011100100 (word) has 2 morphological forms, time Morphological form selection value occupies 1 BIT, 010110 (word) has 5 morphological forms, tense morphological selection value accounts for 3 BITs, 101 (word) has 2 morphological forms, tense morphological selection value accounts for 1 BIT, 010110010 (word) has 2 morphological forms, and the tense morphological selection value occupies 1 BIT.

其中010(the)一词不占时态词形选择值,然后从时态词形选择段的数据(000010)取各频率编号(单词)的时态词形选择值:Wherein 010 (the) word does not account for tense form selection value, then gets the tense form selection value of each frequency number (word) from the data (000010) of tense form selection section:

0:011100100(频率编号)的时态词形选择值0: tense word form selection value of 011100100 (frequency number)

000:010110的时态词形选择值000: tense word form selection value for 010110

1:101的时态词形选择值1:101 tense word form selection value

0:010110010的时态词形选择值0: tense word form selection value of 010110010

再在单词数据库的这4个频率编号对应的4个记录项里,各依时态词形选择值表示的顺序值得到实际的各单词的词形形态:In the 4 record items corresponding to these 4 frequency numbers of the word database, each according to the sequence value represented by the tense morphological selection value obtains the morphological form of each word actually:

0:在单词数据库此频率编号(011100100)对应的记录项里有man,men共两个词形形态,由于时态词形选择值(顺序值)是0,所以得到实际的词形形态是第1个man;0: In the record item corresponding to this frequency number (011100100) in the word database, there are two morphological forms of man and men. Since the tense morphological selection value (order value) is 0, the actual morphological form obtained is the first 1 man;

000:在单词数据库此频率编号(010110)对应的记录项里有is,be,was,been,being共五个词形形态,由于时态词形选择值是000,所以得到实际的词形形态是第1个is;000: In the record item corresponding to this frequency number (010110) in the word database, there are five morphological forms of is, be, was, been, and being. Since the tense morphological selection value is 000, the actual morphological form is obtained is the first is;

1:在单词数据库此频率编号(101)对应的记录项里有a,an共两个词形形态,由于时态词形选择值是1,所以得到实际的词形形态是第2个an;1: There are two morphological forms of a and an in the record item corresponding to the frequency number (101) in the word database. Since the tense morphological selection value is 1, the actual morphological form is the second an;

0:在单词数据库此频率编号(010110010)对应的记录项里有engineer,engineers共两个词形形态,由于时态词形选择值是0,所以得到实际的词形形态是第1个engineer;0: In the record item corresponding to this frequency number (010110010) in the word database, there are engineer and engineers, a total of two morphological forms. Since the tense morphological selection value is 0, the actual morphological form obtained is the first engineer;

对于解出的最前面的010(the),在单词数据库找到此频率编号对应的记录项,取其唯一的词形形态即可。在解码时,在确定各频率编号(单词)的实际的词形形态后,一般是在单词数据库里各频率编号(单词)对应的记录项里取出各单词实际的词形形态的发音信息数据(比如词形音标),所以解码到这里得:For the first 010 (the) that is solved, find the record item corresponding to this frequency number in the word database, and get its unique word form. When decoding, after determining the actual morphological form of each frequency number (word), generally the pronunciation information data ( For example, word form phonetic symbols), so the decoding is here:

’the’单词的词形音标The phonetic transcription of the word 'the'

’man’单词的词形音标The phonetic transcription of the word 'man'

’is’单词的词形音标The phonetic transcription of the word 'is'

’an’单词的词形音标The phonetic transcription of the word 'an'

’engineer’单词的词形音标The phonetic transcription of the word 'engineer'

把以上5行数据合起来就是语句的预备发音数据,由MCU顺次发出上面5行词形音标的音(与拾音器有关的存储器集里应存有各音标的录音数据比如ADPCM格式的录音数据),即是语句的发音(当然发音时各单词的词形音标之间需有停顿),这里是用整步直接发音方式来播放所解码语句的发音的。Combining the above 5 lines of data is the preparatory pronunciation data of the sentence, and the MCU sends out the phonetic symbols of the above 5 lines in sequence (the recording data of each phonetic symbol should be stored in the memory set related to the pickup, such as the recording data of ADPCM format) , which is the pronunciation of the sentence (certainly there needs to be a pause between the word-form phonetic symbols of each word during pronunciation), here is to play the pronunciation of the decoded sentence with the direct pronunciation mode of the whole step.

显然在单词段里的特殊频率编号(比如表示陌生单词的111111或表示前后调用的111)是没有时态词形选择值的。Obviously, the special frequency numbers in the word segments (such as 111111 representing unfamiliar words or 111 representing front and rear calls) do not have tense word form selection values.

当在词段中解出111111,表示是陌生单词,陌生单词的编码信息放在后面的陌生单词段里,在陌生单词的编码信息中,先取出陌生单词的编码信息的格式选择信息(元辅音作组合,词根元辅音组合,差别式,短单词元辅音组合),接着按不同的编码格式处理:When 111111 is solved in the word segment, it means that it is a strange word, and the coding information of the strange word is placed in the strange word segment at the back. Combination, root vowel and consonant combination, differential formula, short word vowel and consonant combination), and then processed according to different encoding formats:

1:如是元辅音作组合的编码格式时,接着在陌生单词的编码信息里取出组成陌生单词发音的元辅音的个数信息,再取出各元辅音的编号(即音标编号,每个音标编号为6BIT),直接由这些元辅音的编号来组成此陌生单词的发音信息数据;在解码发此陌生单词的音时,顺次发出组成此陌生单词的发音信息数据中的各元辅音编号(即音标)对应的语音即可。1: When the encoding format of the combination of vowels and consonants, then take out the number information of the vowels and consonants forming the pronunciation of the unfamiliar word in the encoding information of the unfamiliar word, and then take out the numbering of each vowel and consonant (that is, the phonetic symbol numbering, each phonetic symbol numbering is 6BIT), directly form the pronunciation information data of this unfamiliar word by the numbering of these vowels and consonants; ) corresponding to the voice.

2:如是词根元辅音组合的编码格式时,接着在陌生单词的编码信息里取出组成陌生单词发音的词根和元辅音的总个数信息,再取出各词根及元辅音的顺序号,并需查词根数据库,将各词根顺序号转成其发音信息数据(即由哪些音标编号组成其发音),还需查词根数据库,将各元辅音顺序号转成其发音信息数据(即对应的音标编号);用这些词根的发音信息数据及元辅音的发音信息数据来组成此陌生单词的发音信息数据(用组合起来的音标编号表示);在解码发此陌生单词的音时,顺次发出组成此陌生单词的发音信息数据的各音标编号对应的语音即可;(实际中在某陌生单词的编码信息里也可能只有词根或只有元辅音的顺序号,解码时就只用词根的发音信息数据或只用元辅音的发音信息数据来组成此陌生单词的发音信息数据)2: If it is the encoding format of root vowel and consonant combination, then take out the total number information of the root and vowel consonant that make up the pronunciation of the unfamiliar word in the encoding information of the unfamiliar word, and then take out the sequence number of each root and vowel consonant, and need to check The root database converts the sequence number of each root into its pronunciation information data (that is, which phonetic symbols numbers form its pronunciation), and also needs to check the root database, and converts the sequence numbers of each element and consonant into its pronunciation information data (that is, the corresponding phonetic symbols number) ; Form the pronunciation information data of this unfamiliar word (with the combined phonetic symbol numbering) with the pronunciation information data of these word roots and the pronunciation information data of vowels and consonants; The pronunciation corresponding to each phonetic symbol numbering of the pronunciation information data of word gets final product; (actually also may only have root or only the sequence number of vowel consonant in the coding information of certain strange word, just only only use the pronunciation information data of word root or only Use the pronunciation information data of vowel and consonant to form the pronunciation information data of this unfamiliar word)

3:如是差别式的编码格式时,接着在陌生词的编码信息里取出12bit的单词数据库里单词的寻址地址,并在单词数据库里去找到此单词对应的记录项,并取出其发音信息数据(由音标编号组成),接着看在陌生单词的编码信息里如是0,表示删,接着是删的起始位置,删几个音标编号,接着如还是0,继续删;直到1,开始增,增的起始位置,增几个音标编号,接着是各音标的编号(6bit),接着如还是1,继续增,直到0结束,最后得到陌生单词的发音信息数据(由音标编号组成);在解码发此陌生单词的音时,顺次发出组成此陌生单词的发音信息数据的各音标编号对应的语音即可;3: If it is a differential encoding format, then take out the address address of the word in the 12-bit word database in the encoding information of the unfamiliar word, and find the record item corresponding to the word in the word database, and take out its pronunciation information data (composed of phonetic symbol numbers), then see that in the coding information of unfamiliar words, if it is 0, it means deletion, then the starting position of deletion, delete several phonetic symbol numbers, and then if it is still 0, continue to delete; until 1, start to increase, Increase the starting position, increase several phonetic symbol numbers, then be the numbering (6bit) of each phonetic symbol, then continue to increase as still 1, until 0 ends, finally obtain the pronunciation information data (made up of phonetic symbol numbers) of unfamiliar words; When decoding the sound of the unfamiliar word, the voice corresponding to each phonetic symbol number that forms the pronunciation information data of the unfamiliar word can be issued sequentially;

4:如是短单词元辅音组合的编码格式时,接着在陌生单词的编码信息里取出组成陌生单词发音的短单词和元辅音的总个数信息,再取出各短单词及元辅音的顺序号,并需查短单词数据库,将各短单词顺序号转成其发音信息数据(即由哪些音标编号组成其发音),还需查短单词数据库,将各元辅音顺序号转成其发音信息数据(即对应的音标编号);用这些短单词的发音信息数据及元辅音的发音信息数据来组成此陌生单词的发音信息数据(用组合起来的音标编号表示);在解码发此陌生单词的音时,顺次发出组成此陌生单词的发音信息数据的各音标编号对应的语音即可。4: If it is the encoding format of short words and vowels and consonants, then take out the total number information of short words and vowels and consonants that make up the pronunciation of unfamiliar words in the encoding information of unfamiliar words, and then take out the sequence numbers of each short word and vowels and consonants, And need to look up short word database, each short word sequence number is changed into its pronunciation information data (that is, it is made up of its pronunciation by which phonetic symbol numbers), also need look up short word database, each element consonant sequence number is changed into its pronunciation information data ( That is, the corresponding phonetic symbol numbering); use the pronunciation information data of these short words and the pronunciation information data of vowel consonant to form the pronunciation information data of this strange word (with the phonetic symbol numbering that combines); When decoding the sound of this strange word , sequentially send out the voice corresponding to each phonetic symbol number that forms the pronunciation information data of this unfamiliar word.

如一个单词段里有多个陌生单词(一个单词段里对应有多个111111),那么各陌生单词的编码信息是在陌生单词段里顺序排列的,解码时,对单词段里的各111111,在陌生单词段里顺次取出各个陌生单词的编码信息来组成各陌生单词的发音信息数据,并且按前述有关陌生单词的解码步骤,在陌生单词段里取完一个陌生单词的编码信息接着就是下一个陌生单词的编码信息;本段述的这种处理是技术常识。If there are multiple unfamiliar words in a word segment (corresponding to multiple 111111 in a word segment), then the coding information of each unfamiliar word is arranged sequentially in the unfamiliar word segment. When decoding, each 111111 in the word segment, The coding information of each unfamiliar word is sequentially taken out in the unfamiliar word section to form the pronunciation information data of each unfamiliar word, and according to the aforementioned decoding steps about unfamiliar words, the encoding information of a strange word is taken out in the unfamiliar word section, and then the next unfamiliar word is obtained. The encoded information of the word; this processing described in this paragraph is common sense.

如单词段里不出现111111,显然处理单位中的陌生单词段也不占BIT(也即处理单位中不存在陌生单词段)。If 111111 does not appear in the word segment, it is obvious that the unfamiliar word segment in the processing unit does not occupy BIT (that is, there is no unfamiliar word segment in the processing unit).

以上7段所述是以只适用于英语为例的解码规则所采用的陌生单词处理方式,而且涉及到解码时陌生单词的发音处理是以解码规则是采用整步直接发音方式来播放语句发音为例的(也可以是用分步直接发音方式,如解码规则采用分步直接发音方式来来播放语句发音,那对于任一个陌生单词,在解码过程中得到的此陌生单词的发音信息数据一般应放在一个部分预备发音数据里,一般不会分开放在多个部分预备发音数据里)。对于印欧语系其它语种的陌生单词处理也可按英语这样处理。The above 7 paragraphs are the unfamiliar word processing method adopted by the decoding rule that is only applicable to English as an example, and the pronunciation processing of the unfamiliar word when it is related to decoding is that the decoding rule adopts the whole-step direct pronunciation method to play the sentence pronunciation as (also can be to use step-by-step direct pronunciation mode, adopt step-by-step direct pronunciation mode to play sentence pronunciation as decoding rule, then for any strange word, the pronunciation information data of this strange word that obtains in decoding process generally should be It is placed in a part of the prepared pronunciation data, and generally it is not placed in multiple parts of the prepared pronunciation data). It can also be handled in the same way as English for unfamiliar words in other languages of the Indo-European language family.

再接下来是前后调用段,如单词段里出现频率编号111,就要到这段顺序读,每个前后调用的具体信息包括往前(或往后)调的单词或单词组合的相对位置等,再根据这些位置信息得到单词段里的111对应的实际频率编号。The next step is the forward and backward call section. If the frequency number 111 appears in the word section, it is necessary to read this section sequentially. The specific information of each front and back call includes the relative position of the forward (or backward) tuned word or word combination, etc. , and then obtain the actual frequency number corresponding to 111 in the word segment according to these position information.

以上的BIT数据读取采用顺序读取即可,前面处理完,前面解出的数据又是后面段解码的基础。到一语句解完,本语句数据也读取完。The above BIT data reading can be read sequentially. After the previous processing is completed, the data deciphered earlier is the basis for the subsequent decoding. When a statement is solved, the data of this statement is also read.

解码最后,将语句的预备发音数据转成语音,即为语句的发音。At the end of decoding, the prepared pronunciation data of the sentence is converted into speech, which is the pronunciation of the sentence.

下面举一个较完整的英语的解码例子:Here is a more complete English decoding example:

语句原话是:The handsome man is an engineer.The original sentence is: The handsome man is an engineer.

语句编码值(即表示语句发音的拼合信息的值,也即一个处理单位的值)为:The sentence encoding value (that is, the value of the combined information representing the pronunciation of the sentence, that is, the value of a processing unit) is:

00 00 001100001001111111100111001000101011000101100101100100 00 0011000010011111111001110010001010110001011001011001

00000100101010010110000101110010000010010101001011000010111001

顺序看各BIT:Look at each BIT in order:

头段:header:

00:本语句是英语语种00: This statement is in English language

00:单词段采用第1种词段编码方式(即频率排序编码方式)00: The word segment adopts the first word segment encoding method (that is, the frequency sorting encoding method)

单词段:word segment:

00110:单词段里的单词及单词组合个数(6个)00110: the number of words and word combinations in the word segment (6)

00:the单词的频率编号的长度信息00: Length information of the frequency number of the word

010:the单词的频率编号010: The frequency number of the word

01:111111的频率编号长度信息01: Frequency number length information of 111111

111111:111111代表陌生单词(handsome),(111111也计入单词及单词组合个数)111111: 111111 represents an unfamiliar word (handsome), (111111 is also included in the number of words and word combinations)

10:man单词的频率编号的长度信息10: The length information of the frequency number of the man word

011100100:man单词的频率编号011100100: the frequency number of the word man

01:is单词的频率编号的长度信息01: The length information of the frequency number of the is word

010110:is单词的频率编号010110: the frequency number of the word is

00:an单词的频率编号的长度信息00: Length information of the frequency number of an word

101:an单词的频率编号101: frequency number of word an

10:engineer单词的频率编号的长度信息10: The length information of the frequency number of the word engineer

010110010:engineer单词的频率编号010110010: the frequency number of the word engineer

时态词形选择段(象the这样没词形变化的单词在此段不占BIT):Tenses and forms selection section (words without inflection like the do not occupy BIT in this section):

0:man单词的时态词形选择值(此单词在单词数据库此词对应的记录项里有man,men共两个词形形态,用1个bit表示时态词形选择值,本语句用的man词是此词两个词形形态man,men中的第1个,所以时态词形选择值是0)0: tense word form selection value of the word man (there are two word form forms of man and men in the record item corresponding to this word in the word database, and 1 bit is used to represent the tense word form selection value. This statement uses The word man is the first of the two morphological forms of this word, man and men, so the tense morphological selection value is 0)

000:is单词的时态词形选择值(此单词有is,be,was,been,being共五个词形形态,用3个bit表示,is是5个词形形态中的第1个,所以时态词形选择值是000)000: tense word form selection value of the word is (this word has five form forms of is, be, was, been, being, represented by 3 bits, is is the first of the five form forms, So tense word form selection value is 000)

1:an单词的时态词形选择值(此单词有a,an共两个词形形态,用1个bit表示,an是2个词形形态中的第2个,所以时态词形选择值是1)1: The tense and word form selection value of the word an (this word has two forms, a and an, represented by 1 bit, and an is the second of the two forms, so the tense form is selected value is 1)

0:engineer单词的时态词形选择值(此单词有engineer,engineers共两个词形形态,用1个bit表示,engineer是2个词形形态的第1个,所以时态词形选择值是0)0: tense word form selection value of the word engineer (this word has engineer and engineers, which are represented by 1 bit, and engineer is the first of the two word forms, so the tense word form selection value is 0)

陌生单词段:Unfamiliar word segment:

01:陌生单词的编码信息的格式选择信息(词根元辅音组合格式)01: Format selection information for encoding information of unfamiliar words (root vowel and consonant combination format)

01:组成陌生单词发音的词根和元辅音的总个数(2个,本例只用了2个词根,没用元辅音;词根包括词基,前缀,后缀)01: The total number of roots and vowels that make up the pronunciation of unfamiliar words (2, only 2 roots are used in this example, no vowels; roots include word bases, prefixes, and suffixes)

0100101100:词根hand在词根数据库里的顺序号0100101100: The sequence number of the root hand in the root database

0010111001:词根some在词根数据库里的顺序号0010111001: the sequence number of the root some in the root database

解码过程如下:(这些解码步骤可由直播方式的以只适用于英语为例的编码规则对照推出,也可由前面那些页对直播方式的以只适用于英语为例的解码规则的描述来推出)The decoding process is as follows: (These decoding steps can be deduced from the encoding rules that are only applicable to English as an example in the live broadcast mode, or can be deduced from the descriptions of the decoding rules that are only applicable to English in the live broadcast mode on the previous pages)

依照上述各BIT表示的信息,According to the information represented by the above-mentioned BITs,

先取头段信息,得知本语句是英语语种且单词段是使用频率排序编码方式来编码的;First fetch the header information, and know that the sentence is in English and the word segment is coded using the frequency sorting coding method;

第1步解码,使用频率排序解码方式(直播方式的以只适用于英语为例的解码规则中的频率排序解码方式的更具体阐述在前面有述),先在单词段里取单词段里的单词及单词组合个数信息(6个),再在单词段里往下顺序取得6个频率编号,得:The first step is to decode, use the frequency sorting decoding method (the live broadcast method is only applicable to English as an example, the frequency sorting decoding method in the decoding rules is described in the front), first take the word segment in the word segment Information on the number of words and word combinations (6), and then obtain 6 frequency numbers sequentially in the word segment, and get:

010 111111 011100100 010110 101 010110010010 111111 011100100 010110 101 010110010

(这时单词段的数据取完,接着在语句编码值中就是时态词形选择段数据的起始位置)(at this moment, the data of the word segment has been fetched, and then in the sentence encoding value, it is the initial position of the tense word form selection segment data)

第2步解码,对于上一步解码得到的111111,查语句编码值中的陌生单词段(上一步解码得到语句编码值中单词段里各频率编号后,再查单词数据表,得各频率编号的时态词形选择值占的BIT位数,而得到时态词形选择段的BIT长度,即可定位陌生单词段在语句编码值中的起始位置,这是常识,象111111,111这样表示陌生词或前后调用的频率编号在时态词形选择段里就不占时态词形选择值BIT),先得陌生单词的编码信息的格式是词根元辅音组合格式,再在陌生单词段往下取词根元辅音的总个数信息及各词根元辅音的顺序号(本例只用了2个词根,没用元辅音),并用词根元辅音组合格式来构造陌生单词发音,得:The 2nd step is decoded, for the 111111 that the decoding of the previous step obtains, look up the unfamiliar word segment in the sentence encoding value (after decoding in the previous step to obtain each frequency number in the word segment in the sentence encoding value, check the word data table again, and get the number of each frequency number The BIT digits occupied by the tense word form selection value, and the BIT length of the tense word form selection segment can be obtained to locate the starting position of the unfamiliar word segment in the sentence encoding value. This is common sense, such as 111111, 111. Unfamiliar words or the frequency numbers that are called before and after do not account for the tense morphological selection value BIT) in the tense morphological selection section. Get the total number information of root vowel consonants and the sequence number of each root vowel consonant (this example has only used 2 roots, no vowel consonants), and use the root vowel consonant combination format to construct the pronunciation of unfamiliar words, get:

010(01001011000010111001)011100100 010110 101 010110010010(01001011000010111001)011100100 010110 101 010110010

第3步,查单词数据表得上一步解码得到的各频率编号的时态词形选择值占的BIT位数,再查语句编码值中的时态词形选择段,得上一步解码得到的各频率编号(即单词或单词组合)的时态词形选择值(有些单词或单词组合比如THE是没选择值的),再结合各频率编号的时态词形选择值查单词数据表,将第2步解码得到的各频率编号换成单词或单词组合的实际的词形形态的发音信息数据(本解码例的发音信息数据是以词形音标为例的),再查处理陌生单词要涉及的词根数据库,将用于组成陌生单词发音的两个词根顺序号(第2步得的结果中的括号内的,’hand’词根和’some’词根的顺序号)各自换成音标编号组合并将这两个词根的音标编号组合排在一起成陌生单词的发音信息数据,得语句的预备发音数据顺序为:The 3rd step, look up the word data table to get the BIT digits that the tense word form selection value of each frequency number obtained by decoding in the previous step accounts for, and then check the tense word form selection segment in the sentence code value, and obtain the previous step decoding The tense morphological selection value of each frequency number (i.e. word or word combination) (some words or word combination such as THE do not have selection value), then in conjunction with the tense morphological selection value of each frequency numbering check word data table, will Each frequency number that the 2nd step decoding obtains is changed into the pronunciation information data (the pronunciation information data of this decoding example is with the word form and phonetic symbol) of the actual morphological form of word or combination of words, then look into and deal with unfamiliar words and will involve root database, the two root sequence numbers used to form the pronunciation of unfamiliar words (in the brackets in the results obtained in the second step, the sequence numbers of the 'hand' root and the 'some' root) are respectively replaced by phonetic symbols and combined The phonetic symbols numbers of these two roots are combined to form the pronunciation information data of unfamiliar words, and the order of the prepared pronunciation data of the sentence is:

‘the’单词的词形音标The phonetic transcription of the word 'the'

‘hand’的音标编号组合和‘some’的音标编号组合(此行是陌生单词的发音信息数据)The phonetic number combination of 'hand' and the phonetic number combination of 'some' (this line is the pronunciation information data of unfamiliar words)

‘man’单词的词形音标The phonetic transcription of the word 'man'

‘is’单词的词形音标The phonetic transcription of the word 'is'

‘an’单词的词形音标The phonetic transcription of the word 'an'

‘engineer’单词的词形音标The phonetic transcription of the word 'engineer'

把以上6行数据顺次合起来即语句的预备发音数据,可由MCU驱动喇叭顺次播出以上6行(即语句的预备发音数据中)各音标的音(与拾音器有关的存储器集里应存有各音标的录音数据比如ADPCM格式的录音数据),即是语句的全部发音。至此,此语句编码值的解码过程结束。本解码例子是用整步直接发音方式来播放所解码的语句的发音的。在发音时,在每个单词或单词组合的音标的结束处(即各单词或单词组合之间)需有停顿,这是常识。The above 6 lines of data are put together in order to be the preparatory pronunciation data of the sentence, and the sound of each phonetic symbol of the above 6 lines (that is, in the preparatory pronunciation data of the sentence) can be played in sequence by the MCU driving the speaker (should be stored in the memory set related to the pickup device) There are recording data of each phonetic symbol, such as recording data in ADPCM format), that is, the entire pronunciation of the sentence. At this point, the decoding process of the encoded value of this statement ends. This decoding example uses the full-step direct pronunciation method to play the pronunciation of the decoded sentence. When pronouncing, it is common sense that there should be a pause at the end of the phonetic symbols of each word or word combination (that is, between each word or word combination).

在本例解码的第3步中有关对时态词形选择段的解码处理的具体细节,还可参考本说明书第83页及第83页后面几页所述对时态词形选择段的解码处理。In the 3rd step of decoding in this example, about the specific details of the decoding process to the tense word form selection segment, you can also refer to the decoding of the tense word form selection segment described on the 83rd page and the following pages of the 83rd page of this manual deal with.

(本解码举例里,各单词及单词组合的频率编号除了可以是本例示的阶梯编号外,还可以是等长编号或哈夫曼编号等;如为等长编号,哈夫曼编号那其前面是不带频率编号长度信息的。)(In this decoding example, the frequency number of each word and word combination can be the ladder number of this example, it can also be a number of equal length or a Huffman number, etc.; if it is a number of equal length, the Huffman number that precedes it It does not carry frequency number length information.)

非印欧语系的解码处理:Decode processing for non-Indo-European languages:

以下叙述非印欧语系语种的解码处理时,涉及到举例时是以汉语为例的。When describing the decoding processing of non-Indo-European languages in the following, Chinese is used as an example when it comes to examples.

显然,对非印欧语系任一语种,解码时对音码的定义方式及使用的音码情况与对拼合信息的值编码(这里说的解码与编码是专门对应的)时对音码的定义方式及使用的音码情况一般是相同或兼容的;可被当成解码时涉及的音码的,可以是表示发音的元辅音或元音辅音组合,也可以是可以转换成发音的字,词,词组合或范句,也可以是经组合后可转换为发音的偏旁部首组合或偏旁部首,可由这些音码中的一种或多种来拼合解码时定义的解码语句集发音。Obviously, for any language of the non-Indo-European language family, the definition of the phonetic code and the situation of the used phonetic code when decoding and the definition of the phonetic code when encoding the value of the combined information (the decoding and encoding mentioned here are specially corresponding) The way and the phonetic codes used are generally the same or compatible; what can be regarded as the phonetic codes involved in decoding can be vowels and consonants or combinations of vowels and consonants that can be converted into pronunciations, words, words, etc. Word combination or model sentence, also can be the radical combination or radical that can be converted into pronunciation after being combined, and the decoding sentence set pronunciation that defines when spelling out decoding can be spliced by one or more in these phonetic codes.

在对非印欧语系的某一语种的拼合信息的值进行解码时,首先在与拾音器有关的存储器集里需存储解码数据,解码数据一般是与用某编码规则中有关此语种的编码细则对此语种语句编码时所使用的编码数据内容有关的,解码数据一般与此编码数据是一样或兼容的(本说明书述的以只适用于汉语为例的编码规则涉及的编码数据和与此编码规则对应的以只适用于汉语为例的解码规则涉及的解码数据是一样的,这里说的一样是指比如编码数据包括有A1,A2,A3三个数据库,解码数据包括有B1,B2,B3三个数据库,并且A1与B1的数据库名称及数据库结构及数据内容都相同,并且A2与B2的数据库名称及数据库结构及数据内容都相同,并且A3与B3的数据库名称及数据库结构及数据内容都相同),再使用解码程序来进行解码,此解码程序一般是按与对此语种语句编码时所用编码规则(此编码规则可以是只适用于此语种或适用于包括此语种在内的多个语种)有关此语种对应的解码规则(此解码规则可以是只适用于此语种或适用于包括此语种在内的多个语种)中有关此语种的解码细则来进行解码的,(前一句里提及的两个‘多个’不一定是相同数目)。When decoding the value of the mosaic information of a certain language of the non-Indo-European language family, the decoding data needs to be stored in the memory set related to the pickup at first. Regarding the content of the coded data used in the coding of sentences in this language, the decoded data is generally the same or compatible with the coded data (the coded data and the coded data involved in the coded rules that are only applicable to Chinese as an example described in this manual are related to the coded data. The corresponding decoded data involved in the decoding rules that only apply to Chinese as an example are the same. The same here means that, for example, the encoded data includes three databases A1, A2, and A3, and the decoded data includes three databases: B1, B2, and B3. database, and the database name, database structure and data content of A1 and B1 are the same, and the database name, database structure and data content of A2 and B2 are the same, and the database name, database structure and data content of A3 and B3 are the same ), and then use the decoding program to decode. This decoding program is generally according to the encoding rules used when encoding sentences in this language (this encoding rule can be only applicable to this language or applicable to multiple languages including this language) The decoding rules corresponding to this language (this decoding rule can be only applicable to this language or applicable to multiple languages including this language) are used to decode the decoding rules of this language, (mentioned in the previous sentence) The two 'pluralities' are not necessarily the same number).

对于非印欧语系任一语种,语音编码有声读物的整页面上的码字涉及的表示此语种语句发音的拼合信息的值是可被某种解码规则中有关此语种的直播方式的解码细则来解码的(此解码规则可以是只适用于此语种或适用于包括此语种在内的多个语种的)。For any language of the non-Indo-European language family, the value of the combined information representing the pronunciation of sentences in this language involved in the codewords on the entire page of the phonetic coded audiobook can be determined by the decoding rules related to the live broadcast method of this language in a certain decoding rule Decoded (this decoding rule can be only applicable to this language or applicable to multiple languages including this language).

如解码时播放语句发音是采取直接发音方式,那还需用到诸音码的发音数据(一般是诸音码的录音数据),至于如何对非印欧语系某语种的音码进行录音以产生诸音码的录音数据,在前面阐述直播方式的以只适用于汉语为例的编码规则时有述,并且在编解码过程中用于编解码的主要音码与解码时的发音音码不一定是相同种类,这在前面阐述直播方式的以只适用于汉语为例的编码规则时已有述。Take the direct pronunciation mode when playing sentence pronunciation during decoding, then also need to use the pronunciation data of all phonetic codes (generally the recording data of all phonetic codes), as to how to record the phonetic code of non-Indo-European language to produce The recording data of various phonetic codes is described in the live broadcast mode, which is only applicable to Chinese as an example, and the main phonetic codes used for coding and decoding are not necessarily the same as the pronunciation codes during decoding. It is the same type, which has been mentioned in the previous explanation of the coding rules of the live broadcast method, which is only applicable to Chinese as an example.

只适用于非印欧语系任一语种或可适用于包括数个非印欧语系语种在内的多个语种的编码规则可以有很多种,编码规则的种数实际上是没法穷举的,同样,只适用于非印欧语系任一语种或可适用于包括数个非印欧语系语种在内的多个语种的解码规则的种数也是没法穷举的(只适用于一个语种的解码规则就可有多种)。(本段提及的两个‘数个非印欧语系语种’不一定是相同所指,本段提及的两个‘多个语种’不一定是相同所指)There can be many kinds of encoding rules that are only applicable to any language of the non-Indo-European language family or applicable to multiple languages including several non-Indo-European languages. The number of encoding rules cannot be exhausted. Similarly, the number of decoding rules that are only applicable to any non-Indo-European language or applicable to multiple languages including several non-Indo-European languages is also inexhaustible (only applicable to decoding of one language) There are many kinds of rules). (The two 'several non-Indo-European languages' mentioned in this paragraph do not necessarily refer to the same, and the two 'multiple languages' mentioned in this paragraph do not necessarily refer to the same)

权利要求3,12,20或23中提及的其有关某语种的解码细则含有压缩解码处理的解码规则(此解码规则可以是只适用于此一个语种或适用于包括此一个语种在内的多个语种的)可以是与各种编码规则有关此语种对应的各种具体的解码规则。(本段提及的‘对应’是指一般对应或专门对应)The decoding rules about a certain language mentioned inclaim 3, 12, 20 or 23 contain decoding rules for compression decoding processing (this decoding rule can be only applicable to this one language or be applicable to many including this one language) language) can be various specific decoding rules corresponding to this language related to various coding rules. (The 'correspondence' mentioned in this paragraph refers to general correspondence or special correspondence)

权利要求3,12,20或23中提及的解码规则,可以是与本说明书提及或没提及的适用于非印欧语系任一语种(汉语或韩语或日语等)的其有关此语种的编码细则含有压缩编码处理的任一编码规则(此编码规则可能是只适用于此语种或适用于包括此语种在内的多个语种的)有关此语种对应的本说明书提及或没提及的适用于此语种的其有关此语种的解码细则含有压缩解码处理且是直播方式的的各种解码规则(这各种解码规则中任一解码规则可能是只适用于此语种或适用于包括此语种在内的多个语种的)。(本段提及的两个‘多个’不一定是相同数目,本段提及的‘对应’是指一般对应或专门对应)The decoding rules mentioned inclaim 3, 12, 20 or 23 may be related to any language (Chinese or Korean or Japanese, etc.) that is not mentioned in this specification and is applicable to any non-Indo-European language (Chinese or Korean or Japanese, etc.) The coding rules contain any coding rules for compression coding processing (this coding rule may be applicable only to this language or to multiple languages including this language). The decoding rules for this language applicable to this language contain various decoding rules for compressed decoding processing and live broadcasting (any decoding rule of these various decoding rules may only be applicable to this language or apply to include this multiple languages including one language). (The two "multiple" mentioned in this paragraph are not necessarily the same number, and the "correspondence" mentioned in this paragraph refers to general correspondence or special correspondence)

以下是以汉语作为非印欧语系语种的例子来阐述非印欧语系语种的解码技术的。The following uses Chinese as an example of a non-Indo-European language to illustrate the decoding technology of a non-Indo-European language.

以下述的是对与本说明书前面述的直播方式的以只适用于汉语为例的编码规则有关汉语对应的一种解码规则及其解码实例的阐述,此解码规则包括如下一些解码细则:解码语句集范围完整定义,用于解码时按直接发音方式播放语句发音的音码语音的处理的方式定义,用于解码的音码的种类选择,对音码赋编号,词数据库建立,词类的建立,各种词段解码方式,陌生词解码处理,头段信息读取及用发音形式播放语句发音等,并且此解码规则是以只适用于一个语种即汉语为例的(有时如附带提及到韩语,日语等其它语种的内容时是有注明是韩语或日语语种的),即此解码规则中的所有解码细则都是属于此解码规则中有关汉语语种的解码细则,以只适用于汉语为例的此解码规则是与直播方式的以只适用于汉语为例的编码规则对应的,其中对音码赋编号是有压缩处理的,解码时可供选用的几种词段解码方式(频率排序,频率关联混合,关联,整句差别等解码方式)中除频率排序解码方式外都是含有压缩解码处理的,所以直播方式的以只适用于汉语为例的此解码规则是含有压缩解码处理的,显然此解码规则里的压缩解码处理与直播方式的以只适用于汉语为例的编码规则里的压缩编码处理是对应的。此解码规则是直播方式的。非印欧语系其它语种比如韩语,日语等也可采用此解码规则,当然其具体细节与汉语有别,需作相应修改(当然这种情况时,相应的这些语种编码时需选用基本上同于本说明书前面述的直播方式的以只适用于汉语为例的编码规则的编码规则)。此直播方式的以只适用于汉语为例的解码规则及其解码实例的具体阐述是从下一行起至102页为止的:The following is a description of a decoding rule and its decoding examples corresponding to the Chinese language corresponding to the encoding rule of the live broadcast method mentioned above in this specification, which is only applicable to Chinese. This decoding rule includes the following decoding rules: decoding sentences Complete definition of set range, definition of the processing mode of the phonetic code voice for playing sentence pronunciation in the direct pronunciation mode when decoding, selection of the type of phonetic code used for decoding, assigning numbers to the phonetic code, establishment of word database, establishment of part of speech, Various word segment decoding methods, unfamiliar word decoding processing, header information reading and sentence pronunciation playing in pronunciation form, etc., and this decoding rule is only applicable to one language, that is, Chinese as an example (sometimes as mentioned in Korean , Japanese and other languages are marked as Korean or Japanese), that is, all the decoding rules in this decoding rule belong to the decoding rules related to Chinese language in this decoding rule, and only applicable to Chinese as an example This decoding rule corresponds to the encoding rule applicable to Chinese as an example in the live broadcast mode, in which the numbering of the phonetic code is compressed, and there are several decoding methods (frequency sorting, Decoding methods such as frequency correlation mixing, correlation, and whole sentence difference) all contain compression decoding processing except for the frequency sorting decoding method, so the live broadcast method is only applicable to Chinese as an example. This decoding rule contains compression decoding processing. Apparently, the compression decoding processing in this decoding rule corresponds to the compression coding processing in the coding rule applicable only to Chinese as an example in the live broadcast mode. This decoding rule is live. Other non-Indo-European languages such as Korean, Japanese, etc. can also use this decoding rule. Of course, the specific details are different from Chinese and need to be modified accordingly (of course, in this case, the corresponding codes for these languages need to be selected basically the same as The above-mentioned live mode of this instruction manual is only applicable to the coding rules of the coding rules of Chinese as an example). The specific elaboration of the decoding rules and decoding examples of this live broadcast method, which is only applicable to Chinese as an example, starts from the next line and ends on page 102:

完整定义的解码语句集的范围是任意汉语语句,也即解码语句集包括汉语任意语句。The scope of the fully defined decoding sentence set is any Chinese sentence, that is, the decoding sentence set includes any Chinese sentence.

解码规则中的用于解码时按直接发音方式播放语句发音的音码语音的处理的方式定义,用于解码的音码的种类选择,对音码赋编号,词数据库建立,词类的建立等解码细则是与编码时用的直播方式的以只适用于汉语为例的编码规则中相应的那些编码细则(那些编码细则也即用于解码时按直接发音方式播放语句发音的音码语音的处理的方式定义,用于解码的音码的种类选择,对音码赋编号等)相同的,相应建立的解码数据也是与编码时用的编码数据相同的(其中解码时用的词数据库与编码时用的词数据库是相同的);而此解码规则中的各种词段解码方式,陌生词解码处理这些解码细则是与编码用的直播方式的以只适用于汉语为例的编码规则中的各种词段编码方式,陌生词编码处理这些编码细则对应的。In the decoding rule, it is used to play the sound code voice processing method of the pronunciation of the sentence in the direct pronunciation mode when decoding, the type selection of the sound code used for decoding, the numbering of the sound code, the establishment of the word database, the establishment of the part of speech, etc. decoding Details are those encoding rules corresponding to the encoding rules that are only applicable to Chinese as an example in the live broadcast mode used during encoding (those encoding rules are also used for the processing of the phonetic code voice of playing the pronunciation of sentences in the direct pronunciation mode when decoding Mode definition, be used for the type selection of the sound sign indicating number of decoding, numbering etc. is assigned to sound sign indicating number) identical, the decoding data of corresponding establishment is also identical with the encoding data used when encoding (wherein the word database used when decoding is the same as the word database used when encoding) The word database is the same); and the various word segment decoding methods in this decoding rule, and the unfamiliar word decoding processing. Word segment encoding method and unfamiliar word encoding process correspond to these encoding rules.

此解码规则涉及的发音形式是:解码表示此解码规则定义的解码语句集里任一语句的发音的拼合信息的值时播放语句发音用的发音方式都是用整步直接发音方式。The pronunciation form involved in this decoding rule is: when decoding the value of the combined information representing the pronunciation of any sentence in the decoding sentence set defined by this decoding rule, the pronunciation mode used for playing the pronunciation of the sentence is all the full-step direct pronunciation mode.

在直播方式的以只适用于汉语为例的编码规则里已约定好解码这边是只对于汉语所有的可能的字的发音(约1300个带调音节)进行录音以形成诸音码的录音数据的,解码时语句的发音是由这些带调音节来组成的。以只适用于汉语为例的解码规则所用的诸音码的录音数据是按本说明书第46页和第47页述的第1种方式形成的。In the encoding rules that are only applicable to Chinese as an example in the live broadcast mode, it has been agreed that the decoding here is to record only the pronunciation of all possible characters in Chinese (about 1300 syllables with tones) to form the recording data of various phonetic codes Yes, the pronunciation of the sentence during decoding is composed of these toned syllables. The recording data of the used all phonetic codes of the decoding rule that is only applicable to Chinese is to form by the 1st kind of mode described in the 46th page and the 47th page of this specification sheet.

如采用上述第1种方式对带调音节(即拼音)来录音及在解码时用带调音节的发音来组成词及语句的发音,解码时如需处理变音,那可制定一些有关变音的发音规则(比如记录解码时遇到哪些词的频率编号靠在一起时要变音等),在解码生成语句的全部(或非全部)预备发音数据时,考察要生成的全部(或非全部)预备发音数据里各词的排列情况(比如在解码过程中考察各词的频率编号的排列情况),按有关变音的发音规则,来确定哪些词要变音及怎样变音,并在生成全部(或非全部)预备发音数据时将全部(或非全部)预备发音数据中组成要变音的词的那些带调音节(拼音)替换成要实际发的带调音节(拼音),再可按全部(或非全部)预备发音数据把语句的全部(或非全部)发音发出来,而编码这边可不管其是否有变音,对于可能有变音的词(或词组合),在词数据库里只需记录其一个发音(基本发音)的信息(比如词形拼音),而不需记录其怎样变音的信息,(当然这种方式要求所录的约1300个带调音节的音里要有这些变音的音节的音,如有些变音发音特殊,在原1300个带调音节里没有,那应补录进去);本段是按整步直接发音方式来说的。可把本段说的有关变音的发音规则写成直播方式的以只适用于汉语为例的解码规则里的解码细则。If the above-mentioned 1st method is adopted to record the syllable with tone (i.e. pinyin) and use the pronunciation of the syllable with tone to form the pronunciation of words and sentences when decoding, if it is necessary to deal with inflection during decoding, some relevant inflection can be formulated Pronunciation rules (for example, when recording the frequency numbers of words encountered when decoding, the pronunciation should be changed when they are close together, etc.), when decoding all (or not all) pre-pronunciation data of the generated sentence, examine all (or not all) to be generated ) Prepare the arrangement of each word in the pronunciation data (such as examining the arrangement of the frequency numbers of each word in the decoding process), according to the pronunciation rules of the relevant pronunciation, determine which words will be changed and how, and generate When all (or not all) prepare the pronunciation data, those band tone syllables (pinyin) that form the word to change the pronunciation in all (or not all) prepare pronunciation data are replaced with the band tone syllables (pinyin) that will actually send out, and then can All (or not all) of the sentence is pronounced according to all (or not all) of the prepared pronunciation data, and the coding side can no matter whether it has an inflection, for the word (or word combination) that may have an inflection, in the word In the database, only the information of one pronunciation (basic pronunciation) (such as word form pinyin) needs to be recorded, and the information of how to change the pronunciation is not required. If there are syllables with these inflections, if some inflections have special pronunciations and are not included in the original 1300 syllables with tones, they should be added in); this section is based on the whole-step direct pronunciation method. The pronunciation rules about inflection mentioned in this paragraph can be written as the decoding rules in the decoding rules that only apply to Chinese as an example in the live broadcast mode.

按常规,语句可以是一个句子或一个句子的部分,语句也可以是含有多个句子。编码时,是用一个处理单位对应一个语句(一个处理单位包括头段,词段,陌生词段,前后调用段),如语句含有多个句子的,这时需定义一个编号作为句子间停顿(断句)的标志(此停顿编号可与词的频率编号统一编号,即占用一个词的频率编号,编解码时把停顿当成词来同等对待处理);对于把一个句子的部分(非完整句子)作为一个语句,用一个处理单位对应此语句的,对此语句的编解码处理仍可使用本说明书反复叙述的对于一完整句子的编解码处理方式(因如一个句子缺少如主语或谓语等后,并不影响采用频率排序词段编解码方式等对其编解码)。下面例是这样的:语句就是一个中间无逗号的句子,用一个处理单位对应一个语句,一个处理单位的值就是表示语句发音的拼合信息的值。Conventionally, a statement can be a sentence or part of a sentence, or a statement can consist of multiple sentences. When encoding, a processing unit is used to correspond to a sentence (a processing unit includes a header, a word segment, an unfamiliar word segment, and a call segment before and after). If the sentence contains multiple sentences, a number needs to be defined as a pause between sentences ( Sentence) mark (this pause numbering can be uniformly numbered with the frequency numbering of words, promptly takes up the frequency numbering of a word, treats the pause as words equally during encoding and decoding); for the part (non-complete sentence) of a sentence as If a sentence corresponds to a sentence with a processing unit, the coding and decoding processing of this sentence can still use the coding and decoding processing method for a complete sentence repeatedly described in this manual (because if a sentence lacks such as a subject or a predicate, etc., and It does not affect the encoding and decoding of frequency sorting word segments, etc.). The following example is like this: a sentence is a sentence without a comma in the middle, and a processing unit corresponds to a sentence, and the value of a processing unit is the value of the combined information representing the pronunciation of the sentence.

一个处理单位可以看作含有:头段,词段,陌生词段,前后调用段等,(这些段中的词段是必需的,其它段是可能有,也可能无的,比如在某被编码语句中没陌生词,那其处理单位里就没有陌生词段;前后调用段也是可能有,可能没有的);解码时,也是相应地按段来读取处理单位中各段的数据(按段来读取处理单位中各段的数据,也可认为其是属于解码规则中的一个解码细则)。A processing unit can be regarded as including: head segment, word segment, unfamiliar word segment, front and back calling segment, etc. If there is no unfamiliar word in the statement, then there will be no unfamiliar word segment in its processing unit; there may or may not be a calling segment before and after); when decoding, the data of each segment in the processing unit is also read correspondingly by segment (by segment To read the data of each segment in the processing unit, it can also be considered as a decoding rule in the decoding rule).

先读取头段含的信息:本编码语句属于什么语种,本处理单位的词段使用何种词段编码方式;头段里也可带有本编码语句中的词及词组合的个数信息(但不必需带);根据不同需要,头段里的各信息可以省略;与前面的直播方式的以只适用于汉语为例的编码规则相对应,本解码规则及解码举例里约定所读取的头段里记录有本编码语句属于什么语种和本处理单位的词段使用何种词段编码方式这两个信息,每个信息是各自固定分配2个BIT来记录的。First read the information contained in the header: what language does the coded statement belong to, and which word segment coding method is used for the word segment of this processing unit; the header can also contain the number information of the words and word combinations in the coded sentence (but not necessary); according to different needs, the information in the header can be omitted; corresponding to the encoding rules that are only applicable to Chinese in the previous live broadcast method, the decoding rules and decoding examples agreed to read In the head section of the file, there are two information records, which language the coded statement belongs to and which word segment encoding method is used for the word segment of this processing unit. Each information is fixedly assigned 2 BITs to record.

接下来是词段,按其使用的词段编码方式而使用相应的词段解码方式来解码(相关数据的数据结构参见前面编码时的介绍)。Next is the word segment, which is decoded using the corresponding word segment decoding method according to the word segment encoding method used (for the data structure of relevant data, refer to the introduction during the previous encoding).

再接下来是陌生词段,对此段数据的解码处理是与对此段编码时的处理对应的。解码时,如在前面解出的词段里有代表陌生词的频率编号111111,就在此段取陌生词的编码信息来组成陌生词的发音信息数据。如词段中没出现频率编号111111,那本处理单位就不存在陌生词段。Next is an unfamiliar word segment, and the decoding processing of this segment of data corresponds to the processing of this segment of encoding. During decoding, if the frequency number 111111 representing the unfamiliar word is arranged in the word segment solved in front, just get the encoding information of the unfamiliar word to form the pronunciation information data of the unfamiliar word in this section. If the frequency number 111111 does not appear in the word segment, then there is no unfamiliar word segment in the processing unit.

再接下来是前后调用段,对此段数据的解码处理是与对此段编码时的处理对应的。解码时,如在前面解出的词段里有代表前后调用的频率编号111,就在此段取往前或往后调用及调用的词或词组合的位置信息。如词段中没出现频率编号111,那本处理单位就不存在前后调用段。Next is the front and rear calling segments, and the decoding processing of this segment of data corresponds to the processing of this segment of encoding. During decoding, if there is the frequency number 111 of calling before and after being represented in the word section that is unwrapped in front, just get the position information of the word or the word combination of calling and calling before this section. If the frequency number 111 does not appear in the word segment, then there is no front and rear calling segments in this processing unit.

以下是与(直播方式的以只适用于汉语为例的编码规则里的)几种词段编码方式相对应的(直播方式的以只适用于汉语为例的解码规则里的)几种词段解码方式(这几种词段解码方式是用于对词段的解码;频率排序,频率关联混合,关联,整句差别等这几种词段解码方式中除频率排序解码方式外都是含有压缩解码处理的):The following are several word segment encoding methods (in the live broadcast method only applicable to Chinese as an example) several word segment encoding methods Decoding method (these kinds of word segment decoding methods are used to decode the word segment; frequency sorting, frequency correlation mixing, association, whole sentence difference, etc., all of these word segment decoding methods contain compression except for the frequency sorting decoding method decoded):

频率排序方式:Sort by frequency:

先在词段的前面取5个bit的词段里含有的词及词组合的个数信息,然后只需在词段里再顺序取出各词及词组合的信息,先是取2BIT的一个词或词组合的频率编号长度信息,根据长度,再取此词或词组合的频率编号,然后再取下一个词或词组合的频率编号长度信息和频率编号,直到取完语句(词段里)的全部词及词组合的频率编号长度信息和频率编号为止,把这些频率编号按各频率编号原来在词段里的顺序排组起来即可。First take the number information of words and word combinations contained in the 5-bit word segment in front of the word segment, and then only need to sequentially extract the information of each word and word combination in the word segment, first take a word or word combination of 2BIT The frequency numbering length information of word combination, according to length, get the frequency numbering of this word or word combination again, then get the frequency numbering length information and frequency numbering of next word or word combination, until getting the sentence (in the word segment) As far as the frequency number length information and the frequency number of all words and word combinations, these frequency numbers can be arranged according to the original order of each frequency number in the word segment.

举例:语句原话是:‘这个男人是一个工程师’Example: The original sentence is: 'This man is an engineer'

编码值为:001010001110011001100 01 0101010011010011101010The encoding value is: 001010001110011001100 01 0101010011010011101010

顺序看编码值中的各BIT(本举例的编码值是词段的值,为了使举例简洁,本举例的编码值没包括头段,头段里一般含有语种信息以及词段编码方式选择信息等数据):Look at each BIT in the coded value in order (the coded value in this example is the value of the word segment. In order to make the example concise, the coded value in this example does not include the header, which generally contains language information and word segment encoding method selection information, etc. data):

00101:词段里的词及词组合个数(5个)00101: Number of words and word combinations in the word segment (5)

00:‘这个’词的频率编号的长度信息00: The length information of the frequency number of the word 'this'

011:‘这个’词的频率编号011: Frequency number of word 'this'

10:‘男人’词的频率编号的长度信息10: length information of the frequency number of the word 'man'

011001100:‘男人’词的频率编号011001100: frequency number for the word 'man'

01:’是’词的频率编号的长度信息01: The length information of the frequency number of the word 'is'

010101:’是’词的频率编号010101: Frequency number of the word 'is'

00:’一个’词的频率编号的长度信息00: length information of the frequency number of the word 'one'

110:’一个’词的频率编号110: frequency number of word 'one'

10:‘工程师’词的频率编号的长度信息10: Length information of the frequency number of the word 'engineer'

011101010:‘工程师’词的频率编号011101010: frequency number for the word 'engineer'

解码过程为:The decoding process is:

第1步解码,先取词段里的词及词组合个数信息(5个),再往下先取2BIT的一个频率编号的长度信息,再按长度取一个频率编号,依此顺次取得5个词及词组合的频率编号,得:The first step of decoding is to first get the number of words and word combinations (5) in the word segment, and then get the length information of a frequency number of 2BIT first, and then get a frequency number according to the length, and then get 5 in sequence Frequency codes of words and word combinations, get:

011 011001100 010101 110 011101010(这5个频率编号分别是‘这个’,‘男人’,‘是’,‘一个’,‘工程师’这5个词的频率编号)011 011001100 010101 110 011101010 (These 5 frequency numbers are respectively the frequency numbers of the 5 words 'this', 'man', 'yes', 'one', 'engineer')

解码到这一步,得到语句中各词及词组合的频率编号,用频率排序解码方式对词段的解码也到此为止;接下来的解码步骤一般是(这里仅以此语句来说,没涉及陌生词等):查词数据库,将各频率编号转成发音信息数据(比如词形拼音;词形拼音由单字拼音组成),得到语句的预备发音数据,由MCU顺次发出语句的预备发音数据中各拼音的音(这些拼音是属于单字拼音表里约1300个单字拼音中的拼音,在与拾音器有关的存储器集里应存有这约1300个单字拼音的各单字拼音的录音数据比如ADPCM格式的录音数据),就是解码得到的语句的全部发音。Decoding to this step, the frequency numbers of each word and word combination in the sentence are obtained, and the decoding of the word segment with the frequency sorting decoding method is also so far; the next decoding step is generally (here only for this sentence, not involved) Unfamiliar words, etc.): search word database, convert each frequency number into pronunciation information data (such as morphological pinyin; morphological pinyin is composed of single-character pinyin), get the prepared pronunciation data of the sentence, and send the prepared pronunciation data of the sentence in sequence by the MCU (these phonetics belong to the pinyin in about 1300 single-character phonetics in the single-character phonetic table, and the recording data of each single-character phonetic of these about 1300 single-character phonetics should be stored in the memory set relevant to the pickup, such as the ADPCM format recording data) is the entire pronunciation of the decoded sentence.

本解码举例涉及的都是词(音码),对于编解码的语句中有词组合(音码)的情况与词(音码)同等处理。What this decoding example relates to is all words (sound codes), for the situation that word combination (sound codes) is arranged in the sentence of codec and word (sound codes) equal treatment.

(本词段解码方式里,各词及词组合的频率编号除了可以是本例示的阶梯编号外,还可以是等长编号或哈夫曼编号等;如为等长编号,哈夫曼编号那其前面是不带频率编号长度信息的。)(in this speech segment decoding mode, the frequency numbering of each word and word combination can also be equal-length numbering or Huffman numbering etc. except the ladder numbering of this example; if it is equal-length numbering, the Huffman numbering There is no frequency number length information in front of it.)

频率关联混合方式:Frequency correlation mixing method:

本词段解码方式及其举例主要以用词(音码)来阐述的,如有用词组合(音码)的类似于词(音码)处理。This speech segment decoding mode and its examples are mainly set forth with words (phonetic codes), as the similar word (phonetic codes) processing of useful word combinations (phonetic codes).

先在词段里读取5个bit的阶梯信息值的长度信息,再读取1个BIT的基本节长信息,然后读取三个分节处的偏移信息,得出实际的分节情况,再对各节,排出各种词(阶梯编号)分法,计算各分法下各词(阶梯编号)之间的关联度(如编码时有采用排除法,那计算以前应根据组句规则先排除一些非法的词分法),取关联度值最大的分法为解得的结果。本词段解码方式里涉及的频率编号是阶梯编号。First read the length information of the 5-bit ladder information value in the word segment, then read the basic section length information of 1 BIT, and then read the offset information at the three sections to obtain the actual section situation , to each section, discharge various word (ladder numbering) sub-methods, calculate the degree of relevance between each word (ladder numbering) under each sub-method (if the method of exclusion is adopted during coding, then the calculation should be based on the sentence composition rule First exclude some illegal word segmentation methods), and take the segmentation method with the largest correlation value as the solution result. The frequency numbers involved in this word segment decoding method are ladder numbers.

举例:语句原话是:‘这个胖男人是一个工程师’Example: The original sentence is: 'This fat man is an engineer'

编码值为:011000010000011011011011001100010101110011101010The encoded value is: 011000010000011011011011001100010101110011101010

顺序看编码值中的各BIT(本举例的编码值是词段的值,为了使举例简洁,本举例的编码值没包括头段,头段里一般含有语种信息以及词段编码方式选择信息等数据):Look at each BIT in the coded value in order (the coded value in this example is the value of the word segment. In order to make the example concise, the coded value in this example does not include the header, which generally contains language information and word segment encoding method selection information, etc. data):

01100:阶梯信息值的长度为12个单位(每3个bit一个单位,本例有3*12个bit)01100: The length of the ladder information value is 12 units (one unit per 3 bits, 3*12 bits in this example)

0:每节基本节长(BIT)选为21(0:表示每节基本节长为21,1:表示每节基本节长为24)0: The basic section length (BIT) of each section is selected as 21 (0: indicates that the basic section length of each section is 21, 1: indicates that the basic section length of each section is 24)

010000:三个分节处的偏移信息(第21位是阶梯信息值中的’是’词的频率编号的中间,将此分节处挪到’是’词的频率编号末,并记录距离值为01,因阶梯信息值的长度不到42,所以后面第42位和第63位处的距离值是00和00,即01与00与00合并成010000)010000: Offset information at the three sections (the 21st bit is the middle of the frequency number of the word "yes" in the ladder information value, move this section to the end of the frequency number of the word "yes", and record the distance The value is 01, because the length of the ladder information value is less than 42, so the distance values at the 42nd and 63rd bits are 00 and 00, that is, 01, 00 and 00 are combined into 010000)

011:’这个’词的频率编号011: frequency number of word 'this'

011011:’胖’词的频率编号011011: Frequency number for the word 'fat'

011001100:’男人’词的频率编号011001100: frequency number for the word 'man'

010101:’是’词的频率编号010101: Frequency number of the word 'is'

|\分节处|\section

110:’一个’词的频率编号110: frequency number of word 'one'

011101010:’工程师’词的频率编号011101010: frequency number for the word 'engineer'

解码过程为:The decoding process is:

第1次在阶梯信息值的第21位BIT位置分节:For the first time, it is divided into sections at the 21st BIT position of the ladder information value:

011011011011001100010 101110011101010011011011011001100010 101110011101010

第2次根据三个分节处的偏移信息值010000对分节处位置进行修正:For the second time, the position of the subsection is corrected according to the offset information value 010000 at the three subsections:

01101101101100110001010 111001110101001101101101100110001010 1110011101010

对两节分别穷举分解为各种分法(阶梯编号分法),并计算各种分法的关联度值。The two sections are exhaustively decomposed into various division methods (step number division method), and the correlation value of each division method is calculated.

如对于后一节110011101010,可分解为7种情况:For example, for the latter section 110011101010, it can be decomposed into seven situations:

110,011,101,010(被逗号隔开成4个阶梯编号)110, 011, 101, 010 (separated by commas into 4 ladder numbers)

110,011,101010(被逗号隔开成3个阶梯编号,以下类推)110, 011, 101010 (separated by commas into 3 ladder numbers, and so on)

110,011101010110, 011101010

110,011101,010110, 011101, 010

110011101,010110011101,010

110011,101,010110011, 101, 010

110011,101010110011, 101010

如在编码时有使用排除法,那在解码时应先用排除法,排除一些明显不合符语法等习惯的分法,比如在与拾音器有关的存储器集里存储一些信息数据库,可记录词类与词类非法相邻的信息,比如食品类的词与建筑类的词挨在一起是非法等,这些规则需与编码时的规则相同或兼容。这样如果某种分法出现有词之间非法的搭配情况,即排除这种分法。If the method of elimination is used in encoding, the method of elimination should be used first in decoding to eliminate some divisions that obviously do not conform to the grammar and other habits, such as storing some information databases in the memory set related to the pickup, which can record parts of speech and parts of speech Illegal adjacent information, such as food words and construction words are illegal to be next to each other, these rules need to be the same as or compatible with the encoding rules. In this way, if there is an illegal collocation between words in a certain division, this division will be excluded.

对于经过排除后剩下的各分法(或如没使用排除法,那就是原来7种分法),计算这些分法情况的各词(阶梯编号)间的关联度(对某节某种分法的关联度的计算方式在前面阐述编码时有介绍,并且解码时对各节各种分法的关联度的计算方式显然应与编码时的计算方式相同),关联度值最大的分法即为解,关联度值最大的分法就与原语句的词分法情况相同。For the remaining divisions after exclusion (or if the exclusion method is not used, that is the original 7 divisions), calculate the correlation between each word (ladder number) of these divisions (for a certain section, a certain division) The calculation method of the correlation degree of the method was introduced in the previous explanation of the encoding, and the calculation method of the correlation degree of the various sub-methods in each section during decoding should obviously be the same as the calculation method during the encoding), and the method with the largest correlation value is For solution, the segmentation method with the largest correlation value is just the same as the word segmentation method of the original sentence.

对于其中有词组合的,可取词组合中的前后词(可在词数据库里各词组合对应的记录项里的拼法字段得到,由词的拼法就可在词数据库里找到此词对应的记录项),再与旁边的词计算关联度及判断是否非法。For wherein there are word combinations, the words before and after the optional word combination (can be obtained in the spelling field in the record item corresponding to each word combination in the word database, and the word corresponding to this word can be found in the word database by the spelling of the word) record item), and then calculate the degree of association with the words next to it and judge whether it is illegal.

结果这两节分别分解成:The resulting two sections break down into:

011 011011 011001100 010101和110 011101010011 011011 011001100 010101 and 110 011101010

上面一行的6个阶梯编号分别是‘这个’,‘胖’,‘男人’,‘是’和‘一个’,‘工程师’这6个词的频率编号。The 6 ladder numbers in the upper row are respectively the frequency numbers of the 6 words 'this', 'fat', 'man', 'yes' and 'one', 'engineer'.

解码到这一步,得到语句中各词及词组合的频率编号,用频率关联混合解码方式对词段的解码也到此为止;接下来的解码步骤一般是(这里仅以此语句来说,没涉及陌生词等):查词数据库,将各频率编号转成发音信息(比如词形拼音;词形拼音由单字拼音组成),得到语句的预备发音数据,由MCU顺次发出语句的预备发音数据中各拼音的音(这些拼音是属于单字拼音表里约1300个单字拼音中的拼音,在与拾音器有关的存储器集里应存有这约1300个单字拼音的各单字拼音的录音数据比如ADPCM格式的录音数据),就是解码得到的语句的发音。Decode to this step, get the frequency numbers of each word and word combination in the sentence, and the decoding of the word segment with the frequency-related mixed decoding method also stops here; the next decoding step is generally (here only for this sentence, there is no Involving unfamiliar words, etc.): Look up the word database, convert each frequency number into pronunciation information (such as morphological pinyin; morphological pinyin is composed of single-character pinyin), obtain the prepared pronunciation data of the sentence, and the MCU will sequentially send out the prepared pronunciation data of the sentence (these phonetics belong to the pinyin in about 1300 single-character phonetics in the single-character phonetic table, and the recording data of each single-character phonetic of these about 1300 single-character phonetics should be stored in the memory set relevant to the pickup, such as the ADPCM format recording data) is the pronunciation of the decoded sentence.

关联方式:(语句由词组成,本词段解码方式以词为例,对于词组合类似于词处理)Association method: (sentences are composed of words, the decoding method of this word segment takes words as an example, and the word combination is similar to word processing)

先读取语句中含的词个数的信息,再读取分层的信息,这段信息的长度与词个数相等,bit为0表示处于第1层,bit为1表示处于第2层(如是4层需两个bit),而且从这些个bit可以看见总语句某个位置的词是处于某层词的第几个,以及每层各共有多少词。由第1层词的个数顺序取出第1层词(每词都是先频率编号长度信息,再频率编号),接着读取第2层词的信息,这层每词的信息包含:是上层第几个词的修饰,属于其第几个关联类,本词在关联类中的编号;顺序读取完第2层词,如还有下层的,继续此步骤。First read the information of the number of words contained in the sentence, and then read the layered information. The length of this piece of information is equal to the number of words. A bit of 0 means it is in the first layer, and a bit of 1 means it is in the second layer ( If it is 4 layers, two bits are needed), and from these bits, it can be seen which word in a certain position of the total sentence is in a certain layer of words, and how many words there are in each layer. Take out the words of the first layer in order of the number of words in the first layer (each word is the frequency number length information first, and then the frequency number), and then read the information of the second layer of words. The information of each word in this layer includes: it is the upper layer The modification of which word belongs to which association class it belongs to, and the number of the word in the association class; after reading the second-level words sequentially, if there are still lower-level words, continue this step.

整句差别方式:The difference in the whole sentence:

本词段解码方式及其举例主要以用词(音码)来阐述的,如有用词组合(音码)的类似于词(音码)处理。This speech segment decoding mode and its examples are mainly set forth with words (phonetic codes), as the similar word (phonetic codes) processing of useful word combinations (phonetic codes).

解码过程:先在词段里读出本范句在范句库中的顺序号,接着在词段里读取到删词标志,及从第几个词开始删,删几个等信息,并做删词处理,再重复此删词过程,直到接着读取的是增词标志,再读取从第几个词的位置开始增,增几个,增的词的拼音编号长度信息及频率编号,重复增词至读取到结束标志止。增和删的还可以是词组合。(这段话说的‘几’可以是小于10或等于10或大于10的任何数)Decoding process: first read out the sequence number of the model sentence in the sample sentence library in the word segment, then read the deletion mark in the word segment, and delete from the first word, how many to delete and other information, and Do the word deletion process, and then repeat the word deletion process until the word addition mark is read next, and then read the pinyin number length information and frequency number of the increased word starting from the position of the number, how many, and the added word , repeat adding words until the end mark is read. Adding and deleting can also be word combinations. (The "several" mentioned in this passage can be any number less than 10 or equal to 10 or greater than 10)

例:语句原话是:‘这个胖男人是一个工程师’Example: The original sentence is: 'This fat man is an engineer'

编码值为:0000100101000001001000100001100001001010110110The encoded value is: 0000100101000001001000100001100001001010110110

顺序看各BIT(本举例的编码值是词段的值,为了使举例简洁,本举例的编码值没包括头段,头段里一般含有语种信息以及词段编码方式选择信息等数据):Look at each BIT in order (the coded value in this example is the value of the word segment. In order to make the example concise, the coded value in this example does not include the header, which generally contains data such as language information and word segment encoding method selection information):

0000100101:所用范句在范句库中的顺序号(固定10个bit)0000100101: The sequence number of the sample sentence used in the sample sentence database (fixed 10 bits)

0:删词标志0: delete word flag

00001:从第2个词开始删(如为11111,就结束编码)00001: Delete from the second word (if it is 11111, end the encoding)

001:删1个词(一次删增词数限制8个内或4个内等)001: Delete 1 word (the number of words deleted or added at a time is limited to 8 or 4, etc.)

0:删词标志0: delete word flag

00100:从第6个词开始删(如为11111,就结束编码)00100: delete from the 6th word (if it is 11111, end the coding)

001:删1个词(一次删增词数限制8个内或4个内等)001: Delete 1 word (the number of words deleted or added at a time is limited to 8 or 4, etc.)

1:增词标志1: add word sign

00001:从第2个词开始增00001: increase from the second word

001:增1个词001: add 1 word

01:增的词’胖’的频率编号的长度信息01: The length information of the frequency number of the added word 'fat'

011011:增的词’胖’的频率编号011011: Increased frequency number of word 'fat'

0:再遇0,结束编码0: Encounter 0 again, end encoding

解码过程为:The decoding process is:

取出范句库里第0000100101句(第37句):这个高男人是一个好工程师。Take out sentence 0000100101 (sentence 37) in the sample sentence library: This tall man is a good engineer.

第1步作删操作后,得:这个男人是一个好工程师。After the delete operation instep 1, you get: This man is a good engineer.

第2步作删操作后,得:这个男人是一个工程师。After the delete operation instep 2, you get: This man is an engineer.

第3步作增操作,增1个词’胖’,得:这个胖男人是一个工程师。The third step is to increase operation, add 1 word 'fat', get: this fat man is an engineer.

(解码过程中,以上各步得的句里的各词实际都是用频率编号表示的,这里仅为方便看才用字词本身来表示)。解码到这一步,得到语句中各词及词组合的频率编号,用整句差别解码方式对词段的解码也到此为止;接下来的解码步骤一般是(这里仅以此语句来说,没涉及陌生词等):查词数据库,将各频率编号转成发音信息数据(比如词形拼音;词形拼音由单字拼音组成),得到语句的预备发音数据,MCU顺次发出语句的预备发音数据中各拼音的音(这些拼音是属于单字拼音表里约1300个单字拼音中的拼音,在与拾音器有关的存储器集里应存有这约1300个单字拼音的各单字拼音的录音数据比如ADPCM格式的录音数据),就是解码得到的语句的发音。(in the decoding process, each word in the sentence obtained in the above steps is actually all represented by a frequency number, which is only represented by the word itself for convenience here). Decoding to this step, the frequency numbering of each word and word combination in the sentence is obtained, and the decoding of the word segment with the whole sentence difference decoding method is also so far; the next decoding step is generally (here only for this sentence, there is no Involving unfamiliar words, etc.): look up the word database, convert each frequency number into pronunciation information data (such as word-form pinyin; word-form pinyin is composed of single-character pinyin), obtain the prepared pronunciation data of the sentence, and the MCU sequentially sends out the prepared pronunciation data of the sentence (these phonetics belong to the pinyin in about 1300 single-character phonetics in the single-character phonetic table, and the recording data of each single-character phonetic of these about 1300 single-character phonetics should be stored in the memory set relevant to the pickup, such as the ADPCM format recording data) is the pronunciation of the decoded sentence.

含多句子的语句的解码处理类似于单句子语句(需有多句子语句的范句录在范句库里)。The decoding processing of the sentence that contains many sentences is similar to the single sentence sentence (the model sentence that needs the multi-sentence sentence is recorded in the model sentence library).

(本词段解码方式里,各词及词组合的频率编号除了可以是本例示的阶梯编号外,还可以是等长编号或哈夫曼编号等;如为等长编号,哈夫曼编号那其前面是不带频率编号长度信息的。)(in this speech segment decoding mode, the frequency numbering of each word and word combination can also be equal-length numbering or Huffman numbering etc. except the ladder numbering of this example; if it is equal-length numbering, the Huffman numbering There is no frequency number length information in front of it.)

前后调用处理,是与前面几种词段解码方式中的一种结合使用的,比如与频率排序方式合用,先按频率排序解码方式解出词段,如其中某频率编号为111,即表示此词或词组合是采用前后调用处理方式的,然后调后面的前后调用段信息来得到这个词或词组合的实际频率编号。The front and rear call processing is used in combination with one of the previous word segment decoding methods. For example, it is used together with the frequency sorting method. First, the frequency sorting decoding method is used to solve the word segment. The word or word combination adopts the forward and backward call processing mode, and then adjusts the back and forth call segment information to obtain the actual frequency number of the word or word combination.

汉语(或韩语,日语)一般不设时态词形选择段。Chinese (or Korean, Japanese) generally does not have a choice of tense and word form.

当在词段中解出111111,表示是陌生词,陌生词的编码信息放在后面的陌生词段里,When 111111 is solved in the word segment, it means that it is an unfamiliar word, and the encoding information of the unfamiliar word is placed in the following unfamiliar word segment.

在陌生词的编码信息中,先取出陌生词的编码信息的格式选择信息(元辅音作组合,单字拼音作组合,差别式),接着按不同的编码格式处理(以汉语为例):In the encoding information of unfamiliar words, the format selection information of the encoding information of unfamiliar words is first taken out (the combination of vowels and consonants, the combination of single-character pinyin, differential formula), and then processed according to different encoding formats (taking Chinese as an example):

1:如是元辅音作组合的编码格式时,接着在陌生词的编码信息里取出组成陌生词发音的带调元辅音的个数信息(4BIT),再取出各带调元音和带调辅音的顺序号(每个顺序号占8个BIT),直接由这些带调元音和带调辅音的顺序号来组成此陌生词的发音信息数据;解码时,要发陌生词的音时,直接把组成陌生词的发音信息数据的各带调元音和带调辅音的顺序号的语音顺次发出来即可;1: When the encoding format of vowels and consonants is combined, then take out the number information (4BIT) of the tone vowels that form the pronunciation of the unfamiliar words in the encoding information of unfamiliar words, and then take out the number information (4BIT) of each tone vowel and tone consonants Sequence number (each sequence number accounts for 8 BITs), the pronunciation information data of this unfamiliar word is formed directly by the sequence number of these tone vowels and tone consonants; The voices of the serial numbers of the tone vowels and the tone consonants forming the pronunciation information data of unfamiliar words are sent out in sequence;

2:如是单字拼音作组合的编码格式时,接着在陌生词的编码信息里取出组成陌生词发音的单字拼音的个数信息(4BIT),再取出各单字拼音的顺序号,并需查单字拼音表,将各单字拼音的顺序号转成其发音信息数据(即拼音字符拼法);由这些单字拼音的发音信息数据来组成此陌生词的发音信息数据;解码时,要发陌生词的音时,直接把组成陌生词的发音信息数据的各单字拼音的发音信息数据的语音顺次发出来即可(每次用一个单字拼音的拼音字符拼法对应发一个拼音的音);2: If the single-character pinyin is used as the combined encoding format, then take out the number information (4BIT) of the single-character pinyin that forms the pronunciation of the unfamiliar word in the encoding information of the unfamiliar word, and then take out the sequence number of each single-character pinyin, and need to check the single-character pinyin Table, the sequence number of each individual character pinyin is changed into its pronunciation information data (being pinyin character spelling); The pronunciation information data of this strange word is formed by the pronunciation information data of these individual character pinyin; During decoding, the pronunciation of unfamiliar word will be sent When, directly the pronunciation of the pronunciation information data of each single-character phonetic alphabet that forms the pronunciation information data of unfamiliar words is sent out in sequence (correspondingly sending out the sound of a phonetic alphabet with the phonetic character spelling of a single-character phonetic alphabet at every turn);

3:如是差别式的编码格式时,接着在陌生词的编码信息里取出12bit的词数据库里词的寻址地址,并在词数据库里去找到此词对应的记录项,并取出其发音信息数据(由元音和辅音和声调字符组成),接着看在陌生词的编码信息里如是0,表示删,接着是删的起始位置,删几个元辅音,还可再是0,继续删;直到1,开始增,增的起始位置,增几个元辅音,接着是各元音或辅音或声调的字符(7bit,可为ASCII字符),还可再是1,继续增,直到0结束,最后得到陌生词的发音信息数据(由元音和辅音和声调的字符组成,在解码时,这些元音和辅音和声调的字符可看成是组成一个或多个单字拼音的拼音字符拼法的);解码时,要发陌生词的音时,直接把组成陌生词的发音信息数据的各单字拼音的发音信息数据(每个单字拼音的发音信息数据即拼音字符拼法由元音,辅音和声调的字符组成)的语音顺次发出来即可(每次用一个单字拼音的拼音字符拼法对应发一个拼音的音)。3: If it is a differential encoding format, then take out the address address of the word in the 12-bit word database in the encoding information of the unfamiliar word, and find the record item corresponding to the word in the word database, and take out its pronunciation information data (consisting of vowels and consonants and tone characters), then see that it is 0 in the coding information of unfamiliar words, which means deletion, then the starting position of deletion, delete several vowels and consonants, and can be 0 again, continue to delete; Up to 1, start to increase, increase the starting position, increase several vowels and consonants, followed by the characters of each vowel or consonant or tone (7bit, can be ASCII characters), and then 1, continue to increase until 0 ends , finally get the pronunciation information data of unfamiliar words (composed of vowels, consonants and tone characters, when decoding, these vowels and consonants and tone characters can be regarded as the spelling of pinyin characters forming one or more single-character pinyin during decoding, when the sound of strange word will be sent out, directly the pronunciation information data of each individual character phonetic of forming the pronunciation information data of unfamiliar word (the pronunciation information data of each individual character phonetic is that the phonetic character spelling is by vowel, consonant The pronunciation that the character of harmony tone is formed) sends out in sequence (the phonetic character spelling of a single-character phonetic alphabet corresponds to sending out a phonetic phonetic sound at a time).

如一个词段中有多个陌生词(一个词段里对应有多个111111),那么各陌生词的编码信息是在陌生词段里顺序排列的,解码时,对词段里的各111111,在陌生词段里顺序取出各个陌生词的编码信息来组成各陌生词的发音信息数据,并且按前述有关陌生词的解码步骤,在陌生词段里取完一个陌生词的编码信息接着就是下一个陌生词的编码信息;本段述的这种处理是技术常识。If there are multiple unfamiliar words in a word segment (corresponding to a plurality of 111111 in a word segment), then the coding information of each unfamiliar word is arranged in order in the strange word segment. When decoding, each 111111 in the word segment, Sequentially take out the coding information of each strange word in the strange word segment to form the pronunciation information data of each strange word, and according to the decoding steps about the aforementioned strange word, take out the coding information of a strange word in the strange word segment and then be the next one Encoding information of unfamiliar words; this processing described in this paragraph is technical common sense.

如词段里不出现111111,显然那处理单位里的陌生词段也不占BIT(也即处理单位中不存在陌生词段)。If 111111 does not appear in the word segment, it is obvious that the unfamiliar word segment in the processing unit does not occupy BIT (that is, there is no unfamiliar word segment in the processing unit).

以上6段所述是以只适用于汉语为例的解码规则所采用的陌生词处理方式,而且涉及到解码时陌生词的发音处理是以解码规则是采用整步直接发音方式来播放语句发音为例的(也可以是用分步直接发音方式,如解码规则采用分步直接发音方式来播放语句发音,那对于任一个陌生词,在解码过程中得到的此陌生词的发音信息数据一般应放在一个部分预备发音数据里,一般不会分开放在多个部分预备发音数据里)。对于非印欧语系其它语种的陌生词处理也可按汉语这样处理。The above 6 paragraphs describe the unfamiliar word processing method adopted by the decoding rules that are only applicable to Chinese as an example, and when it comes to the pronunciation processing of unfamiliar words when decoding, the decoding rule is to use the full-step direct pronunciation method to play the sentence pronunciation as (also can be to use step-by-step direct pronunciation mode, adopt step-by-step direct pronunciation mode to play sentence pronunciation as decoding rule, then for any strange word, the pronunciation information data of this strange word that obtains in decoding process generally should put In a part of the pre-pronunciation data, it is generally not placed separately in multiple parts of the pre-pronunciation data). The treatment of unfamiliar words in other languages of the non-Indo-European language family can also be handled in the same way as Chinese.

再接下来是前后调用段,如词段里出现频率编号111,就要到这段顺序读,每个前后调用的具体信息包括往前(或往后)调的词或词组合的相对位置等,再根据这些位置信息得到词段里的111对应的实际频率编号。The next step is the front and back call sections. If the frequency number 111 appears in the word section, it is necessary to read this section sequentially. The specific information of each front and back call includes the relative position of the forward (or backward) tuned word or word combination, etc. , and then obtain the actual frequency number corresponding to 111 in the word segment according to these position information.

以上的BIT数据读取采用顺序读取即可,前面处理完,前面解出的数据又是后面段解码的基础。到一语句解完,本语句数据也读取完。The above BIT data reading can be read sequentially. After the previous processing is completed, the data deciphered earlier is the basis for the subsequent decoding. When a statement is solved, the data of this statement is also read.

解码到这里,将语句的预备发音数据转成语音,即为语句的发音。At this point in decoding, the prepared pronunciation data of the sentence is converted into speech, which is the pronunciation of the sentence.

下面举一个较完整的汉语的解码例子:Here is a more complete Chinese decoding example:

语句原话是:‘这个漂亮男人是一个工程师’The original sentence is: 'This beautiful man is an engineer'

语句编码值(即表示语句发音的拼合信息的值,也即一个处理单位的值)为:The sentence encoding value (that is, the value of the combined information representing the pronunciation of the sentence, that is, the value of a processing unit) is:

01 0000110000110111111110011001100010101010011001 00001100001101111111100110011000101010100110

10 0111010100101010010110100010111000010 01110101001010100101101000101110000

顺序看各BIT:Look at each BIT in order:

头段:header:

01:本语句是汉语语种01: This sentence is in Chinese language

00:词段采用第1种词段编码方式(即频率排序编码方式)00: The word segment adopts the first word segment encoding method (that is, the frequency sorting encoding method)

词段:Phrases:

00110:词段里的词及词组合的个数(6个)00110: the number of words and word combinations in the word segment (6)

00:’这个’词的频率编号的长度信息00: The length information of the frequency number of the word 'this'

011:’这个’词的频率编号011: frequency number of word 'this'

01:111111的频率编号长度信息01: Frequency number length information of 111111

111111:111111代表陌生词(‘漂亮’),(111111也计入词及词组合个数)111111: 111111 represents an unfamiliar word ('beautiful'), (111111 is also included in the number of words and word combinations)

10:’男人’词的频率编号的长度信息10: length information of the frequency number of the word 'man'

011001100:’男人’词的频率编号011001100: frequency number for the word 'man'

01:’是’词的频率编号的长度信息01: The length information of the frequency number of the word 'is'

010101:’是’词的频率编号010101: Frequency number of the word 'is'

00:’一个’词的频率编号的长度信息00: length information of the frequency number of the word 'one'

110:’一个’词的频率编号110: frequency number of word 'one'

10:’工程师’词的频率编号的长度信息10: length information of the frequency number of the word 'engineer'

011101010:’工程师’词的频率编号011101010: frequency number for the word 'engineer'

陌生词段:Unfamiliar phrases:

01:陌生词的编码信息的格式选择信息(单字拼音作组合格式)01: Format selection information for encoding information of unfamiliar words (single-character pinyin as combined format)

01:单字拼音个数(2个)01: The number of pinyin of a single character (2)

01001011010:单字’漂’的拼音在单字拼音表里的顺序号01001011010: the sequence number of the pinyin of the single character 'Piao' in the single-character pinyin table

00101110000:单字’亮’的拼音在单字拼音表里的顺序号00101110000: the sequence number of the pinyin of the single character 'liang' in the single-character pinyin table

解码过程如下:(这些解码步骤可由直播方式的以只适用于汉语为例的编码规则对照推出,也可由前面那些页对直播方式的以只适用于汉语为例的解码规则的描述来推出)The decoding process is as follows: (These decoding steps can be deduced from the encoding rules that are only applicable to Chinese as an example in the live broadcast mode, or can be deduced from the descriptions of the decoding rules that are only applicable to Chinese in the live broadcast mode on the previous pages)

依照上述各BIT表示的信息,According to the information represented by the above-mentioned BITs,

先取头段信息,得知本语句是汉语语种且词段是使用频率排序编码方式来编码的;Get the header information first, and know that the sentence is in Chinese language and the word segment is coded by using the frequency sorting coding method;

第1步解码,使用频率排序解码方式(直播方式的以只适用于汉语为例的解码规则中的频率排序解码方式的更具体阐述在前面有述),先在词段里取词段里的词及词组合个数信息(6个),再在词段里往下顺序取得6个频率编号,得:The first step is to decode, use the frequency sorting decoding method (the live broadcast mode takes the frequency sorting decoding method in the decoding rules that are only applicable to Chinese as an example to be described in the front), and first take the words in the word segment Information on the number of words and word combinations (6), and then obtain 6 frequency numbers in the order of the word segment, and get:

011 111111 011001100 010101 110 011101010011 111111 011001100 010101 110 011101010

第2步解码,对于上一步得的111111,查语句编码值中的陌生词段(显然,上一步解码取完语句编码值中词段里的各频率编号的数据后,接着在语句编码值中就是陌生词段数据的起始位置),先得陌生词的编码信息的格式是单字拼音作组合格式,再在陌生词段往下取单字拼音的个数信息及各单字拼音的顺序号,并用单字拼音作组合格式来构造陌生词发音,得:011(01001011010 00101110000)011001100 010101 110 011101010The 2nd step decodes, for the 111111 that the previous step gets, look up the unfamiliar word segment in the sentence coding value (obviously, after the previous step decodes and gets the data of each frequency number in the word segment in the sentence coding value, then in the sentence coding value It is the starting position of the unfamiliar phrase data), and the format of the encoding information of the unfamiliar phrase is the single-character pinyin as the combined format, and then the number information of the single-character pinyin and the sequence number of each single-character pinyin are obtained in the unfamiliar phrase, and used Single-character pinyin is used as a combined format to construct the pronunciation of unfamiliar words: 011(01001011010 00101110000) 011001100 010101 110 011101010

第3步,查词数据表,将第2步解码得到的各频率编号转成发音信息数据(本解码例的发音信息数据是以词形拼音为例的;词形拼音由拼音组成),再查处理陌生词要涉及的单字拼音表,将用于组成陌生词的发音信息数据的两个单字拼音的顺序号(第2步得的结果中的括号内的)各自转成拼音(拼音由拼音字符拼法表示,‘piao4’和‘liang4’)并将这两个拼音排在一起成陌生词的发音信息数据,得语句的预备发音数据为:The 3rd step, look up word data table, each frequency number that the 2nd step decodes and obtains is converted into pronunciation information data (the pronunciation information data of this decoding example is example with word form phonetic alphabet; Word form phonetic alphabet is made up of pinyin), again Look up and handle the single-character phonetic table that unfamiliar word will relate to, will be used for forming the sequence number of two individual-character phonetic alphabets of the pronunciation information data of unfamiliar word (in the parentheses in the result that the 2nd step gets) change into phonetic respectively (phonetic by phonetic Character spelling representation, 'piao4' and 'liang4') and these two pinyins are arranged together to form the pronunciation information data of unfamiliar words, and the prepared pronunciation data of getting sentence is:

zhe4ge4 piao4liang4 nan2ren2 shi4 yi1ge4 gong1cheng2shi1zhe4ge4 piao4liang4 nan2ren2 shi4 yi1ge4 gong1cheng2shi1

MCU将解码执行到这里,再可由MCU驱动喇叭顺次播出语句的预备发音数据中各拼音的音(这些拼音,比如zhe4ge4中的zhe4,是属于单字拼音表里约1300个单字拼音中的拼音,在与拾音器有关的存储器集里应存有这约1300个单字拼音的各单字拼音的比如ADPCM格式的录音数据;这里的陌生词的发音信息数据piao4liang4中的拼音比如piao4,也是属于1300个单字拼音中的拼音),即是语句的全部发音。至此,此语句编码值的解码过程结束。本解码例子是用整步直接发音方式来播放所解码的语句的发音的。The MCU executes the decoding until here, and then the MCU can drive the speaker to broadcast the sounds of each pinyin in the prepared pronunciation data of the sentence in sequence (these pinyin, such as zhe4 in zhe4ge4, belong to the pinyin of about 1300 single-character pinyin in the single-character pinyin table , in the memory set relevant to the pickup, there should be recorded data such as the ADPCM format of each of these about 1300 single-character pinyins; the pronunciation information data piao4liang4 of unfamiliar words here, such as piao4, also belong to 1300 single-characters Pinyin in Pinyin), that is, the entire pronunciation of the sentence. At this point, the decoding process of the encoded value of this statement ends. This decoding example uses the full-step direct pronunciation method to play the pronunciation of the decoded sentence.

(本解码举例里,各词及词组合的频率编号除了可以是本例示的阶梯编号外,还可以是等长编号或哈夫曼编号等;如为等长编号,哈夫曼编号,那其前面是不带频率编号长度信息的。)(in this decoding example, the frequency numbering of each word and word combination can also be equal-length numbering or Huffman numbering etc. except the ladder numbering of this example; as for equal-length numbering, Huffman numbering, then its The front is without frequency number length information.)

如述及时无特别指明有其他情况,从下面一段至115页述及的码字涉及的任一拼合信息的值,默认可以是用编码规则中有关某语种的直播方式的编码细则来编码生成的,也可以是用编码规则中有关某语种的TTS方式的编码细则来编码生成的,也可以是用编码规则中有关某语种的混合方式的编码细则来编码生成的;本段所说的某语种可以是任一非复合语种或任一复合语种。If there are no other circumstances specified in the above, the value of any combined information involved in the codewords mentioned in the following paragraph to page 115 can be encoded by default in the encoding rules related to the live broadcast method of a certain language. , can also be generated by coding with the coding rules of the TTS mode of a certain language in the coding rules, or it can be coded and generated with the coding rules of the mixed mode of a certain language in the coding rules; the certain language mentioned in this paragraph Can be any non-composite language or any compound language.

关于有声读物整页面上可使用的码字的码种及读码头的形式:Regarding the types of code words that can be used on the entire page of audiobooks and the form of reading docks:

一:使用一维条形码,一般一个码至多可表示15个左右十进制位,转换成二进制约有50个二进制位,位数较少,所以一般只适于携带表示短的语句发音的拼合信息的值;对此种码字的识读可使用光笔或ccd读码头等。One: Use a one-dimensional barcode. Generally, a code can represent at most about 15 decimal digits. When converted into binary, there are about 50 binary digits. The number of digits is small, so it is generally only suitable for carrying the value of the combined information representing the pronunciation of short sentences. ; To read such code words, light pen or ccd can be used to read docks, etc.

二:使用二维条形码,一般可使用行排式二维条形码,如果是两排的话,就可表示到100个二进制位以上,即可携带表示一般长度的语句发音的拼合信息的值;对此种码字的识读一般可使用线性ccd读码头,使用线性ccd读码头时需沿垂直方向拖动读码头。Two: Use two-dimensional barcodes. Generally, row-type two-dimensional barcodes can be used. If there are two rows, it can be expressed to more than 100 binary digits, which can carry the value of the combined information representing the pronunciation of sentences of general length; The recognition and reading of this kind of code word generally can use linear ccd to read dock, need to drag along vertical direction to read dock when using linear ccd to read dock.

三:使用导电码,一般每个导电码能表示100个以上的二进制位,可携带表示一般长度的语句发音的拼合信息的值;对此种码字的识读可使用机电式导电码读码头。(有关导电码的技术内容,可参看本发明人在另一个专利2006102005537号申请里对导电码及其读码原理的阐述,也可参看本说明书后面对导电码技术的介绍以及本说明书对图6的拾音器实施例的阐述中对导电码及读码头及读码原理的描述)Three: Use conductive codes. Generally, each conductive code can represent more than 100 binary bits, and can carry the value of the combination information representing the pronunciation of sentences of general length; the reading of such code words can be read using electromechanical conductive codes. . (For the technical content of the conduction code, please refer to the elaboration of the conduction code and its code reading principle in another patent application No. 2006102005537 by the present inventor, and also refer to the introduction of the conduction code technology at the back of this specification and the description of the diagram in this specification The description of the conduction code and the reading dock and the principle of reading code in the elaboration of the pickup embodiment of 6)

四:使用图形码,磁码或本说明书没提及的其它码种的码字;其识读应使用其相应的读码头(比如可用光电式图形码读码头读图形码,用磁码读码头读磁码)。Four: Use graphic codes, magnetic codes or code words of other codes not mentioned in this manual; the corresponding reading terminal should be used for its reading (for example, the photoelectric graphic code reading terminal can be used to read graphic codes, and the magnetic code can be used to read terminals read magnetic code).

有声读物整页面上诸个码字中的各码字的特点:每个码字可被相应拾音器读取以用于拾音器播出语句发音,每个码字的码值是共与数个表示如何用音码拼合成语句发音的拼合信息的值有关的,并由拾音器根据读得的一个或多个码字的码值来播出语句发音。The characteristics of each codeword in the codewords on the entire page of the audiobook: each codeword can be read by the corresponding pickup for the pronunciation of the sentence played by the pickup, and the code value of each codeword is shared and how many represent It is related to the value of the pieced information of the pronunciation of the sentence that is stitched together by the phonetic code, and the pronunciation of the sentence is played out by the pickup according to the code value of one or more code words read.

关于在实际使用时拾音器读取码字并发音的形式,一般涉及两种情形,一是每次是由拾音器读一个或多个码字,并根据这一个码字的码值来处理播出一个或多个语句发音(根据一个码字的码值来处理播出多个语句发音,一般是指此码字的值有多个拼合信息的值),或根据这多个码字的码值来处理播出一个或多个语句发音,二是,使用时重复情形一的步骤,拾音器读一个或多个码字并发音,下一次又可读这一个或多个码字并发音或可读另外的一个或多个码字来播出另外一个或多个语句的发音,即拾音器可不断读取各码字并不断播出语句发音;按情形一,拾音器读取一个或多个码字后,可播出数个语句发音(比如一个码字的值里有多个拼合信息的值),而如一个拾音器读取一个或多个码字后播出多个语句发音(拾音器读取多个码字后播出多个语句发音的例子比如:拼合信息的值1分成两部分分别放在码字1和码字2里,拼合信息的值2也放在码字2里,拾音器读码字1和码字2后,将这两个码字的值合起来得到拼合信息的值1,从码字2又可得到拼合信息的值2,然后拾音器可解码播出拼合信息的值1对应的语句发音和拼合信息的值2对应的语句发音,甚至这两个语句可以是不同语种的,一般要用与拾音器有关的存储器集里不同的解码程序来解码以播出各自语句发音),就可推出此拾音器读取一个或多个码字后有播出一个语句发音(如播出的是多个语句发音,此一个语句发音是播出的这多个语句发音中的一个语句发音),所以本发明文件为使内容简洁,在有的地方叙述时,是只取其下位概念,即说成拾音器读取一个或多个码字后可播出语句发音(这里的语句是指某一个语句)。在权利要求10,19,22中提及的‘由前述拾音器的前述单片机芯片根据读得的前述部诸码字中的码字的码值使前述拾音器的前述发音器件播出语句发音’以及在权利要求2中提及的‘前述拾音器的前述单片机芯片根据读得的前述部诸码字中的码字的码值使前述拾音器的前述发音器件播出语句发音’以及本说明书中各处的类似表述都是与此有关的。Regarding the form of the code word read and pronounced by the pickup in actual use, it generally involves two situations. One is to read one or more code words by the pickup each time, and process and broadcast a code word according to the code value of this code word. or a plurality of sentence pronunciations (according to the code value of a code word to process and broadcast a plurality of sentence pronunciations, generally referring to the value of this code word has a plurality of values of the combined information), or according to the code values of these multiple code words to Handle and broadcast one or more sentence pronunciations, the 2nd, repeat the step of situation one when using, pick-up reads one or more code words and pronounce, next time can read again this one or more code words and pronounce or can read another One or more code words to broadcast the pronunciation of another one or more sentences, that is, the pickup can constantly read each code word and continuously broadcast the pronunciation of the sentence; by case one, after the pickup reads one or more code words, Several sentence pronunciations can be played out (such as the value of a codeword with multiple values of the combined information), and if a pickup reads one or more codewords, multiple sentence pronunciations are broadcast (the pickup reads multiple codewords). An example of multiple sentence pronunciations after the word is played: For example: the value 1 of the combined information is divided into two parts and placed in code word 1 and code word 2 respectively, the value 2 of the combined information is also placed in code word 2, and the pickup reads code word 1 After the code word 2, combine the values of the two code words to obtain the value 1 of the combined information, and obtain the value 2 of the combined information from the code word 2, and then the pickup can decode and broadcast the statement corresponding to the value 1 of the combined information Pronunciation and the corresponding sentence pronunciation of the value 2 of combining information, even these two sentences can be different languages, generally will use different decoding programs in the memory set relevant to the pickup to decode to broadcast the respective sentence pronunciation), just can roll out After this pickup reads one or more code words, there is a sentence pronunciation to be broadcasted (as broadcasting is a plurality of sentence pronunciations, and this sentence pronunciation is a sentence pronunciation in these multiple sentence pronunciations broadcasted), so this Invention document is for making content succinct, when narrating in some places, is to only get its sub-concept, that is to say, the pronunciation of the sentence can be played after the pickup reads one or more code words (the sentence here refers to a certain sentence). Inclaim 10, 19, mentioned in 'the code value of the aforementioned single-chip microcomputer chip of the aforementioned pickup makes the aforementioned pronunciation device of the aforementioned pickup play the sentence pronunciation according to the code value of the codeword in the aforementioned codewords read by the aforementioned pickup, and in 22 Mentioned in claim 2' the aforementioned single-chip microcomputer chip of the aforementioned pickup makes the aforementioned pronunciation device of the aforementioned pickup play out the pronunciation of sentences according to the code value of the codewords in the aforementioned codewords read out' and the similarities in various places in this specification. Expressions are all about it.

对于同一语种的语句,如使用不同的编码规则,那编码得到的拼合信息的值(拼合信息的值也就是表示语句发音的拼合信息的值)的具体格式(具体格式涉及拼合信息的值中那些值放前面,哪些值放后面等)一般是不同的,这是显然的。本说明书主要是以一个拼合信息的值就是一个处理单位的值为例的。For sentences of the same language, if different encoding rules are used, the specific format of the value of the combined information obtained by the encoding (the value of the combined information is also the value of the combined information representing the pronunciation of the sentence) (the specific format involves those in the value of the combined information) It is obvious that the value is placed in the front, which values are placed in the back, etc.) are generally different. This description mainly takes the example that the value of one piece of information is the value of one processing unit.

本发明文件各处所说的(拾音器)播出语句发音,是指播出一个语句发音,一般有两种情况:1是拾音器读一个码字后,拾音器根据从这一个码字的值里得到(直接得到或转换得到)的一个拼合信息的值来解码播出一个语句的发音(从这一个码字的值里可能只可以得到这一个拼合信息的值,或可以得到包括这一个拼合信息的值在内的多个拼合信息的值),2是指拾音器读多个码字后,拾音器根据把这多个码字的值组合起来得到(直接得到或转换得到)的一个拼合信息的值来解码播出的一个语句的发音(把这多个码字的值组合起来可能只可以得到这一个拼合信息的值,或可以得到包括这一个拼合信息的值在内的多个拼合信息的值)。显然,如从一个码字可得到(或如把这多个码字的值组合起来可得到)多个拼合信息的值比如3个拼合信息的值,那拾音器读此码字后,一般应分别对这多个拼合信息的值作解码及分别播出语句发音。The said (pickup device) broadcasting sentence pronunciation in the document of the present invention refers to broadcasting a sentence pronunciation, generally has two kinds of situations: 1, after the pickup device reads a code word, the pickup device obtains according to the value from this code word ( directly obtained or converted) to decode the pronunciation of a sentence (from the value of this codeword, only the value of this piece of information may be obtained, or the value including this piece of information may be obtained 2 means that after the pickup reads multiple codewords, the pickup decodes according to the value of a piece of information obtained by combining the values of the multiple codewords (obtained directly or converted) Pronunciation of a sentence that is broadcast (combining the values of these multiple codewords may only obtain the value of this one pieced together information, or may obtain the values of multiple pieced together information including the value of this one pieced together information). Obviously, as can obtain from a codeword (or can obtain as combining the value of these multiple codewords) the value of a plurality of splicing information such as the value of 3 splicing information, after that pick-up reads this codeword, generally should respectively Decoding the values of the multiple pieces of combined information and broadcasting the pronunciation of the sentence respectively.

本发明文件各处提及的有关单片机芯片根据读取的部诸码字中的一个码字的码值或多个码字的码值来使发音器件播出语句发音,一般包括两种情况:1是从一个码字就可得到一个拼合信息的值,拾音器的单片机芯片读取有声读物的整页面上的这一个码字得到拼合信息的值,并对拼合信息的值执行解码使发音器件播出语句发音(这种情况,可以是只把此一个拼合信息的值涉及的BIT放在此一个码字里,或,在此码字的值里放有此一个拼合信息的值涉及的BIT和别的拼合信息的值涉及的BIT),本说明书主要以这种方式来举例阐述的,2是对某一拼合信息的值,需读多个码字才能得到此拼合信息的值,拾音器的单片机芯片需读取有声读物的整页面上的这多个码字并根据把这多个码字合起来得到此拼合信息的值,并对此拼合信息的值执行解码使发音器件播出语句发音(这种情况,可以是只把此一个拼合信息的值涉及的BIT分散放在这多个码字里,或,把此一个拼合信息的值涉及的BIT分散放在这多个码字里而且在这多个码字中各码字的值里还放有别的拼合信息的值涉及的BIT);当然以上述及这两种情况时涉及的这些码字都是本段内前面提及的部诸码字中的码字。The relevant single-chip microcomputer chip that the document of the present invention mentions everywhere makes the pronunciation device broadcast sentence pronunciation according to the code value of a code word or the code value of a plurality of code words in the all code words of reading, generally comprises two kinds of situations: 1 is to get a value of the combined information from one code word, the single-chip microcomputer chip of the pickup reads this code word on the entire page of the audiobook to obtain the value of the combined information, and performs decoding on the value of the combined information to make the pronunciation device play Pronunciation of a sentence (in this case, it is possible to only put the BIT related to the value of this piece of information in this codeword, or put the BIT and the value of this piece of information in the value of this codeword. The BIT involved in the value of other combined information), this manual mainly uses this method as an example to illustrate, 2 is the value of a certain combined information, it is necessary to read multiple codewords to obtain the value of the combined information, the single-chip microcomputer of the pickup The chip needs to read the multiple codewords on the entire page of the audiobook and obtain the value of the mosaic information by combining the codewords, and decode the value of the mosaic information to make the pronunciation device play the sentence pronunciation ( In this case, only the BITs involved in the value of this piece of information can be scattered in these multiple codewords, or, the BITs involved in the value of this piece of information can be scattered in these multiple codewords and placed in these multiple codewords. In the value of each codeword in these multiple codewords, there are also BITs involved in the value of other combined information); of course, these codewords involved in the above-mentioned and these two cases are the parts mentioned earlier in this paragraph Codewords in all codewords.

如一个码字的值是共与数个表示语句发音的拼合信息的值有关的,那这数个拼合信息的值各自涉及的语句的语种可以是相同或不相同的。If the value of a codeword is related to several values of combined information representing the pronunciation of the sentence, the languages of the sentences involved in the values of the combined information may be the same or different.

一个码字的值是共与数个拼合信息的值有关的,一个码字的值可以有数个拼合信息的值(或数个拼合信息的值的部分),但一般情况是一个码字的值有一个拼合信息的值(或一个拼合信息的值的部分,这里的一个是修饰拼合信息的值的),这时就是码字的值是只与(一个)拼合信息的值有关的。如码字的值是只与多个拼合信息的值有关的,那这多个拼合信息的值一般可以是顺序放置在此码字里。显然,本发明文件中各处述及码字的值有一个拼合信息的值时,是隐含有此码字的值是与此拼合信息的值有关的而且是只与此拼合信息的值有关的的意思的。The value of a codeword is related to the value of several pieces of combined information. The value of a codeword can have several values of the pieced together information (or parts of the value of several pieced together information), but in general, it is the value of one codeword There is a value of the combined information (or a part of the value of the combined information, here one is to modify the value of the combined information), at this time the value of the codeword is only related to the (one) value of the combined information. If the value of the code word is only related to the values of multiple combined information, the values of the multiple combined information can generally be sequentially placed in the code word. Obviously, when the value of the code word mentioned in the document of the present invention has a value of the combined information, it is implied that the value of the code word is related to the value of the combined information and only related to the value of the combined information the meaning of.

以上5段提及的各个其前面没数量词的‘码字’都是指单个码字。Each of the above 5 paragraphs mentioned does not have 'codeword' before the quantifier all refers to a single codeword.

表示某语种语句发音的拼合信息的值是按某编码规则中有关此语种的编码细则来编码生成的。The value of the combined information representing the pronunciation of sentences in a certain language is coded and generated according to the coding rules related to this language in a certain coding rule.

本发明文件中的权利要求书和说明书中各处所提及的‘拼合信息的值’与‘表示语句发音的拼合信息的值’与‘表示如何用音码拼合成语句发音的拼合信息的值’,这3种提法是相同的,这里说的‘语句’是指某一语种的某一语句。并且有时可对语句作语种限定,比如可分别表示成‘此语种的拼合信息的值’和‘表示此语种的语句发音的拼合信息的值’和‘表示如何用音码拼合成此语种的语句发音的拼合信息的值’,这3种表示也是相同的。本段所说的语种可以是任一非复合语种或任一复合语种。The claims in the document of the present invention and the value of the combination information mentioned in various places in the description and the value of the combination information representing the pronunciation of the sentence and the value of the combination information indicating how to use the phonetic code to form the pronunciation of the sentence ', these three formulations are the same, and the 'sentence' mentioned here refers to a certain sentence in a certain language. And sometimes the language can be limited to the sentence, for example, it can be expressed as "the value of the combined information of this language" and "the value of the combined information indicating the pronunciation of the sentence in this language" and "indicating how to use the phonetic code to form a sentence of this language" The value of the combined information of pronunciation', these three representations are also the same. The language mentioned in this paragraph can be any non-compound language or any compound language.

本发明文件各处提及的‘码字的值,是共与数个拼合信息的值有关的’(或类似提法,这里的码字是指单个码字),一般是指:这数个拼合信息的值全部放在此码字里,或,这数个拼合信息的值中每个拼合信息的值都只有部分放在此码字里,或,这数个拼合信息的值中(这里的数个是指多个)有些拼合信息的值是全部放在此码字里的而有些拼合信息的值是只有部分放在此码字里的,或,这数个拼合信息的值的变换值全部放在此码字里,或,这数个拼合信息的值的变换值的部分放在此码字里等;至于在一个码字里这些值(拼合信息的值,拼合信息的值的部分,拼合信息的值的变换值等,还可能有其它信息的值比如语速定义信息)在码字里的共处格式,这是由编码者自定义的(一般是把各信息的值顺序排列这时需考虑谁前谁后,另如是把各信息的值互相交织放置那还涉及到放置的格式),MCU在读码后可根据定义的格式内容,将各信息的值从码值里各自取出,各自作解码等处理;一个码字的值是共与数个拼合信息的值有关的具体形式除可以是以上这些形式外,还可以是其它形式。本段说的‘有些拼合信息的值是只有部分放在此码字里’的意思是:比如所说的有些拼合信息的值包括两个拼合信息的值(拼合信息的值1,拼合信息的值2),那拼合信息的值1的部分BIT就放在此码字里,拼合信息的值2的部分BIT也放在此码字里;本段说的‘有些拼合信息的值是全部放在此码字里’的意思是:比如所说的有些拼合信息的值包括两个拼合信息的值(拼合信息的值1,拼合信息的值2),那拼合信息的值1的全部BIT就放在此码字里,拼合信息的值2的全部BIT也放在此码字里;对于‘每个拼合信息的值都只有部分放在此码字里’也是按这样类推理解。'The value of the codeword is related to the value of several pieces of combined information' (or similar formulations, the codeword here refers to a single codeword) mentioned throughout the document of the present invention generally refers to: these several The values of the combined information are all placed in this codeword, or, the values of each combined information in the values of the combined information are only partially placed in the codeword, or, in the values of the combined information (here The number refers to multiple) the values of some combination information are all placed in this codeword and the values of some combination information are only partially placed in this codeword, or, the transformation of the value of these number combination information Value is all placed in this code word, or, the part of the transformation value of the value of these several pieces of combined information is placed in this code word etc.; As for these values (the value of combined information, the value of combined information) in a code word part, the conversion value of the value of the combined information, etc., and there may also be other information values (such as speech rate definition information) in the co-existing format of the codeword, which is customized by the coder (generally, the values of each information are arranged in order At this time, it is necessary to consider who is ahead and who is behind. In addition, if the values of each information are interleaved and placed, it also involves the format of the placement). After reading the code, the MCU can take out the value of each information from the code value according to the defined format content. , respectively perform decoding and other processing; the value of a codeword is a specific form related to the value of several pieces of combined information. In addition to the above forms, it can also be other forms. The meaning of "the value of some combination information is only partly placed in this codeword" in this paragraph means: for example, the value of some combination information includes two values of combination information (the value of combination information is 1, and the value of combination information is 1). value 2), the BIT part of thevalue 1 of the combined information is placed in this codeword, and the part of the BIT of thevalue 2 of the combined information is also placed in this codeword; In this codeword, ' means: for example, the value of some combined information includes two values of combined information (the value of combined information is 1, and the value of combined information is 2), then all the BITs ofvalue 1 of combined information are In this codeword, all the BITs of thevalue 2 of the combined information are also placed in this codeword; for "only part of the value of each combined information is placed in this codeword", it is also understood by analogy.

本发明文件各处所说的把某值(或BIT值)放置或放或包括在一个码字里,是指把此值(比如拼合信息的值)的BIT放置在此码字的一些位里(一般是顺序放);而本说明书各处所说的一个码字的值有数个拼合信息的值,是指可以从此码字的值得到这数个拼合信息的值,一般是把这数个拼合信息的值本身的BIT或对这数个拼合信息的值进行变换得到的变换值的BIT放置在此码字的一些位里,同理,本说明书各处所说的一个码字的值有数个拼合信息的值的部分,一般是把这数个拼合信息的值中每个拼合信息的值的一部分BIT都放置在此码字的一些位里或是把对这数个拼合信息的值进行变换得到的变换值的部分BIT放置在此码字的一些位里,本说明书各处所说的其它有关一个码字的值有某值的表达也按此类推理解。Putting or putting or including a certain value (or BIT value) in a codeword as mentioned throughout the documents of the present invention means that the BIT of this value (such as the value of the combined information) is placed in some bits of the codeword ( Generally put in order); and the value of a code word mentioned in this manual has several values of combined information, which means that the value of these combined information can be obtained from the value of the code word. The BIT of the value itself or the BIT of the transformed value obtained by transforming the values of these combined information is placed in some bits of this code word. Similarly, the value of a code word mentioned throughout this manual has several combined information The part of the value is generally obtained by placing a part of the BIT of each value of the combined information in some bits of the codeword or by transforming the values of the combined information. The part BIT of conversion value is placed in some bits of this code word, and the expression that other relevant value of a code word has certain value said everywhere in this specification also understands by analogy.

并且,本发明文件所说的一个码字的值有一个(或多个或数个)拼合信息的值,这里‘有’的意思,是指除这一个(或多个或数个)拼合信息的值外,此码字的值不再与其它拼合信息的值有关;其它类似说法,比如一个码字的值有一个(或多个或数个)拼合信息的值的部分,其中‘有’的意思,也是按此类推的。And, the value of a codeword mentioned in the document of the present invention has one (or more or several) values of the combined information, and the meaning of "has" here refers to the addition of this one (or more or several) combined information In addition to the value of the codeword, the value of this codeword is no longer related to the value of other combined information; other similar statements, such as the value of a codeword has one (or more or several) part of the value of the combined information, where 'has' means, and so on.

当码字的值是共与数个拼合信息的值有关时,有关拼合信息的值在码字里的情况,可以是这几种情况:1,一个码字的值有数个拼合信息的值,2,一个码字的值有数个拼合信息的值的部分(本句和本段前面提及的3次‘数个’都是指相同数目),3,一个码字的值有数个拼合信息的值加数个拼合信息的值的部分等情况(显然这句的两个‘数个’加起来等于前一句‘数个’的数目),显然这第3种情况是在码字的值是共与多个拼合信息的值有关的前提下的;本段提及的各个其前面没数量词的‘码字’都是指单个码字。本段说的一个码字的值是指一个码字上除开校验位等辅助信息后的值。对于码字里放的仅是数个拼合信息的值的变换值的情况,仍可说此一个码字的值有数个拼合信息的值(本句的‘数个’与上一句的‘数个’是相同数目);对于码字里放的仅是数个拼合信息的值的部分的变换值的情况,仍可说此一个码字的值有数个拼合信息的值的部分(本句的‘数个’与上一句的‘数个’是相同数目);对于码字里放的仅是数个拼合信息的值的变换值的部分的情况,仍可说此一个码字的值有数个拼合信息的值的部分(本句的‘数个’与上一句的‘数个’是相同数目)。When the value of the codeword is related to the value of several pieces of combined information, the situation that the value of the relevant pieced together information is in the codeword can be these situations: 1, the value of a codeword has the value of several pieced together information, 2. The value of a codeword has several parts of the value of the combined information (this sentence and the 3 times "several" mentioned earlier in this paragraph all refer to the same number), 3. The value of a codeword has several parts of the combined information value plus several parts of the value of the combined information, etc. (obviously the two 'numbers' of this sentence add up to the number of the previous sentence 'numbers'), obviously this third case is when the value of the codeword is in common On the premise that it is related to the value of multiple spliced information; each 'codeword' mentioned in this paragraph without a quantifier before it refers to a single codeword. The value of a codeword mentioned in this paragraph refers to the value of a codeword after auxiliary information such as a check digit is removed. For what put in the codeword is only the situation of the transformation value of the value of several pieced together information, still can say that the value of this codeword has the value of several pieced together information (' several ' of this sentence and ' several ' of last sentence ' is the same number); for the situation of the transformation value of the part of the value of several pieces of information that is put together in the codeword, it can still be said that the value of this codeword has the part of the value of several pieces of information (' in this sentence' "several" is the same number as the "several" in the previous sentence); for the situation that only the transformed value of the value of several spliced information is put in the codeword, it can still be said that the value of this codeword has several spliced The part of the value of the information ('several' in this sentence is the same number as 'several' in the previous sentence).

导电码表示码值的方式可参考2006102005537号申请里的阐述,也可参看本说明书对图6的拾音器实施例的阐述中对导电码技术的描述,导电码一般可直接或通过通断代名码数来表示二进制码值,而拼合信息的值或拼合信息的值的变换值或拼合信息的值的部分等一般是二进制格式,所以导电码直接或通过通断代名码数来表示的二进制码值可以有二进制格式的数个拼合信息的值或数个拼合信息的值的变换值或拼合信息的值的部分等。The way that the conductive code represents the code value can refer to the elaboration in No. 2006102005537 application, and also refer to the description of the conductive code technology in the explanation of the pickup embodiment of Figure 6 in this specification. To represent the binary code value, and the value of the combined information or the transformation value of the combined information value or the part of the value of the combined information is generally in binary format, so the binary code value represented by the conductive code directly or through the number of on-off code names can be There are several values of the combined information in binary format or transformed values of the values of the combined information or parts of the values of the combined information, and the like.

一般条形码的码制表示的字符集是10个十进制值(09),或者表示的字符集是几十个ascii字符,可以建表将10个十进制值(09)或几十个ascii字符按顺序赋与二进制编号,即建立了条形码与二进制值的对应关系;而拼合信息的值或拼合信息的值的变换值或拼合信息的值的部分等一般是二进制格式,这样就可把二进制格式的数个拼合信息的值或数个拼合信息的值的变换值或拼合信息的值的部分等转换成条形码格式的值,并放入条形码里。比如如条形码的码制表示的字符集是10个十进制值(09),那就可用条形码表示的字符集中的十进制的07来对应表示二进制的000111,如拼合信息的值为011101001,那每3个BIT一组对应转换成为十进制值351,这个351就是放入条形码的值,与此对应的,如在读码时得到条形码里的十进制值为351,可将其每个十进制数对应转换成3BIT的二进制值,即得011101001,这个011101001就是拼合信息的值。本段所述也是技术常识。The character set represented by the code system of the general barcode is 10 decimal values (09), or the character set represented is dozens of ascii characters, you can build a table to assign 10 decimal values (09) or dozens of ascii characters in order And the binary number, that is, the corresponding relationship between the barcode and the binary value is established; and the value of the combined information or the transformed value of the combined information or the part of the value of the combined information is generally in the binary format, so that the binary format can be used. The value of combined information or the converted value of several combined information values or part of the value of combined information is converted into a barcode format value and put into the barcode. For example, if the character set represented by the code system of the barcode is 10 decimal values (09), then thedecimal 07 in the character set represented by the barcode can be used to correspond to the binary 000111. If the value of the combined information is 011101001, then every 3 A group of BIT is correspondingly converted into a decimal value 351, and this 351 is the value put into the barcode. Correspondingly, if the decimal value in the barcode is 351 when reading the code, each decimal number can be converted into a binary value of 3BIT Value, namely 011101001, this 011101001 is the value of the combined information. This paragraph is also technical common sense.

如提及时无特别指明有他义,本发明文件的权利要求书和说明书中各处提及的读物,默认是指一个计量单位的读物,比如一本书,一张地图,一本杂志,这是显然的,也是常识。If there is no other meaning when mentioning, the claims and the reading materials mentioned in the description of the present invention document refer to reading materials of a unit of measurement by default, such as a book, a map, and a magazine. It is obvious and common sense.

本发明文件涉及的语音编码有声读物的整页面上的码字的形式,可以是常规的,即直接在作为有声读物主体的读物任一页面上印刷条形码,或直接在读物任一页面上印刷或涂敷导电物质形成导电码,等,可称这样形成的码字(码字可以是条形码,导电码,图形码或磁码等)的载体是读物的页,码字的载体平面是读物的一个页面(在本说明书其它涉及码字的形式的地方都是这样阐述的);另外也可在一小块平面材料的表面上形成码字(码字可以是条形码,导电码,图形码或磁码等)并把此小块平面材料粘贴到作为有声读物主体的读物的页面上,即以这一小块平面材料作为码字的载体,此平面材料的表面作为码字的载体平面(如码字是导电码,那一般要求平面材料的表面不导电);可以以一小块纸作为一小块平面材料的例子,这一小块纸的形状可为长方形或其它形,面积能容下一个或多个码字(码字可以是导电码或条形码等码种码字)即可,再采用前面述及的方式,采用印刷导电油墨或涂敷导电物质等方式,在这一小块纸表面上形成导电码码字,或在这一小块纸表面上印刷条形码,或是形成其它种类的码字,在这一小块纸上可形成一个或多个码字(多个码字中各码字可以是或不是同一码种的),再把这一小块纸粘贴到作为有声读物主体的读物的一个页面上,对要布置到读物整页面上的其它码字都可按这样形成到小块纸上并粘贴到读物相应页面上,这也是一种有关有声读物的整页面上的码字的形式;一种等同替换形式,是把要布置到读物各页面上的许多个码字,一齐形成(比如印刷)到一张或数张大的纸张上,再剪成一些小块纸,每小块纸表面上含有一个或多个码字(码字可以是导电码或条形码等码种码字;多个码字中各码字可以是或不是同一码种的),再把各小块纸粘贴到读物各页面需要其码字对应发音的地方;还可采用其它形式,比如可把各小块表面没有码字的纸先粘贴到读物各页面上,再在各小块纸的表面上采取印刷或涂敷等方式形成一个或多个码字。采用本段述的粘贴小块纸方式的码字形式后,如印在小块纸上的是导电码,其码字仍可以是2006102005537号申请里述的甲方式导电码或乙方式导电码,其甲方式或乙方式读码头的结构及其读码的原理过程显然仍可按照2006102005537号申请里的说明书及附图相关描述;采用粘贴小块纸方式,如印在小块纸上的是其它码种的码比如条形码等,其读码方式显然与将其印刷在读物的一个页面上的方式是一样的。此方式类似于百货商场里,把条形码打印到一小块纸质标签上,再把标签粘贴到商品表面。(本发明文件各处所说的码字的载体平面,显然是指码字的载体的一个平面,而且此码字处于这个平面上)The form of the code word on the entire page of the phonetic coded audiobook that the document of the present invention relates to can be conventional, that is, directly print a barcode on any page of the reading as the main body of the audiobook, or directly print or print on any page of the reading. Coating conductive material forms conductive code, etc., can say that the carrier of the code word (code word that can be bar code, conductive code, graphic code or magnetic code etc.) that forms like this is the page of reading material, and the carrier plane of code word is one of reading material Page (all other places in this manual that relate to the form of codewords are set forth in this way); in addition, codewords can also be formed on the surface of a small piece of planar material (codewords can be barcodes, conductive codes, graphic codes or magnetic codes) etc.) and paste this small piece of plane material on the page of the reading as the main body of the audiobook, that is, use this small piece of plane material as the carrier of the code word, and the surface of this plane material as the carrier plane of the code word (such as the code word It is a conductive code, which generally requires the surface of the plane material to be non-conductive); a small piece of paper can be used as an example of a small piece of plane material, the shape of this small piece of paper can be rectangular or other, and the area can accommodate one or A plurality of codewords (codewords can be codewords such as conductive codes or barcodes), and then adopt the method mentioned above, using printing conductive ink or coating conductive substances, etc., on the surface of this small piece of paper Form conductive code code words, or print barcodes on the surface of this small piece of paper, or form other types of code words, one or more code words can be formed on this small piece of paper (each code in multiple code words Words may or may not be of the same code type), and then this small piece of paper is pasted on a page of the reading material as the main body of the audiobook, and other code words to be arranged on the entire page of the reading material can be formed into small pieces in this way This is also a form of codewords on the entire page of the audiobook; an equivalent replacement form is to put many codewords to be arranged on each page of the book together. Formed (such as printed) on one or several large sheets of paper, and then cut into small pieces of paper, each small piece of paper contains one or more codewords on the surface (codewords can be conductive codes or barcodes, etc.) each code word in a plurality of code words can be or not the same code type), and then each small piece of paper is pasted to the place where its code word corresponding pronunciation is needed on each page of the reading material; other forms can also be adopted, such as each small piece of paper can be The paper that does not have code word on the block surface is pasted on each page of the reading material earlier, and then take printing or coating on the surface of each small piece of paper to form one or more code words. After adopting the code word form of pasting the small piece of paper described in this paragraph, if the conductive code is printed on the small piece of paper, the code word can still be the conductive code of mode A or the conductive code of mode B mentioned in the application No. 2006102005537. The structure of the wharf read by method A or method B and the principle process of code reading can obviously still be described in accordance with the instructions and drawings in the application No. 2006102005537; the method of pasting small pieces of paper is adopted, such as printing on the small piece of paper is other The code of code kind such as barcode etc., its code reading mode is obviously the same with the mode that it is printed on a page of reading material. This method is similar to that in a department store, where the barcode is printed on a small piece of paper label, and then the label is pasted on the surface of the product. (the carrier plane of the code word mentioned in the documents of the present invention obviously refers to a plane of the carrier of the code word, and this code word is on this plane)

本发明文件各处所指的语音编码有声读物的整页面上的任一码字的形式,可以是直接把有声读物的页面(也即作为有声读物主体的读物的页面)当成载体平面的码字的形式(这是本说明书其它页涉及到码字的形式的地方主要述及的码字的形式,比如在读物的页面上直接印刷导电码或条形码),也可以是如上段述的一小块平面材料的表面上有此码字(即以这一小块平面材料作为码字的载体,这一小块平面材料的表面上只有此码字或有包括此码字在内的多个码字,这里说的多个码字中各码字可以是或不是同一码种的)且这一小块平面材料是粘贴在读物的页面上的形式;本段所指码字可以是导电码,条形码,图形码或磁码等。而且本段说的两种码字的形式(直接把读物页面当成载体平面的码字的形式与小块平面材料的表面上有码字且这一小块平面材料是粘贴在读物的页面上的形式)互相是等同替换。且一本语音编码有声读物的整页面上可只有这两种形式的码字中的一种码字,也可以同时存在这两种形式的码字。The form of any code word on the entire page of the phonetic coded audiobook referred to in the document of the present invention can be a code word that directly regards the page of the audiobook (that is, the page of the reading material as the main body of the audiobook) as the carrier plane (this is the form of the code word that is mainly mentioned in other pages of this specification that relate to the form of the code word, such as directly printing conductive codes or barcodes on the pages of reading materials), or it can be a small piece as mentioned in the above paragraph This codeword is arranged on the surface of the planar material (promptly with this small piece of planar material as the carrier of codeword, only have this codeword or have a plurality of codewords including this codeword on the surface of this small piece of planar material , each code word in the multiple code words mentioned here can be or not the same code type) and this small piece of plane material is the form pasted on the page of the reading material; the code word indicated in this paragraph can be conductive code, bar code , graphic code or magnetic code, etc. And the form of two kinds of code words that this paragraph says (directly regard reading material page as the form of the code word of carrier plane and the surface of small piece plane material have code word and this little piece plane material is pasted on the page of reading material form) are equivalent replacements for each other. And the entire page of a speech coding audiobook may have only one codeword in these two forms of codewords, or these two forms of codewords may exist at the same time.

码字的位置是在有声读物的整页面上的,并且一本有声读物的整页面上的诸个码字中所有码字可都是同一语种的码字(一般是各个码字涉及的各拼合信息的值都是由只适用于此语种的一种编码规则来编码生成的),或不同码字可以是不同语种的码字(一般情况是,每个语种的各码字涉及的各拼合信息的值都是由只适用于此语种的一种编码规则来编码生成的,用以编码生成不同语种的码字涉及的拼合信息的值的编码规则是不同的编码规则,即此有声读物的整页面上有多少个语种的码字就有用多少个编码规则),各个码字涉及的各拼合信息的值里一般带有表示是哪个语种的信息,另也可能是一个码字涉及的各拼合信息的值是不同语种的拼合信息的值(一般情况可以是:一个码字涉及的不同语种的拼合信息的值由各自只适用于各语种的编码规则编码生成,或一个码字涉及的不同语种的拼合信息的值由某适用于多个语种的编码规则中有关各语种的编码细则分别编码生成;还可能有其它情况)。有声读物的一个页面上可有一个或多个处于任意不同位置的码字;有声读物的整页面上的诸个码字中的各码字,可以是导电码,条形码,图形码,磁码或其它码种的码字;这些码字中每个码字的值是共与数个拼合信息的值有关的(每个码字涉及的任一某语种的拼合信息的值,可以是按照只适用于此语种或适用于包括此语种在内的多个语种的编码规则中有关此语种的编码细则来编码生成的),拾音器读取码字,并根据码值来解码播出的语句发音,可以是一般提示音,也可以是把有声读物的整页面上这些码字旁边的句子文字或段落文字的内容照读一遍。本发明文件各处所称语音编码有声读物,其主体为读物,在读物的整页面上有诸个码字,读物的页是码字的载体,这诸个码字中的每个码字可被相应拾音器读取以用于拾音器播出语句发音(如这诸个码字中有不同码种的码字,那不同码种的码字可能需用带有相应读码头的不同的拾音器来读取以播出语句发音,不同码种的码字具体是指条形码,图形码,导电码等),这诸个码字中的每个码字的值是共与数个拼合信息的值有关的,比如有一个拼合信息的值,拾音器根据读得的这诸个码字中的部诸码字中的码字的码值来播出语句发音(拾音器可以是根据读得的这部诸码字中的一个或多个码字的码值来播出语句发音);各码字可以是处在读物的整页面上需播此码字对应播的数个语句发音的位置,各码字在读物的整页面上所处位置一般是在印刷着码字对应播的数个语句发音的文字的旁边。当然处于不同页面或同一页面不同位置的不同码字涉及的表示语句发音的拼合信息的值一般是不同的,一般情况下不同的码字涉及的拼合信息的值是表示不同语句发音的拼合信息的值。如图3,是一种语音编码有声读物的实例,图3所示的语音编码有声读物的主体是书,此有声读物的整页面上印有导电码码字10(此导电码是双排码线多码段甲导电码),当前两页面中每个页面上各有2个导电码码字(当然如是其它情况比如左边页面上有2个码字而右边页面上有4个码字,都是可以的),此有声读物的整页面上的每个导电码码字的值有一个表示在此有声读物的页面上此码字旁边印刷的英语语句文字的发音的拼合信息的值,图3中的44显然是此有声读物的页面上的文字部分。如图7,是另一种语音编码有声读物的实例,图7所示的语音编码有声读物的主体是书,书的当前两页面上印有条形码码字11,书的整页面上的每个条形码码字11的值有一个表示英语语句发音的拼合信息的值,图7中的44显然是此有声读物的页面上的文字部分。如提及时无特别指明有他义,本发明文件的权利要求书和说明书中各处提及的语音编码有声读物,默认是指一个计量单位的语音编码有声读物,比如一本书(此书本身不包括拼合信息码,此书是读物)与其整页面上所有的拼合信息码一起形成的一本语音编码有声读物,这是显然的,也是常识。在读码时,显然每次读码是使一个读码头与一单个码字接触或接近以读码。本发明文件各处所说的码字对应播的数个(一个或多个)语句发音是指一个码字被拾音器读取后,拾音器根据从此码字的码值得到的数个拼合信息的值来解码播出的这数个语句发音,也指包括此码字在内的多个码字被拾音器读取后,拾音器根据将这多个码字的码值组合起来后得到的数个拼合信息的值(一般这数个拼合信息的值是分散放置在这多个码字里的)来解码播出的这数个语句发音,(以上4句几次提及的‘数个’都是相同数目)。The position of the codeword is on the entire page of the audiobook, and all the codewords in the codewords on the entire page of an audiobook can be all codewords of the same language (generally, each codeword involved in each combination The value of the information is encoded by a coding rule that is only applicable to this language), or different codewords can be codewords of different languages (generally, each codeword of each language involves each combination of information The values of all are coded and generated by a coding rule that is only applicable to this language, and the coding rules used to code the values of the combined information involved in generating code words in different languages are different coding rules, that is, the entire audiobook There are as many coding rules as there are codewords of languages on the page), and the value of each combination information involved in each codeword generally contains the information indicating which language it is, and it may also be the combination information involved in a codeword The value of is the value of the combined information of different languages (generally, it can be: the values of the combined information of different languages involved in a codeword are generated by coding rules that are only applicable to each language, or the values of different languages involved in a codeword The value of the spliced information is generated by encoding the encoding rules for each language in an encoding rule applicable to multiple languages; there may be other cases). There can be one or more codewords in any different positions on a page of the audiobook; each codeword among the codewords on the entire page of the audiobook can be a conductive code, a barcode, a graphic code, a magnetic code or Code words of other code types; the value of each code word in these code words is related to the value of several pieced together information (the value of the pieced together information of any certain language that each code word relates to can be according to only applicable In this language or applicable to the coding rules of this language in the coding rules of multiple languages including this language to encode and generate), the pickup reads the code word, and decodes the pronunciation of the sentence broadcast according to the code value, which can be It is a general prompt sound, and it can also be to read the content of the sentence text or paragraph text next to these code words on the entire page of the audiobook. The voice coded audiobook referred to in the documents of the present invention, its main body is the reading material, there are various code words on the entire page of the reading material, and the page of the reading material is the carrier of the code word, and each code word in these code words can be Corresponding pick-up reads for the pick-up to broadcast sentence pronunciation (as there are code words of different code types in these code words, then the code words of different code types may need to be read with different pick-ups with corresponding reading docks Pronunciation with the broadcast sentence, the codewords of different code types refer to barcodes, graphic codes, conductive codes, etc.), the value of each codeword in these codewords is related to the value of several pieces of combined information, For example, there is a value of combining information, and the pick-up broadcasts the sentence pronunciation according to the code value of the code words in the code words in the code words in the code words read (the pick-up device can be based on these code words read) The code value of one or more code words to broadcast sentence pronunciation); Each code word can be on the whole page of reading material and needs to broadcast the position of several sentence pronunciations of this code word corresponding broadcasting, and each code word is in the reading material The position on the whole page is generally next to the text that is printed with codewords corresponding to several sentences pronounced. Of course, the values of the stitching information involved in different codewords on different pages or in different positions on the same page are generally different. In general, the values of the stitching information involved in different codewords represent the stitching information of different sentence pronunciations. value. As shown in Fig. 3, it is an example of a kind of speech coding audiobook, the main body of the speech coding audiobook shown in Fig. 3 is a book, and the conductivecode code word 10 is printed on the whole page of this audiobook (this conductive code is a double row code Line multi-code segment A conductive code), each of the current two pages has 2 conductive code words (of course, in other cases, such as 2 code words on the left page and 4 code words on the right page, both is possible), the value of each conductive code word on the entire page of the audio book has a value representing the pronunciation of the English sentence text printed next to the code word on the page of the audio book, Figure 3 The 44 in is obviously the portion of text on the page of this audiobook. As shown in Figure 7, it is an example of another speech-coded audiobook. The main body of the speech-coded audiobook shown in Figure 7 is a book, andbarcode code words 11 are printed on the current two pages of the book. The value of thebarcode code word 11 has a value representing the spliced information of English sentence pronunciation, and 44 in Fig. 7 is obviously the text part on the page of this audio book. If there is no special meaning when mentioned, the speech coded audiobook mentioned in the claims of the present invention document and the description everywhere refers to the speech coded audiobook of a unit of measurement by default, such as a book (the book itself Does not comprise the splicing information code, and this book is reading material) and all splicing information codes on its whole page form a phonetic coding audiobook together, this is obvious, also is common sense. When reading the code, it is obvious that each time the code is read, a reading terminal is brought into contact with or close to a single code word to read the code. The several (one or more) sentence pronunciations that the said codeword of the document of the present invention corresponds to broadcasting refers to that after a codeword is read by the pick-up, the pick-up is based on the value of several pieced together information obtained from the code value of the codeword. The pronunciation of these several sentences that are decoded and broadcasted also means that after multiple codewords including this codeword are read by the pickup, the pickup will combine the code values of the multiple codewords according to the combination of several pieces of information obtained after combining the codewords. value (generally the values of these combined information are scattered in these multiple codewords) to decode the pronunciation of these several sentences broadcast, (the 'several' mentioned several times in the above 4 sentences are all the same number ).

在权利要求10,19,22中提及的‘前述拾音器的前述单片机芯片可以利用前述拾音器的前述读码头来读取前述诸个码字中的部诸码字中的每个码字’以及在权利要求2中提及的‘前述拾音器的前述单片机芯片利用前述拾音器的前述读码头可以读取前述部诸码字中的每个码字’,显然是指单片机芯片利用读码头可以分别读取有声读物的整页面上的部诸码字中的每个码字。Inclaim 10, 19, the aforementioned single-chip microcomputer chip of the aforementioned pickup can utilize the aforementioned reading pier of the aforementioned pickup to read each codeword in the portions of the aforementioned codewords inclaim 10, 19, and 22. Mentioned in claim 2' the aforementioned single-chip microcomputer chip of the aforementioned pickup can read each codeword in the codewords of the aforementioned parts by the aforementioned reading dock of the aforementioned pickup. Each of the codewords in the codewords on the entire page of the reading material.

把语音编码有声读物所有页面上的所有其码值是共与数个表示如何用音码拼合成语句发音的拼合信息的值有关的码字都集合起来(也即把语音编码有声读物所有页面上的所有拼合信息码都集合起来),就是本发明文件各处提及的语音编码有声读物的整页面上的诸个码字,语音编码有声读物的整页面上的诸个码字与读物的整页面上的诸个码字是同一概念(此读物是作为语音编码有声读物的主体),即把读物所有页面上的所有其码值是共与数个表示如何用音码拼合成语句发音的拼合信息的值有关的码字都集合起来(也即把读物所有页面上的所有拼合信息码都集合起来),就是本发明文件各处提及的读物的整页面上的诸个码字,诸个码字可以是一个或多个码字,比如某个有声读物共有4个页面,各页面分别有2,3,0,1个码字,如这6个(2+3+1)码字的码值都是共与数个表示语句发音的拼合信息的值有关的,那此有声读物的整页面(整页面只包括那4个页面)上的诸个码字就是这6个码字;一本有声读物的整页面上的诸个码字中的各码字,一般是同一码种的码字,比如都是导电码,但如在一本有声读物的整页面上存在多种码种的码字也可(即诸个码字中含有多种码种的码字),比如既有导电码也有图形码,一般可以用带有机电式导电码读码头的拾音器或带有光电式图形码读码头的拾音器分别读取这两种码字以各自发音;显然,一本语音编码有声读物整页面上的诸个码字中的每个码字都可被拾音器读取以用于拾音器解码播出语句发音(码字被拾音器读取的情况一般有:不同码字可被同一种拾音器读取发音,不同码字只能被不同种拾音器读取发音,一个码字可分别被多种拾音器读取以用于这些拾音器各自播出语句发音;不同码字只能被不同种拾音器读取的情况比如一个导电码和一个磁码,需用带有机电式导电码读码头的拾音器或带有磁码读码头的拾音器分别读取这两个码字以各自发音,一个码字可分别被多种拾音器读取的情况比如导电码可被有机电式导电码读码头的拾音器读取,也可被有光电式读码头的拾音器读取);一本有声读物的整页面上的诸个码字中的各码字,一般都是同一语种的,比如都是英语(语种)的码字(这里各码字涉及的各拼合信息的值可都是由只适用于英语的一种编码规则来编码生成的),但如在一本有声读物的整页面上的各码字是不同语种的也可(即此有声读物的整页面上有两个或两个以上语种的码字),比如一本有声读物的整页面上既有英语(语种)的码字也有汉语(语种)的码字,英语(语种)的各码字涉及的各拼合信息的值可由只适用于英语的一种编码规则来编码生成,汉语(语种)的各码字涉及的各拼合信息的值可由只适用于汉语的一种编码规则来编码生成,可以用一个拾音器读取这两种码字并分别解码发出英语语音和汉语语音,这种情况下与此拾音器有关的存储器集里一般应存有只适用于英语的解码程序及解码数据和只适用于汉语的解码程序及解码数据,当然有声读物的整页面上的这两种码字也可以分别被适用于英语的拾音器和适用于汉语的拾音器分别解读(有关有声读物的整页面上的码字还可有其它情况,比如一本有声读物的整页面上是既有英语的码字也有汉语的码字也有韩语的码字也可);甚至一本有声读物的整页面上的诸个码字中的全部或部分码字中的每个码字里可以是有多个不同语种的拼合信息的值的,比如一本有声读物的整页面上的诸个码字中的有些码字中的每个码字的值里同时有英语(语种)的拼合信息的值和汉语(语种)的拼合信息的值,英语(语种)的拼合信息的值可由只适用于英语的一种编码规则来编码生成,汉语(语种)的拼合信息的值可由只适用于汉语的一种编码规则来编码生成,可以用一个拾音器读取这种码字并分别解码播出英语语音和汉语语音,这种情况下与此拾音器有关的存储器集里一般应存有只适用于英语的解码程序及解码数据和只适用于汉语的解码程序及解码数据。诸个码字中的任一码字涉及的任一拼合信息的值可以是任一非复合语种的也可以是任一复合语种的。在本发明文件中使用‘诸个码字’的概念,是为了方便区别于本发明文件各处提及的单个码字。All the code words on all the pages of the phonetic coded audiobook are related to the value of the combined information of how to spell the pronunciation of the sentence with several representations (that is, all the pages of the phonetic coded audiobook are combined) All the combined information codes are assembled), that is, the code words on the entire page of the speech-coded audiobook mentioned in the document of the present invention, the codewords on the entire page of the speech-coded audiobook and the entire page of the reading material The codewords on the page are the same concept (this reading is the main body of the audio-coded audiobook), that is, all the code values on all the pages of the reading are combined with several representations of how to use phonetic codes to form a combination of sentence pronunciation The code words relevant to the value of the information are all assembled (that is, all the combined information codes on all pages of the reading material are assembled), that is, the code words on the entire page of the reading material mentioned in the documents of the present invention everywhere, all the code words Codewords can be one or more codewords. For example, an audiobook has 4 pages in total, and each page has 2, 3, 0, and 1 codewords. For example, the 6 (2+3+1) codewords The code values are all related to the values of several combined information representing the pronunciation of sentences, so the code words on the entire page of the audiobook (the entire page only includes those 4 pages) are exactly these 6 code words; Each codeword in the codewords on the entire page of the audiobook is generally a codeword of the same code type, such as a conductive code, but if there are multiple codewords on the entire page of an audiobook The code word also can (that is to say, the code word that contains multiple codes in all code words), such as existing conductive code also has graphic code, generally can use the pick-up that has electromechanical type conductive code to read dock or have photoelectric type graphic code The pickup of the reading dock reads the two codewords separately to pronounce them separately; obviously, each of the codewords on the entire page of a speech-encoded audiobook can be read by the pickup for the pickup to decode the playback. Pronunciation of a sentence (codewords are generally read by pickups: different codewords can be read and pronounced by the same pickup, different codewords can only be read and pronounced by different pickups, and one codeword can be read by multiple pickups respectively It is used for these pickups to broadcast the pronunciation of sentences; different code words can only be read by different kinds of pickups, such as a conductive code and a magnetic code, it is necessary to use a pickup with an electromechanical conductive code reading terminal or a pickup with a magnetic The pickups of the code-reading terminal read the two codewords to pronounce them separately. One codeword can be read by multiple pickups. For example, the conductive code can be read by the pickup of the electromechanical conductive code-reading terminal. There is the pickup of the photoelectric reading pier to read); each codeword in the codewords on the whole page of an audio book is generally of the same language, such as all English (language) codewords (here each The value of each stitched information involved in the codeword can be coded and generated by a coding rule that is only applicable to English), but if the codewords on the entire page of an audiobook are in different languages ( That is, there are two or more codewords in two or more languages on the entire page of the audiobook), for example, there are both English (language) codewords and Chinese (language) codewords on the entire page of an audiobook, English (language) The value of each piece of information involved in each codeword of Chinese (language) can be coded and generated by a coding rule that is only applicable to English, and the value of each piece of information involved in each codeword of Chinese (language) can be generated by a kind of coding rule that is only applicable to Chinese Encoding rules are used to encode and generate. A pickup can be used to read the two codewords and decode them to produce English and Chinese speech respectively. In this case, the memory set related to the pickup should generally have a decoding program that is only applicable to English. and decoding data, and a decoding program and decoding data that are only applicable to Chinese, of course, these two codewords on the entire page of the audio book can also be interpreted respectively by the pickup that is applicable to English and the pickup that is applicable to Chinese (the relevant audio book The code words on the whole page can also have other situations, such as on the whole page of an audio book, there are both English code words and Chinese code words and also Korean code words); even the entire page of an audio book All or part of the codewords in the above codewords may have the value of the combined information in multiple different languages, such as the codewords in the codewords on the entire page of an audio book The value of the mosaic information of English (language) and the value of the mosaic information of Chinese (language) are arranged in the value of each code word in some codewords simultaneously, and the value of the mosaic information of English (language) can be only applicable to English The value of the combined information of Chinese (language) can be coded and generated by a coding rule that is only applicable to Chinese, and a pickup can be used to read this code word and decode and broadcast English voice and Chinese voice respectively In this case, generally there should be a decoding program and decoding data that are only applicable to English and a decoding program and decoding data that are only applicable to Chinese in the relevant memory set of this pickup. The value of any spliced information involved in any codeword among the codewords may be of any non-compound language or any composite language. The concept of 'codewords' is used in this patent document to facilitate the distinction from the single codewords mentioned throughout this patent document.

如述及时无特别指明有其他情况,本说明书各处述及的有声读物的整页面上的诸个码字中的码字,默认是以同一码字涉及的各拼合信息的值的语种是相同的为例的(但诸个码字中不同码字涉及的拼合信息的值的语种可以不同)。诸个码字显然可以是一个或多个码字。诸个码字中的各码字显然可以是全部都处于有声读物的一个页面上或分散处于有声读物的多个页面上;而且诸个码字中的各码字显然可以是分散处于有声读物的全部页面上或部分页面上。If there are no other circumstances specified when it is mentioned, the codewords in the codewords on the entire page of the audiobook mentioned in this manual are defaulted to be the same language for the value of each piece of information involved in the same codeword (but the languages of the values of the combined information involved in different codewords among the various codewords may be different). The codewords can obviously be one or more codewords. Each of the codewords can obviously be all on one page of the audiobook or be scattered on multiple pages of the audiobook; and each of the codewords can obviously be scattered on the audiobook on all or part of the page.

在权利要求书和本说明书中各处提及的‘部诸码字’,是一种称谓,是指语音编码有声读物的整页面上的诸个码字中的全部码字或语音编码有声读物的整页面上的诸个码字中的部分码字,部诸码字中的每个码字是都可被同一种拾音器读取以用于此拾音器播出语句发音的(本句说的拾音器是指某一种拾音器);在权利要求书和本说明书中各处提及有关拾音器读取有声读物的整页面上的部诸码字中的码字,此部诸码字是指此有声读物的整页面上的诸个码字中能被此拾音器读取以用于此拾音器播出语句发音的所有码字或部分码字(至于把诸个码字中能被此拾音器读取以用于此拾音器播出语句发音的所有码字中的哪些码字划到此部诸码字集合里,并无特别规定,划多划少随便;而且一遇到几个可被某拾音器读取以用于此拾音器播出语句发音的码字,就可把这几个码字集合起来定义成部诸码字来进行技术讨论)。本发明文件引入部诸码字概念的原因是,因为一本语音编码有声读物的整页面上可能只有一个码种的码字,或有多个码种的码字(比如既有图形码也有导电码也有条形码等),一般一种拾音器只有一种读码头只能读取一种码字(比如机电式导电码读码头可读导电码)或可读两种码字(比如光电式读码头可读导电码和条形码),而且诸个码字中能被某拾音器读码的码字,其码值不一定可用于此拾音器解码播出语句发音(比如码字的值有一个汉语的拼合信息的值,而此拾音器是只能对英语的拼合信息的值来解码播出语句发音的),即语音编码有声读物的整页面上的诸个码字中的全部或部分码字中的每个码字可被同一种拾音器读取以用于此拾音器播出语句发音,所以为撰些权利要求书的方便,就引入部诸码字的概念。部诸码字中的各码字可能全是属于一个码种的码字,或是分属于多个码种的码字(比如部诸码字中有些码是导电码,有些码是条形码,而拾音器的读码头是光电式读码头,可以读这两种码)。本段各处说的拾音器可以是非插卡方式拾音器或插数据卡方式拾音器或插部件式数据卡方式拾音器。The "all codewords" mentioned in the claims and this specification are a kind of appellation, referring to all the codewords in the codewords on the entire page of the speech-coded audiobook or the speech-coded audiobook Part codewords in all codewords on the whole page, each codeword in part all codewords all can be read by same kind of pick-up for this pick-up to broadcast sentence pronunciation (the pick-up that this sentence says refers to a certain kind of pickup); in the claims and this specification, it is mentioned that the relevant pickup reads the codewords in the codewords on the entire page of the audiobook, and these codewords refer to the audiobook Among the code words on the whole page, all code words or some code words that can be read by this pickup for the pronunciation of the sentence spoken by this pickup (as for the code words that can be read by this pickup for use in all code words Which codewords in all the codewords pronounced by the pickup broadcast sentence are drawn into the collection of all codewords in this department, there is no special regulation, more strokes are less random; The code word that the sound pickup broadcasts sentence pronunciation here just can these several code words be assembled and defined into department all code words and carry out technical discussion). The reason why the document of the present invention introduces the concept of all codewords is because there may only be codewords of one code type on the entire page of a speech coded audiobook, or code words of multiple code types (such as both graphic codes and conductive codes) Codes also have bar codes, etc.), generally, a pickup has only one reading terminal and can only read one codeword (such as electromechanical conductive code reading terminal can read conductive code) or can read two codewords (such as photoelectric reading terminal can read read conduction code and bar code), and among the code words that can be read by a certain pickup, its code value may not necessarily be used for the pickup to decode and broadcast the sentence pronunciation (for example, the value of the code word has a Chinese combination information value, and this pickup can only decode the pronunciation of the broadcast sentence to the value of the spelling information in English), that is, each code word in all or part of the code words on the entire page of the audio-coded audiobook Words can be read by the same pickup for this pickup to broadcast sentence pronunciation, so for the convenience of writing some claims, the concept of all codewords is introduced. Each codeword in all codewords of the department may all belong to the codeword of a code kind, or belong to the codeword of a plurality of code kinds (for example, some codes are conductive codes in the codewords of the part, some codes are barcodes, and The reading terminal of the pickup is a photoelectric reading terminal, which can read these two codes). The pickups mentioned everywhere in this paragraph can be non-card mode pickups or data card mode pickups or plug-in data card mode pickups.

本发明文件各处述及有关某拾音器读取有声读物的整页面上的部诸码字中的(一个或多个)码字时,显然这部诸码字中的任一码字涉及的全部数个拼合信息的值中至少有一个拼合信息的值可以被此拾音器解码,这拼合信息的值是表示某语种语句发音的拼合信息的值,此拾音器解码时可以是根据与用于编码生成此拼合信息的值的某编码规则中有关此语种的编码细则对应的某解码规则中有关此语种的解码细则来解码的(此对应可以是专门对应或一般对应)。When the document of the present invention refers to the (one or more) codewords in the codewords on the entire page of the audio book read by a certain pickup, it is obvious that any codeword in these codewords relates to all At least one of the values of the combined information can be decoded by the pickup. The value of the combined information is the value of the combined information that represents the pronunciation of a certain language sentence. When the pickup is decoded, it can be based on the encoding used to generate this The decoding rules for this language in a certain encoding rule for the value of the combined information correspond to the decoding rules for this language in a certain decoding rule (this correspondence can be a special correspondence or a general correspondence).

部诸码字显然可以是一个或多个码字。部诸码字中的各码字显然可以是全部都处于有声读物的一个页面上或分散处于有声读物的多个页面上;而且部诸码字中的各码字显然可以是分散处于有声读物的全部页面上或部分页面上。一部诸码字中的任一码字涉及的全部数个拼合信息的值中的任一拼合信息的值可以是任一非复合语种的也可以是任一复合语种的。All codewords can obviously be one or more codewords. Each code word among all code words can obviously be all on one page of the audiobook or be scattered on multiple pages of the audiobook; on all or part of the page. The value of any piece of information in all the values of pieced information involved in any codeword in a section of codewords can be of any non-compound language or any compound language.

显然‘诸个码字’的概念和‘部诸码字’的概念都是显而易见的概念,都可由优先权申请文件推出。Apparently, the concepts of 'codewords' and 'all codewords' are obvious concepts, which can be deduced from the priority application documents.

有声读物的整页面上的一个码字被一种拾音器读取以用于此拾音器播出语句发音的情况一般有2种:1是从一个码字就可得到表示此语句发音的拼合信息的值,此拾音器读取这一个码字得到此拼合信息的值,即可对此拼合信息的值执行解码使发音器件播出语句发音(这种情况,可以是只把此一个拼合信息的值的BIT放在此一个码字里,或,可以是在此码字的值里放有此一个拼合信息的值的BIT和别的拼合信息的值的BIT),2是对于表示此语句发音的拼合信息的值,需读包括此码字在内的多个码字才能得到此拼合信息的值,此拾音器读取包括此码字在内的这多个码字并把这多个码字合起来以得到此拼合信息的值,并对此拼合信息的值执行解码使发音器件播出语句发音(这种情况,可以是只把此一个拼合信息的值的BIT分散放在这多个码字里,或,可以是把此一个拼合信息的值的BIT分散放在这多个码字里而且在这多个码字中各码字的值里还放有别的拼合信息的值的BIT)。There are generally two situations in which a codeword on the entire page of an audiobook is read by a pickup for the pronunciation of a sentence by the pickup: 1. The value of the combined information representing the pronunciation of the sentence can be obtained from a codeword , the pickup reads this codeword to obtain the value of the combined information, and then decodes the value of the combined information to make the pronunciation device broadcast the pronunciation of the sentence (in this case, only the BIT of the value of the combined information can be Be placed in this one code word, or, can be to put the BIT of the value of this one pieced together information and the BIT of the value of other pieced together information in the value of this codeword), 2 is for the pieced together information representing the pronunciation of this sentence value, it is necessary to read multiple codewords including this codeword to obtain the value of this splicing information, the pickup reads these multiple codewords including this codeword and puts these multiple codewords together to obtain Obtain the value of this merging information, and carry out decoding on the value of this merging information so that the pronunciation device broadcasts the sentence pronunciation (in this case, it is possible to only disperse the BIT of the value of this one merging information in these multiple codewords, Or, it can be that the BIT of the value of this piece of information is scattered in these multiple codewords and the value of other piece of information is also placed in the value of each codeword in the multiple codewords).

由‘部诸码字中的每个码字是都可被同一种拾音器读取以用于此拾音器播出语句发音的’显然可推出‘部诸码字中的每个码字是都可被同一个拾音器读取以用于此拾音器播出语句发音的’。It is obvious that each of the codewords can be read by the same kind of pickup for the pronunciation of the sentence pronounced by the pickup. The same pickup reads the ' that is used for the pronunciation of the sentence pronounced by this pickup.

部诸码字可能有这几种情况(还可能有除这3种情况外的其它情况):1是对于部诸码字中的每个码字,都有此特征:拾音器读此一个码字后就可解码播出语句发音(比如部诸码字中每个码字的值有一个拼合信息的值);2是对于部诸码字中的每个码字,都有此特征:拾音器读此一个码字后还需读此部诸码字中的其它码字由这多个码字得到拼合信息的值才可解码播出语句发音(比如部诸码字中只含有3个码字,一个拼合信息的值的BIT被分散放在这3个码字的值里,这3个码字中每个码字的值有这个拼合信息的值的部分);3是对于部诸码字中的部分码字中的每个码字,都有此特征:拾音器读此一个码字后就可解码播出语句发音(比如这部分码字中每个码字的值有一个拼合信息的值),而对于部诸码字中的另外部分码字中的每个码字,都有此特征:拾音器读此一个码字后还需读部诸码字中的其它码字由这多个码字得到拼合信息的值才可解码播出语句发音(比如这部分码字中每个码字的值有一个拼合信息的值的部分),这里第3种情况可举一例:如部诸码字中含有3个码字(码字1,码字2,码字3),拼合信息的值1的BIT分开放在这3个码字里,拼合信息的值2的BIT也放在码字1里,那码字1可看成部诸码字中的一个部分,码字2和码字3可看成部诸码字中的另一部分,读前一部分中码字1的值后即可根据码值里的拼合信息的值2来解码播出语句发音,读后一部分中的任一码字(码字2或码字3)后,都需读另外两个码字才能由这3个码字的值组合起来得到拼合信息的值1来解码播出语句发音(即如果先读码字2就还需读码字3和码字1,如果先读码字3就还需读码字2和码字1;一般在码字2和码字3里有BIT标志表示需读多个码),而且因为把码字1也读进来了,所以解码播的语句发音还可能包括码字1里的拼合信息的值2对应的语句发音(这要看具体需要)。All code words of part may have these several situations (also may have other situations except these 3 kinds of situations): 1 is for each code word in all code words of part, has this feature: pickup reads this code word Just can decode and broadcast the sentence pronunciation (for example, the value of each code word in the code words of the part has a value of assembling information); 2 is for each code word in the code words of the part, this feature is arranged: the pickup read Also need to read other code words in all code words of this part after this one code word and obtain the value of stitching information by these many code words and just can decode broadcast sentence pronunciation (for example only contain 3 code words in all code words of part, The BIT of the value of a pieced together information is scattered in the value of these 3 codewords, and the value of each codeword has the part of the value of this pieced together information in these 3 codewords); 3 is for all codewords in the department Each codeword in the part of codewords has this feature: after the pickup reads this codeword, it can decode and broadcast the sentence pronunciation (for example, the value of each codeword in this part of codewords has a value of splicing information) , and for each codeword in other part codewords in all codewords, this characteristic is arranged: after the pickup reads this codeword, also need to read other codewords in all codewords by these multiple codewords Only when the value of the combined information is obtained can the pronunciation of the broadcast sentence be decoded (for example, the value of each code word in this part of the codeword has a part of the value of the combined information), the third case here can be an example: as in the code words Contains 3 codewords (codeword 1, codeword 2, codeword 3), the BIT with a value of 1 in the combined information is placed in these 3 codewords separately, and the BIT with a value of 2 in the combined information is also placed in codeword 1 , then codeword 1 can be regarded as one part of all codewords, codeword 2 and codeword 3 can be regarded as another part of all codewords, after reading the value of codeword 1 in the former part, the The value 2 of the stitching information in the value is used to decode the pronunciation of the broadcast sentence. After reading any codeword (codeword 2 or codeword 3) in the latter part, it is necessary to read the other two codewords to understand the 3 codewords. Combine the values of the combined information to get thevalue 1 of the stitching information to decode the pronunciation of the broadcast sentence (that is, if you readcode word 2 first, you need to readcode word 3 andcode word 1, if you readcode word 3 first, you also need to readcode word 2 and code word 1)Codeword 1; generally there is a BIT mark incodeword 2 andcodeword 3 to indicate that multiple codes need to be read), and becausecodeword 1 is also read in, the sentence pronunciation of decoding and broadcasting may also includecodeword 1 The sentence pronunciation corresponding to thevalue 2 of the combined information (it depends on specific needs).

一部诸码字中的各码字,一般都是同一语种的,比如都是英语(语种)的码字;但如一部诸码字中的各码字是不同语种的也可(即此部诸码字中有两个或两个以上语种的码字),这种情况时拾音器可对这不同语种的拼合信息的值分别解码播出语句发音,比如一部诸码字中既有英语的码字也有汉语的码字;甚至一部诸码字中的全部或部分码字中的每个码字里可以是有多个不同语种的拼合信息的值的,这种情况时拾音器一般可对这不同语种的拼合信息的值分别解码播出语句发音,比如一部诸码字中的有些码字中的每个码字的值里同时有英语的拼合信息的值和汉语的拼合信息的值。本段提到的语种可以是任一非复合语种或任一复合语种。Each code word in a part of all code words is generally all of the same language, such as being a code word of English (language); but as each code word in a part of all code words is also different languages (that is, this part There are code words of two or more languages in all code words), in this case, the pickup can decode and broadcast sentence pronunciation respectively to the value of the mosaic information of these different languages, such as existing English in one of all code words The code word also has the code word of Chinese; Even in each code word in all or part code words in a part of all code words, there can be the value of the splicing information of a plurality of different languages, and in this case, the pickup can generally The values of the combined information in different languages are respectively decoded and broadcast to the pronunciation of the sentence. For example, the value of each code word in some code words in a set of code words has the value of the combined information in English and the value of the combined information in Chinese. . The language mentioned in this paragraph may be any non-composite language or any compound language.

某拾音器可读的部诸码字可以是只包括一个码字,此拾音器读这一个码字就可播出语句发音,此部诸码字中的一个码字就是指这仅有的一个码字。The readable codewords of a certain pickup can only include a codeword, and the pronunciation of the sentence can be broadcasted by reading this codeword by the pickup, and a codeword in this part of the codewords refers to this unique codeword .

另,用作有声读物主体的读物除了可以是书外,还可以是书籍,印有图文的卡片,报纸杂志或地图等(读物自身范围内是与拼合信息码无关的,如读物的整页面上形成有拼合信息码,那读物整页面上的这些拼合信息码与读物就一起形成有声读物),有声读物的整页面上的码字(即拼合信息码)供拾音器读取。(按常识,书籍包含书的概念,但可看成书与书籍是等同概念)In addition, besides the book, the reading material used as the main body of the audiobook can also be a book, a card printed with pictures and texts, a newspaper, a magazine or a map, etc. There is a combined information code formed on the reading material, and these combined information codes on the entire page of the reading material and the reading material together form an audio book), and the code words (that is, the combined information code) on the entire page of the audio book are read by the pickup. (According to common sense, books contain the concept of books, but it can be regarded as the same concept as books)

码字所含的信息位长度,并无特别限制,一般短码字可携带表示短语句发音的拼合信息的值,长码字可携带表示长语句发音的拼合信息的值。The length of the information bits contained in the codeword is not particularly limited. Generally, the short codeword can carry the value of the combination information representing the pronunciation of a short sentence, and the long codeword can carry the value of the combination information representing the pronunciation of a long sentence.

本发明文件各处所指语音编码有声读物,其主体为读物,在读物的整页面上有诸个码字(但不一定是读物的每个页面都有码字),在读物的多个页面上可以同时有码字(如某书的第2,3,5,6几个页面都印有条形码),任一页面上的码字数量可以是零个或一个或多个,当然各页面的各码字的码值一般是各异的,各码字的码值是共与数个表示如何用音码拼合成语句发音的拼合信息的值有关的,甚至每个页面都可有码字,有的页面如不需发音可没码字;所以本发明文件各处所说的‘在读物的整页面上有诸个码字’,可能是读物的每个页面上都有码字,也可能是只有读物的部分页面上有码字。读物的整页面上的码字,可以是导电码,条形码,图形码,磁码或其它码种的码字。翻开各页,由拾音器读码,并处理播出语句发音。对于语音编码有声读物的整页面上的各码字涉及的某一语种的各拼合信息的值,一般都是由编码规则中有关此语种的含有压缩编码处理的编码细则来编码生成的(比如一本语音编码有声读物的整页面上只有英语语种的码字,这些英语语种的码字涉及的各拼合信息的值是用含有压缩编码处理的只适用于英语的一种编码规则来编码生成的,只适用于英语的与此编码规则对应的解码规则也含有压缩解码处理;再比如一本语音编码有声读物的整页面上有多个语种的码字,每个语种的码字涉及的各拼合信息的值是用只适用于此语种的一种编码规则来编码生成的,分别只适用于各语种的各编码规则都是含有压缩编码处理的,拾音器涉及的分别只适用于各语种的与这些编码规则对应的各解码规则也是含有压缩解码处理的)。(本段说的‘编码规则中有关此语种的含有压缩编码处理的编码细则’,其中‘编码规则中有关此语种’是修饰‘编码细则’的,‘含有压缩编码处理’也是修饰‘编码细则’的)The voice-coded audiobook referred to in the documents of the present invention, its main body is the reading material, there are various code words on the whole page of the reading material (but not necessarily every page of the reading material has a code word), on multiple pages of the reading material There can be codewords on it at the same time (such as the 2nd, 3rd, 5th, and 6th pages of a certain book are all printed with barcodes), and the number of codewords on any page can be zero or one or more. Of course, the number of codewords on each page The code value of each code word is generally different, and the code value of each code word is altogether relevant with the value of the information of how to piece together into sentence pronunciation with several representations, even each page all can have code word, Some pages do not have codewords if they do not need to be pronounced; so 'there are codewords on the whole page of the reading material' said everywhere in the document of the present invention may be that codewords are arranged on each page of the reading material, and may also be Only some pages of the reading material have code words. The codewords on the entire page of the reading material can be conductive codes, barcodes, graphic codes, magnetic codes or other codewords. Turn over each page, read the code by the pickup, and process the pronunciation of the broadcast sentence. For the value of each splicing information of a certain language involved in each code word on the entire page of the speech coded audiobook, it is generally all coded and generated by the coding rules related to this language in the coding rules that contain compression coding processing (such as a The entire page of this phonetic coded audiobook has only codewords in English language, and the values of the stitching information involved in these codewords in English language are coded and generated by using a coding rule that is only applicable to English and contains compression coding processing. The decoding rule corresponding to this encoding rule, which is only applicable to English, also includes compression decoding processing; another example is that there are codewords in multiple languages on the entire page of a speech-encoded audiobook, and the codewords in each language involve various splicing information The value of is encoded by a coding rule that is only applicable to this language. Each coding rule that is only applicable to each language contains compression coding processing. Each decoding rule corresponding to the rule also includes compression decoding processing). (This paragraph says "encoding rules about this language in the encoding rules that contain compression encoding processing", wherein "in the encoding rules about this language" is a modification of "encoding rules", and "includes compression encoding processing" is also a modification of "encoding rules" 'of)

直播方式拾音器与TTS方式拾音器的硬件技术特征基本相同,从下一段到第125页所述的拾音器的硬件技术特征各内容(第125页对图1拾音器主体实施例的阐述不算在硬件技术特征内),默认是针对直播方式拾音器的,这些硬件技术特征中除了少数外其余也都是适用于TTS方式拾音器,在从下一段到第125页不适用于TTS方式拾音器的地方是有专门注明TTS方式拾音器(或用TTS产音方式)会怎样的,没有作这样注明的硬件技术特征是适用于TTS方式拾音器的。The hardware technical features of the live mode pickup and the TTS mode pickup are basically the same, from the next paragraph to the hardware technical characteristics of the pickup described on page 125 (page 125 is not counted as the hardware technical feature In), the default is for live broadcast pickups, except for a few of these hardware technical features, the rest are also applicable to TTS pickups, from the next paragraph to page 125 where it is not applicable to TTS pickups are specially marked What will happen to the TTS mode pickup (or use the TTS sound production method), the hardware technical characteristics that are not marked in this way are applicable to the TTS mode pickup.

所使用的拾音器的结构,特点:The structure of the pickup used, features:

本发明文件涉及的拾音器,有非插卡方式拾音器,插数据卡方式拾音器和插部件式数据卡方式拾音器三种,非插卡方式拾音器是指不需插数据卡也不需插部件式数据卡的非插卡方式拾音器主体本身(即非插卡方式拾音器与非插卡方式拾音器主体是等同的),插数据卡方式拾音器是指把数个数据卡插入插数据卡方式拾音器主体使这数个数据卡与插数据卡方式拾音器主体合为插数据卡方式拾音器,插部件式数据卡方式拾音器是指把数个部件式数据卡插入插部件式数据卡方式拾音器主体使这数个部件式数据卡与插部件式数据卡方式拾音器主体合为插部件式数据卡方式拾音器;如阐述时无特别指明有他义,本说明书各段阐述的有关拾音器的技术,默认是针对非插卡方式拾音器和插数据卡方式拾音器的,而插部件式数据卡方式拾音器的各技术特征可参照本说明书对非插卡方式拾音器和插数据卡方式拾音器的技术描述,有声读物与插部件式数据卡方式拾音器的配合使用关系也可参照本说明书对有声读物与非插卡方式拾音器(或插数据卡方式拾音器)的配合使用关系的描述。如提及时无特别指明有他义,本说明书各处提及的拾音器,默认是指非插卡方式拾音器或插数据卡方式拾音器。而且如提及时无特别指明有其他情况,本说明书各段提及插数据卡方式拾音器时,默认是只以把一个数据卡插入一个插数据卡方式拾音器主体来形成此插数据卡方式拾音器为例的;如提及时无特别指明有其他情况,本说明书各段提及插部件式数据卡方式拾音器时,默认是以只把一个部件式数据卡插入一个插部件式数据卡方式拾音器主体来形成此插部件式数据卡方式拾音器为例的。The pickups involved in the document of the present invention include non-insert-card pickups, data-card-plug pickups, and plug-in data card pickups. The non-card pickup means that neither a data card nor a plug-in data card is required. The main body of the non-inserting card mode pickup itself (that is, the main body of the non-inserting card mode pickup and the non-inserting card mode pickup is equivalent), and the data card mode pickup refers to inserting several data cards into the main body of the data card mode pickup to make these several The main body of the pickup of the data card and the plug-in data card mode is combined into a pickup of the plug-in data card mode. Combined with the main body of the plug-in data card pickup, it is a plug-in data card pickup; if there is no special meaning in the explanation, the technologies related to the pickup described in each paragraph of this manual are by default for non-plug-in pickups and plug-in pickups. The data card mode pickup, and the technical characteristics of the plug-in data card mode pickup can refer to the technical description of the non-plug-in card mode pickup and the data card mode pickup in this manual, and the cooperation between the audiobook and the plug-in data card mode pickup The use relationship can also refer to the description of the cooperative use relationship between the audiobook and the non-inserted card mode pickup (or the data card mode pickup) in this manual. If there is no other meaning when it is mentioned, the pickups mentioned in this manual refer to non-card pickups or data card pickups by default. And if there are no other circumstances specified when mentioning, when the data card insertion pickup is mentioned in each paragraph of this manual, the default is to insert a data card into a data card insertion pickup body to form the data card insertion pickup as an example. If there are no other circumstances specified when mentioning, when each paragraph of this manual refers to the pickup of the plug-in data card mode, the default is to only insert a component data card into the main body of the pickup of the plug-in data card mode to form the pickup. Take the plug-in data card pickup as an example.

除非提及时有特别指明有他义或可从提及处的上下文推出有他义,本发明文件各处述及非插卡方式时,默认提及的拾音器是指非插卡方式拾音器,默认提及的拾音器主体是指非插卡方式拾音器主体。除非提及时有特别指明有他义或可从提及处的上下文推出有他义,本发明文件各处述及插数据卡方式时,默认提及的拾音器是指插数据卡方式拾音器,默认提及的拾音器主体是指插数据卡方式拾音器主体,默认提及的卡是指数据卡。除非提及时有特别指明有他义或可从提及处的上下文推出有他义,本发明文件各处述及插部件式数据卡方式时,默认提及的拾音器是指插部件式数据卡方式拾音器,默认提及的拾音器主体是指插部件式数据卡方式拾音器主体,默认提及的卡是指部件式数据卡。Unless it is specifically indicated that there are other meanings when it is mentioned or it can be deduced from the context of the mention that it has other meanings, when the document of the present invention mentions the non-card mode everywhere, the pickup mentioned by default refers to the non-card mode pickup, and the default mention And the main body of the pickup refers to the main body of the non-card pickup. Unless it is mentioned that it has other meanings or can be deduced from the context of the mentions, when the document of the present invention refers to the way of inserting data cards, the pickup mentioned by default refers to the pick-up of the way of inserting data cards. And the main body of the pickup refers to the main body of the pickup in the way of inserting the data card, and the card mentioned by default refers to the data card. Unless it is specifically indicated that there are other meanings or it can be deduced from the context of the mention, when the document of the present invention refers to the plug-in data card method, the pickup mentioned by default refers to the plug-in data card method Pickup, the pickup body mentioned by default refers to the pickup body of the plug-in data card method, and the card mentioned by default refers to the component data card.

非插卡方式拾音器或插数据卡方式拾音器的拾音器主体的电路实现:拾音器主体一般包括一个读码头,一个单片机芯片(拾音器的主处理芯片),一个发音器件,存储器,按键模块等。(按键模块一般由一个或多个按键组成,如由超过两个按键组成,那一般是接成按键矩阵形式的,这是常识,并且对于拾音器主体来说,按键模块不是必需的;这里的存储器是指单片机芯片外拾音器主体内的存储器,如单片机芯片内无存储器而且拾音器主体不可插数据卡,那单片机芯片外拾音器主体内的存储器就是必需的,如单片机芯片内有存储器或如单片机芯片内无存储器但拾音器主体可插数据卡,那单片机芯片外拾音器主体内的存储器是可根据情况可有可无的,这里有关单片机芯片外拾音器主体内的存储器取舍的处置也属于技术常识)The circuit realization of the pickup main body of the non-card mode pickup or the data card pickup: the pickup main body generally includes a reading terminal, a single-chip microcomputer chip (the main processing chip of the pickup), a pronunciation device, memory, key module, etc. (The button module is generally composed of one or more buttons. If it is composed of more than two buttons, it is generally connected in the form of a button matrix. This is common sense, and for the main body of the pickup, the button module is not necessary; the memory here It refers to the memory in the main body of the pickup outside the single-chip microcomputer chip. If there is no memory in the single-chip microcomputer chip and the main body of the pickup cannot be inserted with a data card, then the memory in the main body of the pickup outside the single-chip microcomputer chip is necessary. If there is memory in the single-chip microcomputer chip or if there is no memory in the single-chip microcomputer chip Memory but the main body of the pickup can be plugged with a data card, the memory in the main body of the pickup outside the single-chip microcomputer chip can be dispensable according to the situation.

本发明文件各处所指单片机芯片(即拾音器的主处理芯片),可以是MCU,CPU等,一般是指MCU,所以本说明书在描述中以MCU作为单片机芯片的实例。本发明文件各处有提及的MCU(采用大写字母的,其前面没‘主’字的)或主MCU(其中的字母是采用大写字母的)或主MCU芯片(其中的字母是采用大写字母的)都是代指作为拾音器的主处理芯片的单片机芯片,而采用小写字母的mcu,也是指单片机芯片但不是专指作为拾音器的主处理芯片的单片机芯片的;如提及时无特别指明有他义,本发明文件各处提及的单片机芯片默认是作为拾音器的主处理芯片的单片机芯片,但如是插部件式数据卡的拾音器方案,其作为主处理芯片的单片机芯片不一定在拾音器主体内而是可在部件式数据卡上。The document of the present invention refers to single-chip microcomputer chip (being the main processing chip of pickup) everywhere, can be MCU, CPU etc., generally refers to MCU, so this specification sheet uses MCU as the example of single-chip microcomputer chip in description. There are MCUs mentioned everywhere in the document of the present invention (in uppercase letters, without the word 'main' in front of it) or main MCU (in which letters are in uppercase letters) or main MCU chip (in which letters are in uppercase letters) ) all refer to the single-chip microcomputer chip as the main processing chip of the pickup, and the mcu in lowercase letters also refers to the single-chip microcomputer chip but not specifically refers to the single-chip microcomputer chip as the main processing chip of the pickup; if there is no special indication when it is mentioned, there are other In other words, the single-chip microcomputer chip mentioned everywhere in the document of the present invention is the single-chip microcomputer chip as the main processing chip of the pickup by default, but as the pickup scheme of the plug-in data card, the single-chip microcomputer chip as the main processing chip is not necessarily in the pickup main body but Yes Available on Component Data Cards.

对作为拾音器的主处理芯片的单片机芯片的一般要求:单片机芯片本身有播语音功能(指在解码采用直接产音方式时由单片机芯片播放比如ADPCM格式的录音数据),或可外接语音芯片,单片机芯片需有一些IO脚完成键扫描,读码等功能。(如拾音器解码时总采用TTS产音方式则不需单片机芯片本身有采用直接产音方式播语音功能)General requirements for the single-chip microcomputer chip as the main processing chip of the pickup: the single-chip microcomputer chip itself has the function of broadcasting voice (referring to the recording data in the ADPCM format played by the single-chip microcomputer chip when the decoding adopts a direct sound production method), or an external voice chip, single-chip microcomputer The chip needs to have some IO pins to complete key scanning, code reading and other functions. (If the pickup always uses the TTS sound production method when decoding, it does not need the single-chip microcomputer chip itself to use the direct sound production method to play the voice function)

读码头是用于读取语音编码有声读物的整页面上的码字的码值的,可以是光电式读码头(比如光笔或CCD读码头)读取有声读物的整页面上的条形码或导电码,也可以是本发明人发明的机电式导电码读码头读取有声读物的整页面上的导电码,或其它形式的读码头读取有声读物的整页面上的其它码种的码字,被读入的码字(条形码,导电码等)的码值是共与数个拼合信息的值有关的,这些拼合信息的值是按编码规则(比如前面所述的直播方式的以只适用于英语为例的或以只适用于汉语为例的编码规则)来形成的。读码头上一般是没有小mcu的(比如俯图6所示的导电码甲读码头1或俯图1所示的光笔),另在读码头里也可有小mcu,这时可把读码头当成有小mcu的读码头,另也可把此小mcu与作为主处理芯片的MCU一齐当成一个大的MCU并且这时可把原来读码头里除去小mcu的部分当成读码头。The reading terminal is used to read the code value of the code word on the entire page of the audio-coded audiobook, which can be a photoelectric reading terminal (such as a light pen or a CCD reading terminal) to read the barcode or conductive code on the entire page of the audiobook , it can also be the electromechanical conductive code reading dock invented by the inventor to read the conductive code on the entire page of the audiobook, or other forms of reading docks to read the codewords of other code types on the entire page of the audiobook, and be read The code value of the read-in codeword (barcode, conductive code, etc.) is related to the value of several pieces of combined information. For example or with the encoding rules that are only applicable to Chinese as an example) to form. There is generally no small mcu on the reading dock (such as the conductive codeA reading dock 1 shown in Figure 6 above or the light pen shown in Figure 1), and there may also be a small MCU in the reading dock. At this time, the reading dock can be regarded as There is a reading port of a small mcu, and this small mcu and the MCU as the main processing chip can also be regarded as a large MCU and at this time the part of the original reading port that removes the small mcu can be used as a reading port.

很显然,本发明文件各处所指的存储器,是指其存储的数据在掉电后不消失的存储器,比如ROM,也可以是EPROM,EEPROM或FLASH等,而且本发明文件各处所指一个存储器,一般是指一个与外界相对独立的一定数量存储单元的集合,比如一个存储器芯片;而且按常识一个单片机工作系统中可有一个存储器或可有处于不同位置的多个存储器,比如单片机芯片内部和外部各有存储器,或单片机芯片内无存储器需外接存储器。而象RAM这样的暂存器,按常识默认包含在单片机芯片里或作为单片机芯片周边的惯例附加芯片,本发明文件中不再特别单独提及。本段所述都是技术常识。Obviously, the memory referred to in the documents of the present invention refers to the memory whose stored data does not disappear after power failure, such as ROM, which can also be EPROM, EEPROM or FLASH, etc., and the documents of the present invention refer to a Memory generally refers to a collection of a certain number of storage units that are relatively independent from the outside world, such as a memory chip; and according to common sense, there can be one memory or multiple memories in different locations in a single-chip microcomputer operating system, such as inside a single-chip microcomputer chip There is memory separately with the outside, or there is no memory in the single-chip microcomputer chip and an external memory is required. And temporary registers like RAM are included in the single-chip microcomputer chip by default by common sense or as a conventional additional chip around the single-chip microcomputer chip, and are no longer specifically mentioned separately in the documents of the present invention. This paragraph is all technical knowledge.

本发明文件有关存储器集的概念属于常识,也是显而易见的。The concept of storage sets in the present invention document is common sense and obvious.

在权利要求2,10,19,22中提及的‘此存储器集里的各存储器可被前述拾音器的前述单片机芯片读取使用’,是指,既可以是在单片机芯片外的存储器,可通过串行连线或并行总线等方式与单片机芯片连接并被单片机芯片读取使用,也可以是位于单片机芯片内的存储器(芯片),可被单片机芯片本身读取使用,这涉及的本来也是技术常识。(如是插数据卡方式拾音器,那插入拾音器主体的数据卡上的存储器显然也可以是通过串行连线或并行总线等方式通过卡插槽与单片机芯片连接并被拾音器主体内的单片机芯片读取使用的;对于插部件式数据卡方式拾音器,部件式数据卡上的非单片机芯片内的存储器显然也可以是通过串行连线或并行总线等方式与单片机芯片连接并被单片机芯片读取使用的,此单片机芯片可以是处于拾音器主体内或部件式数据卡上)Inclaim 2, 10, 19, the 'each memory in this memory set can be read and used by the aforementioned single-chip microcomputer chip of the aforementioned pickup' mentioned in claim 22, means that both can be the memory outside the single-chip microcomputer chip, can be passed through The serial connection or parallel bus is connected to the single-chip microcomputer chip and read by the single-chip microcomputer chip. It can also be a memory (chip) located in the single-chip microcomputer chip, which can be read and used by the single-chip microcomputer chip itself. This involves technical common sense . (if it is a pick-up in the form of a data card, the memory on the data card inserted into the main body of the pick-up can obviously also be connected to the single-chip microcomputer chip through a card slot through a serial connection or a parallel bus, and read by the single-chip microcomputer chip in the main body of the pick-up Used; for the plug-in data card mode pickup, the memory in the non-single-chip microcomputer chip on the component data card can obviously also be connected with the single-chip microcomputer chip through a serial connection or a parallel bus and read by the single-chip microcomputer chip. , this microcontroller chip can be in the main body of the pickup or on a component data card)

本发明文件所说的有用信息是供单片机芯片使用的信息,比如程序,数据等。The useful information mentioned in the document of the present invention is the information used by the single-chip microcomputer chip, such as program, data and so on.

对于非插卡方式拾音器,本发明文件各处所说的与其有关的存储器集,是指作为拾音器的主处理芯片的单片机芯片能读取的所有存储器;存储器集可以是单片机芯片内的数个存储器,或可以是单片机芯片外拾音器主体内的数个存储器(一般是存储器芯片),或可以是单片机芯片内的数个存储器加上单片机芯片外拾音器主体内的数个存储器;拾音器涉及的程序及拾音器涉及的数据存储在此存储器集里;对于在单片机芯片外拾音器主体内的有些存储器,这些存储器能被单片机芯片读取,这些存储器属于存储器集,但可能没存储有用信息。For the non-plug-in card mode sound pick-up, the said memory sets relevant to it in various places of the documents of the present invention refer to all memories that can be read by the single-chip microcomputer chip as the main processing chip of the sound pick-up; The memory set can be several memories in the single-chip microcomputer chip, Or can be several memories (generally memory chips) in the main body of the pickup outside the single-chip microcomputer chip, or can be several memories in the single-chip microcomputer chip plus several memories in the pickup main body outside the single-chip microcomputer chip; The data stored in this memory set; For some memory in the main body of the pickup outside the single-chip microcomputer chip, these memories can be read by the single-chip microcomputer chip, and these memories belong to the memory set, but may not store useful information.

对于插数据卡方式拾音器,本发明文件各处所说的与其有关的存储器集,是指拾音器在读码发音使用时(即在把数据卡插入插数据卡方式拾音器主体使此卡和此拾音器主体合为此拾音器后),作为拾音器的主处理芯片的单片机芯片能读取的所有存储器;存储器集可以是单片机芯片内的数个存储器加上数据卡上的存储器,或可以是单片机芯片外拾音器主体内的数个存储器加上数据卡上的存储器,或可以是单片机芯片内的数个存储器加上单片机芯片外拾音器主体内的数个存储器加上数据卡上的存储器,或可以是数据卡上的存储器;拾音器涉及的程序及拾音器涉及的数据存储在此存储器集里;对于在单片机芯片外拾音器主体内的有些存储器,在读码使用时这些存储器能被单片机芯片读取,这些存储器属于存储器集,但可能没存储有用信息。数据卡上的存储器(一个数据卡上的各存储器是被当成一个存储器的)总是存储有有用信息且一定属于此存储器集;至于此数据卡上还有别的什么信息无所谓,甚至还可有别的种类拾音器的运行程序(此卡还可插到别的种类拾音器主体上运行),只要不干涉这里的插卡运行就行。(本段是以插一个数据卡到插数据卡方式拾音器主体而形成拾音器为例的,对于插多个数据卡到插数据卡方式拾音器主体而形成拾音器的情况,每个数据卡上的存储器都属于与此拾音器有关的存储器集)For the pickup of the data card mode, the said memory set related to it in the documents of the present invention refers to that the pickup is used when the code reading and pronunciation are used (that is, the data card is inserted into the data card mode pickup body so that the card and the pickup body are combined into one After this pickup), all memories that can be read by the single-chip microcomputer chip as the main processing chip of the pickup; the memory set can be several memory storages in the single-chip microcomputer chip plus the memory on the data card, or can be in the pickup main body outside the single-chip microcomputer chip Several memories plus the memory on the data card, or can be several memories in the single-chip microcomputer chip plus several memories in the pickup body outside the single-chip microcomputer chip plus the memory on the data card, or can be the memory on the data card; The programs related to the pickup and the data related to the pickup are stored in this memory set; for some memories in the main body of the pickup outside the single-chip microcomputer chip, these memories can be read by the single-chip microcomputer chip when reading codes. These memories belong to the memory set, but may not Store useful information. The memory on the data card (each memory on a data card is regarded as a memory) always stores useful information and must belong to this memory set; The operating program of other types of pickups (this card can also be inserted into the main body of other types of pickups to run), as long as it does not interfere with the operation of the plug-in card here. (This paragraph is an example of a pickup formed by inserting one data card into the main body of the pickup in the way of inserting data cards. For the situation of forming a pickup by inserting multiple data cards into the main body of the pickup in the way of inserting data cards, the memory on each data card is belongs to the memory set associated with this pickup)

拾音器(非插卡方式拾音器或插数据卡方式拾音器)在使用时,需用到存储器集,存储器集里存储有拾音器涉及的程序及拾音器涉及的数据(拾音器涉及的程序包括拾音器运行主控程序,解码有关程序等,拾音器涉及的数据包括有解码有关数据,还可能包括有其它数据),由单片机芯片执行存储器集里的程序来运行,调存储器集里的解码有关程序及解码有关数据来作解码;这也是技术常识。When the pickup (non-insert-card pickup or data-card pickup) is in use, a memory set is required, and the program and data related to the pickup are stored in the storage set (the programs involved in the pickup include the main control program of the pickup, Decoding-related programs, etc. The data involved in the pickup includes decoding-related data, and may also include other data), the single-chip microcomputer chip executes the program in the memory set to run, and adjusts the decoding-related programs and decoding-related data in the memory set for decoding ; This is also technical common sense.

拾音器涉及的程序(也是指作为拾音器的主处理芯片的单片机芯片涉及使用的程序),一般包括读码程序,拾音器运行主控程序,发音程序,解码有关程序等(如拾音器采用TTS产音方式那拾音器一般就不需发音的程序),还可包括其它语音处理等程序;其解码有关程序,可以是只适用于一个语种或适用于多个语种的解码有关程序,一般是指按只适用于一个语种或适用于多个语种的解码规则来对读入的码字涉及的拼合信息的值进行解码的一套解码程序。拾音器涉及的数据(也是指作为拾音器的主处理芯片的单片机芯片涉及使用的数据),包括解码有关数据,还可能包括诸音码的发音数据(如拾音器采用直接产音方式,一般需用到诸音码的发音数据),还可能包括其它数据;如拾音器采用TTS产音方式那拾音器就不需诸音码的发音数据;解码有关数据,是解码有关程序所使用的,一般是一套或多套解码程序用的解码数据,解码数据是指比如英语解码用的单词数据库,词根数据库等或比如汉语解码用的词数据库,单字拼音表等(解码有关数据,解码数据都不包括诸音码的发音数据)。如拾音器可解码多个语种,那每个语种都可能有一套解码数据和诸音码的发音数据,这时每个语种的一套解码数据和诸音码的发音数据都应包括在拾音器涉及的数据里(且每个语种的解码数据都包括在解码有关数据里),这时如对于每个语种,其解码程序是分开的,那每个语种的解码程序都应包括在拾音器涉及的程序里(且每个语种的解码程序都包括在解码有关程序里)。本段所说的一个语种,可以是任一非复合语种或任一复合语种;本段所说的多个语种,可以全是非复合语种或全是复合语种或既包含非复合语种也包含复合语种。The program involved in the pickup (also refers to the program involved in the use of the single-chip microcomputer chip as the main processing chip of the pickup), generally including the code reading program, the pickup running the main control program, the pronunciation program, the decoding related program, etc. (if the pickup adopts the TTS sound production method, then Pickup generally just does not need the program of pronunciation), also can comprise other programs such as speech processing; Its decoding relevant program can be only applicable to one language or be applicable to the decoding relevant program of multiple languages, generally refers to by only being applicable to one A set of decoding procedures for decoding the value of the stitched information involved in the read-in codeword according to the language or the decoding rules applicable to multiple languages. The data related to the pickup (also referring to the data related to the use of the single-chip microcomputer chip as the main processing chip of the pickup), including decoding relevant data, may also include the pronunciation data of all sound codes (as the pickup adopts the direct sound production mode, generally need to use various The pronunciation data of sound sign indicating number), also may comprise other data; As pick-up adopts the TTS sound production mode that pick-up just does not need the pronunciation data of all sound sign indicating numbers; Decode relevant data, is that decoding relevant program uses, is generally one or more Cover the decoding data that decoding program is used, and decoding data refers to such as word database that English decoding is used, root database etc. or such as the word database that Chinese decoding is used, single-character phonetic table etc. pronunciation data). Can decode multiple languages as the pickup, then each language may have a set of decoding data and the pronunciation data of all phonetic codes, at this moment a set of decoding data of each language and the pronunciation data of all phonetic codes should be included in the relevant audio pickup In the data (and the decoding data of each language is included in the decoding related data), at this time, if the decoding program is separate for each language, then the decoding program of each language should be included in the program involved in the pickup (and the decoding program of each language is included in the decoding related program). A language mentioned in this paragraph may be any non-compound language or any compound language; multiple languages mentioned in this paragraph may be all non-compound languages or all compound languages, or include both non-compound languages and compound languages .

如提及时无特别指明含有他义(涉及插部件式数据卡方式拾音器时属于含有他义),本发明文件各处提及的‘拾音器主体内有存储器’,默认一般可有三种情况:1是MCU片内有存储器,MCU片外拾音器主体内无存储器,2是MCU片内无存储器,MCU片外拾音器主体内有存储器,3是MCU片内有存储器,MCU片外拾音器主体内也有存储器;如提及时无特别指明含有他义(涉及插部件式数据卡方式拾音器时属于含有他义),本发明文件各处提及的‘拾音器主体内无存储器’,默认是指MCU片内以及MCU片外拾音器主体内都无存储器。If there is no special meaning when it is mentioned (it contains other meanings when it involves the plug-in data card mode pickup), the "storage in the pickup body" mentioned in the document of the present invention generally has three situations by default: 1 is There is a memory in the MCU chip, but there is no memory in the main body of the MCU off-chip pickup. 2. There is no memory in the MCU chip. There is a memory in the main body of the MCU off-chip pickup. When mentioned, there is no special indication that it contains other meanings (it contains other meanings when it involves plug-in data card mode pickups), and the "no memory in the pickup body" mentioned throughout the document of the present invention refers to the inside of the MCU chip and the outside of the MCU chip by default. There is no memory inside the pickup body.

与上段对应的,如提及时无特别指明含有他义(涉及插部件式数据卡方式拾音器时属于含有他义),本发明文件各处提及的‘拾音器主体内的存储器’,默认一般也可有三种情况:1是只指MCU片内的数个存储器(要求MCU片内有存储器),2是只指MCU片外拾音器主体内的数个存储器(要求MCU片外拾音器主体内有存储器),3是指MCU片内的数个存储器加上MCU片外拾音器主体内的数个存储器(要求MCU片外拾音器主体内有存储器而且MCU片内也有存储器);如拾音器主体内有存储器,那拾音器主体内的存储器,显然有一个或多个。(数据卡不是拾音器主体部件,不属于拾音器主体,数据卡与拾音器主体是不同的产品)Corresponding to the previous paragraph, if there is no special meaning when it is mentioned (it contains other meanings when it involves a plug-in data card pickup), the 'storage in the pickup body' mentioned in the documents of the present invention can generally be used by default. There are three situations: 1 refers only to several memories in the MCU chip (requiring that there is a memory in the MCU chip), and 2 refers to only several memories in the main body of the MCU off-chip pickup (requiring that there is a memory in the main body of the MCU off-chip pickup), 3 refers to several memories in the MCU chip plus several memories in the main body of the MCU off-chip pickup (it is required that there is a memory in the main body of the off-chip pickup of the MCU and there is also a memory in the MCU chip); if there is a memory in the main body of the pickup, then the main body of the pickup Within the memory, there are obviously one or more. (The data card is not a part of the main body of the pickup, and does not belong to the main body of the pickup. The data card and the main body of the pickup are different products)

一个数据卡上显然也可有一个或多个存储器,但可把一个数据卡上的全部存储器(一个或多个存储器)合起来等同当成是一个存储器(大存储器)。本发明文件各处提及的‘数据卡上的存储器’就是指一个大存储器(此大存储器相当于此数据卡上的全部存储器合起来而成的)。A data card can obviously also have one or more memories, but all the memories (one or more memories) on a data card can be collectively regarded as one memory (large memory). The 'memory on the data card' mentioned throughout the documents of the present invention refers to a large memory (this large memory is equivalent to the combination of all memory on the data card).

如拾音器是非插卡方式拾音器,那拾音器主体(也即拾音器)内须有存储器(这些存储器的集合就是与此拾音器有关的存储器集),拾音器涉及的程序及涉及的数据,存储在拾音器主体内的一个或多个存储器里(至于如何把拾音器涉及的程序及涉及的数据分散存储到多个存储器里是根据需要来定的,这是技术常识)。If the pickup is a non-plug-in pickup, there must be a memory in the pickup body (that is, the pickup) (the collection of these memories is the storage set related to the pickup), and the programs involved in the pickup and the data involved are stored in the pickup body. In one or more memories (as for how to disperse and store the programs involved in the pickup and the data involved in multiple memories according to needs, this is technical common sense).

如拾音器是插数据卡方式拾音器,数据卡上有存储器,而其拾音器主体内有存储器或没有存储器,此数据卡是与此拾音器主体配合使用(插入使用)的,如果此拾音器主体内有存储器(本发明文件默认数据卡和拾音器主体是两个不同产品,数据卡不属于拾音器主体本身的部件),那拾音器涉及的程序及涉及的数据,可存储在数据卡上的存储器里和拾音器主体内的一个或多个存储器里的;另如拾音器主体内有存储器但可不使用,可把拾音器涉及的程序及涉及的数据,都存储在数据卡上的存储器里;如拾音器主体内无存储器,那拾音器涉及的程序及涉及的数据,就存储在数据卡上的存储器里。至于本段叙述中有涉及到的把拾音器涉及的程序及涉及的数据如何分散存储到数据卡上的存储器里和拾音器主体内的一个或多个存储器里(也即如何分散存储到与拾音器有关的存储器集里)是根据需要来定的,这是技术常识,权利要求21提及的‘所述拾音器涉及的程序及所述拾音器涉及的数据存储在所述存储器集里’也是与此有关的,当然无论如何分配存储程序和数据,都默认要求至少有部分拾音器涉及的程序或部分拾音器涉及的数据存储到数据卡上,否则这个卡就不是与此拾音器主体配合使用的数据卡了。对于插数据卡方式拾音器,一般是在拾音器主体内存储拾音器的读码,发音等程序而在数据卡上存储解码有关程序及解码有关数据及诸音码的发音数据等(如插数据卡方式拾音器采用TTS产音方式,那就不需诸音码的发音数据)。关于数据卡的技术阐述在后面。(本段说的诸音码的发音数据一般就是指诸音码的录音数据,但如拾音器采用TTS产音方式的话那拾音器涉及的数据就不需包括诸音码的发音数据)If the pickup is a pickup with a data card, there is a memory on the data card, and there is a memory or no memory in the main body of the pickup. The document of the present invention defaults that the data card and the main body of the pickup are two different products, and the data card is not a part of the main body of the pickup), the programs and data involved in the pickup can be stored in the memory on the data card and in the main body of the pickup In one or more memories; if there is memory in the main body of the pickup but it can not be used, the program and data involved in the pickup can be stored in the memory on the data card; if there is no memory in the main body of the pickup, then the pickup involves The program and related data are stored in the memory on the data card. As for how to disperse and store the programs and related data involved in the pickup in the memory on the data card and one or more memories in the main body of the pickup (that is, how to disperse and store them in the pickup-related The storage set) is determined according to the needs, which is technical common sense, and the 'programs related to the pickup and the data related to the pickup are stored in the storage set' mentioned in claim 21 is also related to this, Of course, no matter how the storage programs and data are allocated, it is required by default that at least some of the programs related to the pickups or some of the data related to the pickups be stored on the data card, otherwise this card is not a data card used with the main body of the pickup. For inserting the data card mode pickup, generally be to store the code reading of the pickup in the pickup main body, programs such as pronunciation and store decoding relevant program and decoding relevant data and the pronunciation data etc. of all sound codes etc. Adopt the TTS sound production mode, then do not need the pronunciation data of all sound codes). The technical description of the data card will be described later. (the pronunciation data of all sound codes mentioned in this paragraph generally refers to the recording data of all sound codes, but if the pickup adopts the TTS sound production mode, then the data that the pickup involves does not need to include the pronunciation data of all sound codes)

以上七段所述都是技术常识。The above seven paragraphs are all technical common sense.

本发明文件各处述及有关某一拾音器的相关技术(比如拾音器主体本身或与拾音器主体配合使用的数据卡或部件式数据卡的技术)时提及的有用信息,显然是指拾音器涉及的程序,或拾音器涉及的数据,或部分拾音器涉及的程序,或部分拾音器涉及的数据,或拾音器涉及的程序加上部分拾音器涉及的数据,或部分拾音器涉及的程序加上拾音器涉及的数据,或部分拾音器涉及的程序加上部分拾音器涉及的数据,或拾音器涉及的程序加上拾音器涉及的数据。(本段内除第1句外提及的拾音器都是指本段内第1句提及的拾音器)The useful information mentioned in the document of the present invention when referring to the related technology of a certain pickup (such as the technology of the pickup body itself or the data card or component data card used in conjunction with the pickup body) obviously refers to the program involved in the pickup , or the pickup-related data, or part of the pickup-related program, or part of the pickup-related data, or the pickup-related program plus part of the pickup-related data, or part of the pickup-related program plus the pickup-related data, or part of the pickup The program referred to plus the data referred to by some pickups, or the program referred to by the pickup plus the data referred to by the pickup. (The pickups mentioned in this paragraph except the first sentence refer to the pickups mentioned in the first sentence of this paragraph)

如提及时无特别指明有他义,本发明文件各处提及的拾音器涉及的程序默认是指拾音器涉及使用的所有程序(也是指作为拾音器的主处理芯片的单片机芯片涉及使用的所有程序);如提及时无特别指明有他义,本发明文件各处提及的拾音器涉及的数据默认是指拾音器涉及使用的所有数据(也是指作为拾音器的主处理芯片的单片机芯片涉及使用的所有数据)。If there is no special indication when mentioning, there are other meanings, and the programs involved in the pickups mentioned in the documents of the present invention refer to all the programs that the pickups use by default (also refer to all the programs that the single-chip microcomputer chip as the main processing chip of the pickups involves in use); There is no special meaning when mentioning, the data that the pickup mentioned in the document of the present invention relates to all data that the pickup relates to using by default (also refers to all the data that the single-chip microcomputer chip as the main processing chip of the pickup relates to and uses).

一般是把拾音器做成一个拾音器读一个语种码字,这时在存储器集里需存储只适用于此语种的解码程序及解码数据,在存储器集里一般还需存储有此语种的诸音码的发音数据(在拾音器是直播方式的拾音器时);也可以把拾音器做成一个拾音器可以读多个语种码字,比如既可读英语也可读汉语语种码字,这时在存储器集里一般需分别存储有只适用于各自语种的各套解码程序及解码数据(也可以是存储有可适用于多个语种的一套解码程序及解码数据),而且在存储器集里一般还需存储有各语种各自的诸音码的发音数据(在拾音器是直播方式的拾音器时),MCU可以对各语种拼合信息的值分别解码,而有声读物的整页面上可有一个或多个语种的码字,有声读物的整页面上的诸个码字中的各码字涉及的各拼合信息的值里一般带有表示其拼合信息的值是哪个语种的信息。本段所说的一个语种可以是任一非复合语种或任一复合语种,本段所说的多个语种可以全是非复合语种或全是复合语种或既包括非复合语种也包括复合语种。Generally, the pick-up is made into a pick-up to read a code word of a language. At this time, the decoding program and decoding data only applicable to this language need to be stored in the memory set, and the sound codes of this language generally need to be stored in the memory set. Pronunciation data (when the pickup is the pickup of the live mode); also the pickup can be made into a pickup that can read multiple language codewords, such as both English and Chinese language codewords can be read, at this time generally need in the memory set Each set of decoding programs and decoding data applicable to their respective languages is stored separately (or a set of decoding programs and decoding data applicable to multiple languages can also be stored), and various languages are generally stored in the storage set. The pronunciation data of respective various phonetic codes (when the pickup is a pickup in a live broadcast mode), the MCU can decode the values of the combined information in each language respectively, and there can be codewords of one or more languages on the entire page of the audiobook, and the audiobook The value of each piece of information involved in each of the codewords on the entire page of the reading material generally has information indicating which language the value of the piece of information is. A language mentioned in this paragraph can be any non-compound language or any compound language, and the multiple languages mentioned in this paragraph can be all non-compound languages or all compound languages, or include both non-compound languages and compound languages.

本发明文件提及的拾音器涉及的发音程序与解码程序里涉及的发音处理一般是有点区别的,解码程序里涉及的发音处理一般是按照解码规则中有关某语种的解码细则涉及的发音方式(比如全用直接发音方式)来作的,比如解码时用整步直接发音方式,用解码程序对一个拼合信息的值进行解码,解码最后作的发音处理就是调发音程序把预备发音数据里各音码的发音信息数据顺次转成语音以形成语句的全部发音,拾音器涉及的发音程序是更接近硬件的,比如可以是由MCU用发音程序调用与拾音器有关的存储器集里的诸音码的录音数据里的一些音码的录音数据(比如ADPCM格式的音标或拼音的录音数据)来驱动在MCU芯片内与播放语音有关的硬件电路,并由此硬件电路驱动MCU芯片外接的喇叭或喇叭电路,由喇叭把语音发出来;或者是MCU外接语音芯片,由MCU用发音程序把从与拾音器有关的存储器集里的诸音码的录音数据得到的一些音码的录音数据(比如ADPCM格式的音标或拼音的录音数据),通过通讯线传给外接的语音芯片,由带喇叭的外接的语音芯片播出语音;而解码程序里涉及的发音处理一般是更高一级的,一般是分一次或多次调用发音程序来发音的,把发音程序当子程序来调用,这是常识。一般是在拾音器采用直接产音方式时,会用到发音程序;如拾音器采用TTS产音方式,那拾音器一般就不用发音程序,也不用诸音码的发音数据。The pronunciation process involved in the pickup mentioned in the document of the present invention is generally somewhat different from the pronunciation processing involved in the decoding program, and the pronunciation processing involved in the decoding program is generally according to the pronunciation mode involved in the decoding rules of a certain language in the decoding rules (such as All use the direct pronunciation method) to do, for example, use the full-step direct pronunciation method during decoding, decode the value of a piece of information with the decoding program, and the final pronunciation processing of decoding is to adjust the pronunciation program to convert each phonetic code in the prepared pronunciation data Pronunciation information data is converted into voice in turn to form the whole pronunciation of sentence, and the pronunciation program that pick-up relates is closer to hardware, such as can be called by MCU with the recording data of all sound codes in the memory set relevant to pick-up with pronunciation program The recording data of some phonetic codes (such as the phonetic symbol of ADPCM format or the recording data of pinyin) to drive the hardware circuit related to playing voice in the MCU chip, and thus the hardware circuit drives the external speaker or speaker circuit of the MCU chip, by The loudspeaker sends out the voice; or the MCU is externally connected to the voice chip, and the MCU uses the pronunciation program to obtain the recording data of some phonetic codes (such as phonetic symbols or pinyin in ADPCM format) The recording data) is transmitted to the external voice chip through the communication line, and the voice is played by the external voice chip with the speaker; the pronunciation processing involved in the decoding program is generally a higher level, usually divided into one or more times It is common sense to call the pronunciation program to pronounce, and to call the pronunciation program as a subroutine. Generally, when the pickup adopts the direct sound production method, the pronunciation program can be used; if the pickup adopts the TTS sound production method, then the pickup generally does not need the pronunciation program, nor the pronunciation data of all sound codes.

本发明文件各处在述及非插卡方式拾音器或插数据卡方式拾音器时提及的‘单片机芯片外拾音器主体内’显然是指‘单片机芯片外且拾音器主体内’;权利要求书各处在述及非插卡方式拾音器或插数据卡方式拾音器时提及的‘所述单片机芯片外所述拾音器主体内’显然是指‘所述单片机芯片外且所述拾音器主体内’;本发明文件各处在述及非插卡方式拾音器或插数据卡方式拾音器时提及的单片机芯片外拾音器主体内的存储器,显然一般就是指单片机芯片外挂的存储器芯片(此存储器是没在数据卡上的)。The document of the present invention refers to the "inside the main body of the pickup outside the chip of the single-chip microcomputer" when referring to the pickup of the non-plug-in card mode or the pickup of the data card mode, which obviously refers to "outside the chip of the single-chip microcomputer and in the main body of the pickup"; the claims are everywhere When referring to the non-inserting card mode pickup or the data card insertion pickup, the "inside the pickup body outside the single-chip microcomputer chip" obviously refers to "outside the single-chip microcomputer chip and inside the pickup body"; each of the documents of the present invention The memory in the main body of the pickup outside the single-chip microcomputer chip mentioned when referring to the non-card mode pickup or the data card mode pickup, obviously generally refers to the memory chip attached to the single-chip microcomputer chip (this memory is not on the data card).

单片机芯片(MCU),单片机芯片是拾音器的主处理芯片,是用于对整机的读码,解码,发音等的控制处理,一般使用一个MCU即可(一般需MCU本身可发语音);如分开使用多个mcu,是原来一个MCU方式的等同替换,权利要求2,10,19或22述及的拾音器可以是这种分开使用多个mcu的拾音器。Single-chip microcomputer chip (MCU), the single-chip microcomputer chip is the main processing chip of the pickup, which is used to control the code reading, decoding, pronunciation, etc. of the whole machine. Generally, one MCU can be used (generally, the MCU itself needs to be able to send voice); Using multiple MCUs separately is an equivalent replacement of the original MCU mode, and the pickup mentioned inclaim 2, 10, 19 or 22 can be the pickup that uses multiple MCUs separately.

本发明文件所涉及的有读码头(机电式导电码读码头或光电式读码头等)的拾音器,一般情况用一个MCU完成读码,读键及发音等任务即可;但如分开使用多个mcu也可,比如使用两个mcu,互相通讯,一个mcu(假设为主控mcu)完成读码及其它(如读键等)主要控制任务,另一个mcu(假设为从mcu)从通讯线接受主控mcu命令(比如主控mcu通过通讯线向从mcu传输其读码得到的拼合信息的值)来完成解码播放语句发音动作,也是可以的,从mcu具有解码并播放语句发音的能力比如:可按直接产音方式播放语句发音而且存储有诸音码的发音数据的存储器需能被从mcu读取以用于从mcu播放语句发音(此存储器可位于从mcu片内,或者此存储器是位于从mcu片外的存储器芯片并与从mcu连接使用)。此分开使用多个(两个或两个以上)mcu方案,因这多个mcu可分别看成是一个大的MCU的各个部分,可把这多个mcu一齐当成一个大的MCU,所以此分开使用多个mcu方案的本质与一个MCU方案同,是一个MCU方案的等同替换;比如可把本段前面说到的主控mcu和从mcu当成一个大的MCU,从mcu一般是外接喇叭来发音的,可把喇叭当成发音器件(喇叭也是子发音器件);权利要求2,10,19或22中提及的单片机芯片都可以是按这样由多个mcu组合而成的形式。甚至还有另一种方案:拾音器内有一个主控mcu和一个解码播放语句发音的从mcu(这两个mcu的性能使用及搭配及诸音码的发音数据在存储器里如何布置存储等情况如本段前面述的两个mcu的情况),读码头里也有一个小mcu,读码头的小mcu完成读码字并将读码结果传给主控mcu,主控mcu通过通讯线向从mcu传输读码得到的拼合信息的值,由从mcu解码播放语句发音,同样可把此主控mcu和从mcu当成一个mcu,并可把此mcu与读码头里的小mcu又当成一个大的MCU(这时可把原读码头里除去小mcu外的部分当成读码头),即把主控mcu,从mcu和读码头里的小mcu一齐当成一个大的MCU,所以此方案的本质与一个MCU方案同,是一个MCU方案的等同替换,而且从mcu一般是外接喇叭来发音的,可把喇叭当成发音器件(喇叭也是子发音器件),权利要求2,10,19或22中提及的单片机芯片都可以是按这样由多个mcu组合而成的形式。(本段所提及的码字是与拼合信息的值有关的码字;因本段前面提到的多个mcu可被当成一个MCU,所以本段提及的存储器仍是可被作为主处理芯片的MCU读取使用的存储器)The pickups that the documents of the present invention relate to have reading docks (electromechanical conductive code reading docks or photoelectric reading docks, etc.), generally use an MCU to complete code reading, read keys and pronunciation and other tasks; but as separate use of multiple MCU can also, for example, use two MCUs to communicate with each other. One MCU (assumed to be the main control MCU) completes code reading and other (such as key reading, etc.) It is also possible for the master control mcu command (for example, the master control mcu to transmit the value of the combined information obtained by reading the code to the slave mcu through the communication line) to complete the decoding and playback sentence pronunciation action, and the slave mcu has the ability to decode and play sentence pronunciation, such as: Can play sentence pronunciation by direct sound production mode and store the memory of the pronunciation data of all phonetic codes need to be able to be read from mcu for from mcu playback sentence pronunciation (this memory can be positioned at from mcu chip, or this memory is positioned at From the memory chip outside the mcu and connected with the mcu). This separates the use of multiple (two or more) MCU solutions, because these multiple MCUs can be regarded as various parts of a large MCU, and these multiple MCUs can be regarded as a large MCU, so this separate The essence of using multiple MCU solutions is the same as that of one MCU solution, and it is an equivalent replacement of an MCU solution; for example, the master MCU and slave MCU mentioned earlier in this paragraph can be regarded as a large MCU, and the slave MCU is usually an external speaker to pronounce Yes, the loudspeaker can be used as a sounding device (the loudspeaker is also a sub-sounding device); the single-chip microcomputer chip mentioned inclaim 2, 10, 19 or 22 can be formed by combining multiple mcu in this way. There is even another kind of scheme: a master control mcu and a decoding playback sentence pronunciation are arranged in the pickup from the mcu (the performance usage and collocation of these two mcu and how the pronunciation data of all sound codes are arranged and stored in the memory, etc. In the case of the two MCUs mentioned above in this paragraph), there is also a small MCU in the reading terminal. The small MCU of the reading terminal completes the code reading and transmits the code reading result to the master control mcu, and the master control mcu transmits it to the slave mcu through the communication line The value of the combined information obtained by reading the code is pronounced by the decoding and playback sentences of the slave mcu. The master control mcu and the slave mcu can also be regarded as one mcu, and the mcu and the small mcu in the reading dock can be regarded as a big MCU ( At this time, the part except the small mcu in the original reading terminal can be regarded as the reading terminal), that is, the main control mcu, the slave mcu and the small mcu in the reading terminal can be regarded as a large MCU, so the essence of this solution is the same as that of an MCU solution The same, it is an equivalent replacement of an MCU solution, and the mcu generally uses an external speaker to make a sound, and the speaker can be used as a sounding device (the speaker is also a sub-sounding device), the single-chip microcomputer chip mentioned inclaim 2, 10, 19 or 22 All can be in the form of combining multiple mcu in this way. (The code word mentioned in this paragraph is the code word related to the value of the information being assembled; because the multiple mcu mentioned earlier in this paragraph can be regarded as an MCU, so the memory mentioned in this paragraph can still be used as the main processing The chip's MCU reads the memory used)

单片机芯片(MCU)与读码头意向连接,单片机芯片(MCU)与发音器件意向连接。本发明文件中各处使用‘意向连接’词主要是为了撰写权利要求书的方便,本发明文件中各处提及的意向连接,是指直接连接或间接连接;单片机芯片与读码头意向连接即单片机芯片与读码头直接连接或间接连接,单片机芯片与发音器件意向连接即单片机芯片与发音器件直接连接或间接连接;比如读码头为光笔时,可以直接从读码头连线到MCU,是直接连接;而如读码头是机电式甲方式导电码读码头时,一般中间需有通断检测电路,即读码头先连线到通断检测电路,从通断检测电路再连线到MCU,所以读码头与MCU是间接连接,此技术点在2006102005537号申请文件里有描述,在本说明书有关图6的拾音器的实施例中,电路中的读码头1也是先连线到两片4021,从两片4021再连线到MCU,所以读码头与MCU是间接连接;实际上,有关读码头与MCU是间接连接的技术点也属显而易见的常识。MCU与发音器件一般是直接连接(比如MCU直接接喇叭电路,此点在本发明文件里都有记载,发音器件是喇叭电路;再比如MCU直接接带喇叭的TTS芯片也是直接连接,发音器件是带喇叭的TTS芯片),而如MCU是经过IO扩展芯片(比如类似于8255的芯片)来驱动语音芯片(或TTS芯片)来发语音声,中间多了个IO扩展芯片所以可称MCU与发音器件是间接连接,这时发音器件应是此带喇叭的语音芯片(或是此带喇叭的TTS芯片),此语音芯片(或TTS芯片)仍是MCU外接的语音芯片(或TTS芯片),此间接连接应是直接连接的等同替换,这也是单片机技术的常规替代技术。另外单片机芯片与读码头的间接连接,甚至可以是无线连接比如红外连接,其读码头较复杂,需自己读码,并需有无线通讯电路。The single-chip microcomputer chip (MCU) is intentionally connected with the reading terminal, and the single-chip microcomputer chip (MCU) is intentionally connected with the pronunciation device. In the document of the present invention, the word "intended connection" is used everywhere mainly for the convenience of writing claims, and the intentional connection mentioned in the document of the present invention refers to direct connection or indirect connection; The single-chip microcomputer chip is connected directly or indirectly to the reading terminal, and the intended connection between the single-chip microcomputer chip and the pronunciation device is the direct or indirect connection between the single-chip microcomputer chip and the pronunciation device; for example, when the reading terminal is a light pen, it can be directly connected from the reading terminal to the MCU ; And if the reading terminal is an electromechanical type A conductive code reading terminal, generally there needs to be an on-off detection circuit in the middle, that is, the reading terminal is first connected to the on-off detection circuit, and then connected to the MCU from the on-off detection circuit, so the reading The terminal and the MCU are indirectly connected. This technical point is described in the application document No. 2006102005537. In the embodiment of the pickup in this specification related to Figure 6, the readingterminal 1 in the circuit is also first connected to two pieces of 4021. From the two pieces of The 4021 is then connected to the MCU, so the reading terminal is indirectly connected to the MCU; in fact, the technical points about the indirect connection between the reading terminal and the MCU are also obvious common sense. MCU and pronunciation device are generally directly connected (for example MCU directly connects the speaker circuit, this point is all recorded in the document of the present invention, and the pronunciation device is the speaker circuit; For example, the TTS chip of MCU directly connecting band speaker is also directly connected, and the pronunciation device is TTS chip with speaker), and for example, MCU drives the voice chip (or TTS chip) to make voice through an IO expansion chip (such as a chip similar to 8255). There is an IO expansion chip in the middle, so it can be called MCU and pronunciation The device is an indirect connection. At this time, the pronunciation device should be the voice chip with the horn (or the TTS chip with the horn), and the voice chip (or TTS chip) is still the voice chip (or TTS chip) connected to the MCU. Indirect connection should be an equivalent replacement of direct connection, which is also a conventional replacement technology for single-chip technology. In addition, the indirect connection between the single-chip microcomputer chip and the reading dock can even be a wireless connection such as an infrared connection. The reading dock is more complicated, and it needs to read the code by itself and needs a wireless communication circuit.

发音器件可以是喇叭或喇叭电路(在本说明书有关图1的拾音器主体的实施例中,是由喇叭和三极管8050和1K电阻一起构成喇叭电路来作为发音器件,在有关图6的拾音器的实施例中,是用喇叭作为发音器件),也可以是带喇叭的外接的语音芯片(本发明文件所称语音芯片是专用于直接产音方式的,是与TTS芯片不同的),另也可以是带喇叭的TTS芯片;另还可能是用其它种类的发音器件。本发明文件各处所称外接的语音芯片(如提及时无特别指明有他义,本发明文件各处提及的外接的语音芯片默认是指主MCU外接的语音芯片),一种形式可以是:语音芯片由一些不含小mcu的逻辑器件及必要的电子元件(如声音放大器等)构成(也可含小mcu),语音芯片可从通讯线上接受外界(如主MCU)发来的比如ADPCM格式(或类似格式)的录音数据,并将其转成语音声音用喇叭播出来;另一种形式可以是:语音芯片是可以用录音数据(比如ADPCM格式的录音数据)来播语音的,语音芯片可调用存储器(此存储器可位于语音芯片片内,或者此存储器是位于语音芯片片外的存储器芯片并与语音芯片连接使用并且可把此存储器芯片看成是与语音芯片一体的;这些存储器不是能被作为主处理芯片的单片机芯片读取使用的存储器)里的ADPCM格式(或类似格式)的音标或拼音的录音数据,芯片可从通讯线上接受外界(如主MCU)发来的预备发音数据(预备发音数据即语句的拼法格式数据,比如按语句发音顺序排组起来的表示英语语句发音的各音标的编号信息或按语句发音顺序排组起来的表示汉语语句发音的各拼音的代号信息)或部分预备发音数据或类似格式的数据,并调音标或拼音的录音数据,来使此语音芯片接的喇叭发音。Pronunciation device can be loudspeaker or horn circuit (in the embodiment of the pickup main body of relevant Fig. 1 in this specification sheet, be to form the horn circuit together by horn andtriode 8050 and 1K resistance as pronunciation device, in the embodiment of the pickup of relevant Fig. 6 Among them, the loudspeaker is used as the pronunciation device), and it can also be an external voice chip with a loudspeaker (the said voice chip in the document of the present invention is dedicated to the direct sound production mode, which is different from the TTS chip), and it can also be an external sound chip with a loudspeaker. The TTS chip of the speaker; it is also possible to use other types of sounding devices. The external voice chip referred to in the documents of the present invention (if there is no special indication when mentioning, there are other meanings, the external voice chip mentioned in the documents of the present invention refers to the external voice chip of the main MCU by default), a form can be: The voice chip is composed of some logic devices without small mcu and necessary electronic components (such as sound amplifiers, etc.) (it can also contain small mcu). Format (or similar format) recording data, and convert it into voice sound and broadcast it with a speaker; another form can be: the voice chip can use recording data (such as ADPCM format recording data) to broadcast voice, The voice chip can call the memory (this memory can be located in the voice chip, or this memory is a memory chip located outside the voice chip and connected with the voice chip and can be regarded as integrated with the voice chip; these memory chips It is not the phonetic symbol or pinyin recording data in ADPCM format (or similar format) that can be read by the single-chip microcomputer chip used as the main processing chip. The chip can receive the preparation from the outside world (such as the main MCU) from the communication line. Pronunciation data (prepared pronunciation data is the spelling format data of the sentence, such as the number information of each phonetic symbol representing the pronunciation of the English sentence arranged in the pronunciation order of the sentence or the phonetic symbols of the pronunciation of the Chinese sentence arranged in the pronunciation order of the sentence) code information) or part of the prepared pronunciation data or data in a similar format, and the recording data of phonetic symbols or pinyin to make the speaker connected to the voice chip pronounce.

本发明文件涉及的拾音器的单片机芯片(MCU)使发音器件播出语音的方式(也是在权利要求10,19,22中的一句‘由前述拾音器的前述单片机芯片根据读得的前述部诸码字中的码字的码值使前述拾音器的前述发音器件播出语句发音’以及在权利要求2中的一句‘前述拾音器的前述单片机芯片根据读得的前述部诸码字中的码字的码值使前述拾音器的前述发音器件播出语句发音’中以及其它地方,涉及的单片机芯片使发音器件播出语音的方式),可以是由MCU用发音程序调用从拾音器主体内的存储器里(或数据卡上的存储器里或部件式数据卡上的存储器里)的诸音码的录音数据得到的一些音码的录音数据(比如ADPCM格式的音标或拼音的录音数据,音标是对应元音辅音音码的,拼音是对应元音辅音组合音码的)来驱动在MCU芯片内与播放语音有关的硬件电路,并由此硬件电路驱动MCU芯片外接的喇叭或喇叭电路(喇叭或喇叭电路为发音器件也是子发音器件),由喇叭把音码的语音播出来,可按这样连续播出多个音码的语音以形成语句的全部发音或非全部发音(也即解码采用直接发音方式);或者是MCU外接语音芯片,可由MCU用发音程序把从拾音器主体内的存储器里(或数据卡上的存储器里或部件式数据卡上的存储器里)的诸音码的录音数据得到的一些音码的录音数据(比如ADPCM格式的音标或拼音的录音数据),通过通讯线传给此外接的语音芯片,由带喇叭的外接的语音芯片(带喇叭的外接的语音芯片是发音器件也是子发音器件)播音码的语音,可按这样连续播出多个音码的语音以形成语句的全部发音或非全部发音(此也是采用直接发音方式);或者是MCU外接另一种形式的语音芯片,可由MCU把语句的全部或非全部预备发音数据(比如各音标或拼音的代号或编号信息)一次(整步直接发音方式)或分次(分步直接发音方式)通过通讯线传给此外接的语音芯片,由带喇叭的外接的语音芯片(带喇叭的外接的语音芯片为发音器件也是子发音器件)调用从存储器(此存储器可位于语音芯片片内,或者此存储器是位于语音芯片片外的存储器芯片并与语音芯片连接使用并且可把此存储器芯片看成是与语音芯片一体的;这些存储器不是能被作为主处理芯片读取使用的存储器)里的诸音码的录音数据得到的一些音码的录音数据(比如ADPCM格式的音标或拼音的录音数据)来播出语句的全部发音或非全部发音(此也是采用直接发音方式);或者是用MCU外接的TTS芯片来发音,解码时相应采用TTS发音方式,可由MCU把语句的全部或非全部拼法格式数据(或如是分步TTS发音方式时,每次是语句的部分拼法格式数据)通过通讯线传给TTS芯片,由带喇叭的TTS芯片(带喇叭的TTS芯片为发音器件也是子发音器件)播出语句的全部发音或非全部发音;或者是用MCU内有的TTS器件来发音,可用MCU内的TTS器件驱动MCU外接的喇叭播出语句发音,这时发音器件是MCU外接的喇叭;或者还可有其它种类的发音器件以及单片机芯片使发音器件播出语句发音的其它方式;这几种方式都属技术常识。The single-chip microcomputer chip (MCU) of the pickup that the document of the present invention relates to makes the sounding device play the mode of voice (also inclaim 10,19, the sentence in the 22' by the aforementioned single-chip microcomputer chip of the aforementioned pickup according to the code words of the aforementioned parts read. The code value of the code word in the aforementioned sound pick-up makes the aforementioned pronunciation device of the aforementioned sound pick-up play the sentence pronunciation 'and a sentence in claim 2'. Make the aforementioned pronunciation device of the aforementioned pickup play the sentence pronunciation 'and other places, the single-chip microcomputer chip involved makes the mode of the pronunciation device broadcast voice), can be called by the MCU with the pronunciation program in the memory (or data card) in the pickup main body The recording data of some phonetic codes (such as the phonetic symbols of the ADPCM format or the recorded data of pinyin in the memory on the device or in the memory of the component data card) of the various phonetic codes obtained , Pinyin is the corresponding vowel and consonant combined sound code) to drive the hardware circuit related to the playback voice in the MCU chip, and thus the hardware circuit drives the external speaker or speaker circuit of the MCU chip (the speaker or speaker circuit is a pronunciation device and is also a sub- Pronunciation device), the voice of the sound code is broadcast by the loudspeaker, and the voice of a plurality of sound codes can be played continuously in this way to form all or part of the pronunciation of the sentence (that is, the decoding adopts the direct pronunciation mode); or MCU External voice chip, the recording data of some phonetic codes obtained from the recording data of various phonetic codes in the memory in the pickup main body (or in the memory on the data card or in the memory on the component data card) can be obtained by the MCU with the pronunciation program (such as ADPCM format phonetic symbols or pinyin recording data), through the communication line to the external voice chip, the external voice chip with the speaker (the external voice chip with the speaker is a pronunciation device and also a sub-pronunciation device) broadcast code The voice of a plurality of phonetic codes can be played continuously in this way to form all or part of the pronunciation of the sentence (this is also a direct pronunciation method); All or not all of the prepared pronunciation data (such as the code or number information of each phonetic symbol or pinyin) is transmitted to this external voice chip through the communication line once (full-step direct pronunciation mode) or divided (step-by-step direct pronunciation mode), and the The externally connected voice chip with the horn (the externally connected voice chip with the horn is a sounding device and also a sub-sounding device) calls the slave memory (this memory can be located in the voice chip, or this memory is a memory chip located outside the voice chip and is connected with the voice chip) The voice chip is connected and used and this memory chip can be regarded as integrated with the voice chip; these memories are not the recording data of the various voice codes obtained by the recording data of the voice codes in the memory that can be read and used as the main processing chip (such as ADPCM format phonetic symbols or pinyin recording data) to broadcast all or part of the pronunciation of the sentence (this is also a direct pronunciation method); or use the TTS chip external to the MCU to pronounce, and use the TTS pronunciation method when decoding. , all or not all of the spelling format data of the sentence (or when it is a step-by-step TTS pronunciation mode, each time a part of the spelling format data of the sentence) can be passed to the TTS chip by the MCU through the communication line, and the TTS chip with a speaker ( The TTS chip with a speaker is a pronunciation device and also a sub-sounding device) to broadcast all or part of the pronunciation of the sentence; or use the TTS device in the MCU to pronounce, and the TTS device in the MCU can be used to drive the external speaker of the MCU to broadcast the sentence Pronunciation, at this moment the pronunciation device is the loudspeaker that MCU connects; Or also can also have other kinds of pronunciation device and single-chip microcomputer chip to make the pronunciation device play other modes of sentence pronunciation; These several modes all belong to technical common sense.

在解码时如采用直接产音方式,对于用主MCU发音或用主MCU外接的语音芯片(比如上段述的接收录音数据形式的语音芯片或接收语句的预备发音数据形式的语音芯片)发音的情况,除了可以是播那些音码的录音数据外,还可有别的形式,比如用语音波形的矢量技术来模拟发音码的语音,这些其它形式应是播那些音码的录音数据方式的等同替换,采用这些其它形式来播出语句发音也可看成属于直接产音方式。When decoding, if the direct sound production method is adopted, for the pronunciation of the pronunciation of the main MCU or the voice chip (such as the voice chip receiving the recording data form mentioned in the previous paragraph or the voice chip receiving the voice chip of the prepared pronunciation data form of the sentence) with the main MCU pronunciation , in addition to broadcasting the recording data of those codes, there are also other forms, such as using the vector technology of voice waveforms to simulate the voice of the codes, these other forms should be equivalent replacements for the recording data of those codes , the use of these other forms to broadcast sentence pronunciation can also be regarded as belonging to the direct sound production mode.

本发明文件涉及的拾音器的单片机芯片利用读码头来读码的方式(也就是权利要求10,19,22提及的‘前述拾音器的前述单片机芯片可以利用前述拾音器的前述读码头来读取前述诸个码字中的部诸码字中的每个码字’以及权利要求2提及的‘前述拾音器的前述单片机芯片利用前述拾音器的前述读码头可以读取前述部诸码字中的每个码字’中,涉及的单片机芯片利用读码头来读码的方式),是指,既可以是由单片机芯片作主动控制来读码,比如由单片机芯片的IO脚输出电流,电流通过通断式机电读码头流到有声读物一页面上的甲导电码的甲码盘,电流再流到甲导电码各码线上,再流回读码头,再对从甲导电码各码线触区位置流回的电流有无情况进行检测,检测结果由单片机芯片取得以得到码值(见2006102005537号‘一种有声读物及拾音器主体’申请里的阐述,读码时,是要使通断式机电读码头与有声读物的页面上的甲导电码接触的),也可以是读码头在读码时不需单片机芯片作控制,比如光电式读码头(如光笔)读取有声读物的页面上的导电码或条形码并输出读码结果或输出读码信息(读码信息是指比如表示码字的码线区的各条空宽度情况的波形),由单片机芯片取得读码结果(本段各处所提及的读码结果即码字的码值)或由单片机芯片取得读码信息并根据读码信息来得到读码结果(读码时,是要使光电式读码头与有声读物的页面上的导电码或条形码接触或接近的);另还可有别的读码方式(比如本说明书中阐述的图6的拾音器的实施例中有关读导电码的方式,其从读码头输出的电流是直接从正电源来的)。这些技术点在本发明文件都有记载。本段以上所述也可看成是对权利要求2中提及的‘前述部诸码字中的每个码字是通过前述拾音器的读码头被前述拾音器读取的’的解释。The single-chip microcomputer chip of the pickup that the document of the present invention relates to utilizes the mode that reads the yard to read code (just claim 10,19, the aforementioned single-chip microcomputer chip of the aforementioned pickup that mentions in 22 can utilize the aforementioned reading yard of the aforementioned pickup to read the aforementioned Each code word in the part code words in a code word' andclaim 2 mentioned ' the aforementioned single-chip microcomputer chip of the aforementioned pick-up utilizes the aforementioned reading terminal of the aforementioned pick-up to be able to read each code word in the aforementioned part all code words In the word ', the involved single-chip microcomputer chip uses the reading terminal to read the code), which means that the single-chip microcomputer chip can be actively controlled to read the code, for example, the IO pin of the single-chip microcomputer chip outputs current, and the current passes through the on-off electromechanical The reading terminal flows to the first code plate of the A conductive code on the first page of the audiobook, and the current flows to the code lines of the A conductive code, and then flows back to the reading terminal, and then flows back to the contact area of each code line of the A conductive code. The presence or absence of the current is detected, and the detection result is obtained by the single-chip microcomputer chip to obtain the code value (see the elaboration in the application of No. 2006102005537 'A kind of audiobook and the main body of the pickup device', when reading codes, it is necessary to make the on-off electromechanical reading dock and The conductive code on the page of the audiobook is contacted), and it can also be that the reading terminal does not need a single-chip microcomputer chip to control when reading the code, such as the photoelectric reading terminal (such as a light pen) to read the conductive code or barcode on the page of the audiobook and Output code-reading result or output code-reading information (code-reading information refers to the waveform of each empty width situation of the code line area that represents code word for example), obtains code-reading result by single-chip microcomputer chip (the reading that this paragraph mentions everywhere The code result is the code value of the code word) or the code reading information is obtained by the single-chip microcomputer chip and the code reading result is obtained according to the code reading information (when reading the code, it is necessary to make the photoelectric reading terminal and the conductive code or barcode on the page of the audiobook contact or approach); there can also be other code-reading modes (such as the mode of reading conduction code in the embodiment of the pickup of Fig. 6 set forth in this specification, and its current output from the reading terminal is directly from the positive power supply of). These technical points are all recorded in the documents of the present invention. The foregoing description of this paragraph also can be regarded as the interpretation of 'each codeword in the aforementioned codewords is read by the aforementioned pickup by the reading pier of the aforementioned pickup' mentioned inclaim 2.

本发明文件各处有提及的根据需要把一个(或多个)拼合信息的值分散(或分开)放在多个码字里,这里的根据需要的意思是指一个(或多个)拼合信息的值的BIT太多在一个码字里放不下,或者码字里还需存储其它与拼合信息的值无关的信息致使一个码字放不下此一个(或多个)拼合信息的值,或者是没什么特别原因而把一个(或多个)拼合信息的值分散放在多个码字里。The documents of the present invention are mentioned everywhere to disperse (or separate) the value of one (or more) combined information into multiple codewords according to the needs. The meaning here refers to one (or more) combined The value of the information has too many BITs to fit in one codeword, or the codeword also needs to store other information that has nothing to do with the value of the combined information, so that one codeword cannot fit the value of the one (or more) combined information, or The value of one (or more) combined information is scattered in multiple codewords for no particular reason.

一般情况下,拾音器可有按键(比如用按键来调节音量),但按键不是必需的。In general, pickups can have buttons (such as buttons to adjust the volume), but buttons are not required.

本发明文件涉及的各电路举例及实施例的供电电压(VCC)选择范围是2.6V至5V(查本发明文件涉及的单片机芯片型号及其它芯片型号的资料也可得到此工作电压选择范围值)。实际拾音器的供电电压可能可在2.6V至5V的范围以外。The supply voltage (VCC) selection range of each circuit example and embodiment that the document of the present invention relates to is 2.6V to 5V (check the single-chip microcomputer chip model involved in the document of the present invention and the data of other chip models and also can obtain this operating voltage selection range value) . Actual pickups may have supply voltages outside the 2.6V to 5V range.

实施例:附图1是一个拾音器主体(因不插数据卡,也不插部件式数据卡,所以此拾音器主体也就是拾音器)的实施例的电路图,其读码头使用光笔,主MCU芯片3是SNC5A9,其配有24个IO脚,有双通道语音功能,有384K*10BIT的数据语音存储器。发音器件显然是由喇叭和8050和1K电阻构成的喇叭电路(此喇叭电路也是一个子发音器件),两个按键可分别作为音量循环调节键和工作唤醒休眠键(这两个按键组成按键模块);本实施例拾音器主体解码时发音总采用直接产音方式;光笔是成熟共知技术,不属于本发明内容,实施可参考公知电路,利用其光电原理读出码的条空值(比如读条形码,光笔读条形码的原理也属于公知技术),这里给出的是一个参考电路。本例以读条形码为例来说明(这里假定印刷条形码的油墨,是黑色的,条形码的载体平面也即有声读物的各页面是白色的,条形码的颜色与载体平面的颜色不同以便于光笔读取),与此拾音器主体配合使用的语音编码有声读物以如图7示的语音编码有声读物为例,此语音编码有声读物的主体是书,此有声读物的整页面上只有一些条形码11。Embodiment: Accompanying drawing 1 is the circuit diagram of the embodiment of a pick-up main body (because of not inserting data card, also not inserting component type data card, so this pick-up main body is also pick-up), its reading terminal uses light pen, andmain MCU chip 3 is SNC5A9, which is equipped with 24 IO pins, has a dual-channel voice function, and has a data voice memory of 384K*10BIT. The sounding device is obviously a speaker circuit composed of a speaker, 8050 and 1K resistors (this speaker circuit is also a sub-sounding device), and the two buttons can be used as a volume cycle adjustment button and a work wake-up sleep button respectively (these two buttons form a button module) Pronunciation always adopts the direct sound production mode during present embodiment pickup main body decoding; Light pen is ripe common knowledge technology, does not belong to content of the present invention, implementation can refer to known circuit, utilizes the bar empty value of its photoelectric principle to read code (such as reading bar code , the principle of light pen reading barcodes also belongs to known technology), what provides here is a reference circuit. This example takes reading a barcode as an example (here it is assumed that the ink for printing the barcode is black, and the carrier plane of the barcode, that is, each page of the audiobook is white, and the color of the barcode is different from the color of the carrier plane to facilitate light pen reading ), the audio-coded audiobook used in conjunction with the main body of the pickup takes the audio-coded audiobook as shown in Figure 7 as an example, the main body of the audio-coded audiobook is a book, and there are only somebarcodes 11 on the entire page of the audiobook.

读码时,使光笔与有声读物的页面上的条形码接触并拖动光笔匀速扫过条形码,D1是发光二极管照射条形码,D2是光敏二极管,用于感受从条形码回来的光(条形码上有油墨的地方是黑的不反射光,无油墨的地方是白的反射光,这是常识),O1运放是信号的第1级放大,O2运放组成滞回比较器,用于信号整形成方波输出,其中Rx的阻值可以调节,D4,D5是稳压管,Qb组成开关回路,提供适合SNC5A9读取的方波输出。这个实施例,将解码的工作放在SNC5A9里面进行,光笔的输出在Out那里是方波输出,方波波形的波峰波谷宽度与条形码的条空宽度比是一致的,这样就可以在SNC5A9里记录从P12脚进来的方波波形的每个波峰的高电平持续时间及每个波谷的低电平持续时间,即可得到条形码的条空布置情况,然后根据条形码的条空布置情况,用软件解出其最后码值来,当然软件里需有对从P12脚进来的方波的波形误差的处理;对高低电平持续时间的计时可使用SNC5A9的中断来做。When reading the code, make the light pen touch the barcode on the page of the audiobook and drag the light pen to scan the barcode at a constant speed. D1 is a light-emitting diode that illuminates the barcode, and D2 is a photosensitive diode that is used to feel the light coming back from the barcode (there is ink on the barcode) The place is black without reflecting light, and the place without ink is white reflecting light, which is common sense), the O1 op amp is the first stage amplification of the signal, and the O2 op amp forms a hysteresis comparator for signal shaping into a square wave Output, where the resistance of Rx can be adjusted, D4 and D5 are regulator tubes, and Qb forms a switch circuit to provide a square wave output suitable for SNC5A9 reading. In this embodiment, the decoding work is carried out in the SNC5A9, and the output of the light pen is a square wave output at Out. The width of the peak and valley of the square wave waveform is consistent with the bar-space width ratio of the barcode, so that it can be recorded in the SNC5A9. The high-level duration of each peak and the low-level duration of each trough of the square wave waveform coming in from the P12 pin can get the bar-space layout of the barcode, and then according to the bar-space layout of the barcode, use the software To solve the final code value, of course, the software needs to deal with the waveform error of the square wave coming in from the P12 pin; the timing of the high and low level duration can be done by using the interrupt of SNC5A9.

(与此拾音器主体配合使用的语音编码有声读物也可如图3示,此有声读物的主体是书,此有声读物的页面上有双排码线多码段甲导电码10,至于光笔读导电码的原理过程,也是通过光从码线区反射回来以在光笔电路上产生条空布置情况的方波波形,SNC5A9从Out那里得到方波波形,并根据导电码的码线区形成原理由方波波形得到码值,这是与读条形码类似的,在此不再述;对于图3示的双排码线的导电码,显然需用一个光笔顺次扫过上下两排码线以分别读码;用光笔读导电码,需要求导电码的质材比如导电油墨的颜色与导电码的载体平面比如有声读物的各页面的颜色显著不同,比如导电油墨的颜色是黑色,有声读物的各页面的颜色是白色。用光笔还可读取如图4所示形式的导电码,读码的原理过程同于本段前面述)(the voice coding audiobook used in conjunction with this pickup main body can also be shown in Figure 3, the main body of this audiobook is a book, and there are double-row code lines and multi-code segment Aconductive code 10 on the page of this audiobook, as for light pen reading conductive The principle process of the code is also to generate a square wave waveform on the light pen circuit through the reflection of light from the code line area. Wave waveform to get the code value, which is similar to reading a barcode, and will not be described here; for the conductive code of the double-row code line shown in Figure 3, it is obviously necessary to use a light pen to scan the upper and lower two rows of code lines in order to read them separately. Code; to read the conductive code with a light pen, the material of the conductive code, such as the color of the conductive ink, is significantly different from the carrier plane of the conductive code, such as the color of each page of the audiobook. For example, the color of the conductive ink is black, and the color of each page of the audiobook The color is white. The conductive code in the form shown in Figure 4 can also be read with a light pen, and the principle process of reading the code is the same as that mentioned above in this paragraph)

语音编码有声读物(如图7示)的整页面上的每个条形码11的值有一个表示英语语句发音的拼合信息的值(这些表示英语语句发音的拼合信息的值都是按本说明书述的直播方式的以只适用于英语的编码规则来编码生成的,把此有声读物的整页面上的所有条形码11集合起来就是此有声读物整页面上的诸个码字);按常识,拾音器涉及的读码,拾音器运行主控,发音等程序都是存储在SNC5A9片内的存储器里,按单片机技术公知惯例可知,这里SNC5A9的内存储器足够大,所以解码数据以及诸音码的录音数据等都可存储在SNC5A9片内的存储器里,解码程序也可存储在SNC5A9片内存储器里(本实施例的拾音器主体是非插卡方式拾音器主体),此解码程序是按照本说明书述的直播方式的以只适用于英语为例的解码规则来解码的,此解码数据是此解码程序解码时使用的单词数据库,词根数据库等数据(这些数据库在本说明书述的直播方式的以只适用于英语为例的解码规则里有述),此诸音码的录音数据是按照本说明书第16页述的第3种方式形成的,本实施例拾音器主体在解码时是采用整步直接发音方式(用SNC5A9驱动喇叭电路播语音,发音器件是由喇叭和8050和1K电阻构成的喇叭电路);使用时,由SNC5A9利用光笔每次读取一个此有声读物的整页面上的诸个码字中的部诸码字中的任一个码字(此任一个码字即条形码11;本来此拾音器主体能读这诸个码字中的每个码字且可按码值播出语句发音,可把此有声读物的整页面上的诸个码字中的一部分条形码定义成此拾音器主体能读的部诸码字),再由SNC5A9片内主处理器调片内存储器里的此解码程序及此解码数据对读入的此一个条形码中的英语语种的拼合信息的值进行解码播出语句发音(可以是播出语句的全部发音或播出语句的非全部发音),解码最后阶段,SNC5A9播出语句发音是要调用SNC5A9片内的存储器里的诸音码的录音数据中的一些音码(元音,辅音音码)的录音数据的(即连续播出多个音码的语音以形成语句发音)。The value of eachbarcode 11 on the entire page of the voice coded audiobook (as shown in Figure 7) has a value representing the combined information of English sentence pronunciation (the values of the combined information representing the English sentence pronunciation are all described in this manual Live mode is coded and generated with coding rules that are only applicable to English, and all thebarcodes 11 on the entire page of the audiobook are assembled to be the codewords on the entire page of the audiobook); according to common sense, the pickups involved Code reading, pickup operation main control, pronunciation and other programs are all stored in the internal memory of SNC5A9. According to the known practice of single-chip technology, the internal memory of SNC5A9 is large enough, so the decoding data and recording data of various sound codes can be used. Stored in the on-chip memory of SNC5A9, the decoding program can also be stored in the on-chip memory of SNC5A9 (the main body of the pickup in this embodiment is the main body of the non-card pickup), this decoding program is only applicable to the live broadcast method described in this manual The decoding rules are based on English as an example. This decoding data is the word database, word root database and other data used by this decoding program for decoding (these databases are only applicable to English as an example in the live broadcast method described in this manual. The recording data of these sound codes is formed according to the 3rd mode stated on the 16th page of this manual. Voice, pronunciation device is the horn circuit that horn and 8050 and 1K resistance are formed); During use, by SNC5A9 utilizes light pen to read at every turn one in all code words on the whole page of this audio book in the part code words Any code word (this arbitrary code word is thebar code 11; originally this pick-up main body can read each code word in these code words and can broadcast the sentence pronunciation according to the code value, can put the whole page of this audiobook Part of the barcodes in the codewords are defined as part of the codewords that the pickup body can read), and then the decoding program in the on-chip memory and the decoding data are read in by the SNC5A9 on-chip main processor. The value of the combined information of the English language in the barcode is decoded to pronounce the pronunciation of the broadcast sentence (it can be the entire pronunciation of the broadcast sentence or not all the pronunciation of the broadcast sentence). In the final stage of decoding, the SNC5A9 broadcast sentence pronunciation is to call the SNC5A9 chip (promptly continuously broadcast the voice of a plurality of phonetic codes to form sentence pronunciation) of some phonetic codes (vowels, consonant phonetic codes) in the recorded data of all phonetic codes in the memory.

本实施例提及的‘拾音器’就是指本实施例说的‘拾音器主体’;本实施例所说的拾音器主体,是包括图1中的所有部分的。The 'pickup' that the present embodiment mentions refers to the 'pickup main body' that the present embodiment says; the pickup main body that the present embodiment says includes all parts in Fig. 1.

本实施例的光笔还可由其它种类的光笔代替,也可由其它类型的光电式读码头代替,比如用ccd读码头(读码时的ccd读码头不需在码字上拖动扫过),可用ccd读码头读取语音编码有声读物的整页面上的条形码或导电码(如将ccd读码头用于读导电码,那读码涉及的软件要编写成适应于读导电码)。The light pen of the present embodiment can also be replaced by other types of light pens, and can also be replaced by other types of photoelectric reading docks, such as reading docks with ccd (the ccd reading docks during code reading do not need to be dragged and swept over the code words), available The ccd reading terminal reads the barcode or the conductive code on the entire page of the voice-coded audiobook (if the ccd reading terminal is used to read the conductive code, the software involved in the code reading should be written to be suitable for reading the conductive code).

关于导电码及拾音器的发明内容,可参看本说明书后面对导电码技术的介绍,也请见本发明人申请的另一项发明申请“一种有声读物及拾音器主体”,申请号:2006102005537。在那个申请里有个实施例,请见其附图26及在其说明书里第57页至第58页对附图26实施例的说明,此拾音器(也即拾音器主体)的读码头为机电式导电码甲读码头,MCU也是SNC5A9,外围也有按键模块及发音回路,导电码的读取在那个申请里有详述,如果将其MCU片内的导电码读码程序保留,MCU片内的其它语音处理及语音播放的程序和数据用为本说明书前面述的比如用于对表示英语语句发音的的拼合信息的值解码的解码程序及解码数据以及诸音码的录音数据(这些具体处理参考从此页往前13页写的‘所使用的拾音器的结构,特点:’后的文字叙述),此解码程序是按照本说明书述的直播方式的以只适用于英语为例的解码规则来解码的,此解码数据是此解码程序解码时使用的单词数据库,词根数据库等数据(这些数据在本说明书述的直播方式的以只适用于英语为例的解码规则里有述),本实施例的拾音器主体(拾音器)在解码时采用整步直接发音方式,即可达到前述用光笔加SNC5A9的一样的功效。假设语音编码有声读物的整页面上有数个甲导电码,每个甲导电码的码值有一个表示英语语句发音的拼合信息的值(这些表示英语语句发音的拼合信息的值都是按本说明书述的直播方式的以只适用于英语的编码规则来编码生成的);使用时,使拾音器主体的机电式导电码甲读码头与此语音编码有声读物的整页面上的一个甲导电码接触,拾音器主体的MCU(即SNC5A9)利用机电式导电码甲读码头读得语音编码有声读物的整页面上的一个甲导电码的码值后,再从码值得到表示英语语句发音的拼合信息的值,并按此拼合信息的值调解码程序及解码数据来解码播出语句发音,解码最后阶段,SNC5A9播放语句发音是要调用SNC5A9片内的存储器里的诸音码的录音数据里的一些音码的录音数据的(即连续播出多个音码的语音以形成语句发音)。这是本发明文件的拾音器主体的第2个实施例。(因是非插卡方式的,所以本实施例提及的拾音器与拾音器主体是等同的)For the content of the invention of the conductive code and the pickup, please refer to the introduction of the conductive code technology at the end of this manual, and also refer to another invention application "A Audiobook and the Main Body of the Pickup" filed by the inventor, application number: 2006102005537. There is an embodiment in that application, please refer to its accompanying drawing 26 and the description of the accompanying drawing 26 embodiment on page 57 topage 58 in its description, the reading terminal of this pickup (that is, the pickup main body) is an electromechanical type Conductive code A reading terminal, the MCU is also SNC5A9, and there are also key modules and pronunciation circuits on the periphery. The reading of the conductive code is detailed in that application. If the conductive code reading program in the MCU chip is reserved, other components in the MCU The program and data of voice processing and voice playback are used as the decoding program and the decoding data and the recording data of all phonetic codes (these specific processing refer to this The first 13 pages of 'the structure of the pickup used, features:' are described in the text), this decoding program is based on the decoding rules that are only applicable to English as an example in the live broadcast method described in this manual. This decoding data is the word database used when this decoding program decodes, the data such as root database (these data are stated in the decoding rules that are only applicable to English as an example in the live broadcast mode described in this manual), the pickup main body of the present embodiment (Pickup) adopts the whole-step direct pronunciation method during decoding, which can achieve the same effect as the aforementioned light pen plus SNC5A9. Assume that there are several A conductive codes on the entire page of the phonetic coded audiobook, and the code value of each A conductive code has a value representing the combined information of the English sentence pronunciation (the values of the combined information representing the English sentence pronunciation are all according to this specification. The above-described live broadcast method is coded and generated by encoding rules that are only applicable to English); when in use, the electromechanical conductive code A reading terminal of the pickup body is contacted with a conductive code A on the entire page of this speech coded audiobook, The MCU (namely SNC5A9) of the main body of the pickup uses the electromechanical conductive code reading terminal to read the code value of a conductive code on the entire page of the speech coded audiobook, and then obtains the value of the combined information representing the pronunciation of the English sentence from the code value , and adjust the decoding program and decoding data according to the value of the combined information to decode the pronunciation of the broadcast sentence. In the final stage of decoding, the pronunciation of the SNC5A9 playback sentence is to call some sound codes in the recording data of the various sound codes in the memory of the SNC5A9 chip (that is, the voice of a plurality of phonetic codes is played continuously to form a sentence pronunciation). This is the 2nd embodiment of the pickup main body of the present invention file. (Because it is a non-plug-in card mode, the pickup mentioned in this embodiment is equivalent to the pickup body)

关于本说明书所述的具有机电式或光电式读码头的拾音器主体与语音编码有声读物的形状,这仅是机械结构等方面的问题,对形状没必要硬性规定,按照本说明书阐述的语音编码有声读物及拾音器主体的原理,可把语音编码有声读物与拾音器主体做成多种形式,比如:一种方式是把拾音器主体做成大机体连读码头形式,把MCU和发音器件和按键模块等做在机体内(如为插数据卡方式或外插部件式数据卡方式,卡插槽也可做在机体上),读码头可做成移动的,并通过导线与机体连接,把大的机体连读码头当成拾音器主体,大机体的外型可以是适于让语音编码有声读物放入机体里的,使用时,将语音编码有声读物(如图3所示为语音编码有声读物示例)放入机体里使用,移动读码头去读语音编码有声读物的整页面上的码字,由机体内的MCU取得读码值并解码得到发音;另外,大机体的外型也可以是适于让语音编码有声读物放在机体上的,使用时,将语音编码有声读物(如图3所示或如图10下半部分所示为语音编码有声读物示例)放在机体上使用,移动读码头去读语音编码有声读物的整页面上的码字,由机体内的MCU取得读码值并解码使发音器件播出语句发音。另一种方式不做大机体,而把MCU和读码头和发音器件和按键模块等集成在一个小壳体里成为适于手持的拾音器主体(当然读码头一般需露出在小壳体外以便读码,如为插数据卡方式或外插部件式数据卡方式,卡插槽也可做在小壳体上),而语音编码有声读物是与拾音器主体独立的,语音编码有声读物可仍是以读物(比如书)为主体(如图3所示或图10下半部分所示为语音编码有声读物示例),在读物的整页面上有诸个码字,读物各页面上各码字的位置是随意的;使用时此有声读物可放在桌面不动,手持小壳体,移动小壳体用小壳体上的读码头去接触或接近(如为机电式导电码读码头时可以用接触的方式,如为光电式读码头时可用接触或接近的方式)在有声读物的整页面上的码字以读码播出语句发音,当有声读物的一个页面上有多个码字时,可实现想读哪个码字就读哪个码字。另一种方式还可把读码头做成移动的,将MCU电路和发音器件等集成在一个小壳体里,此小壳体可放在桌面上,读码头是移动的,读码头与小壳体用导线连接,把小壳体连读码头当成拾音器主体;使用时,移动读码头去读语音编码有声读物的整页面上的码字,由小壳体内的MCU取得读码值并解码使发音器件播出语句发音;除以上几种拾音器主体的外型形式外,还可以有其它的拾音器主体的外型形式。Regarding the shape of the main body of the pickup with an electromechanical or photoelectric reading dock and the voice-coded audiobook described in this manual, this is only a matter of mechanical structure and other aspects, and there is no need for rigid regulations on the shape. The principle of the reading material and the main body of the pickup can make the voice-coded audiobook and the main body of the pickup into various forms, for example: one way is to make the main body of the pickup into a large-body continuous reading terminal form, and make the MCU and pronunciation devices and button modules, etc. In the body (such as the way of inserting a data card or an external component type data card, the card slot can also be made on the body), the reading terminal can be made mobile, and connected with the body through wires, so that the large body can be connected The reading dock is used as the main body of the pickup, and the appearance of the large body can be suitable for putting voice-coded audiobooks into the body. When in use, put the voice-coded audiobook (as shown in Figure 3 is an example of the voice-coded audiobook) into the body It is used in the mobile reading dock to read the codewords on the entire page of the audio-coded audiobook, and the MCU in the body obtains the code-reading value and decodes it to obtain the pronunciation; in addition, the appearance of the large body can also be suitable for making the speech coded with sound If the reading material is placed on the body, when in use, put the voice-coded audiobook (as shown in Figure 3 or the second half of Figure 10 is an example of a voice-coded audiobook) on the body for use, and move the reading dock to read the voice code The code words on the entire page of the audio book are read by the MCU in the body and decoded to make the pronunciation device broadcast the sentence pronunciation. Another way is not to make a large body, but to integrate the MCU, the reading terminal, the pronunciation device and the button module into a small housing to become the main body of the pickup suitable for handheld (of course, the reading terminal generally needs to be exposed outside the small housing for code reading. , such as the way of inserting a data card or an external component type data card, the card slot can also be made on the small casing), and the voice-coded audiobook is independent of the main body of the pickup, and the voice-coded audiobook can still be based on the reading material (such as a book) as the main body (as shown in Figure 3 or in the lower part of Figure 10 is an example of a speech coded audiobook), there are various codewords on the entire page of the reading material, and the positions of each codeword on each page of the reading material are Random; during use, this audiobook can be placed on the desktop and hold the small casing, and move the small casing to touch or approach with the reading dock on the small casing (such as the electromechanical conductive code reading dock can be used to touch Way, such as the photoelectric reading dock can be contacted or approached) the code word on the entire page of the audiobook is pronounced by reading the code to broadcast the sentence, when there are multiple codewords on one page of the audiobook, it can be realized Which codeword is just read which codeword wants to read. In another way, the reading terminal can also be made mobile, and the MCU circuit and the pronunciation device are integrated in a small case. This small case can be placed on the desktop, and the reading terminal is mobile. The reading terminal and the small case The body is connected with a wire, and the small shell continuous reading dock is used as the main body of the pickup; when in use, the mobile reading dock reads the code words on the entire page of the voice-coded audiobook, and the MCU in the small shell obtains the code reading value and decodes it to make the pronunciation The device broadcasts the pronunciation of sentences; except the external forms of the above several pickup main bodies, there can also be other external forms of the pickup main bodies.

如图8,是一个大机体连读码头形式的拾音器主体外型实例和语音编码有声读物的实例,拾音器主体外型是大机体连读码头形式的(即把MCU和发音器件和按键模块等集成在一个机体里,读码头可做成移动的,并通过导线与机体连接,把大的机体连读码头当成拾音器主体,本实例不需插数据卡,所以机体上没有做卡插槽),机体的外型是适于让语音编码有声读物放入机体里的,1是机电式导电码甲读码头,5是按键模块(即两个按键,此按键模块是由两个按键组成的,按键不是必需的可省略),语音编码有声读物的形式可如图3所示,此语音编码有声读物的主体是书,使用时,将语音编码有声读物放入机体里,手持移动甲读码头1,用甲读码头1去接触语音编码有声读物的整页面上的导电码10以读码,机体内的MCU根据码字的码值来解码使发音器件播出语句发音。(本实例的读码头显然可以换成光电式读码头比如光笔或CCD读码头以读取语音编码有声读物的整页面上的导电码;语音编码有声读物的整页面上的码字还可以是条形码等,如码字是条形码,那拾音器主体的读码头应换成光电式读码头比如光笔或CCD读码头)As shown in Figure 8, it is an example of the appearance of the main body of the pickup in the form of a large-body continuous reading dock and an example of a voice-coded audiobook. In one body, the reading terminal can be made mobile and connected to the body through wires, and the large body connecting terminal is used as the main body of the pickup. This example does not need to insert a data card, so there is no card slot on the body), the body The external appearance is suitable for putting voice-coded audiobooks into the body, 1 is the electromechanical conductive code reading dock, 5 is the button module (that is, two buttons, this button module is composed of two buttons, and the buttons are not Necessary can be omitted), the form of the voice coded audiobook can be as shown in Figure 3, the main body of this voice coded audiobook is a book, when in use, the voice coded audiobook is put into the body, hand-held mobileA reading terminal 1, use Areading dock 1 touches theconductive code 10 on the entire page of the audio-coded audiobook to read the code, and the MCU in the body decodes it according to the code value of the code word to make the pronunciation device broadcast the sentence pronunciation. (the reading dock of this example can obviously be replaced with a photoelectric reading dock such as a light pen or a CCD reading dock to read the conductive code on the entire page of the audio-coded audiobook; the codeword on the entire page of the audio-coded audiobook can also be a barcode etc., if the code word is a barcode, then the reading terminal of the main body of the pickup should be replaced with a photoelectric reading terminal such as a light pen or a CCD reading terminal)

如图9,是一个大机体连读码头形式的拾音器主体外型实例和语音编码有声读物的实例,拾音器主体外型是大机体连读码头形式的(即把MCU和发音器件和按键模块等集成在一个机体里,读码头可做成移动的,并通过导线与机体连接,把大的机体连读码头当成拾音器主体,本实例不需插数据卡,所以机体上没有做卡插槽),机体的外型是适于让语音编码有声读物放在机体上的,1是机电式导电码甲读码头,5是按键模块(即两个按键,此按键模块是由两个按键组成的,按键不是必需的可省略),语音编码有声读物的形式可如图10下半部分所示,此语音编码有声读物的主体是书,使用时,将语音编码有声读物放在机体上,手持移动甲读码头1,用甲读码头1去接触语音编码有声读物的整页面上的导电码20以读码,机体内的MCU取得读码值并解码使发音器件播出语句发音。(本实例的读码头显然可以换成光电式读码头比如光笔或CCD读码头以读取有声读物的整页面上的导电码;有声读物的整页面上的码字还可以是条形码等,如码字是条形码,那拾音器主体的读码头应换成光电式读码头比如光笔或CCD读码头)As shown in Figure 9, it is an example of the appearance of the main body of the pickup in the form of a large-body continuous reading dock and an example of a voice-coded audiobook. In one body, the reading port can be made mobile and connected to the body through wires, and the large body connecting port is used as the main body of the pickup. This example does not need to insert a data card, so there is no card slot on the body), the body The external appearance is suitable for voice coding audiobooks to be placed on the body, 1 is the electromechanical conductive code A reading dock, 5 is the button module (that is, two buttons, this button module is composed of two buttons, the buttons are not Necessary can be omitted), the form of the voice-coded audiobook can be shown in the lower part of Figure 10, the main body of the voice-coded audiobook is a book, when in use, put the voice-coded audiobook on the body, hand-held mobileA reading dock 1. Use a readingterminal 1 to touch theconductive code 20 on the entire page of the audio-coded audiobook to read the code, and the MCU in the body obtains the code reading value and decodes it to make the pronunciation device broadcast the sentence pronunciation. (the reading terminal of this example can obviously be replaced with a photoelectric reading terminal such as a light pen or a CCD reading terminal to read the conductive code on the entire page of the audiobook; the code word on the entire page of the audiobook can also be a barcode etc., such as a code The word is a barcode, then the reading terminal of the main body of the pickup should be replaced with a photoelectric reading terminal such as a light pen or a CCD reading terminal)

如图10,是一个手持式拾音器主体外型实例和页面上印有导电码的语音编码有声读物的实例,语音编码有声读物与拾音器主体是独立的,图10上半部分示的是一个手持式拾音器主体(即把MCU和读码头和发音器件和按键模块等集成在一个小壳体里成为适于手持的拾音器主体,读码头一般需露出在小壳体外以便读码,本拾音器主体例不需插数据卡,所以本拾音器主体上没有做卡插槽),1是机电式导电码读码头,5是按键模块(即两个按键,此按键模块是由两个按键组成的),读码头1的形式可以是如图6下半部分所示走线的读码头,语音编码有声读物的整页面上的导电码20是如图4所示单排码线单码段甲导电码,使用时,手持移动小壳体,用小壳体上的机电式导电码读码头1去接触图10下半部分示的语音编码有声读物的整页面上的单排码线单码段甲导电码20,读码并根据码字的码值解码播出语句发音(读码的过程原理可参考后面对图6的拾音器的实施例的阐述)。语音编码有声读物的整页面上的码字还可以是条形码等,如码字是条形码,那手持式拾音器主体的读码头应换成光电式读码头比如光笔或CCD读码头。As shown in Figure 10, it is an example of the appearance of the main body of a hand-held pickup and an example of a voice-coded audiobook printed with conductive codes on the page. The voice-coded audiobook is independent from the main body of the pickup. The main body of the pickup (that is, the MCU, the reading terminal, the pronunciation device and the button module are integrated into a small shell to become a pickup main body suitable for hand-held, and the reading terminal generally needs to be exposed outside the small shell for code reading. This example of the pickup main body does not need Insert the data card, so there is no card slot on the main body of the pickup), 1 is the electromechanical conductive code reading terminal, 5 is the button module (that is, two buttons, this button module is composed of two buttons), the readingterminal 1 The form can be the reading terminal of alignment as shown in the lower part of Figure 6, and theconduction code 20 on the whole page of the audio-coded audiobook is theconduction code 20 of single code line single code segment A as shown in Figure 4, when using, Hold the mobile small housing, use the electromechanical conductivecode reading terminal 1 on the small housing to contact the single-row code line single code segment Aconductive code 20 on the entire page of the speech coded audiobook shown in the lower part of Figure 10, and read code and according to the code value of the code word to decode the pronunciation of the broadcast sentence (the process principle of code reading can refer to the elaboration of the embodiment of the pickup of FIG. 6 ). The code words on the entire page of the voice-coded audiobook can also be bar codes, etc., as the code words are bar codes, then the reading dock of the hand-held pickup body should be replaced with a photoelectric reading dock such as a light pen or a CCD reading dock.

如图11,是一个小壳体连读码头形式的拾音器主体外型实例和语音编码有声读物的实例,拾音器主体外型是小壳体连读码头形式的(即把MCU和发音器件和按键模块等集成在一个小壳体里,此小壳体可放在桌面上,读码头可做成移动的,并通过导线与小壳体连接,把小壳体连读码头当成拾音器主体,本实例可插数据卡,所以小壳体上做有卡插槽),4是光笔读码头,5是按键模块(即两个按键,此按键模块是由两个按键组成的,按键不是必需的可省略),58是外插的数据卡,语音编码有声读物的主体是书,使用时,手持移动光笔读码头4,用光笔读码头4去接触语音编码有声读物的整页面上的条形码11以读码,小壳体内的MCU取得读码值并解码使发音器件播出语句发音。(也可用本实例的拾音器主体的光笔读码头4去读取页面上有导电码的语音编码有声读物的整页面上的导电码,本实例的读码头显然可换成机电式导电码读码头,以读取页面上有导电码的语音编码有声读物的整页面上的导电码)As shown in Figure 11, it is an example of the appearance of the main body of the pickup in the form of a small housing continuous reading dock and an example of a voice-coded audiobook. etc. integrated in a small case, this small case can be placed on the desktop, the reading dock can be made mobile, and connected to the small case through wires, the small case connecting reading port can be used as the main body of the pickup, this example can Insert the data card, so there is a card slot on the small shell), 4 is the light pen reading terminal, 5 is the button module (that is, two buttons, this button module is composed of two buttons, the button is not necessary and can be omitted) , 58 is an externally inserted data card, the main body of the voice-coded audiobook is a book, when in use, the mobile light pen is held in hand to read theterminal 4, and the light pen is used to read theterminal 4 to touch thebarcode 11 on the entire page of the voice-coded audiobook to read the code, The MCU in the small housing obtains the code reading value and decodes it to make the pronunciation device broadcast the pronunciation of the sentence. (can also use the lightpen reading terminal 4 of the pickup main body of this example to read the conductive code on the entire page of the voice coded audiobook with conductive code on the page, the reading terminal of this example can obviously be replaced with an electromechanical conductive code reading terminal, to read conductive codes on entire pages of speech-coded audiobooks with conductive codes on the page)

以上所说的大机体连读码头形式或小壳体形式的拾音器主体,在大机体或小壳体里一般应有供电的电池盒,这是常识。The pickup main body of the above-mentioned large body continuous reading dock form or small shell form should generally have a battery box for power supply in the large body or small shell form, which is common sense.

一般机电式导电码读码头上有有弹性的导电胶条,读码时用导电胶条去接触导电码,导电胶条要被按压,所以对于采用机电式导电码读码头的手持式拾音器主体,为了减小读码时手的用力,可将手持式拾音器主体的结构做布置将重量大的元件集中到读码头附近,使其在使用时重量比较集中在读码头的导电胶条上,使用时,由于手持式拾音器主体的重量起到了一些对导电胶条的按压作用,所以用户使用起来就比较省力。Generally, there is an elastic conductive rubber strip on the electromechanical conductive code reading dock. When reading the code, use the conductive rubber strip to touch the conductive code, and the conductive rubber strip must be pressed. Therefore, for the main body of the hand-held pickup that uses the electromechanical conductive code reading dock, In order to reduce the hand force when reading the code, the structure of the main body of the hand-held pickup can be arranged to concentrate the heavy components near the reading dock, so that the weight is concentrated on the conductive rubber strip of the reading dock during use. When in use, Since the weight of the main body of the hand-held pickup acts as a pressing force on the conductive adhesive strip, it is less labor-intensive for the user to use.

以下关于使用TTS技术的方案是适用于印欧语系各语种及非印欧语系各语种的,分别以英语和汉语作为例子(对于其它语种,比如法语,韩语或日语等,完全也可以有这些语种的TTS器件比如TTS芯片,然后在这些适用于法语,韩语或日语等语种的拾音器内使用法语,韩语或日语等语种的TTS芯片,具体的使用处理显然可参考以下对英语或汉语TTS芯片的使用处理)。The following schemes about using TTS technology are applicable to all Indo-European languages and non-Indo-European languages, taking English and Chinese as examples respectively (for other languages, such as French, Korean or Japanese, etc., these languages are also completely possible TTS devices such as TTS chips, and then use French, Korean or Japanese TTS chips in these pickups that are suitable for French, Korean or Japanese languages. The specific use and processing can obviously refer to the following use of English or Chinese TTS chips. deal with).

本说明书前面描述的编解码技术,解码时有涉及的预备发音数据(或部分预备发音数据)一般是音标或拼音这种可以直接转换成语音的数据形式,因其不是文本流的形式,而且不需对其作分解词等处理,所以与常见的TTS技术(文本转语音技术)有点不同,所以实现将预备发音数据(或部分预备发音数据)转成发音也简单些,用一般可发语音的MCU,配合一些语音合成的规则(合成规则的作用是使语音听起来圆润自然,更合乎发音习惯),即可将语句的全部或非全部预备发音数据(或部分预备发音数据)中的各音码的语音按各音码在语句的全部或非全部预备发音数据(或部分预备发音数据)中的排列顺序顺次连续播出来(比如对各音码按播放ADPCM格式的录音数据的方式来播各音码的语音),这是本说明书前面描述的直接产音方式一般涉及的发音处理,(本段提及的部分预备发音数据是指在解码采用分步直接发音方式时涉及的)。编解码处理等都有可能涉及直接发音方式,如拾音器在对一拼合信息的值解码时硬件发音总是采取直接产音方式,那对此一拼合信息的值解码处理所用的发音方式相应地是直接发音方式;在本说明书里将其涉及的发音形式是全用直接发音方式的只适用于一个语种的解码规则称为直播方式的只适用于一个语种的解码规则,将其涉及的发音形式是全用直接发音方式的适用于多个语种的解码规则中有关其中某语种的解码细则称为解码规则中有关某语种的直播方式的解码细则,将按其涉及的发音形式是全用直接发音方式的解码规则中有关一个语种的解码细则来作的解码处理称为直播方式的解码处理,将按其涉及的发音形式是全用直接发音方式的解码规则中有关一个语种的解码细则来作的解码称为直播方式的解码。在本说明书里将用其编码得到的所有拼合信息的值都是本来用于采用直接发音方式来解码的只适用于一个语种的编码规则称为直播方式的只适用于一个语种的编码规则,将用其编码得到的所有拼合信息的值都是本来用于采用直接发音方式来解码的适用于多个语种的编码规则中有关其中某语种的编码细则称为编码规则中有关某语种的直播方式的编码细则,将按用其编码得到的所有拼合信息的值都是本来用于采用直接发音方式来解码的编码规则中有关一个语种的编码细则来作的编码处理称为直播方式的编码处理,将按用其编码得到的所有拼合信息的值都是本来用于采用直接发音方式来解码的编码规则中有关一个语种的编码细则来作的编码称为直播方式的编码。本发明文件所说的直播方式的编解码处理是指直播方式的编码处理和直播方式的解码处理。如提及时无特别注明有他义,本段提及的编码规则默认是只适用于一个语种或适用于多个语种的,如提及时无特别注明有他义,本段提及的解码规则默认是只适用于一个语种或适用于多个语种的。For the codec technology described earlier in this manual, the pre-pronunciation data (or part of the pre-pronunciation data) involved in decoding is generally phonetic symbols or pinyin, which can be directly converted into voice data, because it is not in the form of text streams, and it is not It needs to be processed by decomposing words, so it is a bit different from the common TTS technology (text-to-speech technology), so it is easier to convert the prepared pronunciation data (or part of the prepared pronunciation data) into pronunciation. MCU, in conjunction with some speech synthesis rules (the function of the synthesis rules is to make the speech sound round and natural, more in line with pronunciation habits), can convert all or part of the prepared pronunciation data (or part of the prepared pronunciation data) of the sentence. The pronunciation of sign indicating number is successively played out (such as the mode of recording data of playing ADPCM format to each sound sign indicating number by the arrangement order of each sound sign indicating number in all or not all preparation pronunciation data (or part preparation pronunciation data) of sentence broadcast the voice of each sound sign indicating number), this is the pronunciation processing generally involved in the direct sound production mode described above in this manual, (the part preparation pronunciation data mentioned in this paragraph refers to when decoding adopts the step-by-step direct pronunciation mode to relate to). Codec processing and so on may involve the direct pronunciation mode, such as when the pickup decodes the value of a combined information, the hardware pronunciation always adopts the direct sound production mode, then the pronunciation mode used for the value decoding process of this combined information is correspondingly Direct pronunciation method; In this manual, the pronunciation forms involved are all used in the direct pronunciation method. The decoding rule applicable to only one language is called the decoding rule applicable to only one language in the live broadcast method, and the pronunciation forms involved are The decoding rules for a certain language in the decoding rules applicable to multiple languages that use the direct pronunciation method are called the decoding rules for the live broadcast method of a certain language in the decoding rules, and will use the direct pronunciation method in accordance with the pronunciation forms involved The decoding processing performed by the decoding rules of a language in the decoding rules is called the decoding processing of the live broadcast method, and the decoding will be done according to the decoding rules of a language in the decoding rules of the direct pronunciation method according to the pronunciation forms involved. It is called the decoding of live mode. In this manual, the values of all the stitching information obtained by encoding are originally used for direct pronunciation decoding. The encoding rule that is only applicable to one language is called the encoding rule that is only applicable to one language in the live mode. The value of all the combined information obtained by encoding is originally used for decoding by direct pronunciation. The encoding rules for a certain language in the encoding rules applicable to multiple languages are called the live broadcast method for a certain language in the encoding rules. Coding rules, the encoding process that the values of all the combined information obtained by encoding are originally used to decode the encoding rules of a language in the direct pronunciation mode is called the encoding process of the live mode. The value of all the combined information obtained by its coding is originally used to adopt the coding rules of a language in the coding rules of the direct pronunciation mode to decode, and the coding is called the coding of the live mode. The encoding and decoding process of the live mode mentioned in the document of the present invention refers to the encoding process of the live mode and the decoding process of the live mode. If there is no special indication when it is mentioned, the encoding rules mentioned in this paragraph are only applicable to one language or multiple languages by default. If there is no special indication when it is mentioned, the decoding mentioned in this paragraph By default, the rules are only applicable to one language or applicable to multiple languages.

在实际中解码涉及的硬件发音可以是使用TTS器件来播出语句发音,这种硬件发音的方式是TTS产音方式;相应的,编解码涉及的有些编解码数据的数据结构及拾音器硬件技术特征要作调整。TTS器件有适用于英语,汉语等不同语种的,所以此方案可适用于汉语,英语等各个语种。编解码处理等都有可能涉及TTS发音方式,如拾音器在对一拼合信息的值解码时硬件发音总是采取TTS产音方式,那对此一拼合信息的值解码处理所用的发音方式相应地是TTS发音方式;本说明书将其涉及的发音形式是全用TTS发音方式的只适用于一个语种的解码规则称为TTS方式的只适用于一个语种的解码规则,将其涉及的发音形式是全用TTS发音方式的适用于多个语种的解码规则中有关其中某语种的解码细则称为解码规则中有关某语种的TTS方式的解码细则,将按其涉及的发音形式是全用TTS发音方式的解码规则中有关一个语种的解码细则来作的解码处理称为TTS方式的解码处理,将按其涉及的发音形式是全用TTS发音方式的解码规则中有关一个语种的解码细则来作的解码称为TTS方式的解码。本说明书将用其编码得到的所有拼合信息的值都是本来用于采用TTS发音方式来解码的只适用于一个语种的编码规则称为TTS方式的只适用于一个语种的编码规则,将用其编码得到的所有拼合信息的值都是本来用于采用TTS发音方式来解码的适用于多个语种的编码规则中有关其中某语种的编码细则称为编码规则中有关某语种的TTS方式的编码细则,将按用其编码得到的所有拼合信息的值都是本来用于采用TTS发音方式来解码的编码规则中有关一个语种的编码细则来作的编码处理称为TTS方式的编码处理,将按用其编码得到的所有拼合信息的值都是本来用于采用TTS发音方式来解码的编码规则中有关一个语种的编码细则来作的编码称为TTS方式的编码。本发明文件所说的TTS方式的编解码处理是指TTS方式的编码处理和TTS方式的解码处理。如提及时无特别注明有他义,本段提及的编码规则默认是只适用于一个语种或适用于多个语种的,如提及时无特别注明有他义,本段提及的解码规则默认是只适用于一个语种或适用于多个语种的。本段和上一段所说的一个语种可以是任一非复合语种或任一复合语种;本段和上一段所说的多个语种可全是非复合语种或全是复合语种或既含有非复合语种也含有复合语种。In practice, the hardware pronunciation involved in the decoding can be to use the TTS device to broadcast the pronunciation of the sentence. This hardware pronunciation method is the TTS sound production method; correspondingly, the data structure of some codec data involved in the codec and the technical characteristics of the pickup hardware To make adjustments. TTS devices are applicable to different languages such as English and Chinese, so this solution can be applied to various languages such as Chinese and English. Codec processing and the like may involve the TTS pronunciation method. For example, when the pickup decodes the value of a piece of information, the hardware pronunciation always adopts the TTS sound production method. Then the pronunciation method used for the value decoding process of this piece of information is correspondingly TTS pronunciation method; in this manual, the pronunciation forms involved are all used in the TTS pronunciation method. The decoding rule that is only applicable to one language is called the decoding rule that is only applicable to one language in the TTS method, and the pronunciation forms involved are all used. The decoding rules for a certain language in the decoding rules applicable to multiple languages in the TTS pronunciation mode are called the decoding rules for a certain language in the TTS mode in the decoding rules, and will be decoded according to the pronunciation forms involved in the TTS pronunciation mode. The decoding processing done by the decoding rules of a language in the rules is called the decoding processing of the TTS mode, and the decoding is done according to the decoding rules of a language in the decoding rules of the TTS pronunciation mode. Decoding in TTS mode. In this manual, the values of all the combined information obtained by encoding are originally used for decoding by using the TTS pronunciation method. The encoding rule that is only applicable to one language is called the TTS encoding rule that is only applicable to one language. It will be used The values of all the stitching information obtained by encoding are originally used for decoding by using the TTS pronunciation method. The coding rules for a certain language in the coding rules applicable to multiple languages are called the coding rules for the TTS method of a certain language in the coding rules. , the value of all the combined information obtained by encoding is originally used to decode the encoding rules of a language in the encoding rules for decoding in the TTS pronunciation mode. The encoding process is called the encoding process of the TTS mode. The values of all the stitching information obtained by its encoding are originally used to decode the encoding rules of a language in the encoding rules using the TTS pronunciation mode, and the encoding is called the encoding of the TTS mode. The encoding and decoding process of the TTS mode mentioned in the document of the present invention refers to the encoding process of the TTS mode and the decoding process of the TTS mode. If there is no special indication when it is mentioned, the encoding rules mentioned in this paragraph are only applicable to one language or multiple languages by default. If there is no special indication when it is mentioned, the decoding mentioned in this paragraph By default, the rules are only applicable to one language or applicable to multiple languages. The language mentioned in this paragraph and the previous paragraph can be any non-compound language or any compound language; the multiple languages mentioned in this paragraph and the previous paragraph can be all non-compound languages or all compound languages or both non-compound languages Compound languages are also included.

本说明书所说的直播方式拾音器,是指此拾音器不使用TTS器件发音,在此拾音器解码时拾音器硬件发音都是采用直接产音方式的;本说明书中各处所说的采用直接产音方式的拾音器也是指直播方式拾音器;本发明文件各处所说的拾音器采用直接产音方式,是指此拾音器的MCU解码时硬件发音是采用直接产音方式,且默认此拾音器是直播方式拾音器。The live mode pickup mentioned in this manual refers to the pickup that does not use the TTS device to pronounce. When the pickup is decoded, the pickup hardware pronunciation is all in the direct sound generation method; the pickups that use the direct sound generation method are mentioned everywhere in this manual It also refers to the live mode pickup; the pickup mentioned in the documents of the present invention adopts the direct sound production mode, which means that the hardware pronunciation is to adopt the direct sound production mode when the MCU decoding of this pickup is used, and this pickup is a live broadcast mode pickup by default.

本说明书所说的TTS方式拾音器,是指此拾音器只用TTS器件发音(不用MCU来发音,也不用MCU外接的语音芯片来发音),在此拾音器解码时拾音器硬件发音都是采用TTS产音方式的;在本说明书中各处所说的采用TTS产音方式的拾音器也是指TTS方式拾音器;本发明文件各处所说的拾音器采用TTS产音方式,是指此拾音器的MCU解码时硬件发音是采用TTS产音方式,且默认此拾音器是TTS方式拾音器。The TTS mode pickup mentioned in this manual means that the pickup only uses the TTS device to pronounce (no MCU to pronounce, nor the MCU external voice chip to pronounce), when the pickup is decoded, the pickup's hardware pronunciation is all using the TTS sound production method The pickup that adopts the TTS sound production mode mentioned everywhere in this specification sheet also refers to the TTS mode pickup; the said pickup in the documents of the present invention adopts the TTS sound production mode everywhere, which means that the hardware pronunciation is to adopt TTS when the MCU decoding of this pickup is used. The sound production method, and the pickup is the TTS pickup by default.

虽然一个适用于多个语种的解码规则中所有有关各自语种的解码细则可以同时既有直播方式也有TTS方式的,但如在一个直播方式(或TTS方式)拾音器里,如解码是按照适用于多个语种的解码规则,那此解码规则中所有有关各自语种的解码细则一般应都是直播方式(或都是TTS方式)的。Although all the decoding rules about their respective languages in a decoding rule applicable to multiple languages can have both live mode and TTS mode at the same time, but as in a live mode (or TTS mode) pickup, if the decoding is in accordance with the applicable multi-language The decoding rules for each language, then all the decoding rules for the respective languages in the decoding rules should generally be in the live broadcast mode (or all in the TTS mode).

如果有拾音器,其内部既有TTS器件可用来发音,也可用MCU或MCU外接的语音芯片来发音(即解码时可采用直接产音方式),此拾音器解码时有时采用直接产音方式有时采用TTS产音方式,那此拾音器可称为既采用直接产音方式又采用TTS产音方式的拾音器,此拾音器内一般有多个子发音器件,这多个子发音器件被当成一个发音器件,此拾音器可在对某语种的拼合信息的值执行解码时用TTS发音方式来播放语句发音而在对另外语种的拼合信息的值执行解码时可用直接发音方式来播放语句发音,或对同一语种的拼合信息的值执行解码时既有用直接发音方式也有用TTS发音方式(比如可对此语种不同拼合信息的值进行解码时分别用直接发音方式和TTS发音方式,也可在对表示一个语句发音的拼合信息的值进行解码时对此语句的部分用直接发音方式发音而对此语句的另外部分用TTS发音方式发音等),那显然此拾音器涉及的各部分(硬件及编解码等)技术原理可参考直播方式拾音器和TTS方式拾音器涉及的各部分(硬件及编解码等)技术原理。If there is a pickup, there is a TTS device inside it that can be used for pronunciation, and it can also be pronounced by an MCU or an external voice chip connected to the MCU (that is, the direct sound production method can be used when decoding). This pickup sometimes uses the direct sound production method and sometimes uses TTS when decoding. The sound production method, the pickup can be called a pickup that adopts both the direct sound production method and the TTS sound production method. Generally, there are multiple sub-sounding devices in the pickup, and these multiple sub-sounding devices are regarded as a sounding device. This pickup can be used in When decoding the value of the combined information in a certain language, use the TTS pronunciation method to play the pronunciation of the sentence, and when decoding the value of the combined information in another language, you can use the direct pronunciation method to play the pronunciation of the sentence, or for the value of the combined information in the same language Both the direct pronunciation mode and the TTS pronunciation mode are useful when performing decoding (for example, the direct pronunciation mode and the TTS pronunciation mode can be used respectively when decoding the values of different combination information of this language, and the value of the combination information representing the pronunciation of a sentence can also be used. When decoding, the part of the sentence is pronounced in the direct pronunciation mode and the other part of the sentence is pronounced in the TTS pronunciation mode, etc.), then obviously the technical principles of each part (hardware and codec, etc.) involved in this pickup can refer to the live pickup. And the technical principles of each part (hardware, codec, etc.) involved in the TTS mode pickup.

对在权利要求2,10,19,22中提及的拾音器并没有特别限制,在权利要求2,10,19或22中提及的拾音器可能只能用TTS器件来发音(即采用TTS产音方式的拾音器),也可能只能用MCU或MCU外接的语音芯片来发音(即采用直接产音方式的拾音器),也可能既能用TTS器件来发音也能用MCU或MCU外接的语音芯片来发音(即既采用直接产音方式又采用TTS产音方式的拾音器),还可能是采用其它种类的发音器件来发音的;本说明书中各处提及的对其没作采用直接产音方式或采用TTS产音方式或既采用直接产音方式又采用TTS产音方式这样来限定的拾音器也是这样的没有特别限制(除非提及时有特别指明不是这样或可从提及处的上下文推出不是这样)。本发明文件说的既采用直接产音方式又采用TTS产音方式的拾音器,此一个拾音器内的一个发音器件可能由多个子发音器件组成(比如一个拾音器内的子发音器件既有带喇叭的TTS芯片,也有带喇叭的MCU外接的语音芯片),一般可对能解码的语种中的各个语种的拼合信息的值进行解码时用各自相应的子发音器件发音(比如对一个语种的拼合信息的值进行解码时用带喇叭的TTS芯片发音即采用TTS产音方式,对另一个语种的拼合信息的值进行解码时用带喇叭的MCU外接的语音芯片发音即采用直接产音方式),甚至可对同一语种的不同拼合信息的值解码时用不同的子发音器件发音,权利要求2,10,19或22里述的拾音器也可以是这种一个拾音器有一个发音器件且这一个发音器件由多个子发音器件组成的情况。对于既采用直接产音方式又采用TTS产音方式的拾音器,其发音器件也可能就是一个子发音器件,此一个子发音器件既可采用直接产音方式发音,又可采用TTS产音方式发音,权利要求2,10,19或22里述的拾音器也可以是一个拾音器有一个这种发音器件的情况。There is no special limitation to the pickups mentioned inclaims 2, 10, 19, and 22, and the pickups mentioned inclaims 2, 10, 19 or 22 may only be pronounced with TTS devices (i.e., adopt TTS sound production mode of pickup), or it may only be pronounced by MCU or a voice chip external to MCU (that is, a pickup that uses a direct sound production method), or it may be possible to use both a TTS device for pronunciation and an MCU or a voice chip external to MCU. Pronunciation (that is, the pickup that both adopts the direct sound production mode and the TTS sound production mode) may also be pronounced by using other types of pronunciation devices; Pickups defined as TTS sound production or both direct sound production and TTS sound production are also not specifically limited (unless the reference specifically indicates otherwise or can be deduced otherwise from the context of the reference) . What the document of the present invention said not only adopts the pickup of the direct sound production mode but also adopts the TTS sound production mode, a sounding device in this pickup may be made up of a plurality of sub-sounding devices (such as the sub-sounding device in a pickup has a TTS with horn chip, and there is also a voice chip externally connected to the MCU with a loudspeaker), generally the value of the combined information in each language that can be decoded can be decoded with the corresponding sub-pronunciation device (for example, the value of the combined information of a language When decoding, use the TTS chip with a speaker to pronounce (that is, adopt the TTS sound production method; when decoding the value of the spliced information in another language, use the voice chip externally connected to the MCU with a speaker to pronounce (that is, use the direct sound production method), even for When the value decoding of different combination information of the same language is pronounced with different sub-pronunciation devices, the pickup described inclaim 2, 10, 19 or 22 can also be such that a pickup has a pronunciation device and this pronunciation device is composed of a plurality of sub-pronunciation devices. The composition of the pronunciation device. For the pickup that not only adopts the direct sound production mode but also adopts the TTS sound production mode, its pronunciation device may also be a sub-sounding device, and this sub-sounding device can both adopt the direct sound production mode to pronounce, and can adopt the TTS sound production mode to pronounce again. The pickup described inclaim 2, 10, 19 or 22 can also be the case that a pickup has a sounding device of this kind.

在某一解码规则中有关某语种的解码细则涉及的发音形式可以全用直接发音方式(或全用TTS发音方式),另如果某解码规则中有关此语种的解码细则涉及的发音既有用直接发音方式使直接产音方式子发音器件(如MCU或MCU外接的语音芯片等)发音也有用TTS发音方式使TTS器件发音(比如可对此语种不同拼合信息的值进行解码时分别用直接发音方式和TTS发音方式,也可在对表示一个语句发音的拼合信息的值进行解码时对此语句的部分用直接发音方式发音而对此语句的另外部分用TTS发音方式发音等),可把这种发音形式称为用混合发音方式(显然解码时涉及的发音形式是用混合发音方式的拾音器是既采用直接产音方式又采用TTS产音方式的拾音器,但既采用直接产音方式又采用TTS产音方式的拾音器可能对有些语种的拼合信息的值解码时涉及的发音形式是全用直接发音方式或全用TTS发音方式);编码也可能涉及到用混合发音方式。本说明书将其涉及的发音形式是用混合发音方式的只适用于一个语种的解码规则称为混合方式的只适用于一个语种的解码规则,将其涉及的发音形式是用混合发音方式的适用于多个语种的解码规则中有关其中某语种的解码细则称为解码规则中有关某语种的混合方式的解码细则,将按其涉及的发音形式是用混合发音方式的解码规则中有关一个语种的解码细则来作的解码处理称为混合方式的解码处理,将按其涉及的发音形式是用混合发音方式的解码规则中有关一个语种的解码细则来作的解码称为混合方式的解码,此混合方式显然可看成是直播方式或TTS方式的等同替换。本说明书将用其编码得到的各拼合信息的值是本来用于采用不同发音方式来解码的只适用于一个语种的编码规则称为混合方式的只适用于一个语种的编码规则,将用其编码得到的各拼合信息的值是本来用于采用不同发音方式来解码的适用于多个语种的编码规则中有关其中某语种的编码细则称为编码规则中有关某语种的混合方式的编码细则,将按用其编码得到的各拼合信息的值是本来用于采用不同发音方式来解码的编码规则中有关一个语种的编码细则来作的编码处理称为混合方式的编码处理,将按用其编码得到的各拼合信息的值是本来用于采用不同发音方式来解码的编码规则中有关一个语种的编码细则来作的编码称为混合方式的编码;混合方式显然可看成是直播方式或TTS方式的等同替换。本发明文件所说的混合方式的编解码处理是指混合方式的编码处理和混合方式的解码处理。如提及时无特别注明有他义,本段提及的编码规则默认是只适用于一个语种或适用于多个语种的,如提及时无特别注明有他义,本段提及的解码规则默认是只适用于一个语种或适用于多个语种的。本段和上面3段和下面3段所说的一个语种可以是任一非复合语种或任一复合语种;本段和上面3段和下面3段所说的多个语种可全是非复合语种或全是复合语种或既含有非复合语种也含有复合语种。The pronunciation forms involved in the decoding rules of a certain language in a certain decoding rule can all use the direct pronunciation method (or all use the TTS pronunciation method), and if the pronunciations involved in the decoding rules of a certain language in a certain decoding rule can use direct pronunciation The way to make the sub-pronunciation device (such as MCU or MCU external voice chip, etc.) of the direct sound production method is also useful to make the TTS device sound in the TTS pronunciation method (for example, the direct pronunciation method and the value of the combined information of this language can be decoded respectively) TTS pronunciation mode, also can when decoding the value of the combined information representing a sentence pronunciation, the part of this sentence is pronounced with direct pronunciation mode and the other part of this sentence is pronounced with TTS pronunciation mode etc.), this pronunciation can be The form is called using the mixed pronunciation method (obviously the pronunciation form involved in decoding is to use the mixed pronunciation method). The pickup of the mode may relate to the pronunciation form when decoding the value of the pieced information of some languages is the direct pronunciation mode or the TTS pronunciation mode); encoding may also relate to the mixed pronunciation mode. In this manual, the pronunciation form involved is a decoding rule that is only applicable to one language in a mixed pronunciation mode, and is called a decoding rule that is only applicable to a language in a mixed mode, and the pronunciation form it involves is a decoding rule that is applicable to a mixed pronunciation mode. The decoding rules about a certain language in the decoding rules of multiple languages are called the decoding rules about the mixed mode of a certain language in the decoding rules. The decoding processing done by the detailed rules is called the decoding processing of the mixed mode, and the decoding is done according to the decoding rules of a language in the decoding rules of the mixed pronunciation mode according to the pronunciation form involved, which is called the decoding of the mixed mode. Obviously, it can be regarded as an equivalent replacement of the live broadcast method or the TTS method. In this manual, the value of each piece of information obtained by encoding is originally used for decoding with different pronunciation methods. The encoding rule that is only applicable to one language is called the mixed encoding rule that is only applicable to one language. It will be used to encode The obtained value of each stitched information is originally used to decode in different pronunciation ways and is applicable to multiple languages. The coding rules about a certain language in the coding rules are called the coding rules about the mixed mode of a certain language in the coding rules. The value of each piece of information obtained by encoding is originally used to decode the encoding rules of a language in the encoding rules using different pronunciation methods. The encoding process is called the mixed encoding process, which will be obtained by encoding. The value of each stitching information is originally used to use different pronunciation methods to decode the coding rules about a language in the encoding rules. The encoding is called mixed encoding; the mixed approach can obviously be regarded as the live mode or TTS mode. equivalent replacement. The hybrid encoding and decoding process mentioned in the document of the present invention refers to the encoding process of the mixed mode and the decoding process of the mixed mode. If there is no special indication when it is mentioned, the encoding rules mentioned in this paragraph are only applicable to one language or multiple languages by default. If there is no special indication when it is mentioned, the decoding mentioned in this paragraph By default, the rules are only applicable to one language or applicable to multiple languages. One of the languages mentioned in this paragraph and the above 3 paragraphs and the following 3 paragraphs can be any non-composite language or any compound language; the multiple languages mentioned in this paragraph and the above 3 paragraphs and the following 3 paragraphs can all be non-compound languages or All are compound languages or contain both non-compound languages and compound languages.

解码规则中有关某语种的解码细则涉及的发音形式除了可以全用直接发音方式或全用TTS发音方式或是用混合发音方式外,还可能还有别的形式,本说明书主要以全用直接发音方式,全用TTS发音方式为例来阐述。The pronunciation forms involved in the decoding rules of a certain language in the decoding rules can be used in addition to direct pronunciation, TTS pronunciation or mixed pronunciation. There may also be other forms. This manual mainly uses direct pronunciation The method is explained by using the TTS pronunciation method as an example.

因本说明书中各编解码规则举例都是只适用于一个语种的,所以可以说这些举例的编码规则(或解码规则)是直播方式(或TTS方式或混合方式)的;而如编码规则是适用于多个语种的时,那对其中某一语种可说成编码规则里有关此语种的编码细则是直播方式(或TTS方式或混合方式)的,如解码规则是适用于多个语种的时,那对其中某一语种可说成解码规则里有关此语种的解码细则是直播方式(或TTS方式或混合方式)的(权利要求书里主要是按这样来说的)。Because each encoding and decoding rule example in this manual is only applicable to one language, it can be said that the encoding rules (or decoding rules) of these examples are in the live mode (or TTS mode or mixed mode); and if the encoding rule is applicable For multiple languages, for one of the languages, it can be said that the coding details of this language in the coding rules are live broadcast (or TTS or mixed). If the decoding rules are applicable to multiple languages, It can be said that the decoding details about this language in the decoding rules are live mode (or TTS mode or mixed mode) for a certain language among them (mainly according to such terms in the claims).

如将其整页面上的各码字涉及的各拼合信息的值都是按TTS方式的编码处理(比如按某编码规则中有关某一个语种的TTS方式的编码细则来作的)来编码生成的的有声读物称为TTS方式的语音编码有声读物,将其整页面上的各码字涉及的各拼合信息的值都是按直播方式的编码处理(比如按某编码规则中有关某一个语种的直播方式的编码细则来作的)来编码生成的的有声读物称为直播方式的语音编码有声读物;那一般情况下,对于同一语种,用于编码生成TTS方式的语音编码有声读物的整页面上的各码字涉及的此语种的各拼合信息的值的编码规则中有关此语种的TTS方式的编码细则一般与用于编码生成直播方式的语音编码有声读物的整页面上的各码字涉及的此语种的各拼合信息的值的另一编码规则中有关此语种的直播方式的编码细则相同,这在后面有述;TTS方式拾音器与直播方式拾音器是不同的;甚至同一本语音编码有声读物的整页面上的码字既可由TTS方式拾音器也可由直播方式拾音器读取并解码发音,显然这里说的码字应可被这两种拾音器的读码头分别读取,当然这里要求被读取的语音编码有声读物的整页面上的码字涉及的拼合信息的值是TTS方式拾音器或直播方式拾音器能解码的。当然对于某一个语种,如特别做成用于编码生成TTS方式的语音编码有声读物的整页面上的各码字涉及的此语种的拼合信息的值的编码规则中有关此语种的TTS方式的编码细则与用于编码生成直播方式的语音编码有声读物的整页面上的各码字涉及的此语种的拼合信息的值的另一编码规则中有关此语种的直播方式的编码细则不同,使TTS方式的语音编码有声读物整页面上的此语种的各码字只能由TTS方式拾音器读取,使直播方式的语音编码有声读物的整页面上的此语种的各码字只能由直播方式拾音器读取,也是可以的。For example, the value of each piece of information involved in each code word on the entire page is coded and generated according to the encoding process of the TTS mode (for example, according to the encoding rules of the TTS mode of a certain language in a certain encoding rule) The audiobooks are called TTS voice-encoded audiobooks, and the values of each piece of information involved in each codeword on the entire page are encoded according to the live mode (for example, according to the live broadcast of a certain language in a certain encoding rule) The audiobooks generated by coding are called live speech coded audiobooks; in general, for the same language, the audiobooks used to encode and generate TTS speech coded audiobooks on the entire page In the coding rules for the value of each mosaic information of this language involved in each codeword, the coding rules related to the TTS method of this language are generally related to the coding rules for each codeword on the entire page of the speech coded audiobook used to generate the live broadcast method. The coding details about the live broadcast mode of this language are the same in another coding rule of the value of each stitched information of the language, which is described later; the TTS mode pickup is different from the live broadcast mode pickup; The codewords on the page can be read and decoded by both TTS pickups and live broadcast pickups. Obviously, the codewords mentioned here should be read separately by the reading terminals of these two pickups. Of course, the voice code to be read is required here. The value of the spliced information involved in the codeword on the entire page of the audiobook can be decoded by the pickup in the TTS mode or the pickup in the live broadcast mode. Of course, for a certain language, if it is specially made to be used for encoding and generating the TTS mode encoding of the value of the splicing information of this language involved in each code word on the entire page of the TTS mode speech coded audiobook. The detailed rules are different from the coding rules related to the live broadcast method of this language in another coding rule used to encode the value of the splicing information of this language involved in each code word on the entire page of the speech coded audiobook in the live broadcast method, so that the TTS method The codewords of this language on the entire page of the voice-coded audiobook can only be read by the TTS mode pickup, so that the codewords of this language on the entire page of the voice-coded audiobook in the live broadcast mode can only be read by the live broadcast mode pickup It is also possible to take.

整步TTS发音方式:是指在解码时,MCU按拼合信息的值,把各音码的拼法数据(其中也可能有音码的发音信息数据)按某种约定顺序排组起来成语句的预备发音数据,并且由MCU将语句的全部预备发音数据传给TTS器件(比如TTS芯片)以播出语句的全部发音,或由MCU将语句的非全部预备发音数据传给TTS器件(比如TTS芯片)以播出语句的非全部发音;一般情况,整步TTS发音方式是指在解码时,MCU按拼合信息的值,把各音码的拼法数据(其中也可能有音码的发音信息数据)按这些数据对应的发音在语句发音中的顺序排组起来成语句的预备发音数据(语句的预备发音数据即语句的拼法格式数据,语句的拼法格式数据里可能既有音码的拼法数据也有音码的发音信息数据,或只有音码的拼法数据,或只有音码的发音信息数据),并且由MCU将预备发音数据传给TTS器件(比如TTS芯片)以播出语句的全部发音;如述及时无特别指明有他义,本说明书各处在述及整步TTS发音方式时默认是以从本段第4行‘一般’至本段第9行‘全部发音’为止所述方式为例的。整步TTS发音方式适用于印欧语系各语种和非印欧语系各语种和任一复合语种。Whole-step TTS pronunciation method: refers to that when decoding, the MCU arranges the spelling data of each phonetic code (which may also have the pronunciation information data of the phonetic code) according to the value of the combined information during decoding to form a sentence Prepare pronunciation data, and pass all the preparation pronunciation data of sentence to TTS device (such as TTS chip) by MCU to broadcast the whole pronunciation of sentence, or pass the non-all preparation pronunciation data of sentence to TTS device (such as TTS chip) by MCU ) to broadcast the non-full pronunciation of the sentence; in general, the full-step TTS pronunciation mode refers to that when decoding, the MCU presses the value of the combined information to convert the spelling data of each phonetic code (therein may also have the pronunciation information data of the phonetic code) ) by the pronunciations corresponding to these data in the order of the pronunciation of the sentence to form the prepared pronunciation data of the sentence (the prepared pronunciation data of the sentence is the spelling format data of the sentence, and the spelling of the existing phonetic code may be arranged in the spelling format data of the sentence The method data also has the pronunciation information data of phonetic sign indicating number, or only have the spelling data of phonetic sign indicating number, or only have the pronunciation information data of phonetic sign indicating number), and the preparation pronunciation data is sent to TTS device (such as TTS chip) by MCU to broadcast sentence All pronunciation; if there is no special meaning when it is mentioned, when referring to the full-step TTS pronunciation method in this manual, the default is from the fourth line of this paragraph "general" to the ninth line of this paragraph "full pronunciation" way as an example. The full-step TTS pronunciation method is applicable to all Indo-European languages and non-Indo-European languages and any compound languages.

分步TTS发音方式:即MCU在对拼合信息的值进行解码时,把各播放发音的动作穿插在解码过程中,这些播放发音的动作产生的发音在听觉上是连续的而且形成语句的全部发音或非全部发音;比如语句的全部(或非全部)预备发音数据可看成由一些部分预备发音数据组成的,解码过程中MCU每得到一个发音接续的部分预备发音数据,就将此部分预备发音数据(部分预备发音数据里可能既有音码的拼法数据也有音码的发音信息数据,或者只有音码的拼法数据,或者只有音码的发音信息数据)传给TTS器件(比如TTS芯片)播放出音来,照这样MCU就可把组成语句的全部(或非全部)预备发音数据的各部分预备发音数据依次分次传给TTS器件发音(每次传一个部分预备发音数据),一直到执行完全部解码为止,那些部分预备发音数据各自的发音在听觉上是连续的即形成所解码语句的全部发音(或非全部发音),本说明书默认是以这种形式来阐述分步TTS发音方式的;这里说的发音接续的部分预备发音数据是指:在解码过程中,如刚才得到一个部分预备发音数据并将其传给TTS器件播放出音来,而在需发的音中紧接此刚才一个部分预备发音数据对应的发音后的发音,正是现在得到的这个部分预备发音数据对应的发音,那现在得到的这个部分预备发音数据就是发音接续的部分预备发音数据(本发明文件其它各处述及分步TTS发音方式时,提及的发音接续的部分预备发音数据也是按这样理解);如解码过程中得到一个部分预备发音数据但不是发音接续的部分预备发音数据,那就可先放一下等解码到该此部分预备发音数据发音时再将其传给TTS器件播放出音来,当然解码时得到的此次对一拼合信息的值解码发音中需最开始发音对应的部分预备发音数据应算为是发音接续的部分预备发音数据并应将其传给TTS器件发音;分步TTS发音方式还可以有别的具体形式,比如MCU这边在解码过程中分次(不一定按发音次序)把各部分预备发音数据传给TTS器件,但要到要传的语句的全部或非全部预备发音数据都传完后TTS器件才发音,这种形式可看成是前面那种(分步)形式的等同替换。显然一个部分预备发音数据是由数个音码的拼法数据(其中也可能有音码的发音信息数据)组成,这些数据一般是按它们对应的发音在此部分预备发音数据对应的发音中的顺序排列的(也可按其它约定顺序排列);分步TTS发音方式适用于印欧语系各语种和非印欧语系各语种和任一复合语种。(本发明文件在叙述TTS技术方式时提及的部分预备发音数据是默认指在用分步TTS发音方式时各步发音过程涉及的部分预备发音数据)Step-by-step TTS pronunciation method: that is, when the MCU decodes the value of the combined information, it intersperses the actions of playing pronunciations in the decoding process. The pronunciations produced by these actions of playing pronunciations are continuous in hearing and form the entire pronunciation of the sentence or non-full pronunciation; for example, all (or not all) of the prepared pronunciation data of a sentence can be regarded as being composed of some partial prepared pronunciation data, and in the decoding process, the MCU every time obtains a partial prepared pronunciation data of a pronunciation continuation, and just this part of the prepared pronunciation Data (may have the spelling data of phonetic sign indicating number and the pronunciation information data of phonetic sign indicating number in part preparation pronunciation data, or only have the spelling data of phonetic sign indicating number, or only have the pronunciation information data of phonetic sign indicating number) pass to TTS device (such as TTS chip ) to play the sound, in this way the MCU can send all (or not all) pre-pronunciation data of each part of the pre-pronunciation data to the TTS device pronunciation in turn (passing a part of the pre-pronunciation data) successively, always Until the complete decoding is performed, the respective pronunciations of those parts of the prepared pronunciation data are auditorily continuous, that is, all pronunciations (or not all pronunciations) of the decoded sentence are formed. This manual defaults to explain the step-by-step TTS pronunciation in this form mode; the part of the prepared pronunciation data that is said here to continue the pronunciation refers to: in the decoding process, as just getting a part of the prepared pronunciation data and sending it to the TTS device to play the sound, and the sound that needs to be sent is followed by This pronunciation after the pronunciation corresponding to a part of the preliminary pronunciation data just now is just the pronunciation corresponding to this part of the preliminary pronunciation data obtained now, then this part of the preliminary pronunciation data obtained now is exactly the part of the preliminary pronunciation data of pronunciation continuation (the document of the present invention other When referring to the step-by-step TTS pronunciation mode everywhere, the part of the preparation pronunciation data that the pronunciation of mentioning is continued is also understood like this); Obtain a part preparation pronunciation data but not the part preparation pronunciation data of pronunciation continuation as in the decoding process, then can Put it on for a while and wait until the part of the prepared pronunciation data is decoded and then pass it to the TTS device to play the sound. Of course, the value of the paired information obtained during decoding needs to be prepared for the part corresponding to the initial pronunciation in the decoding pronunciation. Pronunciation data should be counted as part of the continuation of the pronunciation and should be passed to the TTS device for pronunciation; the step-by-step TTS pronunciation method can also have other specific forms, such as the MCU here in the decoding process. Pronunciation sequence) each part prepares the pronunciation data and passes on to TTS device, but TTS device just pronounces after all or non-all preparation pronunciation data of the sentence to be transmitted has passed, and this form can be regarded as the kind of (point in front) step) equivalent replacement. Obviously a part prepares pronunciation data to be made up of spelling data (wherein the pronunciation information data of phonetic sign indicating number may also be arranged) of several phonetic sign indicating numbers, and these data are generally by their corresponding pronunciation in the corresponding pronunciation of this part preparing pronunciation data Arranged sequentially (arranged in other agreed orders); the step-by-step TTS pronunciation method is applicable to all Indo-European languages and non-Indo-European languages and any compound language. (the part of the prepared pronunciation data that the document of the present invention mentions when narrating the TTS technical mode refers to the part prepared pronunciation data that each step pronunciation process relates to when using the step-by-step TTS pronunciation mode by default)

如阐述时无特别指明有其他情况,以下各有关TTS技术的阐述默认是以整步TTS发音方式为例来阐述的。If there are no other circumstances specified in the explanation, the following explanations about TTS technology are by default based on the full-step TTS pronunciation method as an example.

在硬件上,TTS技术是常规技术,本发明文件各处所称TTS器件,是指可以完成将文本转成语音功能的器件;而其内部的处理算法一般有比如LPC参数合成,共振峰合成,波形合成等,虽然不同的TTS器件内部可能采用不同的处理算法,但都应属于本发明文件所指的TTS器件的范围;TTS器件可以是单独的TTS芯片(现在市面有现成的TTS芯片),所以拾音器硬件上可用一个读码解码用的主MCU与一个TTS芯片(文本转语音芯片)连接使用,由主MCU读码,解码并在解码时可将语句的全部或非全部预备发音数据(语句的预备发音数据是语句的拼法格式数据,是文本流的形式)通过通讯线传输给TTS芯片(这里是以整步TTS发音方式为例的),由TTS芯片完成将文本流形式的语句的全部或非全部预备发音数据转成语音的工作,这样一来,与拾音器有关的存储器集里就不需存储诸音码的发音数据了;如将主MCU与TTS器件集成在一个芯片里,也是可以的,其与分开成两芯片(一个主MCU,一个TTS芯片)本质相同;以下主要以主MCU与一个TTS芯片连接使用(主MCU芯片与TTS芯片是各自独立的芯片)为例来阐述本发明有关TTS方式的相关技术。On hardware, TTS technology is a conventional technology, and the TTS device referred to in the documents of the present invention refers to a device that can complete the text into speech function; and its internal processing algorithm generally has such as LPC parameter synthesis, formant synthesis, waveform Synthesizing etc., although different TTS device internals may adopt different processing algorithm, all should belong to the scope of the TTS device that document of the present invention refers to; On the pick-up hardware, a main MCU for code reading and decoding can be connected with a TTS chip (text-to-speech chip), and the main MCU can read codes, decode and when decoding, all or part of the sentence can be prepared for pronunciation data (statement The prepared pronunciation data is the spelling format data of the sentence, which is in the form of text stream) and is transmitted to the TTS chip through the communication line (here is the full-step TTS pronunciation method as an example), and the TTS chip completes all the sentences in the form of text stream Or the work that not all preparation pronunciation data is changed into speech, like this, just need not store the pronunciation data of all sound codes in the memory set relevant to pickup; As main MCU and TTS device are integrated in a chip, also can It is essentially the same as being divided into two chips (a main MCU and a TTS chip); the following mainly uses the connection between the main MCU and a TTS chip (the main MCU chip and the TTS chip are independent chips) as an example to illustrate the present invention Related technologies related to the TTS method.

以下是TTS方式的实施方案,以主MCU与一个TTS芯片连接使用为例,以WINBOND的WTS701作为TTS芯片例子。以下有关TTS方式的实施方案涉及的以只适用于一个语种为例的编码规则举例及解码规则举例仍是沿承前面有关直播方式的实施方案涉及的直播方式的以只适用于一个语种为例的编码规则举例及解码规则举例,但需在本说明书前面所述的在直播方式的以只适用于一个语种为例的解码规则里使用直接发音方式的技术基础上作些调整修正,其中解码规则需改动,涉及的解码数据需改动(因TTS方式时的编解码规则举例里编码数据是与解码数据相同的,所以也需对编码数据同步改动,以下TTS方式提及的数据的改动默认是同时作用于编码数据和解码数据的,编码规则其余部分可不用改动)。(另外,显然也可根据TTS技术发音的特点而采用别的编解码规则)The following is the implementation plan of the TTS method. Take the main MCU connected with a TTS chip as an example, and take the WTS701 of WINBOND as an example of the TTS chip. The following examples of encoding rules and decoding rules that are only applicable to one language involved in the implementation plan of the TTS method are still following the example of the live broadcast method that is only applicable to one language in the implementation plan related to the previous live broadcast method. Examples of encoding rules and decoding rules, but some adjustments and amendments need to be made on the technical basis of using the direct pronunciation method in the decoding rules that are only applicable to one language in the live broadcast mode described earlier in this manual, and the decoding rules need to be Change, the decoding data involved needs to be changed (because the encoded data and the decoded data in the codec rule example in the TTS mode are the same, so the encoded data needs to be changed synchronously, and the data changes mentioned in the TTS mode below are by default effective at the same time For encoding data and decoding data, the rest of the encoding rules do not need to be changed). (In addition, it is obvious that other codec rules can be adopted according to the characteristics of TTS technology pronunciation)

本说明书以WINBOND的WTS701作为TTS芯片例子。WTS701的主要特点是:此种芯片有几个系列,有单独用于播英语的(WTS701EF),有单独用于播汉语的(WTS701MF);同属于WTS701系列的WTS701EF与WTS701MF,其主要区别是输入文本流的格式上,其它诸如硬件脚位,底层通讯协议,发设置音量大小,设置播放语速等控制命令的方式都基本相同;WTS701EF芯片的英语的输入文本流一般由表示字母的ASCII字符(比如十六进制0X41表示字符‘A’)组成,文本流里面可插入音标字符;WTS701MF芯片的汉语的输入文本流一般由表示汉字的大五码(BIG5)组成,文本流里面可插入拼音字符。This manual uses WTS701 of WINBOND as an example of TTS chip. The main features of WTS701 are: there are several series of this kind of chip, some are used to broadcast English (WTS701EF), and some are used to broadcast Chinese (WTS701MF); In terms of the format of the text stream, other control commands such as hardware pins, underlying communication protocols, sending and setting the volume, and setting the playback speed are basically the same; the English input text stream of the WTS701EF chip is generally composed of ASCII characters representing letters ( For example, the hexadecimal 0X41 represents the character 'A'), and phonetic characters can be inserted into the text stream; the Chinese input text stream of the WTS701MF chip is generally composed of big five codes (BIG5) representing Chinese characters, and pinyin characters can be inserted into the text stream .

整步TTS发音方式时,MCU这边解码时涉及的语句的预备发音数据要改成文本流形式,并由MCU将语句的全部预备发音数据传给TTS芯片以播出语句的全部发音,或由MCU将语句的非全部预备发音数据传给TTS芯片以播出语句的非全部发音。分步TTS发音方式时,MCU这边解码时每次传的部分预备发音数据也要改成文本流形式,并由MCU将组成语句的全部预备发音数据的各部分预备发音数据依次分次传给TTS芯片发音,以形成语句的全部发音,或由MCU将组成语句的非全部预备发音数据的各部分预备发音数据依次分次传给TTS芯片发音,以形成语句的非全部发音。In the whole-step TTS pronunciation mode, the pre-pronunciation data of the sentence involved in the MCU decoding should be changed to the text stream form, and the MCU will send all the pre-pronunciation data of the sentence to the TTS chip to broadcast all the pronunciation of the sentence, or by The MCU transmits the non-full preparation pronunciation data of the sentence to the TTS chip to broadcast the non-full pronunciation of the sentence. In the step-by-step TTS pronunciation mode, the part of the prepared pronunciation data transmitted each time by the MCU is converted into a text stream, and the MCU sends all the prepared pronunciation data of the sentence to each part of the prepared pronunciation data sequentially. The TTS chip pronounces to form the entire pronunciation of the sentence, or the MCU sends each part of the prepared pronunciation data that constitutes the non-all prepared pronunciation data of the sentence to the TTS chip for pronunciation in order to form the partial pronunciation of the sentence.

先述采用TTS产音方式时印欧语系语种的编解码处理:The codec processing of the Indo-European languages when the TTS sound production method is used is described first:

显然,印欧语系任一语种的TTS方式的编解码对音码的定义方式及使用的音码情况是与此语种的直播方式的编解码对音码的定义方式及使用的音码情况一般是相同或兼容的;可被当成此语种的TTS方式编解码涉及的音码的,可以是元辅音或元音辅音组合,也可以是词根,单词,单词组合或范句,也可以是字符组合或字母;可以由这些音码中的一种或多种音码来拼合语句集发音,这基本上与此语种的直播方式的编解码时的情况是一样的。Apparently, the definition of the phonetic codes and the phonetic codes used by the TTS codec of any Indo-European language is generally the same as the definition of the phonetic codes and the phonetic codes used by the live codec of this language. The same or compatible; it can be regarded as the phonetic code involved in the TTS encoding and decoding of this language, which can be a vowel and consonant or a combination of vowels and consonants, or a root, word, word combination or model sentence, or a combination of characters or Letters; one or more phonetic codes in these phonetic codes can be used to assemble the pronunciation of the sentence set, which is basically the same as the encoding and decoding of the live broadcast mode of this language.

对于印欧语系任一语种,语音编码有声读物的整页面上的码字涉及的表示此语种语句发音的拼合信息的值可以是按照某种编码规则中有关此语种的TTS方式的编码细则来编码生成的(此编码规则可以是只适用于此语种或适用于包括此语种在内的多个语种的)。For any language of the Indo-European language family, the value of the combined information representing the pronunciation of sentences in this language involved in the codewords on the entire page of the phonetic coded audiobook can be coded according to the coding rules of the TTS method for this language in a certain coding rule Generated (this coding rule can be only applicable to this language or applicable to multiple languages including this language).

对于印欧语系任一语种,语音编码有声读物的整页面上的码字涉及的表示此语种语句发音的拼合信息的值是可被某种解码规则中有关此语种的TTS方式的解码细则来解码的(此解码规则可以是只适用于此语种或适用于包括此语种在内的多个语种的,此表示此语种语句发音的拼合信息的值一般是按照某种编码规则中有关此语种的TTS方式的编码细则来编码生成的且此解码规则是与此编码规则有关此语种对应的解码规则)。For any language of the Indo-European language family, the value of the combined information representing the pronunciation of sentences in this language involved in the codewords on the entire page of the phonetic coded audiobook can be decoded by the decoding rules related to the TTS method of this language in a certain decoding rule (This decoding rule can be applicable only to this language or to multiple languages including this language. The value of the combined information indicating the pronunciation of sentences in this language is generally the TTS of this language in a certain encoding rule. It is generated by encoding according to the encoding rules of the method, and this decoding rule is related to this encoding rule and corresponds to this language).

以下是以英语作为印欧语系语种的例子来阐述印欧语系语种的TTS方式的编解码技术的。以下编码规则举例是以只适用于英语为例的,解码规则举例是以只适用于英语为例的。The following uses English as an example of an Indo-European language to illustrate the TTS encoding and decoding technology of the Indo-European language. The following examples of encoding rules are only applicable to English, and examples of decoding rules are only applicable to English.

TTS方式的以只适用于英语为例的编码规则是沿承前面有述的直播方式的以只适用于英语为例的编码规则,但需作些改动(主要改动的是编码数据),TTS方式的以只适用于英语为例的解码规则也是沿承前面有述的直播方式的以只适用于英语为例的解码规则,但需作些改动。The encoding rules of the TTS method, which are only applicable to English as an example, follow the encoding rules of the live broadcast method described above, which only apply to English as an example, but some changes are required (the main change is the encoding data), and the TTS method The decoding rule that is only applicable to English as an example is also the same as the decoding rule that only applies to English as an example in the live broadcast mode mentioned above, but some changes are required.

一种TTS方式的以只适用于英语为例的编码规则,包括如下一些编码细则:编码语句集范围完整定义,用于编码的音码的种类选择,对音码赋编号,单词数据库建立,词类的建立,各种词段编码方式,时态词形编码处理,陌生单词编码处理及附加头段信息等,同直播方式的以只适用于英语为例的编码规则一样,此TTS方式的以只适用于英语为例的编码规则是含有压缩编码处理的(因为此编码规则包括的那些编码细则中的对音码赋编号一般是有压缩处理的,几种词段编码方式也是有压缩编码处理的)。权利要求3,12,20或23中提及的编码规则,可以是本说明书提及或没提及的适用于印欧语系任一语种(英语或法语或德语或俄语等)的其有关此语种的编码细则含有压缩编码处理且是TTS方式的的各种编码规则(本句说的‘各种编码规则’中任一编码规则可能是只适用于此语种或适用于包括此语种在内的多个语种)。A TTS encoding rule that is only applicable to English as an example, including the following encoding rules: complete definition of the range of encoding sentence sets, selection of phonetic code types for encoding, numbering of phonetic codes, word database establishment, part of speech The establishment of various word segment encoding methods, tense word form encoding processing, unfamiliar word encoding processing and additional header information, etc., are the same as the encoding rules that are only applicable to English in the live broadcast mode. This TTS method only applies to English. The encoding rules that are applicable to English as an example contain compression encoding processing (because the numbering of phonetic codes in those encoding rules included in this encoding rule generally has compression processing, and several word segment encoding methods also have compression encoding processing ). The coding rule mentioned inclaim 3, 12, 20 or 23 can be any language (English or French or German or Russian etc.) that is applicable to the Indo-European language family (English or French or German or Russian, etc.) that is mentioned or not mentioned in this specification. The encoding rules contain compression encoding processing and various encoding rules in TTS mode (any encoding rule in the "various encoding rules" mentioned in this sentence may be only applicable to this language or applicable to multiple languages including this language. languages).

与上段述的TTS方式的以只适用于英语为例的编码规则有关英语对应的一种TTS方式的以只适用于英语为例的解码规则,包括如下一些解码细则:解码语句集范围完整定义,用于解码的音码的种类选择,对音码赋编号,单词数据库建立,词类的建立,各种词段解码方式,时态词形解码处理,陌生单词解码处理,头段信息读取及用发音形式播放语句发音等。同直播方式的以只适用于英语为例的解码规则一样,此TTS方式的以只适用于英语为例的解码规则是含有压缩解码处理的(因为此解码规则包括的那些解码细则中的对音码赋编号一般是有压缩处理的,几种词段解码方式也是有压缩解码处理的),这些压缩解码处理是与上段述的TTS方式的以只适用于英语为例的编码规则里的压缩编码处理对应的。Corresponding to the encoding rules of the TTS method in the above paragraph that are only applicable to English as an example, there is a decoding rule that is only applicable to English in a TTS method that corresponds to English, including the following decoding rules: a complete definition of the scope of the decoding sentence set, Selection of phonetic codes for decoding, assigning numbers to phonetic codes, establishment of word database, establishment of part of speech, various decoding methods of word segments, tense and word form decoding processing, unfamiliar word decoding processing, reading and using header information Pronunciation form playback sentence pronunciation, etc. The same as the decoding rules applicable only to English in the live broadcast mode, the decoding rules applicable to English in the TTS mode include compression decoding processing (because the decoding rules include those in the decoding rules. Code assignment numbers generally have compression processing, and several word segment decoding methods also have compression decoding processing), these compression decoding processing is the same as the compression coding in the coding rules of the TTS method mentioned in the previous paragraph, which is only applicable to English as an example deal with the corresponding.

权利要求3,12,20或23中提及的解码规则,可以是与本说明书提及或没提及的适用于印欧语系任一语种(英语或法语或德语或俄语等)的其有关此语种的编码细则含有压缩编码处理的任一编码规则(此编码规则可能是只适用于此语种或适用于包括此语种在内的多个语种的)有关此语种对应的本说明书提及或没提及的适用于此语种的其有关此语种的解码细则含有压缩解码处理且是TTS方式的的各种解码规则(这各种解码规则中任一解码规则可能是只适用于此语种或适用于包括此语种在内的多个语种的)。(本段提及的两个‘多个’不一定是相同数目;本段提及的‘对应’是指一般对应或专门对应)The decoding rule mentioned inclaim 3, 12, 20 or 23 may be related to any language (English or French or German or Russian) of the Indo-European language family (English or French or German or Russian, etc.) mentioned or not mentioned in this specification. The coding rules of the language contain any coding rules for compression coding processing (this coding rule may be applicable only to this language or to multiple languages including this language). And the decoding rules for this language applicable to this language include various decoding rules in the form of compressed decoding and TTS (any decoding rule in these various decoding rules may only be applicable to this language or apply to include multiple languages including this one). (The two "multiple" mentioned in this paragraph are not necessarily the same number; the "correspondence" mentioned in this paragraph refers to general correspondence or special correspondence)

TTS方式的以只适用于英语为例的解码规则涉及的发音形式是:解码表示此解码规则定义的解码语句集里任一语句的发音的拼合信息的值时播放语句发音用的发音方式都是用整步TTS发音方式。The pronunciation forms involved in the decoding rule of the TTS mode, which is only applicable to English as an example, are: when decoding the value of the combination information representing the pronunciation of any sentence in the decoding sentence set defined by this decoding rule, the pronunciation mode used for the pronunciation of the playback sentence is Use full-step TTS pronunciation.

因TTS器件(TTS芯片)一般自己带有处理变音(比如TTS器件自己按语句的拼法格式数据连读)的能力,所以印欧语系语种的TTS方式的编解码规则中可不用考虑处理变音。对于涉及形同音不同的单词(或单词组合),一般也是TTS器件(TTS芯片)自己处理发音(比如TTS器件自己按语句的拼法格式数据里各单词的词形形态的排列情况来将单词发出不同的音),所以印欧语系语种的TTS方式的编解码规则中可不用考虑处理形同音不同的发音。并且如发音涉及到变音和形同音不同的情况时,TTS器件一般要考察语句的拼法格式数据中各单词的词形形态的排列情况,所以解码这边一般要用整步TTS发音方式一次把语句的全部(或非全部)拼法格式数据传给TTS器件,而分步TTS发音方式一般可用于不涉及变音和形同音不同的情况。Because TTS devices (TTS chips) generally have the ability to process diacritics (for example, the TTS device itself reads data according to the spelling format of the sentence), so the encoding and decoding rules of the TTS method for Indo-European languages do not need to consider the processing of diacritics. sound. For different words (or word combinations) related to homonyms, generally the TTS device (TTS chip) itself processes the pronunciation (such as the arrangement of the morphological form of each word in the spelling format data of the sentence by the TTS device itself) different sounds), so in the codec rules of the TTS mode of the Indo-European languages, it is not necessary to consider the pronunciation of the same sound but different sounds. And if the pronunciation involves different situations of inflection and homophony, the TTS device generally needs to examine the arrangement of the morphological form of each word in the spelling format data of the sentence, so the whole-step TTS pronunciation method is generally used for decoding. All (or not all) of the spelling format data of the sentence is transmitted to the TTS device at one time, and the step-by-step TTS pronunciation mode can generally be used for situations that do not involve inflections and homonyms.

TTS方式的以只适用于英语为例的编码规则不考虑处理变音和形同音不同的情况(即对读物涉及的英语的任一个单词的任一词形形态,编码时都认为其只有一个发音,对读物涉及的英语的任一单词组合的任一词形形态,编码时都认为其只有一个发音),TTS方式的以只适用于英语为例的解码规则不考虑处理变音和形同音不同的情况(即对读物涉及的英语的任一个单词的任一词形形态,解码时都认为其只有一个发音,对读物涉及的英语的任一单词组合的任一词形形态,解码时都认为其只有一个发音)。The encoding rule of the TTS mode is only applicable to English as an example without considering the different situations of dealing with diacritics and homophones (that is, for any morphological form of any English word involved in the reading material, it is considered that there is only one during encoding) Pronunciation, for any morphological form of any English word combination involved in the reading material, it is considered to have only one pronunciation when encoding), the decoding rules of the TTS method, which are only applicable to English as an example, do not consider processing diacritics and identical The situation where the pronunciation is different (that is, for any morphological form of any English word involved in the reading material, it is considered to have only one pronunciation when decoding, and for any morphological form of any English word combination involved in the reading material, when decoding, it is considered to have only one pronunciation. think that it has only one pronunciation).

与直播方式比较,TTS方式时编解码涉及的语句定义仍然可以是一个句子,一个句子的部分或多个句子,涉及的语种仍然可以是印欧语系的各语种。Compared with the live broadcast mode, the sentence definition involved in the encoding and decoding in the TTS mode can still be a sentence, a part of a sentence or multiple sentences, and the languages involved can still be various languages of the Indo-European language family.

TTS方式的以只适用于英语为例的编码规则完整定义的编码语句集范围是任意英语语句,也即编码语句集包括英语的任意语句;TTS方式的以只适用于英语为例的解码规则完整定义的解码语句集范围是任意英语语句,也即解码语句集包括英语的任意语句。In the TTS method, the coding rules that are only applicable to English as an example completely define the scope of the coding sentence set is any English sentence, that is, the coding sentence set includes any sentence in English; in the TTS method, the decoding rules that are only applicable to English as an example are complete The scope of the defined decoding sentence set is any English sentence, that is, the decoding sentence set includes any sentence in English.

TTS方式的以只适用于英语为例的编解码规则与直播方式的以只适用于英语为例的编解码规则比较,编解码时,对音码的划分定义仍然可不用变(TTS方式时,虽然用来发音的主要是由音码转成的单词拼法,次要是音标,但仍是用音码来作拼合信息的值的编解码),用于编解码的主要音码的选择方式不用变,词的关联性定义可不用变,对音码的编号方式不用变,编码规则中的各种词段编码方式,时态词形编码处理等编码细则还是不用变,解码规则中的各种词段解码方式,时态词形解码处理等解码细则还是不用变,码字中的一个处理单位里各段(头段,单词段,时态词形选择段,陌生单词段等)的定义不用变,范句库格式不用变,在MCU这边不用再对音码录音。在单词数据库里一般不需再记录各单词及单词组合的发音信息数据(词形音标),但如记录还是可以的,虽然一般不用(在TTS芯片里有音标发音功能)。本段说的‘不用变’是指TTS方式与直播方式比较的不用变。(本段所述对于印欧语系其它语种也适合)Compared with the encoding and decoding rules applicable to English as an example of the TTS mode and the encoding and decoding rules applicable to English as an example of the live broadcast mode, during encoding and decoding, the definition of the division of the sound codes still does not need to be changed (in the case of the TTS mode, Although what is used for pronunciation is mainly the word spelling that is converted into by the phonetic sign indicating number, secondary is the phonetic symbol, but still uses the phonetic sign indicating number to do the coding and decoding of the value of assembling information), the selection mode that is used for the main phonetic sign indicating number of coding and decoding does not need Change, the association definition of word needn't be changed, the numbering mode of phonetic code needn't be changed, the coding rules such as various word segment coding modes in coding rules, tense word form coding processing still don't need to change, and various in decoding rules Word segment decoding methods, tense and word form decoding processing and other decoding rules still do not need to be changed, and the definitions of each segment (header segment, word segment, tense word form selection segment, unfamiliar word segment, etc.) in a processing unit in the code word do not need to be changed. Change, the format of the sample sentence library does not need to be changed, and there is no need to record the sound code on the MCU side. Generally do not need to record the pronunciation information data (word form phonetic symbol) of each word and word combination again in word database, but still can as recording, though generally not (the phonetic symbol pronunciation function is arranged in the TTS chip). The "no change" mentioned in this paragraph means that the TTS method does not need to be changed compared with the live broadcast method. (The description in this paragraph is also applicable to other Indo-European languages)

对于英语,一般情况下可把单词及单词组合作为音码,这样比较直观,如定义一些不直观的字符组合作为音码(比如ALLOW分成两个字符组合A和LLOW),各字符组合都赋予频率编号,然后由这些字符组合组成单词及语句的拼法,也是可以的,但其本质与单词及单词组合作音码一样。另外从原理上说,将元辅音(或元音辅音组合)直接作为音码也是可以的,可由元辅音组成语句的全部发音或非全部发音,只是这样一来由主MCU向TTS芯片传输的数据,就基本上是音标数据(音标由元辅音对应而来)了,而没有单词的拼法数据。For English, under normal circumstances, words and word combinations can be used as phonetic codes, which is more intuitive, such as defining some unintuitive character combinations as phonetic codes (such as ALLOW being divided into two character combinations A and LLOW), each character combination is given frequency Numbering, then the spelling of forming word and sentence by these character combinations is also possible, but its essence is the same with word and word combination phonetic sign indicating number. In addition, in principle, it is also possible to directly use vowels and consonants (or combinations of vowels and consonants) as phonetic codes, and all pronunciations or non-all pronunciations of sentences can be composed of vowels and consonants, but in this way the data transmitted from the main MCU to the TTS chip , is basically phonetic symbol data (phonetic symbols correspond to vowels and consonants), and there is no spelling data of words.

TTS方式的以只适用于英语为例的编码规则仍主要是用单词及单词组合这两种音码来编码,TTS方式的以只适用于英语为例的解码规则仍主要是用单词及单词组合这两种音码来解码。The encoding rule of the TTS mode, which is only applicable to English as an example, is still mainly encoded with these two phonetic codes of words and word combinations, and the decoding rule of the TTS mode, which is only applicable to English as an example, is still mainly used of words and word combinations These two kinds of phonetic sign indicating number come decoding.

对于英语这种有词形变化的(印欧语系其它语种也是这样,是有词形变化的),TTS方式的以只适用于英语为例的编码规则用到的单词数据库的记录项字段内容要修改,即对每个单词或单词组合的词形变化,都要记录其拼法,而其发音信息数据(如词形音标)则可以不记录,记录的拼法可以是适于向TTS芯片传输的ASCII字符格式的;另外对于在与拾音器有关的存储器集里其它各数据库(比如词根数据库)里凡是有表示音标的地方,其格式一般需调整一下(原来直播方式编解码时表示音标是用音标编号),可直接用TTS芯片能识别的音标字符(具体用什么字符,各种TTS芯片的定义可能不同,在使用时需参考芯片资料)。在TTS方式的以只适用于英语为例的编码规则用到的单词数据库的记录项举例:比如此单词数据库里的man词的内容(man词对应的记录项)调整如下:For English, which has inflections (other languages of the Indo-European language family also have inflection changes), the content of the record item field of the word database used in the coding rules that are only applicable to English in the TTS mode should be Amendment, namely to the inflection change of each word or word combination, all will record its spelling, and its pronunciation information data (such as word shape and phonetic symbol) then can not record, and the spelling of record can be suitable for transmitting to TTS chip ASCII character format; In addition, for all other databases (such as root databases) in the memory set related to the pickup, where there are phonetic symbols, the format generally needs to be adjusted (the phonetic symbols were used to represent the phonetic symbols when the original live broadcast method was encoded and decoded) number), you can directly use the phonetic characters that can be recognized by the TTS chip (the specific characters used, the definitions of various TTS chips may be different, and you need to refer to the chip information when using it). In the TTS mode, an example of the record item of the word database used by the encoding rule that is only applicable to English: for example, the content of the man word in this word database (the record item corresponding to the man word) is adjusted as follows:

man:(单词)原型的拼法字段(ASCII字符格式)man: spelling field of (word) prototype (ASCII character format)

011100100:频率编号字段(一般为二进制格式)011100100: frequency number field (usually in binary format)

3:所属词类编号字段(man所属词类编号)3: The part of speech number field to which man belongs (the number of the part of speech to which man belongs)

7,8,9,10:关联词类编号字段(man与7,8,9,10几类词关联)7, 8, 9, 10: associated part of speech number field (man is associated with 7, 8, 9, 10 words)

[man]:词形形态1的发音信息数据字段(这里记录的是单词原型man的词形音标,由音标字符组成,本字段可省略,重读和次重读符号也应是归属于音标字符的,音标字符的格式是要能被所使用的TTS芯片识别的音标字符格式)[man]: the pronunciation information data field of morphological form 1 (recorded here is the phonetic symbol of the word prototype man, which is composed of phonetic characters, this field can be omitted, and the accented and sub-stressed symbols should also belong to the phonetic characters. The format of phonetic characters is the phonetic character format that can be recognized by the TTS chip used)

man:词形形态1的拼法字段(单词原型man的拼法,这里记录的不是词形音标,而是单词词形形态的拼法,ASCII字符格式;本字段的数据内容与前面第1个字段的数据内容相同,在这里再开一个字段重复一次是为了醒目)man: the spelling field of morphological form 1 (the spelling of the word prototype man, what is recorded here is not the phonetic symbol of the word form, but the spelling of the word morphological form, in ASCII character format; the data content of this field is the same as the first one above The data content of the field is the same, and another field is opened here to repeat it for eye-catching)

men:词形形态2的拼法字段(man的复数形式men的拼法,这里记录的不是词形音标,而是单词的词形形态的拼法,ASCII字符格式)men: the spelling field of morphological form 2 (the spelling of men in the plural form of man, what is recorded here is not the phonetic symbol of the word form, but the spelling of the morphological form of the word, in ASCII character format)

总之,单词对应的记录项里的词形形态的拼法字段里数据的格式一般需与TTS芯片要求的单词的拼法格式一致。In short, the format of the data in the spelling field of the morphological form in the record item corresponding to the word generally needs to be consistent with the spelling format of the word required by the TTS chip.

单词组合对应的记录项的数据结构是与单词对应的记录项的数据结构类似的。The data structure of the record item corresponding to the word combination is similar to the data structure of the record item corresponding to the word.

就可把大学四级大纲要求的约4千个单词,加上1,2千个常用单词组合(这些单词和单词组合可与直播方式用的英语的单词数据库里记录的那些单词和单词组合相同),编成TTS方式时编码用的单词数据库,即这约4千个单词和1,2千个单词组合,每个都在单词数据库里开一个且只开一个记录项(单词数据库里除了这些单词和单词组合的记录项外没别的记录项),单词数据库里不同记录项里的原型的拼法字段的数据一定是不同的,一个记录项里记录一个单词及其词形形态(或一个单词组合及其词形形态)的拼法信息,而不用管其有变音或形同音不同的情况,记录项的结构见上几段述,各记录项里的频率编号,所属词类编号,关联词类编号等字段里的数据内容可根据本说明书第19页至第22页所述来得到。You can add about 4,000 words required by the university's fourth-level syllabus, plus 1,2,000 commonly used word combinations (these words and word combinations can be the same as those recorded in the English word database used in the live broadcast mode) ), the word database used for encoding when compiling the TTS mode, namely these about 4,000 words and 1,2 thousand word combinations, each of which is opened one and only one record item in the word database (except these in the word database There is no other record item except the record item of word and word combination), the data of the spelling field of the prototype in different record items in the word database must be different, and a word and its morphological form are recorded in a record item (or a The spelling information of word combination and its morphological form), regardless of whether it has diacritics or different homonyms, the structure of the record item is shown in the previous paragraphs, the frequency number in each record item, the number of the part of speech to which it belongs, The data content in fields such as the associated part-of-speech number can be obtained according to the description on pages 19 to 22 of this manual.

TTS方式的以只适用于英语的编码规则中使用的频率排序,整句差别等词段编码方式仍然可与直播方式的以只适用于英语的编码规则中的相应词段编码方式相同,TTS方式的以只适用于英语的解码规则中使用的频率排序,整句差别等词段解码方式仍然可与直播方式的以只适用于英语的解码规则中的相应词段解码方式相同。时态词形编解码处理也同直播方式那样处理。TTS方式的以只适用于英语的编码规则中对陌生词的编码处理,基本上按直播方式的以只适用于英语的编码规则中对陌生词的编码处理即可,相应的,TTS方式的以只适用于英语的解码规则中对陌生词的解码处理,基本上按直播方式的以只适用于英语的解码规则中对陌生词的解码处理即可;只是要注意,如是英语遇到陌生单词,陌生单词的编码信息的格式可采用元辅音作组合(也即音标作组合)或词根元辅音组合等格式,这样在解码时就是用数个音标字符(音标字符应该是TTS芯片能识别的表示音标的字符)来表示陌生单词的发音信息数据,相关的与解码时陌生单词处理有关的词根数据库里各词根对应的记录项的数据里一般不能缺少词根的音标格式的发音信息数据(可用音标字符组合来表示词根的发音信息数据);如解码时是采用整步TTS发音方式来播放语句发音的,那就可把陌生单词的发音信息数据与语句的其它单词的ASCII格式字符一起组成此语句的预备发音数据(文本流),并可由MCU把语句的全部预备发音数据传给TTS芯片播出语句的全部发音或由MCU把语句的非全部预备发音数据传给TTS芯片播出语句的非全部发音(本句和上句提到的‘预备发音数据’都是相同所指);如解码时是采用分步TTS发音方式来播放语句发音的,那任一个部分预备发音数据,一般可以是由数个单词的ASCII格式字符组成,或是由数个陌生单词的发音信息数据与数个单词的ASCII格式字符一起组成,或是由数个陌生单词的发音信息数据一起组成,可由MCU把组成语句的全部预备发音数据的各部分预备发音数据依次分次传给TTS芯片发音以形成语句的全部发音(或由MCU把组成语句的非全部预备发音数据的各部分预备发音数据依次分次传给TTS芯片发音以形成语句的非全部发音),在解码过程中得到的一个陌生单词的发音信息数据一般应放在一个部分预备发音数据里,一般不会分开放在多个部分预备发音数据里。并且在编解码时,除了应把编解码涉及的单词数据库里没有的单词当成陌生单词来对待外,显然也应把单词数据库里的TTS器件(TTS芯片)不能识别的单词当成陌生单词来对待。(单词的ASCII格式字符是指组成单词拼法的所有ASCII字符)In the TTS method, the frequency sorting used in the encoding rules applicable only to English, the word segment encoding method such as the difference of the whole sentence can still be the same as the corresponding word segment encoding method in the live broadcast method in the encoding rule applicable only to English, the TTS method The frequency sorting used in the decoding rules that are only applicable to English, and the segment decoding methods such as the difference in the whole sentence can still be the same as the corresponding segment decoding methods in the decoding rules that are only applicable to English in the live broadcast mode. The tense morphological encoding and decoding process is also processed in the same way as the live broadcast mode. The encoding processing of unfamiliar words in the encoding rules applicable only to English in the TTS method is basically done according to the encoding processing of unfamiliar words in the encoding rules applicable only to English in the live broadcast method. Correspondingly, the encoding processing of unfamiliar words in the TTS method is The decoding processing of unfamiliar words in the decoding rules applicable only to English is basically done according to the live broadcast method, and the decoding processing of unfamiliar words in the decoding rules applicable only to English is sufficient; just note that if English encounters unfamiliar words, The format of the coding information of unfamiliar word can adopt vowel and consonant to make combination (that is phonetic symbol is made combination) or root vowel and consonant combination etc. format, be exactly to use several phonetic symbols character (phonetic symbol character should be the representation phonetic symbol that TTS chip can recognize like this when decoding character) to represent the pronunciation information data of unfamiliar words, the data of the record items corresponding to each root in the root database relevant to the processing of unfamiliar words during decoding generally cannot lack the pronunciation information data of the phonetic symbol format of the word root (the phonetic symbol character combination can be used to represent the pronunciation information data of the word root); if the whole step TTS pronunciation mode is adopted to play the pronunciation of the sentence during decoding, then the pronunciation information data of the unfamiliar word and the ASCII format characters of other words of the sentence can be used to form the preparation of the sentence Pronunciation data (text flow), and can be sent to the whole pronunciation of TTS chip broadcast sentence by MCU all preparation pronunciation data of sentence or pass the not all pronunciation of TTS chip broadcast sentence ( This sentence and the 'ready pronunciation data' mentioned in the previous sentence all refer to the same); as the step-by-step TTS pronunciation mode is used to play the pronunciation of the sentence when decoding, then any part of the prepared pronunciation data can generally be composed of several The ASCII format characters of a word, or the pronunciation information data of several unfamiliar words and the ASCII format characters of several words, or the pronunciation information data of several unfamiliar words together, can be composed by the MCU. Each part of the prepared pronunciation data prepares the pronunciation data to pass to the TTS chip pronunciation successively in batches to form the whole pronunciation of the sentence (or each part of the non-all prepared pronunciation data forming the sentence is passed to the TTS chip pronunciation in successive batches by the MCU To form the non-full pronunciation of the sentence), the pronunciation information data of an unfamiliar word obtained in the decoding process should generally be placed in a part of the prepared pronunciation data, and generally will not be placed separately in multiple parts of the prepared pronunciation data. And when encoding and decoding, except that the words that do not have in the word database involved in encoding and decoding should be treated as unfamiliar words, obviously also the words that cannot be recognized by the TTS device (TTS chip) in the word database should be treated as unfamiliar words. (The ASCII format characters of a word refer to all the ASCII characters that make up the spelling of the word)

本编解码规则举例中,TTS方式的以只适用于英语为例的解码规则涉及用到的解码数据(包括单词数据库等)与TTS方式的以只适用于英语为例的编码规则涉及用到的编码数据(包括单词数据库等)仍是一样的。In this encoding and decoding rule example, the decoding data (including word database, etc.) used in the decoding rule of the TTS method, which is only applicable to English as an example, and the encoding rules used in the TTS method, which is only applicable to English as an example The encoded data (including the word database, etc.) is still the same.

解码采用整步直接发音方式时,对表示英语语句发音的拼合信息的值的解码的最后几个步骤,一般是将频率编号转为词形音标(即把单词及单词组合的频率编号转成能直接发音的格式)以得到语句的预备发音数据,而整步TTS发音方式有点改变,一般是把频率编号转成拼法数据以得到语句的预备发音数据,即要把单词及单词组合的频率编号转为单词及单词组合的拼法(对英语用ASCII字符表示,比如‘man’),但对于陌生单词仍要用音标。解码采用整步TTS发音方式时,对表示英语语句发音的拼合信息的值的解码的最后几个步骤,得到的语句的预备发音数据的格式一般是ASCII码文本流(由单词及单词组合的拼法组成),文本流里面可以插入音标字符(音标字符一般也是ASCII字符格式)来表示陌生单词(本段说明以WTS701EF为例)。(甚至语句的预备发音数据可全由音标字符组成)When decoding adopts whole-step direct pronunciation mode, the last several steps of the decoding of the value of the pieced information that represents English sentence pronunciation generally is that the frequency numbering is converted into the word-form phonetic symbol (promptly the frequency numbering of word and word combination is converted The format of direct pronunciation) to obtain the preparatory pronunciation data of the sentence, and the TTS pronunciation method of the whole step is slightly changed. Generally, the frequency number is converted into spelling data to obtain the preparatory pronunciation data of the sentence, that is, the frequency number of the word and the word combination Convert to the spelling of words and word combinations (expressed in ASCII characters for English, such as 'man'), but still use phonetic symbols for unfamiliar words. When decoding adopts the full-step TTS pronunciation mode, to the last several steps of the decoding of the value of the spelling information representing English sentence pronunciation, the format of the prepared pronunciation data of the sentence that obtains is generally ASCII code text flow (by the spelling of word and word combination) method), and phonetic characters (phonetic characters are generally in ASCII format) can be inserted into the text stream to represent unfamiliar words (this section uses WTS701EF as an example). (Even the prepared pronunciation data of sentence can be made up of phonetic symbols completely)

对于印欧语系任一语种,采用TTS器件发音时,也可用如上一段所述方式来播放语句发音的方式(即整步TTS发音方式),即MCU在解码时,按拼合信息的值进行解码,解码的最后几步,把解出的各单词及单词组合音码的频率编号转成其拼法数据,并如语句编码值中有涉及陌生单词就把用以组成陌生单词的音码的编号(比如词根,元辅音音码的顺序号)转成其发音信息数据(比如音标信息),并把这些拼法数据和发音信息数据(如有陌生单词就有发音信息数据,否则没有发音信息数据)按这些数据对应的发音(由TTS器件发的音)在语句发音中的顺序排组起来形成语句的拼法格式数据(即语句的预备发音数据);并且语句的拼法格式数据也可全部由单词及单词组合的拼法数据组成,或全部由发音信息数据组成(比如在解码的最后几步,可把解出的所有音码的编号或顺序号包括各单词及单词组合的频率编号都转成音标格式的发音信息数据);可由MCU将语句的全部拼法格式数据传给TTS器件(上段是以TTS芯片为例的)播出语句的全部发音或由MCU将语句的非全部拼法格式数据传给TTS器件播出语句的非全部发音。For any language of the Indo-European language family, when the TTS device is used for pronunciation, the mode of pronunciation of the sentence can also be played as described in the previous paragraph (that is, the full-step TTS pronunciation mode), that is, the MCU decodes according to the value of the combined information when decoding. The last few steps of decoding, the frequency numbering of each word and word combination phonetic sign indicating number that solves is changed into its spelling data, and relate to strange word as the numbering of the phonetic sign indicating number ( Such as root, the sequence number of vowel and consonant sound code) is changed into its pronunciation information data (such as phonetic symbol information), and these spelling data and pronunciation information data (if have unfamiliar words just have pronunciation information data, otherwise do not have pronunciation information data) The pronunciation (pronounced by the TTS device) corresponding to these data is arranged in the order in the pronunciation of the sentence to form the spelling format data (that is, the prepared pronunciation data) of the sentence; and the spelling format data of the sentence can also be all by The spelling data of word and word combination are formed, or all are made up of pronunciation information data (for example, in the last few steps of decoding, the numbering or serial number of all phonetic codes that solve can comprise the frequency number of each word and word combination all turn into phonetic symbol format pronunciation information data); MCU can send all the spelling format data of the sentence to the TTS device (the last paragraph is an example with the TTS chip) to broadcast all the pronunciation of the sentence or the non-full spelling format of the sentence by the MCU The data is sent to the TTS device to play out the non-full pronunciation of the sentence.

对于印欧语系语种的分步TTS发音方式,举一个例子:可以在编解码时不使用时态词形选择段,比如英语,比如把字母(共26个)及字符组合作为音码,使用类似频率排序那样的编码方式来对单词段编码(编码时只涉及用单词来编码,单词组合都分拆成单词来处理),而且编码时不涉及处理陌生单词(因所有单词都可由字母及字符组合组成起来表示),也不用带时态词形选择段,并且假设对单词段编码时不使用前后调用处理;MCU解码时,可每次在语句编码值中的单词段里解出组成一个单词的字母及字符组合(单词都是由字母及字符组合组成的;当然单词段里各单词的数据之间需有间隔标志,这里单词的数据是指组成单词的字母及字符组合数据;按类似于频率排序那样的解码方式,取出单词段里单词的数据的顺序就是这些单词原来在语句中的顺序),组成此单词的字母及字符组合就是单词的拼法数据,也是部分预备发音数据,并由MCU将此单词的拼法数据传给TTS器件(比如TTS芯片)播放出语音来,照此方式逐个解出单词段里的全部单词(每解出一个单词,就由MCU把此单词的拼法数据传给TTS器件比如TTS芯片播放语音),即完成全部解码过程,也完成语句的全部发音播出。(实际上每次解出的部分预备发音数据,可以是一个单词或单词组合的拼法数据,也可以是多个单词或单词组合的拼法数据)For the step-by-step TTS pronunciation method of the Indo-European languages, give an example: the tense word form selection segment can not be used in encoding and decoding, such as English, such as using letters (26 in total) and character combinations as phonetic codes, using similar The encoding method of frequency sorting is used to encode word segments (encoding only involves encoding with words, and word combinations are split into words for processing), and encoding does not involve processing unfamiliar words (because all words can be composed of letters and characters. Composed together to represent), also needn't take tense morphological selection section, and assume that word segment encoding does not use before and after call processing; MCU decoding, can solve the composition of a word in the word segment in the sentence code value every time Combinations of letters and characters (words are composed of letters and characters; of course, there must be interval marks between the data of each word in the word segment, where the data of the word refers to the data of the combination of letters and characters that make up the word; according to similar frequency Sort the decoding method, the order of taking out the data of the words in the word segment is the original order of these words in the sentence), the letters and character combinations that make up the word are the spelling data of the word, and it is also part of the prepared pronunciation data, and is provided by the MCU Pass the spelling data of this word to the TTS device (such as a TTS chip) to play the voice, and solve all the words in the word segment one by one in this way (every time a word is solved, the spelling data of this word is sent by the MCU) Pass it to TTS devices such as TTS chips to play voice), that is, complete the entire decoding process, and also complete the broadcast of all pronunciations of sentences. (In fact, part of the prepared pronunciation data solved each time can be the spelling data of a single word or word combination, or the spelling data of multiple words or word combinations)

再述采用TTS产音方式时非印欧语系语种的编解码处理:Restate the codec processing of non-Indo-European languages when adopting the TTS sound production mode:

显然,非印欧语系任一语种的TTS方式的编解码对音码的定义方式及使用的音码情况是与此语种的直播方式的编解码对音码的定义方式及使用的音码情况一般是相同或兼容的;可被当成此语种的TTS方式编解码时涉及的音码的,可以是元辅音或元音辅音组合,也可以是字,词,词组合或范句,也可以是偏旁部首组合或偏旁部首;可以由这些音码中的一种或多种音码来拼合语句集发音,这基本上与此语种的直播方式的编解码时的情况是一样的。Apparently, the definition of the phonetic codes and the phonetic codes used by the TTS codec of any non-Indo-European language is the same as the definition of the phonetic codes and the phonetic codes used by the codec of the live broadcast method of this language Is the same or compatible; can be regarded as the phonetic code involved in the TTS encoding and decoding of this language, it can be a vowel and consonant or a combination of vowels and consonants, it can also be a word, word, word combination or model sentence, or it can be a radical Combination of radicals or radicals; one or more of these phonetic codes can be used to assemble the pronunciation of the sentence set, which is basically the same as the encoding and decoding of the live broadcast mode of this language.

对于非印欧语系任一语种,语音编码有声读物的整页面上的码字涉及的表示此语种语句发音的拼合信息的值可以是按照某种编码规则中有关此语种的TTS方式的编码细则来编码生成的(此编码规则可以是只适用于此语种或适用于包括此语种在内的多个语种的)。For any language of the non-Indo-European language family, the value of the combined information representing the pronunciation of sentences in this language involved in the codewords on the entire page of the phonetic coded audiobook can be determined according to the coding rules of the TTS method for this language in a certain coding rule. Generated by encoding (this encoding rule can be only applicable to this language or applicable to multiple languages including this language).

对于非印欧语系任一语种,语音编码有声读物的整页面上的码字涉及的表示此语种语句发音的拼合信息的值是可被某种解码规则中有关此语种的TTS方式的解码细则来解码的(此解码规则可以是只适用于此语种或适用于包括此语种在内的多个语种的,此表示此语种语句发音的拼合信息的值一般是按照某种编码规则中有关此语种的TTS方式的编码细则来编码生成的且此解码规则是与此编码规则有关此语种对应的解码规则)。For any language of the non-Indo-European language family, the value of the combined information representing the pronunciation of sentences in this language involved in the codewords on the entire page of the phonetic coded audiobook can be determined by the decoding rules related to the TTS method of this language in a certain decoding rule Decoded (this decoding rule can be applicable only to this language or to multiple languages including this language, which means that the value of the combined information of the sentence pronunciation in this language is generally related to this language in a certain encoding rule The encoding rules of the TTS method are encoded and generated, and this decoding rule is the decoding rule corresponding to this language related to this encoding rule).

以下是以汉语作为非印欧语系语种的例子来阐述非印欧语系语种的TTS方式的编解码技术的。以下编码规则举例是以只适用于汉语为例的,解码规则举例是以只适用于汉语为例的The following uses Chinese as an example of a non-Indo-European language to illustrate the TTS encoding and decoding technology of a non-Indo-European language. The following examples of encoding rules are only applicable to Chinese as an example, and examples of decoding rules are only applicable to Chinese as an example

TTS方式的以只适用于汉语为例的编码规则是沿承前面有述的直播方式的以只适用于汉语为例的编码规则,但需作些改动(主要改动的是编码数据),TTS方式的以只适用于汉语为例的解码规则也是沿承前面有述的直播方式的以只适用于汉语为例的解码规则,但需作些改动。The encoding rules of the TTS method, which are only applicable to Chinese as an example, follow the encoding rules of the live broadcast method described above, which are only applicable to Chinese as an example, but some changes are required (mainly the encoding data), the TTS method The decoding rule that is only applicable to Chinese as an example is also the decoding rule that only applies to Chinese as an example following the live broadcast method described above, but some changes are required.

一种TTS方式的以只适用于汉语为例的编码规则,包括以下一些编码细则:编码语句集范围完整定义,用于编码的音码的种类选择,对音码赋编号,词数据库建立,词类的建立,各种词段编码方式,陌生词编码处理及附加头段信息等,同直播方式的以只适用于汉语为例的编码规则一样,此TTS方式的以只适用于汉语为例的编码规则是含有压缩编码处理的(因为此编码规则包括的那些编码细则中的对音码赋编号一般是有压缩处理的,几种词段编码方式也是有压缩编码处理的)。权利要求3,12,20或23中提及的编码规则,可以是本说明书提及或没提及的适用于非印欧语系任一语种(汉语或韩语或日语等)的其有关此语种的编码细则含有压缩编码处理且是TTS方式的的各种编码规则(本句说的‘各种编码规则’中任一编码规则可能是只适用于此语种或适用于包括此语种在内的多个语种)。A TTS encoding rule that is only applicable to Chinese as an example, including the following encoding rules: complete definition of the range of encoding sentence sets, selection of phonetic code types for encoding, numbering of phonetic codes, establishment of word database, and part of speech The establishment of various word segment encoding methods, unfamiliar word encoding processing and additional header information, etc., are the same as the encoding rules that are only applicable to Chinese in the live broadcast method, and the encoding rules that are only applicable to Chinese in this TTS method The rule contains the compression coding processing (because the numbering of the phonetic code in those coding rules that this coding rule includes generally has the compression processing, and several word segment coding modes also have the compression coding processing). The encoding rule mentioned inclaim 3, 12, 20 or 23 may be any language (Chinese or Korean or Japanese, etc.) that is not mentioned in this specification and is applicable to this language. The encoding rules contain compression encoding processing and various encoding rules in TTS mode (any encoding rule in the "various encoding rules" mentioned in this sentence may be only applicable to this language or applicable to multiple languages including this language. language).

与上段述的TTS方式的以只适用于汉语为例的编码规则有关汉语对应的一种TTS方式的以只适用于汉语为例的解码规则,包括如下一些解码细则:解码语句集范围完整定义,用于解码的音码的种类选择,对音码赋编号,词数据库建立,词类的建立,各种词段解码方式,陌生词解码处理,头段信息读取及用发音形式播放语句发音等。同直播方式的以只适用于汉语为例的解码规则一样,此TTS方式的以只适用于汉语为例的解码规则是含有压缩解码处理的(因为此解码规则包括的那些解码细则中的对音码赋编号一般是有压缩处理的,几种词段解码方式也是有压缩解码处理的),这些压缩解码处理是与上段述的TTS方式的以只适用于汉语为例的编码规则里的压缩编码处理对应的。Corresponding to the TTS coding rules that are only applicable to Chinese as an example in the above paragraph, a TTS decoding rule that is only applicable to Chinese as an example includes the following decoding rules: a complete definition of the scope of the decoding sentence set, Selection of phonetic codes for decoding, numbering of phonetic codes, establishment of word database, establishment of part of speech, various decoding methods of word segments, decoding of unfamiliar words, reading of header information and playback of sentence pronunciation in pronunciation form, etc. The same as the decoding rules applicable only to Chinese in the live broadcast method, the decoding rules applicable to Chinese in the TTS method include compression decoding processing (because the decoding rules include the audio pairings in the decoding rules. Code assignment numbers generally have compression processing, and several word segment decoding methods also have compression decoding processing), these compression decoding processing is the same as the compression coding in the coding rules of the TTS method mentioned in the previous paragraph, which is only applicable to Chinese as an example deal with the corresponding.

权利要求3,12,20或23中提及的解码规则,可以是与本说明书提及或没提及的适用于非印欧语系任一语种(汉语或韩语或日语等)的其有关此语种的编码细则含有压缩编码处理的任一编码规则(此编码规则可能是只适用于此语种或适用于包括此语种在内的多个语种的)有关此语种对应的本说明书提及或没提及的适用于此语种的其有关此语种的解码细则含有压缩解码处理且是TTS方式的的各种解码规则(这各种解码规则中任一解码规则可能是只适用于此语种或适用于包括此语种在内的多个语种的)。(本段提及的两个‘多个’不一定是相同数目;本段提及的‘对应’是指一般对应或专门对应)The decoding rules mentioned inclaim 3, 12, 20 or 23 may be applicable to any language (Chinese or Korean or Japanese, etc.) of the non-Indo-European language family mentioned or not mentioned in this specification. The coding rules contain any coding rules for compression coding processing (this coding rule may be applicable only to this language or to multiple languages including this language). The instructions corresponding to this language are mentioned or not mentioned The decoding rules for this language applicable to this language include various decoding rules in the form of compressed decoding and TTS (any decoding rule in these various decoding rules may be only applicable to this language or applicable to the multiple languages including one language). (The two "multiple" mentioned in this paragraph are not necessarily the same number; the "correspondence" mentioned in this paragraph refers to general correspondence or special correspondence)

TTS方式的以只适用于汉语为例的解码规则涉及的发音形式是:解码表示此解码规则定义的解码语句集里任一语句的发音的拼合信息的值时播放语句发音用的发音方式都是用整步TTS发音方式。The pronunciation forms involved in the decoding rule of the TTS mode, which is only applicable to Chinese as an example, are: when decoding the value of the combination information representing the pronunciation of any sentence in the decoding sentence set defined by this decoding rule, the pronunciation modes used for playing the pronunciation of the sentence are Use full-step TTS pronunciation.

因TTS器件(TTS芯片)一般自己带有处理变音(比如TTS器件自己按语句的拼法格式数据连读)的能力,所以非印欧语系语种的TTS方式的编解码规则中可不用考虑处理变音。对于涉及形同音不同的词(或词组合),一般也是TTS器件(TTS芯片)自己处理发音(比如TTS器件自己按语句的拼法格式数据里各词的排列情况来将词发出不同的音),所以非印欧语系语种的TTS方式的编解码规则中可不用考虑处理形同音不同的情况。并且如发音涉及到变音和形同音不同的情况时,TTS器件一般要考察语句的拼法格式数据中各词的排列情况,所以解码这边一般要用整步TTS发音方式一次把语句的全部(或非全部)拼法格式数据传给TTS器件,而分步TTS发音方式一般可用于不涉及变音和形同音不同的情况。Because TTS devices (TTS chips) generally have the ability to process diacritics (for example, the TTS device itself reads data according to the spelling format of the sentence), so the TTS encoding and decoding rules for non-Indo-European languages do not need to consider processing umlaut. For different words (or word combinations) that involve homonyms, generally the TTS device (TTS chip) processes the pronunciation by itself (for example, the TTS device itself emits different sounds according to the arrangement of the words in the spelling format data of the sentence) ), so the encoding and decoding rules of the TTS mode of non-Indo-European languages do not need to consider the situation of dealing with homonyms and different sounds. And if the pronunciation involves different situations of inflection and homophony, the TTS device generally needs to examine the arrangement of each word in the spelling format data of the sentence, so the decoding side generally needs to use the whole-step TTS pronunciation method to convert the words of the sentence once. All (or not all) of the spelling format data is transmitted to the TTS device, and the step-by-step TTS pronunciation method can generally be used for situations that do not involve inflections and homonyms.

TTS方式的以只适用于汉语为例的编码规则不考虑处理变音和形同音不同的情况(即对读物涉及的汉语的任一个词或任一词组合,解码时都认为其只有一个发音),TTS方式的以只适用于汉语为例的解码规则不考虑处理变音和形同音不同的情况(即对读物涉及的汉语的任一个词或任一词组合,解码时都认为其只有一个发音)。对于词数据表里‘为’这类情况,因TTS方式用的词数据表是遵循原来直播方式用的词数据表所以仍保留了两个记录项来分别记录‘为’的两个不同发音的情况,那TTS方式编解码时只需使用其中一个记录项对应的(词的)频率编号,对各语句涉及的不同发音的词‘为’都用此频率编号来表示,解码时TTS器件可自己按得到的语句的拼法格式数据里的‘为’(大五码数据)与其旁边的词(大五码数据)的搭配关系来确定播它的哪个音,对词数据表里其它原来形同音不同的词的情况也按‘为’这样类似处理。The coding rules of the TTS method, which are only applicable to Chinese as an example, do not consider the situation of dealing with different phonetic changes and homonyms (that is, for any word or any combination of words in Chinese involved in the reading, it is considered that it has only one pronunciation when decoding. ), the decoding rules of the TTS method, which are only applicable to Chinese as an example, do not consider the situation of dealing with different inflections and homophones (that is, for any word or any combination of words in Chinese involved in the reading, it is considered that only one pronunciation). For the case of "is" in the word data table, because the word data table used in the TTS method follows the word data table used in the original live broadcast method, two records are still reserved to record the two different pronunciations of "be". In this case, when encoding and decoding in TTS mode, you only need to use the (word) frequency number corresponding to one of the record items, and use this frequency number to represent the words 'is' with different pronunciations involved in each sentence. When decoding, the TTS device can automatically Determine which sound it plays by the collocation relationship between 'for' (big five code data) and the word (big five code data) next to it in the spelling format data of the sentence that is obtained, and it is the same to other originals in the word data table The situation of words with different sounds is also handled similarly by 'for'.

与直播方式比较,TTS方式时编解码涉及的语句定义仍然可以是一个句子,一个句子的部分或多个句子,涉及的语种仍然可以是非印欧语系里的各语种。Compared with the live broadcast method, the sentence definition involved in the encoding and decoding in the TTS method can still be a sentence, a part of a sentence or multiple sentences, and the languages involved can still be various languages in the non-Indo-European language family.

TTS方式的以只适用于汉语为例的编码规则完整定义的编码语句集范围是任意汉语语句,也即编码语句集包括汉语的任意语句;TTS方式的以只适用于汉语为例的解码规则完整定义的解码语句集范围是任意汉语语句,也即解码语句集包括汉语的任意语句。In the TTS method, the encoding rules that are only applicable to Chinese as an example completely define the scope of the encoding sentence set is any Chinese sentence, that is, the encoding sentence set includes any sentence in Chinese; the decoding rules in the TTS method are only applicable to Chinese as an example. The scope of the defined decoding sentence set is any Chinese sentence, that is, the decoding sentence set includes any Chinese sentence.

TTS方式的以只适用于汉语为例的编解码规则与直播方式的以只适用于汉语为例的编解码规则比较,编解码时,对音码的划分定义仍然可不用变(TTS方式时,虽然用来发音的主要是由音码转成的词的拼法,次要是拼音,但仍是用音码来作拼合信息的值的编解码),用于编解码的主要音码的选择方式不用变,词的关联性定义可不用变,对音码的编号方式不用变,编码规则中的几种词段编码方式等编码细则还是不用变,解码规则中的几种词段解码方式等解码细则还是不用变,码字中一个处理单位里各段(头段,词段,陌生词段等)的定义不用变,范句库格式不用变,在与拾音器有关的存储器集里不用再存诸音码的发音数据。虽然在词数据库里一般不需再记录各词及词组合的发音信息数据(词形拼音),但如记录还是可以的,虽然一般不用(在TTS芯片里有拼音发音功能)。本段说的‘不用变’是指TTS方式与直播方式比较的不用变。(本段所述对非印欧语系其它语种也是适合的)Compared with the coding and decoding rules of the TTS mode, which are only applicable to Chinese as an example, and the coding and decoding rules of the live broadcast mode, which are only applicable to Chinese as an example, during encoding and decoding, the definition of the division of the sound codes still does not need to be changed (in the case of the TTS mode, Although what is used for pronunciation is mainly the spelling of the word that is converted into by the phonetic sign indicating number, secondary is pinyin, but still uses the phonetic sign indicating number to do the encoding and decoding of the value of assembling information), is used for the selection mode of the main phonetic sign indicating number of encoding and decoding No need to change, the definition of relevance of words needn’t be changed, the way of numbering phonetic codes needn’t be changed, the coding details such as several word segment encoding methods in the coding rules still needn’t be changed, and the decoding of several word segment decoding methods in the decoding rules, etc. Detailed rules still needn't be changed, and the definition of each section (head section, speech section, strange phrase section etc.) in a processing unit in the code word does not need to change, and the model sentence storehouse format does not need to change, and needn't deposit all again in the memory collection relevant with pickup The pronunciation data of the phonetic code. Though generally do not need to record the pronunciation information data (word form pinyin) of each word and word combination again in word database, still can as record, although generally do not need (in TTS chip, pinyin pronunciation function is arranged). The "no change" mentioned in this paragraph means that the TTS method does not need to be changed compared with the live broadcast method. (This paragraph is also suitable for other languages of the non-Indo-European language family)

从原理上说对于汉语等非印欧语系的语种,将元辅音或元音辅音组合(拼音)直接作为音码也是可以的,由元辅音或元音辅音组合(拼音)组成语句发音,这样一来主MCU向TTS芯片传输的数据,就基本上是拼音数据(汉语的拼音数据由元音,辅音和声调的字符组成),而没有字词的大五码数据。In principle, for non-Indo-European languages such as Chinese, it is also possible to directly use vowel-consonant or vowel-consonant combination (pinyin) as the phonetic code, and to form a sentence pronunciation by vowel-consonant or vowel-consonant combination (pinyin), such The data transmitted from the main MCU to the TTS chip is basically pinyin data (Chinese pinyin data is composed of vowels, consonants and tonal characters), and there is no big five code data of words.

TTS方式的以只适用于汉语为例的编码规则仍主要是用词及词组合这两种音码来编码,TTS方式的以只适用于汉语为例的解码规则仍主要是用词及词组合这两种音码来解码。The encoding rule of the TTS mode, which is only applicable to Chinese as an example, is still mainly coded with the two phonetic codes of words and word combinations, and the decoding rule of the TTS mode, which is only applicable to Chinese as an example, is still mainly to use words and word combinations These two kinds of phonetic sign indicating number come decoding.

对于汉语这种不需词形变化的(非印欧语系其它语种也是这样或可看成是这样),TTS方式的以只适用于汉语为例的编码规则用到的词数据库仍然可使用前面定义的词数据库的数据格式。其记录的词的拼法,应该是适于向TTS芯片输出的格式,一般可以是大五码这类表示汉字的格式。比如在TTS方式的以只适用于汉语为例的编码规则用到的词数据库里的一个记录项(对应一个词)的内容为:For Chinese that does not need inflections (other languages of the non-Indo-European language family are also like this or can be regarded as like this), the word database used by the coding rules that are only applicable to Chinese in the TTS mode can still use the previous definition The data format of the word database. The spelling of the words recorded by it should be suitable for outputting to the TTS chip, generally it can be a format representing Chinese characters such as big five codes. For example, the content of a record item (corresponding to a word) in the word database used by the coding rules that are only applicable to Chinese in the TTS mode is:

‘男人’:(词的)拼法字段(这里应该用大五码格式记录各汉字)'Man': (word) spelling field (each Chinese character should be recorded in the big five format here)

011001100:频率编号字段(一般为二进制格式)011001100: frequency number field (usually in binary format)

3:所属词类编号字段(‘男人’词所属词类编号)3: The part of speech number field (the part of speech number of the word 'man')

7,8,9,10:关联词类编号字段(‘男人’词与7,8,9,10几类词关联)7, 8, 9, 10: associated part of speech number field (the word 'man' is associated with 7, 8, 9, 10 words)

nan2ren2:发音信息数据字段(词形拼音,一般是ASCII字符格式,其中的数字是声调,本字段可以省略)nan2ren2: Pronunciation information data field (word form pinyin, generally in ASCII character format, the numbers in it are tones, this field can be omitted)

总之,词对应的记录项里的拼法字段里数据的格式一般需与TTS芯片要求的词的拼法格式一致。In short, the format of the data in the spelling field in the record item corresponding to the word generally needs to be consistent with the spelling format of the word required by the TTS chip.

词组合对应的记录项的数据结构是与词对应的记录项的数据结构类似的。The data structure of the record item corresponding to the word combination is similar to the data structure of the record item corresponding to the word.

就可把经常使用的约几千个词和词组合(这些词和词组合可与直播方式用的汉语的词数据库里记录的那些词和词组合相同),编成TTS方式时编码用的词数据库,即这约几千个词和词组合,每个都在词数据库里开一个记录项,记录项的结构见上几段述,各记录项里的频率编号,所属词类编号,关联词类编号等字段里的数据内容可根据本说明书第50页至第53页所述来得到。Just can be about several thousand words and word combinations frequently used (these words and word combinations can be identical with those words and word combinations recorded in the Chinese word database that the live broadcast mode is used), the word that encoding is used when being compiled into TTS mode Database, that is, about thousands of words and word combinations, each of which opens a record item in the word database. The structure of the record item is described in the previous paragraphs, the frequency number in each record item, the number of the part of speech it belongs to, and the number of the associated part of speech The data content in the etc. fields can be obtained according to the description on pages 50 to 53 of this manual.

对于原来直播方式编解码时定义的词数据表里的在多个记录项里分别记录的本来同一个词(或词组合)的不同发音情况信息(形同音不同,这多个记录项中每个记录项对应是一个词或词组合),TTS方式编解码时定义的词数据表里仍可保留在直播方式编解码时定义的词数据表里里这些分别记录原俩同一个词(或词组合)的不同发音情况信息的多个记录项的形式,且每个记录项的数据结构仍可如以上几段所述(这多个记录项中各记录项里拼法字段的大五码数据都相同),只是TTS方式使用时只使用这多个记录项中其中一个记录项的信息。For the original different pronunciation situation information of the same word (or word combination) recorded separately in the word data table defined in the original live mode encoding and decoding in multiple record items (the same sound is different, each in these multiple record items Each record item corresponds to a word or a combination of words), and the word data table defined in the TTS mode encoding and decoding can still be retained in the word data table defined in the live broadcast mode encoding and decoding. Combination) in the form of a plurality of record items of different pronunciation situation information, and the data structure of each record item can still be as described in the above paragraphs (the big five code data of the spelling field in each record item in these multiple record items are the same), except that when using the TTS method, only the information of one of the multiple record items is used.

TTS方式的以只适用于汉语的编码规则中使用的频率排序,整句差别等词段编码方式仍然可与直播方式的以只适用于汉语的编码规则中的相应词段编码方式相同,TTS方式的以只适用于汉语的解码规则中使用的频率排序,整句差别等词段解码方式仍然可与直播方式的以只适用于汉语的解码规则中的相应词段解码方式相同。TTS方式的以只适用于汉语的编码规则中对陌生词的编码处理,基本上按直播方式的以只适用于汉语的编码规则中对陌生词的编码处理即可,相应的,TTS方式的以只适用于汉语的解码规则中对陌生词的解码处理,基本上按直播方式的以只适用于汉语的解码规则中对陌生词的解码处理即可;只是要注意,如是汉语遇到陌生词,陌生词的编码信息的格式可采用单字拼音作组合(也即拼音作组合),这样在解码时就是用拼音字符(此拼音字符应该是TTS芯片能识别的表示拼音的字符,一般是ASCII字符格式)来表示陌生词的发音信息数据,相关的与陌生词处理有关的单字拼音表里各单字拼音对应的记录项的数据里一般不能缺少单字拼音的拼法信息(拼音字符拼法,这里的拼音字符应是TTS芯片能识别的表示拼音的字符);如解码时是采用整步TTS发音方式来播放语句发音的,那就可把陌生词的发音信息数据与语句的其它词的大五码一起组成此语句的预备发音数据(文本流),并可由MCU把语句的全部预备发音数据传给TTS芯片播出语句的全部发音或由MCU把语句的非全部预备发音数据传给TTS芯片播出语句的非全部发音(本句和上句提到的‘预备发音数据’都是相同所指);如解码时是采用分步TTS发音方式来播放语句发音的,那任一个部分预备发音数据,一般可以是由数个词的大五码组成,或是由数个陌生词的发音信息数据与数个词的大五码一起组成,或是由数个陌生词的发音信息数据一起组成,可由MCU把组成语句的全部预备发音数据的各部分预备发音数据依次分次传给TTS芯片发音以形成语句的全部发音或由MCU把组成语句的非全部预备发音数据的各部分预备发音数据依次分次传给TTS芯片发音以形成语句的非全部发音,在解码过程中得到的一个陌生词的发音信息数据一般应放在一个部分预备发音数据里,一般不会分开放在多个部分预备发音数据里。并且在编解码时,除了应把编解码涉及的词数据库里没有的词当成陌生词来对待外,显然也应把词数据库里的TTS器件(TTS芯片)不能识别的词当成陌生词来对待。(词的大五码是指组成词拼法的所有大五码字符)The TTS method is based on the frequency sorting used in the coding rules that are only applicable to Chinese, and the word segment coding methods such as differences in the whole sentence can still be the same as the corresponding word segment coding methods in the live broadcast method that are only applicable to Chinese coding rules. The TTS method The frequency sorting used in the decoding rules that are only applicable to Chinese, and the segment decoding methods such as the difference in the whole sentence can still be the same as the corresponding segment decoding methods in the decoding rules that are only applicable to Chinese in the live broadcast mode. In the TTS mode, the encoding processing of unfamiliar words in the encoding rules applicable only to Chinese is basically done according to the encoding processing of unfamiliar words in the encoding rules applicable only to Chinese in the live broadcast mode. Correspondingly, the TTS method uses The decoding processing of unfamiliar words in the decoding rules applicable only to Chinese is basically the same as the decoding processing of unfamiliar words in the decoding rules applicable only to Chinese according to the live broadcast method; just note that if Chinese encounters unfamiliar words, The format of the encoding information of unfamiliar words can be combined with single-character pinyin (that is, pinyin is combined), so that when decoding, pinyin characters are used (this pinyin character should be a character that can be recognized by the TTS chip to represent pinyin, generally ASCII character format ) to represent the pronunciation information data of unfamiliar words, and the data of the record items corresponding to each word in the phonetic alphabet table related to the processing of unfamiliar words generally cannot lack the phonetic information of the phonetic alphabet (pinyin character spelling, the phonetic alphabet here Character should be the character that the TTS chip can recognize to represent pinyin); As adopting the full-step TTS pronunciation mode to play the pronunciation of the sentence when decoding, then the pronunciation information data of the unfamiliar word can be combined with the big five yards of other words of the sentence Make up the prepared pronunciation data (text flow) of this sentence, and can pass all the prepared pronunciation data of the sentence to the TTS chip to broadcast the whole pronunciation of the sentence by the MCU or pass the non-all prepared pronunciation data of the sentence to the TTS chip to broadcast the sentence by the MCU Not all pronunciations (this sentence and the "prepared pronunciation data" mentioned in the previous sentence all refer to the same); if the step-by-step TTS pronunciation method is used to play the pronunciation of the sentence during decoding, then any part of the prepared pronunciation data, generally It can be composed of the big five codes of several words, or it can be composed of the pronunciation information data of several unfamiliar words and the big five codes of several words, or it can be composed of the pronunciation information data of several unfamiliar words together, and can be controlled by the MCU Send each part of the prepared pronunciation data of the whole prepared pronunciation data of the sentence to the TTS chip pronunciation in order to form the whole pronunciation of the sentence; Pronunciation to the TTS chip to form part of the pronunciation of the sentence, the pronunciation information data of an unfamiliar word obtained in the decoding process should generally be placed in a part of the prepared pronunciation data, and generally will not be placed separately in multiple parts of the prepared pronunciation data. And when encoding and decoding, except that the words that should not be treated as unfamiliar words in the word database involved in encoding and decoding, obviously also should treat the words that the TTS device (TTS chip) in the word database cannot recognize as unfamiliar words. (The big five codes of the word refer to all the big five code characters that make up the spelling of the word)

本编解码规则举例中,TTS方式的以只适用于汉语为例的解码规则涉及用到的解码数据(包括词数据库等)与TTS方式的以只适用于汉语为例的编码规则涉及用到的编码数据(包括词数据库等)仍是一样的。In this example of encoding and decoding rules, the decoding data (including word database, etc.) used in the decoding rules of the TTS method, which is only applicable to Chinese as an example, and the encoding rules of the TTS method, which are only applicable to Chinese as an example, involve the used The coded data (including word database, etc.) is still the same.

解码采用整步直接发音方式时,对表示汉语语句发音的拼合信息的值的解码的最后几步,一般是将频率编号转为词形拼音(即把词及词组合的频率编号转成能直接发音的格式)以得到语句的预备发音数据,而整步TTS发音方式有点改变,一般是把频率编号转成拼法数据以得到语句的预备发音数据,即是把词及词组合的频率编号转为词及词组合的拼法(对汉语用大五码表示,比如‘男人’),但对于陌生词仍要用拼音。解码采用整步TTS发音方式时,对表示汉语语句发音的拼合信息的值的解码的最后几步,主MCU输出给TTS芯片的语句的全部或非全部预备发音数据的主要格式是大五码文本流,大五码文本流里面可以插入拼音字符(拼音字符一般是ASCII字符格式)表示陌生词(本段以WTS701MF为例)。(甚至语句的预备发音数据可全由拼音字符组成)When decoding adopts whole-step direct pronunciation mode, the last few steps of the decoding of the value of the pieced information to represent Chinese sentence pronunciation, generally be that the frequency numbering is converted into word form pinyin (promptly the frequency numbering of word and word combination is converted into can directly Pronunciation format) to obtain the preparatory pronunciation data of the sentence, and the whole step TTS pronunciation method is slightly changed, generally the frequency number is converted into spelling data to obtain the preparatory pronunciation data of the sentence, that is, the frequency number of the word and word combination is transferred It is the spelling method of words and word combinations (indicated by big five yards for Chinese, such as 'man'), but still use pinyin for unfamiliar words. When the decoding adopts the full-step TTS pronunciation method, in the last few steps of decoding the value of the combined information representing the pronunciation of the Chinese sentence, the main format of all or part of the prepared pronunciation data of the sentence output by the main MCU to the TTS chip is a big five code text Stream, the Big Five text stream can insert pinyin characters (pinyin characters are generally in ASCII character format) to represent unfamiliar words (this paragraph takes WTS701MF as an example). (Even the prepared pronunciation data of sentence can be made up of pinyin character completely)

对于非印欧语系任一语种,也可用如上一段所述播放语句发音的方式(即整步TTS发音方式),即MCU在解码时,按拼合信息的值进行解码,解码的最后几步,把解出的各代表词及词组合音码的频率编号转成其拼法数据,并如语句编码值中有涉及陌生词就把组成陌生词的音码的编号(比如单字拼音的顺序号,单字拼音也是元音辅音组合音码)转成其发音信息数据(比如拼音),并把这些拼法数据和发音信息数据(如有陌生词就有发音信息数据,如没有陌生词就没有发音信息数据)按这些数据对应的发音(由TTS器件发的音)在语句发音中的顺序排组起来形成语句的拼法格式数据(即为语句的预备发音数据);并且语句的拼法格式数据也可全部由词及词组合的拼法数据组成,或全部由发音信息数据组成(比如在解码的最后几步,可把解出的所有音码的编号或顺序号包括各词及词组合的频率编号都转成拼音格式的发音信息数据);可由MCU将语句的全部拼法格式数据传给TTS器件(上段是以TTS芯片为例的)播出语句的全部发音或由MCU将语句的非全部拼法格式数据传给TTS器件播出语句的非全部发音。For any language of the non-Indo-European language family, the method of playing sentence pronunciation as described in the previous paragraph (that is, the full-step TTS pronunciation method) can also be used, that is, when the MCU decodes, it decodes according to the value of the combined information. In the last few steps of decoding, the The frequency numbering of each representative word and word combination phonetic sign indicating number that solves changes into its spelling data, and relates to the numbering of the phonetic sign indicating number (such as the sequence number of individual character phonetic alphabet, individual character) of forming unfamiliar word as having to relate to unfamiliar word in the sentence encoding value Pinyin is also a combination of vowel and consonant sound code) into its pronunciation information data (such as pinyin), and these spelling data and pronunciation information data (if there are unfamiliar words, there is pronunciation information data, as there is no pronunciation information data without unfamiliar words ) according to the pronunciation (pronounced by the TTS device) corresponding to these data is arranged in the order in the pronunciation of the sentence to form the spelling format data of the sentence (that is, the prepared pronunciation data of the sentence); and the spelling format data of the sentence can also be All are made up of the spelling data of word and word combination, or all are made up of pronunciation information data (for example, in the last few steps of decoding, the numbering or sequence numbers of all phonetic codes that can be solved include the frequency numbering of each word and word combination are all converted into pronunciation information data in pinyin format); MCU can transmit all the spelling format data of the sentence to the TTS device (the above paragraph is an example of the TTS chip) to broadcast all the pronunciation of the sentence or the non-full spelling of the sentence by the MCU The French format data is sent to the TTS device to broadcast the non-full pronunciation of the sentence.

对于非印欧语系语种的分步TTS发音方式,举一个例子:比如汉语,如对某语句的编码值(一个处理单位的值)进行解码,此语句编码值是主要以词及词组合为音码来编码的,其词段编码采用频率排序编码方式,如编码语句中无陌生词,并且假设对词段编码时不使用前后调用处理;解码时,MCU先查语句编码值知道无陌生词,并且汉语本来无时态词形选择段,接着看词段,就可以在词段里每取一个词或词组合的频率编号(按频率排序解码方式,取出词段里词及词组合的频率编号的顺序就是这些词及词组合原来在语句中的顺序),就在词数据库里去查此词或词组合对应的记录项里的拼法数据(比如大五码),此拼法数据就是部分预备发音数据,并由MCU将此拼法数据传给TTS器件(比如TTS芯片)播放出音来,接着MCU在词段里再取下个词或词组合的频率编号,并依此办法播放下个词或词组合的发音(即每解出一个词或词组合,就播这个词或词组合的音),直到把词段里的所有词及词组合都取完播放完,解码过程完成,语句的全部发音播出也完成了。(实际上每次解出的部分预备发音数据,可以是一个词或词组合的拼法数据,也可以是多个词或词组合的拼法数据)For the step-by-step TTS pronunciation method of non-Indo-European languages, give an example: such as Chinese, such as decoding the coded value (value of a processing unit) of a sentence, the coded value of this sentence is mainly based on words and word combinations. Code to encode, the word segment encoding adopts the frequency sorting encoding method, if there are no unfamiliar words in the encoding sentence, and it is assumed that the front and back call processing is not used when encoding the word segment; when decoding, the MCU first checks the sentence encoding value to know that there are no unfamiliar words, And Chinese originally has no tense word form selection segment, and then look at the word segment, you can get the frequency number of each word or word combination in the word segment (according to the frequency sorting decoding method, take out the frequency number of the word and word combination in the word segment The order of these words and word combinations is the original order in the sentence), just check the spelling data (such as big five codes) in the record item corresponding to this word or word combination in the word database, this spelling data is part Prepare the pronunciation data, and send the spelling data to the TTS device (such as a TTS chip) by the MCU to play the sound, and then the MCU takes the frequency number of the next word or word combination in the word segment, and plays it in this way The pronunciation of each word or word combination (that is, every time a word or word combination is solved, the sound of this word or word combination is played), until all the words and word combinations in the word segment are taken and played, the decoding process is completed, The entire pronunciation broadcast of the sentence has also been completed. (In fact, part of the prepared pronunciation data solved each time can be the spelling data of a word or word combination, or the spelling data of multiple words or word combinations)

对非印欧语系各语种和印欧语系各语种,解码采用分步TTS发音方式时,在每个部分预备发音数据的发音之后,需有点延时,MCU在这个延时期间要解码取下一个发音接续的部分预备发音数据,然后再播这下一个部分预备发音数据的音(将此部分预备发音数据传给TTS芯片发音),不过这个延时很短,对形成语句的全部发音和非全部发音没什么影响,而且可在划分各部分预备发音数据时,使每个部分预备发音数据的发音之后本来就需有一小段静音延时(比如每个部分预备发音数据的发音是一个词的发音,每个词的发音之后本来就需有一小段静音延时)。For non-Indo-European languages and Indo-European languages, when the decoding adopts the step-by-step TTS pronunciation method, after the pronunciation of each part of the prepared pronunciation data, there is a delay, and the MCU needs to decode and take the next one during this delay period. Pronunciation continuation part preparation pronunciation data, then broadcast the sound of this next part preparation pronunciation data (pass this part preparation pronunciation data to TTS chip pronunciation), but this time delay is very short, to form the whole pronunciation of sentence and not all Pronunciation has no influence, and can when dividing each part preparation pronunciation data, make the pronunciation of each part preparation pronunciation data originally need to have a short silence delay (such as the pronunciation of each part preparation pronunciation data is the pronunciation of a word, every A short silence delay would have been required after the pronunciation of the word).

以下涉及印欧语系语种和非印欧语系语种的TTS方式时各解码举例是以整步TTS发音方式为例的。The following decoding examples for the TTS modes of Indo-European languages and non-Indo-European languages are based on the full-step TTS pronunciation mode as an example.

印欧语系语种或非印欧语系语种的TTS方式涉及的实施方案中,有关语音编码有声读物及拾音器,以及编码规则及解码规则,凡是涉及与直播方式相同内容的技术,其技术特征请参阅前面对直播方式的技术的详细描述。In the implementation schemes involved in the TTS method of Indo-European languages or non-Indo-European languages, regarding speech coding audiobooks and pickups, as well as encoding rules and decoding rules, any technology involving the same content as the live broadcast method, for its technical characteristics, please refer to the preceding A detailed description of the technology facing the live broadcast method.

虽然TTS方式时编码数据(解码数据)的内容有修改,但修改的地方不涉及编码处理(对编码数据的修改比如是将单词数据库里的各词形形态的发音信息数据字段改成拼法字段),所以TTS方式时对语句的编码过程可同于直播方式时对语句的编码过程。Though the content of coded data (decoded data) has modification during TTS mode, the place of modification does not involve encoding process (for example, the pronunciation information data field of each morphological form in the word database is changed into spelling field to the modification of coded data ), so the encoding process of the sentence in the TTS mode can be the same as the encoding process of the sentence in the live mode.

印欧语系语种TTS方式编码举例:同直播方式述,请参考本说明书第39页至第41页对‘The handsome man is an engineer’语句的编码举例。Example of TTS encoding for Indo-European languages: Same as the live broadcast method, please refer to the encoding example of the sentence "The handsome man is an engineer" on pages 39 to 41 of this manual.

非印欧语系语种TTS方式编码举例:同直播方式述,请参考本说明书第65页至第66页对‘这个年轻漂亮男人是一个年轻工程师’语句的编码举例。Example of TTS coding in non-Indo-European languages: the same as the live broadcast method, please refer to the coding example of the sentence "this young beautiful man is a young engineer" on pages 65 to 66 of this manual.

拾音器这边的解码处理有变,主要是解码的最后几步有改变。The decoding process on the pickup side has changed, mainly in the last few steps of decoding.

印欧语系语种解码举例:原话是:The handsome man is an engineer.Indo-European language decoding example: The original words are: The handsome man is an engineer.

语句编码值(即表示语句发音的拼合信息的值,也即一个处理单位的值)为:The sentence encoding value (that is, the value of the combined information representing the pronunciation of the sentence, that is, the value of a processing unit) is:

0000001010001001111111100111001000111001110010110010000000010100010011111111001110010001110011100101100100

01100101010010110000101110010110010101001011000010111001

顺序看各BIT:Look at each BIT in order:

头段:header:

00:本语句是英语语种00: This statement is in English language

00:单词段采用第1种词段编码方式(即频率排序编码方式)00: The word segment adopts the first word segment encoding method (that is, the frequency sorting encoding method)

单词段:word segment:

00101:单词段里的单词及单词组合个数(5个)00101: the number of words and word combinations in the word segment (5)

00:the单词的频率编号的长度信息00: Length information of the frequency number of the word

010:the单词的频率编号010: The frequency number of the word

01:111111的频率编号长度信息01: Frequency number length information of 111111

111111:111111代表陌生单词(handsome),(111111也计入单词及单词组合个数)111111: 111111 represents an unfamiliar word (handsome), (111111 is also included in the number of words and word combinations)

10:man单词的频率编号的长度信息10: The length information of the frequency number of the man word

011100100:man单词的频率编号011100100: the frequency number of the word man

01:is an单词组合的频率编号的长度信息01: The length information of the frequency number of the is an word combination

110011:is an单词组合的频率编号110011: Frequency number of is an word combination

10:engineer单词的频率编号的长度信息10: The length information of the frequency number of the word engineer

010110010:engineer单词的频率编号010110010: the frequency number of the word engineer

时态词形选择段:Tenses and Morphological Choices:

0:man单词的时态词形选择值(此单词在单词数据库此词对应的记录项里有man,men共两个词形形态,用1个bit表示时态词形选择值,本语句用的man词是此词两个词形形态man,men中的第1个,所以时态词形选择值是0)0: tense word form selection value of the word man (there are two word form forms of man and men in the record item corresponding to this word in the word database, and 1 bit is used to represent the tense word form selection value. This statement uses The word man is the first of the two morphological forms of this word, man and men, so the tense morphological selection value is 0)

011:is an单词组合的词形选择(此单词组合有is a,be a,was a,is an,bean,was an共6个词形形态,用3个bit表示,is an是6个词形形态的第4个,所以时态词形选择值是011)011: Word form selection of is an word combination (this word combination has is a, be a, was a, is an, bean, was an a total of 6 word forms, represented by 3 bits, is an is 6 words The 4th of the shape, so the tense word shape selection value is 011)

0:engineer单词的时态词形选择值(此单词有engineer,engineers共两个词形形态,用1个bit表示,engineer是2个词形形态的第1个,所以时态词形选择值是0)0: tense word form selection value of the word engineer (this word has engineer and engineers, which are represented by 1 bit, and engineer is the first of the two word forms, so the tense word form selection value is 0)

陌生单词段:Unfamiliar word segment:

01:陌生单词的编码信息的格式选择信息(词根元辅音组合格式)01: Format selection information for encoding information of unfamiliar words (root vowel and consonant combination format)

01:组成陌生单词发音的词根和元辅音的总个数(2个,本例用了2个词根,没用元辅音;词根包括词基,前缀,后缀)01: The total number of roots and consonants that make up the pronunciation of unfamiliar words (2, this example uses 2 roots, no consonants; roots include bases, prefixes, and suffixes)

0100101100:词根hand在词根数据库里的顺序号0100101100: The sequence number of the root hand in the root database

0010111001:词根some在词根数据库里的顺序号0010111001: the sequence number of the root some in the root database

解码过程:Decoding process:

依照上述各BIT表示的信息,先取头段信息,得知本语句是英语语种且单词段是使用频率排序编码方式来编码的。According to the information represented by the above-mentioned BITs, the header information is obtained first, and it is known that the sentence is in English and the word segments are coded using a frequency sorting coding method.

第1步解码,使用频率排序解码方式(频率排序解码方式的更具体阐述在前面直播方式的以只适用于英语为例的解码规则中有述),先在单词段里取单词段里的单词及单词组合个数信息(5个),再在单词段里往下顺序取得5个频率编号,得:The first step is to decode, use the frequency sorting decoding method (the more specific explanation of the frequency sorting decoding method is described in the decoding rules that only apply to English in the live broadcast mode above), first take the words in the word segment in the word segment And the word combination number information (5), and then obtain 5 frequency numbers in the order of the word segment, get:

010 111111 011100100 110011 010110010010 111111 011100100 110011 010110010

(这时单词段的数据取完,接着在语句编码值中就是时态词形选择段数据的起始位置)(at this moment, the data of the word segment has been fetched, and then in the sentence encoding value, it is the initial position of the tense word form selection segment data)

第2步解码,对于上一步解码得到的111111,查语句编码值中的陌生单词段(上一步解码得到语句编码值中单词段里各频率编号后,再查单词数据表,得各频率编号的时态词形选择值占的BIT位数,而得到时态词形选择段的BIT长度,即可定位陌生单词段在语句编码值中的起始位置,这是常识,象111111,111这样表示陌生词或前后调用的频率编号在时态词形选择段里就不占时态词形选择值BIT),先得陌生单词的编码信息的格式是词根元辅音组合格式,再在陌生单词段往下取词根元辅音的总个数信息及各词根元辅音的顺序号(本例只用了2个词根,没用元辅音),并用词根元辅音组合格式来构造陌生单词发音,得:The 2nd step is decoded, for the 111111 that the decoding of the previous step obtains, look up the unfamiliar word segment in the sentence encoding value (after decoding in the previous step to obtain each frequency number in the word segment in the sentence encoding value, check the word data table again, and get the number of each frequency number The BIT digits occupied by the tense word form selection value, and the BIT length of the tense word form selection segment can be obtained to locate the starting position of the unfamiliar word segment in the sentence encoding value. This is common sense, such as 111111, 111. Unfamiliar words or the frequency numbers that are called before and after do not account for the tense morphological selection value BIT) in the tense morphological selection section. Get the total number information of root vowel consonants and the sequence number of each root vowel consonant (this example has only used 2 roots, no vowel consonants), and use the root vowel consonant combination format to construct the pronunciation of unfamiliar words, get:

010(0100101100 0010111001)011100100 110011 010110010010(0100101100 0010111001)011100100 110011 010110010

第3步,查单词数据表得上一步解码得到的各频率编号的时态词形选择值占的BIT位数,再查语句编码值中的时态词形选择段,得上一步解码得到的各频率编号(即单词或单词组合)的时态词形选择值(有些单词或单词组合比如THE是没选择值的),再结合各频率编号的时态词形选择值查单词数据表,将第2步解码得到的各频率编号换成单词或单词组合的实际的词形形态的拼法,再查处理陌生单词要涉及的词根数据库,将用于组成陌生单词发音的两个词根顺序号(第2步得的结果中的括号内的,’hand’词根和’some’词根的顺序号)各自换成音标字符组合(由一个或多个音标字符组成)并将这两个词根的音标字符组合排在一起成陌生单词的发音信息数据,得(下面第2行即‘‘hand’的音标字符组合和‘some’的音标字符组合’那一行是陌生单词的发音信息数据,组成这两个词根的音标字符组合的音标字符应该是TTS芯片能识别的表示音标的字符,一般是ASCII格式字符):The 3rd step, look up the word data table to get the BIT digits that the tense word form selection value of each frequency number obtained by decoding in the previous step accounts for, and then check the tense word form selection segment in the sentence code value, and obtain the previous step decoding The tense morphological selection value of each frequency number (i.e. word or word combination) (some words or word combination such as THE do not have selection value), then in conjunction with the tense morphological selection value of each frequency numbering check word data table, will The spelling of each frequency number that the 2nd step decoding obtains is changed into the actual morphological form of word or combination of words, then check and process the root database that unfamiliar words will relate to, will be used to form two root sequence numbers ( In the result obtained in step 2, in the parentheses, the sequence numbers of 'hand' root and 'some' root) are replaced by phonetic character combinations (consisting of one or more phonetic characters) and the phonetic characters of these two roots Combining and arranging the pronunciation information data of unfamiliar words together, get (the second line below is the combination of phonetic symbols of ''hand' and the combination of phonetic symbols of 'some'' that line is the pronunciation information data of unfamiliar words, forming these two The phonetic characters of the phonetic character combination of the word root should be the characters that can be recognized by the TTS chip to represent the phonetic symbols, generally ASCII format characters):

’the’单词的拼法(ASCII字符格式)The spelling of the word 'the' (ASCII character format)

‘hand’的音标字符组合和‘some’的音标字符组合The phonetic character combination of 'hand' and the phonetic character combination of 'some'

’is an’单词组合的拼法(ASCII字符格式)The spelling of the word combination of 'is an' (ASCII character format)

’engineer’单词的拼法(ASCII字符格式)The spelling of the word 'engineer' (ASCII character format)

MCU执行解码到这里,再把以上4行字符顺序连接起来就是语句的拼法格式数据(各单词或单词组合的拼法数据之间应有空格符号,如有需要可再插入一些控制字符),也即语句的预备发音数据,可由MCU把语句的拼法格式数据顺序发给TTS芯片,由TTS芯片播放出语音,即是语句的全部发音(在ASCII码文本流中插入音标字符的方式,文本流中间相应要加一些控制字符,具体可参考芯片资料);本解码例子是用整步TTS发音方式来播放所解码的语句的发音的。The MCU executes the decoding here, and then connects the above 4 lines of characters sequentially to form the spelling format data of the sentence (there should be space symbols between the spelling data of each word or word combination, and some control characters can be inserted if necessary), That is, the prepared pronunciation data of the sentence, the spelling format data of the sentence can be sent to the TTS chip sequentially by the MCU, and the voice is played out by the TTS chip, which is the entire pronunciation of the sentence (the method of inserting phonetic symbols in the ASCII code text stream, text Some control characters should be added in the middle of the stream, please refer to the chip documentation for details); this decoding example uses the full-step TTS pronunciation method to play the pronunciation of the decoded sentence.

在本例解码的第3步中有关对时态词形选择段的解码处理的具体细节,还可参考本说明书第83页及第83页后面几页所述对时态词形选择段的解码处理。In the 3rd step of decoding in this example, about the specific details of the decoding process to the tense word form selection segment, you can also refer to the decoding of the tense word form selection segment described on the 83rd page and the following pages of the 83rd page of this manual deal with.

(本解码举例里,各单词及单词组合的频率编号除了可以是本例示的阶梯编号外,还可以是等长编号或哈夫曼编号等;如为等长编号,哈夫曼编号那其前面是不带频率编号长度信息的。)(In this decoding example, the frequency number of each word and word combination can be the ladder number of this example, it can also be a number of equal length or a Huffman number, etc.; if it is a number of equal length, the Huffman number that precedes it It does not carry frequency number length information.)

非印欧语系语种解码举例:语句原话是’这个漂亮男人是一个工程师’Non-Indo-European language decoding example: the original sentence is 'this beautiful man is an engineer'

语句编码值(即表示语句发音的拼合信息的值,也即一个处理单位的值)为:The sentence encoding value (that is, the value of the combined information representing the pronunciation of the sentence, that is, the value of a processing unit) is:

01 00 001100001101111111100110011000101010100110100111010100101010010110101 00 0011000011011111111001100110001010101001101001110101001010100101101

000101110000000101110000

顺序看各BIT:Look at each BIT in order:

头段:header:

01:本语句是汉语语种01: This sentence is in Chinese language

00:词段采用第1种词段编码方式(即频率排序编码方式)00: The word segment adopts the first word segment encoding method (that is, the frequency sorting encoding method)

词段:Phrases:

00110:词段里的词及词组合的个数(6个)00110: the number of words and word combinations in the word segment (6)

00:’这个’词的频率编号的长度信息00: The length information of the frequency number of the word 'this'

011:’这个’词的频率编号011: frequency number of word 'this'

01:111111的频率编号长度信息01: Frequency number length information of 111111

111111:111111代表陌生词(‘漂亮’),(111111也计入词及词组合个数)111111: 111111 represents an unfamiliar word ('beautiful'), (111111 is also included in the number of words and word combinations)

10:’男人’词的频率编号的长度信息10: length information of the frequency number of the word 'man'

011001100:’男人’词的频率编号011001100: frequency number for the word 'man'

01:’是’词的频率编号的长度信息01: The length information of the frequency number of the word 'is'

010101:’是’词的频率编号010101: Frequency number of the word 'is'

00:’一个’词的频率编号的长度信息00: length information of the frequency number of the word 'one'

110:’一个’词的频率编号110: frequency number of word 'one'

10:’工程师’词的频率编号的长度信息10: length information of the frequency number of the word 'engineer'

011101010:’工程师’词的频率编号011101010: frequency number for the word 'engineer'

陌生词段:Unfamiliar phrases:

01:陌生词的编码信息的格式选择信息(单字拼音作组合格式)01: Format selection information for encoding information of unfamiliar words (single-character pinyin as combined format)

01:单字拼音个数(2个)01: The number of pinyin of a single character (2)

01001011010:单字’漂’的拼音在单字拼音表里的顺序号01001011010: the sequence number of the pinyin of the single character 'Piao' in the single-character pinyin table

00101110000:单字’亮’的拼音在单字拼音表里的顺序号00101110000: the sequence number of the pinyin of the single character 'liang' in the single-character pinyin table

解码过程:Decoding process:

依照上述各BIT表示的信息,先取头段信息,得知本语句是汉语语种且词段是使用频率排序编码方式来编码的。According to the information represented by the above-mentioned BITs, the header information is obtained first, and it is known that the sentence is in the Chinese language and the word segments are coded using the frequency sorting coding method.

第1步解码,使用频率排序解码方式(频率排序解码方式的更具体阐述在前面直播方式的以只适用于汉语为例的解码规则中有述),先在词段里取词段里的词及词组合个数信息(6个),再在词段里往下顺序取得6个频率编号,得:The first step is to decode, use the frequency sorting decoding method (the more specific explanation of the frequency sorting decoding method is described in the decoding rules that only apply to Chinese as an example in the live broadcast mode), and first take the words in the word segment And word combination number information (6), and then obtain 6 frequency numbers in the following order in the word segment, get:

011 111111 011001100 010101 110 011101010011 111111 011001100 010101 110 011101010

第2步解码,对于上一步得的111111,查语句编码值中的陌生词段(显然,上一步解码取完语句编码值中词段里的各频率编号的数据后,接着在语句编码值中就是陌生词段数据的起始位置),先得陌生词的编码信息的格式是单字拼音作组合格式,再在陌生词段往下取单字拼音的个数信息及各单字拼音的顺序号,并用单字拼音作组合格式来构造陌生词发音,得:011(01001011010 00101110000)011001100 010101 110 011101010The 2nd step decodes, for the 111111 that the previous step gets, look up the unfamiliar word segment in the sentence coding value (obviously, after the previous step decodes and gets the data of each frequency number in the word segment in the sentence coding value, then in the sentence coding value It is the starting position of the unfamiliar phrase data), and the format of the encoding information of the unfamiliar phrase is the single-character pinyin as the combined format, and then the number information of the single-character pinyin and the sequence number of each single-character pinyin are obtained in the unfamiliar phrase, and used Single-character pinyin is used as a combined format to construct the pronunciation of unfamiliar words: 011(01001011010 00101110000) 011001100 010101 110 011101010

第3步,再查词数据表,将第2步解码得到的各频率编号转成词的拼法(大五码),再查处理陌生词要涉及的单字拼音表,将用于组成陌生词发音的两个单字拼音的顺序号(第2步得的结果中的括号内的)各自转成拼音(拼音由拼音字符拼法表示,‘piao4’和‘liang4’)并将这两个拼音排在一起成陌生词的发音信息数据,得(下面第2行即‘piao4liang4拼音’那一行是陌生词的发音信息数据,组成表示这两个拼音的拼音字符应是TTS芯片能识别的表示拼音的字符,一般是ASCII格式字符):Step 3, check the word data table again, convert each frequency number obtained by decoding instep 2 into the spelling of the word (big five codes), and then check the single-character pinyin table to be involved in the processing of unfamiliar words, which will be used to form unfamiliar words The sequence numbers of the two single-character pinyins (in the parentheses in the result obtained in the second step) are converted into pinyin respectively (pinyin is represented by the spelling of pinyin characters, 'piao4' and 'liang4') and these two pinyin are arranged Together to form the pronunciation information data of unfamiliar words, get (the second line below is the pronunciation information data of 'piao4liang4 pinyin', and the phonetic characters forming the representation of these two pinyin should be the ones that can be recognized by the TTS chip to represent pinyin Characters, generally ASCII format characters):

‘这个’      词的拼法(大五码)The spelling of the word 'this' (big five yards)

piao4liang4   拼音piao4liang4 pinyin

‘男人’      词的拼法(大五码)The spelling of the word 'man' (big five)

‘是’        词的拼法(大五码)The spelling of the word 'is' (big five yards)

‘一个’      词的拼法(大五码)The spelling of the word 'one' (big five yards)

‘工程师’    词的拼法(大五码)The spelling of the word 'engineer' (big five yards)

MCU将解码执行到这里,再把以上6行字符顺序连接起来就是语句的拼法格式数据,(如有需要可再插入一些控制字符),也即语句的预备发音数据,可由MCU把此语句的拼法格式数据顺序发给TTS芯片,由TTS芯片播放出语音,即是语句的全部发音(在大五码文本流中插入拼音字符的方式,文本流中间相应要加一些控制字符,具体可参考芯片资料);也可由MCU把语句的拼法格式数据的部分(即语句的部分预备发音数据)顺序发给TTS芯片,由TTS芯片播放出语音,即是语句的非全部发音;本解码例子是用整步TTS发音方式来播放所解码的语句的发音的。(本解码举例里,各词及词组合的频率编号除了可以是本例示的阶梯编号外,还可以是等长编号或哈夫曼编号等;如为等长编号,哈夫曼编号那其前面是不带频率编号长度信息的。)The MCU executes the decoding until here, and then connects the above 6 lines of characters in order to form the spelling format data of the sentence (if necessary, some control characters can be inserted), that is, the prepared pronunciation data of the sentence, which can be converted by the MCU. The spelling format data is sent to the TTS chip in sequence, and the voice is played by the TTS chip, which is the entire pronunciation of the sentence (in the way of inserting pinyin characters in the big five text stream, some control characters should be added in the middle of the text stream, for details, refer to chip data); the MCU can also send the part of the spelling format data of the sentence (that is, the part of the prepared pronunciation data of the sentence) to the TTS chip in sequence, and the voice will be played by the TTS chip, that is, the non-full pronunciation of the sentence; this decoding example is The pronunciation of the decoded sentence is played in a full-step TTS pronunciation mode. (In this decoding example, the frequency numbering of each word and word combination can be except the ladder numbering of this example, can also be equal-length numbering or Huffman numbering etc.; If it is equal-length numbering, the Huffman numbering that its front It does not carry frequency number length information.)

在主MCU解码得到ASCII码文本流或大五码文本流之后,主MCU在向TTS芯片传输文本流以前,有时需在文本流数据中插入一些控制字节,比如暂停,恢复等,再传输数据,这点属于芯片本身要求,请参阅芯片资料。After the main MCU decodes the ASCII code text stream or the big five code text stream, before the main MCU transmits the text stream to the TTS chip, sometimes it needs to insert some control bytes in the text stream data, such as pause, resume, etc., and then transmit the data , which is required by the chip itself, please refer to the chip information.

按本说明书所述的采用TTS产音方式的拾音器,其TTS芯片与用于其它常规产品的TTS芯片的使用形式没什么实质差别,所以TTS芯片原来有的一些功能,仍然可以使用,比如由主MCU发命令给TTS芯片,以设置TTS芯片系统参数,操作芯片运行,调节音量大小,调节语音播放速度,做播放暂停,缩略语处理等,其中调节播放速度可实现语句的变速播放功能,至于如何发命令及其命令格式,请参考具体芯片资料。According to the pickup that adopts the TTS sound production method described in this manual, the TTS chip has no substantial difference from the TTS chip used in other conventional products, so some original functions of the TTS chip can still be used, such as by the main MCU Send commands to the TTS chip to set the system parameters of the TTS chip, operate the chip, adjust the volume, adjust the playback speed of the voice, pause the playback, process abbreviations, etc., and adjust the playback speed to realize the variable-speed playback function of the sentence. As for how to send For the command and its format, please refer to the specific chip information.

一般情况下TTS方式的语音编码有声读物与直播方式的语音编码有声读物是没什么特别区别的(本段以下所提及的有声读物是指TTS方式的语音编码有声读物),作为有声读物主体的读物仍然可以是书,书籍,卡片或报纸杂志等(读物自身范围内是与拼合信息码无关的,如读物的整页面上形成有拼合信息码,那读物整页面上的这些拼合信息码与读物就一起形成有声读物),有声读物的整页面上的诸个码字中各码字,仍然可以是条形码,导电码,图形码或磁码等码种,多个页面可同时有码字,任一页面上的码字数量可以是零个或一个或多个,码的位置也仍然是在有声读物各页面上任意位置,一本有声读物的整页面上可有一个或多个语种的码字(或可有其同一个码字涉及的各拼合信息的值是不同语种的的码字),每个码字的值是共与数个拼合信息的值有关的(比如每个码字的值可以有一个拼合信息的值),拾音器根据读得的码字的码值来播出语句发音,各码字可以是处在有声读物的整页面上需播此码字对应播的数个语句发音的位置,各码字在有声读物的整页面上所处位置一般是在印刷着码字对应播的数个语句发音的文字的旁边。对于一本有声读物,可以是其所有页面都有码字,也可以是只有部分页面有码字。本段所说的任一码字,其有涉及的各拼合信息的值中的任一拼合信息的值都是由编码规则中有关某一语种的TTS方式的编码细则形成的(不同拼合信息的值可以是用相同或不同编码规则编码生成的)。Under normal circumstances, there is no special difference between speech-encoded audiobooks in TTS mode and speech-encoded audiobooks in live broadcast mode (the audiobooks mentioned below in this paragraph refer to speech-encoded audiobooks in TTS mode), as the main body of audiobooks Still can be books, books, cards or newspapers and magazines etc. together form audiobooks), each codeword in the codewords on the entire page of the audiobook can still be a barcode, conductive code, graphic code or magnetic code, etc., multiple pages can have codewords at the same time, any The number of codewords on the page can be zero or one or more, and the position of the code is still any position on each page of the audiobook. There can be codewords of one or more languages on the entire page of an audiobook ( Or can have the value of each mosaic information that its same code word relates to be the code word of different languages), the value of each code word is to be related to the value of several mosaic information altogether (such as the value of each code word can be There is a value of splicing information), and the pickup will broadcast the pronunciation of the sentence according to the code value of the read code word, and each code word can be the pronunciation of several sentences that need to be played on the entire page of the audio book. Position, the position of each code word on the entire page of the audiobook is generally next to the text that the code word is printed corresponding to the pronunciation of several sentences broadcast. For an audiobook, all pages may have codewords, or only some pages may have codewords. Any code word said in this paragraph, it has the value of any combination information in the value of each combination information involved all to be formed by the encoding rules of the TTS mode of relevant certain language in the encoding rule (different combination information Values can be encoded using the same or different encoding rules).

如有一本语音编码有声读物,其整页面上有些码字涉及的各拼合信息的值是由某编码规则中有关某一语种的TTS方式的编码细则形成的,另一些码字涉及的各拼合信息的值是由某编码规则中有关某一语种的直播方式的编码细则形成的(如果这里的直播方式的编码细则与TTS方式的编码细则有不同时),那此语音编码有声读物还是有上段述的那些特征。如有一本语音编码有声读物,其整页面上有些码字涉及的全部数个拼合信息的值中有些拼合信息的值是由某编码规则中有关某一语种的TTS方式的编码细则形成的,而另一些拼合信息的值是由某编码规则中有关某一语种的直播方式的编码细则形成的(如果这里的直播方式的编码细则与TTS方式的编码细则有不同时),那此语音编码有声读物还是有上段述的那些特征。For example, if there is a speech coded audiobook, the value of each piece of information involved in some codewords on the entire page is formed by the encoding rules of the TTS mode of a certain language in a certain encoding rule, and the value of each piece of information involved in other codewords The value of is formed by the encoding rules of the live broadcast method of a certain language in a certain encoding rule (if the encoding rules of the live broadcast method here are different from the encoding rules of the TTS method), then the speech coded audiobook still has the above paragraph of those characteristics. For example, if there is a speech coded audiobook, the value of some of the combined information involved in some codewords on the entire page is formed by the coding rules of the TTS mode of a certain language in a certain coding rule, and The value of other stitching information is formed by the encoding rules of the live broadcast mode of a certain language in a certain encoding rule (if the encoding rules of the live broadcast mode here are different from the encoding rules of the TTS mode), then the voice coded audiobook It still has the characteristics mentioned in the previous paragraph.

拾音器这边,TTS方式拾音器仍可以是非插卡方式拾音器或插数据卡方式拾音器或插部件式数据卡方式拾音器,TTS方式的这几种拾音器的电路结构与本说明书其它地方述及的直播方式的这几种拾音器的电路结构基本相同,只是用TTS器件来播语音;TTS方式时,非插卡方式拾音器主体就是非插卡方式拾音器,插入数个数据卡到插数据卡方式拾音器主体上以形成插数据卡方式拾音器,插入数个部件式数据卡到插部件式数据卡方式拾音器主体上以形成插部件式数据卡方式拾音器;TTS方式的非插卡方式拾音器或插数据卡方式拾音器的拾音器主体的硬件一般仍包括一个读码头,一个单片机芯片,一个发音器件(带喇叭的TTS芯片),存储器,按键模块等;TTS方式拾音器的读码头仍然可以是光电式读码头(用以读条形码或导电码),或机电式导电码读码头(用以读导电码),或其它读码头。解码有关程序及解码有关数据可以存储在与TTS方式拾音器有关的存储器集里,因使用TTS芯片发音,所以可不需在与存储器集里存储诸音码的发音数据。For the pickup, the TTS pickup can still be a non-card pickup, a data card pickup, or a plug-in data card pickup. The circuit structure of these TTS pickups is the same as that of the live broadcast method mentioned elsewhere in this manual. The circuit structures of these types of pickups are basically the same, except that TTS devices are used to broadcast voice; in the TTS mode, the main body of the non-card pickup is the non-card pickup, and several data cards are inserted into the main body of the data card pickup to form Plug-in data card mode pickup, insert several component type data cards into the main body of the plug-in component type data card mode pickup to form a plug-in component type data card mode pickup; TTS mode non-plug-in mode pickup or pickup body of the data card mode pickup The hardware generally still includes a reading terminal, a single-chip microcomputer chip, a pronunciation device (TTS chip with a speaker), memory, key module, etc.; the reading terminal of the TTS mode pickup can still be a photoelectric reading terminal (used to read barcodes or conductive Code), or electromechanical conductive code reading terminal (used to read conductive code), or other reading terminals. Decoding relevant programs and decoding relevant data can be stored in the memory set relevant to the TTS mode pickup, because of using the TTS chip to pronounce, so it is not necessary to store the pronunciation data of all sound codes in the memory set.

然后本说明书后面述的直播方式编解码处理所使用的几个技术点:1。做多个词数据库以适应不同题材短文,来提高压缩率。2。在单词(词)编号中开10个编号来存储前面句子使用的偏僻词。3。使用相同或兼容的编解码规则及数据,可实现语音编码有声读物与拾音器主体的互相通用。4。语句停顿处理。这几个技术点在TTS方式编解码处理时仍然可用的。Then several technical points used in the live codec processing described later in this manual: 1. Make multiple word databases to adapt to short essays of different themes to improve the compression rate. 2. Open 10 numbers in the word (word) numbering to store the remote words used in the previous sentence. 3. By using the same or compatible codec rules and data, the intercommunication between the voice coded audiobook and the pickup body can be realized. 4. Statement stall processing. These technical points are still available in TTS codec processing.

并且采用TTS器件(比如TTS芯片)来发音的插数据卡方式拾音器(包括印欧语系各语种或非印欧语系各语种),数据卡上的存储器里可存储解码有关程序及解码有关数据,而拾音器的运行主控,读码等程序可存储在拾音器主体内的存储器里(或也可存储在数据卡上的存储器里),使用时,把此数据卡插入拾音器主体使数据卡与拾音器主体合为一个拾音器(本段仍是以在拾音器主体上插一个卡而不是插多个卡为例),由此拾音器读码并调卡上的解码有关程序及解码有关数据执行解码,并可由MCU把语句的全部或非全部预备发音数据(语句的全部或非全部拼法格式数据)通过通讯线传给TTS器件(比如TTS芯片,TTS芯片一般是在拾音器主体内,不是在数据卡上的)播出语句的全部发音或非全部发音(这里解码时的发音方式是以整步TTS发音方式为例的,也可以是分步TTS发音方式),有关插数据卡的更详细技术描述在后面。采用TTS器件来发音的插数据卡方式拾音器是可适用于印欧语系各语种及非印欧语系各语种及任一复合语种的。And adopt the TTS device (such as TTS chip) to insert the data card mode pickup (comprising various languages of Indo-European language family or non-Indo-European language family) of pronunciation, can store decoding relevant program and decoding relevant data in the memory on the data card, and The operation master control of the pickup, code reading and other programs can be stored in the memory in the pickup body (or can also be stored in the memory on the data card). When in use, insert the data card into the pickup body to make the data card and pickup body combined It is a pickup (this paragraph is still an example of inserting one card on the pickup body instead of inserting multiple cards), so that the pickup reads the code and adjusts the decoding-related programs and decoding-related data on the card to perform decoding, and can be decoded by the MCU. All or part of the prepared pronunciation data of the sentence (all or part of the spelling format data of the sentence) is transmitted to the TTS device (such as the TTS chip, which is generally in the main body of the pickup, not on the data card) through the communication line. Pronounce all or part of the sentence (the pronunciation mode during decoding here is an example of the full-step TTS pronunciation mode, and can also be a step-by-step TTS pronunciation mode), and the more detailed technical description of the relevant data card is described later. The data card mode pickup that adopts the TTS device to pronounce is applicable to various languages of the Indo-European language family, various languages of the non-Indo-European language family and any compound language.

常规的TTS器件是做成独立芯片的,本说明书也主要是按此描述的,但如专门将主MCU和TTS器件集成做成一个芯片,也是可以的,其本质都一样,只是由主MCU解码出来的数据,不用传到外面只需在芯片内传输,传输的数据格式仍是前述的文本流(ASCII码流或大五码流)。本发明文件约定,采用TTS产音方式的拾音器,如其TTS器件是独立的TTS芯片,那拾音器的发音器件就是指带喇叭的TTS芯片,如TTS器件是与主MCU集成为一个芯片,此芯片外接喇叭来发音,那拾音器的发音器件就是喇叭。The conventional TTS device is made into an independent chip, and this manual is mainly described according to this, but it is also possible to integrate the main MCU and the TTS device into one chip, the essence of which is the same, but it is decoded by the main MCU The output data does not need to be transmitted outside but only needs to be transmitted within the chip, and the format of the transmitted data is still the aforementioned text stream (ASCII code stream or big five code stream). The document of the present invention stipulates that if the pickup adopts the TTS sound production method, if its TTS device is an independent TTS chip, then the pronunciation device of the pickup refers to the TTS chip with a horn. If the TTS device is integrated with the main MCU as a chip, the chip is externally connected The horn is used to pronounce the sound, and the sounding device of the pickup is the horn.

实施例:图2是一个使用TTS芯片的拾音器主体(因不插数据卡,也不插部件式数据卡,所以此拾音器主体也就是拾音器)的实施例的电路图,3是主MCU,主MCU型号选用SNC5A9,TTS芯片选用WTS701(可选用只发英语的WTS701EF型号),为了使图清晰直观,图中此芯片有些没使用的脚位没标出;发音器件是带喇叭的WTS701芯片(即发音器件是带喇叭的TTS芯片,此带喇叭的TTS芯片也是一个子发音器件),两个按键可分别作为音量循环调节键和工作唤醒休眠键(这两个按键组成按键模块);本实施例的拾音器主体在解码时发音总是采用TTS产音方式的;读码头选用光笔4(此光笔不一定采用附图1中所示结构的光笔,可以是其它种类光笔),用于读取语音编码有声读物的整页面上的条形码(或导电码),与本实施例的拾音器主体配合使用的语音编码有声读物如图7示,此有声读物的整页面上只有一些条形码11(与此拾音器配合使用的语音编码有声读物也可如图3示,此有声读物的整页面上只有一些导电码10)。光笔是现成公知技术,请参考相关资料,读码时,使光笔接触有声读物的整页面上的条形码并在条形码上沿水平方向匀速扫过;光从条形码的条空反射回来在光笔电路上形成方波,方波形式与条形码的条空性状相同(比如一个条形码码字的一个条宽是0.4mm,与其相邻的空宽是0.2mm,那由此条空得到的方波的波峰与相邻波谷的宽度比也是2:1),此方波从SNC5A9的P33脚进入SNC5A9,由SNC5A9分析方波的峰谷宽度数据来解出码值,当然SNC5A9软件里需有对从P33脚进来的方波的波形误差的处理。WTS701EF芯片的复位端可接专用复位电路,这里是用SNC5A9的P63脚来产生WTS701EF的复位信号(用10微秒高电平复位)。SNC5A9芯片上的P20,P21,P22,P23,P60,P61,P62各脚是与TTS芯片通讯的通讯脚。(光笔读导电码的原理与读条形码的原理类似,读码时,使光笔接触导电码并在导电码的码线区沿水平方向匀速扫过,由光从码线区的条空反射回来在光笔电路上形成方波,SNC5A9用P33脚读取方波并根据方波来算出码值)Embodiment: Fig. 2 is the circuit diagram of the embodiment of a pickup main body (because no data card is inserted, nor component type data card inserted, so this pickup main body is also pickup) that uses TTS chip, 3 is main MCU, main MCU model SNC5A9 is selected, and WTS701 is selected as the TTS chip (the WTS701EF model that only sends out English can be selected). In order to make the picture clear and intuitive, some unused pins of this chip are not marked in the figure; It is a TTS chip with a horn, and this TTS chip with a horn is also a sub-pronunciation device), and two buttons can be used as a volume cycle adjustment key and a work wake-up dormancy key respectively (these two buttons form a key module); the pickup of the present embodiment The main body always adopts the TTS sound production mode when decoding; the reading dock selects the light pen 4 (this light pen does not necessarily adopt the light pen of the structure shown in the accompanying drawing 1, it can be other kinds of light pens), and is used to read the speech coding audiobook The bar code (or conduction code) on the whole page of the audiobook used in conjunction with the pickup main body of this embodiment is shown in Figure 7, and there are only some barcodes 11 (the voice code used in conjunction with this pickup) on the entire page of this audiobook. Encoding audiobooks can also be shown in Figure 3, there are only some conductive codes 10) on the entire page of this audiobook. The light pen is a ready-made known technology, please refer to the relevant information. When reading the code, make the light pen touch the barcode on the entire page of the audiobook and sweep across the barcode at a uniform speed along the horizontal direction; Square wave, the shape of the square wave is the same as that of the bars and spaces of the barcode (for example, the bar width of a barcode code word is 0.4mm, and the width of the space adjacent to it is 0.2mm, then the peak of the square wave obtained from this space is the same as that of the adjacent space. The width ratio of the trough is also 2:1), this square wave enters the SNC5A9 from the P33 pin of the SNC5A9, and the SNC5A9 analyzes the peak-to-valley width data of the square wave to decode the code value. Waveform error processing. The reset terminal of the WTS701EF chip can be connected to a special reset circuit. Here, the P63 pin of the SNC5A9 is used to generate the reset signal of the WTS701EF (reset with a high level of 10 microseconds). P20, P21, P22, P23, P60, P61, and P62 on the SNC5A9 chip are communication pins for communicating with the TTS chip. (The principle of reading a conductive code with a light pen is similar to the principle of reading a barcode. When reading a code, make the light pen touch the conductive code and sweep across the code line area of the conductive code at a uniform speed in the horizontal direction, and the light is reflected from the bar space in the code line area to return to the bar code. A square wave is formed on the light pen circuit, SNC5A9 reads the square wave with P33 pin and calculates the code value according to the square wave)

如本实施例涉及的语音编码有声读物(如图7示)的整页面上的每个条形码的值有一个表示英语语句发音的拼合信息的值(这些表示英语语句发音的拼合信息的值都是按本说明书述的TTS方式的以只适用于英语的编码规则来编码生成的,把此有声读物的整页面上的所有条形码11集合起来就是此有声读物整页面上的诸个码字);按常识,拾音器涉及的读码,拾音器运行主控等程序都是存储在SNC5A9片内的存储器里,按单片机技术公知惯例可知,这里SNC5A9的内存储器足够大,所以解码数据都可存储在SNC5A9片内的存储器里,解码程序也可存储在SNC5A9片内存储器里(本实施例的拾音器主体是非插卡方式拾音器主体),此解码程序是按照本说明书述的TTS方式的以只适用于英语为例的解码规则来解码的,此解码数据是此解码程序解码时使用的单词数据库,词根数据库等数据,本实施例的拾音器在解码时采用整步TTS发音方式;使用时,由SNC5A9利用光笔读码头每次读取语音编码有声读物的整页面上的诸个码字中的部诸码字中的任一个码字(此任一个码字即条形码11;本来此拾音器主体能读这诸个码字中的每个码字且可按码值播出语句发音,可把此有声读物的整页面上的诸个码字中的全部条形码定义成此拾音器主体能读的部诸码字),由SNC5A9片内主处理器调片内存储器里的此解码程序和此解码数据对此一个条形码涉及(也就是条形码里)的英语语种的拼合信息的值进行解码,解码最后几步得到的语句的预备发音数据是适于向TTS芯片传输的语句的拼法格式数据,SNC5A9可再通过通讯脚将语句的全部预备发音数据(即语句的拼法格式数据的全部)传给TTS芯片播出语句的全部发音,(或SNC5A9可通过通讯脚将语句的非全部预备发音数据即语句的拼法格式数据的一部分传给TTS芯片播出语句的非全部发音)。这里TTS芯片型号是WTS701EF,其能接收处理英语的ASCII码文本流,文本流中间可插入音标字符。The value of each barcode on the entire page of the voice coded audiobook (as shown in Figure 7) involved in this embodiment has a value representing the spliced information of the pronunciation of English sentences (the values of the spliced information representing the pronunciation of English sentences are all According to the TTS method described in this manual, it is generated by encoding only applicable to English, and all thebarcodes 11 on the entire page of the audiobook are assembled to be the codewords on the entire page of the audiobook); press Common sense, the code reading involved in the pickup, the main control program of the pickup operation, etc. are all stored in the memory in the SNC5A9 chip. According to the known practice of single-chip technology, the internal memory of the SNC5A9 is large enough, so the decoded data can be stored in the SNC5A9 chip. In the memory, the decoding program can also be stored in the SNC5A9 on-chip memory (the main body of the pickup in this embodiment is the main body of the non-card pickup), this decoding program is in accordance with the TTS method described in this manual, and it is only applicable to English as an example The decoding rule decodes, and this decoding data is the word database that this decoding program uses when decoding, data such as word root database, and the pickup of the present embodiment adopts full-step TTS pronunciation mode when decoding; Read any one of the codewords in the codewords on the entire page of the speech coded audiobook (this arbitrary codeword is the bar code 11; originally this pickup main body can read these codewords Each code word of each code word and the sentence pronunciation can be played out according to the code value, and all the barcodes in the code words on the entire page of the audiobook can be defined as all the code words that can be read by the main body of the pickup), composed of SNC5A9 The internal main processor adjusts the decoding program and the decoding data in the internal memory of the chip to decode the value of the combined information of the English language involved in this barcode (that is, in the barcode), and decode the prepared pronunciation data of the sentences obtained in the last few steps It is suitable for the spelling format data of the sentence transmitted to the TTS chip. SNC5A9 can then pass all the prepared pronunciation data of the sentence (that is, all the spelling format data of the sentence) to the TTS chip to broadcast all the pronunciation of the sentence through the communication foot. (or SNC5A9 can pass the non-full pronunciation of sentence to the non-full pronunciation data of sentence, that is, the part of the spelling format data of sentence, by communication foot). The TTS chip model here is WTS701EF, which can receive and process English ASCII code text streams, and phonetic characters can be inserted in the middle of the text streams.

本实施例提及的‘拾音器’就是指本实施例说的‘拾音器主体’;本实施例所说的拾音器主体,是包括图2中的所有部分的。The 'pickup' that the present embodiment mentions refers to the 'pickup main body' that the present embodiment says; the pickup main body that the present embodiment says includes all parts in Fig. 2.

本实施例的拾音器在播放使用过程时,可由SNC5A9发命令给WTS701EF,以调节音量,调节语音播放速度,做播放暂停等,其中调节播放速度可实现语句的变速播放功能,发命令的方式与用于其它常规产品的方式相比没什么特别之处,一般是将所发命令做成字节格式穿插在语句文本流中,发给WTS701EF,至于如何发命令及其命令格式,具体请参考芯片资料。另外还有专门的设置芯片系统参数及操作芯片运行的命令(在不播放时发送),以控制芯片工作状态,及设置调节音量大小,语速等参数。使用不同的语音播放速度参数可实现语句的不同速度播放功能。When the pickup of this embodiment is playing and using, the SNC5A9 can send commands to WTS701EF to adjust the volume, adjust the voice playback speed, and do playback pauses, etc., wherein adjusting the playback speed can realize the variable-speed playback function of sentences, and the way of sending commands is different from that of users. Compared with other conventional products, there is nothing special about it. Generally, the sent commands are made into byte format and interspersed in the statement text stream, and sent to WTS701EF. As for how to send commands and their command formats, please refer to the chip documentation for details. In addition, there are special commands for setting chip system parameters and operating the chip (sent when not playing), to control the working state of the chip, and to set and adjust parameters such as volume and speech rate. Using different voice playback speed parameters can realize the different speed playback function of sentences.

TTS芯片也可以是可播多个语种的,那在与拾音器有关的存储器集里,可存储有适用于各自语种的各解码程序及解码数据,或可存储有可适用于这多个语种的一套解码程序及解码数据,比如TTS芯片既可播英语又可播汉语的,那一般在与拾音器有关的存储器集里既存储有只适用于英语的解码程序及解码数据,也存储有只适用于汉语的解码程序及解码数据的话,这样一来一个拾音器就既可读其页面上有英语语种的码字的语音编码有声读物,又可读其页面上有汉语语种的码字的语音编码有声读物,也可读其页面上既有英语语种的码字也有汉语语种码字的语音编码有声读物,也可读其页面上有其同一码字涉及的多个拼合信息的值中既有英语语种的也有汉语语种的拼合信息的值的码字的语音编码有声读物。The TTS chip can also broadcast multiple languages. In the memory set relevant to the pickup, it can store each decoding program and decoding data applicable to each language, or can store a program applicable to these multiple languages. A set of decoding programs and decoding data, such as TTS chips that can play both English and Chinese, generally in the memory set related to the pickup, there are both decoding programs and decoding data that are only applicable to English, and there are also storages that are only applicable to Chinese decoding program and decoding data, so that a pickup can not only read the audio-coded audiobooks with code words in English language on its page, but also read the audio-coded audiobooks with code words in Chinese language on its page , you can also read the phonetic coded audiobooks with both English and Chinese codewords on the page, and you can also read the English-language audiobooks on the page where there are multiple spliced information related to the same codeword There are also speech coded audiobooks of the codewords of the value of the spliced information in the Chinese language.

机械形状方面,仍然可以象前面直播方式述的,关于本说明书所述的TTS方式具有机电式或光电式读码头的拾音器主体与语音编码有声读物的形状,可以把语音编码有声读物与拾音器主体做成多种形式,比如:一种方式是把拾音器主体做成大机体连读码头形式,把MCU和TTS芯片和喇叭和按键模块做在机体内(如为插数据卡方式,卡插槽也可做在机体上),读码头可做成移动的,并通过导线与机体连接,把大的机体连读码头当成拾音器主体,大机体的外型可以是适于让语音编码有声读物放入机体里的,使用时,将语音编码有声读物(如图3所示为语音编码有声读物示例)放入机体里使用,移动读码头去读语音编码有声读物的整页面上的码字,由机体内的MCU取得读码值并解码使TTS芯片播出语句发音;另外,大机体的外型也可以是适于让语音编码有声读物放在机体上的,使用时,将语音编码有声读物(如图3所示为语音编码有声读物示例)放在机体上使用,移动读码头去读语音编码有声读物的整页面上的码字,由机体内的MCU取得读码值并解码使TTS芯片播出语句发音。另一种方式不做大机体,而把MCU和TTS芯片和读码头和喇叭和按键模块等集成在一个小壳体里成为适于手持的拾音器主体(当然读码头一般需露出在小壳体外以便读码,如为插数据卡方式,卡插槽也可做在小壳体上),而语音编码有声读物是与拾音器主体独立的,使用时语音编码有声读物可放在桌面不动,手持小壳体,移动小壳体用小壳体上的读码头去接触或接近(如为机电式导电码读码头时可以用接触的方式,如为光电式读码头时可用接触或接近的方式)在语音编码有声读物的整页面上的码字以读码播出语句发音。另一种方式还可把读码头做成移动的,将MCU和TTS芯片和喇叭集成在一个小壳体里,此小壳体可放在桌面上,读码头与小壳体用导线连接,把小壳体连读码头当成拾音器主体。In terms of mechanical shape, it can still be described as the live broadcast method above. Regarding the TTS method described in this manual, the pickup body of the electromechanical or photoelectric reading dock and the shape of the voice-coded audiobook can be used to make the voice-coded audiobook and the pickup body. In many forms, for example: one way is to make the main body of the pickup into the form of a large body continuous reading dock, and make the MCU and TTS chip, speaker and button module in the body (for example, the way of inserting a data card, the card slot can also be made on the body), the reading dock can be made mobile, and connected to the body through wires, the large body connecting dock is used as the main body of the pickup, and the appearance of the large body can be suitable for putting voice-coded audiobooks into the body Yes, when in use, put the voice-coded audiobook (as shown in Figure 3 as an example of the voice-coded audiobook) into the body for use, and move the reading dock to read the codewords on the entire page of the voice-coded audiobook, which is controlled by the inside of the body The MCU obtains the code reading value and decodes it to make the TTS chip broadcast the pronunciation of the sentence; in addition, the appearance of the large body can also be suitable for allowing the voice-coded audiobook to be placed on the body. When in use, the voice-coded audiobook (as shown in Figure 3 Shown is an example of a voice-coded audiobook) used on the body, and the mobile reading dock is used to read the code words on the entire page of the voice-coded audiobook, and the MCU in the body obtains the code reading value and decodes it to make the TTS chip broadcast the sentence pronunciation . Another way is not to make a large body, but to integrate the MCU and TTS chip, the reading terminal, the speaker and the button module into a small housing to become the main body of the pickup suitable for hand-holding (of course, the reading terminal generally needs to be exposed outside the small housing to facilitate Code reading, such as the way of inserting a data card, the card slot can also be made on the small shell), and the audio-coded audiobook is independent of the main body of the pickup. When in use, the audio-coded audiobook can be placed on the desktop and held in a small Housing, move the small housing and use the reading terminal on the small housing to touch or approach (such as the electromechanical conductive code reading terminal can use the contact method, such as the photoelectric reading terminal can use the contact or approach method) in the Speech-coding codewords on a full page of an audiobook reads the pronunciation of the sentence in code-reading. In another way, the reading terminal can also be made mobile, and the MCU, TTS chip and speaker are integrated in a small case, which can be placed on the table, and the reading terminal is connected with the small case by wires, and the The small shell connects the dock as the main body of the pickup.

对于某一语种,本发明文件所说的编码规则中有关此语种的直播方式的编码细则与另一编码规则中有关此语种的TTS方式的编码细则一般是相同的,但也可能不同,不论相同还是不同,由编码规则中有关此语种的直播方式的编码细则编码生成的拼合信息的值,都可能可由某解码规则中有关此语种的直播方式或TTS方式或混合方式的解码细则解码,不论相同还是不同,由编码规则中有关此语种的TTS方式的编码细则编码生成的拼合信息的值,都可能可由某解码规则中有关此语种的直播方式或TTS方式或混合方式的解码细则解码;因为作为直播方式编解码涉及的音码与TTS方式编解码涉及的音码种类一般是相同的(比如元辅音,单词等),而一种音码既可用于直播方式解码,也可用于TTS方式解码(比如一个单词,用于直播方式解码时其可以转换成词形音标这样的发音信息数据,在TTS方式解码时可以转换为用于TTS器件发音的拼法数据或发音信息数据),而既可用于直播方式解码也可用于TTS方式解码就推出可用于混合方式解码;本段所说的编码规则可以是只适用于一个语种或适用于多个语种的,本段所说的解码规则可以是只适用于一个语种或适用于多个语种的(这句和上一句提及的两个‘多个’不一定是相同数目)。For a certain language, the coding rules of the live broadcast mode of this language in the coding rules mentioned in the document of the present invention are generally the same as the coding rules of the TTS mode of this language in another coding rule, but they may also be different, regardless of the same Still different, the value of the spliced information encoded by the coding rules of the live broadcast mode of this language in the encoding rule may be decoded by the decoding rules of the live broadcast mode or TTS mode or mixed mode of the language in a certain decoding rule, regardless of the same Still different, the value of the combined information encoded by the TTS coding rules of this language in the encoding rules may be decoded by the decoding rules of the live broadcast mode or TTS mode or mixed mode of this language in a certain decoding rule; because as The sound codes involved in the live codec and the TTS codecs are generally the same (such as vowels, consonants, words, etc.), and a kind of sound code can be used for both live broadcast decoding and TTS decoding ( For example, a word can be converted into pronunciation information data such as word form and phonetic symbol when it is used for live broadcast mode decoding, and can be converted into spelling data or pronunciation information data for TTS device pronunciation when TTS mode decoding), and can be used for both Live broadcast decoding can also be used for TTS decoding, and it can be used for hybrid decoding; the encoding rules mentioned in this paragraph can be only applicable to one language or multiple languages, and the decoding rules mentioned in this paragraph can be applicable only In one language or applicable to multiple languages (this sentence and the two "multiple" mentioned in the previous sentence are not necessarily the same number).

比如采用本说明书述的直播方式的以只适用于英语为例的编码规则来编码生成拼合信息的值,而如要采用本说明书述的TTS方式的以只适用于英语为例的解码规则来解码,此解码规则中定义的解码语句集范围一般需与此编码规则中定义的编码语句集范围相同,这种解码规则所用解码数据与直播方式的以只适用于英语为例的解码规则所用的解码数据一般是基本相同,只是单词数据库的每个记录项里的每个词形形态有关发音的信息的记录格式不同(TTS方式的是记录拼法数据,直播方式的是记录词形音标),而这点区别只是影响到解码生成预备发音数据(或部分预备发音数据)时有点改动,而其它解码过程TTS方式与直播方式是基本相同的,所以TTS方式的以只适用于英语为例的解码规则也可以去解码采用直播方式的以只适用于英语为例的编码规则来编码生成的拼合信息的值;同理,直播方式的以只适用于英语为例的解码规则也可以去解码采用TTS方式的以只适用于英语为例的编码规则来编码生成的拼合信息的值。对于汉语的情况与英语类似。For example, use the encoding rules that are only applicable to English as an example in the live broadcast method described in this manual to encode the value of the combined information, and if you want to use the TTS method described in this manual to decode using the decoding rules that are only applicable to English as an example , the scope of the decoding sentence set defined in this decoding rule generally needs to be the same as the range of the encoding sentence set defined in this encoding rule. The data are generally basically the same, except that the recording format of the pronunciation-related information of each morphological form in each record item of the word database is different (the TTS mode is to record the spelling data, and the live mode is to record the morphological phonetic symbols), and This difference only affects a little change when decoding and generating pre-pronunciation data (or part of the pre-pronunciation data), and the other decoding process TTS method is basically the same as the live broadcast method, so the TTS method only applies to English as an example of decoding rules It is also possible to decode the value of the combined information generated by encoding the encoding rules that are only applicable to English in the live broadcast mode; similarly, the decoding rules that are only applicable to English in the live broadcast mode can also be decoded using the TTS method The value of the generated flattened information is encoded using the encoding rules applicable only to English as an example. The situation for Chinese is similar to that for English.

对某语种,显然如要用解码规则中有关此语种的TTS方式(或混合方式)的解码细则来解码用编码规则中有关此语种的直播方式的编码细则编码生成的拼合信息的值,那一般要求这里编码时音码的定义方式及使用的音码情况与解码时音码的定义方式及使用的音码情况相同或兼容(比如都是使用元辅音);如要用解码规则中有关此语种的直播方式(或混合方式)的解码细则来解码用编码规则中有关此语种的TTS方式的编码细则编码生成的拼合信息的值,那一般要求这里编码时音码的定义方式及使用的音码情况与解码时音码的定义方式及使用的音码情况相同或兼容(比如都是使用元辅音)。For a certain language, it is obvious that if you want to use the decoding rules related to the TTS mode (or mixed mode) of this language in the decoding rules to decode the value of the spliced information encoded with the encoding rules related to the live mode of this language in the encoding rules, then generally It is required that the definition method of the sound code and the situation of the sound code used during encoding are the same or compatible with the definition method of the sound code and the situation of the sound code used in decoding (for example, all use vowels and consonants); if you want to use the relevant language in the decoding rules The decoding rules of the live broadcast mode (or mixed mode) to decode the value of the spliced information encoded by the encoding rules of the TTS mode of this language in the coding rules, which generally requires the definition method of the sound code and the sound code used when encoding here The situation is the same as or compatible with the definition mode of the phonetic code and the situation of the used phonetic code when decoding (for example, all use vowels and consonants).

以上3段述及用解码规则中有关某语种的一发音方式的解码细则来解码用编码规则中有关此语种的另一发音方式的编码细则编码生成的拼合信息的值,主要是针对编码时定义的编码语句集与解码时定义的解码语句集是完全相同来说的,也即用此编码规则中有关此语种的本段前面述的另一发音方式的编码细则编码生成的表示此编码语句集中任一语句发音的拼合信息的值,都可由此解码规则中有关此语种的本段前面述的一发音方式的解码细则来解码;对于编码语句集与解码语句集部分相同时,也是与上述类似的。以上3段中除第2段外涉及编码时定义编码语句集不一定是采用编码全集定义的,以上3段中除第2段外涉及解码时定义解码语句集不一定是采用解码全集定义的。The above 3 paragraphs refer to the decoding rules about one pronunciation mode of a certain language in the decoding rules to decode the value of the combined information encoded by the encoding rules about another pronunciation mode in the encoding rules, mainly for the definition at the time of encoding The coded sentence set is exactly the same as the decoded sentence set defined at the time of decoding, that is to say, the coded sentence set represented by the encoding rules generated by encoding the other pronunciation method mentioned above in this paragraph about this language in this coding rule The value of the spliced information of any sentence pronunciation can be decoded by the decoding rules of the above-mentioned pronunciation mode in this paragraph about this language in this decoding rule; when the coded sentence set and the decoded sentence set are partly the same, it is also similar to the above of. In the above 3 paragraphs exceptparagraph 2, when encoding is involved, the encoding statement set is not necessarily defined by the encoding ensemble, and in the above 3 paragraphs except the second paragraph, when decoding is involved, the decoding statement set is not necessarily defined by using the decoding ensemble.

关于本发明文件所说的解码规则与编码规则对应的定义,对于某语种,至少有某一语句,如用某编码规则中有关此语种的编码细则来编码生成的表示此语句发音的拼合信息的值,可被某解码规则中有关此语种的解码细则来解码播放语句发音,那此解码规则中有关此语种的解码细则就是与此编码规则中有关此语种的编码细则对应的,而且可说成此解码规则是与此编码规则有关此语种对应的解码规则,此解码规则中有关此语种的解码细则定义的此语种的解码语句集与此编码规则中有关此语种的编码细则定义的此语种的编码语句集是完全相同或部分相同的,如果此解码规则中有关此语种的解码细则定义的此语种的解码语句集与此编码规则中有关此语种的编码细则定义的此语种的编码语句集是部分相同的,那一般是此解码规则中有关此语种的解码细则可能是专门用来解码用另外的编码规则(而不是此编码规则)中有关此语种的编码细则来编码生成的拼合信息的值。(本段说的编码规则可以是只适用于一个语种或适用于多个语种的,本段说的解码规则可以是只适用于一个语种或适用于多个语种的,以上两句提及的两个‘多个’不一定是相同数目)Regarding the definition corresponding to the decoding rule and the encoding rule mentioned in the document of the present invention, for a certain language, there is at least a certain sentence, such as using the coding rules related to this language in a certain coding rule to encode the combined information generated to represent the pronunciation of this sentence value, the pronunciation of the playback sentence can be decoded by the decoding rules related to this language in a certain decoding rule, then the decoding rules related to this language in this decoding rule correspond to the encoding rules related to this language in this encoding rule, and can be said to be This decoding rule is a decoding rule corresponding to this language related to this encoding rule. The decoding statement set of this language defined in the decoding rules of this language in this decoding rule is the same as the language of this language defined in the encoding rules of this encoding rule. The encoding statement set is completely or partially identical, if the decoding statement set of this language defined in the decoding rules for this language in this decoding rule is the same as the encoding statement set for this language defined in the encoding rules for this language in this encoding rule Part of the same, that is generally the decoding rules about this language in this decoding rule may be specially used to decode the value of the splicing information generated by encoding the encoding rules about this language in another encoding rule (instead of this encoding rule) . (The encoding rules mentioned in this paragraph can be applicable only to one language or to multiple languages, and the decoding rules mentioned in this paragraph can be applicable to only one language or multiple languages. The two mentioned in the above two sentences 'multiple' are not necessarily the same number)

一个解码规则中有关某语种的TTS方式的解码细则既可以与一个编码规则中有关此语种的TTS方式的编码细则对应,也可以与一个编码规则中有关此语种的直播方式(或混合方式)的编码细则对应;一个解码规则中有关某语种的直播方式的解码细则既可以与一个编码规则中有关此语种的直播方式的编码细则对应,也可以与一个编码规则中有关此语种的TTS方式(或混合方式)的编码细则对应;一个解码规则中有关某语种的混合方式的解码细则既可以与一个编码规则中有关此语种的混合方式的编码细则对应,也可以与一个编码规则中有关此语种的直播方式(或TTS方式)的编码细则对应。The decoding rules about the TTS mode of a certain language in a decoding rule can correspond to the encoding rules about the TTS mode of this language in an encoding rule, and can also correspond to the live mode (or mixed mode) about this language in an encoding rule. Corresponding to the encoding rules; the decoding rules about the live broadcast mode of a certain language in a decoding rule can correspond to the encoding rules about the live broadcast mode of this language in an encoding rule, and can also correspond to the TTS mode (or TTS) about this language in an encoding rule mixed mode) corresponding to the encoding rules; the decoding rules about the mixed mode of a certain language in a decoding rule can correspond to the encoding rules about the mixed mode of this language in an encoding rule, and can also correspond to the encoding rules about this language in an encoding rule Corresponds to the encoding rules of the live broadcast mode (or TTS mode).

本发明文件各处提及的只适用于某语种的编码规则,如没指明是直播方式或TTS方式或混合方式,那就是既可以是直播方式的也可以是TTS方式的也可以是混合方式的;本发明文件各处提及的适用于多个语种的编码规则中有关某语种的编码细则,如没指明是直播方式或TTS方式或混合方式,那就是既可以是直播方式的也可以是TTS方式的也可以是混合方式的。本发明文件各处提及的只适用于某语种的解码规则,如没指明是直播方式或TTS方式或混合方式,那就是既可以是直播方式的也可以是TTS方式的也可以是混合方式的;本发明文件各处提及的适用于多个语种的解码规则中有关某语种的解码细则,如没指明是直播方式或TTS方式或混合方式,那就是既可以是直播方式的也可以是TTS方式的也可以是混合方式的。The coding rules mentioned everywhere in the document of the present invention are only applicable to a certain language. If it is not specified as a live broadcast mode or a TTS mode or a mixed mode, then it can be either a live mode or a TTS mode or a mixed mode. ; The relevant coding rules of a certain language in the coding rules applicable to multiple languages mentioned in the documents of the present invention, if it is not specified as a live broadcast mode or a TTS mode or a mixed mode, then it can be either a live broadcast mode or a TTS The method can also be a mixed method. The decoding rules mentioned everywhere in the document of the present invention are only applicable to a certain language. If it is not specified as a live broadcast mode or a TTS mode or a mixed mode, then it can be either a live broadcast mode or a TTS mode or a mixed mode. ; The decoding rules applicable to a certain language in the decoding rules applicable to multiple languages mentioned in the documents of the present invention, if it is not specified as a live broadcast mode or a TTS mode or a mixed mode, then it can be either a live broadcast mode or a TTS The method can also be a mixed method.

本说明书所说的编码规则,可以是只适用于一个语种的或适用于多个语种的,本说明书所说的解码规则,也可以是只适用于一个语种的或适用于多个语种的,编码规则与解码规则的对应情况可以有:1:编码规则只适用于一个语种,解码规则也只适用于此一个语种,此解码规则中有关此语种的解码细则与此编码规则中有关此语种的编码细则是对应的(这种情况也可说成此解码规则与此编码规则是对应的),在本说明书以上有关编码规则和解码规则的阐述中,就是这样的,这种情况也可说成解码规则与编码规则有关此一个语种是对应的;2:编码规则只适用于一个语种,解码规则适用于包括此一个语种在内的多个语种,这种情况时解码规则与编码规则有关此语种是对应的,3:解码规则只适用于一个语种,编码规则适用于包括此一个语种在内的多个语种,这种情况时解码规则与编码规则有关此语种是对应的,4:编码规则适用于多个语种,解码规则适用于多个语种(这句和上一句提及的两个‘多个’不一定是相同数目),解码规则与编码规则有关一个或超过一个的语种是对应的,这种情况可推出解码规则与编码规则有关某一个语种是对应的;本段内容是对权利要求13,14,15,16中有关解码规则与编码规则对应的内容的解释,本发明文件其它地方提及的有关解码规则与编码规则对应的内容,也应按本段前面述的几种方式来理解。The encoding rules mentioned in this manual can be applicable to only one language or applicable to multiple languages, and the decoding rules mentioned in this manual can also be applicable to only one language or applicable to multiple languages. The correspondence between the rules and the decoding rules can be as follows: 1: The encoding rules are only applicable to one language, and the decoding rules are only applicable to this one language. The details are corresponding (this situation can also be said that the decoding rule corresponds to the encoding rule), which is the case in the above description of the encoding rule and decoding rule in this manual. The rules are related to the encoding rules, which corresponds to one language; 2: The encoding rules are only applicable to one language, and the decoding rules are applicable to multiple languages including this one language. In this case, the decoding rules are related to the encoding rules. This language is Correspondingly, 3: The decoding rule is only applicable to one language, and the encoding rule is applicable to multiple languages including this one language. In this case, the decoding rule is related to the encoding rule and this language is corresponding. 4: The encoding rule is applicable to For multiple languages, the decoding rules are applicable to multiple languages (this sentence is not necessarily the same number as the two "multiple" mentioned in the previous sentence), and the decoding rules correspond to one or more than one language related to the coding rules. In this case, it can be deduced that the decoding rule corresponds to a certain language related to the encoding rule; the content of this paragraph is an explanation of the content corresponding to the decoding rule and the encoding rule inclaims 13, 14, 15, and 16, and other places in the document of the present invention provide The content corresponding to the decoding rules and encoding rules should also be understood in the ways mentioned above in this paragraph.

本说明书中述的各编码规则举例和各解码规则举例都是以只适用于一个语种为例的,对于要适用于多个语种的编码规则,显然其对各语种的编码原理是可参照本说明书阐述的直播方式的以只适用于英语为例的编码规则或者直播方式的以只适用于汉语为例的编码规则或者TTS方式的以只适用于英语为例的编码规则或者TTS方式的以只适用于汉语为例的编码规则的,对于要适用于多个语种的解码规则,显然其对各语种的解码原理是可参照本说明书阐述的直播方式的以只适用于英语为例的解码规则或者直播方式的以只适用于汉语为例的解码规则或者TTS方式的以只适用于英语为例的解码规则或者TTS方式的以只适用于汉语为例的解码规则的;对于要适用于多个语种的编码规则,可简单的把分别只适用于各自语种的编码规则(每个编码规则的形式就如本说明书阐述的直播方式的以只适用于英语为例的编码规则,这些解码规则可能是全是直播方式的,或全是TTS方式的,或既有直播方式也有TTS方式的,或全是混合方式的等情况)并排合起来(也就是把这些编码规则中的编码细则并排合起来)即成一种适用于多个语种的编码规则,也可再编一种新的适用于多个语种的编码规则(此编码规则中所有有关各自语种的编码细则可能是全是直播方式的,或全是TTS方式的,或既有直播方式也有TTS方式的,或全是混合方式的等情况);对于要适用于多个语种的解码规则,可简单的把分别只适用于各自语种的解码规则(每个解码规则的形式就如本说明书阐述的直播方式的以只适用于英语为例的解码规则,这些解码规则可能是全是直播方式的,或全是TTS方式的,或既有直播方式也有TTS方式的,或全是混合方式的等情况)并排合起来(也就是把这些解码规则中的解码细则并排合起来)即成一种适用于多个语种的解码规则,也可再编一种新的适用于多个语种的解码规则(此解码规则中所有有关各自语种的解码细则可能是全是直播方式的,或全是TTS方式的,或既有直播方式也有TTS方式的,或全是混合方式的等情况);比如最简单的可把直播方式的以只适用于英语为例的编码规则和直播方式的以只适用于汉语为例的编码规则简单并排合起来就成为一种可适用于英语和汉语的编码规则(且此编码规则中有关英语的编码细则和有关汉语的编码细则都是直播方式的)。The examples of coding rules and decoding rules described in this manual are all examples that are only applicable to one language. For coding rules that are applicable to multiple languages, it is obvious that the coding principles for each language can be referred to in this manual. The coding rules of the live broadcast method that are only applicable to English as an example or the coding rules of the live broadcast method that are only applicable to Chinese as an example or the coding rules that are only applicable to English in the TTS method or the TTS method that is only applicable to English For the encoding rules for Chinese as an example, for the decoding rules applicable to multiple languages, it is obvious that the decoding principles for each language can refer to the live broadcast method explained in this manual. The decoding rules or live broadcasts that are only applicable to English The decoding rules that are only applicable to Chinese in the mode or the decoding rules that are only applicable to English in the TTS mode or the decoding rules that are only applicable to Chinese in the TTS mode; for multiple languages Coding rules can be simply the coding rules that are only applicable to their respective languages (the form of each coding rule is just like the coding rules that are only applicable to English in the live broadcast mode described in this manual, these decoding rules may be all Live mode, or all TTS mode, or both live mode and TTS mode, or all mixed modes) are combined side by side (that is, the coding rules in these coding rules are combined side by side) to form one One encoding rule applicable to multiple languages, or a new encoding rule applicable to multiple languages (all the encoding rules for the respective languages in this encoding rule may be all live broadcast, or all TTS mode, or both live mode and TTS mode, or all mixed modes); for the decoding rules to be applicable to multiple languages, you can simply set the decoding rules that are only applicable to their respective languages (each The form of the decoding rules is just like the decoding rules for the live broadcast method described in this manual, which only applies to English as an example. These decoding rules may be all live broadcast, or all TTS, or both live broadcast and TTS. or all of them are mixed) and combined side by side (that is, the decoding rules in these decoding rules are combined side by side) to form a decoding rule applicable to multiple languages, and a new applicable decoding rule can also be compiled. Decoding rules for multiple languages (in this decoding rule, all the decoding rules for their respective languages may be all live, or all TTS, or both live and TTS, or all mixed etc.); for example, the simplest coding rules for live broadcasting that are only applicable to English and the coding rules for live broadcasting that are only applicable to Chinese as an example can be simply combined side by side to become a method that is applicable to both English and Chinese The encoding rules of Chinese (and the encoding rules about English and the encoding rules about Chinese in the encoding rules are all in the live mode).

按上一段所述,对于要适用于多个语种的编码规则,可简单的把分别只适用于各自(一个)语种的编码规则(每个编码规则的形式就如本说明书阐述的直播方式的以只适用于英语为例的编码规则)并排合起来(也就是把这些编码规则中的编码细则并排合起来)即成一种适用于多个语种的编码规则,把这些分别只适用于一个语种的编码规则中的编码细则并排合起来的时候,可把不同编码规则的编码细则中有些编码细则作成共用的,这样一来,编码规则中有关某一语种的编码细则与编码规则中有关另一语种的编码细则中有些编码细则可能是相同的;按上一段所述,对于要适用于多个语种的解码规则,可简单的把分别只适用于各自(一个)语种的解码规则(每个解码规则的形式就如本说明书阐述的直播方式的以只适用于英语为例的解码规则)并排合起来(也就是把这些解码规则中的解码细则并排合起来)即成一种适用于多个语种的解码规则,把这些只适用于一个语种的解码规则中的解码细则并排合起来的时候,可把不同解码规则的解码细则中有些解码细则作成共用的,这样一来,解码规则中有关某一语种的解码细则与解码规则中有关另一语种的解码细则中有些解码细则可能是相同的。As mentioned in the previous paragraph, for coding rules applicable to multiple languages, the coding rules applicable only to each (one) language can be simply combined (the form of each coding rule is as described in the description of the live broadcast mode in the following description) Coding rules that are only applicable to English as an example) are combined side by side (that is, the coding rules in these coding rules are combined side by side) to form a coding rule that is applicable to multiple languages, and these coding rules that are only applicable to one language When the coding rules in the rules are combined side by side, some of the coding rules in different coding rules can be shared. Some coding rules in the coding rules may be the same; as mentioned in the previous paragraph, for the decoding rules applicable to multiple languages, the decoding rules (each decoding rule’s The form is just like the decoding rules that are only applicable to English in the live broadcast method described in this manual) and combined side by side (that is, the decoding rules in these decoding rules are combined side by side) to form a decoding rule applicable to multiple languages , when the decoding rules in the decoding rules that are only applicable to one language are combined side by side, some of the decoding rules in different decoding rules can be shared. In this way, the decoding rules related to a certain language in the decoding rules Some of the decoding rules may be the same as the decoding rules for another language in the decoding rules.

对于一种适用于多个语种的编码规则,此编码规则中所有有关各自语种的编码细则可能是全是直播方式的,或全是TTS方式的,或既有直播方式也有TTS方式的,或全是混合方式的等情况;对于一种适用于多个语种的解码规则,此解码规则中所有有关各自语种的解码细则可能是全是直播方式的,或全是TTS方式的,或既有直播方式也有TTS方式的,或全是混合方式的等情况。For a coding rule applicable to multiple languages, all the coding rules for the respective languages in this coding rule may be all in the live mode, or all in the TTS mode, or have both the live mode and the TTS mode, or all It is a mixed mode; for a decoding rule applicable to multiple languages, all the decoding rules related to the respective languages in this decoding rule may be all in the live broadcast mode, or all in the TTS mode, or the existing live broadcast mode There are also TTS methods, or all mixed methods.

本发明文件各处所说的编码规则中所有有关各自语种的编码细则,是指此编码规则中有关各自语种的编码细则的集合(此集合里的每个元素是此编码规则中有关一语种的编码细则),比如一个适用于3个语种(语种1,语种1,语种3)的编码规则,那此编码规则中所有有关各自语种的编码细则就包括此编码规则中有关语种1的编码细则和此编码规则中有关语种2的编码细则和此编码规则中有关语种3的编码细则(不同语种涉及的编码细则中可能有部分相同的编码细则,比如此编码规则中有关语种1的编码细则中与此编码规则中有关语种2的编码细则中可能有部分相同的编码细则),然后如有说此编码规则中所有有关各自语种的编码细则全是直播方式的,就是指此编码规则中有关语种1的编码细则是直播方式的且此编码规则中有关语种2的编码细则是直播方式的且此编码规则中有关语种3的编码细则是直播方式的;本发明文件各处所说的解码规则中所有有关各自语种的解码细则,是指此解码规则中有关各自语种的解码细则的集合(此集合里的每个元素是此解码规则中有关一语种的解码细则),比如一个适用于3个语种(语种1,语种1,语种3)的解码规则,那此解码规则中所有有关各自语种的解码细则就包括此解码规则中有关语种1的解码细则和此解码规则中有关语种2的解码细则和此解码规则中有关语种3的解码细则(不同语种涉及的解码细则中可能有部分相同的解码细则,比如此解码规则中有关语种1的解码细则中与此解码规则中有关语种2的解码细则中可能有部分相同的解码细则),然后如有说此解码规则中所有有关各自语种的解码细则全是TTS方式的,就是指此解码规则中有关语种1的解码细则是TTS方式的且此解码规则中有关语种2的解码细则是TTS方式的且此解码规则中有关语种3的解码细则是TTS方式的。All relevant coding rules of respective languages in the coding rules mentioned in the documents of the present invention refer to the collection of relevant coding rules of respective languages in this coding rules (each element in this collection is the code about a language in this coding rules Rules), such as a coding rule applicable to 3 languages (language 1, language 1, language 3), then all the coding rules about their respective languages in this coding rule include the coding rules about language 1 in this coding rule and this The coding rules related to language 2 in the coding rules and the coding rules related to language 3 in this coding rule (the coding rules involved in different languages may have some of the same coding rules, for example, the coding rules related to language 1 in this coding rule are the same as this The coding rules for language 2 in the coding rules may have some of the same coding rules), and if it is said that all the coding rules for the respective languages in this coding rule are live broadcast, it means that the coding rules for language 1 in this coding rule The encoding details are in the live mode and the encoding details about language 2 in the encoding rules are in the live mode and the encoding details about the language 3 in the encoding rules are in the live mode; The decoding rules of the language refer to the set of the decoding rules of the respective languages in the decoding rules (each element in this set is the decoding rules of a language in the decoding rules), for example, one is applicable to 3 languages (language 1 , language 1, language 3) decoding rules, then all the decoding rules related to the respective languages in this decoding rule include the decoding rules related to language 1 in this decoding rule and the decoding rules related to language 2 in this decoding rule and this decoding rule The decoding rules related to language 3 (the decoding rules related to different languages may have some of the same decoding rules, such as the decoding rules related to language 1 in this decoding rule and the decoding rules related to language 2 in this decoding rule may have some The same decoding rules), and then if it is said that all the decoding rules about the respective languages in this decoding rule are all in TTS mode, it means that the decoding rules about language 1 in this decoding rule are in TTS mode and the relevant language in this decoding rule The decoding details of 2 are in TTS mode and the decoding details of language 3 in this decoding rule are in TTS mode.

如有只适用于语种1的编码规则只包括两个编码细则(细则1和细则2),如有只适用于语种2的编码规则只包括两个编码细则(细则3和细则4),将这两个编码规则中的编码细则并排合起来成为适用于语种1和语种2的编码规则,此编码规则中就只包括有这4个编码细则(细则1和细则2和细则3和细则4),然后此编码规则中有关语种1的编码细则就是指细则1和细则2,此编码规则中有关语种2的编码细则就是指细则3和细则4;假设细则1和细则3相同,则可把细则1和细则3合并,这样此编码规则中就只包括有3个编码细则(细则1和细则2和细则4),然后此编码规则中有关语种1的编码细则就是指细则1和细则2,此编码规则中有关语种2的编码细则就是指细则1和细则4。If there is a coding rule that is only applicable tolanguage 1 and only includes two coding rules (rule 1 and rule 2), if there is a coding rule that is only applicable tolanguage 2 and only includes two coding rules (rule 3 and rule 4), these The coding rules in the two coding rules are combined side by side to become the coding rules applicable tolanguage 1 andlanguage 2. This coding rule only includes these 4 coding rules (rules 1 and 2 andrules 3 and 4). Then the coding rules aboutlanguage 1 in this coding rule refer to rule 1 andrule 2, and the coding rules aboutlanguage 2 in this coding rule refer to rule 3 andrule 4; assuming thatrule 1 andrule 3 are the same,rule 1 can be Merge withrule 3, so that this coding rule only includes 3 coding rules (rule 1,rule 2 and rule 4), and then the coding rules aboutlanguage 1 in this coding rule refer to rule 1 andrule 2, this code The coding rules forlanguage 2 in the rules refer to rule 1 andrule 4.

与上段类似,如有只适用于语种1的解码规则只包括两个解码细则(细则1和细则2),如有只适用于语种2的解码规则只包括两个解码细则(细则3和细则4),将这两个解码规则中的解码细则并排合起来成为适用于语种1和语种2的解码规则,此解码规则中就只包括有这4个解码细则(细则1和细则2和细则3和细则4),然后此解码规则中有关语种1的解码细则就是指细则1和细则2,此解码规则中有关语种2的解码细则就是指细则3和细则4;假设细则1和细则3相同,则可把细则1和细则3合并,这样此解码规则中就只包括有3个解码细则(细则1和细则2和细则4),然后此解码规则中有关语种1的解码细则就是指细则1和细则2,此解码规则中有关语种2的解码细则就是指细则1和细则4。Similar to the previous paragraph, if there is a decoding rule that is only applicable tolanguage 1, it only includes two decoding rules (rule 1 and rule 2), and if there is a decoding rule that is only applicable tolanguage 2, it only includes two decoding rules (rule 3 and rule 4). ), the decoding rules in these two decoding rules are combined side by side to become the decoding rules applicable tolanguage 1 andlanguage 2, and this decoding rule only includes these 4 decoding rules (rule 1 andrule 2 andrule 3 and Rule 4), then the decoding rules forlanguage 1 in this decoding rule refer to rule 1 andrule 2, and the decoding rules forlanguage 2 in this decoding rule refer to rule 3 andrule 4; assuming thatrule 1 andrule 3 are the same, then Rules 1 and 3 can be combined, so that the decoding rule only includes 3 decoding rules (rule 1,rule 2 and rule 4), and then the decoding rules forlanguage 1 in this decoding rule refer to rule 1 andrule 2. The decoding rules forlanguage 2 in this decoding rule refer to rule 1 andrule 4.

本说明书对于一个只适用于一个语种的编码规则,所说的此编码规则中的编码细则是指此编码规则中的全部编码细则;本说明书对于一个只适用于一个语种的解码规则,所说的此解码规则中的解码细则是指此解码规则中的全部解码细则。For a coding rule that is only applicable to one language in this manual, the coding rules in this coding rule refer to all the coding rules in this coding rule; for a decoding rule that is only applicable to one language in this manual, the The decoding rules in this decoding rule refer to all the decoding rules in this decoding rule.

本说明书各处所说的‘这些编码规则中的编码细则并排合起来’或类似说法(这些编码规则中的各编码规则都是只适用于一个语种的),是指比如这些编码规则含编码规则1和编码规则2,那‘这些编码规则中的编码细则并排合起来’的意思是编码规则1中的编码细则与编码规则2中的编码细则并排合起来;本说明书各处所说的‘这些解码规则中的解码细则并排合起来’或类似说法(这些解码规则中的各解码规则都是只适用于一个语种的),是指比如这些解码规则含解码规则1和解码规则2,那‘这些解码规则中的解码细则并排合起来’的意思是解码规则1中的解码细则与解码规则2中的解码细则并排合起来。"The coding rules in these coding rules are combined side by side" or similar expressions (each coding rule in these coding rules is only applicable to one language) mentioned in this manual refers to, for example, these coding rules includecoding rule 1 andCoding Rule 2, then 'these coding rules in these coding rules are combined side by side' means that the coding rules incoding rule 1 and the coding rules incoding rule 2 are combined side by side; The decoding rules in the "combined side by side" or similar expressions (each decoding rule in these decoding rules is only applicable to one language), refers to, for example, these decoding rules includedecoding rule 1 anddecoding rule 2, then "these decoding rules "Combine the decoding rules side by side" means that the decoding rules indecoding rule 1 and the decoding rules indecoding rule 2 are combined side by side.

显然,对于只适用于一个语种的编码规则,其编码规则中有关此语种的编码细则就是指编码规则中的全部编码细则;对于只适用于一个语种的解码规则,其解码规则中有关此语种的解码细则就是指解码规则中的全部解码细则;对于适用于多个语种的编码规则,其编码规则中有关这多个语种中的某一语种的编码细则就是指编码规则中与用于对此语种语句编码有关的全部编码细则;对于适用于多个语种的解码规则,其解码规则中有关这多个语种中的某一语种的解码细则就是指解码规则中与用于对此语种的拼合信息的值进行解码有关的全部解码细则。Obviously, for a coding rule that is only applicable to one language, the coding rules related to this language in the coding rule refer to all the coding rules in the coding rule; The decoding rules refer to all the decoding rules in the decoding rules; for the coding rules applicable to multiple languages, the coding rules related to a certain language among the multiple languages in the coding rules refer to the coding rules used in the coding rules for this language. All coding rules related to sentence coding; for the decoding rules applicable to multiple languages, the decoding rules in the decoding rules related to a certain language in the multiple languages refer to the combination of the decoding rules and the combined information used for this language. All decoding details related to decoding the value.

本发明文件各处所说的适用于数个语种的编码规则(或解码规则),是指对这数个语种中每个语种的语句集(各语句集范围在此编码规则里定义)里的任一语句都能编码的编码规则。本发明文件各处所说的适用于数个语种的解码规则,是指对这数个语种中的每个语种,表示此语种的语句集(各语句集范围在此解码规则里定义)里的任一语句发音的拼合信息的值都能被此解码规则解码的解码规则。The coding rules (or decoding rules) applicable to several languages mentioned in the documents of the present invention refer to any sentence set (each sentence set scope is defined in this coding rule) in each of these several languages. An encoding rule that can encode a sentence. The decoding rules that are applicable to several languages mentioned in the documents of the present invention refer to each language in these several languages, represent any sentence set in this language (the scope of each sentence set is defined in this decoding rule) A decoding rule that the value of the combined information of a sentence pronunciation can be decoded by the decoding rule.

一个编码规则下包括一些编码细则,本说明书说一个只适用于某一语种的编码规则里含有压缩编码处理,实际上也就是说其包含的那些编码细则含有压缩编码处理,也可等价说成此编码规则中有关此语种的编码细则是含有压缩编码处理的;而对于一个可适用于多个语种的编码规则,对于其中一个语种,可说此编码规则中有关此语种的编码细则是含有压缩编码处理的(此编码规则中有关别的语种的编码细则可能有也可能没含有压缩编码处理);同理,一个解码规则下包括一些解码细则,本说明书说一个只适用于某一语种的解码规则里含有压缩解码处理,实际上也就是说其包含的那些解码细则含有压缩解码处理,也可等价说成此解码规则中有关此语种的解码细则是含有压缩解码处理的;而对于一个可适用于多个语种的解码规则,对于其中一个语种,可说此解码规则中有关此语种的解码细则是含有压缩解码处理的(此解码规则中有关别的语种的解码细则可能有也可能没含有压缩解码处理)。An encoding rule includes some encoding rules. This specification says that an encoding rule applicable only to a certain language contains compression encoding processing. In fact, it means that the encoding rules it contains include compression encoding processing. It can also be equivalently said The encoding details about this language in this encoding rule contain compression encoding processing; and for an encoding rule that can be applied to multiple languages, for one of the languages, it can be said that the encoding details about this language in this encoding rule contain compression encoding processing (the encoding rules for other languages in this encoding rule may or may not include compression encoding processing); similarly, a decoding rule includes some decoding rules, and this manual says that a decoding that is only applicable to a certain language The rules contain compression decoding processing, in fact, those decoding rules it contains contain compression decoding processing, and it can also be equivalently said that the decoding rules related to this language in this decoding rule contain compression decoding processing; Decoding rules applicable to multiple languages. For one of the languages, it can be said that the decoding rules for this language in this decoding rule include compression decoding processing (the decoding rules for other languages in this decoding rule may or may not contain compression decoding processing).

除非提及时有特别指明有他义,以上19段提及的单个的语种(比如提及的‘某一语种’)默认可以是任一非复合语种或任一复合语种;除非提及时有特别指明有他义,以上19段提及的适用于多个语种的编码规则中涉及的多个语种默认可以全是非复合语种或全是复合语种或既包含非复合语种也包含复合语种;除非提及时有特别指明有他义,以上19段提及的适用于多个语种的解码规则中涉及的多个语种默认可以全是非复合语种或全是复合语种或既包含非复合语种也包含复合语种。Unless otherwise specified when mentioned, the single language mentioned in the above 19 paragraphs (such as "a certain language" mentioned) can be any non-compound language or any composite language by default; unless otherwise specified when mentioned There are other meanings, the multiple languages involved in the encoding rules applicable to multiple languages mentioned in paragraph 19 above can be all non-composite languages or all compound languages or include both non-composite languages and compound languages; unless mentioned In particular, it has other meanings. The multiple languages involved in the decoding rules applicable to multiple languages mentioned in paragraph 19 above can be all non-compound languages or all composite languages, or include both non-compound languages and composite languages.

关于音码的定义及其编解码规则:About the definition of sound code and its encoding and decoding rules:

对于印欧语系:(如描述时无特别指明有其他情况,以下对1类音码或2类音码或其它形式音码的描述及对与1类音码或2类音码或其它形式音码有关的编解码规则描述默认是适用于印欧语系各语种的,而且是以每个编码规则,每个解码规则都是只适用于一个语种为例的,这些编码规则中一个只适用于某一语种的编码规则中的所有编码细则可作为适用于多个语种的编码规则中有关此语种的编码细则,这些解码规则中一个只适用于某一语种的解码规则中的所有解码细则可作为适用于多个语种的解码规则中有关此语种的解码细则;并且权利要求3,12,20或23中提及的编码规则,可以是其有关印欧语系任一语种的编码细则有用1类音码或2类音码或其它形式音码且含有压缩编码处理且是直播方式或TTS方式或混合方式的的只适用于此语种或适用于包括此语种在内的多个语种的各种编码规则,权利要求3,12,20或23中提及的解码规则,可以是其有关印欧语系任一语种的解码细则有用1类音码或2类音码或其它形式音码且含有压缩解码处理且是直播方式或TTS方式或混合方式的的只适用于此语种或适用于包括此语种在内的多个语种的各种解码规则;本段说的‘适用于多个语种’中的各语种的情况可能是全是印欧语系的语种或既有印欧语系的语种也有非印欧语系的语种或既有印欧语系的语种也有复合语种等)For the Indo-European language family: (if there are no other circumstances specified in the description, the following descriptions ofclass 1 phonetic codes orclass 2 phonetic codes or other forms of phonetic codes and the relationship withclass 1 phonetic codes orclass 2 phonetic codes or other forms of phonetic codes Code-related codec rule descriptions are applicable to all Indo-European languages by default, and each encoding rule and each decoding rule is only applicable to one language as an example, and one of these encoding rules is only applicable to a certain All the coding rules in the coding rules for one language can be used as the coding rules for this language in the coding rules applicable to multiple languages. Among these decoding rules, all the decoding rules in the decoding rules that are only applicable to a certain language can be used as applicable The decoding rules related to this language in the decoding rules of multiple languages; and the coding rules mentioned inclaim 3, 12, 20 or 23 can be useful for the coding rules of any language in the Indo-European language family. OrType 2 audio codes or other audio codes that contain compressed coding and are live broadcast or TTS or mixed. Various coding rules that are only applicable to this language or to multiple languages including this language, The decoding rule mentioned inclaim 3, 12, 20 or 23 can be that its decoding rules related to any language of the Indo-European language family haveType 1 phonetic codes orType 2 phonetic codes or other forms of phonetic codes and contain compression decoding processing and It is a live broadcast method or a TTS method or a mixed method that is only applicable to this language or various decoding rules that are applicable to multiple languages including this language; the "applicable to multiple languages" mentioned in this paragraph It may be all Indo-European languages or both Indo-European and non-Indo-European languages or both Indo-European and compound languages etc.)

本说明书前面所说的音码,可被当成音码的,可以是可以表示或转换为发音的元辅音,元音辅音组合,单词,单词组合,词根(包括词基,前缀,后缀)或范句等,可称这些音码为1类音码。The phonetic sign indicating number mentioned earlier in this specification sheet can be regarded as the phonetic sign indicating number, can be the vowel and consonant that can represent or be converted into pronunciation, vowel and consonant combination, word, word combination, root (comprising word base, prefix, suffix) or range Sentence etc., these phonetic codes can be said to beClass 1 phonetic codes.

另还有一种定义音码的方式,是多个音码组合后可转换为发音(这里的发音,也是在语句发音中,多个音码组合后所对应的发音),比如字符组合,字母等,可以当成音码。对于印欧语系语种(以英语为例,其它法语,德语等类似),比如字符组合,是指对语句的分解不是按词或词根的概念,比如wake up这样一个短语,如将其分解成wak和e up这样两段字符组合(其中的空格也当成一个字符),把这两段字符组合分别作为两个音码,对这两个音码分别赋予编号如000011,000101,这两个音码(字符组合)各自单独使用时并不能表示发音,但如组合起来成wake up(ASCII字符格式),再到单词数据库里按单词拼法(‘wake’和‘up’,是ASCII字符格式)去查找wake和up两单词对应的记录项而得到这两个单词的发音信息数据(比如词形音标),就可将组合成的wake up转换为发音,推而广之就可用这些字符组合(音码)来组成语句发音(直接发音方式),在编码时仍使用编码规则来生成拼合信息的值,编码时仍可采用前面述的一些词段编码方式比如频率排序编码方式,在解码时的解码规则是与编码规则对应的;极端的,甚至可以把26个字母作为26个音码,26个字母音码各赋予不同的编号,由字母组成单词的拼法,再按单词拼法在单词数据库里查找此单词对应的记录项而得到此单词的发音信息数据(比如词形音标),由各单词的发音组成语句发音(直接发音方式);在编码时仍使用编码规则来生成拼合信息的值,编码时仍然可以使用词段编码方式比如前面述的频率排序编码方式,在解码时的解码规则是与编码规则对应的。可称这些种经组合后可转换为发音的音码(比如字符组合,字母)为2类音码。(注:即使在编解码时不使用单词音码来编解码,但编解码时仍可建立单词数据库,以记录各常用单词的拼法,词形音标,词形形态等信息,当然这时各单词的频率编号一般就不用记录了)There is also another way of defining phonetic codes, which can be converted into pronunciation after a plurality of phonetic codes are combined (the pronunciation here is also the pronunciation corresponding to the combination of multiple phonetic codes in the sentence pronunciation), such as character combinations, letters, etc. , can be regarded as a phonetic code. For Indo-European languages (English as an example, other French, German, etc.), such as character combinations, it means that the decomposition of a sentence is not based on the concept of words or roots, such as a phrase like wake up, such as decomposing it into wak Combination of two sections of characters such as e up (the space in which is also regarded as a character), these two sections of characters are combined as two phonetic codes respectively, and numbering such as 000011, 000101 is assigned to these two phonetic codes respectively, these two phonetic codes (Character combinations) cannot express pronunciation when used alone, but if they are combined to form wake up (ASCII character format), then go to the word database according to the word spelling ('wake' and 'up', in ASCII character format) Look up the record items corresponding to the two words wake and up to obtain the pronunciation information data (such as word form and phonetic symbols) of these two words, and the wake up that is formed can be converted into pronunciation, and by extension, these character combinations (phonetic symbols) can be used. Code) to form the sentence pronunciation (direct pronunciation method), the encoding rules are still used to generate the value of the combined information during encoding, and some of the above-mentioned word segment encoding methods such as the frequency sorting encoding method can still be used during encoding, and the decoding during decoding The rules are corresponding to the encoding rules; extreme, even 26 letters can be used as 26 phonetic codes, and the 26 alphabetic phonetic codes are assigned different numbers, and the spelling of words is formed by letters, and then listed in the word database according to the spelling of words Look up the record item corresponding to this word in to get the pronunciation information data of this word (such as word form and phonetic symbol), and the sentence pronunciation is composed of the pronunciation of each word (direct pronunciation method); the encoding rule is still used to generate the value of the combined information during encoding , the word segment coding method such as the aforementioned frequency sorting coding method can still be used during encoding, and the decoding rule during decoding corresponds to the encoding rule. Can claim that these kinds can be converted into the phonetic sign indicating number of pronunciation (such as character combination, letter) after being combined and be 2 classes of sound sign indicating numbers. (Note: Even if the word phonetic code is not used to encode and decode, the word database can still be set up during encoding and decoding to record the spelling of each commonly used word, the phonetic symbols of the word shape, the information such as the shape of the word, of course at this time The frequency number of the word generally does not need to be recorded)

如将1类音码用于TTS方式解码时发音,那音码中的元辅音及元音辅音组合可以表示成发音信息数据(比如音标格式数据),而单词,单词组合,词根或范句既可转成发音信息数据(比如音标格式数据)也可转成拼法数据(比如一单词的各字母用ASCII字符来表示),当然需事先在音码的数据库里存储各音码的比如音标或拼法等数据;而且解码时可使用整步TTS发音方式或分步TTS发音方式来播放语句发音,如用整步TTS发音方式,解码时MCU可将解码中得到的各1类音码转成其相应的发音信息数据(比如音标格式数据)或拼法数据,并把这些数据(这里说的‘这些数据’中可能包括发音信息数据,拼法数据中的一种或两种)按它们对应的发音在语句发音中的顺序组合起来成语句的拼法格式数据(也即语句的预备发音数据),再由MCU将语句的拼法格式数据一次传给TTS器件(比如TTS芯片)播出语句发音。Pronunciation when using class 1 phonetic codes for TTS mode decoding, the vowel-consonant and vowel-consonant combination in that phonetic code can be represented as pronunciation information data (such as phonetic symbol format data), and word, word combination, root or model sentence both It can be converted into pronunciation information data (such as phonetic symbol format data) and can also be converted into spelling data (such as each letter of a word is represented by an ASCII character), certainly need to store each phonetic code in advance in the database of phonetic codes such as phonetic symbols or data such as spelling; and when decoding, the full-step TTS pronunciation mode or the step-by-step TTS pronunciation mode can be used to play the sentence pronunciation, such as using the whole-step TTS pronunciation mode, the MCU can convert each class 1 sound code obtained in the decoding into Its corresponding pronunciation information data (such as phonetic symbol format data) or spelling data, and these data (the "these data" mentioned here may include pronunciation information data, one or two types of spelling data) according to their corresponding The order of the pronunciation in the sentence pronunciation is combined into the spelling format data of the sentence (that is, the prepared pronunciation data of the sentence), and then the MCU sends the spelling format data of the sentence to the TTS device (such as a TTS chip) to broadcast the sentence pronounce.

如将2类音码用于TTS方式解码时发音,那字符组合可转成拼法数据(字符格式的),字母可转成拼法数据(也即字符);而且解码时可使用整步TTS发音方式或分步TTS发音方式来播放语句发音,如使用整步TTS发音方式,解码时MCU可将解码中得到的各2类音码转成拼法数据(字符格式的),并可把这些拼法数据按它们对应的发音在语句发音中的顺序组合起来成语句的拼法格式数据(也即语句的预备发音数据),再由MCU将语句的拼法格式数据一次传给TTS器件(比如TTS芯片)播出语句发音。Pronunciation when the 2 class phonetic codes are used for TTS mode decoding, then character combination can be changed into spelling data (of character format), and letter can be changed into spelling data (that is character); And whole step TTS can be used during decoding Pronunciation mode or step-by-step TTS pronunciation mode to play sentence pronunciation, such as using the full-step TTS pronunciation mode, MCU can convert each 2 types of sound codes obtained in decoding into spelling data (in character format) during decoding, and can convert these The spelling data are combined into the spelling format data (that is, the prepared pronunciation data) of the sentence by their corresponding pronunciations in the order of the sentence pronunciation, and then the spelling format data of the sentence is passed to the TTS device once by the MCU (such as TTS chip) broadcast sentence pronunciation.

按本说明书所述,可以由1类音码中一种或多种音码来拼合语句集发音(比如可由单词与单词组合与元辅音一起来拼合语句集发音);而对于2类音码,同样可以由2类音码中的一种或多种音码来拼合语句集发音,比如可单独由字母组合来拼合语句集发音,也可由字母组合与字母来拼合语句集发音;并且可以由1类音码中的一种或多种音码与2类音码中的一种或多种音码共同来拼合语句集发音,比如由2类音码中的字母与1类音码中的范句,采用整句差别编码方式来作编码,例如:(假设解码数据里仍有单词数据库),先对单词段进行编码,选一个与需编码语句相似的范句,先记录此范句在范句库里的顺序号,再记录其在哪个单词的位置比需编码语句删几个字母音码(实际是把要删的单词看成是由几个字母音码组合成的),哪个单词的位置增几个字母音码(实际是把要增的单词看成是由几个字母音码组合成的)即成单词段的编码,至于时态词形选择段和陌生单词段的处理可以参照主要用单词及单词组合来作编解码时的处理。如由1类音码和2类音码来共同拼合语句集发音,那拾音器解码来播放语句发音可以采用整步或分步直接发音方式,另也可采用整步或分步TTS发音方式,如使用整步TTS发音方式,在解码时MCU可把拼合成语句发音的各1类音码转成其相应的发音信息数据(比如音标格式数据)或拼法数据,把各2类音码转成字符格式数据,再把这些数据(这里说的‘这些数据’中可能包括发音信息数据,拼法数据,字符格式数据中的一种或多种)按它们对应的发音在语句发音中的顺序组合起来成语句的拼法格式数据(也即语句的预备发音数据),再由MCU把语句的拼法格式数据一次传给TTS器件播出语句发音。As described in this specification, one or more phonetic codes in the 1 class sound code can be used to piece together the sentence set pronunciation (such as the sentence set pronunciation can be combined with the vowel and consonant by word and word combination); and for the 2 class sound codes, Equally can by one or more phonetic codes in 2 class phonetic sign indicating numbers to put together the sentence collection pronunciation, such as can come together sentence collection pronunciation by letter combination separately, also can put together sentence collection pronunciation by letter combination and letter; And can be by 1 One or more phonetic codes in the class phonetic codes and one or more phonetic codes in the 2 class phonetic codes jointly come together the sentence set pronunciation, such as by the letter in the 2 class phonetic codes and the range in the 1 class phonetic codes Sentence, using the whole sentence differential encoding method to encode, for example: (assuming that there is still a word database in the decoded data), first encode the word segment, choose a model sentence similar to the sentence to be encoded, and first record this model sentence in the model sentence Sequential number in the sentence storehouse, record its position in which word again and delete several letter sound codes (actually the word to be deleted is regarded as being composed of several letter sound codes) than the coded sentence, which word The location increases several letter phonetic codes (actually the word to be added is regarded as being composed of several letter phonetic codes) and becomes the coding of the word segment, as for the processing of the tense word form selection segment and the unfamiliar word segment, you can refer to Mainly use words and word combinations for encoding and decoding processing. If the pronunciation of the sentence set is combined together by the 1-type sound code and the 2-type sound code, the pronunciation of the statement can be played by the pickup decoding and can be directly pronounced in whole steps or in steps, or the TTS pronunciation in whole steps or steps can be used, such as Using the full-step TTS pronunciation method, the MCU can convert eachclass 1 phonetic codes that are spelled into a sentence pronunciation into its corresponding pronunciation information data (such as phonetic symbol format data) or spelling data when decoding, and convert eachclass 2 phonetic codes into character format data, and then combine these data (here, "these data" may include pronunciation information data, spelling data, and character format data) according to the order of their corresponding pronunciation in the sentence pronunciation Put together the spelling format data (that is, the prepared pronunciation data of the sentence) of the sentence, and then send the spelling format data of the sentence to the TTS device once to broadcast the pronunciation of the sentence by the MCU.

对于直播方式,如果此方式(直播方式)的只适用于印欧语系某语种的编码规则有用1类音码或2类音码来编码,那此编码规则中的各编码细则可参照本说明书前面所述的此方式(直播方式)的以只适用于英语为例的编码规则中述及的主要以单词及单词组合音码作编码的比如对音码赋予编号,频率排序等词段编码方式,以及陌生单词处理,时态词形处理等编码细则;如果此方式(直播方式)的只适用于印欧语系某语种的解码规则有用1类音码或2类音码来解码,那此解码规则中的各解码细则可参照本说明书前面所述的此方式(直播方式)的以只适用于英语为例的解码规则中述及的主要以单词及单词组合音码作解码的比如对音码赋予编号,频率排序等词段解码方式,以及陌生单词处理,时态词形处理等解码细则。(本段所说的‘有用1类音码或2类音码’,是指只用1类音码中的一种或多种音码,或者只用2类音码中的一种或多种音码,或者用1类音码中的一种或多种音码和2类音码中的一种或多种音码)For the live broadcast mode, if the encoding rules applicable to a certain language of the Indo-European language family in this mode (live broadcast mode) are coded withclass 1 phonetic codes orclass 2 phonetic codes, then the coding rules in the coding rules can refer to the front of this manual Described this mode (live broadcast mode) with only being applicable to English as example in the coding rule that mentions mainly with word and word combination phonetic sign indicating number to encode such as giving numbering to phonetic sign indicating number, the word segment coding modes such as frequency sorting, And unfamiliar word processing, tense word form processing and other coding rules; if this method (live broadcast method) is only applicable to a certain language of the Indo-European language family, the decoding rules can be decoded withType 1 phonetic code orType 2 phonetic code, then this decoding rule The detailed rules for decoding can refer to the decoding rules that are only applicable to English in the method (live broadcast mode) described earlier in this specification. Word segment decoding methods such as numbering and frequency sorting, as well as decoding rules such as unfamiliar word processing and tense word form processing. (the said ' useful 1 class phonetic code or 2 class phonetic codes ' of this paragraph, refer to only using one or more phonetic codes in 1 class phonetic codes, or only use one or more in 2 class phonetic codes Kind of sound sign indicating number, or with one or more sound sign indicating numbers inclass 1 sound sign indicating number and one or more sound sign indicating numbers inclass 2 sound sign indicating numbers)

对于TTS方式的情况,也如上段所述,只是要把上段所述中的‘直播方式’都换成‘TTS方式’。显然如既对直播方式有效也对TTS方式有效那对混合方式也有效,对于混合方式的情况,显然也如上段所述,只是要把上段所述中的‘直播方式’都换成‘混合方式’。For the situation of the TTS mode, also as described in the previous paragraph, only the 'live broadcast mode' mentioned in the previous paragraph should be replaced by the 'TTS mode'. Obviously, if it is valid for both the live broadcast method and the TTS method, it is also valid for the mixed method. For the mixed method, it is obviously the same as the above paragraph, but the "live method" in the above paragraph should be replaced with "mixed method" '.

1类音码在本发明文件里有阐述,可以对其赋予频率编号,并且对1类音码赋予频率编号本身就是有压缩作用的(但如仅对每个元辅音赋予一个6BIT的编号是没压缩作用的),如有主要使用1类音码中的一种或多种音码来作编解码,然后再结合其它词段编解码方式(比如频率排序方式,整句差别方式等),来进行编解码,其它词段编解码方式一般是有压缩处理的,显然包含这些对音码赋编号和各词段编码方式等编码细则的编码规则是含有压缩编码处理的,包含这些对音码赋编号和各词段解码方式等解码细则的解码规则也是含有压缩解码处理的。但如是用1类音码中的元辅音,每个元辅音(音标)赋予一个6BIT的编号,并且编码规则是把语句发音中所有各元辅音对应的元辅音编号(也就是音标编号)按其元辅音的发音在语句发音中的顺序排组起来,来表示语句发音,这种方式的编码规则及相应的解码规则是没有压缩处理的。Class 1 sound sign indicating number is set forth in the document of the present invention, can assign frequency number to it, and assign frequency number itself toclass 1 sound sign indicating number and just have compression effect (but as only assigning the numbering of a 6BIT to each vowel consonant is no Compression function), if there is one or more phonetic codes in the first class of phonetic codes to be used for encoding and decoding, and then combined with other word segment encoding and decoding methods (such as frequency sorting methods, whole sentence difference methods, etc.), to Encoding and decoding, other word segment encoding and decoding methods generally have compression processing. Obviously, the encoding rules that include these coding rules such as assigning numbering to phonetic codes and encoding methods for each word segment contain compression encoding processing, including these assigning phonetic codes to phonetic codes. The decoding rules of the decoding rules such as numbering and each word segment decoding method also include compression decoding processing. But as using the vowels and consonants in theclass 1 phonetic sign indicating number, each vowel and consonant (phonetic symbol) is given the numbering of a 6BIT, and the encoding rule is that the vowel and consonant numbering (that is, the phonetic symbol numbering) corresponding to all each vowel and consonant in the sentence pronunciation is pressed by its The pronunciation of vowels and consonants is arranged in the order of the sentence pronunciation to represent the sentence pronunciation. The encoding rules and corresponding decoding rules of this method are not compressed.

2类音码,也可以对其赋予频率编号,并且对2类音码赋予频率编号本身就是有压缩作用的(但如对每个字母赋予一个7BIT的ASCII字符作为编号是没压缩作用的),如有主要使用2类音码中的一种或多种音码来作编解码,然后再结合其它词段编解码方式(比如频率排序方式等),来进行编解码,其它词段编解码方式一般是有压缩处理的,显然包含这些对音码赋编号和各词段编码方式等编码细则的编码规则是含有压缩编码处理的,包含这些对音码赋编号和各词段解码方式等解码细则的解码规则也是含有压缩解码处理的。但如是用2类音码中的字母,每个字母用其对应的一个7BIT的ASCII字符来表示,并且编码规则是把语句中的所有各字母的ASCII字符按其字母在语句中的顺序排组起来,来表示语句发音(各字母组成单词后可转换成发音),这种方式的编码规则及相应的解码规则是没有压缩处理的,这种方式也是常规的文本记录方式;这种方式也可采取一些措施来达到压缩的效果,比如可参照本说明书前面述的直播方式的以只适用于英语为例的编码规则里的前后调用处理,如语句后面出现一串字符与语句前面一串字符相同,对前面一串字符可记录其各ASCII字符,对后面这串字符就可不记录其各ASCII字符,而记录此字符串与前面字符串的距离(相隔多少个字符)以及有几个字符相同,如将此种处理纳入编码规则,那此编码规则就是有压缩处理的,相应的解码规则也是有压缩处理的;上段述的编码时对每个元辅音赋予一个6BIT的编号并把语句发音中所有各元辅音对应的元辅音编号按其元辅音的发音在语句发音中的顺序排组起来的方式,也可采取这种压缩处理,相应的编码规则和解码规则也是有压缩处理的。Class 2 sound codes can also be given frequency numbers to it, and the frequency number itself has a compression effect to the 2 class sound codes (but as the ASCII characters of a 7BIT are assigned to each letter, there is no compression effect as the numbering), If one or more phonetic codes in the 2 types of phonetic codes are mainly used for encoding and decoding, and then other word segment encoding and decoding methods (such as frequency sorting methods, etc.) are used to encode and decode, other word segment encoding and decoding methods Generally, there is compression processing. Obviously, the encoding rules that include these encoding rules for assigning numbers to phonetic codes and encoding methods for each word segment contain compression encoding processing, including these decoding rules for assigning numbers to phonetic codes and decoding methods for each word segment. The decoding rules also include compression decoding processing. But if the letters in the 2-type phonetic code are used, each letter is represented by its corresponding ASCII character of 7BIT, and the encoding rule is to arrange the ASCII characters of all letters in the sentence according to the order of the letters in the sentence Get up to represent the pronunciation of the sentence (each letter can be converted into a pronunciation after forming a word), the encoding rules of this method and the corresponding decoding rules are not compressed, and this method is also a conventional text recording method; this method can also be used Take some measures to achieve the effect of compression, for example, you can refer to the before and after call processing in the encoding rules in the live broadcast mode mentioned above in this manual, which only applies to English as an example. , each ASCII character can be recorded for the previous string of characters, and each ASCII character can not be recorded for the latter string of characters, but the distance between this string and the previous string (how many characters are separated) and how many characters are the same, If this kind of processing is incorporated into the coding rules, then this coding rule has compression processing, and the corresponding decoding rules also have compression processing; when encoding in the above paragraph, a 6BIT number is given to each consonant and all the sentences in the pronunciation The number of consonants corresponding to each consonant is arranged in the order of the pronunciation of the consonants in the sentence pronunciation, and this compression process can also be adopted, and the corresponding encoding rules and decoding rules are also compressed.

上面9段所述是适合于英语的,至于印欧语系语种中的法语,德语,俄语等,其语句发音的构成主要特点是与英语相似的,由字母组成单词,由单词组成单词组合和语句,每个单词的发音由元音和辅音组成,单词有词形时态变化,所以可被当成用以拼合语句集发音的音码的,可以是表示发音的元辅音或元音辅音组合这样的1类音码,也可以是转换成发音的词根,单词,单词组合或范句这样的1类音码,也可以是经组合后可转换为发音的字符组合或字母这样的2类音码;由这些音码(1类和2类音码)中的一种或多种音码来拼合语句集发音,这基本上也是按上面9段所述处理。对印欧语系任一语种,如用2类音码中的一种或多种音码(或用2类音码中的一种或多种音码加1类音码中的一种或多种音码)来编解码,那直播方式(或TTS方式或混合方式)的只适用于印欧语系此语种的含有压缩处理的编解码规则也可参照前面述的直播方式(或TTS方式)的以只适用于英语为例的编解码规则,另也可使用别的种类的直播方式(或TTS方式或混合方式)的只适用于印欧语系此语种的含有压缩处理的编解码规则(编解码规则是指编码规则和解码规则)。The above 9 paragraphs are suitable for English. As for French, German, Russian, etc. in the Indo-European languages, the main characteristics of the composition of the pronunciation of the sentences are similar to those of English. Words are composed of letters, word combinations and sentences are composed of words. , the pronunciation of each word is composed of vowels and consonants. Words have word form and tense changes, so they can be regarded as the phonetic codes used to put together the pronunciation of the sentence set.Class 1 phonetic sign indicating number can also be converted into the root of pronunciation, word, word combination or model sentencesuch Class 1 phonetic sign indicating number, also can be converted into such 2 classes of phonetic sign indicating number of character combination or letter of pronunciation after being combined; By one or more phonetic codes in these phonetic codes (class 1 andclass 2 phonetic codes), the sentence set pronunciation is put together, and this also basically processes as described in the above 9 paragraphs. For any language of the Indo-European language family, such as using one or more phonetic codes in the 2 class phonetic codes (or adding one or more phonetic codes in the 1 class phonetic codes with one or more phonetic codes in the 2 class phonetic codes codec) to codec, the codec rules of the live broadcast method (or TTS method or mixed method) that are only applicable to the Indo-European language and contain compression processing can also refer to the above-mentioned live broadcast method (or TTS method) Take the codec rules that are only applicable to English as an example, and other types of live broadcast methods (or TTS methods or mixed methods) that are only applicable to this language of the Indo-European language family include codec rules that contain compression processing (codec rules) Rules refer to encoding rules and decoding rules).

本发明文件各处所称音码,是用以拼合语句集发音的;可被当成印欧语系任一语种的音码的,可以是表示发音的元辅音或元音辅音组合,也可以是可转换成发音的词根,单词,单词组合或范句,也可以是组合起来可转换为发音的字符组合或字母(这些对音码的定义在本说明书前面都有述),除此之外,音码还可以是其它本说明书未提及的形式,比如编码者自定的形式,只要其能用以拼合此语种的语句集发音即可(实际上这些编码者自定的音码形式,也应看作是元辅音,元音辅音组合,词根,单词,范句,字符组合,字母这些音码的等同替换);对印欧语系某一语种,如有用其它形式的音码来拼合语句集发音,那涉及的只适用于此语种的含有压缩编码处理的编码规则以及涉及的只适用于此语种的含有压缩解码处理的解码规则中的各细则仍可参照本说明书前面所述的直播方式(或TTS方式)的以只适用于英语为例的编解码规则中的主要以单词及单词组合音码作编解码的比如对音码赋予编号,频率排序等词段编解码方式,以及陌生单词处理,时态词形编解码处理等细则,或者可按照使用的其它形式的音码,来使用新的只适用于此语种的含有压缩编码处理的编码规则及新的只适用于此语种的含有压缩解码处理的解码规则。本段前面提及的‘如有用其它形式的音码来拼合语句集发音’,其用其它形式的音码来拼合语句集发音的具体含义是指可以由一种或多种其它形式的音码来拼合语句集发音,也可由一种或多种其它形式的音码与元辅音,元音辅音组合,词根,单词,单词组合,范句,字符组合和字母中的一种或多种音码一起来拼合语句集发音。本段所述是可适用于印欧语系的各语种(英语,法语,德语或俄语等)的。本段所说的细则是指编码细则或解码细则。除了述及时有特别注明有确切含义之外,本段以上所述及的编解码规则可以是直播方式或TTS方式或混合方式的。The so-called phonetic code everywhere in the documents of the present invention is used to spell out the pronunciation of sentence sets; what can be regarded as the phonetic code of any language of the Indo-European language family can be the combination of vowels and consonants or vowels and consonants representing pronunciation, and can also be convertible Become the root of pronunciation, word, word combination or model sentence, also can be the combination of characters or letters that can be converted into pronunciation (these definitions of phonetic codes are all stated in front of this specification sheet), in addition, phonetic codes It can also be other forms that are not mentioned in this specification sheet, such as the form determined by the coder, as long as it can be used to put together the sentence set pronunciation of this language (in fact, the phonetic code forms that these coders make themselves, also should see be vowel and consonant, vowel and consonant combination, root, word, model sentence, character combination, the equivalent replacement of these phonetic codes of letters); To a certain language of the Indo-European language family, if there are other forms of phonetic codes to put together the sentence collection pronunciation, For details about the encoding rules that only apply to this language and include compression and encoding processing and the decoding rules that only apply to this language and include compression and decoding processing, you can still refer to the live broadcast method (or TTS) described earlier in this manual. In the encoding and decoding rules that are only applicable to English as an example, words and word combination phonetic codes are mainly used for encoding and decoding, such as assigning numbers to phonetic codes, frequency sorting and other word segment encoding and decoding methods, as well as unfamiliar word processing. Morphological codec processing and other detailed rules, or according to other forms of phonetic codes used, to use the new encoding rules that are only applicable to this language and contain compression and decoding processing that are only applicable to this language decoding rules. Mentioned in front of this paragraph ' if there are other forms of phonetic codes to put together the pronunciation of the sentence collection', its concrete meaning of putting together the pronunciation of the sentence collection with the phonetic codes of other forms refers to the phonetic code that can be formed by one or more other forms To piece together the pronunciation of the sentence set, it can also be one or more sound codes in one or more other forms of phonetic codes and vowels and consonants, vowel and consonant combinations, roots, words, word combinations, model sentences, character combinations and letters Let's put together the pronunciation of the sentence set. The description in this paragraph is applicable to various languages of the Indo-European language family (English, French, German or Russian, etc.). The rules mentioned in this paragraph refer to encoding rules or decoding rules. Except when it is specifically stated that there is an exact meaning, the encoding and decoding rules mentioned above in this paragraph can be in live broadcast mode, TTS mode or mixed mode.

一个解码规则中有关印欧语系任一语种的直播方式的有用1类音码或2类音码或其它形式音码的解码细则,其发音方式显然可以是整步直接发音方式或分步直接发音方式;一个解码规则中有关印欧语系任一语种的TTS方式的有用1类音码或2类音码或其它形式音码的解码细则,其发音方式显然可以是整步TTS发音方式或分步TTS发音方式;一个解码规则中有关印欧语系任一语种的混合方式的有用1类音码或2类音码或其它形式音码的解码细则,其发音形式是用混合发音方式,发音时可能会使用到整步直接发音方式,整步TTS发音方式,分步直接发音方式,分步TTS发音方式等中的多种方式。Useful Class 1 phonetic codes orClass 2 phonetic codes or other forms of phonetic codes in a decoding rule for the live broadcast of any language in the Indo-European language family. The pronunciation method can obviously be the whole-step direct pronunciation method or the step-by-step direct pronunciation Mode; in a decoding rule, there areuseful Class 1 phonetic codes orClass 2 phonetic codes or other decoding rules for the TTS mode of any language in the Indo-European language family. TTS pronunciation mode; in a decoding rule, there areuseful Class 1 phonetic codes orClass 2 phonetic codes or decoding rules for other forms of phonetic codes related to the mixed mode of any language in the Indo-European language family. Will use the whole-step direct pronunciation method, the whole-step TTS pronunciation method, the step-by-step direct pronunciation method, the step-by-step TTS pronunciation method, etc.

对于非印欧语系:(如描述时无特别指明有其他情况,以下对3类音码或4类音码或其它形式音码的描述及对与3类音码或4类音码或其它形式音码有关的编解码规则描述默认是适用于非印欧语系各语种的,而且是以每个编码规则,每个解码规则都是只适用于一个语种为例的,这些编码规则中一个只适用于某一语种的编码规则中的所有编码细则可作为适用于多个语种的编码规则中有关此语种的编码细则,这些解码规则中一个只适用于某一语种的解码规则中的所有解码细则可作为适用于多个语种的解码规则中有关此语种的解码细则;并且权利要求3,12,20或23中提及的编码规则,可以是其有关非印欧语系任一语种的编码细则有用3类音码或4类音码或其它形式音码且含有压缩编码处理且是直播方式或TTS方式或混合方式的的只适用于此语种或适用于包括此语种在内的多个语种的各种编码规则,权利要求3,12,20或23中提及的解码规则,可以是其有关非印欧语系任一语种的解码细则有用3类音码或4类音码或其它形式音码且含有压缩解码处理且是直播方式或TTS方式或混合方式的的只适用于此语种或适用于包括此语种在内的多个语种的各种解码规则,本段说的‘适用于多个语种’中的各语种的情况可能是全是非印欧语系的语种或既有非印欧语系的语种也有印欧语系的语种或既有非印欧语系的语种也有复合语种等)For non-Indo-European languages: (if there are no other circumstances in the description, the following descriptions of 3 types of phonetic codes or 4 types of phonetic codes or other forms of phonetic codes and 3 types of phonetic codes or 4 types of phonetic codes or other forms The description of codec rules related to phonetic codes is applicable to non-Indo-European languages by default, and each encoding rule and each decoding rule is only applicable to one language as an example, and one of these encoding rules is only applicable All the coding rules in the coding rules for a certain language can be used as the coding rules for this language in the coding rules applicable to multiple languages. Among these decoding rules, all the decoding rules in the decoding rules that are only applicable to a certain language can be used as As the decoding rules applicable to this language in the decoding rules applicable to multiple languages; and the coding rules mentioned inclaim 3, 12, 20 or 23 can be useful as the coding rules for any language of the non-Indo-European language family3 Types of audio codes or 4 types of audio codes or other forms of audio codes that contain compressed encoding and are live broadcast or TTS or mixed are only applicable to this language or applicable to multiple languages including this language. Coding rule, the decoding rule mentioned inclaim 3, 12, 20 or 23, can be its relevant non-Indo-European language any kind of decoding rules useful 3 kinds of phonetic codes or 4 kinds of phonetic codes or other forms of phonetic codes and contain Compression and decoding processing and live broadcast or TTS or mixed methods are only applicable to this language or various decoding rules applicable to multiple languages including this language. The "applicable to multiple languages" mentioned in this paragraph It may be that all languages are non-Indo-European languages, or there are both non-Indo-European languages and Indo-European languages, or there are both non-Indo-European languages and compound languages, etc.)

本说明书前面所说的音码,可被当成音码的,可以是可以表示或转换为发音的元辅音,元音辅音组合,字,词,词组合或范句等,可称这些音码为3类音码。The phonetic codes mentioned above in this manual can be regarded as phonetic codes, and can be expressed or converted into vowel and consonant sounds, vowel and consonant combinations, words, words, word combinations or model sentences, etc., and these phonetic codes can be referred to as 3 types of sound codes.

另还有一种定义音码的方式,是多个音码组合后可转换为发音(这里的发音,也是在语句发音中,多个音码组合后所对应的发音),比如偏旁部首组合,偏旁部首等,可以当成音码。对于非印欧语系语种(以汉语为例,其它韩语,日语等类似),比如偏旁部首组合,是指对语句的分解不是按词或词组合的概念,先可以建立一个按偏旁部首查找词的词数据库,里面存储的各词的信息里有发音信息数据(比如词形拼音);然后比如对于一个‘段’(这可看成一个词),可将其左右两部分偏旁部首组合分别定义成为两音码,对这两个音码分别赋予编号如010011,001101,这两个音码(偏旁部首组合)各自单独使用时并不能表示发音,但如组合起来就是一个词的全部偏旁部首信息,再在词数据库里按偏旁部首信息查到此词‘段’的记录项而得到词‘段’的发音信息数据(比如词形拼音),推而广之就可用这些偏旁部首组合(音码)来组成语句发音(直接发音方式);在编码时仍使用编码规则来生成拼合信息的值,编码时仍可采用前面述的一些词段编码方式比如频率排序编码方式,在解码时的解码规则是与编码规则对应的;甚至可以把字可能用的所有偏旁部首分别作为音码(汉字的所有偏旁部首总数大约为5,6百个),这些偏旁部首音码各赋予不同的编号,由偏旁部首音码组成词的偏旁部首信息,再按词的偏旁部首信息在词数据库(词数据库也是做成可按偏旁部首查找词的)里查找此词对应的记录项而得到此词的发音信息数据(比如词形拼音),由各词的发音组成语句发音(直接发音方式);在编码时仍使用编码规则来生成拼合信息的值,编码时仍可采用前面述的一些词段编码方式比如频率排序编码方式,在解码时的解码规则是与编码规则对应的。可称这些种经组合后可转换为发音的音码(比如偏旁部首组合,偏旁部首)为4类音码。(注:即使在编解码时不使用词音码来编解码,但编解码时仍可建立词数据库,以记录各常用词的拼法,词形拼音等信息,当然这时各词的频率编号一般就不用记录了)Also have a kind of mode of definition phonetic sign indicating number in addition, be that a plurality of phonetic sign indicating numbers are combined and can be converted into pronunciation (the pronunciation here is also in the sentence pronunciation, the corresponding pronunciation after a plurality of phonetic sign indicating numbers are combined), such as the combination of radicals, Radicals etc. can be regarded as phonetic codes. For non-Indo-European languages (take Chinese as an example, other Korean, Japanese, etc.), such as radical combination, refers to the concept that the decomposition of sentences is not based on words or word combinations. First, you can create a search by radicals The word database of words, the information of each word stored in it contains pronunciation information data (such as morphological pinyin); then, for example, for a "segment" (this can be regarded as a word), the left and right parts of the radicals can be combined Be defined as two phonetic codes respectively, these two phonetic codes are given numbering respectively as 010011,001101, these two phonetic codes (radical radical combination) can not express pronunciation when using separately separately, but be exactly the whole of a word as combined Radical information, then find the record item of this word 'duan' according to the radical information in the word database to obtain the pronunciation information data (such as word form pinyin) of the word 'duan', and by extension, these radicals can be used The combination of radicals and radicals (sound code) is used to form the pronunciation of the sentence (direct pronunciation mode); the coding rules are still used to generate the value of the combined information during coding, and some of the above-mentioned word segment coding methods such as the frequency sorting coding method can still be used during coding. The decoding rule at the time of decoding corresponds to the encoding rule; even all possible radicals of a word can be used as phonetic codes respectively (the total number of all radicals of Chinese characters is about 5, 6 hundred), these radicals sound Code respectively endows different numbering, is formed the radical information of word by the radical phonetic code, then searches for this in the word database (the word database is also made can look up word by radical) by the radical information of word The record item corresponding to the word is used to obtain the pronunciation information data of the word (such as word shape pinyin), and the pronunciation of each word is used to form a sentence pronunciation (direct pronunciation mode); when encoding, the encoding rule is still used to generate the value of the combined information, when encoding Some word segment encoding methods mentioned above such as frequency sorting encoding methods can still be used, and the decoding rules during decoding correspond to the encoding rules. These kind can be converted into the phonetic sign indicating number of pronunciation after being combined (such as radical combination, radical) be 4 classes of phonetic codes. (Note: Even if the word phonetic code is not used for encoding and decoding, a word database can still be established during encoding and decoding to record the spelling of each commonly used word, word form pinyin and other information. Of course, the frequency number of each word at this time Generally do not need to record)

如将3类音码用于TTS方式解码时发音,那音码中的元辅音,元音辅音组合可以表示成发音信息数据(比如拼音格式数据,可用元辅音或元音辅音组合来组成字的拼音),而字,词,词组合或范句既可转成发音信息数据(比如字可转成拼音,词或词组合可转成词形拼音)也可转成拼法数据(比如大五码格式数据),当然需事先在音码的数据库里存储各音码的比如拼音或大五码等数据;而且可使用整步TTS发音方式或分步TTS发音方式来播放语句发音,如使用整步TTS发音方式,解码时MCU可将解码中得到的各3类音码转成其相应的发音信息数据(比如拼音格式数据)或拼法数据(比如大五码格式数据),并把这些数据(这里说的‘这些数据’中可能包括发音信息数据,拼法数据中的一种或两种)按它们对应的发音在语句发音中的顺序组合起来成语句的拼法格式数据(也即语句的预备发音数据),再由MCU将语句的拼法格式数据一次传给TTS器件(比如TTS芯片)播出语句发音。Pronunciation when 3 kinds of phonetic codes are used for TTS mode decoding, vowel and consonant in that phonetic code, the combination of vowel and consonant can be expressed as pronunciation information data (such as pinyin format data, available vowel and consonant or vowel and consonant combination to form word Pinyin), and words, words, word combinations or model sentences can be converted into pronunciation information data (such as words can be converted into pinyin, words or word combinations can be converted into morphological pinyin) can also be converted into spelling data (such as big five code format data), certainly need to store data such as pinyin or big five codes of each phonetic code in advance in the database of the phonetic code; Step TTS pronunciation method, when decoding, MCU can convert each of the three types of phonetic codes obtained in decoding into its corresponding pronunciation information data (such as pinyin format data) or spelling data (such as big five format data), and convert these data (Said 'the data' here may include pronunciation information data, one or both of the spelling data) are combined into the spelling format data (that is, the sentence) according to the order of their corresponding pronunciation in the sentence pronunciation. prepared pronunciation data), and then the MCU sends the spelling format data of the sentence to the TTS device (such as the TTS chip) to broadcast the pronunciation of the sentence.

如将4类音码用于TTS方式解码时发音,那把偏旁部首音码(或偏旁部首组合音码)组合起来可表示成词,词也可转成发音信息数据(比如词形拼音)或拼法数据(比如大五码格式数据),当然也需事先在音码的数据库里存储各词的比如拼音或大五码等信息;而且解码时可使用整步TTS发音方式或分步TTS发音方式来播放语句发音,如使用整步TTS发音方式,解码时MCU可将解码中得到的各偏旁部首音码(或偏旁部首组合音码)组合成词的偏旁部首信息,再按词的偏旁部首信息在词数据库里查找此词对应的记录项即可得到此词的发音信息(比如词形拼音)或拼法信息(比如大五码信息),即把各词转成其相应的发音信息数据(比如拼音格式数据)或拼法数据(比如大五码格式数据),然后可把这些数据(这里说的‘这些数据’中可能包括发音信息数据,拼法数据中的一种或两种)按它们对应的发音在语句发音中的顺序组合起来成语句的拼法格式数据(也即语句的预备发音数据),再由MCU将语句的拼法格式数据一次传给TTS器件(比如TTS芯片)播出语句发音。Pronunciation when 4 classes of phonetic codes are used for TTS mode decoding, then radical phonetic codes (or radical combination phonetic codes) can be represented as words, and words can also be converted into pronunciation information data (such as word form pinyin) ) or spelling data (such as big five code format data), of course also need to store information such as pinyin or big five codes of each word in the database of the phonetic code in advance; and can use the whole step TTS pronunciation mode or step by step when decoding TTS pronunciation mode plays sentence pronunciation, as using full-step TTS pronunciation mode, MCU can be combined into the radical information of each radical sound sign indicating number (or radical combination sound sign indicating number) obtained in the decoding when decoding, and then Search the record item corresponding to this word in the word database according to the radical information of the word to get the pronunciation information (such as word form pinyin) or spelling information (such as big five code information) of this word, that is, convert each word into Its corresponding pronunciation information data (such as pinyin format data) or spelling data (such as big five code format data), can then these data (saying 'these data' mentioned here may include pronunciation information data, spelling data in One or two) are combined according to the order of their corresponding pronunciation in the pronunciation of the sentence to form the spelling format data of the sentence (that is, the prepared pronunciation data of the sentence), and then the spelling format data of the sentence is transmitted to the TTS once by the MCU A device (such as a TTS chip) broadcasts the pronunciation of the sentence.

按本说明书所述,可以由3类音码中一种或多种音码来拼合语句集发音(比如可由词与词组合一起来拼合语句集发音);而对于4类音码,同样可以由4类音码中的一种或多种音码来拼合语句集发音,比如可单独由偏旁部首组合来拼合语句集发音,也可由偏旁部首组合与偏旁部首一起来拼合语句集发音;并且可以由3类音码中的一种或多种音码与4类音码中的一种或多种音码共同来拼合语句集发音,比如由4类音码中的偏旁部首与3类音码中的范句,采用整句差别编码方式来作编码,例如,(假设解码数据里仍有词数据库),先对词段进行编码,选一个与需编码语句相似的范句,先记录此范句在范句库里的顺序号,再记录其在哪个词的位置比需编码语句删几个偏旁部首音码(实际是把要删的词看成是由几个偏旁部首音码组合成的),哪个词的位置增几个偏旁部首音码(实际是把要增的词看成是由几个偏旁部首音码组合成的)即成词段的编码,至于陌生词段的处理可参照主要用词及词组合来作编解码时的处理。如由3类音码和4类音码来共同拼合语句集发音,那拾音器解码时播放语句发音可以采用整步或分步直接发音方式,另也可采用整步或分步TTS发音方式,如使用整步TTS发音方式,在解码时MCU可把拼合成语句发音的各3类音码转成其相应的发音信息数据(比如拼音格式数据)或拼法数据(比如大五码格式数据),把各4类音码组合成词并把词转成发音信息数据(比如拼音格式数据)或拼法数据(比如大五码格式数据),再把这些数据(这里说的‘这些数据’中可能包括发音信息数据,拼法数据中的一种或两种)按它们对应的发音在语句发音中的顺序组合起来成语句的拼法格式数据(也即语句的预备发音数据),再由MCU把语句的拼法格式数据一次传给TTS器件播出语句发音。According to this description, one or more phonetic codes in 3 classes of phonetic codes can be used to put together the pronunciation of the sentence set (such as the combination of words and words to put together the pronunciation of the sentence set); One or more phonetic codes in the 4 class phonetic sign indicating numbers come together the sentence collection pronunciation, such as can alone by the combination of radicals to put together the pronunciation of the sentence collection, also can come together the pronunciation of the collection of sentences by the combination of radicals and radicals; And one or more phonetic codes in 3 classes of phonetic codes and one or more phonetic codes in 4 classes of phonetic codes can be used to piece together the sentence set pronunciation, such as by radicals and 3 in 4 classes of phonetic codes The model sentence in the class phonetic sign indicating number adopts the whole sentence differential encoding method to encode, for example, (assuming that there is still a word database in the decoding data), first the word segment is encoded, choose a model sentence similar to the sentence to be encoded, first Record the sequence number of this model sentence in the model sentence storehouse, then record its position in which word than need coding sentence to delete several radical phonetic codes (actually the word to be deleted is regarded as being made of several radicals phonetic code is combined), the position of which word increases several radical phonetic codes (actually the word to be increased is regarded as being composed of several radical phonetic codes) and becomes the coding of word segment, as The processing of unfamiliar words can refer to the main words and word combinations for encoding and decoding. If 3 types of sound codes and 4 types of sound codes are used to combine the pronunciation of the sentence set, the playback sentence pronunciation can adopt the full-step or step-by-step direct pronunciation method when the pickup is decoded, or the full-step or step-by-step TTS pronunciation method, such as Using the full-step TTS pronunciation method, the MCU can convert each of the three types of phonetic codes that are spelled into a sentence into its corresponding pronunciation information data (such as pinyin format data) or spelling data (such as big five format data) during decoding. Each 4 types of phonetic codes are combined into words and words are converted into pronunciation information data (such as pinyin format data) or spelling data (such as big five code format data), and then these data (possibly in 'these data' said here) Including the pronunciation information data, one or both of the spelling data) are combined into the spelling format data of the sentence (that is, the prepared pronunciation data of the sentence) according to the order of their corresponding pronunciation in the sentence pronunciation, and then the MCU puts the The spelling format data of the sentence is sent to the TTS device once to broadcast the pronunciation of the sentence.

对于直播方式,如果此方式(直播方式)的只适用于非印欧语系某语种的编码规则有用3类音码或4类音码来编码,那此编码规则中的各编码细则可参照本说明书前面所述的(直播方式)的以只适用于汉语为例的编码规则中的主要以词及词组合音码作编码的比如对音码赋予编号,频率排序等词段编码方式,以及陌生词处理等编码细则;如果此方式(直播方式)的只适用于非印欧语系某语种的解码规则有用3类音码或4类音码来解码,那此解码规则中的各解码细则可参照本说明书前面所述的(直播方式)的以只适用于汉语为例的解码规则中的主要以词及词组合音码作解码的比如对音码赋予编号,频率排序等词段解码方式,以及陌生词处理等解码细则。(本段所说的‘有用3类音码或4类音码’,是指用3类音码中的一种或多种音码,或者用4类音码中的一种或多种音码,或者用3类音码中的一种或多种音码和4类音码中的一种或多种音码)For the live broadcast mode, if the encoding rules applicable to a certain language of the non-Indo-European language family in this mode (live broadcast mode) are coded with 3 types of phonetic codes or 4 types of phonetic codes, then the coding rules in this coding rule can refer to this manual In the above-mentioned (live broadcast mode) encoding rules that are only applicable to Chinese as an example, words and word combination sound codes are mainly used for encoding, such as assigning numbers to sound codes, frequency sorting and other word segment encoding methods, and unfamiliar words Processing and other coding rules; if the decoding rules of this method (live broadcast method) only applicable to a certain language of the non-Indo-Europeanlanguage family use 3 types of phonetic codes or 4 types of phonetic codes to decode, then the decoding rules in this decoding rule can refer to this In the decoding rules mentioned above (live mode) in the manual, which are only applicable to Chinese as an example, the decoding rules mainly use words and word combination sound codes for decoding, such as assigning numbers to sound codes, frequency sorting and other word segment decoding methods, and unfamiliar Decoding rules such as word processing. (the said ' useful 3 class phonetic codes or 4 class phonetic codes ' of this paragraph refers to with one or more phonetic codes in 3 class phonetic codes, or with one or more phonetic codes in 4 class phonetic codes sign indicating number, or with one or more sound codes in 3 class sound codes and one or more sound codes in 4 class sound codes)

对于TTS方式的情况,也如上段所述,只是要把上段所述中的‘直播方式’都换成‘TTS方式’。显然如既对直播方式有效也对TTS方式有效那对混合方式也有效,对于混合方式的情况,也如上段所述,只是要把上段所述中的‘直播方式’都换成‘混合方式’。For the situation of the TTS mode, also as described in the previous paragraph, only the 'live broadcast mode' mentioned in the previous paragraph should be replaced by the 'TTS mode'. Obviously, if it is valid for both the live broadcast method and the TTS method, it is also valid for the hybrid method. For the hybrid method, it is also as described in the previous paragraph, but the "live broadcast method" in the previous paragraph should be replaced with "hybrid method". .

3类音码在本发明文件里有阐述,可以对其赋予频率编号,并且对3类音码赋予频率编号本身就是有压缩作用的(但如对每个元音及辅音赋予一个6BIT的编号是没压缩作用的),如有主要使用3类音码中的一种或多种音码来作编解码,然后再结合其它词段编解码方式(比如频率排序方式,整句差别方式等),来进行编解码,其它词段编解码方式一般是有压缩处理的,显然包含这些对音码赋编号和各词段编码方式等编码细则的编码规则是含有压缩编码处理的,包含这些对音码赋编号和各词段解码方式等解码细则的解码规则也是含有压缩解码处理的。但如是用3类音码中的元辅音,每个元音及辅音赋予一个6BIT的编号,并且编码规则是把这些元辅音编号按其元辅音的发音在语句发音中的顺序排组起来,来表示语句发音,这种方式的编码规则及相应的解码规则是没有压缩处理的。如是用3类音码中的字,每个字用此字对应的一个16BIT的大五码(一个大五码一般占16BIT)来表示,并且编码规则是把语句中的所有各字的大五码按其字在语句中的顺序排组起来,来表示语句发音(字可转成发音),这种方式的编码规则及相应的解码规则是没有压缩处理的,这种方式也是常规的文本记录方式;这种方式也可采取一些措施来达到压缩的效果,比如可参照本说明书前面述的直播方式的以只适用于汉语为例的编码规则里的前后调用处理,如语句后面出现一串字与语句前面一串字相同,对前面一串字可记录其各字的大五码,对后面这串字就可不记录其各字的大五码,而记录此串字与前面那串字的距离(相隔多少个字)以及有几个字相同,如将此种处理纳入编码规则,那编码规则就是有压缩处理的,相应的解码规则也是有压缩处理的;前面述的编码时对每个元辅音赋予一个6BIT的编号并把语句发音中所有各元辅音对应的元辅音编号按其元辅音的发音在语句发音中的顺序排组起来的方式,也可采取这种压缩处理,相应的编码规则和解码规则也是有压缩处理的。3 class sound codes are set forth in the document of the present invention, can be given frequency numbering to it, and the frequency numbering itself just has compression effect to 3 class sound codes (but as assigning a 6BIT numbering to each vowel and consonant is There is no compression effect), if there is one or more phonetic codes in the 3 types of phonetic codes to be used for encoding and decoding, and then in combination with other word segment encoding and decoding methods (such as frequency sorting methods, whole sentence difference methods, etc.), To encode and decode, other word segment encoding and decoding methods generally have compression processing. Obviously, the encoding rules that include these coding rules such as assigning numbers to phonetic codes and encoding methods for each word segment contain compression encoding processing, including these pairing phonetic codes. The decoding rules of the decoding rules such as assigning numbers and decoding methods of each word segment also include compression decoding processing. But as using the vowels and consonants in the 3 class sound codes, each vowel and consonant is given a 6BIT numbering, and the encoding rule is to arrange these vowels and consonants numbers in the order of the pronunciation of the vowels and consonants in the pronunciation of the sentence, to come Indicates the pronunciation of a sentence. The encoding rules and corresponding decoding rules of this method are not compressed. Such as using the words in the 3 class phonetic codes, each word is represented by a big five code (a big five code generally accounts for 16BIT) of a 16BIT corresponding to this word, and the encoding rule is to put all the big five codes of each word in the sentence The codes are arranged according to the order of the words in the sentence to represent the pronunciation of the sentence (words can be converted into pronunciation). The encoding rules and corresponding decoding rules of this method are not compressed. This method is also a conventional text record method; this method can also take some measures to achieve the effect of compression, for example, you can refer to the front and rear call processing in the coding rules of the live broadcast method mentioned above in this manual, which is only applicable to Chinese as an example, such as a string of characters appearing after the sentence It is the same as the string of words in front of the statement. For the string of words in front, the big five codes of each character can be recorded. For the string of words behind, the big five codes of each character can not be recorded, and the string of words and the string of words in front of it can be recorded. The distance (how many words are separated) and how many words are the same, if this processing is included in the coding rules, then the coding rules are compressed, and the corresponding decoding rules are also compressed; The consonants are given a 6BIT number and the consonant numbers corresponding to all the consonants in the sentence pronunciation are arranged according to the order in which the consonants are pronounced in the sentence pronunciation. This compression process can also be adopted, and the corresponding encoding Rules and decoding rules are also compressed.

4类音码,也可以对其赋予频率编号,并且对4类音码赋予频率编号本身就是有压缩作用的(比如对各个偏旁部首赋予一个10BIT的频率编号是有压缩作用的),如有主要使用4类音码中的一种或多种音码来作编解码,然后再结合其它词段编解码方式(比如频率排序方式等),来进行编解码,其它词段编解码方式一般是有压缩处理的,显然包含这些对音码赋编号和各词段编码方式等编码细则的编码规则是含有压缩编码处理的,包含这些对音码赋编号和各词段解码方式等解码细则的解码规则也是含有压缩解码处理的。4 classes of sound codes can also be given frequency numbers to it, and the frequency number itself is given to the 4 classes of phonetic codes to have a compression effect (such as giving a 10BIT frequency number to each radical, which has a compression effect), if any Mainly use one or more of the four types of phonetic codes for encoding and decoding, and then combine other word segment encoding and decoding methods (such as frequency sorting methods, etc.) to perform encoding and decoding. Other word segment encoding and decoding methods are generally If there is compression processing, it is obvious that the encoding rules that include these encoding rules for assigning numbers to phonetic codes and encoding methods for each word segment contain compression encoding processing, including the decoding of these decoding rules for assigning numbers to phonetic codes and decoding methods for each word segment Rules also contain compression and decoding processing.

上面9段所述是适合于汉语的,对于非印欧语系语种中的韩语,其语句发音的构成主要特点是:由元音,辅音符号构成音节(韵尾也算在辅音里),由音节(音节可看成相当于汉语里的字)组成词,由词组成词组合和语句,而且其构成音节的形状的元音,辅音的符号,既可当成发音符号,又可当成偏旁部首,所以可被当成用以拼合语句集发音的音码的,可以是表示发音的元辅音或元音辅音组合这样的3类音码,也可以是可以转换成发音的字,词,词组合或范句这样的3类音码,也可以是经组合后可转换为发音的偏旁部首组合或偏旁部首这样的4类音码;由这些音码(3类和4类音码)中的一种或多种来拼合语句集发音,这基本上也是按上面9段所述处理。非印欧语系语种中的日语,其语句发音的构成主要特点是:由元音,辅音组成假名发音,由假名发音组成字(指汉字)的发音,由假名和字组成词(也可以单独由假名或单独由字组成词),由词组成词组合和语句,日语里的罗马字可对应成假名来对待处理,(日语的各假名和各汉字可分别看成相当于汉语里的字),虽然假名不能分解成偏旁部首但可把假名当成偏旁部首,而且日语里的汉字也可看成是由偏旁部首(这里是类似于汉字的偏旁部首)组成的,这样一来,其语句编解码涉及的音码情况就与汉语,韩语基本相同了,所以可被当成用以拼合语句集发音的音码的,可以是表示发音的元辅音或元音辅音组合这样的3类音码,也可以是可以转换成发音的字(包括汉字和假名),词,词组合或范句这样的3类音码,也可以是经组合后可转换为发音的偏旁部首组合或偏旁部首这样的4类音码;由这些音码(3类和4类音码)中的一种或多种来拼合语句集发音,这基本上也是按上面9段所述处理。对非印欧语系任一语种,如用4类音码中的一种或多种音码(或用4类音码中的一种或多种音码加3类音码中的一种或多种音码)来编解码,相应的直播方式(或TTS方式或混合方式)的只适用于非印欧语系此语种的含有压缩处理的编解码规则也可参照前面的直播方式(或TTS方式)的以只适用于汉语为例的编解码规则的阐述,另也可使用别的种类的直播方式(或TTS方式或混合方式)的只适用于非印欧语系此语种的含有压缩处理的编解码规则(编解码规则是指编码规则和解码规则)。(注:本发明文件各处提及的日语的假名包括平假名和片假名,这是显然的)The above 9 paragraphs are suitable for Chinese. For Korean in non-Indo-European languages, the main characteristics of the pronunciation of its sentences are: vowels and consonant symbols form syllables (rhyme endings are also counted in consonants), and syllables ( A syllable can be regarded as equivalent to a word in Chinese) to form a word, a word combination and a sentence are formed from words, and the vowel and the consonant symbol of the shape of the syllable can be used as pronunciation symbols or radicals, so What can be regarded as the phonetic codes used to put together the pronunciation of the sentence set can be three types of phonetic codes such as vowels and consonants or combinations of vowels and consonants that represent pronunciation, and can also be words, words, word combinations or model sentences that can be converted into pronunciation Such 3 types of phonetic codes can also be converted into radical combination or radicals such 4 types of phonetic codes after being combined; by one of these phonetic codes (3 types and 4 types of phonetic codes) or multiple to piece together the pronunciation of the sentence set, which is also basically processed as described in the above 9 paragraphs. In the non-Indo-European languages of Japanese, the main characteristics of the formation of the pronunciation of sentences are: the pronunciation of kana is formed by vowels and consonants, the pronunciation of characters (referring to Chinese characters) is formed by the pronunciation of kana, and the word is formed by kana and words (also can be composed of Kana or separate words are formed by words), word combinations and sentences are formed by words, and the Roman characters in Japanese can be treated as kana for processing, (each kana and each Chinese character in Japanese can be regarded as equivalent to words in Chinese respectively), Although pseudonyms cannot be decomposed into radicals, pseudonyms can be regarded as radicals, and Chinese characters in Japanese can also be regarded as being composed of radicals (here are radicals similar to Chinese characters), so that other The phonetic codes involved in sentence encoding and decoding are basically the same as those in Chinese and Korean, so they can be used as the phonetic codes used to put together the pronunciation of the sentence set, which can be three types of phonetic codes, such as vowels and consonants or combinations of vowels and consonants that represent pronunciation. , also can be the word (comprising Chinese character and pseudonym) that can be converted into pronunciation, speech, word combination or model sentence such 3 class phonetic codes, also can be the radical combination or radical radical that can be converted into pronunciation after being combined Such 4 classes of phonetic codes; by one or more in these phonetic codes (3 classes and 4 classes of phonetic codes), the sentence set pronunciation is put together, and this basically also is by the described processing of above 9 paragraphs. For any language of the non-Indo-European language family, such as using one or more phonetic codes in the 4 class phonetic codes (or adding one or more of the 3 class phonetic codes with one or more phonetic codes in the 4 class phonetic codes) A variety of audio codes) to encode and decode, the corresponding live broadcast method (or TTS method or mixed method) is only applicable to non-Indo-European languages, and the codec rules containing compression processing can also refer to the previous live broadcast method (or TTS method ) to explain the codec rules that are only applicable to Chinese as an example, and other types of live broadcast methods (or TTS methods or mixed methods) that are only applicable to non-Indo-European languages and include compression processing Decoding rules (codec rules refer to encoding rules and decoding rules). (note: the Japanese pseudonym mentioned in the document of the present invention comprises hiragana and katakana everywhere, and this is obvious)

非印欧语系的其它语种的拼合信息的值的编解码处理,也都可参照汉语,韩语或日语的拼合信息的值的编解码处理。For the codec processing of the value of the combined information in other languages other than the Indo-European language family, reference can also be made to the codec processing of the value of the combined information in Chinese, Korean or Japanese.

本发明文件各处所称音码,是用以拼合语句发音的;可被当成非印欧语系任一语种的音码的,可以是表示发音的元辅音或元音辅音组合,也可以是可转换成发音的字,词,词组合或范句,也可以是组合起来可转换为发音的偏旁部首组合或偏旁部首(这些对音码的定义在本说明书前面都有述),除此之外,音码还可以是其它本说明书未提及的形式,比如编码者自定的形式,只要其能用以拼合此语种的语句集发音即可(实际上这些编码者自定的音码形式,也应看作是元辅音,元音辅音组合,字,词,词组合,范句,偏旁部首组合,偏旁部首这些音码的等同替换),对非印欧语系某一语种,如有用其它形式的音码来拼合语句集发音,那涉及的只适用于此语种的含有压缩编码处理的编码规则以及涉及的只适用于此语种的含有压缩解码处理的解码规则中的细则仍可参照本说明书前面所述的直播方式(或TTS方式)的以只适用于汉语为例的编解码规则中的主要以词及词组合音码作编解码的比如对音码赋予编号,频率排序等词段编解码方式,以及陌生词处理等细则,或者可按照使用的其它形式的音码,来使用新的只适用于此语种的含有压缩编码处理的编码规则及新的只适用于此语种的含有压缩解码处理的解码规则。本段前面提及的‘如有用其它形式的音码来拼合语句集发音’,其用其它形式的音码来拼合语句集发音的具体含义是指可以由一种或多种其它形式的音码来拼合语句集发音,也可由一种或多种其它形式的音码与元辅音,元音辅音组合,字,词,词组合,范句,偏旁部首组合和偏旁部首中的一种或多种音码一起来拼合语句集发音。本段所述是可适用于非印欧语系的各语种(汉语,韩语或日语等)的。本段所说的细则是指编码细则或解码细则。除了在述及时有特别注明有确切含义之外,本段以上所述及的编解码规则可以是直播方式或TTS方式或混合方式的。The so-called phonetic code everywhere in the documents of the present invention is used to spell out the pronunciation of sentences; what can be regarded as the phonetic code of any language of the non-Indo-European language family can be the combination of vowels and consonants or vowels and consonants representing pronunciation, and can also be convertible Pronounced words, words, word combinations or model sentences can also be combined into radical combinations or radicals that can be converted into pronunciation (these definitions of phonetic codes are all stated in front of this specification sheet), in addition In addition, the phonetic code can also be other forms not mentioned in this manual, such as the form determined by the coder, as long as it can be used to put together the pronunciation of the sentence set of this language (in fact, the phonetic code form determined by these coders , should also be regarded as vowel and consonant, vowel and consonant combination, word, word, word combination, model sentence, radical combination, the equivalent replacement of these phonetic codes of radical), to a certain language of the non-Indo-European language family, such as Other forms of phonetic codes are used to combine the pronunciation of the sentence set, and the detailed rules in the encoding rules that are only applicable to this language and contain the compression encoding processing and the decoding rules that are only applicable to this language and contain compression decoding processing can still be referred to In the codec rules of the live broadcast method (or TTS method) mentioned earlier in this manual, which are only applicable to Chinese as an example, the codecs mainly use words and word combination phonetic codes as codecs, such as assigning numbers to phonetic codes, sorting words by frequency, etc. segment coding and decoding methods, and unfamiliar word processing and other details, or according to other forms of phonetic codes used, new coding rules that are only applicable to this language and contain compression coding processing and new coding rules that are only applicable to this language. Decoding rules for compression decoding processing. Mentioned in front of this paragraph ' if there are other forms of phonetic codes to put together the pronunciation of the sentence collection', its concrete meaning of putting together the pronunciation of the sentence collection with the phonetic codes of other forms refers to the phonetic code that can be formed by one or more other forms To piece together the sentence collection pronunciation, also can be by one or more other forms of sound sign indicating number and vowel consonant, vowel consonant combination, word, speech, word combination, model sentence, one or in radical combination and radical A variety of phonetic codes come together to piece together the pronunciation of the sentence set. This paragraph is applicable to non-Indo-European languages (Chinese, Korean or Japanese, etc.). The rules mentioned in this paragraph refer to encoding rules or decoding rules. Except when it is specifically stated that it has an exact meaning, the encoding and decoding rules mentioned above in this paragraph can be live broadcast, TTS, or mixed.

一个解码规则中有关非印欧语系任一语种的直播方式的有用3类音码或4类音码或其它形式音码的解码细则,其发音方式显然可以是整步直接发音方式或分步直接发音方式;一个解码规则中有关非印欧语系任一语种的TTS方式的有用3类音码或4类音码或其它形式音码的解码细则,其发音方式显然可以是整步TTS发音方式或分步TTS发音方式;一个解码规则中有关非印欧语系任一语种的混合方式的有用3类音码或4类音码或其它形式音码的解码细则,其发音形式是用混合发音方式,发音时可能会用到整步直接发音方式,整步TTS发音方式,分步直接发音方式,分步TTS发音方式等中的多种方式。In a decoding rule, there are useful 3-type phonetic codes or 4-type phonetic codes or other decoding rules for the live broadcast of any language in the non-Indo-European language family. The pronunciation method can obviously be the whole-step direct pronunciation method or step-by-step direct Pronunciation mode; the useful decoding rules of 3 types of phonetic codes or 4 types of phonetic codes or other forms of phonetic codes in the TTS mode of any language of the non-Indo-European language family in a decoding rule, and its pronunciation mode can obviously be a whole-step TTS pronunciation mode or Step-by-step TTS pronunciation mode; the decoding rules of useful 3-class phonetic codes or 4-class phonetic codes or other forms of phonetic codes about the mixed mode of any language in the non-Indo-European language family in a decoding rule, and its pronunciation form is to use the mixed pronunciation mode, When pronouncing, you may use the whole-step direct pronunciation method, the whole-step TTS pronunciation method, the step-by-step direct pronunciation method, and the step-by-step TTS pronunciation method.

从这里到190页对数字发音的处理的叙述为止有关编解码技术的阐述是对前面述的有关印欧语系语种的编解码规则举例和非印欧语系语种的编解码规则举例的技术的补充,而且这些补充的编解码技术可以是适用于直播方式或TTS方式或混合方式的编解码处理的,(在述及有关举例的适用于印欧语系语种的编解码规则时仍是以其只适用于英语为例的,在述及有关举例的适用于非印欧语系语种的编解码规则时仍是以其只适用于汉语为例的;这些补充的技术显然也可用于任一复合语种的编解码):From here to the narration of the processing of digital pronunciation on page 190, the elaboration on the codec technology is a supplement to the above-mentioned codec rules for Indo-European languages and non-Indo-European languages. And these supplementary coding and decoding technologies can be applicable to the coding and decoding processing of live mode or TTS mode or mixed mode, (when referring to the coding and decoding rules applicable to the Indo-European languages of relevant examples, it is still only applicable to Taking English as an example, when referring to the codec rules applicable to non-Indo-European languages, it is still only applicable to Chinese as an example; these supplementary technologies can obviously also be used for codec rules of any compound language ):

对于编解码处理,还有一个技术改进点,以英语为例,目的是为了增加压缩效率;在编解码时,因码长限制,能用于编码的单词及单词组合的个数是有限的(比如4千个);对于不同题材的短文,其使用的单词及单词组合往往按题材分成类,在某一种题材下,经常使用相应类下的单词和单词组合,比如把PEN,BOOK,TEACH,GO TO SCHOOL,ASKFor encoding and decoding processing, there is another technical improvement point. Taking English as an example, the purpose is to increase compression efficiency; when encoding and decoding, due to the limitation of code length, the number of words and word combinations that can be used for encoding is limited ( For example, 4,000); for essays of different themes, the words and word combinations used are often divided into categories according to the theme. Under a certain theme, words and word combinations under the corresponding category are often used, such as PEN, BOOK, TEACH , GO TO SCHOOL, ASK

等词和词组归成一类,在学校题材的短文时,这类里面的词显然是经常使用的词,将这些词放在此题材的单词数据库的前边,用短编号表示(3BIT或6BIT表示),而且其经常使用,所以就可达到进一步压缩的目的,而且某题材使用的偏僻词也可编入其题材的单词数据库而不用当陌生单词处理,也可节约码长,当然单词数据库里还应包括THE,FOR这些常用公共词。实际用时,可以做多个单词数据库,每个单词数据库对应不同的题材(比如学校,餐厅,工厂等),然后统计每个题材下所有可能用到的单词及单词组合及它们的使用频率(包括本题材有关单词及THE,FOR这些常用公共词),将其编入相应题材的单词数据库,使用频率高的赋短编号,使用频率低的赋长编号,可在短文编的第1个码字的头段里指示本短文用的是哪个单词数据库,然后其余编码同于本说明书前面述的直播方式的以只适用于英语为例的编码规则的处理。解码时,先在本短文第1个码字里解出得知使用哪个题材的单词数据库,然后其余解码同于本说明书前面述的直播方式的以只适用于英语为例的解码规则的处理。对于以汉语为代表的非印欧语系语种,同样可使用此方式。对于使用其它音码的可类似处理。使用这种按题材分建词数据库的技术后,对拾音器的其它软硬件没有影响,即码字仍可以是印在书上,卡片上等的,码字可以是导电码,条形码等,相应的读码头仍可以是机电式导电码读码头,光电式读码头等。Such words and phrases are grouped into one category. In the short essays of school subjects, the words in this category are obviously frequently used words. Put these words in front of the word database of this theme, and use short numbers to indicate (3BIT or 6BIT representation) , and it is often used, so the purpose of further compression can be achieved, and the remote words used in a certain theme can also be compiled into the word database of the theme instead of being treated as unfamiliar words, and the code length can also be saved. Of course, the word database should also be Including THE, FOR these common common words. In actual use, you can make multiple word databases, each word database corresponds to a different theme (such as schools, restaurants, factories, etc.), and then count all possible words and word combinations and their frequency of use under each theme (including Words related to this theme and THE, FOR these commonly used public words) are compiled into the word database of the corresponding theme, and the short codes with high frequency of use and the long codes with low frequency of use can be edited in the first code word of the short text Indicate which word database this essay uses in the header, and then the rest of the encoding is the same as the encoding rules that are only applicable to English as an example in the live broadcast mode mentioned above in this manual. When decoding, first solve the word database of which subject matter to use in the first code word of this short text, and then the rest of the decoding is the same as the processing of the decoding rules that are only applicable to English as an example in the live broadcast mode mentioned above in this manual. This method can also be used for non-Indo-European languages represented by Chinese. Can be similarly processed for using other phonetic codes. After using this technique of building word databases according to subject matter, it has no effect on other software and hardware of the pickup, that is, code words can still be printed on books, cards, etc., code words can be conductive codes, bar codes, etc., corresponding The reading dock can still be an electromechanical conductive code reading dock, a photoelectric reading dock, etc.

另外,还有一个技术改进点:在一个段落里,有些不常用的词,比如专用名词或动词,在段落前面句子里出现后,经常在段落后面句子里也出现,所以就可采取一点改进措施,这里对词的频率编号,词的分类及关联的定义以及词段编解码方式,陌生词处理,时态词形处理等还是依照本说明书前面反复阐述的,并假设频率编号采取阶梯编号,各阶梯编号的编号位长分别为3,6,9,12,编号位长为3的是8个最常用的词,编号位长为6的编号共64个,是次常用的词,在这64个编号里开10个不定义成固定词的编号(这样一来因这里有10个词多出来,需对词数据库重新编号),这10个编号专门用于编码时存储冷僻词,可称暂存器RAM里用于存储这10个编号对应情况的数据区为暂用区,其频率编号范围可定义为110101-111110,即比如在前面句子遇到编号位长为12的词,在那里还是按正常方式在编码里记录其12位编号位长,并将其频率编号信息记录到暂用区,其编号位长就变为6了,110101,然后到后面的句子,如又用到这个词,就用它在暂用区里的编号代替,对于出现多个12位编号位长的词,可顺序记录在暂用区里,第二个转换成110110,如超出10个,就可使用某种替换策略,比如进暂用区最早的就被新进的替换,其余编码处理比如词段编码方式,陌生词处理,时态词形处理等还是按照本说明书前面述的处理;解码时,从前面句子开始解码,遇到12位编号位长的词就记录在解码暂用区,来一个加一个,超10个时替换也按照先进先出原则(原则需与编码时的定义一样或兼容),解后面句子时,遇到处于暂用区编号的词就查解码暂用区,将其替换成12位的频率编号,用于解码,其余解码处理比如词段解码方式,陌生词处理,时态词形处理等还是按照本说明书前面述的直播方式的以只适用于英语为例的解码规则的处理。这样处理,用短的频率编号来代替多次使用的长的频率编号,达到压缩目的。但有其局限性,就是需对段落的每个码字都能读出,如有一个没能正常读出码值,解码就可能不正常。本段所述适用于印欧语系和非印欧语系各语种,也适用与任一复合语种。In addition, there is another technical improvement point: in a paragraph, some uncommon words, such as proper nouns or verbs, appear in the sentence before the paragraph, and often appear in the sentence behind the paragraph, so some improvement measures can be taken , here the frequency numbering of words, the definition of word classification and association, the encoding and decoding method of word segments, the processing of unfamiliar words, the processing of tense and word form, etc. are still in accordance with the previous repeated explanations in this manual, and assume that the frequency numbering adopts step numbering, each The numbering lengths of the ladder numbers are 3, 6, 9, and 12 respectively. Those with a numbering length of 3 are the 8 most commonly used words, and those with a numbering length of 6 have a total of 64 words, which are the second most commonly used words. Among the 64 There are 10 numbers in each number that are not defined as fixed words (in this way, there are 10 more words here, and the word database needs to be renumbered). These 10 numbers are specially used to store uncommon words during encoding, which can be called temporary The data area used to store the corresponding situations of these 10 numbers in the memory RAM is a temporary area, and its frequency number range can be defined as 110101-111110. Record its 12-digit number bit length in the code in the normal way, and record its frequency number information into the temporary use area, and its number bit length becomes 6, 110101, and then to the following sentence, if this word is used again , just replace it with its number in the temporary area. For words with multiple 12-digit number lengths, they can be recorded in the temporary area in sequence, and the second one is converted into 110110. If there are more than 10 words, a certain word can be used. For example, the earliest entry into the temporary use area will be replaced by the new entry, and the rest of the encoding processing, such as word segment encoding, unfamiliar word processing, tense word form processing, etc., will still be processed according to the above-mentioned processing in this manual; when decoding, start from The previous sentence starts to be decoded. When encountering a word with a length of 12 digits, it will be recorded in the temporary decoding area. Add one by one, and replace it when it exceeds 10. , when solving the following sentence, if you encounter a word in the temporary area number, check the decoding temporary area and replace it with a 12-digit frequency number for decoding. Other decoding processes such as word segment decoding, unfamiliar word processing, time Morphology processing etc. are still according to the processing of the decoding rules that are only applicable to English as an example in the live broadcast mode mentioned above in this specification. In this way, short frequency numbers are used to replace long frequency numbers used many times to achieve the purpose of compression. But it has its limitations, that is, it needs to read every code word of the paragraph, if one fails to read the code value normally, the decoding may not be normal. This paragraph applies to all Indo-European and non-Indo-European languages, and also applies to any compound language.

对缩略语的处理,比如AM,PM,可与普通单词或单词组合一样处理,常用的缩略语编入单词数据库,不常用的缩略语不编入单词数据库,遇到不常用的缩略语就当陌生单词处理。The processing of abbreviations, such as AM and PM, can be processed in the same way as ordinary words or word combinations. Commonly used abbreviations are compiled into the word database, and uncommonly used abbreviations are not compiled into the word database. Strange word processing.

关于语句停顿的问题,对于简单的读物内容编码,一般是一个处理单位对应一个句子(此一个句子是被当成一个语句的),句子中间没逗号(如英语),没其它停顿,解码时每个处理单位结束(句子结束)语句播放就作停顿。而对于一些复杂情况,比如编码时想用一个处理单位对应多个句子而每句子结束需停顿,或者句子内有多个由逗号隔开的分句(或主从句)而遇到逗号需停顿,或者还有其它需停顿的情况;解决方法,可以在词数据库里专门占用几个词的频率编号位置,这些编号不是对应词,而是分别表示句子间停顿(句号断句),逗号停顿和其它停顿,编码时用这些编号分别代替句号,逗号和其它停顿,比如用频率编号000000100来表示逗号,词数据库里此编号就不对应词,然后对语句编码时,将逗号用此编号代替,之后的编码此逗号编号与语句中词的频率编号统一处理,解码时遇到这个000000100编号就知道是逗号,并作语气停顿;另外,为了解决停顿问题,还可在划分语句时,就把被句号(断句),逗号或其它停顿隔开的各句子(或句子的部分)分别划为不同的语句,分别编码生成各处理单位的值;而如是把一个分句划为一个语句,那就是用一个处理单位对应一个句子的部分,对其编解码仍与对是一个完整句子的语句的编解码处理相同(编解码时是把上句说的一个句子的部分当成一个语句的),参看本说明书里的编解码技术叙述即可,这样各处理单位内就没有语气停顿了,只在处理单位后面有停顿。本段所述适用于印欧语系和非印欧语系各语种,也适用与任一复合语种。Regarding the problem of sentence pauses, for simple reading content encoding, generally a processing unit corresponds to a sentence (this sentence is regarded as a sentence), there is no comma in the middle of the sentence (such as English), and there is no other pause. When decoding, each When the processing unit ends (sentence end), the playback of the sentence pauses. For some complex situations, such as when coding, you want to use one processing unit to correspond to multiple sentences and you need to pause at the end of each sentence, or there are multiple clauses (or main clauses) separated by commas in the sentence and you need to pause when encountering commas. Or there are other situations that need to pause; the solution is to take the frequency number positions of several words in the word database. These numbers are not corresponding words, but represent pauses between sentences (periods), comma pauses and other pauses When encoding, use these numbers to replace periods, commas and other pauses. For example, use the frequency number 000000100 to represent commas. This number does not correspond to words in the word database. Then, when encoding sentences, replace commas with this number, and then encode This comma numbering and the frequency numbering of word in the statement are processed in unison, and when decoding, run into this 000000100 numbering and just know that it is a comma, and make tone pause; ), commas or other pauses to separate each sentence (or part of a sentence) are divided into different sentences, respectively coded to generate the value of each processing unit; and if a clause is divided into a sentence, that is to use a processing unit Corresponding to the part of a sentence, its encoding and decoding is still the same as that of a sentence that is a complete sentence (when encoding and decoding, the part of a sentence mentioned in the previous sentence is regarded as a sentence), see the encoding and decoding in this manual The decoding technique can be described, so that there is no tone pause in each processing unit, only a pause behind the processing unit. This paragraph applies to all Indo-European and non-Indo-European languages, and also applies to any compound language.

对于印欧语系各语种,还可把一些与语句发音有关的非常规发音的信息定义为一类特殊的音码,比如前面已有述的语句中的停顿,这类音码可有:句子间停顿,分句间逗号停顿(主从句间逗号停顿也可归属于分句间逗号停顿),重读,次重读,轻读等(这里是指的需重读,次重读,轻读是针对语句中的单词的,而不是指单词内的音节的重读,次重读),这些音码可与其它要用于编码的音码统一编号(即腾出部分单词的频率编号,来赋给这些特殊音码),比如句子间停顿,分句间逗号停顿,重读可分别赋予频率编号000000100,000000101,000000110,然后编码时可把特殊音码当成单词来处理,并与其它音码混合编码,比如英语的‘Is engineer.’是一句话,engineer需重读,就可把重读的编号附在engineer的编号前:For the various languages of the Indo-European language family, some unconventional pronunciation information related to the pronunciation of sentences can also be defined as a class of special phonetic codes, such as the pause in the aforementioned sentence, this class of phonetic codes can have: between sentences Pause, comma pause between clauses (comma pause between main and subordinate sentences can also be attributed to comma pause between clauses), stress, secondary stress, light reading, etc. word, rather than referring to the stress of the syllable in the word, secondary stress), these phonetic codes can be used for encoding with other phonetic codes for unified numbering (promptly vacate the frequency numbering of some words, to give these special phonetic codes) , such as pauses between sentences, comma pauses between clauses, stress can be assigned frequency numbers 000000100, 000000101, 000000110 respectively, and then special phonetic codes can be treated as words when encoding, and mixed with other phonetic codes, such as 'Is in English engineer.' is a sentence, the engineer needs to re-read it, and the re-read number can be attached before the engineer’s number:

举例:(采用频率排序方式来对单词段编码)For example: (using frequency sorting method to encode word segments)

00011:单词段里的单词及单词组合个数(3个)00011: the number of words and word combinations in the word segment (3)

01:is单词的频率编号的长度信息01: The length information of the frequency number of the is word

010110:is单词的频率编号010110: the frequency number of the word is

100:重读的频率编号长度信息100: Reread frequency number length information

000000110:重读的频率编号000000110: reread frequency number

10:engineer单词的频率编号的长度信息10: The length information of the frequency number of the word engineer

010110010:engineer单词的频率编号010110010: the frequency number of the word engineer

把以上各BIT按顺序合起来,就是编码值:000110101011010000000011010010110010Combine the above BITs in order to get the encoded value: 000110101011010000000011010010110010

对于非印欧语系各语种,特殊音码也可有:句子间停顿,分句间逗号停顿,重读,次重读,轻读等,其编号及编解码处理原理与印欧语系任一语种相似,可参照上几段述的对印欧语系语种的处理。对于复合语种也可这样定义特殊音码。For non-Indo-European languages, special phonetic codes can also have: pauses between sentences, comma pauses between clauses, stress, second stress, light reading, etc., its numbering and codec processing principles are similar to any language of the Indo-European language family, You can refer to the treatment of Indo-European languages mentioned in the previous paragraphs. Also can define special phonetic sign indicating number like this for compound language.

对印欧语系各语种或非印欧语系各语种或任一复合语种,如按解码规则MCU解码时有得到语句的预备发音数据,那得到语句的预备发音数据之后,如按直接发音方式来播放语句发音(以整步直接发音方式为例),发音时,如遇到预备发音数据中的重读的符号标志(由重读的频率编号转得),就可把需重读的单词(或词)的音的波形的振幅加大,即可得到重读效果,对于轻读是类似的可把发音的波形的振幅减小以得到轻读效果;如按TTS发音方式来播放语句发音,且TTS器件可处理重读,那就可在由MCU向TTS器件传输的语句的预备发音数据(或部分预备发音数据)里插入有关重读轻读的符号。For each language of the Indo-European language family or non-Indo-European language family or any compound language, if the MCU decodes according to the decoding rules and obtains the prepared pronunciation data of the sentence, after obtaining the prepared pronunciation data of the sentence, if it is played according to the direct pronunciation method Sentence pronunciation (taking whole step direct pronunciation mode as example), during pronunciation, if run into the stressed symbol mark (by the frequency number of stressed reading) in the preparation pronunciation data, just can put the word (or speech) of needing stressed The amplitude of the waveform of the sound is increased, and the stress reading effect can be obtained. It is similar to the light reading, and the amplitude of the pronunciation waveform can be reduced to obtain the light reading effect; if the pronunciation of the sentence is played according to the TTS pronunciation method, and the TTS device can handle Stress reading, then can be inserted in the preparation pronunciation data (or part preparation pronunciation data) of the sentence that transmits to TTS device by MCU the symbol about stress reading light reading.

对不同语系不同语种,具体定义的特殊音码及个数不一定相同。For different languages of different language families, the special phonetic codes and numbers of specific definitions are not necessarily the same.

在编码时,不一定非要用特殊音码;而使用特殊音码是需与别的音码混合使用来编解码的,对于印欧语系语种(英语例),特殊音码一般是需与元辅音,元音辅音组合,词根,单词,单词组合,范句,字符组合和字母中的一种或多种音码混合使用来编解码的,特殊音码可处于语句编码的中间;对于非印欧语系语种(汉语例),特殊音码一般是需与元辅音,元音辅音组合,字,词,词组合,范句,偏旁部首组合和偏旁部首中的一种或多种音码混合使用来编解码的,特殊音码可处于语句编码的中间。不论印欧语系的各语种或非印欧语系的各语种或任一复合语种,如编解码涉及有使用特殊音码,那显然拾音器解码时涉及的产音方式可为直接产音方式或TTS产音方式(或拾音器既可用直接产音方式也可用TTS产音方式)。When encoding, it is not necessarily necessary to use special phonetic codes; and use special phonetic codes to be mixed with other phonetic codes for encoding and decoding. Consonants, vowel-consonant combinations, roots, words, word combinations, model sentences, character combinations and letters are coded and decoded by mixing one or more sound codes, and special sound codes can be in the middle of sentence codes; for non-printed For European languages (Chinese example), special phonetic codes generally need to be combined with vowel-consonant, vowel-consonant combination, character, word, word combination, model sentence, radical combination and radical combination and one or more phonetic codes Mixed use to encode and decode, the special sound sign indicating number can be in the middle of sentence encoding. Regardless of the various languages of the Indo-European language family or non-Indo-European languages or any compound language, if the codec involves the use of special phonetic codes, it is obvious that the sound production method involved in the decoding of the pickup can be the direct sound production method or the TTS production method. sound mode (or the pickup can be used in both direct sound production mode and TTS sound production mode).

下面举一个编码例子和解码例子,来说明语句中含多个句子的编解码,以及单词组合在编解码中的使用,以及句子间停顿的特殊音码的使用:The following is an encoding and decoding example to illustrate the encoding and decoding of multiple sentences in a sentence, the use of word combinations in encoding and decoding, and the use of special phonetic codes for pauses between sentences:

语句里含有两个句子,一个处理单位对应一个语句的编码举例(英语例):A sentence contains two sentences, and one processing unit corresponds to a coding example of a sentence (English example):

语句为:The man is an engineer.The man is fat.The sentence is: The man is an engineer. The man is fat.

编码的各BIT为:The encoded BITs are:

头段:header:

00:本语句是英语语种00: This statement is in English language

00:单词段采用第1种词段编码方式(即频率排序编码方式)00: The word segment adopts the first word segment encoding method (that is, the frequency sorting encoding method)

单词段:word segment:

01001:单词段里的单词及单词组合个数(9个)01001: the number of words and word combinations in the word segment (9)

00:the单词的频率编号的长度信息00: Length information of the frequency number of the word

010:the单词的频率编号010: The frequency number of the word

10:man单词的频率编号的长度信息10: The length information of the frequency number of the man word

011100100:man单词的频率编号011100100: the frequency number of the word man

01:is an单词组合的频率编号的长度信息01: The length information of the frequency number of the is an word combination

110011:is an单词组合的频率编号110011: Frequency number of is an word combination

10:engineer单词的频率编号的长度信息10: The length information of the frequency number of the word engineer

010110010:engineer单词的频率编号010110010: the frequency number of the word engineer

10:表示句子间停顿的特殊音码的频率编号的长度信息10: the length information of the frequency number of the special sound code that represents the pause between sentences

000000100:表示句子间停顿的特殊音码的频率编号000000100: the frequency number of the special sound code that represents the pause between sentences

00:the单词的频率编号的长度信息00: Length information of the frequency number of the word

010:the单词的频率编号010: The frequency number of the word

10:man单词的频率编号的长度信息10: The length information of the frequency number of the man word

011100100:man单词的频率编号011100100: the frequency number of the word man

01:is单词的频率编号的长度信息01: The length information of the frequency number of the is word

010110:is单词的频率编号010110: the frequency number of the word is

01:fat单词的频率编号的长度信息01: The length information of the frequency number of the fat word

001011:fat单词的频率编号001011: frequency number of fat word

时态词形选择段(表示停顿的特殊音码显然没有时态词形选择值):Temporal morphological selection segment (the special phonetic sign indicating number of pause obviously does not have temporal morphological selection value):

0:man单词的时态词形选择值(此单词在单词数据库此词对应的记录项里有man,men共两个词形形态,用1个bit表示时态词形选择值,本语句用的man词是此词两个词形形态man,men中的第1个,所以时态词形选择值是0)0: tense word form selection value of the word man (there are two word form forms of man and men in the record item corresponding to this word in the word database, and 1 bit is used to represent the tense word form selection value. This statement uses The word man is the first of the two morphological forms of this word, man and men, so the tense morphological selection value is 0)

011:is an单词组合的时态词形选择值(此单词组合有is a,be a,was a,isan,be an,was an共6个词形形态,用3个bit表示,is an是6个词形形态中的第4个,所以时态词形选择值是011)011: tense word form selection value of is an word combination (this word combination has is a, be a, was a, isan, be an, was an a total of 6 word forms, represented by 3 bits, is an is 4th of 6 lemmas, so tense morphological selection value is 011)

0:engineer单词的时态词形选择值(此单词有engineer,engineers共两个词形形态,用1个bit表示,engineer是2个词形形态中的第1个,所以时态词形选择值是0)0: tense word form selection value of the word engineer (this word has two word forms of engineer and engineers, represented by 1 bit, and engineer is the first of the two word forms, so the tense word form is selected value is 0)

0:第2个man单词的时态词形选择值(此单词有man,men共两个词形形态,用1个bit表示,man是2个词形形态中的第1个,所以时态词形选择值是0)0: The tense and morphological selection value of the second man word (this word has two morphological forms of man and men, which are represented by 1 bit, and man is the first of the two morphological forms, so the tense Lemmatization value is 0)

000:is单词的时态词形选择值(此单词有is,be,was,been,being共五个词形形态,用3个bit表示,is是5个词形形态中的第1个,所以时态词形选择值是000)000: tense word form selection value of the word is (this word has five form forms of is, be, was, been, being, represented by 3 bits, is is the first of the five form forms, So tense word form selection value is 000)

把以上各BIT按顺序合起来得语句的编码值(即表示语句发音的拼合信息的值,也即一个处理单位的值)为:Combining the above BITs in sequence to obtain the coded value of the sentence (that is, the value of the combined information representing the pronunciation of the sentence, that is, the value of a processing unit) is:

00 00 01001000101001110010001 1100111001011001010000000100000101000 00 01001000101001110010001 11001110010110010100000001000001010

0111001000101011001 0010110011000000111001000101011001 001011001100000

本语句没有陌生单词,所以本处理单位中不含陌生单词段。There are no unfamiliar words in this sentence, so this processing unit does not contain unfamiliar word segments.

本语句编码中有使用单词组合,在编码用的单词数据库里是需要存储有这些所用的单词组合的记录项信息的。There are word combinations used in the encoding of this sentence, and the record item information of these used word combinations needs to be stored in the word database used for encoding.

语句里含有两个句子,一个处理单位对应一个语句的解码举例(英语例):A sentence contains two sentences, and one processing unit corresponds to a decoding example of a sentence (English example):

语句原话仍为上例:The man is an engineer.The man is fat.The original sentence is still the above example: The man is an engineer. The man is fat.

语句编码值(即表示语句发音的拼合信息的值,也即一个处理单位的值)仍为上例:The sentence encoding value (that is, the value of the combined information representing the pronunciation of the sentence, that is, the value of a processing unit) is still the above example:

00 00 01001000101001110010001 1100111001011001010000000100000101000 00 01001000101001110010001 11001110010110010100000001000001010

0111001000101011001 0010110011000000111001000101011001 001011001100000

分开看编码的各BIT为:Look at the BITs of the encoding separately:

头段:header:

00:本语句是英语语种00: This statement is in English language

00:单词段采用第1种词段编码方式(即频率排序编码方式)00: The word segment adopts the first word segment encoding method (that is, the frequency sorting encoding method)

单词段:word segment:

01001:单词段里的单词及单词组合个数(9个)01001: the number of words and word combinations in the word segment (9)

00:the单词的频率编号的长度信息00: Length information of the frequency number of the word

010:the单词的频率编号010: The frequency number of the word

10:man单词的频率编号的长度信息10: The length information of the frequency number of the man word

011100100:man单词的频率编号011100100: the frequency number of the word man

01:;is an单词组合的频率编号的长度信息01: length information of the frequency number of ;is an word combination

110011:is an单词组合的频率编号110011: Frequency number of is an word combination

10:engineer单词的频率编号的长度信息10: The length information of the frequency number of the word engineer

010110010:engineer单词的频率编号010110010: the frequency number of the word engineer

10:表示句子间停顿的特殊音码的频率编号的长度信息10: the length information of the frequency number of the special sound code that represents the pause between sentences

000000100:表示句子间停顿的特殊音码的频率编号000000100: the frequency number of the special sound code that represents the pause between sentences

00:the单词的频率编号的长度信息00: Length information of the frequency number of the word

010:the单词的频率编号010: The frequency number of the word

10:man单词的频率编号的长度信息10: The length information of the frequency number of the man word

011100100:man单词的频率编号011100100: the frequency number of the word man

01:is单词的频率编号的长度信息01: The length information of the frequency number of the is word

010110:is单词的频率编号010110: the frequency number of the word is

01:fat单词的频率编号的长度信息01: The length information of the frequency number of the fat word

001011:fat单词的频率编号001011: frequency number of fat word

时态词形选择段(表示停顿的特殊音码显然没有时态词形选择值):Temporal morphological selection segment (the special phonetic sign indicating number of pause obviously does not have temporal morphological selection value):

0:011100100词(man)的时态词形选择值(此单词在单词数据库此词对应的记录项里有man,men共两个词形形态,用1个bit表示时态词形选择值,本语句用的man词是此词两个词形形态man,men中的第1个,所以时态词形选择值是0)0: 011100100 word (man) tense morphological selection value (this word has two morphological forms of man and men in the record item corresponding to this word in the word database,use 1 bit to represent the tense morphological selection value, The man word used in this sentence is the first of the two forms of this word, man and men, so the tense form selection value is 0)

011:110011词(is an)的时态词形选择值(此单词组合有is a,be a,was a,is an,be an,was an共6个变化,用3个bit表示,is an是6个词形形态中的第4个,所以时态词形选择值是011)011: 110011 The tense word form selection value of the word (is an) (this word combination has 6 changes of is a, be a, was a, is an, be an, was an, represented by 3 bits, is an is the 4th of 6 morphological forms, so the tense morphological selection value is 011)

0:010110010词(engineer)的时态词形选择值(此单词有engineer,engineers共两个变化,用1个bit表示,engineer是2个词形形态中的第1个,所以时态词形选择值是0)0: 010110010 tense word form selection value of the word (engineer) (this word has two changes of engineer and engineers, represented by 1 bit, engineer is the first of the two word forms, so the tense word form select value is 0)

0:第2个011100100词(man)的时态词形选择值(此单词有man,men共两个变化,用1个bit表示,man是2个词形形态中的第1个,所以时态词形选择值是0)0: The tense and word form selection value of the second 011100100 word (man) (this word has two changes of man and men, represented by 1 bit, man is the first of the two word forms, so time Morphological selection value is 0)

000:010110词(is)的时态词形选择值(此单词有is,be,was,been,being共五个词形形态,用3个bit表示,is是5个词形形态中的第1个,所以时态词形选择值是000)000: 010110 tense word form selection value of the word (is) (this word has five form forms of is, be, was, been, being, represented by 3 bits, and is is the first of the fiveform forms 1, so tense word form selection value is 000)

解码过程为:The decoding process is:

依照上述各BIT表示的信息,先取头段信息,得知本语句是英语语种且单词段是使用频率排序编码方式来编码的。According to the information represented by the above-mentioned BITs, the header information is obtained first, and it is known that the sentence is in English and the word segments are coded using a frequency sorting coding method.

第1步解码,使用频率排序解码方式,先在单词段里取单词段里的单词及单词组合个数信息(9个),再在单词段里往下顺序取得9个频率编号,得:The first step is to decode, use the frequency sorting decoding method, first take the word and word combination number information (9) in the word segment, and then obtain 9 frequency numbers in the word segment in order, and get:

010 011100100 110011 010110010 000000100(表示句子间停顿的频率编号)010011100100 010110 001011010 011100100 110011 010110010 000000100 (represents the frequency number of pauses between sentences) 010011100100 010110 001011

(这时单词段的数据取完,接着在语句编码值中就是时态词形选择段的数据的起始位置)(at this moment, the data of the word segment has been fetched, and then in the sentence encoding value, it is exactly the initial position of the data of the tense word form selection segment)

第2步,查单词数据表得上一步解码得到的各频率编号的时态词形选择值占的BIT位数,再查语句编码值中的时态词形选择段,得上一步解码得到的各频率编号(即单词或单词组合)的时态词形选择值(有些单词或单词组合比如THE是没选择值的,表示停顿的特殊音码也没选择值),再结合各频率编号的时态词形选择值查单词数据表,将第2步解码得到的各频率编号(除表示停顿的频率编号外)换成单词或单词组合的实际的词形形态的发音信息数据(本解码例的发音信息数据是以词形音标为例的),得语句的预备发音数据:The 2nd step, look up the word data table to get the BIT digit that the tense word form selection value of each frequency number obtained by decoding in the previous step accounts for, and then check the tense word form selection segment in the sentence code value, and obtain the previous step decoding The tense word form selection value of each frequency numbering (i.e. words or word combinations) (some words or word combinations such as THE do not have selection values, and the special sound codes that represent a pause do not have selection values), and then combine the time of each frequency numbering State morphological selection value checks the word data table, and each frequency number (except the frequency number that expresses pause) that the 2nd step decoding obtains is changed into the pronunciation information data of the actual morphological form of word or word combination (this decoding example Pronunciation information data is based on word shape and phonetic symbols), and the prepared pronunciation data of the sentence is obtained:

句子The man is an engineer中各单词及单词组合的词形音标,接着是句子间停顿标志(由表示停顿的频率编号000000100转得),再接着是句子The man is fat中各单词及单词组合的词形音标。The morphological phonetic symbols of each word and word combination in the sentence The man is an engineer, followed by the inter-sentence pause sign (transferred from the frequency number 000000100 representing the pause), and then the words and word combinations in the sentence The man is fat word form phonetic symbols.

MCU将解码执行到这里,再顺次播放出以上3行(即语句的预备发音数据)中的各词形音标的音(与拾音器有关的存储器集里应有各音标的录音数据比如ADPCM格式的录音数据),即是语句的全部发音。本解码例子是用整步直接发音方式来播放语句发音的。MCU will decode and carry out here, then play the sound of each word-form phonetic symbol in the above 3 rows (i.e. the prepared pronunciation data of sentence) in order (the recording data of each phonetic symbol such as the ADPCM format should be arranged in the memory set relevant to the pickup recording data), that is, the entire pronunciation of the sentence. This decoding example uses the full-step direct pronunciation method to play the sentence pronunciation.

在本例解码的第2步中有关对时态词形选择段的解码处理的具体细节,还可参考本说明书第83页及第83页后面几页所述对时态词形选择段的解码处理。In the 2nd step of decoding in this example, about the specific details of the decoding process to the tense word form selection segment, you can also refer to the decoding of the tense word form selection segment described on the 83rd page and the last few pages of the 83rd page of this specification deal with.

本语句编解码例中有使用单词组合,是需要在解码用的单词数据库里存储有这些所用的单词组合的信息的(可一个记录项存一个单词组合的信息),并且这里解码用的单词数据库里的单词及单词组合的数据是以与编码用的单词数据库里的单词及单词组合的数据相同为例的。In this sentence encoding and decoding example, word combinations are used, and it is necessary to store the information of these word combinations in the word database for decoding (one record item can store the information of one word combination), and here the word database for decoding The data of the words and word combinations in the code are the same as the data of words and word combinations in the word database used for encoding.

下面再举有关非印欧语系语种使用词组合的编解码例子(以汉语例):Here is another example of encoding and decoding of word combinations used in non-Indo-European languages (in the case of Chinese):

下面举一个较完整的涉及使用词组合的汉语的编码例子:Let's give a more complete example of encoding Chinese that involves using word combinations:

语句为:’这个漂亮男人是一个工程师’The statement is: 'This beautiful man is an engineer'

依照上述汉语的编码规则,分开编码是:According to the above Chinese encoding rules, the separate encoding is:

头段:header:

01:本语句是汉语语种01: This sentence is in Chinese language

00:词段采用第1种词段编码方式(即频率排序编码方式)00: The word segment adopts the first word segment encoding method (that is, the frequency sorting encoding method)

词段:Phrases:

00101:词段里的词及词组合的个数(5个)00101: the number of words and word combinations in the word segment (5)

00:’这个’词的频率编号的长度信息00: The length information of the frequency number of the word 'this'

011:’这个’词的频率编号011: frequency number of word 'this'

01:111111的频率编号长度信息01: Frequency number length information of 111111

111111:111111代表陌生词(‘漂亮’),(111111也计入词及词组合的个数)111111: 111111 represents an unfamiliar word ('beautiful'), (111111 is also included in the number of words and word combinations)

10:’男人’词的频率编号的长度信息10: length information of the frequency number of the word 'man'

011001100:’男人’词的频率编号011001100: frequency number for the word 'man'

01:’是一个’词组合的频率编号的长度信息01: The length information of the frequency number of the word combination of 'is a'

011001:’是一个’词组合的频率编号011001: frequency number for the word combination 'is a'

10:’工程师’词的频率编号的长度信息10: length information of the frequency number of the word 'engineer'

011101010:’工程师’词的频率编号011101010: frequency number for the word 'engineer'

陌生词段:Unfamiliar phrases:

01:陌生词的编码信息的格式选择信息(单字拼音作组合格式)01: Format selection information for encoding information of unfamiliar words (single-character pinyin as combined format)

01:单字拼音个数(2个)01: The number of pinyin of a single character (2)

01001011010:单字’漂’的拼音在单字拼音表里的顺序号01001011010: the sequence number of the pinyin of the single character 'Piao' in the single-character pinyin table

00101110000:单字’亮’的拼音在单字拼音表里的顺序号00101110000: the sequence number of the pinyin of the single character 'liang' in the single-character pinyin table

把以上各BIT按顺序合起来得语句编码值为(这里的语句编码值是一个处理单位的值,也是表示语句发音的拼合信息的值):Combine the above BITs in order to get the sentence coding value (the sentence coding value here is the value of a processing unit, and is also the value of the combined information representing the pronunciation of the sentence):

01 00 00101000110111111110011001100010110011001110101001010101 00 001010001101111111100110011000101100110011101010010101

0010110100010111000000101101000101110000

本编码例中的’是一个’是词组合。'is one' in this coding example is word combination.

下面举一个较完整的涉及使用词组合的汉语的解码例子:Here is a more complete example of decoding Chinese that involves the use of word combinations:

语句原话仍为上例:‘这个漂亮男人是一个工程师’The original sentence is still the above example: 'This beautiful man is an engineer'

语句编码值(即表示语句发音的拼合信息的值)仍为上例:Sentence encoding value (that is, the value of the combined information representing the pronunciation of the sentence) is still the above example:

01 00 00101000110111111110011001100010110011001110101001010101 00 001010001101111111100110011000101100110011101010010101

0010110100010111000000101101000101110000

顺序看各BIT:Look at each BIT in order:

头段:header:

01:本语句是汉语语种01: This sentence is in Chinese language

00:词段采用第1种词段编码方式(即频率排序编码方式)00: The word segment adopts the first word segment encoding method (that is, the frequency sorting encoding method)

词段:Phrases:

00101:词段里的词及词组合的个数(5个)00101: the number of words and word combinations in the word segment (5)

00:’这个’词的频率编号的长度信息00: The length information of the frequency number of the word 'this'

011:’这个’词的频率编号011: frequency number of word 'this'

01:111111的频率编号长度信息01: Frequency number length information of 111111

111111:111111代表陌生词(‘漂亮’),(111111也计入词及词组合个数)111111: 111111 represents an unfamiliar word ('beautiful'), (111111 is also included in the number of words and word combinations)

10:’男人’词的频率编号的长度信息10: length information of the frequency number of the word 'man'

011001100:’男人’词的频率编号011001100: frequency number for the word 'man'

01:’是一个’词组合的频率编号的长度信息01: The length information of the frequency number of the word combination of 'is a'

011001:’是一个’词组合的频率编号011001: frequency number for the word combination 'is a'

10:’工程师’词的频率编号的长度信息10: length information of the frequency number of the word 'engineer'

011101010:’工程师’词的频率编号011101010: frequency number for the word 'engineer'

陌生词段:Unfamiliar phrases:

01:陌生词的编码信息的格式选择信息(单字拼音作组合格式)01: Format selection information for encoding information of unfamiliar words (single-character pinyin as combined format)

01:单字拼音个数(2个)01: The number of pinyin of a single character (2)

01001011010:单字’漂’的拼音在单字拼音表里的顺序号01001011010: the sequence number of the pinyin of the single character 'Piao' in the single-character pinyin table

00101110000:单字’亮’的拼音在单字拼音表里的顺序号00101110000: the sequence number of the pinyin of the single character 'liang' in the single-character pinyin table

解码过程如下:The decoding process is as follows:

依照上述各BIT表示的信息,先取头段信息,得知本语句是汉语语种且词段是使用频率排序编码方式来编码的;According to the information represented by the above-mentioned BITs, the header information is first obtained, and it is known that the sentence is in the Chinese language and the word segment is encoded using a frequency sorting encoding method;

第1步解码,使用频率排序解码方式,先在词段里取词段里的词及词组合个数信息(5个),再在词段里往下顺序取得5个频率编号,得:The first step of decoding, using the frequency sorting decoding method, first fetches the number of words and word combinations (5) in the word segment, and then obtains 5 frequency numbers in the word segment in order, and gets:

011 111111 011001100 011001 011101010011 111111 011001100 011001 011101010

第2步解码,对于上一步得的111111,查语句编码值中的陌生词段(显然,上一步解码取完语句编码值中词段里的各频率编号的数据后,接着在语句编码值中就是陌生词段数据的起始位置),先得陌生词的编码信息的格式是单字拼音作组合格式,再在陌生词段往下取单字拼音的个数信息及各单字拼音的顺序号,并用单字拼音作组合格式来构造陌生词发音,得:011(01001011010 00101110000)011001100 011001 011101010The 2nd step decodes, for the 111111 that the previous step gets, look up the unfamiliar word segment in the sentence coding value (obviously, after the previous step decodes and gets the data of each frequency number in the word segment in the sentence coding value, then in the sentence coding value It is the starting position of the unfamiliar phrase data), and the format of the encoding information of the unfamiliar phrase is the single-character pinyin as the combined format, and then the number information of the single-character pinyin and the sequence number of each single-character pinyin are obtained in the unfamiliar phrase, and used Single-character pinyin is used as a combined format to construct the pronunciation of unfamiliar words: 011(01001011010 00101110000) 011001100 011001 011101010

第3步,查词数据表,将第2步解码得到的各频率编号转成发音信息数据(本解码例的发音信息数据是以词形拼音为例的;词形拼音由拼音组成),再查处理陌生词要涉及的单字拼音表,将用于组成陌生词发音的两个单字拼音的顺序号(第2步得的结果中的括号内的)各自转成拼音(拼音由拼音字符拼法表示,‘piao4’和‘liang4’)并将这两个拼音排在一起成陌生词的发音信息数据,得语句的预备发音数据为(拼音):The 3rd step, look up word data table, each frequency number that the 2nd step decodes and obtains is converted into pronunciation information data (the pronunciation information data of this decoding example is example with word form phonetic alphabet; Word form phonetic alphabet is made up of pinyin), again Look up and deal with the single-character phonetic table that unfamiliar word will involve, will be used for forming the sequence number of two single-character phonetic alphabets of unfamiliar word pronunciation (in the parentheses in the result that the 2nd step gets) change into phonetic respectively (phonetic is by phonetic character spelling Express, 'piao4' and 'liang4') and arrange these two pinyins together to form the pronunciation information data of unfamiliar words, and the prepared pronunciation data of the sentence is (pinyin):

zhe4ge4 piao4liang4 nan2ren2 shi4yi1ge4 gong1cheng2shi1zhe4ge4 piao4liang4 nan2ren2 shi4yi1ge4 gong1cheng2shi1

MCU将解码执行到此,再顺次播放出上一行(即语句的预备发音数据)中的各拼音的音(这些拼音,比如zhe4ge4中的zhe4,是属于单字拼音表里约1300个单字拼音中的拼音,在与拾音器有关的存储器集里应存有这约1300个单字拼音的各单字拼音的录音数据比如ADPCM格式的录音数据),即是语句的全部发音。本解码例子是用整步直接发音方式来播放语句发音的。The MCU executes the decoding up to this point, and then plays the sounds of each pinyin in the previous row (that is, the prepared pronunciation data of the sentence) in sequence (these pinyins, such as zhe4 in zhe4ge4, belong to about 1300 single-character pinyins in the single-character pinyin table The pinyin, the recording data of each single-character pinyin of these about 1300 single-character pinyins (such as the recording data of ADPCM format) should be stored in the memory set relevant to the pickup, which is the whole pronunciation of the sentence. This decoding example uses the full-step direct pronunciation method to play the sentence pronunciation.

本解码例中的’是一个’是词组合。'is one' in this decoding example is a combination of words.

在一个码字的值里可以放有一个或多个处理单位的值,对于一个码字的值里放有多个处理单位的值的情况,这多个处理单位的值可以在一个码字的值里顺序排放(或按其它方式比如按约定把各处理单位交叉排放也可),而且显然一个码字里的不同处理单位可由相同或不同的编码规则来编码生成,不同处理单位涉及的语句可以是相同或不同的语种。解码时,取出一个码字里各处理单位的值,对各处理单位的值分别解码即可。至于各处理单位在码字里按顺序排放时,如何表示一个处理单位在哪个BIT结束,一是可在处理单位的前部附加本处理单位的值的BIT长度信息,二是如采用本说明书前面述的以只适用于英语(或汉语)为例的编解码规则,对一个处理单位顺序解码,解码结束,数据指针指到的位置也就是本处理单位的值结束的位置,这就不需在处理单位的前部附长度信息。本段所说的一个码字里各处理单位中的任一处理单位涉及的语句的语种可以是任一非复合语种或任一复合语种。One or more processing unit values can be placed in the value of a codeword. For the case where there are multiple processing unit values in the value of a codeword, the values of these multiple processing units can be placed in the value of a codeword Values are arranged sequentially (or in other ways, such as by agreement, each processing unit can be cross-arranged), and obviously different processing units in a codeword can be encoded by the same or different coding rules, and the sentences related to different processing units can be be the same or different languages. When decoding, take out the value of each processing unit in a code word, and decode the value of each processing unit separately. As for how to indicate which BIT a processing unit ends in when each processing unit is arranged sequentially in the codeword, one is to add the BIT length information of the value of the processing unit to the front of the processing unit; The encoding and decoding rules described above are only applicable to English (or Chinese) as an example. A processing unit is sequentially decoded. After the decoding is completed, the position pointed by the data pointer is the position where the value of the processing unit ends. Handle the front of the unit with length information. The language of the sentence involved in any processing unit in a code word mentioned in this paragraph can be any non-compound language or any compound language.

(以下4段内容涉及的语句的语种可以是任一非复合语种或任一复合语种,以下4段涉及的各处理单位都是用一个编码规则中有关此语种的编码细则来编码生成的,涉及的各处理单位都是用一个解码规则中有关此语种的解码细则来解码的)(The language of the sentences involved in the following 4 paragraphs can be any non-compound language or any compound language, and each processing unit involved in the following 4 paragraphs is coded and generated by coding rules related to this language in a coding rule. Each processing unit of the language is decoded by the decoding rules related to this language in a decoding rule)

另外,因语句内可能含一个或多个句子,一个处理单位对应一个语句,所以一个处理单位可能对应一个或多个句子,一般情况,编码中的一个处理单位的结束就是处理单位内最后一个句子的结束,解码时需由MCU作语气停顿。但也可能一个处理单位是在最后一个句子中间结束(包括是在句子中的分句间的逗号处结束的情况),下一个处理单位是从此句子的中间接着编码,其区别方式属常规技术,可自定,比如可以在每个处理单位的BIT值后面约定附加一个BIT,0表示是本处理单位的最后是句子结束要加语气停顿,如为1表示句子没完不停顿,而逗号编号,其它停顿编号可处于处理单位内语句的任意位置,甚至可以是语句的最后。MCU解码时,对一个码字的值里放有多个处理单位的情况,可把各处理单位取出后,各自解码并顺序播出音来。In addition, because a sentence may contain one or more sentences, and one processing unit corresponds to one sentence, one processing unit may correspond to one or more sentences. In general, the end of a processing unit in coding is the last sentence in the processing unit At the end of the decoding, the MCU needs to pause for tone. But it is also possible that a processing unit ends in the middle of the last sentence (comprising the situation of ending at the comma between the clauses in the sentence), and the next processing unit is to encode in the middle of this sentence, and its distinguishing method belongs to conventional technology. It can be customized. For example, you can agree to add a BIT after the BIT value of each processing unit. 0 means that the end of the processing unit is the end of the sentence to add a tone pause. The stall number can be anywhere in a statement within a processing unit, even at the end of a statement. When the MCU is decoding, if there are multiple processing units in the value of a codeword, each processing unit can be taken out, decoded and played sequentially.

按上段所述,那在语句中,各句子中最前面的句子可以是部分句子(语句中只保留原文第1个句子的后一部分,可称为截头),各句子中最后的句子也可以是部分句子(语句中只保留原文最后1个句子的前一部分。可称为截尾),比如:语句内容如下:The man is anengineer.He is Tom’s brother.He live,语句含有前面两个完整句子加最后一个部分句子,原文最后一句子本来是He live in NewYork,本语句中只保留了句子前一部分He live,后面的in NewYork留到下一个语句编码,且in NewYork成为下一个语句的最开始的一个非完整句子。甚至可把要连着读的数个句子任意截成几节,每节作为一个语句,一节可能只含一个非完整句子,这个非完整句子是既被截头也被截尾即只留下原来完整句子的中间部分,也可能是含有数个完整句子加数个非完整句子等情况,可对每节编码生成一处理单位,可把这些处理单位的值顺序放在码字的值里。(本段所称部分句子是指一个句子的部分,比如一个句子的前半部分或后半部分,本发明文件所说的部分句子与非完整句子是同一概念)According to the above paragraph, in the sentence, the first sentence in each sentence can be a partial sentence (only the second part of the first sentence in the original text is kept in the sentence, which can be called a truncation), and the last sentence in each sentence can also be It is a partial sentence (only the first part of the last sentence in the original text is kept in the sentence. It can be called censored), for example: the content of the sentence is as follows: The man is an engineer.He is Tom's brother.He live, the sentence contains the first two complete sentences Add the last part of the sentence. The last sentence of the original text was originally He live in NewYork. In this sentence, only the first part of the sentence He live is kept, and the following in NewYork is reserved for the next sentence encoding, and in NewYork becomes the beginning of the next sentence an incomplete sentence of . Even several sentences to be read in a row can be arbitrarily cut into several sections, each section is regarded as a sentence, and a section may contain only one incomplete sentence, and this incomplete sentence is either truncated or truncated The middle part of the original complete sentence may also contain several complete sentences plus several incomplete sentences. A processing unit can be generated for each section code, and the value order of these processing units can be placed in the value of the code word. (the said partial sentence in this paragraph refers to the part of a sentence, such as the first half or the second half of a sentence, and the partial sentence and incomplete sentence mentioned in the present invention document are the same concept)

由上两段所述可得,语句可以是含有数个(一个或多个)完整句子和数个非完整句子,或者语句可以是含有一个非完整句子,或者语句可以是含有多个非完整句子(比如含有一个被截头的句子和一个被截尾的句子)等,这些种情况可看成是与本说明书前面反复阐述的‘语句可以是一个完整的句子,或者是一个非完整句子,或者是含有多个完整句子’这几个概念并列的概念。From the above two paragraphs, it can be obtained that a sentence can contain several (one or more) complete sentences and several incomplete sentences, or a sentence can contain one incomplete sentence, or a sentence can contain multiple incomplete sentences (such as containing a truncated sentence and a truncated sentence), etc., these situations can be regarded as the same as the "statement can be a complete sentence, or a non-complete sentence, or It is a concept that contains multiple complete sentences' these concepts are juxtaposed.

如语句中含有数个句子,且还含有非完整句子,一个处理单位对应此语句,对其中的完整句子或非完整句子的编解码处理仍都可参照完整句子的编解码处理方式(因如一个句子缺少如主语或谓语等后,并不影响采用频率排序等词段编解码方式对其编解码),至于各句子间的停顿处理方式前已有述,比如可定义一个频率编号来作为句子间停顿。同理,如语句中含有一个非完整句子(被截头,或被截尾,或既被截头又被截尾的),一个处理单位对应此语句,对此语句的编解码处理仍都可参照完整句子的编解码处理方式Contain several sentences in the statement, and also contain incomplete sentence, a processing unit corresponds to this statement, and the coding and decoding processing of the complete sentence or the incomplete sentence in it can still refer to the coding and decoding processing mode of the complete sentence (because such as a If a sentence lacks such as a subject or a predicate, etc., it will not affect the encoding and decoding of word segments such as frequency sorting). pause. Similarly, if a sentence contains an incomplete sentence (truncated, or truncated, or both truncated and truncated), a processing unit corresponds to this sentence, and the encoding and decoding of this sentence can still be processed. Refer to the codec processing method of the complete sentence

在一本有声读物的整页面上,如在两个不同地方(比如两个不同页面或一个页面上的不同位置)需播的语句发音是完全相同的,那也应把这两个语句分别定义成两个不同语句,可对这两个语句分别编码以生成各自的拼合信息的值,这两个拼合信息的值可分别放在不同的码字里或共处于同一个码字里。On the entire page of an audio book, if the pronunciation of the sentence to be played in two different places (such as two different pages or different positions on the same page) is exactly the same, then these two sentences should also be defined separately The two sentences can be coded separately to generate the value of the combined information, and the values of the two combined information can be placed in different codewords or in the same codeword.

本段内容适于印欧语系各语种和非印欧语系各语种和任一复合语种:对于实际中的语句中的各词,其中有部分词是需要重读的,而且具体的语句中哪些词需重读并无固定规则,但是作为语言学习之用,如语句中的各词的读音是平缓的方式也可接受,在解码时总采用直接产音方式,不区分重读,轻读,当然如非要做到语句中的有些词是重读有些词是不重读,也是可以的,一种解决方案就如在前面叙述特殊音码的处理时是把重读(或轻读)当成特殊音码对待;另一种解决方案,比如:主要使用单词来编码而不用单词组合,在编码时,可在一个处理单位的最后附加一个重读信息段,信息段的BIT位长等于单词段里的单词个数,信息段里每个BIT对应单词段里的一个单词,BIT为0表示此单词不重读(正常发音),BIT为1表示此单词需重读,如解码时采用直接发音方式,发任一个单词音时,结合看重读信息段里的信息,如需重读的,就可采取在播放此单词的音时,将此单词语音波形的振幅加大,起到重读的效果(反之,如把语音波形的振幅减小,就起到轻读的效果);如解码时采用TTS发音方式,对于TTS器件(比如TTS芯片),如其有重读功能的,也可按重读信息段的内容在要用于TTS器件(比如TTS芯片)发音的语句的预备发音数据(或部分预备发音数据)中插入有关重读的符号(当然如是只播出语句的非全部发音,那可能是把语句的预备发音数据的部分传给TTS器件)。(对于次重读,轻读等处理可与本段对重读的处理类似,比如可在一个处理单位的最后的重读信息段后面再附加一个轻读信息段;本发明文件所称词的不重读是指正常的发音,词的重读是指在正常发音的基础上重读,次重读是指在正常发音的基础上次重读,轻读是指在正常发音的基础上轻读。)The content of this paragraph is suitable for all Indo-European languages and non-Indo-European languages and any compound language: For each word in the actual sentence, some of the words need to be re-read, and which words in the specific sentence need to be re-read. There is no fixed rule for stress, but for language learning, it is acceptable if the pronunciation of each word in the sentence is gentle. When decoding, the direct sound production method is always used, without distinguishing between stress and light reading. Of course, if you have to Accomplish some words in the sentence to be to stress and some words are not to stress, also possible, a kind of solution is to treat stress (or light reading) as special phonetic sign indicating number when narrating the processing of special phonetic sign indicating number in front; A solution, such as: mainly use words to encode instead of word combinations. When encoding, a reread information segment can be added at the end of a processing unit. The BIT bit length of the information segment is equal to the number of words in the word segment, and the information segment Each BIT in the word corresponds to a word in the word segment. A BIT of 0 means that the word is not accented (normal pronunciation), and a BIT of 1 means that the word needs to be accented. For example, when decoding, the direct pronunciation method is used. When sending any word sound, combine Pay attention to the information in the re-read information section, if you need to re-read, you can increase the amplitude of the voice waveform of this word when the sound of this word is played, so as to achieve the effect of re-reading (on the contrary, reduce the amplitude of the voice waveform , just play the effect of light reading); if the TTS pronunciation mode is adopted during decoding, for TTS devices (such as TTS chips), if there is a re-reading function, the content of the re-reading information segment can also be used in TTS devices (such as TTS Chip) in the preparatory pronunciation data (or part of the preparatory pronunciation data) of the sentence that is pronounced, inserts the symbol about rereading (certainly as only broadcasting the non-full pronunciation of the sentence, it may be that the part of the preparatory pronunciation data of the sentence is passed to the TTS device) . (for the second stress reading, the processing such as light reading can be similar to the processing of this paragraph to stress reading, such as can add a light reading information segment after the last stress information segment of a processing unit; The non-stress reading of the said word of the present invention document is Refers to the normal pronunciation, the stress of the word refers to the stress on the basis of the normal pronunciation, the secondary stress refers to the secondary stress on the basis of the normal pronunciation, and the light reading refers to the light reading on the basis of the normal pronunciation.)

对于印欧语系里各语种和非印欧语系里各语种和任一复合语种的语句里的数字,虽然在字面上是阿拉伯数字,但其发音仍可看成是单词(词)的发音,所以语句里的数字可对应成一个或多个单词(词)来作编解码处理(可把数字涉及的发音都当成单词或词音码,相应的可以在单词数据库或词数据库里把这些涉及数字发音的音码在不同的记录项里记录下来),这都属于常识。比如汉语里数字涉及的发音有:0,1,2,3,4,5,6,7,8,9,点,十,百,千,万等,可把这些都当成词(音码),语句中数字的发音就由这些词的发音组成。For the numbers in the sentences of various languages in the Indo-European language family and non-Indo-European languages and any compound language, although they are literally Arabic numerals, their pronunciation can still be regarded as the pronunciation of words (words), so The number in the sentence can be corresponding to one or more words (words) to do encoding and decoding processing (can regard the pronunciation that number relates to as word or word sound code, correspondingly these can relate to number pronunciation in word database or word database The phonetic sign indicating number is recorded in different recording items), and this all belongs to common sense. For example, the pronunciations involved in numbers in Chinese are: 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, dot, ten, hundred, thousand, ten thousand, etc., which can be regarded as words (sound codes) , the pronunciation of numbers in the sentence is formed by the pronunciation of these words.

如语音编码有声读物的整页面上有对应播同一语种语音的码字时,一个拾音器内也可能有只适用于此一个语种的一套解码程序及解码数据,只要各出版社对各种语音编码有声读物的整页面上的各码字涉及的此语种的拼合信息的值编码使用统一的编码规则,各拾音器生产厂生产的各种拾音器对此语种的拼合信息的值的解码使用同样的解码规则,并且拼合信息的值的解码规则是与编码规则有关此语种对应的解码规则,那么拾音器可实现各出版社的各种语音编码有声读物与各种拾音器任意通读使用。如语音编码有声读物的整页面上有对应播不同语种语音的码字时,一个拾音器内也可能有只适用于各自语种的数套解码程序及解码数据,只要各出版社对各种语音编码有声读物的整页面上的各码字涉及的相同语种的拼合信息的值编码使用统一的编码规则,各拾音器生产厂生产的各种拾音器对相同语种的拼合信息的值的解码使用同样的解码规则,并且各语种的拼合信息的值的编码规则与解码规则是对应的,那么拾音器仍可实现各出版社的各种语音编码有声读物与各种拾音器任意通读使用。而要做到这点,对拾音器来说,对其读码头并没限制,如果语音编码有声读物采用的是条形码,那读码头用各种光电式读码头都可,如果语音编码有声读物采用的是导电码,那读码头用光电式读码头或机电式导电码读码头都可,并且如拾音器使用光电式读码头的话,那可把此读码头做成只可读条形码,另也可把此读码头做成既可读条形码,又可读导电码(一般在导电码码字上需加有光电读取的起始标志),读入的值,用同样一个解码规则进行解码。拾音器一般可做成这样:各拾音器的解码程序及解码数据是一样的,而读码头可变种类,可以是机电式导电码读码头,读取导电码,也可以是光电式读码头,可读取导电码或条形码,还可以是其他读码头;读入的值经过去校验等处理后的有效值(拼合信息的值),用拾音器的解码程序解码播出语句发音。本段以上所述的拾音器是以非插卡方式拾音器来说的。其实拾音器仍可以是插数据卡方式拾音器(或插部件式数据卡方式拾音器),比如一个数据卡上存储有只适用于某一语种的解码程序及解码数据,那使用时,将此卡插入拾音器主体成为拾音器,由拾音器调卡上数据和程序来对读码得到的拼合信息的值进行解码,并且此插数据卡方式拾音器,可以通读各种语音编码有声读物的整页面上的与卡适用的语种相同语种的码字(或其码字涉及的各拼合信息的值中有此语种的拼合信息的值的码字),并且读这些语音编码有声读物都不需再换卡;而如换插入含有只适用于另一个语种的解码程序及解码数据的数据卡后,就可通读各本语音编码有声读物的整页面上的另一语种的码字(或其码字涉及的各拼合信息的值中有此语种的拼合信息的值的码字),并且读这些语音编码有声读物不用换卡。(为了实现拾音器与语音编码有声读物的通读使用,一般还需统一定义数据信息在各码字上的存在格式,比如拼合信息的值在条形码的码字上从第几个BIT开始放,或在导电码的码字上从第几个BIT开始放等,这只是常识手法)。本段所述适用于印欧语系各语种和非印欧语系各语种和各复合语种。本段所述的(以只适用于一个语种为例的)编解码规则可以是直播方式或TTS方式或混合方式的;本段所述的拾音器,可以是采用直接产音方式的,也可以是采用TTS产音方式的,也可以是既采用直接产音方式又采用TTS产音方式的,如是采用直接产音方式的,那在存储器集(比如拾音器主体内的存储器或数据卡上的存储器)里一般应存储有诸音码的发音数据,如是采用TTS产音方式的,那拾音器主体内应有TTS器件。(本发明文件各处所说的与编码规则有关某一语种对应的解码规则,隐含有有关此语种的编码数据与解码数据对应的意思,因为解码数据是根据解码规则生成的,编码数据是根据编码规则生成的。同理,所说的各解码规则相同或兼容也是包括解码数据相同或兼容的,各编码规则相同或兼容也是包括编码数据相同或兼容的,这里说的编码规则和解码规则是只适用于一个语种的。本发明文件各处所说的‘拼合信息的值的编码规则’显然是指用以产生此拼合信息的值的编码规则,本发明文件各处所说的‘拼合信息的值的解码规则’显然是指可用以对此拼合信息的值进行解码的解码规则。)If there are codewords corresponding to the voice of the same language on the entire page of the voice-coded audiobook, there may also be a set of decoding programs and decoding data that are only applicable to this language in a pickup, as long as each publishing house has a code for each voice code. The codewords on the entire page of the audiobook use a unified coding rule for encoding the value of the combined information in this language, and the various pickups produced by each pickup manufacturer use the same decoding rule for decoding the value of the combined information in this language , and the decoding rule of the value of the merged information is related to the encoding rule and corresponds to the language, then the pickup can be used for arbitrary reading of various speech coded audiobooks of various publishers and various pickups. If there are codewords corresponding to the voices of different languages on the entire page of the voice coded audiobook, there may also be several sets of decoding programs and decoding data that are only applicable to their respective languages in a pickup, as long as each publishing house has a voice code for each voice code. The value encoding of the combined information of the same language involved in each code word on the entire page of the reading material uses a unified coding rule, and the various pickups produced by each pickup manufacturer use the same decoding rule for the decoding of the value of the combined information of the same language. And the encoding rules and decoding rules of the value of the spliced information in each language correspond to each other, so the pickup can still be used for arbitrary reading of various speech coded audiobooks of various publishers and various pickups. And to do this, for the pickup, there is no limit to its reading dock. If the voice coding audiobook adopts a bar code, then the reading dock can be read with various photoelectric reading docks. If the voice coding audiobook adopts If it is a conductive code, then the reading terminal can be read with a photoelectric type reading terminal or an electromechanical conductive code reading terminal, and if the pickup uses a photoelectric type reading terminal, then this reading terminal can be made to only read barcodes, and this can also be used The reading terminal is made to read both the barcode and the conductive code (generally, a photoelectric reading start mark needs to be added on the conductive code code word), and the read value is decoded with the same decoding rule. Pickups can generally be made like this: the decoding program and decoding data of each pickup are the same, and the types of reading docks can be variable, which can be electromechanical conductive code reading docks to read conductive codes, or photoelectric reading docks, which can be read Get conduction code or bar code, also can be other reading docks; The value read in is through the effective value (the value of splicing information) after the processing such as checking, decodes and broadcasts the sentence pronunciation with the decoding program of pickup. The pickups mentioned above in this paragraph are non-card pickups. In fact, the pickup can still be a pickup with a data card (or a pickup with a plug-in data card). For example, a data card stores a decoding program and decoding data that are only applicable to a certain language. When using it, insert this card into the pickup. The main body becomes a pickup, and the pickup reads the data and programs on the card to decode the value of the combined information obtained by reading the code, and the pickup in the way of inserting the data card can read through the entire page of various voice-coded audiobooks that are applicable to the card The codewords of the same language (or the codewords with the value of the spliced information of this language in the value of each spliced information involved in the codeword), and reading these speech coded audiobooks do not need to change the card again; After the data card containing the decoding program and decoding data applicable only to another language, you can read through the codewords of another language (or the value of each spliced information involved in the codewords) on the entire page of each speech coded audiobook There is a codeword of the value of the mosaic information in this language), and reading these speech coded audiobooks does not need to change the card. (In order to realize the read-through use of the pickup and the voice-coded audiobook, it is generally necessary to uniformly define the existence format of the data information on each codeword, such as the value of the combined information on the codeword of the barcode from which BIT is put, or in The code word of the conductive code starts from the first BIT, etc., this is just a common sense method). This paragraph applies to all Indo-European languages and non-Indo-European languages and compound languages. The encoding and decoding rules described in this paragraph (taking only one language as an example) can be in live mode or TTS mode or mixed mode; The TTS sound production method can also be adopted both the direct sound production method and the TTS sound production method. If the direct sound production method is adopted, then the memory set (such as the memory in the main body of the pickup or the memory on the data card) Generally, the pronunciation data of all sound codes should be stored in it. If the TTS sound production method is adopted, there should be a TTS device in the main body of the pickup. (the decoding rules corresponding to a certain language related to the coding rules mentioned in the documents of the present invention imply that the coding data of this language corresponds to the decoding data, because the decoding data is generated according to the decoding rules, and the coding data is based on Generated by encoding rules. Similarly, the same or compatible decoding rules also include the same or compatible decoding data, and the same or compatible encoding rules also include the same or compatible encoding data. The encoding rules and decoding rules mentioned here are Only be applicable to a kind of language. The said " coding rule of the value of assembling information " of the said document of the present invention everywhere obviously refers to the coding rule in order to produce the value of this assembling information, and the said " value of assembling information " of the document of the present invention everywhere The decoding rules' clearly refer to the decoding rules that can be used to decode the value of this flattened information.)

上一段述的编码规则或解码规则都是以只适用于一个语种为例的,实际上显然也可以是编码规则或解码规则是适用于多个语种的,适用于多个语种时,对于一语种,只要各有声读物的码字涉及的此语种的拼合信息的值都是用同一编码规则中有关此语种的编码细则来编码生成的,各拾音器对有声读物的码字涉及的此语种的拼合信息的值的解码是按照同一解码规则中有关此语种的解码细则来解码的,而且此解码规则与此编码规则有关此语种是对应的,那仍可实现各有声读物与拾音器的任意通读使用。The encoding rules or decoding rules mentioned in the previous paragraph are all applicable to one language as an example. In fact, it is obvious that the encoding rules or decoding rules are applicable to multiple languages. When applicable to multiple languages, for one language , as long as the value of the mosaic information of this language involved in the codewords of each audiobook is encoded and generated by the coding rules related to this language in the same coding rule, each pickup will be able to match the mosaic information of this language involved in the codewords of the audiobook The decoding of the value of is based on the decoding rules for this language in the same decoding rule, and this decoding rule is corresponding to this language related to this encoding rule, so it can still be used for arbitrary reading of audiobooks and pickups.

以上2段说的有声读物与拾音器的任意通读使用,显然要求拾音器的读码头应可读取有声读物的整页面上的所有码字或部分码字。显然,上两段说的拾音器和有声读物的通读是指拾音器能读取有声读物的整页面上的所有码字或只能读部分码字并解码发音。以上2段说的解码规则与编码规则对应是指专门对应,一般对应的情况与专门对应的情况类似。实际的情况可能比以上2段所述复杂,比如对于某语种,不同读物的码字涉及的此语种的拼合信息的值是用不同的编码规则中有关此语种的编码细则来编码生成的,这些不同读物的码字又可被同一拾音器按同一解码规则中有关此语种的解码细则来解码,或者,比如对于某语种,不同读物的码字涉及的此语种的拼合信息的值是用同一编码规则中有关此语种的编码细则来编码生成的,这些不同读物的码字又可被同一拾音器按同一解码规则中有关另一语种的解码细则来解码(比如上两句说的某语种是英语,这句说的另一语种是英语加汉语的复合语种)。The audiobook mentioned in the above 2 paragraphs and the random reading of the pickup obviously require that the reading terminal of the pickup should be able to read all or part of the codewords on the entire page of the audiobook. Obviously, the pickup and audiobook read-through mentioned in the last two paragraphs means that the pickup can read all the codewords on the entire page of the audiobook or can only read part of the codewords and decode the pronunciation. The correspondence between decoding rules and encoding rules mentioned in the above two paragraphs refers to special correspondence, and the situation of general correspondence is similar to that of special correspondence. The actual situation may be more complicated than that described in the above two paragraphs. For example, for a certain language, the value of the spliced information of this language involved in the code words of different reading materials is encoded by the coding rules related to this language in different coding rules. These The codewords of different readings can be decoded by the same pickup according to the decoding rules related to this language in the same decoding rule, or, for example, for a certain language, the value of the combined information of this language involved in the codewords of different readings is to use the same coding rule The code words of these different readings can be decoded by the same pickup according to the decoding rules of another language in the same decoding rule (for example, the certain language mentioned in the previous two sentences is English, this The other language spoken in the sentence is the compound language of English and Chinese).

本说明书中所阐述的语音编码有声读物及拾音器,其基本功能是在此有声读物各页面上有诸个码字,诸个码字中每个码字的值是共与数个拼合信息的值有关的,使用时由拾音器通过读码头读码并解码发音,此功能在本说明书前面已反复说明;而语音编码有声读物和拾音器的具体应用范围主要有(以下三条都适用于印欧语系各语种和非印欧语系各语种和任一复合语种,以下三条涉及的各码字,一般是一个码字的值有一个表示语句发音的拼合信息的值):The basic function of the voice coded audiobook and the pickup described in this manual is that there are codewords on each page of the audiobook, and the value of each codeword in the codewords is the value of several combined information Related, when in use, the pick-up reads the code and decodes the pronunciation through the reading dock. This function has been repeatedly explained in the front of this manual; and the specific application scope of the voice-coded audiobook and the pick-up mainly includes (the following three are applicable to all languages of the Indo-European language family. And non-Indo-European languages and any compound language, the code words involved in the following three, generally the value of a code word has a value representing the combined information of the pronunciation of the sentence):

1:码字印在幼儿看的图文卡片上,或幼儿读物上,由拾音器读码播出语句发音,其涉及的内容不一定局限于语言学习,可以是认物品,数个数等,作为幼儿的启蒙读物。1: The code word is printed on the graphic card that the child reads, or on the child's reading material, and the pronunciation of the sentence is broadcast by the pickup device reading the code. The content involved is not necessarily limited to language learning, it can be recognition of objects, counting numbers, etc. Introductory reading for young children.

2:码字印在初级语言教材或读物上,作为语言学习的入门读物(比如《英语学习入门教材》,《韩语入门》等),因涉及语句一般很简单,相应的拾音器的解码程序及解码数据也可比较简单。2: The code words are printed on elementary language teaching materials or reading materials as introductory reading materials for language learning (such as "Introduction to English Learning Textbook", "Introduction to Korean", etc.), because the sentences involved are generally very simple, the corresponding decoding program and decoding of the pickup Data can also be relatively simple.

3:码字印在较高年级(比如初中英语,高中英语,大学英语,大学韩语等)的听力教材或听力训练读物上,将数个表示每几行或每数十行的听力语句发音的拼合信息的值放到一个码字,各码字分布在各自需发声的页面上,拾音器读取一个码字并根据此码字涉及的全部数个拼合信息的值来解码播出听力语音(或连续读多个码字,让拾音器根据这多个码字涉及的全部数个拼合信息的值连续解码播出听力语音),这样就可达到使用听力磁带的效果,成为听力教学的辅助工具,这种方式涉及的语句复杂,编码也复杂,拾音器的解码程序比较复杂,拾音器涉及的解码用的数据也较多。3: The code words are printed on the listening textbooks or listening training reading materials of higher grades (such as junior high school English, high school English, college English, college Korean, etc.), and several lines that represent the pronunciation of listening sentences every few lines or every dozens of lines The value of the mosaic information is put into a code word, and each code word is distributed on the page that needs to be uttered separately, and the pickup reads a code word and decodes and broadcasts the listening voice (or Continuously read multiple codewords, let the pickup continuously decode and broadcast the listening voice according to the values of all the combined information involved in the multiple codewords), so that the effect of using listening tapes can be achieved and become an auxiliary tool for listening teaching. The sentences that a kind of mode involves are complicated, and coding is also complicated, and the decoding program of pickup is more complicated, and the data that the decoding that pickup involves is also more.

关于方言,对于印欧语系某语种的一种方言,显然也是属于印欧语系,可把各方言分别当成单独的语种,对其编解码可参照本说明书前面阐述的直播方式(或TTS方式)的以只适用于英语为例的编解码规则,或可把方言看成是官方用语的等同替换,方言与官方用语在发音及语法等方面很相似,对其编解码可参照官方用语的编解码规则;比如低地德语是德语的方言,低地德语是属于印欧语系的,可把低地德语当成印欧语系的一个独立语种,其编解码规则可以参照本说明书前面阐述的直播方式(或TTS方式)的以只适用于英语为例的编解码规则原理来设计产生,低地德语与官方德语的差别不大,低地德语也可看成是官方德语的等同替换,其编解码规则,也可以是直接把直播方式(或TTS方式或混合方式)的只适用于官方德语的编解码规则作些修改再拿来当成低地德语的编解码规则。同理,对于非印欧语系某语种的一种方言,显然也是属于非印欧语系,可把各方言分别当成单独的语种,对其编解码可参照本说明书前面阐述的直播方式(或TTS方式)的以只适用于汉语为例的编解码规则,或可把方言看成是官方用语的等同替换,方言与官方用语在发音及语法等方面很相似,对其编解码可参照官方用语的编解码规则;比如广东话是汉语的方言,广东话是属于非印欧语系的,可把广东话当成非印欧语系的一个独立语种,其编解码规则可以参照本说明书前面阐述的直播方式(或TTS方式)的以只适用于汉语为例的编解码规则原理来设计产生,广东话与官方汉语的差别不大,广东话也可看成是官方汉语的等同替换,其编解码规则,也可以是直接把直播方式(或TTS方式或混合方式)的只适用于官方汉语的编解码规则(本说明书前面反复举例说明的)作些修改再拿来当成广东话的编解码规则;本段述的编码规则或解码规则都是以只适用于一个语种为例的,对适用于多个语种的编码规则,此编码规则中有关某方言语种的编码细则可与本段前面述的只适用于此一个方言语种的编码规则中的编码细则相同,对适用于多个语种的解码规则,此解码规则中有关某方言语种的解码细则可与本段前面述的只适用于此一个方言语种的解码规则中的解码细则相同;如复合语种包含方言语种的话,那其涉及的编解码处理也可参照本段所述,本段所述都是显而易见的常识。Regarding dialects, for a dialect of a certain language in the Indo-European language family, it obviously also belongs to the Indo-European language family. Each dialect can be regarded as a separate language, and its encoding and decoding can refer to the live broadcast method (or TTS method) explained earlier in this manual. Taking the encoding and decoding rules that only apply to English as an example, dialects may be regarded as equivalent replacements for official language. Dialects and official language are very similar in pronunciation and grammar, and their encoding and decoding can refer to the encoding and decoding rules of official language. For example, Low German is a dialect of German, and Low German belongs to the Indo-European language family. Low German can be regarded as an independent language of the Indo-European language family. It is designed and produced based on the principle of codec rules that only apply to English as an example. The difference between Low German and official German is not big. Low German can also be regarded as an equivalent replacement of official German. The codec rules can also be directly broadcast Mode (or TTS mode or mixed mode) codec rules that only apply to official German are modified and then used as Low German codec rules. Similarly, for a dialect of a certain language that is not an Indo-European language, it obviously also belongs to a non-Indo-European language. Each dialect can be regarded as a separate language, and its encoding and decoding can refer to the live broadcast method (or TTS method) explained earlier in this manual. )’s encoding and decoding rules that are only applicable to Chinese as an example, or dialects can be regarded as equivalent replacements for official languages. Decoding rules; for example, Cantonese is a dialect of Chinese, and Cantonese belongs to the non-Indo-European language family. Cantonese can be regarded as an independent language of the non-Indo-European language family. Its encoding and decoding rules can refer to the live broadcast method described earlier in this manual (or TTS mode) is designed and produced based on the codec rules that are only applicable to Chinese as an example. The difference between Cantonese and official Chinese is not large. Cantonese can also be regarded as an equivalent replacement of official Chinese. Its codec rules can also be It is to directly modify the encoding and decoding rules that are only applicable to official Chinese in the live broadcast mode (or TTS mode or mixed mode) (repeatedly exemplified in the front of this manual) and then use them as Cantonese encoding and decoding rules; Encoding rules or decoding rules are only applicable to one language as an example. For encoding rules applicable to multiple languages, the encoding rules related to a certain dialect in this encoding rule can be the same as those mentioned above in this paragraph that are only applicable to this one. The coding rules in the coding rules for dialects are the same. For the decoding rules applicable to multiple languages, the decoding rules for a certain dialect in this decoding rule can be the same as the decoding rules for only one dialect mentioned earlier in this paragraph. The decoding rules are the same; if the compound language includes dialect languages, then the encoding and decoding processing involved can also refer to the description in this paragraph, which is obvious common sense.

本发明文件各处所称的一个码字的值,可以是数个拼合信息的值等情况;如果编码者将这数个拼合信息的值(原来的值),全部进行码变换,比如加密处理等后成为新的值(此值一般仍是二进制值,可放入导电码或条形码等码字里;可把新的值称为原来值的变换值);可将新的值放入码字里,在MCU读码处理时,读得的此码字的值里的新的值经相应转换(逆变换)后仍可得到原来的值(这里定义的码变换,逆变换是这样的:要使用新的值的所有BIT才能作逆变换,不能对新的值的部分BIT来作逆变换以得到部分原来的值),显然,码字的值是新的值后,仍是一个码字的值是共与数个拼合信息的值有关的这种说法的一种具体表现形式(每个这样的码字仍属于有声读物整页面上的诸个码字的范围),仍可把一些这样的码字定义成诸个码字中的某拾音器能读的部诸码字,此部诸码字中的各码字是可被此拾音器读取以用于此拾音器解码播出语句发音的,此拾音器可读此部诸码字中的数个码字并根据读得的码字的码值里的新的值来作转换并用于播出语句发音,所以新的值是原来的值的等同替换。同样对数个拼合信息的值的部分(或数个拼合信息的值加上数个拼合信息的值的部分)也可按这样作码变换得到新的值。如原来的值是数个拼合信息的值,对原来的值进行码变换后得到新的值,将新的值放入码字里,假设码字里不再放别的值,码字也应被当成是有数个(此‘数个’是与从此句起往前数第5句的‘数个’是相同所指)拼合信息的值;如原来的值是数个拼合信息的值的部分,对原来的值进行码变换后得到新的值,将新的值放入码字里,假设码字里不再放别的值,码字也应被当成是有数个(此‘数个’是与从此句起往前数第5句的‘数个’是相同所指)拼合信息的值的部分,其它情况类推。如原来的值是数个拼合信息的值(或包括数个拼合信息的值的值),对原来的值进行码变换后得到新的值,将此新的值分别放在多个码字里,那每个码字的值仍是与这数个拼合信息的值有关的(每个这样的码字仍属于有声读物整页面上的诸个码字的范围),应当成是每个码字的值有数个(此‘数个’是与从此句起往前数第5句的‘数个’是相同所指)拼合信息的值的部分,这种情况与不作变换的情况仍是等同替换。(本发明文件其它段所说的‘变换值’就是指按本段这样定义的新的值)The value of a codeword referred to in the documents of the present invention can be the values of several pieces of combined information and the like; Then it becomes a new value (this value is generally still a binary value, which can be put into a code word such as a conductive code or a bar code; the new value can be called the transformed value of the original value); the new value can be put into the code word , when the MCU reads the code, the new value in the read value of the codeword can still get the original value after corresponding conversion (inverse transformation) (the code transformation defined here, the inverse transformation is like this: to use Only all BITs of the new value can be inversely transformed, and part of the BIT of the new value cannot be inversely transformed to obtain part of the original value). Obviously, after the value of the codeword is a new value, it is still the value of a codeword is a concrete manifestation of the statement that the value of several stitched information is related (each such codeword still belongs to the range of codewords on the entire page of the audiobook), and some such codewords can still be combined Word is defined as all codewords that can be read by a certain pick-up in the codewords, each codeword in this part of all codewords can be read by this pick-up in order to be used for this pick-up to decode and broadcast sentence pronunciation, this pick-up Several codewords in all codewords of this part can be read and converted according to the new value in the code value of the codewords read and used for broadcasting sentence pronunciation, so the new value is an equivalent replacement of the original value. Similarly, the value part of several combined information (or the value of several combined information plus the part of several combined information values) can also be coded and transformed in this way to obtain a new value. If the original value is the value of several merged information, the new value is obtained after the code transformation of the original value, and the new value is put into the code word. Assuming that no other values are put in the code word, the code word should also be It is regarded as having several (this 'several' is the same as the 'several' in the fifth sentence from this sentence) the value of the combined information; if the original value is part of the value of several combined information , to obtain a new value after performing code transformation on the original value, and put the new value into the code word. Assuming that no other values are placed in the code word, the code word should also be regarded as having several (the 'several' It refers to the same as "several" in the 5th sentence from this sentence) the part of the value of the combined information, and the other cases are analogized. If the original value is the value of several pieces of combined information (or includes the value of several pieces of combined information), a new value is obtained after performing code transformation on the original value, and the new value is placed in multiple codewords respectively , then the value of each codeword is still related to the value of these pieces of combined information (each such codeword still belongs to the range of codewords on the entire page of the audiobook), it should be that each codeword There are several values (this 'several' is the same as the 'several' in the fifth sentence from this sentence) to the value part of the combined information. This situation is still equivalent to the case of no transformation . ('transformation value' said in other paragraphs of the documents of the present invention refers to the new value defined in this paragraph)

本说明书以上所述码字的值有数个拼合信息的值等情况,是针对一个码字来说的,一般情况,只将一个拼合信息的值放入一个码字里,这时此一个码字的值有一个拼合信息的值,拾音器读取一个码字就可处理解码播出语句发音。如果拼合信息的值的BIT长度大于单个码字的BIT码长,那就需要把拼合信息的值的BIT分开放在多个码字里,每个码字的值有拼合信息的值的BIT的一部分,这多个码字在有声读物的整页面上所处位置一般是在印刷着这多个码字对应播的语句发音的文字的旁边,这多个码字在有声读物的页面上一般是印刷排得比较近的并且这多个码字是处于同一页面上,这多个码字也可能分别处于不同页面上,拾音器需按约定(可以在各码字里开一个或多个BIT来表示需读多个码)读取这多个码字(拾音器一般是依次读取这多个码字),然后把读得的这多个码字的值组合起来,得到拼合信息的值,再对此拼合信息的值进行解码播出语句发音。这种情况这多个码字中的各码字仍然是可被相应拾音器读取以用于拾音器播出语句发音,一个码字的码值仍是共与数个(以上是以一个为例的)表示如何用音码拼合成语句发音的拼合信息的值有关的(本段前面述的是一个码字的值放了前面述的拼合信息的值的BIT的部分),此码字仍属于有声读物整页面上的诸个码字的范围,可把一些这样的码字定义成有声读物整页面上的诸个码字中的某拾音器能读的部诸码字,此部诸码字中的各码字是可被此拾音器读取以用于此拾音器解码播出语句发音的,此拾音器可读此部诸码字中的数个码字来得到拼合信息的值来解码播出语句发音;这种情况拾音器读码后解码时播放语句发音仍可以是用直接发音方式或TTS发音方式等发音方式的。对于码字与其它表示与语句发音有关的值有关的情况是与此类似的,比如可把数个拼合信息的值的变换值放在多个码字里。甚至可把一个拼合信息的值(或数个拼合信息的值的变换值)放在多个不同码种的码字里,这多个码字又可被同一拾音器读取以用于解码播出语句发音,这多个码字可以属于此拾音器能读的部诸码字,比如这多个码字中既有导电码又有条形码,可由有光电式读码头的拾音器读取解码播放语句发音。而且显然在一本有声读物的整页面上的各码字,既可有其一个码字的值就是放了拼合信息的值的所有BIT这样的码字,也可有其一个码字的值放了拼合信息的值的BIT的部分这样的码字,而且这两种情况的码字都可被一种拾音器读取,分别处理发音(拾音器每次读一个码字或读多个码字以处理发音)。(关于把拼合信息的值的BIT放在多个码字里的情况,还可在这多个码字中的全部或部分码字里另外再放另外的拼合信息的值)。权利要求11,20,23里有关‘只需从前述一个码字即可得到前述拼合信息的值1或需从包括前述一个码字在内的所述部诸码字中的多个码字才能得到前述拼合信息的值1’的说法以及权利要求3里有关‘只需从所述一个码字即可得到所述拼合信息的值1或需从包括所述一个码字在内的所述部诸码字中的多个码字才能得到所述拼合信息的值1’的说法以及有关拾音器根据读取的一个码字的码值或多个码字的码值来得到拼合信息的值并解码使发音器件播出语句发音的技术点是与本段内容有关的。本说明书中各处述及的有关各种有声读物形式,拾音器主体与卡的配合形式,有声读物与拾音器的配合使用形式等内容中各处涉及读一个码字就可解码播出语句发音的地方,都可用每次读多个码字来解码播出语句发音的情况来做替换,比如原来有声读物的一页面上有一个码字,码字的值是一个拼合信息的值,拾音器读这个码字就可解码播出语句发音,现把这一个码字换成印刷两个码字,把那个拼合信息的值的BIT分开放在这两个码字里,原来的拾音器读这两个码字后可解码播出语句发音。The value of the above-mentioned code word in this manual has several values of combined information, etc., which is for one code word. In general, only the value of one combined information is put into a code word. At this time, this code word The value of has a value of combining information, and the pickup reads a code word to process the pronunciation of the decoded broadcast sentence. If the BIT length of the combined information value is greater than the BIT code length of a single codeword, then the BIT of the combined information value needs to be separated into multiple codewords, and the value of each codeword has the BIT of the combined information value In one part, the position of these multiple codewords on the entire page of the audiobook is generally next to the text that the multiple codewords correspond to the pronunciation of the sentence broadcast. These multiple codewords are generally on the page of the audiobook. The printing arrangement is relatively close and these multiple codewords are on the same page, and these multiple codewords may also be on different pages respectively, and the pickup needs to be indicated by agreement (one or more BITs can be opened in each codeword) need to read multiple codes) to read these multiple codewords (the pickup generally reads these multiple codewords in sequence), then combine the values of the multiple codewords read to obtain the value of the combined information, and then The value of this stitching information is decoded to play out the sentence pronunciation. In this case, each codeword in these multiple codewords still can be read by the corresponding pickup for the pickup to broadcast the sentence pronunciation, and the code value of a codeword is still a total of several (the above is an example with one ) represents how to use the phonetic code to spell the value of the combined information of the pronunciation of the sentence (the above-mentioned is that the value of a code word has put the part of the BIT of the value of the aforementioned combined information), and this code word still belongs to the sound The range of code words on the entire page of the reading material can be defined as some code words that can be read by a certain pickup in the code words on the entire page of the audiobook. Each codeword can be read by the pickup to be used for the pickup to decode and broadcast the pronunciation of the sentence, and the pickup can read several codewords in the codewords to obtain the value of the combined information to decode the pronunciation of the broadcast sentence; In this case, the playback statement pronunciation can still be with pronunciation modes such as direct pronunciation mode or TTS pronunciation mode when decoding after the pickup reads the code. It is similar to this for the case that the codeword is related to other values that represent the pronunciation of the sentence. For example, the transformed values of several combined information values can be placed in multiple codewords. It is even possible to put the value of one pieced information (or the transformation value of several pieces of combined information) in multiple codewords of different code types, and these multiple codewords can be read by the same pickup for decoding and broadcasting Sentence pronunciation, these multiple code words can belong to the part all code words that this pickup can read, such as both conductive code and bar code are arranged in these multiple code words, can be read and decoded and played sentence pronunciation by the pickup that has photoelectric type reading terminal. And obviously each codeword on the whole page of an audio book can have the value of one codeword that is to put all the codewords of the value of the combined information BIT, and also can have the value of one codeword to put codewords such as the part of the BIT of the value of the combined information, and the codewords in both cases can be read by a pickup, and the pronunciation is processed separately (the pickup reads one codeword at a time or reads a plurality of codewords to process pronounce). (about the situation that the BIT of the value of the combined information is placed in a plurality of codewords, it is also possible to put another value of the combined information in all or part of the codewords in these multiple codewords).Claims 11, 20, and 23 relate to 'only need to obtain thevalue 1 of the aforementioned combined information from the aforementioned one codeword or need to obtain a plurality of codewords in the described codewords including the aforementioned one codeword. Obtaining thevalue 1 of the aforementioned combined information andclaim 3 about 'thevalue 1 of the combined information can be obtained only from the one codeword or from the part including the one codeword A plurality of codewords in all codewords just can obtain the value 1 ' of described mosaic information and relevant pickup obtains the value of mosaic information and decodes according to the code value of a codeword or the code value of a plurality of codewords read The technical point of making the pronunciation device broadcast the sentence pronunciation is related to the content of this paragraph. Various forms of audiobooks mentioned in this manual, the cooperation form of the main body of the pickup and the card, the cooperation form of the audiobook and the pickup, etc., refer to the place where reading a code word can decode the pronunciation of the broadcast sentence. , can be replaced by reading multiple codewords each time to decode the pronunciation of the broadcast sentence. For example, there is a codeword on a page of the original audiobook, and the value of the codeword is a value of a combined information. The pickup reads this code Words can be decoded to broadcast the pronunciation of the sentence. Now replace this codeword with two codewords for printing, and put the BIT of the value of the combined information into these two codewords separately. The original pickup reads these two codewords After that, the pronunciation of the broadcast sentence can be decoded.

另外,本发明文件说的拾音器需读多个码字才能解码播放语句发音,还可能是这种情况:即本来读一个码字就可解码播放语句发音(比如一个码字的值是一个拼合信息的值),但在码字里还有标志BIT表示还需读其它码字,然后用拾音器读此码字后,拾音器可根据标志BIT来提示(比如语音提示)使用者再读其它数个码字,使用者用拾音器再读其它码字后,一共得到多个拼合信息的值,拾音器再按这多个拼合信息的值来解码播放多个语句发音(播放多个语句发音包含播放语句发音的情况)。In addition, the pickup that the document of the present invention says needs to read a plurality of codewords and could decode the pronunciation of the playback sentence, and it may also be the case: that is, reading a codeword originally can decode the playback sentence pronunciation (such as the value of a codeword is a combination of information value), but there is also a sign BIT in the codeword to indicate that other codewords need to be read, and then after reading the codeword with the pickup, the pickup can prompt (such as voice prompts) the user to read other codewords according to the sign BIT After the user reads other codewords with the pickup, the value of a plurality of combined information is obtained in total, and the pickup decodes and plays multiple sentence pronunciations according to the values of the multiple combination information (playing multiple sentence pronunciations includes playing sentence pronunciations) Condition).

本发明文件各处提及的‘码字的值,是与唯一拼合信息的值有关的’(或类似提法,这里的码字是指单个码字),就是指此码字的值只与此拼合信息的值有关而不与其它拼合信息的值有关的,一般是指此码字的值,有此拼合信息的值或有此拼合信息的值的部分,一个码字的值是与唯一的拼合信息的值有关的具体形式除可以是以上这些形式外,还可以是其它形式。'The value of the codeword is related to the value of the unique combination information' (or a similar formulation, the codeword here refers to a single codeword) mentioned in the documents of the present invention everywhere, means that the value of this codeword is only related to the value of the codeword The value of this combined information is not related to the value of other combined information. It generally refers to the value of this code word, the value of this combined information or the part with the value of this combined information. The value of a code word is related to the unique In addition to the above forms, the specific forms related to the value of the combined information can also be other forms.

在权利要求2中提及(以及本说明书各处类似提及)的‘前述部诸码字中的每个码字可被前述拾音器读取以用于前述拾音器播出语句发音’,是上位概念,具体的对于部诸码字中的每个码字一般可有两种情况(还可有其它情况):1是从这一个码字就可得到拼合信息的值,拾音器读取有声读物的整页面上的这一个码字得到此拼合信息的值,并对此拼合信息的值执行解码播出语句发音(这种情况,可以是只把此一个拼合信息的值的BIT放在此一个码字里,或,在此码字的值里放有此一个拼合信息的值的BIT和别的拼合信息的值的BIT),2是对于一个拼合信息的值,需读包括此码字在内的多个码字才能得到此拼合信息的值,可以是把此一个拼合信息的值的BIT分散放在这多个码字里,拾音器需读取有声读物的整页面上的这多个码字并根据这多个码字合起来得到的被分散放置在这多个码字里的此拼合信息的值,并对此拼合信息的值执行解码播出语句发音(这种情况,可以是只把此一个拼合信息的值的BIT分散放在这多个码字里,或,把此一个拼合信息的值的BIT分散放在这多个码字里而且在这多个码字中各码字的值里还放有别的拼合信息的值的BIT)。最简单的形式可以是:一本有声读物的整页面上的各码字的码值都是有一个拼合信息的值(不同码字的码值有的拼合信息的值互不同),这些码字可分别被同一个拾音器读取,每次此拾音器读取一个码字以播出一个语句发音。(本段内后面提及的两个‘播出语句发音’都是指本段内前面提及的‘播出语句发音’)Mentioned in claim 2 (and similarly mentioned throughout this specification) 'each codeword in the aforementioned codewords can be read by the aforementioned pickup for the pronunciation of the sentence pronounced by the aforementioned pickup' is a superordinate concept Concretely can have two kinds of situations (also can have other situation) for each codeword in all codewords specifically: 1 just can obtain the value of stitching information from this codeword, and the pickup reads the whole audiobook This code word on the page obtains the value of this combined information, and performs decoding on the value of this combined information and broadcasts the sentence pronunciation (in this case, it can be that only the BIT of the value of this combined information is placed in this code word In, or, put the BIT of the value of this piece of information and the BIT of the value of other pieces of information in the value of this codeword), 2 is for the value of a piece of information, need to read including this codeword Only multiple codewords can get the value of this combined information, and the BIT of the value of this combined information can be scattered in these multiple codewords, and the pickup needs to read these multiple codewords on the entire page of the audiobook and The value of this mosaic information that is dispersedly placed in these a plurality of codewords obtained according to these a plurality of code words is put together, and this value of this mosaic information is carried out decoding broadcast sentence pronunciation (in this case, can be only this The BIT of the value of a pieced information is scattered in these multiple codewords, or the BIT of the value of this pieced together information is scattered in these multiple codewords and the value of each codeword in these multiple codewords There is also a BIT with the value of other combined information). The simplest form can be: the code value of each codeword on the whole page of an audio book all has the value of a splicing information (the code values of different codewords have different values of splicing information), these codewords Can be read by the same pickup, each time the pickup reads a code word to play a sentence pronunciation. (The two "pronunciation of broadcast sentences" mentioned later in this paragraph both refer to the "pronunciation of broadcast sentences" mentioned earlier in this paragraph)

本发明文件中各处所涉及的拼合信息的值(或其变换值)主要是以连续二进制格式为例的,实际上,还可有别的格式,比如可以把拼合信息的值(或其变换值)的二进制BIT分拆开,分散放在一个码字(比如导电码)里的各部分(比如码字里还有一些别的信息的BIT),解码时按约定把各分散开的拼合信息的值(或其变换值)组合起来,甚至不用组合,直接按约定读各分散开的拼合信息的值(或其变换值)来处理解码即可,此种格式是连续二进制格式的等同替换;使用别的格式是常规手法。The value (or its transformation value) of the information of joining together involved in the document of the present invention is mainly based on continuous binary format as an example, in fact, there are other formats, such as the value of the joining information (or its transformation value) can be )’s binary BIT is split apart and scattered into each part of a codeword (such as a conductive code) (for example, there are some BITs of other information in the codeword). Values (or their transformation values) can be combined, even without combination, just read the values (or their transformation values) of the scattered information directly according to the agreement to process and decode. This format is an equivalent replacement of the continuous binary format; use Other formats are conventional.

拾音器可以是非插卡方式拾音器(本说明书前面阐述的有关图1或图2的拾音器主体的实施例也即是非插卡方式拾音器),与拾音器有关的存储器集,是处于拾音器主体内的(在拾音器主体内,MCU芯片内外都可能有存储器);另一种方案,拾音器可以是插数据卡方式拾音器,可做出可插入拾音器主体(如提及时无指明,本段以下提及的拾音器主体是指插数据卡方式拾音器主体)并可从拾音器主体拔出的数据卡,将此数据卡插入此拾音器主体后此数据卡与此拾音器主体合为一个插数据卡方式拾音器,此数据卡上有存储器,此数据卡上的存储器里可以存储解码有关程序及解码有关数据(如拾音器采用直接产音方式那一般还应在数据卡上的存储器里存储诸音码的发音数据),拾音器主体有一个卡插槽(如有插槽,那一般是把插槽做在拾音器主体的机体上),拾音器主体内仍包括有单片机芯片,读码头和发音器件,这三者之间的连接关系(可能这三者在拾音器主体内本来已连接,或者可能是要等把数据卡插入拾音器主体后,通过数据卡上的某些连线才使这三者连接起来)同于非插卡方式拾音器主体内单片机芯片,读码头和发音器件三者之间的连接关系(本行和以上2行提到的连接都是指意向连接,本行和以上3行提到的连接关系都是指意向连接的关系),读码使用时把数据卡插入拾音器主体;可针对不同语种做不同的卡(只适用于某一个语种的卡上的存储器里存储有只适用于相应语种的解码程序及解码数据,不同语音编码有声读物的整页面上的相同语种的码字可使用同一个只适用于此语种的卡,如拾音器采用直接产音方式那一般还应在数据卡上的存储器里存储此语种的诸音码的发音数据),其它读码,拾音器运行主控,发音等程序可仍放在拾音器主体内的存储器里;显然采用直接产音方式的插数据卡方式拾音器的各项技术特征(比如软件的读码及解码处理,硬件特征等)都同于本说明书述及的采用直接产音方式的非插卡方式拾音器的各项技术特征,显然采用TTS产音方式的插数据卡方式拾音器的各项技术特征(比如软件的读码及解码处理,硬件特征等)都同于本说明书述及的采用TTS产音方式的非插卡方式拾音器的各项技术特征。语音编码有声读物的整页面上的诸个码字中每个码字的值是共与数个拼合信息的值有关的,使用时可把只适用于要读取的语音编码有声读物的整页面上的码字涉及的拼合信息的值的语种(一般同一码字涉及的各拼合信息的值的语种是相同语种,即码字是此语种的码字)的数据卡插入拾音器主体使数据卡和拾音器主体合为一个拾音器,由拾音器通过读码头读取语音编码有声读物的整页面上的诸个码字中的部诸码字中的一个或多个码字,并由拾音器的MCU调数据卡上的相应解码程序及解码数据来对从这一个或多个码字的值得到的此语种的拼合信息的值进行解码播出语句发音(一般就是由拾音器读取部诸码字中的此语种的一个或多个码字,并根据码字的码值得到拼合信息的值来解码播出语句发音)。数据卡上的存储器类型可以是ROM等。插数据卡方式拾音器,其解码涉及的产音方式可以是直接产音方式(一般其发音是用主MCU发音或用主MCU外接的语音芯片来发音的),也可以是TTS产音方式,因为对于插数据卡方式拾音器,显然仍可以在其拾音器主体的电路中使用TTS器件(比如TTS芯片)来发音,而且这样一来拾音器主体内和数据卡上的存储器里就不需存诸音码的发音数据和发音程序,显然一个插数据卡方式拾音器甚至既采用直接产音方式又采用TTS产音方式(一般此拾音器内有由多个子发音器件组成的一个发音器件,这多个子发音器件中一般包括作为一个子发音器件的带喇叭的TTS芯片,此拾音器的其它硬件特点与采用直接产音方式或采用TTS产音方式的拾音器类似)且此拾音器的解码处理参照采用直接产音方式的拾音器和采用TTS产音方式的拾音器的解码处理即可;对于插数据卡方式拾音器,其读码头可以是机电式导电码读码头(用于读取语音编码有声读物的整页面上的导电码)或光电式读码头(用于读取语音编码有声读物的整页面上的导电码或条形码)等,下面一个实例是以读码头是机电式导电码甲读码头来举例的,相应的语音编码有声读物的整页面上的码字可以是导电码或条形码等。对于以上所述拾音器,解码有关程序及解码有关数据一般是都存储在数据卡上,但也可能出现其它情况比如解码有关程序及解码有关数据中的一部分存储在数据卡上,另一部分存储在拾音器主体内的存储器里。对于本段内以上述的包括硬软件处置的插数据卡的方案,更极端的,甚至拾音器主体内可无存储器(或拾音器主体内有存储器比如MCU内有存储器但不用其存储程序和数据),而把拾音器涉及的所有程序和所有数据,比如读码程序,拾音器运行主控程序,解码有关程序及解码有关数据(如拾音器采用直接产音方式那一般还应包括发音程序及诸音码的发音数据)等都存储在数据卡上的存储器里(并且仍可以是针对不同语种做不同的卡,即只适用于某一语种的卡上的存储器里存储有只适用于相应语种的解码程序及解码数据);使用时,把卡插入拾音器主体使卡和拾音器主体合为一个拾音器,由拾音器运行卡上的各程序,作读码操作,并调卡上的解码程序及解码数据来解码播出语句发音;而且同前述的一样,这种把拾音器涉及的所有程序和所有数据都放在数据卡上的方式,可适用于采用直接产音方式(一般其发音是用主MCU发音或用主MCU外接的语音芯片来发音的)的拾音器,也可适用于采用TTS产音方式的拾音器(如是TTS产音方式,拾音器主体内应有TTS器件比如TTS芯片,并且存储器集里不需存储发音程序及诸音码的发音数据),也可适用于既采用直接产音方式又采用TTS产音方式的拾音器。对于本段以上所述插数据卡方式拾音器,可能包括在与拾音器有关的存储器集里的可以是单片机芯片内的存储器,拾音器主体内(非单片机芯片内)的存储器,数据卡上的存储器等(当然如这些存储器中有些不存在就不会被包括在存储器集里),可把只适用于一个语种或适用于多个语种的解码有关程序及解码有关数据存储在存储器集里。本段以上所说的拾音器的插卡方式主要是针对不同语种做不同的卡(一种卡只适用于一个语种),实际上也可以做成一种卡可适用于多个语种,即比如一个卡上的存储器里存储有只适用于各自语种的各解码程序及解码数据(适用于不同语种的解码程序一般是互相独立的,适用于不同语种的解码数据一般也是互相独立的)或存储有可适用于多个语种的一套解码程序及解码数据(如拾音器是采用直接产音方式,那一般还需把这多个语种中各自语种的诸音码的发音数据存储在此卡上),把一个此种卡插入拾音器主体使卡和拾音器主体合为一个拾音器后,拾音器可读取多个语种的码字并分别处理解码播出语句发音或读取其同一码字涉及的各拼合信息的值是不同语种的码字来播出各语种的语句发音,这种拾音器(其插的卡是一个卡可适用于多个语种的)可以是采用直接产音方式或采用TTS产音方式或既采用直接产音方式又采用TTS产音方式的。本段以上所述的数据卡可以看成是与拾音器主体配套的(比如卡上存储有只适用于某一语种或适用于多个语种的解码有关程序及解码有关数据,本段以上也主要是按这样来述的)或可看成卡是与语音编码有声读物配套的(比如对于同一语种的不同年级的学生读物,不同年级有声读物配的卡上只存储有只适用于此语种的解码程序而且各卡的解码程序相同,但存储的解码数据不同比如词数据库里的词数量不同),另外数据卡也可在出售时既不与拾音器主体配套也不与有声读物配套。(注:本发明文件所说的数据卡,语音编码有声读物,插数据卡方式拾音器主体是3种不同产品,数据卡可以与语音编码有声读物配套或与拾音器主体配套,配套一般是指配套出售;本段是只述及一个插数据卡方式拾音器主体只插一个数据卡的情况的;本段各处说到的单个的语种比如说到的某一语种,可以是任一非复合语种或任一复合语种,本段各处说到的多个语种,可以全是非复合语种或全是复合语种或既包含非复合语种也包含复合语种)Pickup can be a non-card mode pickup (the embodiment of the pickup body of Fig. 1 or Fig. 2 described above in this specification is also a non-card mode pickup), and the memory set relevant to the pickup is in the pickup body (in the pickup body). In the main body, there may be memory inside and outside the MCU chip); another solution, the pickup can be a pickup in the form of a data card, and can be inserted into the pickup body (if there is no indication when mentioned, the pickup body mentioned below in this paragraph refers to Insert data card mode pickup body) and can pull out the data card from the pickup body, after inserting this data card into the pickup body, the data card and the pickup body are combined into a data card insertion mode pickup, and there is a memory on the data card, The memory on the data card can store decoding related programs and decoding related data (if the pickup adopts a direct sound production method, the pronunciation data of various sound codes should generally be stored in the memory on the data card), and the main body of the pickup has a card slot Slot (if there is a slot, the slot is generally made on the body of the pickup body), the pickup body still includes a single-chip microcomputer chip, a reading port and a pronunciation device, the connection relationship between the three (possibly these three It is already connected in the main body of the pickup, or it may be that after the data card is inserted into the main body of the pickup, the three are connected through some connections on the data card. Read the connection relationship between the terminal and the pronunciation device (the connections mentioned in this line and the above 2 lines all refer to the connection of intention, and the connection relations mentioned in this line and the above 3 lines all refer to the relationship of intention connection), read When using the code, insert the data card into the main body of the pickup; different cards can be made for different languages (the memory on the card that is only applicable to a certain language stores a decoding program and decoding data that are only applicable to the corresponding language, and audiobooks with different speech codes The codewords of the same language on the entire page can use the same card that is only applicable to this language. If the pickup adopts the direct sound production method, the pronunciation data of the various phonetic codes of this language should generally be stored in the memory on the data card. ), other code reading, pickup operation main control, pronunciation and other programs can still be placed in the memory in the pickup main body; obviously adopt the various technical characteristics of the pickup in the way of directly producing sound (such as code reading and decoding of software) Processing, hardware features, etc.) are all the same as the various technical features of the non-card mode pickup that adopts the direct sound production mode mentioned in this manual, and obviously adopt the various technical features of the data card mode pickup of the TTS sound production mode (such as Software code reading and decoding processing, hardware features, etc.) are all the same as the technical features of the non-insert-card mode pickup that adopts the TTS sound production mode mentioned in this specification sheet. The value of each codeword in the codewords on the entire page of the speech-encoded audiobook is related to the value of several pieced together information. When used, it can be used only for the entire page of the speech-encoded audiobook to be read. Insert the data card of the value of the combined information involved in the above code into the main body of the pickup to make the data card and The main body of the pickup is combined into a pickup, and the pickup reads one or more codewords of all the codewords on the entire page of the voice-coded audiobook through the reading dock, and the MCU of the pickup reads the data card Corresponding decoding program and decoding data on the value of the combination information of this language obtained from the value of one or more codewords are decoded and broadcast sentence pronunciation (generally this language in all codewords by the pickup reading part One or more codewords of the codeword, and according to the code value of the codeword, obtain the value of the combined information to decode and broadcast the sentence pronunciation). The memory type on the data card may be ROM or the like. For the data card mode pickup, the sound production method involved in the decoding can be a direct sound production method (generally, the pronunciation is pronounced by the main MCU or an external voice chip connected to the main MCU), or it can be a TTS sound production method, because For the pickup of the data card mode, it is obviously still possible to use a TTS device (such as a TTS chip) in the circuit of the pickup body to pronounce sounds, and in this way, there is no need to store all sound codes in the pickup body and the memory on the data card. Pronunciation data and pronunciation program, obviously a data card mode pickup not only adopts the direct sound production mode but also adopts the TTS sound production mode (generally, there is a pronunciation device composed of a plurality of sub-pronunciation devices in this pickup device, generally in these multiple sub-pronunciation devices Including a TTS chip with a speaker as a sub-pronunciation device, other hardware features of this pickup are similar to pickups using direct sound production or TTS sound production) and the decoding process of this pickup refers to pickups that use direct sound production and The decoding process of the pickup using the TTS sound production method is sufficient; for the pickup of the data card mode, the reading terminal can be an electromechanical conductive code reading terminal (used to read the conductive code on the entire page of the voice-coded audiobook) or photoelectric Type reading terminal (used to read the conductive code or barcode on the entire page of the voice-coded audiobook), etc., the following example is an example where the reading terminal is an electromechanical conductive code A-reading terminal, and the corresponding voice-coded audiobook The code words on the entire page can be conductive codes or barcodes, etc. For the above-mentioned pickups, the decoding-related programs and decoding-related data are generally stored on the data card, but other situations may also occur such as a part of the decoding-related programs and decoding-related data is stored on the data card, and the other part is stored in the pickup. in the main memory. For the above-mentioned scheme of inserting a data card including hardware and software in this paragraph, even more extreme, there may be no memory in the main body of the pickup (or there is a memory in the main body of the pickup, such as a memory in the MCU, but it is not used to store programs and data), And all programs and all data that pickup relates to, such as code reading program, pickup runs main control program, decodes relevant program and decodes relevant data (if pickup adopts the direct sound mode that generally also should include pronunciation program and the pronunciation of all sound codes data), etc. are all stored in the memory on the data card (and it is still possible to make different cards for different languages, that is, the memory on the card that is only applicable to a certain language stores the decoding program and decoding that is only applicable to the corresponding language. data); when in use, insert the card into the main body of the pickup so that the card and the main body of the pickup are combined into one pickup, and the pickup runs the programs on the card for code reading operations, and adjusts the decoding program and decoding data on the card to decode the broadcast statement Pronunciation; and the same as the aforementioned, this method that all programs related to the pickup and all data are placed on the data card is applicable to adopting a direct sound production mode (generally, its pronunciation is to be pronounced with the main MCU or externally connected with the main MCU voice chip to pronounce), also applicable to the pickup that adopts the TTS sound production method (such as the TTS sound production method, there should be a TTS device such as a TTS chip in the pickup main body, and there is no need to store the pronunciation program and all sounds in the memory set. The pronunciation data of sign indicating number), also can be applicable to the pick-up that not only adopts direct sound production mode but also adopts TTS sound production mode. For the above-mentioned plug-in data card mode pickup of this paragraph, what may be included in the memory set relevant to the pickup can be the memory in the single-chip microcomputer chip, the memory in the pickup main body (not in the single-chip microcomputer chip), the memory on the data card, etc. ( Of course, if some of these memories do not exist, they will not be included in the memory set), and the decoding-related programs and decoding-related data that are only applicable to one language or multiple languages can be stored in the memory set. The card insertion method of the pickup mentioned above in this paragraph is mainly to make different cards for different languages (one card is only suitable for one language), in fact, it can also be made into one card that can be used for multiple languages, that is, for example The memory on the card stores the decoding programs and decoding data that are only applicable to their respective languages (the decoding programs applicable to different languages are generally independent of each other, and the decoding data applicable to different languages are generally independent of each other) or stores data that can be A set of decoding programs and decoding data applicable to multiple languages (if the pickup adopts a direct sound production method, it is generally necessary to store the pronunciation data of the various phonetic codes of the respective languages in these multiple languages on this card), put One such card is inserted into the main body of the pickup so that the card and the pickup body are combined into one pickup, and the pickup can read codewords in multiple languages and process and decode the pronunciation of the broadcast sentence or read the value of each combined information involved in the same codeword It is the code words of different languages to broadcast the pronunciation of sentences in various languages. This kind of pickup (the card inserted in it is a card that can be applied to multiple languages) can adopt the direct sound production method or the TTS sound production method or both The direct sound production method also adopts the TTS sound production method. The data card described above in this paragraph can be regarded as supporting with the pickup main body (for example, there are decoding related programs and decoding related data that are only applicable to a certain language or applicable to multiple languages on the card, and the above paragraph is also mainly According to this description) or it can be regarded as a card that is matched with speech coded audiobooks (for example, for students’ books of different grades in the same language, only the decoding program applicable to this language is stored on the card with audiobooks of different grades) And the decoding program of each card is the same, but the decoding data stored is different (for example, the number of words in the word database is different), and the data card can neither be matched with the pickup main body nor be matched with the audiobook when selling in addition. (Note: the data card mentioned in the document of the present invention, voice coded audiobook, and the main body of the pickup in the way of inserting the data card are 3 different products, and the data card can be matched with the voice coded audiobook or with the main body of the pickup, and matching generally refers to matching for sale ; This paragraph only refers to the situation that a data card mode pickup body only inserts a data card; the single language mentioned everywhere in this paragraph, such as a certain language, can be any non-composite language or any A compound language, the multiple languages mentioned in this paragraph can be all non-compound languages or all compound languages or include both non-compound languages and compound languages)

附图6是使用机电式导电码读码头的拾音器的一个实施例的电路图,3是MCU,MCU型号选用SNC710,其配有24个IO脚,具备播放语音的功能,32K*16 BIT的数据及程序存储器,此芯片还可外接存储器;其拾音器运行主控程序和读码程序长度较少所以放在SNC710片内;发音器件为喇叭(喇叭是子发音器件),两个按键可分别作为音量循环调节键和工作唤醒休眠键(这两个按键组成按键模块);本实施例的拾音器解码时总是采用直接产音方式的;1是通断式机电读码头,由甲感应排14,甲扫描排15(只含一根扫描线)和甲电阻排16组成,如图6下面部分所示是甲感应排14和甲扫描排15的平面布置图,一般还需与连接介质(斑马条7)一起装配成读码头(空间结构参照图5),甲电阻排16上的每个电阻可取几K欧以限流,也可取消此电阻排,甲扫描排15上唯一的一个读码扫描线直接接在正电源上(也可接一个MCU的IO脚由此IO脚来输出电流,此两种方式是等同的),用于读取如图4示形式的单排码线单码段甲导电码(可定义本例的此甲导电码的码线区不使用通断代名码数,即按每一个码线触区位置旁挨一个空地布置,任一个码线触区位置有码线就表示此码线触区位置对应的BIT是1,任一个码线触区位置无码线就表示此码线触区位置对应的BIT是0,图4所示甲导电码表示的码值是1101111,图4上面1-7的数字标号是用以示意7个码线触区位置的,一个数字标号下方可能被一个码线占据的范围,就是一个码线触区位置;另,本例的此甲导电码的码线区也可定义成使用通断代名码数的,通断代名码数的原理可参看2006102005537号申请),与此拾音器配合使用的语音编码有声读物的整页面上只有一些如图4示形式的甲导电码(语音编码有声读物可如图10下半部分所示,页面上有图4示形式的甲导电码20);读码时,使读码头1与语音编码有声读物的整页面上的图4示形式的甲导电码接触,甲扫描排15通过斑马条与甲导电码的甲码盘12接触,甲感应排14通过斑马条与甲导电码的码线区13接触,电流从处于高电平的唯一的扫描线,注入到甲导电码上的甲码盘12,然后在甲导电码码线区13上各码线触区位置中有码线(BIT为1)的码线触区位置,电流就通过码线,再通过斑马条,再通过甲感应排14的相对应的感应线流回进入两片4021的并行输入脚,在甲导电码码线区13上各码线触区位置中无码线(BIT为0)的码线触区位置,就没有电流从甲感应排14上与这些无码线的码线触区位置相对应的感应线流回进入两片4021的并行输入脚,两片4021读得的甲感应排14的数据最后以串行数据形式被MCU获取,导电码的检测原理及过程还可参见2006102005537号申请对导电码的技术阐述。为了检测更可靠,可以在两片4021的并行输入脚的各脚加下拉100K电阻。此例单码段甲导电码的码线区有7个码线触区位置,如不带校验位,可表示7个有效BIT,而一般表示语句发音的拼合信息的值,至少要四,五十个BIT,为了解决码太短的问题,可以在读物的页面上需发音的语句旁,多印几个位置近靠的此种如图4的甲导电码,把表示语句发音的拼合信息的值分开放置在这几个导电码的码值里。读码时,用拾音器的读码头连续顺序读取这几个导电码,再由MCU按约定(可以在各导电码的码值里开一个或多个BIT标志表示是否需读多个码来播出语句发音)把码值组合起来得到拼合信息的值,解码播出语句发音;原理上可行,当然这有些不方便。Accompanying drawing 6 is the circuit diagram of an embodiment of the pick-up that uses electromechanical conduction code to read dock, and 3 is MCU, and MCU model selects SNC710 for use, and it is equipped with 24 IO pins, has the function of playing voice, the data of 32K*16 BIT and Program memory, this chip can also be connected to an external memory; the pickup runs the main control program and the code reading program is short in length, so it is placed in the SNC710 chip; the pronunciation device is a speaker (the speaker is a sub-pronunciation device), and the two buttons can be used as volume cycles respectively. Regulating key and work wake-up sleep key (these two buttons form key module); Always adopt the direct sound production mode when the pick-up of the present embodiment is decoded; Row 15 (containing only one scanning line) and a resistance row 16, as shown in the lower part of Fig. 6 is a planar layout of a sensing row 14 and a scanning row 15, and generally needs to be connected with a connecting medium (zebra strip 7) Assembled together into a reading dock (refer to Figure 5 for the spatial structure), each resistor on the first resistor row 16 can be a few K ohms to limit the current, and this resistor row can also be cancelled, and the only code-reading scanning line on the first scanning row 15 directly Connect to the positive power supply (you can also connect an IO pin of an MCU to output current from this IO pin, the two methods are equivalent), used to read the single-row code line single-code segment A conduction as shown in Figure 4 Code (can define the code line area of this first conduction code of this example not to use on-off code name code number, promptly press each code line touch zone position next to next to an open space layout, any code line touch zone position has code line and just represents The BIT corresponding to the touch area position of this code line is 1. If there is no code line at any touch area position of the code line, it means that the BIT corresponding to the touch area position of the code line is 0. The code value represented by the first conductive code shown in Figure 4 is 1101111. 4. The numbers 1-7 above are used to indicate the position of the 7 code line contact areas. The area below a number label that may be occupied by a code line is the position of a code line contact area; The code line area of sign indicating number also can be defined as using on-off code name code number, the principle of on-off code name code number can refer to No. 2006102005537 application), and on the whole page of the voice coding audiobook used in conjunction with this sound pick-up, there are only some as shown in Fig. 4 A conduction code in the form shown (the voice-coded audiobook can be shown in the lower part of Figure 10, and the first conduction code 20 in the form shown in Figure 4 is arranged on the page); Figure 4 on the page shows the first conductive code contact of the form, the first scanning row 15 is in contact with the first code plate 12 of the first conductive code through the zebra strip, and the first induction row 14 is in contact with the code line area 13 of the first conductive code through the zebra strip, and the current From the only scanning line at a high level, it is injected into the first code disc 12 on the first conductive code, and then there is a code line (BIT is 1) code in each code line touch area position on the first conductive code line area 13 When the line touches the area, the current passes through the code line, then through the zebra strip, and then flows back through the corresponding induction line of the first induction row 14 to enter the parallel input pins of the two 4021s, and each code on the first conductive code line area 13 In the position of the code line contact area without a code line (BIT is 0) in the position of the line contact area, there is no current flowing back into the two pieces of 4021 from the induction line corresponding to the position of the code line contact area without code line on the first induction row 14 Parallel input pins, the data of A sensor row 14 read by two 4021s are finally acquired by the MCU in the form of serial data. For the detection principle and process of the conductive code, please refer to the technical description of the conductive code in the application No. 2006102005537. In order to detect more reliably, you can add a pull-down 100K resistor to each of the parallel input pins of the two 4021s. In this example, the code line area of the single code segment A conductive code has 7 code line touch area positions. If there is no check digit, it can represent 7 effective BITs. Generally, the value of the combined information representing the pronunciation of a sentence must be at least four. Fifty BITs, in order to solve the problem that the code is too short, you can print a few more positions next to the sentence that needs to be pronounced on the page of the reading material, as shown in Figure 4. The value of is placed separately in the code values of these conductive codes. When reading the code, use the reading terminal of the pickup to read these conductive codes sequentially, and then the MCU will follow the agreement (you can open one or more BIT flags in the code value of each conductive code to indicate whether you need to read multiple codes to play) Pronunciation of the sentence) combine the code values to obtain the value of the combined information, decode and broadcast the pronunciation of the sentence; it is feasible in principle, but of course this is somewhat inconvenient.

此实施例的外接数据存储器6,是图6中的SR008(8M BIT存储空间),可以把用于拼合信息的值解码的解码程序及解码数据以及用于语句发音的诸音码的录音数据存放在外接数据存储器6上(其它的拾音器运行主控及读码等程序还是放在SNC710片内),外接数据存储器6可做成插卡形式(即把数据存储器6做在一个数据卡上),此卡可插入拾音器主体并可从拾音器主体拔出(当然这种插卡方式的与卡配合使用的拾音器主体要做有一个卡插槽,并且一般是把插槽做在拾音器主体机体上)。如做成插卡形式,即做成数据卡,可以每个语种做一种数据卡,卡上存储相应语种的拼合信息的值的解码程序及解码数据及诸音码的录音数据;假设本实施例的数据卡是只适用于汉语的数据卡,与此数据卡配套的如图10下半部分所示的有声读物的整页面上的每个如图4示形式的甲导电码码字的值有一个汉语语种的拼合信息的值的部分,这些拼合信息的值是按本说明书述的以只适用于汉语为例的编码规则来编码生成的;可把此有声读物的整页面上的所有如图4所示形式的导电码集合起来定义成诸个导电码,使用时,在拾音器主体上插入此只适用于汉语的数据卡使数据卡和拾音器主体合为一个拾音器(使外接数据存储器6与SNC710通过插槽连接起来),可由此拾音器利用读码头读取语音编码有声读物(如图10下半部分所示)的整页面上的诸个码字中的部诸码字中的汉语语种的如图4示形式的多个甲导电码(本来此拾音器能读这诸个码字中的每个导电码且可按码值播放语句发音,可把此有声读物的整页面上的诸个码字中的所有导电码定义成此拾音器能读的部诸码字),并由拾音器的MCU调用卡上的解码程序及解码数据(此解码程序是按本说明书述的以只适用于汉语为例的解码规则来解码的)及诸音码的录音数据(此诸音码的录音数据是按照本说明书第46页和第47页述的第1种方式形成的)中的一些音码(这里是单字拼音,单字拼音是元音辅音组合音码)的录音数据来对从这多个导电码码字得到的汉语语种的拼合信息的值进行解码播出语句发音(本实施例的拾音器在解码时采用整步直接发音方式,即连续播出多个音码的语音以形成语句发音,用SNC710驱动喇叭播音码的语音,发音器件是喇叭);这种形式,可以做成给每个拾音器主体配几个语种的卡(而不给语音编码有声读物配卡)。本实施例是从原理上说明外插数据卡的使用和导电码的读取使用。如提及时无特别指明有他义,本段以上提及的拾音器主体默认是指插数据卡方式拾音器主体。(图5是机电式导电码读码头在读码时的空间分解示意图,电流沿箭头路线流动,斑马条7沿虚线OX轴不导通电,沿虚线OY轴导通电,此是常识,扫描排15,感应排14是分别与两个斑马条7紧贴装配在读码头上的,20是图4所示的单排码线单码段甲导电码;图4中4个‘*’号围起来的矩形区域就是甲导电码的码线区13)Theexternal data memory 6 of this embodiment is SR008 (8M BIT storage space) among Fig. 6, can be used for the decoding procedure of the value decoding of assembling information and decoded data and the recording data of all phonetic codes for the pronunciation of sentences are stored On the external data memory 6 (programs such as other pickups running master control and code reading are still placed in the SNC710 chip), theexternal data memory 6 can be made into a plug-in card form (that is, thedata memory 6 is made on a data card), This card can be inserted into the main body of the pickup and can be pulled out from the main body of the pickup (of course, the main body of the pickup used in conjunction with the card should have a card slot, and generally the slot is made on the main body of the pickup). If it is made into a card form, that is, a data card can be made into a data card for each language, and the card stores the decoding program and decoding data of the value of the combined information of the corresponding language and the recording data of all sound codes; assuming this implementation The data card of example is the data card that is only applicable to Chinese, and the value of each conductive code word of the form shown in Figure 4 on the entire page of the audiobook as shown in the lower part of Figure 10 matched with this data card There is a part of the value of the mosaic information in the Chinese language, and the value of these mosaic information is coded and generated according to the encoding rules described in this manual, which are only applicable to Chinese as an example; The conductive codes of the form shown in Fig. 4 are assembled and defined as various conductive codes. During use, insert this data card that is only applicable to Chinese on the pickup main body so that the data card and the pickup main body are combined into a pickup (make theexternal data memory 6 and SNC710 is connected through a slot), and the pickup can use the reading dock to read the Chinese language in all the code words on the entire page of the speech coded audiobook (as shown in the lower part of Figure 10). A plurality of first conduction codes of the form shown in Figure 4 (originally this pick-up can read each conduction code in these code words and can play the sentence pronunciation by code value, can read all codes on the whole page of this audio book All the conductive codes in the word are defined as all the code words that the pickup can read), and the MCU of the pickup calls the decoding program and decoding data on the card (this decoding program is only applicable to Chinese as an example as described in this manual decoded) and some sound codes (here is Single character pinyin, single character pinyin is the recording data of vowel and consonant combined sound sign indicating number) to decode the value of the mosaic information of the Chinese language that obtains from these a plurality of conduction code codewords and broadcast sentence pronunciation (the pickup of the present embodiment when decoding The whole step direct pronunciation method is adopted, that is, the voice of multiple sound codes is broadcast continuously to form a sentence pronunciation, and the voice of the speaker broadcasting code is driven by the SNC710, and the pronunciation device is a speaker); this form can be made into a configuration for each pickup body Cards in several languages (without cards for speech-coded audiobooks). This embodiment is to explain the use of the external data card and the reading of the conduction code in principle. If there is no special meaning when it is mentioned, the main body of the pickup mentioned above in this paragraph refers to the main body of the pickup in the way of inserting a data card by default. (Figure 5 is a schematic diagram of the spatial decomposition of the electromechanical conductive code reading terminal when reading the code. The current flows along the arrow route, and thezebra strip 7 is not conducting electricity along the dotted line OX axis, but is conducting electricity along the dotted line OY axis. This is common sense, and thescanning row 15, theinduction row 14 is respectively closely assembled with twozebra strips 7 on the reading dock, and 20 is the single row code line single code segment A conductive code shown in Figure 4; 4 '*' numbers are surrounded in Figure 4 The rectangular area of is exactly the code line area 13) of first conduction code

本实施例以上述的方案中,数据存储器6做在一个数据卡上,那图6电路中除数据存储器6以外的部分,就是插数据卡方式拾音器主体,把数据卡(即数据存储器6)插入此拾音器主体就使卡与此拾音器主体合为本实施例以上提及的拾音器;所以,本实施例可被看成是拾音器的实施例。Present embodiment is with above-mentioned scheme, anddata memory 6 is done on a data card, and the part exceptdata memory 6 in that Fig. 6 circuit is exactly the pickup main body of inserting data card mode, data card (being data memory 6) is inserted This pick-up main body just makes card and this pick-up main body combine into the pick-up mentioned above of this embodiment; So, this embodiment can be regarded as the embodiment of pick-up.

对于插卡方式,还有一种变形(不是专指上面这个实例),就是把原拾音器主体内的部件(包括读码头,发音器件,单片机芯片;此单片机芯片是作为主处理芯片的单片机芯片)中的一些部件也可做在数据卡上而另一些部件仍做在拾音器主体内(即把这些部件分散布置在拾音器主体内和数据卡上,极端的,甚至可把这些部件都布置在数据卡上而拾音器主体内只有一些诸如供电电路的东西,使用时仍把卡插入拾音器主体),至于怎样把这些部件布置放在拾音器主体内和数据卡上是根据需要来定的(数据卡上至少有一个部件,拾音器主体内可没有部件),比如可把发音器件做在数据卡上,甚至主MCU,读码头等都可做在卡上,当然卡的接插口与拾音器主体的卡插槽需有这些部件的连接脚(卡插入拾音器主体的卡插槽后,使得拾音器主体内的部件与卡上的部件连接成一个完整电路),一般是把卡插槽做在拾音器主体上,卡可插入拾音器主体也可从拾音器主体拔出;可把这种有部件的数据卡称为部件式数据卡,拾音器主体的卡插槽也可相应被称为部件式卡插槽,这种拾音器主体可被称为插部件式数据卡方式拾音器主体,这种卡插入这种拾音器主体而形成的拾音器是插部件式数据卡方式拾音器(如提及时无特别指明有他义,本段以下提及的拾音器默认是指插部件式数据卡方式拾音器;如提及时无特别指明有他义,本段以下提及的拾音器主体默认是指插部件式数据卡方式拾音器主体);单片机芯片内可以有或没有存储器,单片机芯片外拾音器主体内可以有或没有存储器,部件式数据卡上仍有存储器(如卡上有单片机芯片,那卡上的存储器可以就是单片机芯片内的存储器),拾音器涉及的程序和拾音器涉及的数据存储在这些存储器里,而其中的解码有关程序及解码有关数据一般是存储在卡上的存储器里的(如拾音器采用直接产音方式那一般还应在卡上的存储器里存储诸音码的发音数据),并且这种卡可以看成是与拾音器主体配套的(比如卡上存储有只适用于某一语种或适用于多个语种的解码有关程序及解码有关数据)或可看成卡是与语音编码有声读物配套的(比如对于同一语种的不同年级的学生用的有声读物,不同年级有声读物配的卡上只存储有只适用于此语种的解码程序而且各卡的解码程序相同,但存储的解码数据不同比如单词数据库的单词数量不同,这时就应把卡与有声读物配套);这种方式的读码头仍然可以是机电式导电码读码头或光电式读码头等,相应的语音编码有声读物的整页面上的码字可以是导电码或条形码等,拾音器仍然可以是采用直接产音方式或采用TTS产音方式或既采用直接产音方式又采用TTS产音方式的;读码使用时,可把适用于某语种的部件式数据卡插入拾音器主体,这时卡与拾音器主体合为一个插部件式数据卡方式拾音器,拾音器主体内的部件与卡上的部件连接成一个完整电路,再可用此拾音器的读码头读语音编码有声读物的整页面上的诸个码字中的部诸码字中的数个码字得到此语种的拼合信息的值来解码播出语句发音(这里说的一个或多个码字中每个码字涉及的全部数个拼合信息的值中是有此语种的拼合信息的值的,可能可从这一个或多个码字得到一个或多个此语种的拼合信息的值,拾音器可调卡上的解码程序及解码数据对每个此语种的拼合信息的值解码播出语句发音)。权利要求22,23,24是与这种方式有关的,在权利要求22,23,24中有关单片机芯片与读码头和发音器件的意向连接,存储器的读取,用读码头读码,拼合信息的值,编码解码,存储器的布置等技术点的含义与权利要求10,11,12中涉及的这些技术点的含义相同。把卡插入拾音器主体使这两者合为一插部件式数据卡方式拾音器后,此拾音器在使用时仍需用到存储器集,此存储器集,是指拾音器在读码发音使用时(即在把部件式数据卡插入插部件式数据卡方式拾音器主体而形成拾音器后),作为拾音器的主处理芯片的单片机芯片(此单片机芯片本来可能在拾音器主体内或卡上)能读取的所有存储器;可能包括在存储器集里的可以有单片机芯片内的存储器,拾音器主体内(非单片机芯片内)的存储器,部件式数据卡上(非单片机芯片内)的存储器等(当然如这些存储器中有些在拾音器内不存在,那存储器集里也不包括这里提到的但实际不存在的存储器)。拾音器涉及的程序及拾音器涉及的数据存储在存储器集里,解码有关程序及解码有关数据一般是都存储在部件式数据卡上,但也可能出现其它情况比如解码有关程序及解码有关数据中的一部分存储在部件式数据卡上,另一部分存储在拾音器主体内的存储器里。对于在拾音器主体内的有些存储器,在读码使用时这些存储器能被单片机芯片读取,这些存储器属于存储器集,但可能没存储有用信息;但部件式数据卡一定有存储器存储有有用信息且此存储器一定属于本段前面述的存储器集。存储器集里可能存储有只适用于一个语种或适用于多个语种的解码有关程序及解码有关数据。如单片机芯片没位于部件式数据卡上而在拾音器主体内,那部件式数据卡上的存储器一般是存储器芯片,如单片机芯片位于部件式数据卡上,那部件式数据卡上的存储器可以是独立于单片机芯片外的存储器芯片或就是单片机芯片内的存储器或是指单片机芯片片内加上片外的存储器。(本发明文件本段以外各处提及的其前面没加‘部件式’来修饰的‘数据卡’是默认指没部件的数据卡;本说明书本段以外所提及的插卡方式拾音器是指插数据卡方式拾音器;本说明书各处提及的非插卡方式拾音器是指既不插数据卡也不插部件式数据卡方式的拾音器;如阐述的那一段落无特别指明有其他情况,本说明书本段以外的各段落有关拾音器的各技术阐述是默认只针对非插卡方式拾音器或插数据卡方式拾音器的;而插部件式数据卡方式拾音器与非插卡方式拾音器的硬件特点本质相同,插部件式数据卡方式拾音器的各项技术特征及其与语音编码有声读物的配合使用等都可参考本说明书对非插卡方式或插数据卡方式拾音器的各项技术特征及其与语音编码有声读物的配合使用的描述;本发明文件本段以外各处提及的其前面没加‘部件式’来修饰的‘卡插槽’是指数据卡的插槽)。为使叙述直观,本发明文件各处仍将这种方式的部件式数据卡插入拾音器主体形成拾音器后读码使用时单片机芯片(指作为主处理芯片的单片机芯片)用到的程序(这里说的程序一般包括读码程序,拾音器运行主控程序,发音程序,解码有关程序等)称为插部件式数据卡方式拾音器涉及的程序,将这种方式的部件式数据卡插入拾音器主体形成拾音器后读码使用时单片机芯片用到的数据(这里说的数据一般包括解码有关数据,还可能包括诸音码的发音数据)称为插部件式数据卡方式拾音器涉及的数据;部件式数据卡上的存储器里至少存储有部分拾音器涉及的程序或部分拾音器涉及的数据。本段是只述及一个插部件式数据卡方式拾音器主体只插一个部件式数据卡的情况的。本发明文件所说的部件式数据卡,语音编码有声读物,插部件式数据卡方式拾音器主体是3种不同产品。(插部件式数据卡方式拾音器可以是采用直接产音方式或TTS产音方式或既采用直接产音方式又采用TTS产音方式的;本段各处说到的单个的语种比如说到的某一语种,可以是任一非复合语种或任一复合语种,本段各处说到的多个语种,可以全是非复合语种或全是复合语种或既包含非复合语种也包含复合语种)For the card insertion method, there is another kind of deformation (not specifically referring to the above example), which is to put the components in the main body of the original pickup (including the reading port, the pronunciation device, and the single-chip microcomputer chip; this single-chip microcomputer chip is the single-chip microcomputer chip as the main processing chip) Some parts can also be made on the data card while other parts are still made in the main body of the pickup (that is, these parts are scattered in the main body of the pickup and on the data card, and in extreme cases, even all these parts can be arranged on the data card And there are only some things such as power supply circuit in the main body of the pickup, and the card is still inserted into the main body of the pickup during use), as for how to arrange these parts in the main body of the pickup and on the data card, it is determined according to the needs (there is at least one on the data card) There are no components in the main body of the pickup), for example, the audio device can be made on the data card, even the main MCU, the reading port, etc. can be made on the card, of course, the card socket and the card slot of the pickup body must have these The connection pin of the component (after the card is inserted into the card slot of the pickup body, the components in the pickup body and the components on the card are connected to form a complete circuit), generally the card slot is made on the pickup body, and the card can be inserted into the pickup body It can also be pulled out from the main body of the pickup; this kind of data card with parts can be called a component data card, and the card slot of the main body of the pickup can also be called a component card slot accordingly, and the main body of this pickup can be called a Plug-in data card pickup body, the pickup formed by inserting this card into the pickup body is a plug-in data card pickup (if there is no special meaning when it is mentioned, the pickup mentioned below in this paragraph defaults to Plug-in data card mode pickup; if there is no special meaning when it is mentioned, the main body of the pickup mentioned below in this paragraph refers to the plug-in data card mode pickup body by default); there may or may not be memory in the single-chip microcomputer chip, and the single-chip microcomputer chip There may or may not be a memory in the main body of the external pickup, and there is still a memory on the component data card (such as a single-chip microcomputer chip on the card, the memory on the card can be the memory in the single-chip microcomputer chip), the program involved in the pickup and the data storage involved in the pickup In these memories, the relevant program of decoding and the relevant data of decoding are generally stored in the memory on the card (as the pickup adopts the mode of direct sound generation that generally also should store the pronunciation data of all sound codes in the memory on the card ), and this card can be seen as supporting the main body of the pickup (for example, there are decoding-related programs and decoding-related data that are only applicable to a certain language or applicable to multiple languages stored on the card) or can be regarded as a card that is compatible with the voice Encoded audiobooks (for example, for audiobooks used by students of different grades in the same language, the audiobooks of different grades only store the decoding program applicable to this language and the decoding programs of each card are the same, but the stored The decoding data is different, such as the number of words in the word database is different, at this time the card should be matched with the audiobook); the reading dock of this mode can still be an electromechanical conductive code reading dock or a photoelectric reading dock, etc., and the corresponding voice coding has a sound The code words on the entire page of the reading material can be conductive codes or bar codes, etc., and the pickup can still adopt the direct sound production method or the TTS sound production method or both the direct sound production method and the TTS sound production method; , the part-type data card suitable for a certain language can be inserted into the main body of the pickup. At this time, the card and the main body of the pickup are combined into a pickup in the form of a plug-in data card. The parts in the main body of the pickup and the parts on the card are connected to form a complete circuit, and then The reading terminal of this pickup can be used to read the code words of some of the code words on the entire page of the voice coded audiobook to obtain the value of the splicing information of this language to decode the pronunciation of the broadcast sentence (here said Among the values of all the combined information involved in each code word in one or more code words, there is a value of the combined information of this language, and one or more values of this language may be obtained from the one or more code words The value of the combined information, the decoding program and the decoded data on the pickup adjustable card decode the value of the combined information of each language and broadcast the sentence pronunciation). Claims 22, 23, and 24 are related to this method. In claims 22, 23, and 24, the intentional connection between the single-chip microcomputer chip and the reading dock and the pronunciation device, the reading of the memory, reading codes with the reading dock, and combining information The meanings of technical points such as the value of , encoding and decoding, and arrangement of memory are the same as those involved inclaims 10, 11, and 12. After the card is inserted into the main body of the pickup to make the two into a plug-in data card mode pickup, the pickup still needs to use a memory set when in use. After the type data card is inserted into the main body of the pickup of the plug-in type data card mode to form the pickup), the single-chip microcomputer chip (this single-chip microcomputer chip may have been in the pickup main body or on the card) as the main processing chip of the pickup can read all memories; may include Can have the memory in the single-chip microcomputer chip in memory set, the memory in the pick-up main body (in the non-single-chip microcomputer chip), the memory etc. on the component type data card (not in the single-chip microcomputer chip) exists, the storage set does not include the storage mentioned here but does not actually exist). The programs related to the pickup and the data related to the pickup are stored in the memory set, and the decoding-related programs and decoding-related data are generally stored on the component data card, but other situations such as decoding-related programs and a part of the decoding-related data may also occur One part is stored on the part data card and another part is stored in the memory inside the pickup body. For some memories in the main body of the pickup, these memories can be read by the single-chip microcomputer chip when reading the code. These memories belong to the memory set, but may not store useful information; but the component data card must have a memory to store useful information and this memory Must belong to the storage set mentioned earlier in this paragraph. Decoding-related programs and decoding-related data applicable to only one language or applicable to multiple languages may be stored in the memory set. If the single-chip microcomputer chip is not located on the component data card but in the main body of the pickup, the memory on the component data card is generally a memory chip. If the single-chip microcomputer chip is located on the component data card, the memory on the component data card can be an independent The memory chip outside the single-chip microcomputer chip is either the memory in the single-chip microcomputer chip or refers to the addition of off-chip memory in the single-chip microcomputer chip. (the 'data card' that does not add 'component type' to be modified in front of it mentioned in this paragraph of the document of the present invention refers to the data card without parts by default; Refers to the pickup of the data card method; the non-card pickup mentioned in this manual refers to the pickup of neither the data card nor the component data card; if there is no other situation specified in the paragraph described, this The technical descriptions of the pickups in the paragraphs other than this paragraph of the manual are only for the non-card pickup or the data card pickup by default; the hardware features of the plug-in data card pickup and the non-card pickup are essentially the same. For the various technical features of the plug-in data card pickup and its use with voice-coded audiobooks, you can refer to this manual for the technical features of the non-plug-in or data-card pickup and its connection with voice-coded audiobooks. The description of the combined use of the reading material; the 'card slot' mentioned in this document other than this paragraph, which is not modified by 'component type' in front of it, refers to the slot of the data card). In order to make narration intuitive, the document of the present invention still inserts the part-type data card of this mode everywhere into the pick-up main body to form the program that the single-chip microcomputer chip (referring to the single-chip microcomputer chip as the main processing chip) used when using the code reading after the pickup (said here The program generally includes the code reading program, the pickup runs the main control program, the pronunciation program, and the decoding related program, etc.) is called the program involved in the pickup of the plug-in data card method. The data used by the single-chip microcomputer chip when the code is used (the data said here generally includes decoding related data, and may also include the pronunciation data of all sound codes) is called the data involved in the plug-in data card mode pickup; the memory on the component data card There are at least some programs related to the pickups or data related to some pickups stored therein. This paragraph only refers to the situation that only one component data card is inserted into the main body of the pickup in the form of a plug-in component data card. The component type data card mentioned in the document of the present invention, the voice coded audiobook, and the main body of the pickup in the form of a plug-in component type data card are three different products. (the plug-in type data card mode sound pick-up can be to adopt direct sound production mode or TTS sound production mode or not only adopt direct sound production mode but also adopt TTS sound production mode; One language can be any non-composite language or any compound language, and the multiple languages mentioned in this paragraph can be all non-composite languages or all compound languages or include both non-compound languages and compound languages)

对于非插卡方式拾音器,上段所述的那些部件(包括读码头,发音器件,单片机芯片)显然也都是在非插卡方式拾音器主体内的。For the non-plug-in card mode pickup, those parts described in the previous paragraph (comprising the reading dock, pronunciation device, single-chip microcomputer chip) are also all in the non-plug-in card mode pickup main body.

如是把解码有关程序和解码有关数据都存储在拾音器主体内,另有一个卡,这个卡上只有一个或多个部件(比如一个发音器件),在卡上不存储任何程序和任何数据,所有程序和所有数据都存储在拾音器主体内的存储器里,其它部件放在此拾音器主体内,使用时把这个卡插入拾音器主体使此卡与此拾音器主体合为一个拾音器(如拾音器采用直接产音方式,那拾音器主体内一般还应存储有诸音码的发音数据),这种方案这个卡可看成是与拾音器主体配套的,并且这种方案显然与把全部部件和全部程序和全部数据都放在拾音器主体内的非插卡方式拾音器方案是等同替换。If the decoding relevant program and decoding relevant data are all stored in the pickup main body, another card has only one or more parts (such as a pronunciation device) on this card, and does not store any program and any data on the card, and all programs And all data are stored in the memory in the main body of the pickup, and other parts are placed in the main body of the pickup. When using, insert this card into the main body of the pickup to make the card and the main body of the pickup into a pickup (such as the pickup adopts a direct sound production method, The pronunciation data that generally also should be stored with all sound codes in that pickup main body), this card of this scheme can be regarded as supporting with the pickup main body, and this scheme is obviously with all parts and all programs and all data all placed The non-insert pickup solution in the pickup body is an equivalent replacement.

对于插卡方式,如一个拾音器主体可分别配一个或多个卡,每个卡上有只适用于一个语种的解码程序及解码数据,每次插一个(或其中一个)卡来读码发音(相应语种的),那这种情况每次把一个(或其中的一个)卡插入一个拾音器主体也是使此卡与此拾音器主体合为一个拾音器(如拾音器采用直接产音方式或既采用直接产音方式又采用TTS产音方式,那这些卡上一般还应存储有诸音码的发音数据)。(本段所述适用于插数据卡方式,也适用于插部件式数据卡方式)For the card insertion method, if a pickup body can be equipped with one or more cards, each card has a decoding program and decoding data that are only applicable to one language, insert one (or one of) cards at a time to read the code and pronounce ( In this case, inserting one (or one of them) card into a pickup body each time also makes the card and the pickup body combined into a pickup (such as the pickup adopts the direct sound production mode or adopts both direct sound production Mode adopts the TTS sound production mode again, then generally also should store the pronunciation data of all sound codes on these cards). (The description in this paragraph applies to the method of inserting data cards, and also applies to the method of inserting component data cards)

对于插卡方式,如一个拾音器主体可分别配一个或多个卡,这些卡只用于存储拾音器的读码及发音及主控运行程序,而拾音器主体内存储只适用于一个语种或适用于多个语种的解码有关程序及解码有关数据,每次插一个(或其中一个)卡来读码发音,那这种情况每次把一个(或其中的一个)卡插入一个拾音器主体也是使此卡与此拾音器主体合为一个拾音器(如拾音器采用直接产音方式或既采用直接产音方式又采用TTS产音方式,那这些卡上一般还应存储有诸音码的发音数据)。(本段所述适用于插数据卡方式,也适用于插部件式数据卡方式)For the way of inserting cards, if a pickup body can be equipped with one or more cards, these cards are only used to store the code reading and pronunciation of the pickup and the main control operation program, while the storage in the pickup body is only suitable for one language or for multiple languages. For the decoding related program and decoding related data of each language, one (or one of them) card is inserted at a time to read the code and pronounce. This pick-up main body is combined into a pick-up (as pick-up adopts direct sound production mode or both adopts direct sound production mode and adopts TTS sound production mode, then generally also should store the pronunciation data of all sound codes on these cards). (The description in this paragraph applies to the method of inserting data cards, and also applies to the method of inserting component data cards)

除非述及时特别指明有其他情况,本说明书除本段和下一段外述及到插数据卡方式拾音器或插部件式数据卡方式拾音器以及它们的相关技术时,都是以每次一个拾音器主体只插一个卡使此拾音器主体与此卡形成为一个拾音器为例的;(本段以下是按插数据卡方式来阐述的,所以本段以下所说的卡是指数据卡,本段以下所说的拾音器主体是指插数据卡方式拾音器主体;但本段以下所述内容也适合于插部件式数据卡方式,即可以把本段以下所述的卡理解成部件式数据卡且把本段以下所述的拾音器主体理解成插部件式数据卡方式拾音器主体且把本段以下所述的拾音器理解成插部件式数据卡方式拾音器且把本段以下所述的卡插槽理解成部件式卡插槽),实际上,也可以是一个拾音器主体可插多个卡(相应的,此一个拾音器主体上需有多个卡插槽),一个拾音器主体插多个卡后这一个拾音器主体与多个卡合为一个拾音器,这多个卡可以全部是与拾音器主体配套的(这时这多个卡等同于一个卡),这多个卡也可以全部是与有声读物配套的(这时这多个卡等同于一个卡),也可能这多个卡中的部分是与有声读物配套的而另一部分是与拾音器主体配套的,也可能是这多个卡既不与有声读物配套也不与拾音器主体配套而是独立出售的,还可能有其它可能情况,但这些情况的拾音器(把多个卡插入拾音器主体而成的拾音器)的结构功用是等同的,这些情况的拾音器与插一个卡方式拾音器的技术特征及使用是相同的,不同在于在销售时多个(而不是一个)卡作为各自不同的产品与有声读物及拾音器主体的配套销售情况(也可能卡与有声读物和拾音器主体都不配套),而这与拾音器的技术特征无关;所以有关插多个卡的拾音器的读码发音等技术特征,与有声读物的配合使用,产品外型,与其它电子类产品集成等都可参照本说明书除本段外对插一个卡的拾音器的相关介绍。Unless otherwise specified when mentioning, except for this paragraph and the next paragraph, when this manual refers to the pickup of the plug-in data card method or the pickup of the plug-in data card method and their related technologies, it is based on the fact that each pickup body only Inserting a card to make the main body of the pickup and the card form a pickup as an example; (The following paragraph is explained by inserting a data card, so the card mentioned below in this paragraph refers to the data card, and the following paragraph in this paragraph refers to the data card. The main body of the pickup refers to the main body of the pickup in the way of inserting a data card; but the following content in this paragraph is also suitable for the method of plugging in a component data card, that is, the card described in the following paragraph can be understood as a component data card and the following in this paragraph The main body of the pickup is understood as the main body of the plug-in type data card pickup, and the pickup described below in this paragraph is understood as the pickup of the plug-in type data card, and the card slot described below in this paragraph is understood as a component type card insertion Slot), in fact, it can also be a pickup body that can insert multiple cards (correspondingly, there must be multiple card slots on the pickup body), after a pickup body inserts multiple cards, this pickup body and multiple Snap into a pickup, these multiple cards can all be matched with the main body of the pickup (at this moment, these multiple cards are equal to one card), and these multiple cards can also all be matched with audiobooks (at this time, these multiple cards are equivalent to one card). card is equivalent to one card), it is also possible that some of the multiple cards are compatible with the audiobook and another part is compatible with the pickup body, or it is possible that the multiple cards are neither compatible with the audiobook nor with the pickup body It is matched but sold independently, and there may be other possible situations, but the structure and function of the pickups in these situations (the pickups formed by inserting multiple cards into the pickup body) are the same. The technical characteristics and use are the same, the difference is that multiple (rather than one) cards are sold as separate products and matched with the audiobook and the main body of the pickup (it is also possible that the card is not matched with the audiobook and the main body of the pickup) , and this has nothing to do with the technical characteristics of the pickup; therefore, for the technical characteristics of the pickup with multiple cards, such as code reading and pronunciation, the use with audiobooks, product appearance, and integration with other electronic products, etc., you can refer to this manual except This paragraph is related to the introduction of the pickup with one card inserted.

在使用时,插入一个插数据卡方式拾音器主体的所有数据卡中每个数据卡上的存储器,都属于与拾音器(此拾音器是指把这些数据卡插入此拾音器主体而形成的一个拾音器)有关的存储器集。在使用时,插入一个插部件式数据卡方式拾音器主体的所有部件式数据卡中每个部件式数据卡上的存储器,都属于与拾音器(此拾音器是指把这些部件式数据卡插入此拾音器主体而形成的一个拾音器)有关的存储器集。权利要求书中各处提及的‘所述数个数据卡上的存储器’,其中的‘数个’是修饰‘数据卡’的,‘所述’是修饰‘数个数据卡’的,‘所述数个数据卡上的存储器’是指这数个数据卡上的所有存储器的总和(比如数个数据卡包括卡1,卡2,卡3,那数个数据卡上的存储器就是指卡1上的存储器加卡2上的存储器加卡3上的存储器),本说明书各处提到的‘数个数据卡上的存储器’(或权利要求书提到的‘前述数个数据卡上的存储器’),也是指这数个数据卡上的所有存储器的总和(与权利要求书提到的‘所述数个数据卡上的存储器’的理解相同);同理,权利要求书中各处提及的‘所述数个部件式数据卡上的存储器’,其中的‘数个’是修饰‘部件式数据卡’的,‘所述’是修饰‘数个部件式数据卡’的,‘所述数个部件式数据卡上的存储器’是指这数个部件式数据卡上的所有存储器的总和(比如数个部件式数据卡包括卡1,卡2,卡3,那数个部件式数据卡上的存储器就是指卡1上的所有存储器加卡2上的所有存储器加卡3上的所有存储器),本说明书各处提到的‘数个部件式数据卡上的存储器’(或权利要求书提到的‘前述数个部件式数据卡上的存储器’),也是指这数个部件式数据卡上的所有存储器的总和(与权利要求书提到的‘所述数个部件式数据卡上的存储器’的理解相同)。When in use, the memory on each data card in all data cards inserted into the main body of the pickup in the form of a data card is related to the pickup (this pickup refers to a pickup formed by inserting these data cards into the pickup body) storage set. When in use, the memory on each component data card in all component data cards inserted into the main body of a plug-in component data card mode all belongs to the pickup (this pickup refers to inserting these component data cards into the pickup body And form a pickup) related memory set. The 'storage on said several data cards' mentioned in the claims, wherein 'several' is used to modify 'data card', 'said' is used to modify 'several data cards',' The memory on the several data cards' refers to the sum of all the memories on the data cards (for example, several data cards include card 1, card 2, and card 3, and the memory on the several data cards refers to the card 1 plus the memory on card 2 plus the memory on card 3), the 'storage on several data cards' mentioned throughout this specification (or the 'storage on the aforementioned several data cards' mentioned in the claims) memory'), also refers to the sum of all the memories on these several data cards (same as the understanding of 'storage on the several data cards' mentioned in the claims); similarly, everywhere in the claims The reference to 'storage on said several component data cards', wherein 'several' modifies 'component data cards', 'said' modifies 'several component data cards',' The memory on said several component data cards' refers to the sum of all the memories on these several component data cards (for example, several component data cards include card 1, card 2, and card 3, and those several component data cards The memory on the data card refers to all the memory on the card 1 plus all the memory on the card 2 plus all the memory on the card 3), the "storage on several component data cards" (or right The "storage on the aforementioned several component data cards" mentioned in the claim) also refers to the sum of all the memories on the several component data cards (which is different from the "several component data cards" mentioned in the claims) The memory on the card' has the same understanding).

对拾音器来说,显然插多个数据卡的拾音器与插一个数据卡的拾音器是等同替换,至于多个数据卡与有声读物及拾音器主体在出售时如何配套不关拾音器这边的事。对拾音器来说,显然插多个部件式数据卡的拾音器与插一个部件式数据卡的拾音器是等同替换,至于多个部件式数据卡与有声读物及拾音器主体在出售时如何配套不关拾音器这边的事。另外,如某一拾音器主体在被使用时既要插数据卡又要插部件式数据卡,那此拾音器主体仍可看成是插部件式数据卡方式拾音器主体,而把要插入此拾音器主体的所有数据卡和所有部件式数据卡等同看成一个部件式数据卡,而分开的各数据卡和各部件式数据卡可能是各自与数个有声读物及此拾音器主体配套销售的,此种拾音器主体仍属于权利要求22所述的拾音器主体(也即部件式拾音器主体),可把把数个数据卡和数个部件式数据卡插入此拾音器主体后形成的拾音器看成为非插卡方式拾音器。For the pickup, it is obvious that a pickup with multiple data cards is equivalent to a pickup with one data card. As for how to match multiple data cards with audiobooks and the main body of the pickup when it is sold, it has nothing to do with the pickup. For the pickup, it is obvious that the pickup with multiple component data cards is equivalent to the pickup with one component data card. As for the matching of multiple component data cards with audiobooks and the main body of the pickup, it is not related to the pickup. Side things. In addition, if a pickup body needs to be inserted with both a data card and a component data card when being used, then the pickup body can still be regarded as a pickup body in the form of a plug-in data card, and the pickup body to be inserted into the pickup body All data cards and all component data cards are regarded as one component data card, and separate data cards and component data cards may be sold separately with several audiobooks and the pickup body, such pickup body Still belonging to the pickup body described in claim 22 (that is, the component type pickup body), the pickup formed by inserting several data cards and several component type data cards into the pickup body can be regarded as a non-card type pickup.

对于图6的实施例,可采用部件式数据卡方式,比如把外接数据存储器6和4021芯片(U6)一起做在成一个部件式数据卡,图6中除这两个元件(SR008和4021)以外的部分就是插部件式数据卡方式拾音器主体,此拾音器主体上做有部件式数据卡的插槽,使用时,把此部件式数据卡插入此拾音器主体而形成一个拾音器(部件式数据卡上的元件和拾音器主体上的元件通过插槽连接起来),此部件式数据卡上存储有解码程序及解码数据,由拾音器读有声读物(也可以是象如图10下半部分那样的有声读物)整页面上的数个导电码(导电码也是如图4所示形式,每个如图4示形式的甲导电码码字的值有一个汉语语种的拼合信息的值的部分),根据从这数个导电码得到的汉语语种的拼合信息的值来解码播放语句发音。有关图6的实施例涉及的读码等其它技术特征见199页至201页所述。For the embodiment of Fig. 6, the component data card method can be adopted, for example, theexternal data memory 6 and the 4021 chip (U6) are made together into a component data card, except for these two components (SR008 and 4021) in Fig. 6 The other part is the pickup main body of the plug-in type data card mode. The slot of the component type data card is arranged on the pickup body. During use, the component type data card is inserted into the pickup body to form a pickup (on the component type data card) The components on the pickup body are connected by slots), the component data card stores the decoding program and decoding data, and the audiobook is read by the pickup (it can also be an audiobook as shown in the lower part of Figure 10) Several conductive codes on the whole page (the conductive code is also the form shown in Figure 4, and the value of each first conductive code code word of the form shown in Figure 4 has a part of the value of the combined information of the Chinese language), according to from this The value of the combined information of the Chinese language obtained by several conductive codes is used to decode the pronunciation of the playback sentence. Other technical features such as code reading related to the embodiment of FIG. 6 are described in pages 199 to 201.

对于拾音器(非插卡方式或插数据卡方式或插部件式数据卡方式的),如其读入的码字涉及的拼合信息的值的语种是本拾音器不能解码的,当然就不处理;比如某拾音器能解码英语(语种)的拼合信息的值,如它读入的一个码字的值里有英语的拼合信息的值和汉语的拼合信息的值,那拾音器只把他能解码的英语的拼合信息的值解码播出语句发音即可,而不处理汉语的拼合信息的值(一般在各拼合信息的值里有语种标志BIT)。For the pickup (non-card-inserting mode or data card mode or plug-in data card mode), if the language of the value of the combined information involved in the code word read in cannot be decoded by the pickup, of course it will not be processed; for example, a certain The pickup can decode the value of English (language) splicing information. If the value of a code word it reads includes the value of English splicing information and the value of Chinese splicing information, then the pickup can only decode the English splicing information it can decode. The value of the information only needs to decode and broadcast the pronunciation of the sentence, instead of dealing with the value of the combined information in Chinese (generally, there is a language flag BIT in the value of each combined information).

前面已有阐述,拾音器解码涉及的硬件发音的方式可以有直接产音方式或TTS产音方式,在直接产音方式时,可用主MCU芯片来发音,也可用外接的语音芯片来发音,外接的语音芯片的一种方式是接受从主MCU解码时来的语句的全部或非全部预备发音数据(解码时采用整步直接发音方式),并调用诸音码的录音数据里的一些音码的录音数据,来播出语句发音;这种方式外接的语音芯片可以更进一步,做成专用的芯片,其内部可使用类似于TTS器件所用的波形合成,参数合成等语音产生方式,并且有专门针对提高语音质量(使语音平滑及合乎发音习惯等)的处理,接口那里可接受外界(解码用的主MCU)传来的语句的全部或非全部预备发音数据(比如按语句发音顺序排组起来的表示英语语句发音的各音标的编号信息或按语句发音顺序排组起来的表示汉语语句发音的各拼音的代号信息,而不是如TTS器件接收的语句的拼法格式数据),并将其转成语句的全部发音或非全部发音播出;这种芯片可与解码用的主MCU芯片配合使用,如采用这种芯片来发音,拾音器就等同是采用直接产音方式来发音的,拾音器解码时播放语句发音的发音方式也算作本发明文件所说的直接发音方式;并且是适用于整步直接的,本段以上所述是以解码是用整步直接发音方式为例的,这种芯片显然也可用于分步直接发音方式,每次接受从主MCU来的部分预备发音数据并发音。It has been explained above that the hardware pronunciation method involved in the pickup decoding can be a direct sound production method or a TTS sound production method. In the direct sound production method, the main MCU chip can be used for pronunciation, and an external voice chip can also be used for pronunciation. A kind of mode of voice chip is to accept all or part of the prepared pronunciation data of the sentence that comes when decoding from main MCU (adopt whole-step direct pronunciation mode during decoding), and call the recording of some sound codes in the recording data of all sound codes Data, to broadcast the pronunciation of sentences; the external voice chip in this way can go a step further and be made into a dedicated chip, which can use waveform synthesis, parameter synthesis and other voice generation methods similar to those used by TTS devices, and there are special methods for improving The processing of voice quality (making the voice smooth and in line with pronunciation habits, etc.), where the interface can accept all or part of the prepared pronunciation data of the sentence sent from the outside world (the main MCU for decoding) (such as the representation arranged in the pronunciation order of the sentence) The number information of each phonetic symbol of the English sentence pronunciation or the code information of each pinyin representing the pronunciation of the Chinese sentence arranged in the pronunciation order of the sentence, rather than the spelling format data of the sentence received by the TTS device), and convert it into a sentence All pronunciations or not all pronunciations are broadcast; this chip can be used in conjunction with the main MCU chip for decoding. If this chip is used to pronounce, the pickup is equivalent to the direct sound production method to pronounce, and the pickup will play sentences when decoding The pronunciation mode of pronunciation also counts as the said direct pronunciation mode of the document of the present invention; It can be used in a step-by-step direct pronunciation method, and each time it receives part of the prepared pronunciation data from the main MCU and pronounces it.

本说明书前面所阐述的拾音器主体,实际上是可和其它电子类产品集成在一起的,比如与MP3播放机或与复读机(比如播放磁带的那种)或与手机集成在一起,可有多种方式集成,以与手机集成为例(其它与MP3播放机等的集成与与手机集成类似),其集成方式一般可有:一:手机其内部原来就有用作手机功能控制用的主MCU,仍可用此主MCU兼作为读码解码处理用的MCU(当然要求此MCU有足够多的IO脚等来作兼用),那就可把读码头(可以是机电式导电码读码头或光电式读码头等)装在手机上,读码头与主MCU意向连接(主MCU与原来手机的元器件的连接仍可不变),主MCU能读取的存储器里存储有解码用的解码有关程序及解码有关数据(可以是只适用于一个语种或适用于多个语种的解码有关程序及解码有关数据,如采用插数据卡方式或插部件式数据卡方式,还可以是把解码有关程序及解码有关数据存储在插入手机的数据卡上或部件式数据卡上的存储器里),手机原来的按键可兼作读码发音控制(比如调节音量)的按键,如手机原来的主MCU或主MCU外接的发音芯片可播语音,那就可仍旧用手机原来的主MCU来在对拼合信息码解码时播语音(这种情况发音器件是主MCU外接的喇叭,此喇叭也是子发音器件)或主MCU外接的发音芯片来在对拼合信息码解码时播语音(这种情况发音器件是主MCU外接的带喇叭的发音芯片,带喇叭的发音芯片也是子发音器件),否则那一般应在主MCU外接语音芯片来在对拼合信息码解码时播语音(这种情况发音器件是主MCU外接的带喇叭的语音芯片,此带喇叭的语音芯片也是子发音器件,即发音器件就是一个子发音器件),手机原来其它的电路可不变,这样一来原来手机的主MCU既可完成原来通讯的功能,又可读取拼合信息码并解码播出语句发音;二:可在手机机体内,分别布置手机原来的电路和拾音模块的电路,两个电路独立(供电可公用),各有各的作主控用的单片机芯片,各有各的按键,发音的电路也分开,拾音模块这边有自己的发音器件,读码头也装在手机上并与读码解码用的MCU(也即拾音模块的MCU)意向连接,这样集成后就同时具备原来通讯的功能和读码解码播出语句发音的功能,且这两功能互不干扰;另还可有其它集成的方式。可把这些种集成方式的电子产品(本段以下所说电子产品是指其集成有MP3播放机功能或复读机功能或手机功能等且可读码解码播出语句发音的电子产品)都看成是拾音器主体,如果这样的拾音器主体在被使用时不需插数据卡,那这样的拾音器主体包括权利要求10的技术特征,所以前述任一种集成方式的使用时不用插数据卡的电子产品(电子产品可看成是拾音器主体)应属于权利要求10所述的拾音器主体的范围,如果这样的拾音器主体在被使用时需插数据卡,那这样的拾音器主体包括权利要求19的技术特征,所以前述任一种集成方式的使用时需插数据卡的电子产品(电子产品可看成是拾音器主体)应属于权利要求19所述的拾音器主体的范围。(本段所说的拾音模块包括MCU,读码头,发音器件这些部件,这也是本段以外述及的非插卡方式拾音器主体包括的部件,拾音模块在必要时还可能包括存储器)。本段前面述的各种电子产品,其在使用时是不插数据卡或插数据卡但数据卡上只有存储器没有部件(部件是指MCU,读码头,发音器件等)的;如是在使用时电子产品需插部件式数据卡(比如部件式数据卡上有读码解码发语音用的发音器件)的方式也可(在部件式数据卡上有的部件,在电子产品上就不用再有),仍可把按这样集成起来的各种电子产品看成是拾音器主体(而且是插部件式数据卡方式拾音器主体),这样的拾音器主体包括权利要求22的技术特征,所以按这样集成起来的任一种电子产品(电子产品可看成是拾音器)都应属于权利要求22所述拾音器主体的范围。本段所说的被看成拾音器主体的电子产品,如其读码发音时本身不用插数据卡和部件式数据卡,那它本身就可看成是非插卡方式拾音器(也可把这种拾音器称为非插卡方式电子产品拾音器),如读码播语音时需插数据卡,那把数据卡插入电子产品后就可看成是数据卡与电子产品合为一个插数据卡方式拾音器(也可把这种拾音器称为插数据卡方式电子产品拾音器),如读码播语音时需插部件式数据卡,那把部件式数据卡插入电子产品后就可看成是部件式数据卡与电子产品合为一个插部件式数据卡方式拾音器(也可把这种拾音器称为插部件式数据卡方式电子产品拾音器)。使用本段前面所述的各种电子产品拾音器,显然可用于读取本发明文件各处所述的有声读物的整页面上诸个码字中的其码值是共与数个拼合信息的值有关的的码字并解码播出语句发音,有声读物的整页面上能被某电子产品拾音器读码以用于解码播出语句发音的一些码字也可被定义为此有声读物整页面上的此电子产品拾音器能读的部诸码字,此电子产品拾音器可读取有声读物整页面上的诸个码字中的部诸码字中的数个码字并解码播出语句发音。显然用本段说的电子产品拾音器(即不用插卡的电子产品本身或是插有数据卡或部件式数据卡的电子产品)可读取有声读物的整页面上的一个或多个码字并可对从这一个或多个码字得到的拼合信息的值进行解码播出语句发音,且这些拾音器可以是采用直接产音方式或采用TTS产音方式或既采用直接产音方式又采用TTS产音方式的,如拾音器是采用直接产音方式或既采用直接产音方式又采用TTS产音方式的话那一般应在拾音器的存储器里存储有诸音码的发音数据,如拾音器是采用TTS产音方式的话那拾音器内应有TTS器件,如拾音器是既采用直接产音方式又采用TTS产音方式的话,拾音器内应有一个发音器件(此发音器件是一个子发音器件或由多个子发音器件组成,这一个或多个子发音器件中一般应包括一个带喇叭的TTS芯片)。电子产品拾音器显然跟本说明书述的其它没与电子类产品集成的拾音器一样也是按照解码规则来解码的(比如本说明书述的以只适用于英语为例的解码规则),且解码时可能用到直接发音方式,TTS发音方式等,对于任一非复合语种或任一复合语种,只要与电子产品拾音器有关的存储器集里(此存储器集是能被此拾音器里作为读码解码发语音主控用的MCU读取的所有存储器)存储有适用于此语种的解码程序及解码数据,那就可用此电子产品拾音器去读码并对从码字得到的此语种的拼合信息的值进行解码。本段说的数据卡可以是数个数据卡,本段说的部件式数据卡可以是数个部件式数据卡。本段所说的各电子产品的形状显然没特别限制。本段各处提到的读码头可以是机电式导电码读码头或光电式读码头等,而且如是与手机集成为电子产品的话,还可以是把手机原来就有的可以读码的器件当成读码头,比如可用手机原来有的照相头来读码,那一般就可把照相头当成读码头(照相机图象处理电路在主MCU内时,一般是照相头连线到主MCU,这种情况照相涉及的读码软件一般放在主MCU内),或把照相头加照相机图象处理集成电路当成读码头(照相机图象处理电路是做成照相机图象处理集成电路时,照相头和此集成电路都在主MCU芯片外,一般是照相头连线到此集成电路且此集成电路连线到主MCU,这种情况照相涉及的读码软件一般可放在照相机图象处理集成电路内,由照相机图象处理集成电路得到读码结果后,再由此集成电路把读码结果传给主MCU),显然以上在述两种照相方式的读码头情况时提到的主MCU也是兼作为读码播语音主控用的MCU,用照相头读的相应的语音编码有声读物整页面上的码字一般是二维条形码,也可修改照相涉及的读码软件使此种照相方式的读码头可以读导电码或一维条形码或图形码等。本段所说的发音器件都是指读码解码发语音用的发音器件,即是如本说明书其它段反复提到的拾音器的发音器件。The pickup body described above in this manual can actually be integrated with other electronic products, such as with an MP3 player or with a repeater (such as the one that plays tapes) or with a mobile phone. There are several ways of integration, taking the integration with mobile phones as an example (other integration with MP3 players is similar to integration with mobile phones), the integration methods generally include: 1: the main MCU used for mobile phone function control is originally inside the mobile phone, The main MCU can still be used as the MCU for code reading and decoding processing (of course, this MCU is required to have enough IO pins for both purposes), then the reading terminal (which can be an electromechanical conductive code reading terminal or a photoelectric reading terminal) can be used. Dock, etc.) installed on the mobile phone, read the intended connection between the dock and the main MCU (the connection between the main MCU and the original components of the mobile phone can still remain unchanged), and the memory that the main MCU can read stores the decoding-related programs and decoding-related information. Data (can be only applicable to one language or be applicable to decoding relevant program and decoding relevant data of a plurality of languages, as adopting the way of inserting data card or plug-in type data card, can also be to store decoding relevant program and decoding relevant data On the data card inserted into the mobile phone or in the memory on the component data card), the original key of the mobile phone can also be used as a key for code reading and pronunciation control (such as adjusting the volume), such as the original main MCU of the mobile phone or the pronunciation chip external to the main MCU. If you want to broadcast audio, you can still use the original main MCU of the mobile phone to broadcast audio when decoding the combined information code (in this case, the audio device is a speaker connected to the main MCU, and this speaker is also a sub-audio device) or the audio chip external to the main MCU. To broadcast the voice when decoding the combined information code (in this case, the pronunciation device is a pronunciation chip with a speaker external to the main MCU, and the pronunciation chip with a horn is also a sub-pronunciation device), otherwise the voice chip should be connected to the main MCU. Play voice when decoding the combined information code (in this case, the pronunciation device is a voice chip with a speaker external to the main MCU, and the voice chip with a speaker is also a sub-pronunciation device, that is, the pronunciation device is a sub-pronunciation device), the original other The circuit can be kept unchanged, so that the main MCU of the original mobile phone can not only complete the original communication function, but also read the combined information code and decode the pronunciation of the broadcast sentence; The circuit of the audio module, the two circuits are independent (the power supply can be shared), each has its own single-chip microcomputer chip for main control, each has its own button, and the pronunciation circuit is also separated. The pickup module has its own pronunciation device. The reading terminal is also installed on the mobile phone and is connected to the MCU for code reading and decoding (that is, the MCU of the pickup module), so that after integration, it has the original communication function and the function of reading code decoding and broadcasting sentence pronunciation, and this The two functions do not interfere with each other; other integration methods are also available. Electronic products of these types of integration (the electronic products mentioned below in this paragraph refer to electronic products integrated with MP3 player functions, repeater functions or mobile phone functions, etc., and can read codes, decode and broadcast sentence pronunciations) can be regarded as It is the main body of the pickup. If such a main body of the pickup does not need to insert a data card when being used, then such a main body of the pickup includes the technical feature ofclaim 10, so the electronic product ( The electronic product can be regarded as the pickup body) should belong to the scope of the pickup body described inclaim 10, if such a pickup body needs to insert a data card when being used, then such a pickup body includes the technical features of claim 19, so An electronic product that needs to be inserted into a data card when using any of the aforementioned integration methods (the electronic product can be regarded as the main body of the pickup) should belong to the scope of the main body of the pickup as claimed in claim 19 . (the sound pick-up module mentioned in this paragraph includes MCU, read dock, these parts of pronunciation device, and this is also the part that the non-plug-in card mode sound pickup main body mentioned outside this paragraph includes, and the sound pickup module may also include memory when necessary). The various electronic products mentioned above in this paragraph are used without a data card or inserted with a data card but there is only memory and no components on the data card (components refer to MCU, reading terminal, pronunciation device, etc.); Electronic products need to be plugged into a component data card (for example, there is a sounding device for code reading, decoding, and voice delivery on the component data card) (the components on the component data card do not need to be present on the electronic product) , the various electronic products integrated in this way can still be regarded as the main body of the pickup (and the main body of the pickup in the form of a plug-in data card), such a main body of the pickup includes the technical characteristics of claim 22, so any integrated An electronic product (the electronic product can be regarded as a pickup) should belong to the scope of the pickup body described in claim 22. The electronic product mentioned in this paragraph is regarded as the main body of the pickup, if it does not need to insert a data card and a component data card when it reads the code and pronounces it, then it itself can be regarded as a non-card mode pickup (this pickup can also be called It is a pickup for non-card-inserting electronic products). If a data card needs to be inserted when reading codes and broadcasting voice, then the data card can be regarded as a data card and electronic product after being inserted into a data card. This kind of pickup is called the electronic product pickup with data card insertion method), if a component data card needs to be inserted when reading codes and broadcasting voice, then the component data card can be regarded as a component data card and electronic product after being inserted into the electronic product. Be combined into a plug-in data card mode pickup (this pickup can also be referred to as a plug-in data card mode electronic product pickup). Using the various electronic product pickups described above in this paragraph, it can obviously be used to read the code values of the code words on the entire page of the audiobook described in various places of the present invention. Relevant codewords and decode the pronunciation of the broadcast sentence, some codewords on the entire page of the audiobook that can be read by a pickup of an electronic product for decoding the pronunciation of the broadcast sentence can also be defined as the entire page of the audiobook The electronic product pickup can read all the codewords, and the electronic product pickup can read several codewords of all codewords on the entire page of the audiobook and decode and broadcast the sentence pronunciation. Obviously, the electronic product pickup mentioned in this paragraph (that is, the electronic product itself without a card or the electronic product with a data card or a component data card inserted) can read one or more codewords on the entire page of the audiobook and The value of the combined information obtained from the one or more codewords can be decoded and the pronunciation of the sentence can be played out, and these pickups can be either direct sound generation or TTS sound generation or both direct sound generation and TTS sound generation. Sound mode, if the pickup adopts the direct sound production method or adopts both the direct sound production method and the TTS sound production method, then generally the pronunciation data of various sound codes should be stored in the memory of the pickup, such as the pickup adopts TTS sound production If the method is used, there should be a TTS device in the pickup. If the pickup adopts both the direct sound production method and the TTS sound production method, there should be a sounding device in the pickup (this sounding device is a sub-sounding device or is composed of multiple sub-sounding devices. One or more sub-pronunciation devices should generally include a TTS chip with a speaker). Electronic product pickups are obviously decoded according to the decoding rules like other pickups described in this manual that are not integrated with electronic products (for example, the decoding rules described in this manual are only applicable to English as an example), and may be used when decoding. Direct pronunciation mode, TTS pronunciation mode, etc., for any non-compound language or any compound language, as long as it is in the memory set related to the pickup of the electronic product (this memory set can be used by the pickup as the code-reading, decoding, and voice master control) All the memory read by the MCU) stores the decoding program and decoding data applicable to this language, then the pickup of this electronic product can be used to read the code and decode the value of the combined information of this language obtained from the codeword. The data card mentioned in this paragraph may be several data cards, and the component data card mentioned in this paragraph may be several component data cards. The shape of each electronic product mentioned in this paragraph is obviously not particularly limited. The reading docks mentioned in this paragraph can be electromechanical conductive code reading docks or photoelectric reading docks, etc., and if they are integrated with mobile phones into electronic products, they can also use the original code-reading devices of mobile phones as reading devices. For example, the original camera head of the mobile phone can be used to read the code, then generally the camera head can be used as a reading port (when the camera image processing circuit is in the main MCU, the camera head is usually connected to the main MCU, in this case the camera The code reading software involved is generally placed in the main MCU), or the camera head plus camera image processing integrated circuit is used as a reading terminal (when the camera image processing circuit is made into a camera image processing integrated circuit, the camera head and the integrated circuit They are all outside the main MCU chip. Generally, the camera head is connected to this integrated circuit and this integrated circuit is connected to the main MCU. In this case, the code reading software involved in photography can generally be placed in the camera image processing integrated circuit, and the After the image processing integrated circuit obtains the code reading result, the code reading result is passed to the main MCU by the integrated circuit), obviously the main MCU mentioned above when reading the dock situation of the above-mentioned two camera modes also doubles as the code reading broadcast The MCU used for voice master control, the code word on the entire page of the corresponding voice-coded audiobook read by the camera head is generally a two-dimensional barcode, and the code reading software involved in the camera can also be modified so that the camera-based reading terminal can read conductive code or one-dimensional barcode or graphic code, etc. The pronunciation device mentioned in this paragraph all refers to the pronunciation device used for code reading and decoding to send out voice, that is, the pronunciation device of the pickup repeatedly mentioned in other paragraphs of this specification sheet.

本发明文件各处所提及的‘读物’(其前面没加‘语音编码有声’六字的也没加‘有声’两个字的)概念与‘语音编码有声读物’概念是有点区别的,读物是指常见的书籍,杂志等本身(读物自身范围内是与拼合信息码无关的),如在读物(如书籍,杂志等)整页面上形成(比如印刷)有一些拼合信息码,那这读物整页面上的所有这些拼合信息码与读物就一起形成语音编码有声读物(读物整页面上的所有这些拼合信息码也就是读物整页面上的诸个码字),此读物也就是作为此语音编码有声读物主体的读物。本段说的拼合信息码(码字)可以是直接把作为有声读物主体的读物的页面当成载体平面的拼合信息码,也可以是如本说明书后面述的小块平面材料的表面上有的拼合信息码,且这一小块平面材料是粘贴在作为有声读物主体的读物的页面上的。The concept of 'reading material' mentioned in the documents of the present invention (the six words 'phonetic coded sound' and the word 'voiced' are not added in front of it) is somewhat different from the concept of 'phonetic coded audio book'. Reading material refers to common books, magazines etc. itself (in the scope of reading material itself, it has nothing to do with the combined information code), if there are some combined information codes formed (such as printing) on the entire page of reading material (such as books, magazines, etc.), then this All these splicing information codes on the entire page of the reading matter and the reading matter just form the audio-coded audiobook together (all these splicing information codes on the entire page of the reading matter are also the code words on the entire page of the reading matter), and this reading matter is just used as the voice code Encodes the reading for the body of the audiobook. The combination information code (code word) mentioned in this paragraph can be the combination information code that directly regards the page of the reading material as the main body of the audiobook as the carrier plane, or it can be the combination on the surface of the small piece of plane material as described later in this specification. Information code, and this small piece of flat material is pasted on the page of the reading that is the main body of the audiobook.

上段说的在读物整页面上形成有拼合信息码,只是有声读物产品的最终形式,实际上生产时到底采用先在数个大开张的印刷纸上形成(比如印刷)拼合信息码再印图文再装订成有声读物,先在数个大开张的印刷纸上印图文再形成拼合信息码再装订成有声读物(本句和上句说的装订是指把印有图文和形成有拼合信息码的数个大开张的印刷纸各自裁减成多个小开张的,并装订起来),先装订成读物形式再印图文再形成拼合信息码以形成有声读物等这些方式中的哪一种是没关系的(本句说的装订是指把数个还没印图文也没形成拼合信息码的大开张的印刷纸各自裁减成多个小开张的,并装订起来),这些方式互相是等同替换。As mentioned in the previous paragraph, the combined information code is formed on the entire page of the reading material, which is only the final form of the audiobook product. In fact, the combined information code is first formed (such as printed) on several large sheets of printing paper during production, and then the graphics and text are printed. Then bind it into audiobooks, first print pictures and texts on several large sheets of printing paper, then form a combined information code, and then bind them into audiobooks (the binding in this sentence and the previous sentence refers to printing pictures and texts and forming a combined information The printing papers of several large sheets of codes are cut into several small sheets and bound together), first bound into the form of reading materials, then printed with pictures and texts, and then form a combination of information codes to form audiobooks, etc. Which of these methods is It doesn't matter (the binding mentioned in this sentence refers to cutting several large sheets of printed paper that have not yet printed pictures and texts and formed a combined information code into multiple small sheets, and then bound them together). These methods are equivalent to each other. .

如提及时无特别指明有他义,本发明文件各处提及的‘语音编码有声读物的整页面’(或‘有声读物的整页面’)默认包括语音编码有声读物的所有页面(对于只有一个页面的语音编码有声读物,其所有页面就是指这一个页面),如提及时无特别指明有他义,本发明文件各处提及的‘读物的整页面’(其前面没加‘语音编码有声’六字的也没加‘有声’两个字的)默认包括读物的所有页面(对于只有一个页面的读物,其所有页面就是指这一个页面)。如提及时无特别指明有他义,本发明文件各处提及的‘语音编码有声读物的页面’,默认是指语音编码有声读物的某个具体的页面;如提及时无特别指明有他义,本发明文件各处提及的‘语音编码有声读物的页’,默认是指语音编码有声读物的某个具体的页;如提及时无特别指明有他义,本发明文件各处提及的‘读物的页面’(其前面没加‘语音编码有声’六字的也没加‘有声’两个字的),默认是指读物的某个具体的页面;如提及时无特别指明有他义,本发明文件各处提及的‘读物的页’(其前面没加‘语音编码有声’六字的也没加‘有声’两个字的),默认是指读物的某个具体的页。If there is no special indication of other meanings when mentioned, the 'full page of the audio-coded audiobook' (or 'the entire page of the audiobook') mentioned throughout the document of the present invention includes all pages of the audio-coded audiobook by default (for only one The voice coded audiobook of the page, all its pages refer to this page), if there is no special meaning when it is mentioned, the "whole page of the reading material" mentioned in the document of the present invention (without the "voice coded audio book" in front of it) 'Six-character ones without adding the word 'sound') defaults to include all pages of the reading material (for a reading material with only one page, all its pages refer to this one page). If there is no special meaning when it is mentioned, the "page of the speech-coded audiobook" mentioned in the document of the present invention refers to a specific page of the speech-coded audiobook by default; if there is no special meaning when it is mentioned, there is other meaning , the 'page of the speech-coded audiobook' mentioned in the document of the present invention refers to a specific page of the speech-coded audiobook by default; if there is no special meaning when it is mentioned, the document mentioned in the document of the present invention has other meanings "The page of the reading material" (the six characters "phonetic coding and audio" are not added in front of it, and the word "audio" is not added), the default refers to a specific page of the reading material; if it is mentioned, it has other meanings , the 'page of the reading material' mentioned in the document of the present invention (it does not add the six characters of 'speech coding and audio' and does not add the two words of 'audio' in front of it), defaults to a certain specific page of the reading material.

对于有声读物的任一页,也可说是作为此有声读物主体的读物的一页,即有声读物的这一页与读物的这一页是指物质上完全相同的东西,比如一有声读物的一个页,是一张纸,正面印有一语句‘上午好’(正面没再印其它图文),反面印有一语句‘下午好’(反面没再印其它图文),正面还印有一个码字,那此有声读物的这一页就包括这张纸和印的语句‘上午好’和‘下午好’,作为此有声读物主体的读物的页也包括这张纸和印的语句‘上午好’和‘下午好’。关于本段所说的物质上完全相同的东西,比如对一个放在桌上的苹果,那说‘这苹果’与说‘放在桌上的这苹果’就是指物质上完全相同的东西(都是指这个3两重的苹果);For any page of the audiobook, it can also be said to be a page of the reading as the main body of the audiobook, that is, this page of the audiobook and this page of the reading refer to the same thing in substance, such as an audiobook. One page is a piece of paper, with a sentence "good morning" printed on the front (no other pictures and texts printed on the front), a sentence "good afternoon" printed on the back (no other pictures and texts printed on the back), and a code printed on the front words, then this page of the audiobook includes this piece of paper and the printed sentences 'good morning' and 'good afternoon', and the page of the reading that is the main body of this audiobook also includes this piece of paper and the printed sentence 'good morning' ' and 'Good afternoon'. Regarding the materially identical things mentioned in this paragraph, for example, for an apple on the table, saying 'this apple' and 'this apple on the table' refer to the materially identical things (both refers to the 3 double apples);

由以上可推出:对于有声读物的任一页面,也可说是作为此有声读物主体的读物的一页面(或者说对于作为有声读物主体的读物的任一页面,也可说是此有声读物的一页面),即有声读物的这一页面与读物的这一页面是完全相同所指,比如一有声读物的一个页,是一张纸,正面印有一语句‘上午好’(正面没再印其它图文),反面印有一语句‘下午好’(反面没再印其它图文),正面还印有一个码字,此有声读物的这一页就包括这张纸和印的语句‘上午好’和‘下午好’,此有声读物的这一页也是作为此有声读物主体的读物的一页,所以这页的正面就是此有声读物的一页面,这页的正面也是作为此有声读物主体的读物的一页面。由此再可推出有声读物的整页面与作为此有声读物主体的读物的整页面是完全相同所指;比如一有声读物只含两个页(页1和页2),页1是在一张纸的正面印有一语句‘上午好’(正面没再印其它图文),反面印有一语句‘下午好’(反面没再印其它图文),正面还印有一个码字,那此有声读物的页1就包括这张纸和印的语句‘上午好’和‘下午好’,页2是在另一张纸的正面印有一语句‘中午好’(正面没再印其它图文),反面印有一语句‘晚上好’(反面没再印其它图文),正面和反面还各印有一个码字,那此有声读物的页2就包括这张纸和印的语句‘中午好’和‘晚上好’,那么此有声读物的整页面就包括此有声读物的页1的正面和此有声读物的页1的反面和此有声读物的页2的正面和此有声读物的页2的反面,作为此有声读物主体的读物的整页面包括此读物的页1的正面和此读物的页1的反面和此读物的页2的正面和此读物的页2的反面,而有声读物的页1与读物的页1等同,有声读物的页2与读物的页2等同,所以此有声读物的整页面与作为此有声读物主体的读物的整页面是完全相同所指。From the above, it can be deduced that for any page of the audiobook, it can also be said to be a page of the reading as the main body of the audiobook (or in other words, for any page of the reading as the main body of the audiobook, it can also be said to be a page of the audiobook). One page), that is, this page of the audiobook and this page of the reading are exactly the same. Pictures and texts), there is a sentence 'good afternoon' printed on the back (no other pictures and texts are printed on the back), and a code word is printed on the front, this page of this audio book includes this paper and the printed sentence 'good morning' and 'good afternoon', this page of this audiobook is also a page of the reading that is the subject of this audiobook, so the front of this page is a page of this audiobook, and the front of this page is also the reading that is the subject of this audiobook of a page. From this, it can be deduced that the entire page of the audiobook is exactly the same as the entire page of the reading as the main body of the audiobook; such as an audiobook only contains two pages (page 1 and page 2), andpage 1 is in a On the front side of the paper is printed a sentence "good morning" (no other pictures and texts printed on the front side), on the back side is printed a sentence "good afternoon" (no other pictures and texts printed on the back side), and there is also a code word printed on the front side, so thisaudiobook Page 1 includes this piece of paper and the printed sentences 'good morning' and 'good afternoon', andpage 2 has a sentence 'good afternoon' printed on the front of another sheet of paper (no other pictures and texts are printed on the front), and the reverse side There is a sentence "good evening" printed (no other pictures and texts are printed on the reverse side), and a code word is printed on the front and back respectively, sopage 2 of this audiobook includes this paper and the printed sentences "good afternoon" and " Good evening', then the entire page of this audiobook includes the front ofpage 1 of this audiobook and the back ofpage 1 of this audiobook and the front ofpage 2 of this audiobook and the back ofpage 2 of this audiobook, as The entire page of the reading in the main body of the audiobook includes the front ofpage 1 of the reading and the back ofpage 1 of the reading and the front ofpage 2 of the reading and the back ofpage 2 of the reading Thepage 1 of the audiobook is equivalent to thepage 2 of the audiobook, so the entire page of the audiobook is exactly the same as the entire page of the reading as the main body of the audiobook.

本发明文件所说的读物自身范围内是与拼合信息码无关的,是指读物自身范围内无码字,比如上段说的含两个页的有声读物,作为此有声读物主体的读物,包括组成页1的那张纸和印的语句‘上午好’和‘下午好’,以及组成页2的那张纸和印的语句‘中午好’和‘晚上好’,显然此读物包括的东西的范围内无码字,也即此读物自身范围内是与拼合信息码无关的。The scope of the reading material itself mentioned in the document of the present invention has nothing to do with the splicing information code. It means that there is no code word within the scope of the reading material itself, such as the audiobook containing two pages mentioned in the previous paragraph. As the main body of the audiobook, the reading material includes the constituent pages The sheet of paper and the printed sentences 'good morning' and 'good afternoon' onpage 1, and the sheet of paper and printed sentences 'good noon' and 'good evening' that make uppage 2, are obviously within the scope of what this reading material includes There is no codeword, that is to say, within the scope of this reading material itself, it has nothing to do with assembling the information code.

对于任一个码字,如处于一本有声读物的页面上,那此码字与作为此有声读物主体的读物是互相独立的关系,而且此码字既不属于此读物的页面,也不属于此读物的页,也不属于此有声读物的页面,也不属于此有声读物的页,但此码字是属于此有声读物的(此码字可以是直接把作为此有声读物主体的读物的页面当载体平面的码字,也可以是本说明书后面述的在小块平面材料上的码字且这小块平面材料是粘贴在作为此有声读物主体的读物的页面上的)。而作为有声读物主体的读物的页面上印刷的图文,显然既属于读物,也属于此有声读物(因读物是作为有声读物的主体,读物属于有声读物),也属于读物的页,也属于此有声读物的页。作为有声读物主体的读物的整页面上的诸个码字与此有声读物的整页面上的诸个码字是相同所指,且这里说的诸个码字属于此有声读物但不属于此有声读物的整页面也不属于作为此有声读物主体的读物也不属于作为此有声读物主体的读物的整页面。对于有声读物的整页面上的诸个码字中的部诸码字,也可说成作为此有声读物主体的读物的整页面上的诸个码字中的部诸码字;对于作为有声读物主体的读物的整页面上的诸个码字中的部诸码字,也可说成此有声读物的整页面上的诸个码字中的部诸码字。For any codeword, if it is on the page of an audiobook, then the codeword and the reading material as the main body of the audiobook are independent of each other, and the codeword neither belongs to the page of the reading material nor to this The page of the reading material does not belong to the page of the audiobook, nor does it belong to the page of the audiobook, but this code word belongs to the audio book (this code word can directly take the page of the reading material as the main body of the audio book as The codeword on the carrier plane can also be the codeword on the small piece of plane material described later in this specification and this small piece of plane material is pasted on the page of the reading material as the main body of the audiobook). And the pictures and texts printed on the page of the reading material as the main body of the audio book obviously belong to both the reading material and the audio book (because the reading material is the main body of the audio book, the reading material belongs to the audio book), and also belong to the page of the reading material, and also belong to this audio book. Audiobook page. The codewords on the entire page of the audiobook as the main body of the audiobook are the same as the codewords on the entire page of the audiobook, and the codewords mentioned here belong to the audiobook but not to the audiobook. The entire page of the reading does not belong to the reading that is the subject of this audiobook nor does it belong to the entire page of the reading that is the subject of this audiobook. For the part codewords in the codewords on the whole page of the audiobook, it can also be said to be the part codewords in the codewords on the whole page of the reading material as the main body of the audiobook; The part code words in the code words on the whole page of the reading material of the main body can also be said to be the part code words in the code words on the whole page of the audiobook.

以上5段提到的码字都是指拼合信息码。以上7段所述是为了本发明文件的表达方便,都是显而易见的常识或者可由优先权申请文件推出或者是优先权申请文件已述及的。The code words mentioned in the above 5 paragraphs all refer to the combination information code. The above 7 paragraphs are for the convenience of expressing the documents of the present invention, and they are all obvious common sense or can be deduced from the priority application documents or have been mentioned in the priority application documents.

如提及时没特别指明有他义,本说明书前面提及的语句,默认其含有的各句体,都是同一个语种的,如提及时没特别指明有他义,本说明书前面提及的拼合信息的值涉及的语句也默认其含有的各句体都是同一个语种的;但一个语句,其含有的各句体,也可以是不同语种的,即此语句是多语种的语句,可以把这语句看成是一复合语种的语句(比如一本读物的每单个语句中可能有英语句子和汉语句子,那就可把英语加汉语定义为一个复合语种,读物中每个语句就是这个复合语种的语句,当然读物中实际的被归属于复合语种的有些语句可能只含有一个语种的句体,是作为有不同语种的句体的语句的特殊情况,对此复合语种语句的编解码处理与对含有多个语种句体的语句的编解码处理可相同),一个复合语种可以是英语加汉语,汉语加韩语,汉语加韩语加日语或汉语(普通话)加广东话等,一个复合语种对应定义的语句集里的语句个数可有限或无限(如一个复合语种是英语加汉语,那此复合语种对应定义的语句集里任一语句可以是含有英语句体和汉语句体的,也可只含有英语句体,或只含有汉语句体);以上提到的各前面没加‘复合’来修饰的‘语种’,是指单独语种。If there is no other meaning specified when mentioning, the sentences mentioned earlier in this manual will default to the sentences contained in them in the same language. The sentences involved in the information value also default to the same language; however, each sentence body contained in a sentence can also be in different languages, that is, the sentence is a multilingual sentence, and you can put This sentence is regarded as a sentence of a compound language (for example, there may be English sentences and Chinese sentences in each single sentence of a reading, then English and Chinese can be defined as a compound language, and each sentence in the reading is this compound language. Of course, some sentences in the readings that are actually attributed to compound languages may only contain sentence bodies of one language, which is a special case of sentences with sentence bodies in different languages. The encoding and decoding processing of sentences containing multiple language sentences can be the same), a compound language can be English plus Chinese, Chinese plus Korean, Chinese plus Korean plus Japanese or Chinese (Mandarin) plus Cantonese, etc., a compound language corresponds to the defined The number of sentences in the sentence set can be limited or unlimited (such as a compound language is English plus Chinese, then any sentence in the sentence set corresponding to the definition of the compound language can contain English and Chinese sentences, or only contain English sentence style, or only contain Chinese sentence style); the above-mentioned "languages" that are not modified by "compound" in front refer to individual languages.

如一个复合语种是汉语加韩语加日语,那也可以说此复合语种包含汉语和韩语和日语这些单独语种,即此复合语种的任一语句中可能含有汉语的句体,韩语的句体,日语的句体中的一种或多种;一个复合语种可以是包含多个印欧语系的语种(比如包含英语和法语),或包含多个非印欧语系的语种(比如包含汉语和韩语和日语),或包含数个印欧语系的语种和数个非印欧语系的语种(比如包含英语和汉语),甚至一个复合语种可以是包含印欧语系和非印欧语系以外的数个单独语种(即数个既不属于印欧语系又不属于非印欧语系的单独语种),或者一个复合语种可以是包含印欧语系和非印欧语系以外的数个单独语种加上数个印欧语系的语种,或者一个复合语种可以是包含印欧语系和非印欧语系以外的数个单独语种加上数个非印殴语系的语种,或者一个复合语种可以是包含印欧语系和非印欧语系以外的数个单独语种加上数个印殴语系的语种加上数个非印殴语系的语种等。如有适用于多个语种的编码规则(或解码规则),那这多个语种中可以有复合语种。复合语种包含多个单独语种,而这‘多’的数量是没限制的。If a compound language is Chinese plus Korean plus Japanese, it can also be said that this compound language includes the individual languages of Chinese, Korean and Japanese, that is, any sentence in this compound language may contain Chinese sentence body, Korean sentence body, Japanese One or more of the sentences; a compound language can be a language that includes multiple Indo-European languages (such as including English and French), or a language that includes multiple non-Indo-European languages (such as including Chinese and Korean and Japanese ), or include several Indo-European languages and several non-Indo-European languages (such as including English and Chinese), or even a compound language can include several individual languages other than Indo-European and non-Indo-European languages ( That is, several individual languages that neither belong to the Indo-European language family nor non-Indo-European languages), or a compound language can be composed of several individual languages other than Indo-European languages and non-Indo-European languages plus several Indo-European languages languages, or a composite language can include several individual languages other than Indo-European and non-Indo-European plus several non-Indo-European languages, or a composite language can include Indo-European and non-Indo-European languages Several separate languages of the Indo-Australian language family plus several non-Indo-Australian languages and so on. If there are encoding rules (or decoding rules) applicable to multiple languages, then there may be compound languages among the multiple languages. A compound language contains multiple individual languages, and the number of this 'many' is unlimited.

(本段所述适于任一复合语种)。仍可以是对复合语种语句来进行编解码,可以用一个编码规则中有关此复合语种的编码细则来对一个复合语种编码语句集里的各语句进行编码,此复合语种编码语句集是此编码规则中有关此复合语种的编码细则定义的,此编码规则中有关此复合语种的直播方式(或TTS方式或混合方式)的编码细则对此复合语种的语句中不同非复合语种(以及各既不属于印欧语系又不属于非印欧语系的单独语种)的句体的音码定义及编码处理方式可参照本说明书述的直播方式(或TTS方式)的以只适用于英语为例的编码规则以及直播方式(或TTS方式)的以只适用于汉语为例的编码规则(复合语种语句拼合编解码时显然可以是用本说明书前面述的1类,2类,3类,4类,其它形式音码中的任意多种音码),生成的仍是表示语句发音的拼合信息的值(这里的语句是复合语种的),仍可简称其为拼合信息的值,相应的,拼合信息的值涉及的语句也是复合语种的语句;相应的,可以用一个与本段前述对复合语种编码语句集里的语句进行编码的编码规则有关此复合语种对应的解码规则来对表示复合语种解码语句集里的语句发音的拼合信息的值进行解码(实际是用此解码规则中有关此复合语种的解码细则来解码的,此复合语种编码语句集与此复合语种解码语句集可完全相同或部分相同),这里对应可以是专门对应或一般对应,此复合语种解码语句集是此解码规则中有关此复合语种的解码细则定义的,此解码规则中有关此复合语种的直播方式(或TTS方式或混合方式)的解码细则对此复合语种的语句中不同非复合语种(以及各既不属于印欧语系又不属于非印欧语系的单独语种)的句体的音码定义及解码处理方式可参照本说明书述的直播方式(或TTS方式)的以只适用于英语为例的解码规则以及直播方式(或TTS方式)的以只适用于汉语为例的解码规则(复合语种语句拼合编解码时显然可以是用本说明书前面述的1类,2类,3类,4类,其它形式音码中的任意多种音码)。(This paragraph is suitable for any compound language). It is still possible to encode and decode compound language sentences, and each sentence in a compound language coded sentence set can be coded by using the coding rules related to this compound language in a coding rule. This compound language coded sentence set is the coding rule As defined in the coding rules for this compound language, the coding rules for this compound language’s live broadcast mode (or TTS mode or mixed mode) in this coding rule are different non-compound languages (and each of which is neither Indo-European language family does not belong to non-Indo-European language family) phonetic code definition and coding processing method of the sentence body can refer to the coding rules and the The encoding rules of the live mode (or TTS mode) are only applicable to Chinese as an example (when compound language sentences are combined and coded, it is obvious that the above-mentionedType 1,Type 2,Type 3,Type 4, and other forms of audio can be used. Any multiple phonetic codes in the code), what generates is still the value of the combined information representing the pronunciation of the sentence (the statement here is of a compound language), and it can still be referred to as the value of the combined information for short. Correspondingly, the value of the combined information involves The sentence in the compound language is also a sentence in a compound language; correspondingly, a decoding rule related to the encoding rule for encoding the sentences in the compound language coded sentence set described above in this paragraph can be used to represent the compound language in the decoded sentence set. The value of the combined information of the pronunciation of the sentence is decoded (actually, it is decoded with the decoding rules about this compound language in this decoding rule, and the coded sentence set of this compound language can be completely or partially identical to the set of decoded sentences of this compound language), here Correspondence can be special correspondence or general correspondence, this complex language decoding sentence set is defined by the decoding rules of this compound language in this decoding rule, and the live broadcast mode (or TTS mode or mixed mode) of this compound language in this decoding rule Detailed rules for decoding For the definition of phonetic codes and decoding processing methods of sentences of different non-compound languages (and individual languages that neither belong to the Indo-European language family nor non-Indo-European language family) in sentences of this compound language, please refer to the description in this manual. The decoding rules of the live mode (or TTS mode) that are only applicable to English as an example and the decoding rules of the live mode (or TTS mode) that are only applicable to Chinese as an example (when compound language sentences are assembled and decoded, obviously this can be used) The above-mentionedclass 1 in the manual,class 2,class 3,class 4, any multiple phonetic codes in other forms of phonetic codes).

比如一复合语种是英语加汉语,那用于编码的可以是这4种音码:单词(英语涉及的单词),元辅音(英语涉及的元辅音),字(汉语涉及的字)和词(汉语涉及的词),然后在对此复合语种语句进行编解码时,语句中的英语句体就可用数个单词音码和数个元辅音音码(或只用数个单词音码或只用数个元辅音音码)来作拼合,语句中的汉语句体就可用数个字音码和数个词音码(或只用数个字音码或只用数个词音码)来作拼合。Such as a compound language is English plus Chinese, and that can be these 4 kinds of phonetic codes that are used for coding: word (the word that English relates to), vowel and consonant (the vowel and consonant that English relates to), word (word that Chinese relates to) and word ( Chinese related word), then when this compound language sentence is encoded and decoded, the English sentence body in the sentence can use several word phonetic codes and several vowel and consonant phonetic codes (or only use several word phonetic codes or only use several word phonetic codes Several vowel and consonant sound codes) are put together, and the Chinese sentence style in the sentence just can be done splicing with several word sound codes and several word sound codes (or only with several word sound codes or only with several word sound codes).

根据不同语种的规则,各语种的语法结构,发音方式互相有些差别,但世界上各种语种都有共性,都是可用音码来拼合成语句发音的,各语种用于编解码的音码的定义方式及编解码处理方式的本质互相是相同的,而本说明书不可能对世界上每种语种都来阐述一遍,所以,编码规则中有关某复合语种的编码细则对此复合语种的语句中各既不属于印欧语系又不属于非印欧语系的单独语种的句体的音码定义及编码处理方式可参照本说明书述的直播方式(或TTS方式)的以只适用于英语为例的编码规则以及直播方式(或TTS方式)的以只适用于汉语为例的编码规则,解码规则中有关某复合语种的解码细则对此复合语种的语句中各既不属于印欧语系又不属于非印欧语系的单独语种的句体的音码定义及解码处理方式及发音方式可参照本说明书述的直播方式(或TTS方式)的以只适用于英语为例的解码规则以及直播方式(或TTS方式)的以只适用于汉语为例的解码规则。According to the rules of different languages, the grammatical structure and pronunciation methods of each language are somewhat different from each other, but all languages in the world have common features, and they can be combined into sentences with phonetic codes. The essence of the definition method and the codec processing method are the same with each other, and it is impossible to explain every language in the world in this manual. For the phonetic code definitions and encoding processing methods of sentences of individual languages that neither belong to the Indo-European language family nor non-Indo-European language families, please refer to the live broadcast method (or TTS method) described in this manual, which only applies to English as an example Rules and live streaming (or TTS) coding rules that are only applicable to Chinese as an example, and decoding rules for a compound language in the decoding rules. The sentences in this compound language neither belong to the Indo-European language family nor belong to the non-Indo-European language family. The definition of phonetic codes and decoding processing methods and pronunciation methods of sentence styles in separate languages of the European language family can refer to the decoding rules and live broadcast methods (or TTS methods) that are only applicable to English in the live broadcast method (or TTS method) described in this manual. ) of the decoding rules that are only applicable to Chinese as an example.

本发明文件所说的编码规则中有关复合语种的编码细则是可有压缩编码处理的,因为编码规则中有关复合语种的编码细则中对复合语种语句中各非复合语种(以及各既不属于印欧语系又不属于非印欧语系的单独语种)的句体的编码处理是可参照以只适用于英语(或汉语)为例的编码规则的,而在本说明书所述的以只适用于英语(或汉语)为例的编码规则是有压缩编码处理的;本发明文件所说的解码规则中有关复合语种的解码细则是可有压缩解码处理的,因为解码规则中有关复合语种的解码细则中对复合语种语句中各非复合语种(以及各既不属于印欧语系又不属于非印欧语系的单独语种)的句体的解码处理是可参照以只适用于英语(或汉语)为例的解码规则的,而在本说明书所述的以只适用于英语(或汉语)为例的解码规则是有压缩解码处理的;在编码时,复合语种包含的各非复合语种(以及各既不属于印欧语系又不属于非印欧语系的单独语种)涉及的编码数据一般可分开放,比如把英语的单词数据库与汉语的词数据库分开放,但也可合起来;在解码时,复合语种包含的各非复合语种(以及各既不属于印欧语系又不属于非印欧语系的单独语种)涉及的解码数据一般可分开放,比如把英语的单词数据库与汉语的词数据库分开放,但也可合起来。The coding details about the compound language in the said coding rules of the present invention document can be processed by compression coding, because in the coding rules about the compound languages in the coding rules, each non-composite language in the compound language sentence (and each neither belongs to the printed word) European language family does not belong to the separate language of non-Indo-European language family), the coding processing of the sentence body can refer to the coding rules that are only applicable to English (or Chinese) as an example, and the coding rules that are only applicable to English (or Chinese) as an example encoding rule has compression coding processing; The decoding details about compound languages in the said decoding rules of the present invention document can have compression decoding processing, because in the decoding rules about compound languages in the decoding rules For the decoding processing of the sentence body of each non-compound language (and each individual language that does not belong to the Indo-European language family or non-Indo-European language family) in the compound language sentence, you can refer to the example that is only applicable to English (or Chinese) decoding rules, while the decoding rules described in this manual, which are only applicable to English (or Chinese) as an example, are processed by compression decoding; when encoding, each non-compound language contained in the compound language (and each not belonging to Indo-European languages do not belong to the separate languages of the non-Indo-European language family), and the coding data involved can generally be opened separately, for example, the English word database and the Chinese word database are opened separately, but they can also be combined; when decoding, the compound language includes The decoding data involved in each non-compound language (and each individual language that is neither Indo-European nor non-Indo-European) can generally be opened separately, such as separating the English word database from the Chinese word database, but also can be combined.

拾音器有使用的适用于复合语种的解码程序及解码数据一般是根据适用于复合语种的解码规则(比如本段前面述的解码规则)中有关此复合语种的解码细则来对表示复合语种语句发音的拼合信息的值执行解码的(与一个拾音器有关的存储器集里可能只存储有只适用于复合语种的解码程序及解码数据,也可能存储有适用于包括复合语种和非复合语种在内的多个语种的解码程序及解码数据);至于有关复合语种涉及的语音编码有声读物及拾音器的各项技术特征(比如码字的种类及形式,拼合信息的值在各码字里如何放比如放一个或多个码字里或作变换再放等,各码字在有声读物整页面上的布置即各个页面在什么位置形成码字等,拾音器的硬件特点等)都与有关非复合语种涉及的语音编码有声读物及拾音器的的这些技术特征相同。The decoding program and decoding data applicable to the compound language used by the pickup are generally based on the decoding rules applicable to the compound language (such as the decoding rules mentioned above in this paragraph) to pronounce the sentence representing the compound language. Combining the value of the information to perform decoding (the memory set related to a pickup may only store the decoding program and decoding data that are only applicable to the compound language, and may also store multiple programs that are applicable to both the compound language and the non-compound language. language decoding program and decoding data); as for the speech coding audiobooks involved in the compound language and various technical features of the pickup (such as the type and form of the codeword, how to put the value of the combined information in each codeword, such as putting one or In a plurality of codewords or make transformation and put again etc., the arrangement of each codeword on the whole page of audiobook (i.e. where each page forms a codeword, etc., the hardware characteristics of the pickup, etc.) These technical features are the same for audiobooks and pickups.

如某有声读物整页面上的某码字的值是与表示复合语种的语句发音的拼合信息的值有关的,那此码字仍属于此有声读物的诸个码字的范围,可把诸个码字中的所有这样的码字(码字的值是与表示复合语种的语句发音的拼合信息的值有关的,码字的值还可能与表示此复合语种或其它复合语种或非复合语种的语句的发音的拼合信息的值有关)中的所有码字或部分码字定义成一部诸码字,某拾音器可读取诸个码字中的这部诸码字中的数个码字而得到表示复合语种的语句发音的拼合信息的值,并解码播出语句发音。If the value of a certain codeword on the entire page of an audiobook is related to the value of the combined information representing the sentence pronunciation of a compound language, then this codeword still belongs to the scope of the codewords of the audiobook, and all the codewords can be combined All such codewords in the codeword (the value of the codeword is related to the value of the mosaic information of the sentence pronunciation representing the compound language, and the value of the codeword may also be related to representing this compound language or other compound languages or non-compound languages All codewords or part codewords are defined as a part of all codewords in the value of the combined information of the pronunciation of the sentence), and a certain pickup can read several codewords in this all codewords in the codewords to obtain Indicates the value of the mosaic information of the sentence pronunciation of the compound language, and decodes and broadcasts the sentence pronunciation.

拾音器可以是采用直接产音方式或采用TTS产音方式或既采用直接产音方式又采用TTS产音方式的,拾音器的MCU按解码规则中有关复合语种的解码细则根据从码字得到的表示复合语种语句发音的拼合信息的值来解码时播放复合语种语句发音可以采用直接发音方式或TTS发音方式(一个复合语种语句里每个非复合语种的句体的发音的处理显然同于本说明书前述的一个此非复合语种的语句的发音的处理,因为一个句体是一个句子或一个部分句子,一个语句可以是一个句子或一个部分句子,所以在发音时,任一个非复合语种的句体可等价看成是一个此非复合语种的语句来发音,比如连续播此非复合语种的数个音码的语音来形成此非复合语种的句体的发音),拾音器可以是按照某解码规则中有关此复合语种的解码细则来对复合语种的拼合信息的值进行解码的,此解码规则中有关此复合语种的解码细则可以是直播方式或TTS方式或混合方式的;如拾音器解码时有采用直接发音方式,那与拾音器有关的存储器集里一般应有用于解码时复合语种语句中的不同非复合语种句体涉及的各非复合语种的相关音码的发音数据(还可能有既不属于印欧语系又不属于非印欧语系的单独语种的音码的发音数据);如拾音器有采用TTS器件来发音,那或者有多个不同语种的TTS器件,解码时复合语种语句中的不同非复合语种的句体(还可能有各既不属于印欧语系又不属于非印欧语系的单独语种的句体)使用各自语种的TTS器件发音(此多个带喇叭的TTS器件就组成拾音器的发音器件,每个带喇叭的TTS器件也就是一个子发音器件),或者可使用一个TTS器件,此TTS器件可用于解码时复合语种语句中的不同非复合语种(还可能有各既不属于印欧语系又不属于非印欧语系的单独语种)的句体的发音,此一个带喇叭的TTS器件就是拾音器的发音器件(此一个带喇叭的TTS器件也即子发音器件,发音器件就是这一个子发音器件)。The pickup can use the direct sound production method or the TTS sound production method or both the direct sound production method and the TTS sound production method. When the value of the combination information of language sentence pronunciation is decoded, the pronunciation of compound language sentences can be played in direct pronunciation mode or TTS pronunciation mode (the processing of the pronunciation of each non-compound language sentence body in a compound language sentence is obviously the same as the foregoing description of this manual The processing of the pronunciation of a sentence in a non-compound language, because a sentence body is a sentence or a partial sentence, and a sentence can be a sentence or a partial sentence, so when pronouncing, any sentence body in a non-compound language can be equal The value is regarded as a sentence of this non-compound language to pronounce, such as continuously broadcasting the voices of several phonetic codes of this non-compound language to form the pronunciation of the sentence body of this non-compound language), the pickup can be according to relevant in a certain decoding rule The decoding rules of this compound language are used to decode the value of the combined information of the compound language. The decoding rules of this compound language in this decoding rule can be live broadcast mode or TTS mode or mixed mode; if the pickup is decoded, it uses direct pronunciation Mode, generally should have the pronunciation data of the relevant phonetic sign indicating number of each non-composite language that the different non-composite language sentence bodies in compound language sentences relate to when decoding in the memory set that is relevant to pickup Do not belong to the pronunciation data of the sound code of the independent language of non-Indo-European language family); As the pickup has the TTS device to pronounce, or there are a plurality of TTS devices of different languages, the different non-compound language in the compound language sentence when decoding Sentence body (there may also be the sentence body of each independent language that does not belong to Indo-European language family and does not belong to non-Indo-European language family) use the TTS device pronunciation of respective languages (this multiple TTS device with loudspeaker just forms the pronunciation device of pick-up, Each TTS device with loudspeaker is also a sub-pronunciation device), or a TTS device can be used, and this TTS device can be used for different non-compound languages in compound language sentences when decoding (there may also be each neither belonging to Indo-European language family nor The pronunciation of the sentence body that does not belong to the separate language of non-Indo-European language family), this TTS device with horn is exactly the pronunciation device of pickup (this TTS device with horn is also the sub-pronunciation device, and the pronunciation device is exactly this sub-pronunciation device ).

涉及复合语种时,不论拾音器是采用直接产音方式还是采用TTS产音方式还是既采用直接产音方式又采用TTS产音方式的,拾音器都仍可以是非插卡方式拾音器或插数据卡方式拾音器或插部件式数据卡方式拾音器,也可以是本说明书说的电子产品拾音器;如拾音器的MCU按照解码规则中有关某复合语种的解码细则根据从码字得到的表示此复合语种语句发音的拼合信息的值来解码时播放此复合语种语句发音是采用直接发音方式,那可以是用整步直接发音方式或分步直接发音方式,如解码时播放此复合语种语句发音是采用TTS发音方式,那可以是用整步TTS发音方式或分步TTS发音方式,如解码细则涉及的发音形式是用混合发音方式,那解码时播放此复合语种语句发音可以是用到整步直接发音方式,整步TTS发音方式,分步直接发音方式,分步TTS发音方式中的任意多种。When multiple languages are involved, no matter whether the pickup adopts the direct sound production method, the TTS sound production method, or both the direct sound production method and the TTS sound production method, the pickup can still be a non-card type pickup or a data card type pickup or The plug-in type data card mode pickup can also be the electronic product pickup mentioned in this manual; as the MCU of the pickup is according to the decoding rules about a certain compound language in the decoding rules according to the combined information obtained from the codeword to represent the pronunciation of the compound language sentence The pronunciation of this compound language sentence is to adopt the direct pronunciation method when decoding, that can be the whole step direct pronunciation method or the step-by-step direct pronunciation method, such as playing this compound language sentence pronunciation when decoding is to adopt the TTS pronunciation method, that can be Use the full-step TTS pronunciation method or the step-by-step TTS pronunciation method. If the pronunciation form involved in the decoding rules is a mixed pronunciation method, then the pronunciation of the compound language sentence can be played in the whole-step direct pronunciation method or the whole-step TTS pronunciation method when decoding. , any number of step-by-step direct pronunciation methods, step-by-step TTS pronunciation methods.

以上5段提及的有关复合语种语句的编码规则可以是只适用于一个语种(即复合语种)的,也可以是适用于包括此复合语种在内的多个语种(这多个语种中除此复合语种外的其它语种可全是复合语种或全是非复合语种或既包含复合语种也包含非复合语种)的,以上5段提及的有关复合语种语句的解码规则可以是只适用于一个语种(即复合语种)的,也可以是适用于包括此复合语种在内的多个语种(这多个语种中除此复合语种外的其它语种可全是复合语种或全是非复合语种或既包含复合语种也包含非复合语种)的。The coding rules for compound language sentences mentioned in the above 5 paragraphs can be applicable to only one language (that is, compound language), or can be applicable to multiple languages including this compound language (among these multiple languages, other than this Other languages other than compound languages can be all compound languages or all non-compound languages, or both compound languages and non-compound languages), the decoding rules for compound language sentences mentioned in the above 5 paragraphs can be only applicable to one language ( That is, compound languages), it can also be applicable to multiple languages including this compound language (among the multiple languages, other languages except this compound language can be all compound languages or all non-compound languages or both contain compound languages Also includes non-compound languages).

如编码规则是适用于数个语种的(这数个语种全是复合语种,或既包含复合语种也包含非复合语种),解码规则是适用于数个语种的(这数个语种全是复合语种,或既包含复合语种也包含非复合语种;这句的‘数个’与上句的‘数个’不一定是相同数目),那此编码规则中有关各语种的编码细则的定义以及此解码规则中有关各语种的解码细则的定义以及此解码规则中有关某一语种的解码细则与此编码规则中有关某一语种的编码细则的对应情况(这里说到的两个‘某一语种’是相同所指,此语种可以是任一非复合语种或任一复合语种)也同于本说明书述的所涉及的语种都是非复合语种时编码规则中有关各语种的编码细则的定义以及解码规则中有关各语种的解码细则的定义以及解码规则中有关某一语种的解码细则与编码规则中有关某一语种的编码细则的对应情况(这里说到的两个‘某一语种’是相同所指且是非复合语种但这里说到的两个‘某一语种’与本段第5行说到的两个‘某一语种’可能是也可能不是相同语种)。If the coding rules are applicable to several languages (these several languages are all compound languages, or contain both compound languages and non-composite languages), the decoding rules are applicable to several languages (these several languages are all compound languages , or include both compound languages and non-compound languages; the "several" in this sentence is not necessarily the same number as the "several" in the previous sentence), then the definition of the coding rules for each language in this coding rule and the decoding The definition of the decoding rules for each language in the rules and the correspondence between the decoding rules for a certain language in this decoding rule and the encoding rules for a certain language in this encoding rule (the two "a certain language" mentioned here are The same reference, this language can be any non-composite language or any compound language) is also the same as the definition of the coding rules for each language in the coding rules and the decoding rules when the languages involved in this description are all non-compound languages The definition of the decoding rules for each language and the correspondence between the decoding rules for a certain language in the decoding rules and the encoding rules for a certain language in the encoding rules (the two "a certain language" mentioned here refer to the same and It is not a compound language but the two "a certain language" mentioned here may or may not be the same language as the two "a certain language" mentioned in line 5 of this paragraph).

权利要求1,2,3,10,11,12,19,20,22或23提及的语句及拼合信息的值涉及的语句,既可以是任一非复合语种的一个语句,也可以是任一复合语种的一个语句。The sentence mentioned inclaim 1, 2, 3, 10, 11, 12, 19, 20, 22 or 23 and the sentence involved in the value of the combined information can be a sentence of any non-compound language, or any A sentence in a compound language.

只适用于一个复合语种或适用于包括数个复合语种在内的多个语种的编码规则可以有很多种,只适用于一个复合语种或适用于包括数个复合语种在内的多个语种的解码规则可以有很多种。There can be many kinds of encoding rules that are only applicable to one compound language or applicable to multiple languages including several compound languages There can be many kinds of rules.

权利要求3,12,20或23所提及的‘此编码规则中有关语种1的编码细则是含有压缩编码处理的’中的‘语种1’可以是一复合语种;权利要求3,12,20或23所提及的‘此编码规则中有关语种1的编码细则是含有压缩编码处理的’中的‘语种1’也可以为一非复合语种(比如英语)。权利要求3,12,20或23所提及的‘此解码规则中有关语种2的解码细则是含有压缩解码处理的’中的‘语种2’可以是一复合语种;权利要求3,12,20或23所提及的‘此解码规则中有关语种2的解码细则是含有压缩解码处理的’中的‘语种2’也可以为一非复合语种。Claim 3, 12, 20 or 23 mentions 'the encoding details aboutlanguage 1 in this encoding rule is to contain compression coding process' in 'language 1' can be a compound language;Claim 3, 12, 20 Or the "language 1" in "the coding details aboutlanguage 1 in this coding rule contains compression coding processing" mentioned in 23 can also be a non-compound language (such as English).Claim 3, 12, 20 or 23 mentions 'the decoding details aboutlanguage 2 in this decoding rule is to contain compression decoding process' in 'language 2' can be a compound language;Claim 3, 12, 20 Or the "language 2" in the "decoding details aboutlanguage 2 in this decoding rule includes compression decoding processing" mentioned in 23 can also be a non-compound language.

一复合语种编码语句集中的一个此复合语种的语句,其含有的多个句体可以是属于几个单独语种的(一般情况),也可以是特例,比如此复合语种的一个语句含有的数个句体都属于一个单独语种的,或者比如此复合语种是英语加汉语加韩语而此复合语种的一个语句含有一个英语句体和一个汉语句体,对这些特例语句,仍可以用某编码规则中有关此复合语种的编码细则来编码生成表示此特例语句发音的拼合信息的值,此拼合信息的值又可以被某解码规则中有关此复合语种的解码细则来解码,所以对于在读物整页面上的需发的一个单独语种的语句的发音,此语句可能会被当成一复合语种的特例语句并用某编码规则中有关此复合语种的编码细则来编码生成表示此特例语句发音的拼合信息的值,并可把此拼合信息的值(或其变换值)放在一个或多个码字里,如有声读物整页面上的码字的值是与这样的拼合信息的值有关的,那此码字也属于此有声读物整页面上诸个码字的范围,权利要求2里就可以是这样的情况,即语句1含有的句体可都是一非复合语种的且语句1可以是某复合语种的特例,权利要求3里提到的‘解码规则中有关语种2的解码细则’涉及的语种2可以是此复合语种的,权利要求3里提到的‘编码规则中有关语种1的编码细则’涉及的语种1可以是此复合语种的,对权利要求11,12也可有与权利要求2,3这样类似的情况。权利要求20中的语句1含有的句体可都是一非复合语种的且语句1可以是某复合语种的特例,权利要求20里提到的‘解码规则中有关语种2的解码细则’涉及的语种2可以是此复合语种的,权利要求20里提到的‘编码规则中有关语种1的编码细则’涉及的语种1可以是此复合语种的,权利要求23也可有与权利要求20这样类似的情况。A sentence of this compound language in a compound language coding sentence set, the multiple sentence bodies contained in it can belong to several individual languages (general situation), or it can be a special case, such as the several sentences contained in a sentence of this compound language Sentence bodies all belong to a single language, or such as the compound language is English plus Chinese plus Korean and a sentence in this compound language contains an English sentence body and a Chinese sentence body. For these special cases, a certain encoding rule can still be used The coding rules of this compound language are used to encode and generate the value of the splicing information representing the pronunciation of this special sentence, and the value of this splicing information can be decoded by the decoding rules of this compound language in a certain decoding rule, so for the entire page of the reading material The pronunciation of a sentence in a single language that needs to be issued, this sentence may be regarded as a special sentence of a compound language and encoded with the coding rules related to the compound language in a certain coding rule to generate a value representing the combined information of the pronunciation of the special sentence, And the value (or its transformed value) of this stitching information can be placed in one or more codewords, if the value of the codeword on the entire page of the audiobook is related to the value of such stitching information, then the codeword It also belongs to the scope of the codewords on the entire page of the audio book, which can be the case in claim 2, that is, the sentences contained in sentence 1 can all be in a non-compound language and sentence 1 can be in a certain compound language As a special case, the language 2 involved in the 'decoding rules for language 2 in the decoding rules' mentioned in claim 3 may be of this compound language, and the 'encoding rules for language 1 in the encoding rules' mentioned in claim 3 involve Language 1 can be this composite language, and claims 11 and 12 can also have similar situations as claims 2 and 3. The sentences contained in thesentence 1 inclaim 20 can all be of a non-compound language and thesentence 1 can be a special case of a certain compound language. The "decoding rules related tolanguage 2 in the decoding rules" mentioned inclaim 20 refer toLanguage 2 can be of this composite language, and thelanguage 1 involved in the 'encoding rules forlanguage 1 in the coding rules' mentioned inclaim 20 can be of this composite language, and claim 23 can also have something similar to claim 20 Case.

如一个复合语种是英语加汉语,那编码时定义的编码语句集(或解码时定义的解码语句集)可以包括一个只有英语句体的语句,一个只有汉语句体的语句,一个既含有英语句体也含有汉语句体的语句。If a compound language is English plus Chinese, the encoding statement set defined during encoding (or the decoding statement set defined during decoding) can include a statement with only English sentence body, a sentence with only Chinese sentence body, and a sentence with both English sentence The body also contains sentences in the Chinese body.

复合语种的编码举例和解码举例:如有一个复合语种是英语加汉语,有一个只适用于此复合语种的编码规则(本段以下提到的‘此编码规则’就是这个编码规则),有一个只适用于此复合语种的解码规则(本段以下提到的‘此解码规则’就是这个解码规则),有一个此复合语种的语句(这个语句含一个英语句子和一个汉语句子,本段以下提到的‘此语句’就是这个语句),此编码规则中用于英语句体编码的主要是单词和词根和元辅音等音码,用于汉语句体编码的主要是词音码和元音辅音组合音码,此解码规则中用于英语句体解码的也主要是单词和词根和元辅音等音码,用于汉语句体解码的也主要是词音码和元音辅音组合音码,然后编码时,对此语句(含有一个英语句子和一个汉语句子),此编码规则按照本说明书述的直播方式的以只适用于英语为例的编码规则里的编码细则来对这一个英语句子编码生成头段,单词段,时态词形选择段,陌生单词段等(可称为英语的头段,单词段,时态词形选择段,陌生单词段等),此编码规则按照本说明书述的直播方式的以只适用于汉语为例的编码规则里的编码细则来对这一个汉语句子编码生成头段,词段,陌生词段等(可称为汉语的头段,词段,陌生词段等),然后把英语的头段,单词段,时态词形选择段,陌生单词段与汉语的头段,词段,陌生词段并排起来即成表示此语句发音的拼合信息的值;在用此解码规则解码时,从这拼合信息的值里分别取出英语的头段,单词段,时态词形选择段,陌生单词段与汉语的头段,词段,陌生词段,然后此解码规则按照本说明书述的直播方式的以只适用于英语为例的解码规则里的解码细则来对英语的头段,单词段,时态词形选择段,陌生单词段等进行解码播放这个英语句子的发音(比如与拾音器有关的存储器里有英语的相关音码的发音数据),此解码规则再按照本说明书述的直播方式的以只适用于汉语为例的解码规则里的解码细则来对汉语的头段,单词段,时态词形选择段,陌生单词段等进行解码播放这个汉语句子的发音(比如与拾音器有关的存储器里有汉语的相关音码的发音数据),这样英语句子的发音和汉语句子的发音就组成了此语句的发音。可以就把以只适用于英语为例的编码规则里的编码细则和以只适用于汉语为例的编码规则里的编码细则简单并排合起来(还可能需再加一些别的辅助编码细则)即成此编码规则,可以把以只适用于英语为例的编码规则使用的编码数据和以只适用于汉语为例的编码规则使用的编码数据简单合起来即成此编码规则使用的编码数据,此编码规则中的所有编码细则就是此编码规则中有关此复合语种(英语加汉语)的编码细则;可以就把以只适用于英语为例的解码规则里的解码细则和以只适用于汉语为例的解码规则里的解码细则简单并排合起来(还可能需再加一些别的辅助解码细则)即成此解码规则,可以把以只适用于英语为例的解码规则使用的解码数据和以只适用于汉语为例的解码规则使用的解码数据简单合起来即成此解码规则使用的解码数据,此解码规则中的所有解码细则就是此解码规则中有关此复合语种(英语加汉语)的解码细则。Examples of encoding and decoding of compound languages: If there is a compound language that is English plus Chinese, there is an encoding rule that is only applicable to this compound language (the 'this encoding rule' mentioned below in this paragraph is this encoding rule), and there is a Only applicable to the decoding rules of this compound language (the 'this decoding rule' mentioned below in this paragraph is this decoding rule), there is a sentence in this compound language (this sentence contains an English sentence and a Chinese sentence, and the following paragraphs mentioned Arrived 'this sentence' is exactly this sentence), in this encoding rule, what is used for English sentence style coding is mainly word and word root and vowel consonant code, and what is used for Chinese sentence style coding is mainly word phonetic code and vowel consonant Combination phonetic sign indicating number, also mainly sound sign indicating number such as word and word root and vowel consonant for English sentence style decoding in this decoding rule, also mainly word sound sign indicating number and vowel consonant combination sound sign indicating number for Chinese sentence style decoding, then When coding, for this sentence (contains an English sentence and a Chinese sentence), the coding rule is to code and generate the English sentence according to the coding rules in the coding rules of the live broadcast method described in this manual, which are only applicable to English as an example. Head section, word section, tense word form selection section, unfamiliar word section, etc. In the live broadcast mode, the encoding rules in the encoding rules that are only applicable to Chinese as an example are used to encode this Chinese sentence to generate headers, phrases, unfamiliar phrases, etc. (which can be called Chinese headers, phrases, and unfamiliar phrases) etc.), then the head section of English, the word section, the tense word form selection section, the unfamiliar word section and the Chinese head section, the word section, and the unfamiliar word section are arranged side by side to form the value of the combined information representing the pronunciation of this sentence; When decoding with this decoding rule, take out the English header, word segment, tense word form selection segment, unfamiliar word segment and Chinese head segment, word segment, and unfamiliar word segment from the value of the combined information, and then decode The rules are based on the decoding rules in the decoding rules that are only applicable to English as an example in the live broadcast method described in this manual to decode the English head section, word section, tense word form selection section, unfamiliar word section, etc. Play this English sentence Pronunciation (for example, there are pronunciation data of English-related phonetic codes in the memory related to the pickup), the decoding rules will be used for Chinese according to the decoding rules in the decoding rules of the live broadcast mode described in this manual, which are only applicable to Chinese as an example. The head section, word segment, tense word form selection segment, unfamiliar word segment etc. are decoded and play the pronunciation of this Chinese sentence (such as the pronunciation data of the relevant phonetic code of Chinese in the storage device relevant to the pickup), so that the pronunciation of the English sentence And the pronunciation of the Chinese sentence constitutes the pronunciation of this sentence. The encoding rules in the encoding rules that are only applicable to English as an example and the encoding rules in the encoding rules that are only applicable to Chinese as an example can be simply arranged side by side (and some other auxiliary encoding rules may be added) that is To form this encoding rule, the encoding data used by the encoding rule only applicable to English and the encoding data used by the encoding rule applicable only to Chinese can be simply combined to form the encoding data used by this encoding rule. All the coding rules in the coding rules are the coding rules about this compound language (English plus Chinese) in the coding rules; you can take the decoding rules in the decoding rules that are only applicable to English as an example and the codes that are only applicable to Chinese as an example The decoding rules in the decoding rules are simply combined side by side (and some other auxiliary decoding rules may need to be added) to form this decoding rule. You can combine the decoding data used by the decoding rules that are only applicable to English as an example and those that are only applicable to The decoding data used in the decoding rule for Chinese as an example is simply combined to become the decoding data used in the decoding rule, and all the decoding rules in the decoding rule are the decoding rules in the decoding rule for the compound language (English plus Chinese).

如一个语句本身就是一个句体,本发明文件也说此语句含有这一个句体。As a sentence itself is a sentence body, the document of the present invention also says that this sentence contains this sentence body.

拼合信息的值在码字里的存在形式,除了可以是把拼合信息的值的BIT放在一个码字里或分开放在多个码字里(假设这句说的方式是方式1),或是把数个拼合信息的值的变换值放在一个码字里或分开放在多个码字里(假设这句说的方式是方式2)外,还可能有本说明书没提到的其它方式,这些其它方式应看成与前面方式1和方式2的等同替换。The existence form of the value of the combined information in the codeword, except that the BIT of the value of the combined information can be placed in one codeword or separated into multiple codewords (assuming that the method mentioned in this sentence is mode 1), or It is to put the transformation values of the values of several spliced information into one codeword or separate them into multiple codewords (assuming that the method mentioned in this sentence is method 2), there may be other methods not mentioned in this manual , these other ways should be regarded as equivalent replacements with theprevious ways 1 and 2.

本发明文件约定一本有声读物里定义的要发音的语句总是互不同的,即使是发音相同的两个语句也是被看成不同的语句,本发明文件约定在一本有声读物整页面上诸个码字中的所有码字里凭各种方式存在(也即可读码得到的)的所有拼合信息的值的BIT中(比如所有码字包含码字1和码字2和码字3,拼合信息的值1的BIT放在码字1里,拼合信息的值2的BIT放在码字2里,拼合信息的值3的变换值放在码字3里,拼合信息的值4的BIT放在码字1和码字2里,码字1和码字2和码字3里都不再有有关其它拼合信息的值的值,那此有声读物整页面上诸个码字中的所有码字里凭各种方式存在的所有拼合信息的值的BIT,就是拼合信息的值1的BIT和拼合信息的值2的BIT和拼合信息的值3的BIT和拼合信息的值4的BIT),总是不存在两个拼合信息的值的BIT是表示同一语句发音的的情况,也即没有相同的拼合信息的值;所以在一个码字里,可能放有两个拼合信息的值,其中一个拼合信息的值涉及的BIT与另一个拼合信息的值涉及的BIT完全相同(即各自解码出来是相同发音),那这两个拼合信息的值被看成是互不同的拼合信息的值;在一个码字里,可能放有一个拼合信息的值的变换值和另一个拼合信息的值本身,其中一个拼合信息的值涉及的BIT与另一个拼合信息的值涉及的BIT完全相同(即各自解码出来是相同发音),那这两个拼合信息的值被看成是互不同的拼合信息的值;在一个码字里,如放有一个拼合信息的值,并且还放有另一个与这个拼合信息的值涉及的BIT值完全相同的拼合信息的值的部分,那这两个拼合信息的值被看成是互不同的拼合信息的值;还可能有两个拼合信息的值,其中一个拼合信息的值涉及的BIT与另一个拼合信息的值涉及的BIT完全相同(即各自解码出来是相同发音),这两个拼合信息的值分别处于一本有声读物整页面上的不同码字里,那这两个拼合信息的值被看成是互不同的拼合信息的值;其它情况依此类推。本发明文件约定一本有声读物里定义的任何语句与另一本有声读物里定义的任何语句都是不同语句(不论发音是否相同都是不同语句)。The document of the present invention stipulates that the sentences to be pronounced defined in an audio book are always different from each other, and even two sentences with the same pronunciation are regarded as different sentences. In all code words in a code word, in the BIT of the value (such as all code words comprise code word 1 and code word 2 and code word 3, The BIT with value 1 of the combined information is placed in codeword 1, the BIT with value 2 of combined information is placed in codeword 2, the transformed value of combined information with value 3 is placed in codeword 3, and the BIT with value 4 of combined information is placed in codeword 3 Put in codeword 1 and codeword 2, codeword 1, codeword 2 and codeword 3 have no value about the value of other combined information, then all the codewords on the entire page of the audiobook The BIT of the value of all the merged information existing in various ways in the codeword is the BIT of the value 1 of the merged information, the BIT of the value 2 of the merged information, the BIT of the value 3 of the merged information, and the BIT of the value 4 of the merged information) , the BIT that always does not have two values of the combined information is the situation that the same sentence is pronounced, that is, there is no value of the same combined information; so in a codeword, there may be two values of the combined information, of which The BIT involved in the value of one piece of information is exactly the same as the BIT involved in the value of another piece of information (that is, they are decoded to have the same pronunciation), then the values of the two pieces of information are regarded as the values of the pieces of information that are different from each other; In a codeword, there may be a transformed value of the value of a pieced together information and the value itself of another pieced together information, wherein the BIT involved in the value of one pieced together information is exactly the same as the BIT involved in the value of another pieced together information (that is, each It is the same pronunciation when decoded), then the values of these two combined information are regarded as the values of different combined information; in a codeword, if there is a value of combined information, and another value with this The value of the combined information involves the BIT value of the part of the value of the combined information that is exactly the same, then the values of the two combined information are regarded as the values of the combined information that are different from each other; there may also be two values of the combined information, one of which is The BIT involved in the value of the pieced information is exactly the same as the BIT involved in the value of another pieced information (that is, they are decoded to have the same pronunciation), and the values of the two pieces of information are in different codewords on the entire page of an audiobook. , then the values of these two combined information are regarded as the values of different combined information; other cases can be deduced by analogy. The invention document stipulates that any sentence defined in one audiobook is a different sentence from any sentence defined in another audiobook (no matter whether the pronunciation is the same or not).

(本段和下一段各处提到的‘某语种’可以是印欧语系任一语种或非印欧语系任一语种或任一复合语种),对于本发明文件所提及的某编码规则中有关某语种的编码细则(或某只适用于此语种的编码规则)可用于对此语种的语句进行编码以生成拼合信息的值(或类似说法),其中所说的此语种的语句是属于此编码规则中有关此语种的编码细则定义的此语种的编码语句集的;对于本发明文件所提及的某解码规则中有关某语种的解码细则(或某只适用于此语种的解码规则)可用于对表示此语种的语句发音的拼合信息的值进行解码(或类似说法),其中所说的此语种的语句是属于此解码规则中有关此语种的解码细则定义的此语种的解码语句集的;对于本发明文件所提及的某解码程序及解码数据可用于对表示某语种的语句发音的拼合信息的值进行解码(或类似说法),其中所说的某语种的语句是属于某解码规则中有关此语种的解码细则定义的此语种的解码语句集的,此解码程序是按照此解码规则中有关此语种的解码细则来解码的。("a certain language" mentioned in this paragraph and the next paragraph can be any language of the Indo-European language family or any language of the non-Indo-European language family or any composite language), for a certain coding rule mentioned in the document of the present invention Encoding rules for a language (or an encoding rule that applies only to that language) can be used to encode sentences in that language to generate a value (or similar) for a flattened message, where the spoken sentence in that language belongs to the The encoding sentence set of this language defined by the encoding rules about this language in the coding rules; the decoding rules about a certain language in a certain decoding rule mentioned in the document of the present invention (or a certain decoding rule that is only applicable to this language) can be used It is used to decode (or similarly say) the value of the combined information representing the pronunciation of sentences in this language, where the spoken sentences in this language belong to the decoding sentence set of this language defined in the decoding rules for this language in this decoding rule ; For a certain decoding program mentioned in the document of the present invention and the decoding data, it can be used to decode the value of the combined information representing the pronunciation of a certain language (or a similar statement), wherein said certain language sentence belongs to a certain decoding rule For the decoding sentence set of this language defined in the decoding rules of this language, the decoding program is decoded according to the decoding rules of this language in this decoding rule.

本发明文件提及的适用于某语种(可以是只适用于此语种或适用于包括此语种在内的多个语种的)的编码规则,所说的适用于某语种是指适用于此语种的编码语句集(即此编码规则可对此编码语句集里的语句进行编码),此编码语句集是由前述编码规则中有关此语种的编码细则定义的。本发明文件提及的适用于某语种(可以是只适用于此语种或适用于包括此语种在内的多个语种的)的解码规则,所说的适用于某语种是指适用于此语种的解码语句集(即此解码规则可对表示此解码语句集里的任一语句发音的拼合信息的值进行解码),此解码语句集是由前述解码规则中有关此语种的解码细则定义的。本发明文件提及的适用于某语种(可以是只适用于此语种或适用于包括此语种在内的多个语种的)的解码程序及解码数据,所说的适用于某语种是指适用于此语种的解码语句集(即此解码程序可对表示此解码语句集里的任一语句发音的拼合信息的值进行解码),此解码语句集是由解码规则中有关此语种的解码细则定义的,此解码程序是按照此解码规则中有关此语种的解码细则来解码的。The coding rules applicable to a certain language mentioned in the document of the present invention (may be only applicable to this language or applicable to multiple languages including this language), and the said applicable to a certain language refers to the coding rules applicable to this language Coding statement set (that is, the encoding rule can encode the statements in the encoding statement set), and the encoding statement set is defined by the encoding rules about this language in the aforementioned encoding rules. The decoding rules applicable to a certain language (which may be applicable only to this language or applicable to multiple languages including this language) mentioned in the document of the present invention, said applicable to a certain language refers to the decoding rules applicable to this language The decoding sentence set (that is, the decoding rule can decode the value of the combined information representing the pronunciation of any sentence in the decoding sentence set), and the decoding sentence set is defined by the decoding rules related to this language in the aforementioned decoding rules. The decoding program and decoding data that are applicable to a certain language (which may be applicable only to this language or to multiple languages including this language) mentioned in the document of this invention, the said applicable to a certain language means applicable to The decoding sentence set of this language (that is, the decoding program can decode the value of the combined information representing the pronunciation of any sentence in this decoding sentence set), and this decoding sentence set is defined by the decoding rules related to this language in the decoding rules , the decoding program decodes according to the decoding rules for this language in the decoding rules.

如没特别指明有他义,以下5段提到的语种默认可以是印欧语系或非印欧语系任一语种,也可以是任一复合语种。Unless otherwise specified, the languages mentioned in the following five paragraphs can be either Indo-European or non-Indo-European by default, or any compound language.

对于某一语种,对于某编码规则1中有关此语种的编码细则,那一般需专门对应配套设计一种解码规则1中有关此语种的解码细则,此编码规则1中有关此语种的编码细则定义的语句集与此解码规则1中有关此语种的解码细则定义的语句集一般是相同的,此编码规则1中有关此语种的编码细则里音码的定义方式及使用的音码情况与此解码规则1中有关此语种的解码细则里音码的定义方式及使用的音码情况是相同或兼容的,此解码规则1中有关此语种的解码细则与此编码规则1中有关此语种的编码细则是专门对应的;但也可能有另一种编码规则2中有关此语种的编码细则,那一般需专门对应配套设计另一种解码规则2中有关此语种的编码细则,此编码规则2中有关此语种的编码细则定义的语句集与此解码规则2中有关此语种的解码细则定义的语句集一般是相同的,此编码规则2中有关此语种的编码细则里音码的定义方式及使用的音码情况与此解码规则2中有关此语种的解码细则里音码的定义方式及使用的音码情况是相同或兼容的,此解码规则2中有关此语种的解码细则与此编码规则2中有关此语种的编码细则是专门对应的;由编码规则1中有关此语种的编码细则编码生成的拼合信息的值可能可由解码规则2中有关此语种的解码细则解码,如这种解码(这种解码是指上句说的解码)可行的话那此解码规则2中有关此语种的解码细则与此编码规则1中有关此语种的编码细则是一般对应的;由编码规则2中有关此语种的编码细则编码生成的拼合信息的值可能可由解码规则1中有关此语种的解码细则解码,如这种解码(这种解码是指上句说的解码)可行的话那此解码规则1中有关此语种的解码细则与此编码规则1中有关此语种的编码细则是一般对应的。For a certain language, for the coding rules related to this language in acertain coding rule 1, it is generally necessary to design amatching decoding rule 1 related to this language. The coding rules related to this language in thiscoding rule 1 are defined The sentence set is generally the same as the sentence set defined in the decoding rules for this language in thisdecoding rule 1. The definition of the phonetic code and the used phonetic code in the coding rules for this language in thiscoding rule 1 are the same as this decoding The definition of the phonetic code and the phonetic code used in the decoding rules for this language inRule 1 are the same or compatible. The decoding rules for this language in thisdecoding rule 1 are the same as the coding rules for this language in thiscoding rule 1. It is specifically corresponding; but there may also be anotherencoding rule 2 related to the coding rules of this language, which generally needs to be specially designed to match the coding rules related to this language in anotherdecoding rule 2, and thisencoding rule 2 is related to The sentence set defined by the coding rules of this language is generally the same as the sentence set defined by the decoding rules of this language in thisdecoding rule 2. The phonetic code is the same or compatible with the definition of the phonetic code and the used phonetic code in the decoding rules for this language in thisdecoding rule 2. The decoding rules for this language in thisdecoding rule 2 are the same as those in thiscoding rule 2. The coding details about this language are specifically corresponding; the value of the stitching information generated by encoding the coding rules about this language inencoding rule 1 may be decoded by the decoding rules about this language indecoding rule 2, such as this decoding (this Decoding refers to the decoding in the previous sentence) If it is feasible, then the decoding rules about this language in thisdecoding rule 2 are generally corresponding to the encoding rules about this language in thisencoding rule 1; by the encoding rules about this language inencoding rule 2 The value of the merged information generated by the detailed code may be decoded by the decoding rules related to this language indecoding rule 1. If this kind of decoding (this decoding refers to the decoding mentioned in the previous sentence) is feasible, then the decoding rules related to this language inrule 1 The decoding rules generally correspond to the coding rules about this language in thiscoding rule 1.

某解码规则中有关某语种的解码细则与某编码规则中有关此语种的编码细则专门对应是指此解码规则中有关此语种的解码细则是专门用于解码用此编码规则中有关此语种的编码细则编码生成的此语种的拼合信息的值的,解码时所涉及的数据与编码时涉及数据一般是相同或兼容的,此解码规则中有关此语种的解码细则定义的解码语句集与此编码规则中有关此语种的编码细则定义的编码语句集一般是相同的,此编码规则中有关此语种的编码细则里音码的定义方式及使用的音码情况与此解码规则中有关此语种的解码细则里音码的定义方式及使用的音码情况是相同或兼容的,此解码规则中涉及此语种的各解码细则与此编码规则中涉及此语种的各编码细则是相同或对应的,如编解码涉及压缩处理的话那此解码规则中有关此语种的解码细则涉及的各压缩解码处理与此编码规则中有关此语种的编码细则涉及的各压缩编码处理一般是对应的,此解码规则中有关此语种的解码细则和此编码规则中有关此语种的编码细则一般都是直播方式或都是TTS方式或都是混合方式的;某解码规则中有关某语种的解码细则与某编码规则中有关某语种的编码细则是一般对应是指此解码规则中有关此语种的解码细则可以用于但不是专门用于解码用此编码规则中有关此语种的编码细则编码生成的此语种的拼合信息的值的,解码时所涉及的数据与编码时涉及数据一般是相同或兼容的,此解码规则中有关此语种的解码细则定义的解码语句集与此编码规则中有关此语种的编码细则定义的编码语句集是完全相同或部分相同的,此编码规则中有关此语种的编码细则里音码的定义方式及使用的音码情况与此解码规则中有关此语种的解码细则里音码的定义方式及使用的音码情况一般是相同或兼容的,此解码规则中涉及此语种的各解码细则与此编码规则中涉及此语种的各编码细则一般是相同或对应的,如编解码涉及压缩处理的话那此解码规则中有关此语种的解码细则涉及的各压缩解码处理与此编码规则中有关此语种的编码细则涉及的各压缩编码处理一般是对应的,对于此解码规则中有关此语种的解码细则与此编码规则中有关此语种的编码细则,如前者是直播方式的而后者可以是直播方式或是TTS方式或混合方式的,如前者是TTS方式的而后者可以是直播方式或是TTS方式或混合方式的,如前者是混合方式的而后者可以是直播方式或是TTS方式或混合方式的。本发明文件所说的解码与编码专门对应(或一般对应或对应)是指此解码是按某解码规则中有关某语种的解码细则来作的,此编码是按某编码规则中有关此语种的编码细则来作的,且此解码规则中有关此语种的解码细则与此编码规则中有关此语种的编码细则是专门对应(或一般对应或对应)的。The decoding rules about a certain language in a decoding rule correspond specifically to the coding rules about this language in a certain encoding rule, which means that the decoding rules about this language in this decoding rule are specially used for decoding the codes about this language in this encoding rule. For the value of the combined information of this language generated by the coding of the rules, the data involved in the decoding is generally the same or compatible with the data involved in the encoding. The set of coding sentences defined in the coding rules for this language is generally the same. The definition of the phonetic codes and the phonetic codes used in the coding rules for this language in the coding rules are the same as the decoding rules for this language in the decoding rules. The definition method of the sound code and the situation of the used sound code are the same or compatible, and the decoding rules related to this language in this decoding rule are the same or corresponding to the coding rules related to this language in this encoding rule, such as codec If compression processing is involved, the compression and decoding processes involved in the decoding rules for this language in the decoding rules generally correspond to the compression encoding processes involved in the encoding rules for this language in the encoding rules. The decoding rules and the coding rules for this language in this coding rule are generally live broadcast or TTS or mixed; the decoding rules for a certain language in a certain decoding rule are the same as the coding rules for a certain language Coding rules are general correspondences, which means that the decoding rules about this language in this decoding rule can be used but not specifically used to decode the value of the combined information of this language generated by encoding with the encoding rules about this language in this encoding rule. The data involved in encoding is generally the same or compatible, and the set of decoding sentences defined in the decoding rules for this language in this decoding rule is exactly the same as the set of encoding sentences defined in the encoding rules for this language in this encoding rule. Identical or partially identical, the definition of the phonetic code and the phonetic code used in the coding rules of this language in this coding rule are the same as the definition of the phonetic code and the phonetic code used in the decoding rules of this language in this decoding rule The situation is generally the same or compatible. The decoding rules involving this language in the decoding rules are generally the same or corresponding to the encoding rules involving this language in the encoding rules. If the encoding and decoding involve compression processing, then the decoding rules The compression and decoding processes involved in the decoding rules for this language generally correspond to the compression and encoding processes involved in the encoding rules for this language in this encoding rule. For the decoding rules for this language in this decoding rule and the encoding rules for this Regarding the encoding details of this language, if the former is live broadcast and the latter can be live broadcast or TTS or mixed, if the former is TTS and the latter can be live or TTS or mixed, such as The former is a hybrid method and the latter can be a live broadcast or a TTS method or a hybrid method. The said decoding and encoding in the document of the present invention correspond specifically (or generally correspond or correspond) to mean that the decoding is done according to the decoding rules related to a certain language in a certain decoding rule, and this encoding is done according to the relevant language in a certain encoding rule. The coding rules are done, and the decoding rules about this language in the decoding rules are specially corresponding (or generally corresponding or corresponding) to the coding rules about this language in the encoding rules.

如某解码规则中有关某语种的解码细则与某编码规则中有关此语种的编码细则对应,那对应的情况可以是专门对应或一般对应。本发明文件所说的有关解码规则与编码规则的对应(或专门对应或一般对应)是指解码规则中有关某语种的解码细则与编码规则中有关此语种的编码细则对应(或专门对应或一般对应),如果此解码规则是只适用于此语种,此编码规则也是只适用于此语种的,那可说成此解码规则与此编码规则对应(或专门对应或一般对应)。If the decoding rules about a certain language in a certain decoding rule correspond to the coding rules about this language in a certain encoding rule, the corresponding situation can be special correspondence or general correspondence. The correspondence (or special correspondence or general correspondence) of the relevant decoding rules and coding rules mentioned in the document of the present invention refers to the correspondence (or special correspondence or general correspondence) between the decoding rules of a certain language in the decoding rules and the coding rules of this language in the coding rules. Corresponding), if this decoding rule is only applicable to this language, and this encoding rule is also only applicable to this language, then it can be said that this decoding rule corresponds to this encoding rule (or specifically corresponds or generally corresponds).

本发明文件各处所说的某解码规则中有关某语种的解码细则与某编码规则中有关此语种的编码细则对应(或一般对应或专门对应),与说某解码规则与某编码规则有关此语种对应(或一般对应或专门对应)是等价的。The decoding rules about a certain language in a certain decoding rule mentioned in the documents of the present invention correspond to the coding rules about this language in a certain coding rule (or generally correspond or specifically correspond), and say that a certain decoding rule is related to a certain coding rule for this language Correspondences (or general or specific correspondences) are equivalent.

本说明书在讨论有关某语种的编解码原理时,有说由某些种音码可拼合语句集发音,且如没特别注明有他义的话那默认这里提及的此语种的语句集是泛指的(此泛指的语句集里包括的语句个数可以是大于等于1的任意数目,如把此语种的所有语句当成一个大集合,那此泛指的语句集就可以是由在这个大集合里随意地取任意个数的语句来组成的语句集,比如把大集合里的所有语句顺序编号,随意地取2个语句就可取大集合里的1号语句和3号语句,或取2号语句和6号语句等,随意地取3个语句就可取大集合里的1号语句和3号语句和5号语句,或取2号语句和5号语句和6号语句等),然后在具体涉及某编码规则中有关此语种的编码细则时定义的此语种的编码语句集就可以是本段前面提及的语句集的一个具体实例,而且存在(即使实际上不存在,理论上也是存在的)与此编码规则中有关此语种的编码细则专门对应的某解码规则中有关此语种的解码细则,其定义的此语种的解码语句集与本段前面述的编码语句集是完全相同的;比如在本说明书第11页,第12页所述及的对印欧语系任一语种(可以是英语),可由某些种音码来拼合语句集发音,提到的各语句集都是泛指的,然后在后面的以只适用于英语为例的编码规则里定义的(也即在后面的以只适用于英语为例的编码规则中有关英语的编码细则里定义的)编码语句集就是上一句提及的各语句集中的一个语句集的一个具体实例,后面的以只适用于英语为例的解码规则里定义的(也即后面的以只适用于英语为例的解码规则中有关英语的解码细则里定义的)解码语句集与上一句提及的编码语句集是相同的,此解码规则与此编码规则是专门对应的。在述及编码规则中有关某语种的编码细则时,在其涉及定义的语句集前面加有‘编码’作限定的,因为用此编码规则中有关此语种的编码细则编码得到的拼合信息的值可能由不是与此编码规则中有关此语种的编码细则专门对应的解码规则中有关此语种的解码细则来解码,且此编码规则中有关此语种的编码细则定义的编码语句集与此解码规则中有关此语种的解码细则定义的解码语句集一般不同,甚至用此编码规则中有关此语种的编码细则编码得到的拼合信息的值可以是由与此编码规则中有关此语种的编码细则没对应关系的某解码规则中有关另一语种(比如本句说的‘此语种’是一非复合语种且本句说的‘另一语种’是一复合语种)的解码细则来解码的,所以编码时涉及定义的语句集前面加有‘编码’作限定;同理,在述及解码规则中有关某语种的解码细则时,也在其涉及定义的语句集前面加有‘解码’作限定的。如提到时无特别指明有他意,本段各处提到的语种默认可以是任一非复合语种或任一复合语种。When discussing the coding and decoding principles of a certain language in this manual, it is said that certain phonetic codes can be combined to form a sentence set for pronunciation. Refers to (the number of sentences included in this general sentence set can be any number greater than or equal to 1, if all the sentences in this language are regarded as a large set, then this general sentence set can be composed of this large set A statement set composed of arbitrary numbers of statements in the set, such as numbering all the statements in the large set sequentially, and randomly taking 2 statements can be taken asstatement 1 andstatement 3 in the large set, or 2 Statement No. 6 and Statement No. 6, etc., if you randomly take 3 statements, you can choose Statement No. 1, Statement No. 3 and Statement No. 5 in the large set, or statement No. 2, statement No. 5 and statement No. 6, etc.), and then in Specifically related to the coding rules of this language in a certain coding rule, the coded sentence set of this language can be a specific example of the sentence set mentioned earlier in this paragraph, and it exists (even if it does not exist in practice, it also exists in theory. ) The decoding rules for this language in a certain decoding rule that specifically correspond to the coding rules for this language in this encoding rule, and the decoding statement set for this language defined by it is exactly the same as the encoding statement set mentioned above in this paragraph; For example, on the 11th page of this manual, the 12th page described to any language of the Indo-European language family (can be English), can be combined sentence set pronunciation by some kinds of phonetic codes, and each sentence set mentioned is a general reference. , then the encoding statement set defined in the encoding rules that are only applicable to English as an example (that is, defined in the encoding rules about English in the encoding rules that are only applicable to English as an example) is the above A specific example of a sentence set in each sentence set mentioned in a sentence, defined in the following decoding rules that are only applicable to English The decoding statement set defined in the decoding rules) is the same as the encoding statement set mentioned in the previous sentence, and this decoding rule corresponds specifically to this encoding rule. When referring to the coding rules of a certain language in the coding rules, it is limited by adding "coding" in front of the statement set involved in the definition, because the value of the combined information obtained by coding with the coding rules of the coding rules about this language may be decoded by a decoding rule for this language in a decoding rule that does not specifically correspond to the encoding rule for this language in this encoding rule, and the encoding statement set defined by the encoding rule for this language in this encoding rule is the same as that in this decoding rule The decoding sentence sets defined by the decoding rules of this language are generally different, and even the value of the combined information encoded by the encoding rules of this language in this encoding rule may not correspond to the encoding rules of this language in this encoding rule. It is decoded by the decoding rules related to another language in a certain decoding rule (for example, the "this language" mentioned in this sentence is a non-compound language and the "another language" mentioned in this sentence is a compound language), so the encoding involves Defined sentence sets are limited by 'encoding'; similarly, when referring to the decoding rules of a certain language in the decoding rules, 'decoding' is also added in front of the defined sentence sets. If there is no special intention when mentioned, the language mentioned in this paragraph can be any non-composite language or any compound language by default.

本说明书在讨论编码规则时,有说由一些种音码(比如单词音码和词根音码和元辅音音码)可拼合编码规则中有关某语种的编码细则定义的编码语句集发音,是指编码语句集中的任一语句都可由这些种音码来编码得到表示此语句发音的拼合信息的值;When discussing the coding rules in this manual, it is said that the coding sentence set pronunciation defined by the coding rules of a certain language in the coding rules that can be combined by some kinds of phonetic codes (such as word phonetic codes and root phonetic codes and vowel consonant codes) refers to Any sentence in the coded sentence set can be encoded by these kinds of phonetic codes to obtain the value of the combined information representing the pronunciation of this sentence;

本说明书在讨论解码规则时,有说由一些种音码(比如单词音码和词根音码和元辅音音码)可拼合解码规则中有关某语种的解码细则定义的解码语句集发音,是指表示此解码语句集中任一语句发音的拼合信息的值都可由此解码规则中有关此语种的解码细则解码,且解码时是用这些种音码来拼合此语句发音的;When discussing the decoding rules in this manual, it is said that the pronunciation of the decoding sentence set defined by the decoding rules about a certain language in the decoding rules can be combined by some kinds of phonetic codes (such as word phonetic codes and root phonetic codes and vowel consonant codes). Indicates that the value of the stitching information of any sentence pronunciation in the decoding sentence set can be decoded by the decoding rules related to this language in the decoding rules, and these phonetic codes are used to stitch the pronunciation of the sentence during decoding;

如一个语句既属于某编码规则中有关某语种A的编码细则定义的编码语句集,又属于某解码规则中有关某语种B的解码细则定义的解码语句集,那用此编码规则中有关语种A的编码细则编码生成的表示此语句发音的拼合信息的值,就可能可由此解码规则中有关语种B的解码细则来解码(一般要求编码规则中有关语种A的编码细则里音码的定义方式及使用的音码情况与解码规则中有关语种B的解码细则里音码的定义方式及使用的音码情况相同或兼容);本段说的语种A和语种B可以是相同语种或不同语种。If a sentence belongs to the encoding statement set defined by the encoding rules of a certain language A in a certain encoding rule, and belongs to the decoding statement set defined in the decoding rules of a certain language B in a certain decoding rule, then use the relevant language A in this encoding rule The value of the combined information representing the pronunciation of the sentence generated by the coding rules of the code may be decoded by the decoding rules of the relevant language B in the decoding rules (generally requiring the definition of the phonetic code in the coding rules of the language A in the coding rules and The phonetic code situation that uses is the same or compatible with the definition of the phonetic code and the phonetic code situation used in the decoding rules of the relevant language B in the decoding rules); the language A and language B mentioned in this paragraph can be the same language or different languages.

如没特别指明有他义,以下2段各处提到的语种默认可以是印欧语系或非印欧语系任一语种,也可以是任一复合语种。Unless otherwise specified, the languages mentioned in the following two paragraphs can be either Indo-European or non-Indo-European by default, or any compound language.

本说明书述的直播方式的以只适用于英语为例的解码规则与直播方式的以只适用于英语为例的编码规则有关英语是专门对应的,直播方式的以只适用于汉语为例的解码规则与直播方式的以只适用于汉语为例的编码规则有关汉语是专门对应的,TTS方式的以只适用于英语为例的解码规则与TTS方式的以只适用于英语为例的编码规则有关英语是专门对应的,TTS方式的以只适用于汉语为例的解码规则与TTS方式的以只适用于汉语为例的编码规则有关汉语是专门对应的;除此之外,除非提及时特别指明有他义,本发明文件其它各处提及的解码规则与编码规则有关某语种对应,默认既可以是专门对应,也可以是一般对应,在一般对应时,本发明文件提及的用此编码规则中有关此语种的编码细则编码生成的表示编码语句集中有些语句中任一语句发音的拼合信息的值,是可以被此解码规则中有关此语种的解码细则解码的,但也可能用此编码规则中有关此语种的编码细则编码生成的表示编码语句集中有些语句中任一语句发音的拼合信息的值不能被此解码规则中有关此语种的解码细则解码(前面这3句话提到的编码语句集是指此编码规则中有关此语种的编码细则定义的编码语句集)。The decoding rules of the live broadcast mode described in this manual are only applicable to English as an example and the encoding rules of the live broadcast mode are only applicable to English as an example. The rules are specifically corresponding to the encoding rules of the live broadcast method that are only applicable to Chinese as an example, and the decoding rules of the TTS method that are only applicable to English as an example are related to the encoding rules of the TTS method that are only applicable to English as an example English is specially corresponding, and the decoding rules of the TTS method that are only applicable to Chinese as an example are specifically corresponding to the encoding rules of the TTS method that are only applicable to Chinese as an example; other than that, unless otherwise specified when mentioning There are other meanings. The decoding rules mentioned elsewhere in the document of the present invention correspond to certain languages related to the encoding rules. The default can be a special correspondence or a general correspondence. The value generated by the coding rules about this language in the rules, representing the combined information of any sentence in some sentences in the coded sentence set, can be decoded by the decoding rules about this language in this decoding rule, but it is also possible to use this code The value of the combined information generated by encoding the detailed coding rules for this language in the rule to represent the pronunciation of any sentence in the coded sentence set cannot be decoded by the detailed decoding rules for this language in the decoding rule (the codes mentioned in the previous three sentences The sentence set refers to the set of coding sentences defined in the coding rules for this language in this coding rule).

对某语种,也可把本发明文件所说的某解码规则中有关此语种的解码细则与某编码规则中有关此语种的编码细则对应,说成此编码规则中有关此语种的编码细则与此解码规则中有关此语种的解码细则对应。对于只适用于一个语种的解码规则和只适用于一个语种的编码规则,也可把本发明文件所说的解码规则中有关此语种的解码细则与编码规则中有关此语种的编码细则对应,说成此解码规则与此编码规则对应,或说成此编码规则与此解码规则对应。(把本段内容中的‘对应’全部换成‘专门对应’或‘一般对应’仍然有效)To certain language, also can corresponding to the relevant coding detailed rules of this language in a certain decoding rule of the said document of the present invention and the relevant coding detailed rules of this language in certain encoding rules, be described as the relevant coding detailed rules of this language in this coding rule and this Corresponds to the decoding rules for this language in the decoding rules. For the decoding rules that are only applicable to one language and the encoding rules that are only applicable to one language, the decoding rules about this language in the said decoding rules of the document of the present invention can also be corresponding to the encoding rules about this language in the encoding rules, saying The decoding rule corresponds to the encoding rule, or the encoding rule corresponds to the decoding rule. (Replacing all the "correspondence" in this paragraph with "special correspondence" or "general correspondence" is still valid)

如提及时无特别指明确切含义,以上8段提及的编码规则可以是只适用于一个语种或适用于多个语种的;如提及时无特别指明确切含义,以上8段提及的解码规则可以是只适用于一个语种或适用于多个语种的。If the exact meaning is not specified when mentioned, the encoding rules mentioned in the above 8 paragraphs can be applicable to only one language or multiple languages; if the exact meaning is not specified when mentioned, the decoding rules mentioned in the above 8 paragraphs can be Is it applicable to only one language or applicable to multiple languages.

如一个解码规则中有关某语种的解码细则涉及的发音形式是全用直接发音方式(即对表示解码语句集中任一语句发音的拼合信息的值都用直接发音方式来进行解码),那一般是全用整步直接发音方式(即对表示解码语句集中任一语句发音的拼合信息的值都用整步直接发音方式进行解码),或全用分步直接发音方式(即对表示解码语句集中任一语句发音的拼合信息的值都用分步直接发音方式进行解码),另也可对表示解码语句集中有些语句中任一语句发音的拼合信息的值用整步直接发音方式进行解码,而对表示解码语句集中另一些语句中任一语句发音的拼合信息的值用分步直接发音方式进行解码(甚至对表示解码语句集中任一语句发音的拼合信息的值既可用整步直接发音方式也可用分步直接发音方式进行解码,甚至对表示解码语句集中任一语句发音的拼合信息的值解码时对此语句的部分用整步直接发音方式发音而对此语句的另外部分用分步直接发音方式发音),这种方式与全用整步直接发音方式(或全用分步直接发音方式)是等同替换,这种方式的解码相关处理可参考整步直接发音方式和分步直接发音方式;对于一个解码规则中有关某语种的解码细则涉及的发音形式是全用TTS发音方式的(即对表示解码语句集中任一语句发音的拼合信息的值都用TTS发音方式来进行解码),也可有类似这样的方式,且这种方式与全用整步TTS发音方式(或全用分步TTS发音方式)是等同替换,这种方式的解码相关处理可参考整步TTS发音方式和分步TTS发音方式。(本段提到的语种可以是任一非复合语种或任一复合语种)If the pronunciation forms involved in the decoding rules of a certain language in a decoding rule are all used in the direct pronunciation mode (that is, the value of the combined information representing the pronunciation of any sentence in the decoding sentence set is decoded in the direct pronunciation mode), then generally All use the full-step direct pronunciation mode (that is, decode the value of the combined information that represents any sentence pronunciation in the decoded sentence set), or use the step-by-step direct pronunciation mode (that is, represent any sentence in the decoded sentence set) The value of the combined information of a sentence pronunciation is all decoded with the step-by-step direct pronunciation mode), and the value of the combined information of any sentence pronunciation in some sentences in the decoding statement set can also be decoded with the whole step direct pronunciation mode in addition. The value of the combined information representing the pronunciation of any sentence in other sentences in the decoded sentence set is decoded with the step-by-step direct pronunciation method (even the value of the combined information representing the pronunciation of any sentence in the decoded sentence set can be directly pronounced in a whole step. Decoding in step-by-step direct pronunciation mode, even when decoding the value of the combined information representing the pronunciation of any sentence in the decoded sentence set, pronounce the part of the sentence in full-step direct pronunciation mode and the other part of the sentence in step-by-step direct pronunciation mode Pronunciation), this method is equivalent to the full-step direct pronunciation method (or all step-by-step direct pronunciation method), and the decoding related processing of this method can refer to the whole-step direct pronunciation method and the step-by-step direct pronunciation method; for The pronunciation forms involved in the decoding rules of a certain language in a decoding rule are all in the TTS pronunciation mode (that is, the values of the combined information representing the pronunciation of any sentence in the decoding sentence set are all decoded in the TTS pronunciation mode), and there may also be A method similar to this, and this method is an equivalent replacement with the full-step TTS pronunciation method (or the full-step TTS pronunciation method). The decoding related processing of this method can refer to the whole-step TTS pronunciation method and the step-by-step TTS pronunciation method. Way. (The language mentioned in this paragraph can be any non-compound language or any compound language)

对于一语种,在一个编码规则中,如果有一些编码细则是用于对此语种的一个编码语句集进行编码的,此编码规则中,还有另一些编码细则是用于对此语种的另一个编码语句集进行编码的,这里两个编码语句集可以完全相同或部分相同或完全不同,对这种情况本发明文件是默认这样做的:把这里说的一些编码细则和另一些编码细则合起来当成此编码规则中有关此语种的编码细则1,此编码细则1定义的编码语句集1是前面述的两个编码语句集的合集(对任一语句,如两个编码语句集里同时有此语句,那在合集里只留一个此语句);如在某解码规则中,有一些解码细则是与前述一些编码细则有关此语种专门对应的(这些解码细则对应定义有一个解码语句集),还有另一些解码细则是与前述另一些编码细则有关此语种专门对应的(这些解码细则也对应定义有另一个解码语句集),对这种情况本发明文件是默认这样做的:把这里说的一些解码细则和另一些解码细则合起来当成此解码规则中有关此语种的解码细则1,此解码细则1定义的解码语句集1是前面述的两个解码语句集的合集(对任一语句,如两个解码语句集里同时有此语句,那在合集里只留一个此语句);此解码规则中有关此语种的解码细则1与前述编码规则中有关此语种的编码细则1是专门对应的;如另有一编码规则2,只含有前述另一些编码细则,如另有一解码规则2,只含有前述另一些解码细则,那前述解码规则中有关此语种的解码细则1与编码规则2里这另一些编码细则是一般对应的,解码规则2里这另一些解码细则与前述编码规则中有关此语种的编码细则1是一般对应的;如前述一些解码细则是直播方式的,前述另一些解码细则是直播方式,那它们合起来就是直播方式的,如前述一些解码细则是TTS方式的,前述另一些解码细则是TTS方式,那它们合起来就是TTS方式的,如前述一些解码细则是直播方式的,前述另一些解码细则是TTS方式,那它们合起来就是混合方式的,编码细则的情况与解码细则的情况类似。总之,本发明文件默认对于在一个编码规则中有针对此语种两个或两个以上编码语句集来分别编码的情况都按以上述这样,把分别用于此语种的各编码语句集编码的编码细则都合起来当成此编码规则中有关此语种的编码细则(把各编码语句集也合起来成一编码语句集,并把此编码语句集当成这里把各编码细则合起来得到的此编码规则中有关此语种的编码细则定义的编码语句集);总之,本发明文件默认对于在一个解码规则中有针对此语种两个或两个以上解码语句集来分别解码的情况都按以上述这样,把分别用于此语种的各解码语句集解码的解码细则都合起来当成此解码规则中有关此语种的解码细则(把各解码语句集也合起来成一解码语句集,并把此解码语句集当成这里把各解码细则合起来得到的此解码规则中有关此语种的解码细则定义的解码语句集)。(本段说的解码规则中有关此语种的解码细则1,仍是指解码规则中有关此语种的所有解码细则的集合,本段说的编码规则中有关此语种的编码细则1,仍是指编码规则中有关此语种的所有编码细则的集合;本段提到的语种可以是任一非复合语种或任一复合语种)For a language, in a coding rule, if some coding details are used to encode a set of coded sentences of this language, there are other coding rules in this coding rule, which are used to encode another set of coding sentences of this language. Coding sentence set is encoded, and here two coding sentence sets can be completely identical or partly identical or completely different, and the present invention document is done like this by default in this case: combine some coding detailed rules mentioned here with other coding detailed rules As the coding rules 1 about this language in this coding rule, the coded sentence set 1 defined in this coding rule 1 is the collection of the two coded sentence sets mentioned above (for any sentence, if there is this in the two coded sentence sets simultaneously) sentence, then only one sentence is left in the collection); for example, in a certain decoding rule, there are some decoding rules that are specifically corresponding to the language related to the aforementioned coding rules (these decoding rules correspond to a set of decoding sentences defined), and There are other decoding rules that are specifically corresponding to this language related to the aforementioned other encoding rules (these decoding rules also define another set of decoding sentences correspondingly), and the document of the present invention does this by default in this case: put what is said here Some decoding rules and other decoding rules are combined as the decoding rules 1 about this language in the decoding rules, and the decoding statement set 1 defined in the decoding rules 1 is a collection of the two decoding sentences sets mentioned above (for any sentence, If there is this sentence in two sets of decoding sentences at the same time, then only one sentence is left in the collection); the decoding rule 1 in this decoding rule for this language is specially corresponding to the encoding rule 1 in the aforementioned encoding rule for this language ; If there is another encoding rule 2, which only contains the other encoding rules mentioned above, and if there is another decoding rule 2, which only contains the other decoding rules mentioned above, then in the decoding rules 1 and encoding rules 2 related to this language in the aforementioned decoding rules, this other Some coding rules are generally corresponding, and the other decoding rules in the decoding rule 2 are generally corresponding to the coding rules 1 in the aforementioned coding rules related to this language; for example, some of the aforementioned decoding rules are live broadcast, and the aforementioned other decoding rules are Live mode, then they are combined into live mode, if some of the aforementioned decoding details are in TTS mode, and other aforementioned decoding details are in TTS mode, then they are combined into TTS mode, such as some of the aforementioned decoding details are in live mode, The aforementioned other decoding rules are in the TTS mode, so they are mixed together, and the situation of the encoding rules is similar to that of the decoding rules. In a word, the document of the present invention acquiesces to the situation that there are two or more coded sentence sets for this language in one coding rule to encode separately, all according to the above, the coding of each coded sentence set used for this language respectively The detailed rules are all put together as the coding detailed rules about this language in this coding rule (each coded sentence set is also combined into a coded sentence set, and this coded sentence set is regarded as the relevant coding rules in this coding rule that each coding detailed rule is combined to obtain here) The encoding statement set defined by the encoding rules of the language); in a word, the document of the present invention defaults to the situation that two or more than two decoding statement sets of the language are decoded respectively according to the above in a decoding rule, and use respectively The decoding details of each decoding sentence set decoding in this language are all combined as the decoding rules about this language in this decoding rule (each decoding sentence set is also combined into a decoding sentence set, and this decoding sentence set is regarded as here. The decoding statement set defined by the decoding rules for this language in this decoding rule obtained by combining the rules). (The decoding rules 1 related to this language in the decoding rules mentioned in this paragraph still refer to the collection of all decoding rules related to this language in the decoding rules, and theencoding rules 1 related to this language in the encoding rules mentioned in this paragraph still refer to A collection of all coding rules for this language in the coding rules; the language mentioned in this paragraph can be any non-composite language or any compound language)

对于任一语种(此任一语种可以是任一非复合语种或任一复合语种),本发明文件说到的此语种语句集里语句个数无限(或说此语种语句集范围无限),其实际上一般是指此语句集是包括此语种的任意语句,而如单单从字母(或汉语的几千个字)的角度来说,用26个字母(或汉语的几千个字等)来组成语句,那语句个数就算有几十亿个也总是有限度的,但数目太大难以穷举所以可看成是无限的,如果此语句集是包括此语种的某个范围的任意语句且语句数量很大难以穷举那也可看成语句个数是无限的;而本发明文件说到的某语种语句集里语句个数有限,是指语句集包含的语句个数一般是几百或几千或几万,是可穷举罗列的。(本段所说的语句集是编码语句集时本段内容也成立,本段所说的语句集是解码语句集时本段内容也成立)For any language (this any language can be any non-compound language or any compound language), the number of sentences in the sentence set of this language mentioned in the document of the present invention is infinite (or the range of sentence collection of this language is infinite), and its In fact, it generally means that this sentence set is any sentence that includes this language, and as only from the perspective of letters (or thousands of words in Chinese), use 26 letters (or thousands of words in Chinese, etc.) To form sentences, the number of sentences is always limited even if there are billions, but the number is too large to be exhaustive, so it can be regarded as infinite. If the sentence set includes any sentences in a certain range of this language And the number of sentences is very large and difficult to exhaustively, it can also be regarded as the number of sentences is infinite; and the number of sentences in the sentence set of a certain language mentioned in the document of the present invention is limited, which means that the number of sentences included in the sentence set is generally several hundred. Or thousands or tens of thousands, can be listed exhaustively. (The content of this paragraph is also established when the sentence set mentioned in this paragraph is a coded sentence set, and the content of this paragraph is also established when the sentence set mentioned in this paragraph is a decoding sentence set)

以下6段各处提到的语种可以是任一非复合语种或任一复合语种。The languages mentioned in various places in the following 6 paragraphs may be any non-composite language or any compound language.

编码规则中有关某语种的编码细则定义编码语句集范围的方式可以是完整定义,基本定义,隐含定义等;完整定义是指把编码规则中有关此语种的编码细则能编码的此语种的所有语句都具体定义出来并用这些语句组成编码语句集,完整定义是被做成编码细则的,(本说明书的编码规则举例里主要是采用完整定义的);基本定义所定义的是编码规则中有关此语种的编码细则初始要编码的此语种的编码语句集(但实际可能有些不属于基本定义的编码语句集的此语种的语句也能被编码),基本定义是被做成编码细则的;隐含定义是指在编码规则中有关此语种的编码细则里没作对编码语句集的定义,但由此编码规则中有关此语种的编码细则涉及的其它信息及其它数据可间接得到此编码规则中有关此语种的编码细则涉及的此语种的编码语句集的范围,比如某编码规则中有关此语种的编码细则里不能处理陌生词且主要用于拼合语句发音的音码是词,那词数据库里词的范围就间接决定了此语种的编码语句集的范围。The coding rules about a certain language in the coding rules define the scope of the coding sentence set can be complete definition, basic definition, implicit definition, etc.; the complete definition refers to all the coding rules about the language in the coding rules that can be encoded in this language. Statements are all specifically defined and these sentences are used to form a set of coded sentences. The complete definition is made into coding rules (in the example of coding rules in this manual, the complete definition is mainly used); the basic definition is defined in the coding rules. Coding rules of language The set of coded sentences of this language to be coded initially (but in fact, some sentences of this language that do not belong to the set of coded sentences of the basic definition may also be coded), the basic definition is made into coding rules; implicit The definition means that there is no definition of the coding sentence set in the coding rules about this language in the coding rules, but the other information and other data involved in the coding rules about this language in the coding rules can be indirectly obtained from the coding rules related to this language. The range of coded sentence sets of this language involved in the coding rules of a language. For example, in a coding rule related to this language, the phonetic codes that cannot handle unfamiliar words and are mainly used to combine the pronunciation of sentences are words. The words in the word database The scope indirectly determines the scope of the coded sentence set of this language.

显然在编码规则中有关某语种的编码细则对编码语句集的范围作定义时,可以是选用完整定义,基本定义,隐含定义这3种方式中的一种。对于任一个语种,不同编码规则中有关此语种的编码细则,定义编码语句集的方式可能是不同的。Obviously, when defining the scope of the coded sentence set in the coding rules related to a certain language in the coding rules, one of the three ways of complete definition, basic definition and implicit definition can be selected. For any language, the encoding rules for this language in different encoding rules may be different in the way of defining the encoding statement set.

如编码规则中有关某语种的编码细则基本定义或隐含定义了此语种的编码语句集,那此编码规则中有关此语种的编码细则对此编码语句集里任一语句都可进行编码以生成拼合信息的值,但也可能还能对此语句集外的此语种的语句进行编码以生成拼合信息的值,比如语句‘HE WALK AND SING’If the coding rules about a certain language in the coding rules basically define or implicitly define the coding sentence set of this language, then any sentence in the coding sentence set can be encoded to generate The value of the flattened information, but may also encode sentences in this language outside the set of sentences to generate the value of the flattened information, such as the sentence 'HE WALK AND SING'

属于基本定义或隐含定义的(英语)编码语句集,而语句‘HE SING AND WALK’A set of (English) coded sentences that belong to the basic or implied definition, and the sentence 'HE SING AND WALK'

不属于此编码语句集(出现这种情况时一般基本定义或隐含定义的编码语句集里语句个数有限),但对后一语句一般也能象对前一语句那样进行编码(比如都是用单词音码来拼合,用于前一语句拼合发音的那些音码也可用于后一语句拼合发音)。(本行和以上7行是同一段落,这里英语跳行写是因为说明书电子编辑器有问题,所以这里遇到英语要跳行写)It does not belong to this coded statement set (when this happens, the number of statements in the coded statement set of the basic definition or implicit definition is generally limited), but the latter statement can generally be encoded as the previous statement (for example, both Combine with word phonetic sign indicating number, those phonetic codes that are used for the combined pronunciation of preceding sentence also can be used for the combined pronunciation of following sentence). (This line is the same paragraph as the above 7 lines. The reason for skipping lines in English here is because there is a problem with the electronic editor of the manual, so here you need to skip lines when encountering English)

类似的,解码规则中有关某语种的解码细则定义解码语句集的方式可以是完整定义,基本定义,隐含定义;完整定义是指把所有表示其发音的拼合信息的值能被解码规则中有关此语种的解码细则解码的此语种的语句都具体定义出来并由这些语句组成解码语句集(这句话里‘所有’是修饰‘此语种的语句’的,这句话里的‘其’是指‘此语种的语句’,这句话里‘表示其发音的拼合信息的值能被解码规则中有关此语种的解码细则解码’是修饰‘此语种的语句’的),完整定义是被做成解码细则的,(本说明书的解码规则举例里主要是采用完整定义的);基本定义所定义的解码语句集,是指解码规则中有关此语种的解码细则初始想用于对表示此语句集里任一语句发音的拼合信息的值都能进行解码的语句集。但实际可能有些表示不属于基本定义的解码语句集的此语种的语句的发音的拼合信息的值也能被解码,基本定义是被做成解码细则的;隐含定义是指在解码细则里没作解码语句集的定义,但由解码规则中有关此语种的解码细则涉及的其它信息及其它数据可间接得到此解码规则中有关此语种的解码细则涉及的解码语句集的范围,比如某解码规则中有关此语种的解码细则里不能处理陌生词且主要用于拼合语句发音的音码是词,那词数据库里词的范围就间接决定了解码语句集的范围。Similarly, the decoding rules for a certain language in the decoding rules can define the decoding sentence set in a complete definition, a basic definition, and an implicit definition; a complete definition means that all the values of the combined information that represent its pronunciation can be related to the decoding rules. The decoding rules of this language The sentences of this language to be decoded are all specifically defined and composed of these sentences to form a set of decoding sentences (the "all" in this sentence is to modify the "sentences of this language", and the "other" in this sentence is Refers to 'sentences in this language', in this sentence, 'indicates that the value of the combined information of its pronunciation can be decoded by the decoding rules of this language in the decoding rules' is a modification of 'sentences in this language'), and the complete definition is made into decoding rules, (the examples of decoding rules in this specification mainly use complete definitions); the decoding sentence set defined in the basic definition refers to the decoding rules of this language in the decoding rules that are initially intended to be used to represent this sentence set The set of sentences in which the value of the combined information of any sentence pronunciation can be decoded. But in fact, there may be some values that represent the pronunciation of sentences in this language that do not belong to the decoding sentence set of the basic definition can also be decoded. The basic definition is made into decoding rules; the implicit definition means that there is no The definition of the decoding sentence set, but the range of the decoding sentence set involved in the decoding rules of this language in the decoding rules can be indirectly obtained from other information and other data involved in the decoding rules of this language in the decoding rules, such as a certain decoding rule In the decoding rules for this language, unfamiliar words cannot be dealt with and the phonetic codes that are mainly used to combine the pronunciation of sentences are words. The range of words in the word database indirectly determines the range of decoding sentence sets.

显然在解码规则中有关某语种的解码细则对解码语句集的范围作定义时,可以是选用完整定义,基本定义,隐含定义这3种方式中的一种。对于任一个语种,不同解码规则中有关此语种的解码细则,定义解码语句集的方式可能是不同的。Apparently, in the decoding rules related to a certain language, when defining the scope of the decoding sentence set, one of the three methods of complete definition, basic definition and implicit definition can be selected. For any language, the decoding rules for this language in different decoding rules may be different in the way of defining the set of decoding sentences.

如解码规则中有关某语种的解码细则基本定义或隐含定义了此语种的解码语句集,那此解码规则中有关此语种的解码细则对表示此解码语句集里任一语句发音的拼合信息的值都可解码,但也可能还能对于表示此语句集外的此语种的语句的发音的拼合信息的值进行解码,比如语句‘HE WALK AND SING’属于基本定义或隐含定义的(英语)解码语句集,而语句‘HE SING AND WALK’If the decoding rules of a certain language in the decoding rules basically define or implicitly define the decoding sentence set of this language, then the decoding rules of this language in the decoding rules have a certain effect on the combined information representing the pronunciation of any sentence in the decoding sentence set The values can be decoded, but it is also possible to decode the value of the combined information representing the pronunciation of sentences in this language outside this sentence set, such as the sentence 'HE WALK AND SING' belonging to the basic definition or implicit definition (English) Decoding the set of statements while the statement 'HE SING AND WALK'

不属于此解码语句集(出现这种情况时一般基本定义或隐含定义的解码语句集里语句个数有限),但表示后一语句发音的拼合信息的值一般也能象表示前一语句发音的拼合信息的值一样被解码(比如都是用单词音码来拼合,用于前一语句拼合发音的那些音码也可用于后一语句拼合发音)。(本行和以上8行是同一段落,这里英语跳行写是因为说明书电子编辑器有问题,所以这里遇到英语要跳行写)Does not belong to this decoding sentence set (when this happens, the number of sentences in the decoding sentence set of general basic definition or implicit definition is limited), but the value of the combined information representing the pronunciation of the next sentence can generally represent the pronunciation of the previous sentence The value of the stitching information is decoded (for example, all are stitched together with word phonetic codes, and those phonetic codes that are used for the stitched pronunciation of the previous sentence can also be used for the stitched pronunciation of the latter sentence). (This line is the same paragraph as the above 8 lines. The reason for skipping lines in English here is because there is a problem with the electronic editor of the manual, so here you need to skip lines when encountering English)

本说明书在述及解码规则与编码规则对应或专门对应或一般对应时涉及的编码规则中有关某语种的编码细则定义的编码语句集,以及述及直播方式的以只适用于英语为例的编码规则,TTS方式的以只适用于英语为例的编码规则时涉及的编码规则中有关英语的编码细则定义的编码语句集,以及述及直播方式的以只适用于汉语为例的编码规则,TTS方式的以只适用于汉语为例的编码规则时涉及的编码规则中有关汉语的编码细则定义的编码语句集都默认是按以下1段所述这样把超出语句都包括进编码语句集的(除非提及处有特别注明不是把超出语句都包括进来;而除以上这些情况之外时对编码语句集的定义不一定是按照以下1段所述要把超出语句都包括进来),可把这种把超出语句都包括进编码语句集的情况称为编码时定义编码语句集是采用编码全集定义的:In this specification, when referring to the correspondence between the decoding rules and the encoding rules, or special correspondence or general correspondence, the encoding sentence set defined by the encoding rules of a certain language in the encoding rules involved, and the encoding statement that is only applicable to English as an example when referring to the live broadcast method Rules, TTS mode of encoding rules only applicable to English as an example, the encoding sentence set defined in the encoding rules related to English encoding rules, and the encoding rules that refer to live broadcast methods only applicable to Chinese as an example, TTS The coded sentence sets defined in the coding rules related to Chinese coding rules in the coded rules that are only applicable to Chinese as an example of the mode are all included in the coded sentence sets by default as described in the following paragraph 1 (unless There is a special note at the mention that does not include all the excess sentences; and in addition to the above cases, the definition of the coded statement set does not necessarily include all the excess sentences as described in the following paragraph), you can put this The case of including more than one statement into the set of coded statements is called encoding-time definition. The set of coded statements is defined using the complete set of codes:

如某编码规则中有关某语种的编码细则原来基本定义了此语种的编码语句集的范围,可能实际能编码的此语种的语句范围就等于此编码语句集范围,这时此编码语句集范围就是此编码规则中有关此语种的编码细则定义的编码语句集范围;但如有些实际能被此编码规则中有关此语种的编码细则编码的此语种的语句超出了前述基本定义的编码语句集范围(本段把此语种所有这样的超出语句称为所有超出语句),那此编码规则中有关此语种的编码细则定义的此语种的编码语句集实际是包括前述基本定义的编码语句集和所有这些超出语句的。(本段说的所有超出语句是针对编码的)。对于隐含定义的情况与基本定义的情况类似。总而言之,不管编码规则中有关某语种的编码细则定义编码语句集范围的方式是采取哪种方式(指完整定义,基本定义,隐含定义等),凡是能被此编码规则中有关此语种的编码细则编码的此语种的语句都属于此编码规则中有关此语种的编码细则定义的编码语句集;本段所说的语种可以是任一非复合语种或任一复合语种。For example, the coding rules for a certain language in a certain coding rule originally basically defined the range of the coded sentence set of this language, and the range of sentences of this language that can be actually coded is equal to the range of this coded sentence set. At this time, the range of this coded sentence set is The scope of the coding sentence set defined by the coding rules for this language in this coding rule; however, if some sentences of this language that can actually be encoded by the coding rules for this language in this coding rule exceed the range of the coding sentence set for the basic definition ( This paragraph refers to all such exceeding sentences of this language as all exceeding sentences), then the coding sentence set of this language defined in the coding rules of this language in this coding rule actually includes the coding sentence set of the aforementioned basic definition and all these exceeding sentences sentence. (All the excess statements mentioned in this paragraph are for encoding). The situation for implicit definitions is similar to that for basic definitions. All in all, regardless of the way in which the coding rules for a certain language in the coding rules define the scope of the coded sentence set (referring to complete definitions, basic definitions, implicit definitions, etc.), all codes that can be coded for this language in the coding rules The sentences of this language coded by the detailed rules belong to the coded sentence set defined by the coding rules about this language in this coding rule; the language mentioned in this paragraph can be any non-compound language or any compound language.

本说明书在述及解码规则与编码规则对应或专门对应或一般对应时涉及的解码规则中有关某语种的解码细则定义的解码语句集,以及述及直播方式的以只适用于英语为例的解码规则,TTS方式的以只适用于英语为例的解码规则时涉及的解码规则中有关英语的解码细则定义的解码语句集,以及述及直播方式的以只适用于汉语为例的解码规则,TTS方式的以只适用于汉语为例的解码规则时涉及的解码规则中有关汉语的解码细则定义的解码语句集都默认是按以下1段所述这样把超出语句都包括进解码语句集的(除非提及处有特别注明不是把超出语句都包括进来;而除以上这些情况之外时对解码语句集的定义不一定是按照以下1段所述要把超出语句都包括进来),可把这种把超出语句都包括进解码语句集的情况称为解码时定义解码语句集是采用解码全集定义的:In this manual, when referring to the correspondence between the decoding rules and the encoding rules, or special correspondence or general correspondence, the decoding statement set defined in the decoding rules of a certain language in the decoding rules, and the decoding rules that are only applicable to English as an example when referring to the live broadcast method Rules, the decoding rules that are only applicable to English as an example in the TTS mode, the decoding statement set defined in the decoding rules related to English in the decoding rules, and the decoding rules that are only applicable to Chinese as an example in the live broadcast mode, TTS In the decoding rule that is only applicable to Chinese as an example, the decoding sentence sets defined in the decoding rules related to Chinese in the decoding rules are all defaulted to include the excess sentences into the decoding sentence set as described in the following paragraph (unless There is a special note at the mention that does not include all the excess sentences; and in addition to the above cases, the definition of the decoding statement set does not necessarily include all the excess sentences as described in the following paragraph), you can put this The case of including beyond sentences into the set of decoded sentences is called decoding definition. The set of decoded sentences is defined by using the complete set of decoded:

如某解码规则中有关某语种的解码细则原来基本定义了此语种的解码语句集的范围,如表示此解码语句集中任一语句发音的拼合信息的值都能被解码,表示此解码语句集以外此语种的任一语句的发音的拼合信息的值都不能被解码,这时此解码规则中有关此语种的解码细则定义的此语种的解码语句集的范围就是前述基本定义的解码语句集范围;如果此语种的有些语句超出了前述基本定义的解码语句集范围,而表示这些超出语句中任一语句发音的拼合信息的值也能被此解码规则中有关此语种的解码细则解码(本段把此语种所有这样的超出语句称为所有超出语句),那此解码规则中有关此语种的解码细则定义的此语种的解码语句集实际是包括前述基本定义的解码语句集和所有超出语句的。(本段说的所有超出语句是针对解码的)。对于隐含定义的情况与基本定义的情况类似。总而言之,不管解码规则中有关某语种的解码细则定义解码语句集范围的方式是采取哪种方式(指完整定义,基本定义,隐含定义等),对于此语种的任一语句,如表示此语句发音的拼合信息的值能被此解码规则中有关此语种的解码细则解码,那此语句就属于此解码规则中有关此语种的解码细则定义的解码语句集。本段所说的语种可以是任一非复合语种或任一复合语种。For example, the decoding rules for a certain language in a certain decoding rule basically defined the scope of the decoding sentence set of this language. The value of the combined information of the pronunciation of any sentence in this language cannot be decoded. At this time, the range of the decoding sentence set of this language defined in the decoding rules for this language in the decoding rules is the range of the decoding sentence set in the basic definition above; If some sentences of this language are beyond the scope of the decoding sentence set of the aforementioned basic definition, the value of the combined information representing the pronunciation of any sentence in these sentences can also be decoded by the decoding rules related to this language in the decoding rules (this paragraph puts All such exceeding sentences of this language are called all exceeding sentences), then the decoding sentence set of this language defined in the decoding rules about this language in the decoding rules actually includes the decoding sentence set of the aforementioned basic definition and all exceeding sentences. (All the excess statements mentioned in this paragraph are for decoding). The situation for implicit definitions is similar to that for basic definitions. All in all, regardless of the way in which the decoding rules for a certain language in the decoding rules define the scope of the decoding sentence set (referring to complete definition, basic definition, implicit definition, etc.), for any sentence in this language, if the statement If the value of the combined pronunciation information can be decoded by the decoding rules for this language in the decoding rule, then this sentence belongs to the set of decoding sentences defined by the decoding rules for this language in the decoding rule. The language mentioned in this paragraph can be any non-compound language or any compound language.

在实际中的编解码情况可更复杂些,可能是约定所定义的编码语句集和解码语句集不包括基本定义外的语句,而且定义语句集的方式可能是除完整定义,基本定义,隐含定义这些方式之外的定义方式,而且定义的语句集里包括的语句是些什么类型也可能是各种各样的,然后解码规则与编码规则的关系也可能有很多,权利要求3,12,20或23中述及的编解码情况可以是这些种情况的,即权利要求3所述可如下段这样(下段对权利要求12,20,23也都适用):The encoding and decoding situation in practice may be more complicated. It may be that the encoding statement set and decoding statement set defined by the agreement do not include statements other than the basic definition, and the way of defining the statement set may be other than complete definition, basic definition, implicit Definition methods other than these methods are defined, and the types of sentences included in the defined sentence set may also be various, and then there may be many relationships between decoding rules and encoding rules.Claims 3, 12, The coding and decoding situations mentioned in 20 or 23 can be these kinds of situations, that is, the following paragraphs can be described in claim 3 (the following paragraphs are also applicable toclaims 12, 20, and 23):

对于此权利要求提到的编码规则和解码规则,可不用管编码规则中是怎样定义编码语句集的,也不用管解码规则中是怎样定义解码语句集的,也不用管解码规则与编码规则有什么关系(比如解码规则是专门用于对用编码规则编码生成的拼合信息的值进行解码的,那此解码规则与此编码规则就是有对应配套的关系的,除此之外,解码规则与编码规则还可能有别的关系);只要对一语句,由编码规则中有关语种A的编码细则来编码生成的表示此语句发音的拼合信息的值可由解码规则中有关语种B的解码细则来解码,此编码规则中有关语种A的编码细则有定义编码语句集(即使没明确定义也是可推出隐含存在此编码语句集的),是用一些种音码来编码的,编码时有压缩编码处理(可以是本说明书有述的或没述的压缩编码处理方式),此解码规则中有关语种B的解码细则有定义解码语句集(即使没明确定义也是可推出隐含存在此解码语句集的),是用一些种音码来解码作拼合的,解码时有压缩解码处理(可以是本说明书有述的或没述的压缩解码处理方式);此权利要求里可以语种A是任一非复合语种且同时语种B是任一非复合语种,或者可以语种A是任一非复合语种且同时语种B是任一复合语种,或者可以语种A是任一复合语种且同时语种B是任一非复合语种,或者可以语种A是任一复合语种且同时语种B是任一复合语种。For the encoding rules and decoding rules mentioned in this claim, it does not matter how the encoding statement set is defined in the encoding rules, nor how the decoding statement set is defined in the decoding rules, nor does it need to care about the relationship between the decoding rules and the encoding rules. What is the relationship (for example, the decoding rule is specially used to decode the value of the combined information generated by encoding the encoding rule, then the decoding rule and the encoding rule have a corresponding matching relationship, in addition, the decoding rule and the encoding The rules may also have other relations); as long as a sentence is coded by the encoding rules of the relevant language A in the encoding rules, the value of the combined information representing the pronunciation of this sentence can be decoded by the decoding rules of the relevant language B in the decoding rules, The coding rules about language A in this coding rule have a defined coded sentence set (even if it is not clearly defined, it can be deduced that there is this coded sentence set implicitly), and it is coded with some kinds of phonetic codes, and there is compression coding process during coding ( It can be the compression encoding processing method described or not described in this manual), and the decoding rules of language B in this decoding rule have a defined decoding sentence set (even if it is not clearly defined, it can be deduced that this decoding sentence set implicitly exists), It uses some kinds of sound codes to decode and assemble, and there is compression decoding processing during decoding (it can be the compression decoding processing method described or not described in this manual); in this claim, language A can be any non-compound language and At the same time, language B is any non-composite language, or language A can be any non-compound language and language B can be any compound language, or language A can be any compound language and language B can be any non-compound language, Alternatively, language A may be any compound language and language B may be any compound language at the same time.

本发明文件各处提及的只适用于一个语种的解码有关程序,是指一套只适用于此语种的解码程序(此解码程序是根据只适用于此语种的解码规则或适用于包括此语种在内的多个语种的解码规则中有关此语种的解码细则来生成的)。本发明文件各处提及的适用于多个语种的解码有关程序,那一般是多套分别只适用于各自语种的解码程序,或是适用于多个语种的一套解码程序(此解码程序是根据适用于这多个语种的解码规则来生成的),或是几套分别只适用于各自语种的解码程序加上适用于多个语种的一套解码程序(这句话提及的‘几’加这句话提及的‘多’,得到的数值等于前面一句话提及的‘多’的数值)等情况。The decoding-related programs that are only applicable to one language mentioned in the documents of the present invention refer to a set of decoding programs that are only applicable to this language (this decoding program is based on the decoding rules that are only applicable to this language or are applicable to include this language) generated from the decoding rules of this language in the decoding rules of multiple languages). The decoding related programs that are applicable to multiple languages mentioned in the documents of the present invention are generally multiple sets of decoding programs that are only applicable to their respective languages, or a set of decoding programs that are applicable to multiple languages (this decoding program is Generated according to the decoding rules applicable to these multiple languages), or several sets of decoding programs applicable only to their respective languages plus a set of decoding programs applicable to multiple languages (the 'several' mentioned in this sentence Add the 'many' mentioned in this sentence, and the obtained value is equal to the value of the 'many' mentioned in the previous sentence), etc.

本发明文件各处提及的只适用于一个语种的解码有关程序及解码有关数据,是指一套只适用于此语种的解码程序及解码数据(此解码程序是根据只适用于此语种或适用于包括此语种在内的多个语种的解码规则来生成的,此解码数据是此解码程序使用的)。本发明文件各处提及的适用于多个语种的解码有关程序及解码有关数据,那一般是多套分别只适用于各自语种的解码程序及解码数据,或是适用于多个语种的一套解码程序及解码数据(此解码程序是根据适用于这多个语种的解码规则来生成的,此解码数据是此解码程序使用的),或是几套分别只适用于各自语种的解码程序及解码数据加上适用于多个语种的一套解码程序及解码数据(这句话提及的‘几’加这句话提及的‘多’,得到的数值等于前面一句话提及的‘多’的数值)等情况。本段和上一段提到的一个语种可以是任一非复合语种或任一复合语种,本段和上一段提到的多个语种可能全是非复合语种或全是复合语种或既包含非复合语种也包含复合语种。The decoding-related programs and decoding-related data that are only applicable to one language mentioned in the documents of the present invention refer to a set of decoding programs and decoding data that are only applicable to this language (this decoding program is based on the generated by the decoding rules of multiple languages including this language, and this decoded data is used by this decoding program). The decoding-related programs and decoding-related data applicable to multiple languages mentioned in the document of the present invention are generally multiple sets of decoding programs and decoding data applicable to their respective languages, or one set applicable to multiple languages. Decoding program and decoding data (this decoding program is generated according to the decoding rules applicable to these multiple languages, and the decoding data is used by this decoding program), or several sets of decoding programs and decoding programs that are only applicable to their respective languages Data plus a set of decoding programs and decoding data applicable to multiple languages (the "several" mentioned in this sentence plus the "many" mentioned in this sentence, the obtained value is equal to the "many" mentioned in the previous sentence value) and so on. A language mentioned in this paragraph and the previous paragraph may be any non-compound language or any compound language, and the multiple languages mentioned in this paragraph and the previous paragraph may be all non-compound languages or all compound languages or both non-compound languages Compound languages are also included.

本说明书各处说到与拾音器有关的存储器集里的诸音码的发音数据,那此拾音器是采用直接产音方式或既采用直接产音方式又采用TTS产音方式的;如拾音器是可用直接发音方式解码多个语种的拼合信息的值时,那存储器集里一般应存储这多个语种中各自语种的诸音码的发音数据,如拾音器是只可用直接发音方式解码一个语种的拼合信息的值时,那存储器集里只需存储这一个语种的诸音码的发音数据。本段所述是显然的。本段提到的各语种可以是任一非复合语种或任一复合语种;对于一个复合语种,如果此复合语种包含多个非复合语种,那此复合语种的诸音码的发音数据一般就包含这多个非复合语种各自的相关音码的发音数据。This instruction manual mentions the pronunciation data of all sound codes in the memory set relevant to the pickup everywhere, then this pickup adopts the direct sound production mode or not only adopts the direct sound production mode but also adopts the TTS sound production mode; When the pronunciation mode decodes the value of the combined information of multiple languages, the pronunciation data of the various phonetic codes of the respective languages in the multiple languages should generally be stored in the memory set. For example, the pickup can only decode the combined information of one language in the direct pronunciation mode. When value, only need to store the pronunciation data of all phonetic codes of this one language in that memory set. What is stated in this paragraph is obvious. Each language mentioned in this paragraph can be any non-compound language or any compound language; The pronunciation data of the relevant phonetic codes of the plurality of non-compound languages.

权利要求3,12,20或23中提及的编码规则,可以是本说明书提及或没提及的适用于印欧语系任一语种(英语或法语或德语或俄语等)的其有关此语种的编码细则含有压缩编码处理且是混合方式(或其它方式)的的各种编码规则(本句说的‘各种编码规则’中任一编码规则可以是只适用于此语种或适用于包括此语种在内的多个语种),这些编码规则中各编码规则中有关此语种的混合方式(或其它方式)的编码细则的具体处理可参考本说明书述的直播方式(或TTS方式)的以只适用于英语的编码规则的阐述;权利要求3,12,20或23中提及的编码规则,可以是本说明书提及或没提及的适用于非印欧语系任一语种(汉语或韩语或日语等)的其有关此语种的编码细则含有压缩编码处理且是混合方式(或其它方式)的的各种编码规则(本句说的‘各种编码规则’中任一编码规则可以是只适用于此语种或适用于包括此语种在内的多个语种),这些编码规则中各编码规则中有关此语种的混合方式(或其它方式)的编码细则的具体处理可参考本说明书述的直播方式(或TTS方式)的以只适用于汉语的编码规则的阐述;权利要求3,12,20或23中提及的编码规则,可以是本说明书提及或没提及的适用于任一复合语种的其有关此语种的编码细则含有压缩编码处理且是混合方式(或其它方式)的的各种编码规则(本句说的‘各种编码规则’中任一编码规则可以是只适用于此语种或适用于包括此语种在内的多个语种),这些编码规则中各编码规则中有关此语种的混合方式(或其它方式)的编码细则的具体处理可参考本说明书述的直播方式(或TTS方式)的以只适用于英语的编码规则和直播方式(或TTS方式)的以只适用于汉语的编码规则的阐述。(本段提及的3个‘多个’不一定是相同数目)The encoding rule mentioned inclaim 3, 12, 20 or 23 may be any language (English or French or German or Russian) of the Indo-European language family (English or French or German or Russian, etc.) that is mentioned or not mentioned in this description. The coding rules contain various coding rules that are compressed and coded in a mixed way (or in other ways) (any coding rule in the "various coding rules" mentioned in this sentence can be only applicable to this language or be suitable for including this Multiple languages including multiple languages), the specific handling of the coding rules for the mixed mode (or other modes) of the language in each coding rule in these coding rules can refer to the live mode (or TTS mode) described in this specification. Explanation of coding rules applicable to English; the coding rules mentioned in claim 3, 12, 20 or 23 may be applicable to any language of the non-Indo-European language family (Chinese or Korean or Japanese, etc.) its encoding rules about this language contain compression encoding processing and mixed (or other) various encoding rules (any encoding rule in the "various encoding rules" mentioned in this sentence can be only applicable In this language or applicable to multiple languages including this language), the specific handling of the coding rules related to the mixed mode (or other modes) of this language in each coding rule in these coding rules can refer to the live broadcast method described in this manual (or TTS mode) with the elaboration of the encoding rules applicable only to Chinese; the encoding rules mentioned in claim 3, 12, 20 or 23 can be applicable to any compound language mentioned or not mentioned in this specification The coding rules related to this language contain various coding rules that are compressed and coded in a mixed way (or in other ways) (any coding rule in the "various coding rules" mentioned in this sentence can be only applicable to this language or applicable to multiple languages including this language), the specific handling of the coding rules related to the mixed mode (or other modes) of this language in each coding rule in these coding rules can refer to the live broadcast method (or TTS) described in this specification The encoding rules applicable only to English and the encoding rules applicable only to Chinese in the live broadcast mode (or TTS mode). (The 3 'pluralities' mentioned in this paragraph are not necessarily the same number)

权利要求3,12,20或23中提及的解码规则,可以是与本说明书提及或没提及的适用于印欧语系任一语种(英语或法语或德语或俄语等)的其有关此语种的编码细则含有压缩编码处理的任一编码规则(此编码规则可以是只适用于此语种或适用于包括此语种在内的多个语种的)有关此语种对应的本说明书提及或没提及的适用于此语种的其有关此语种的解码细则含有压缩解码处理且是混合方式(或其它方式)的的各种解码规则(这各种解码规则中任一解码规则可以是只适用于此语种或适用于包括此语种在内的多个语种的),这些解码规则中各解码规则中有关此语种的混合方式(或其它方式)的解码细则的具体处理可参考本说明书述的直播方式(或TTS方式)的以只适用于英语的解码规则的阐述;权利要求3,12,20或23中提及的解码规则,可以是与本说明书提及或没提及的适用于非印欧语系任一语种(汉语或韩语或日语等)的其有关此语种的编码细则含有压缩编码处理的任一编码规则(此编码规则可以是只适用于此语种或适用于包括此语种在内的多个语种的)有关此语种对应的本说明书提及或没提及的适用于此语种的其有关此语种的解码细则含有压缩解码处理且是混合方式(或其它方式)的的各种解码规则(这各种解码规则中任一解码规则可以是只适用于此语种或适用于包括此语种在内的多个语种的),这些解码规则中各解码规则中有关此语种的混合方式(或其它方式)的解码细则的具体处理可参考本说明书述的直播方式(或TTS方式)的以只适用于汉语的解码规则的阐述;权利要求3,12,20或23中提及的解码规则,可以是与本说明书提及或没提及的适用于任一复合语种的其有关此语种的编码细则含有压缩编码处理的任一编码规则(此编码规则可以是只适用于此语种或适用于包括此语种在内的多个语种的)有关此语种对应的本说明书提及或没提及的适用于此语种的其有关此语种的解码细则含有压缩解码处理且是混合方式(或其它方式)的的各种解码规则(这各种解码规则中任一解码规则可以是只适用于此语种或适用于包括此语种在内的多个语种的),这些解码规则中各解码规则中有关此语种的混合方式(或其它方式)的解码细则的具体处理可参考本说明书述的直播方式(或TTS方式)的以只适用于英语的解码规则和直播方式(或TTS方式)的以只适用于汉语的解码规则的阐述。(本段提及的6个‘多个’不一定是相同数目;本段提及的‘对应’是指一般对应或专门对应)The decoding rule mentioned inclaim 3, 12, 20 or 23 may be related to any language (English or French or German or Russian) of the Indo-European language family (English or French or German or Russian, etc.) mentioned or not mentioned in this specification. The coding rules of the language contain any coding rules for compression coding processing (this coding rule can be applicable only to this language or to multiple languages including this language) and this specification is mentioned or not mentioned corresponding to this language. And the decoding rules applicable to this language contain compression decoding processing and mixed (or other) various decoding rules (any decoding rule in these various decoding rules can be only applicable to this language or applicable to multiple languages including this language), the specific handling of the decoding rules related to the mixed mode (or other modes) of this language in each decoding rule in these decoding rules can refer to the live broadcast method described in this manual ( or TTS mode) to explain the decoding rules applicable only to English; the decoding rules mentioned in claims 3, 12, 20 or 23 may be applicable to non-Indo-European languages mentioned or not mentioned in this specification Any language (Chinese, Korean, or Japanese, etc.) whose coding rules about this language contain any coding rules for compression coding processing (this coding rule can be only applicable to this language or applicable to multiple languages including this language) language) related to this language corresponding to this manual mentioned or not mentioned applicable to this language, its decoding rules related to this language contain compression decoding processing and various decoding rules in a mixed way (or other ways) (this Any decoding rule in various decoding rules can be only applicable to this language or applicable to multiple languages including this language), and the mixing method (or other methods) related to this language in each decoding rule in these decoding rules The detailed processing of the decoding rules can refer to the explanation of the decoding rules applicable only to Chinese in the live mode (or TTS mode) described in this manual; the decoding rules mentioned in claim 3, 12, 20 or 23 can be the same as The coding rules applicable to any compound language mentioned or not mentioned in this specification contain any coding rules for compression coding processing (this coding rule may be only applicable to this language or applicable to the For multiple languages in this manual) the corresponding decoding rules for this language that are mentioned or not mentioned in this manual for this language contain compression decoding processing and are mixed (or other methods) Decoding rules (any decoding rule in these various decoding rules can be only applicable to this language or applicable to multiple languages including this language), and the mixing method of this language in each decoding rule in these decoding rules ( or other methods) for the specific processing of the decoding rules can refer to the decoding rules applicable only to English in the live broadcast method (or TTS method) and the decoding rules applicable only to Chinese in the live broadcast method (or TTS method) described in this manual elaborate. (The 6 "multiples" mentioned in this paragraph are not necessarily the same number; the "correspondence" mentioned in this paragraph refers to general correspondence or special correspondence)

上两段提及的其它方式,是指区别于直播方式,TTS方式,混合方式的方式,比如拾音器采用一种新的子发音器件来发音(或者拾音器采用新的子发音器件与TTS器件等一起发音),相应的拾音器解码时要采用一种新的发音方式(其它发音方式),就可把涉及这种新的发音方式的解码规则中有关某语种的解码细则和编码规则中有关某语种的编码细则以及编解码处理等定义为某其它方式的。用其它方式时的编解码原理仍可参照用直播方式,TTS方式或混合方式时的编解码原理。如果在某个编码规则和某个解码规则里都不涉及发音,此编码规则编码生成的拼合信息的值被此解码规则解码时是在解码完后才发音,这种情况一是可等同看成把解码与发音合为一个解码步骤,此解码步骤仍可看成按一解码规则中有关某语种的解码细则来解码,此解码规则中有关此语种的解码细则仍涉及发音,此解码规则中有关此语种的解码细则就是由不涉及发音的本段前面述的某个解码规则中有关此语种的解码细则加上有关发音的解码细则而成(这几句提及的发音的方式仍可以是直播方式,TTS方式,混合方式或其它方式等,前面两段涉及的内容也适合这种发音的方式),这种情况的本段内前面述的某个编码规则仍可看成是不涉及发音的;再是仍看成编码规则和解码规则里都不涉及发音,解码时是在解码完后才发音,这种情况时在权利要求3中提及的‘所述拾音器的所述单片机芯片是按一种解码规则中有关语种2的解码细则来对读码得到的所述拼合信息的值1执行解码使所述拾音器的所述发音器件播出所述语句1发音的’和权利要求12中提及的‘所述拾音器的所述单片机芯片是按所述解码规则中有关语种2的解码细则来对读码得到的所述拼合信息的值1执行解码使所述拾音器的所述发音器件播出所述语句1发音的’和权利要求20,23中提及的‘所述拾音器的所述单片机芯片是按一种解码规则中有关语种2的解码细则来对读码得到的前述拼合信息的值1执行解码使所述拾音器的所述发音器件播出前述语句1发音的’仍都成立,权利要求11里‘所述拾音器的所述单片机芯片是按一种解码规则来对读码得到的前述拼合信息的值1执行解码使所述拾音器的所述发音器件播出前述语句1发音的’仍成立,不过这些引用的话中的执行解码就是使发音器件播出语句发音的一个步骤,单片机芯片执行完解码后还要做发音处理(比如根据解码得到的结果调发音程序发音)以使发音器件播出语句发音。而权利要求2,11中提到的编码规则也可以是不涉及发音的编码规则(即编码时并不限制解码时用什么发音方式及产音方式来发音,比如可用一些单词音码来做编码,用解码规则解码时就既可用TTS发音方式,又可用直接发音方式),只要其编码生成的拼合信息的值可被解码播出语音即可;而且在权利要求2中的编码规则是不涉及发音的编码规则时,权利要求2中提到的解码规则可以是涉及发音或不涉及发音的;在权利要求11中的编码规则是不涉及发音的编码规则时,权利要求11中提到的解码规则可以是涉及发音或不涉及发音的;权利要求3,12,20,23中提到的编码规则中有关语种1的编码细则也可以是不涉及发音的编码细则(即编码时并不限制解码时用什么发音方式及产音方式来发音,比如可用一些单词音码来做编码,用解码规则解码时就既可用TTS发音方式,又可用直接发音方式),只要其编码生成的拼合信息的值可被解码播出语音即可;而且在权利要求3中的编码规则中有关语种1的编码细则是不涉及发音的编码细则时,权利要求3中提到的解码规则中有关语种2的解码细则可以是涉及发音或不涉及发音的;在权利要求12中的编码规则中有关语种1的编码细则是不涉及发音的编码细则时,权利要求12中提到的解码规则中有关语种2的解码细则可以是涉及发音或不涉及发音的;在权利要求20中的编码规则中有关语种1的编码细则是不涉及发音的编码细则时,权利要求20中提到的解码规则中有关语种2的解码细则可以是涉及发音或不涉及发音的;在权利要求23中的编码规则中有关语种1的编码细则是不涉及发音的编码细则时,权利要求23中提到的解码规则中有关语种2的解码细则可以是涉及发音或不涉及发音的;显然本段所说的其它发音方式可以有很多种,本段所说的其它方式可以有很多种。The other methods mentioned in the previous two paragraphs refer to the methods that are different from the live broadcast method, the TTS method, and the mixed method, such as the pickup adopts a new sub-pronunciation device to pronounce (or the pickup uses a new sub-pronunciation device together with the TTS device, etc. Pronunciation), a new pronunciation mode (other pronunciation modes) will be adopted when the corresponding pickup is decoded, and the decoding rules related to a certain language in the decoding rules related to this new pronunciation mode and the relevant certain language in the encoding rules Encoding rules, codec processing, etc. are defined in some other way. The coding and decoding principle when using other methods can still refer to the coding and decoding principle when using the live broadcast method, TTS method or mixed method. If pronunciation is not involved in a certain encoding rule and a certain decoding rule, when the value of the combined information generated by the encoding rule is decoded by the decoding rule, it will be pronounced after decoding. In this case, it can be regarded as Combining decoding and pronunciation into one decoding step, this decoding step can still be regarded as decoding according to the decoding rules related to a certain language in a decoding rule. The decoding rules related to this language in this decoding rule still involve pronunciation, and the relevant decoding rules in this decoding rule The decoding rules for this language are made up of the decoding rules for this language in a certain decoding rule mentioned earlier in this paragraph that does not involve pronunciation plus the decoding rules for pronunciation (the pronunciation method mentioned in these sentences can still be live broadcast method, TTS method, mixed method or other methods, etc., the content involved in the previous two paragraphs is also suitable for this pronunciation method), in this case, a certain coding rule mentioned above in this paragraph can still be regarded as not involving pronunciation ; Be still regarded as not involving pronunciation in coding rule and decoding rule again, be to just pronounce after decoding during decoding, in this case, the described single-chip microcomputer chip of ' described sound pick-up mentioned in claim 3 is according to A kind of decoding rule about language 2 in the decoding rule is to carry out decoding to the value 1 of described mosaic information obtained by reading the code, so that the said pronunciation device of the said pickup plays the pronunciation of said sentence 1' and what is proposed in claim 12 And 'the single-chip microcomputer chip of the pickup performs decoding on the value 1 of the combined information obtained by reading the code according to the decoding rules of the relevant language 2 in the decoding rules, so that the pronunciation device of the pickup can play The said single-chip microcomputer chip of said pickup of said statement 1 pronunciation and mentioned in claim 20, 23 is the value of the aforementioned combined information obtained by reading the code according to the decoding rules related to language 2 in a decoding rule 1. Execute decoding to make the pronunciation device of the pickup play out theaforementioned statement 1. The pronunciation of the "still all established, claim 11" said single-chip microcomputer chip of the pickup is obtained by reading the code according to a decoding rule Thevalue 1 of the combined information performs decoding to make the pronunciation device of the pickup play the pronunciation of theaforementioned sentence 1, which still holds true, but the execution decoding in these quoted words is a step to make the pronunciation device broadcast the pronunciation of the sentence, and the single-chip microcomputer chip executes Pronunciation processing (such as tuning the pronunciation program pronunciation according to the result obtained by decoding) will be done after decoding to make the pronunciation device broadcast the sentence pronunciation. And claim 2, the encoding rule mentioned in 11 also can be the encoding rule that does not relate to pronunciation (promptly do not limit what pronunciation mode and the sound production mode to pronounce when decoding, such as some word sound codes can be used to do encoding , when decoding with the decoding rule, both the TTS pronunciation mode and the direct pronunciation mode can be used), as long as the value of the stitching information generated by its encoding can be decoded and broadcast the voice; and the encoding rule in claim 2 does not involve When the encoding rule of pronunciation is used, the decoding rule mentioned in claim 2 can be related to pronunciation or not related to pronunciation; when the encoding rule in claim 11 is an encoding rule not related to pronunciation, the decoding rule mentioned in claim 11 The rules can be related to pronunciation or not related to pronunciation; the coding rules about language 1 in the coding rules mentioned in claim 3, 12, 20, and 23 can also be the coding rules not related to pronunciation (that is, encoding does not limit decoding What pronunciation method and sound production method are used to pronounce, for example, some word sound codes can be used for encoding, when decoding with decoding rules, both the TTS pronunciation method and the direct pronunciation method can be used), as long as the value of the combined information generated by its encoding It can be decoded and broadcast voice; and when the coding rules about language 1 in the coding rules in claim 3 are coding rules that do not involve pronunciation, the decoding rules about language 2 in the decoding rules mentioned in claim 3 Can be related to pronunciation or not related to pronunciation; When the coding rules related to language 1 in the coding rules in claim 12 are coding rules not related to pronunciation, the decoding rules related to language 2 in the decoding rules mentioned in claim 12 It can be related to pronunciation or not related to pronunciation; when the coding rules related to language 1 in the coding rules in claim 20 are coding rules not related to pronunciation, the decoding rules related to language 2 in the decoding rules mentioned in claim 20 It can be related to pronunciation or not related to pronunciation; when the coding rules related to language 1 in the coding rules in claim 23 are coding rules not related to pronunciation, the decoding rules related to language 2 in the decoding rules mentioned in claim 23 It can involve pronunciation or not; obviously there can be many other ways of pronunciation mentioned in this paragraph, and there can be many kinds of other ways mentioned in this paragraph.

对某语种,某解码规则中有关此语种的某其它方式的解码细则与某编码规则中有关此语种的此其它方式的编码细则可以是对应的(即根据后者编的拼合信息的值可由前者解码),某解码规则中有关此语种的其它方式的解码细则与某编码规则中有关此语种的直播方式(或TTS方式或混合方式)的编码细则可以是对应的(即根据后者编的拼合信息的值可由前者解码),某解码规则中有关此语种的直播方式(或TTS方式或混合方式)的解码细则与某编码规则中有关此语种的其它方式的编码细则可以是对应的(即根据后者编的拼合信息的值可由前者解码)。本段所说的对应可以是专门对应或一般对应。本段和上一段提到的语种可以是任一非复合语种或任一复合语种。For a certain language, the decoding rules of certain other ways of this language in a certain decoding rule can be corresponding to the coding rules of this other way of this language in a certain coding rule Decoding), the decoding rules for other methods of this language in a certain decoding rule may correspond to the coding rules for the live broadcast method (or TTS method or mixed method) of this language in a certain coding rule (that is, according to the combination of the latter The value of the information can be decoded by the former), and the decoding rules of the live broadcast mode (or TTS mode or mixed mode) of this language in a certain decoding rule can be corresponding to the encoding rules of other modes of this language in a certain encoding rule (that is, according to The value of the stitching information encoded by the latter can be decoded by the former). The correspondence referred to in this paragraph may be a specific correspondence or a general correspondence. The languages mentioned in this paragraph and the previous paragraph may be any non-composite language or any compound language.

以下2段提到的语种可以是任一非复合语种或任一复合语种。The languages mentioned in the following 2 paragraphs can be any non-composite language or any compound language.

对于本发明文件所说的某解码规则中有关某语种的混合方式的解码细则,其涉及的发音形式是用混合发音方式,其可能对所有其能解码的此语种的拼合信息的值中的有些拼合信息的值执行解码是采用整步直接发音方式而对所有其能解码的此语种的拼合信息的值中的有些拼合信息的值执行解码是采用整步TTS发音方式,或在对表示一个语句发音的拼合信息的值执行解码时对此语句的部分用整步直接发音方式发音而对此语句的另外部分用整步TTS发音方式发音等(甚至还可用更多发音方式,比如分步直接发音方式,分步TTS发音方式等),所以显然某解码规则中有关某语种的混合方式的解码细则的内容可参照某解码规则中有关此语种的直播方式的解码细则和某解码规则中有关此语种的TTS方式的解码细则,所以显然某编码规则中有关某语种的混合方式的编码细则的内容可参照某编码规则中有关此语种的直播方式的编码细则和某编码规则中有关此语种的TTS方式的编码细则。For the decoding rules about the mixed mode of a certain language in the certain decoding rule mentioned in the document of the present invention, the pronunciation form involved is to use the mixed pronunciation mode, and it may be able to decode some of the values of the combined information of this language. The decoding of the value of the combined information is to use the full-step direct pronunciation method, and the decoding of some values of the combined information in all the values of the combined information of this language that can be decoded is to use the full-step TTS pronunciation method, or to represent a statement When the value of the combined information of the pronunciation is decoded, the part of the sentence is pronounced with the full-step direct pronunciation mode and the other part of the sentence is pronounced with the whole-step TTS pronunciation mode (even more pronunciation modes can be used, such as step-by-step direct pronunciation method, step-by-step TTS pronunciation method, etc.), so it is obvious that the content of the decoding rules about the mixed mode of a certain language in a certain decoding rule can refer to the decoding rules about the live broadcast method of this language in a certain decoding rule and the related language in a certain decoding rule Therefore, it is obvious that the content of the coding rules related to the mixed mode of a certain language in a certain coding rule can refer to the coding rules related to the live broadcast method of this language in a certain coding rule and the TTS method related to this language in a certain coding rule coding rules.

除非提及时有指明或从上下文可推出有确切意义,本说明书各处提及的解码处理(或只适用于一个语种的解码程序,或解码规则中有关某语种的解码细则,或只适用于一个语种的某解码规则),默认其涉及的发音形式是全用直接发音方式或全用TTS发音方式或用混合发音方式的。除非提及时有指明或从上下文可推出有确切意义,本说明书各处提及的编码规则中有关某语种的编码细则或只适用于一个语种的某编码规则,默认可以是直播方式或TTS方式或混合方式的。Unless otherwise specified or deduced from the context to have an exact meaning, the decoding process mentioned in this manual (or only applicable to the decoding program of one language, or the decoding rules related to a certain language in the decoding rules, or only applicable to one language) A certain decoding rule of the language), the default pronunciation forms involved are all direct pronunciation, TTS pronunciation or mixed pronunciation. Unless specified when mentioned or deduced from the context, the encoding rules mentioned in this manual refer to the encoding rules of a certain language or a certain encoding rule only applicable to one language. The default can be live broadcast or TTS or mixed way.

对于一个拾音器,如果此拾音器含有的采用直接产音方式发音的子发音器件超过一个,那这些子发音器件被看成是组成一个发音器件,比如一个带喇叭的MCU外接的语音芯片与另一个带喇叭的MCU外接的语音芯片可合起来当成是一个发音器件;对于一个拾音器,如果此拾音器含有的采用TTS产音方式发音的子发音器件超过一个,那这些子发音器件被看成是组成一个发音器件,比如一个带喇叭的TTS芯片与另一个带喇叭的TTS芯片可合起来当成是一个发音器件。对于一个拾音器,如果此拾音器含有的采用其它产音方式(除直接产音和TTS产音方式外的)发音的子发音器件超过一个,那这些子发音器件被看成是组成一个发音器件。For a pickup, if the pickup contains more than one sub-pronunciation device that adopts a direct sound production method, then these sub-pronunciation devices are considered to form a pronunciation device, such as an MCU with a loudspeaker. The voice chip externally connected to the MCU of the speaker can be regarded as a sounding device; for a pickup, if the pickup contains more than one sub-sounding device that adopts the TTS sound production method, then these sub-sounding devices are regarded as forming a sounding device. Devices, such as a TTS chip with a speaker and another TTS chip with a speaker can be combined as a sounding device. For a pickup, if the pickup contains more than one sub-sounding device that adopts other sound production methods (except direct sound production and TTS sound production methods), then these sub-sounding devices are regarded as forming a sounding device.

在权利要求2,10,19,22中提及的拾音器,是定义其含有一个发音器件的,这些拾音器可以是采用直接产音方式或采用TTS产音方式的拾音器或是既采用直接产音方式又采用TTS产音方式的拾音器,这3种拾音器里的一个发音器件一般就是一个子发音器件,但不排除在权利要求2,10,19或22中提及的拾音器含有的一个发音器件是由多个子发音器件组成的(这里说的多个子发音器件中的各子发音器件可以是产音方式相同或不同的子发音器件,比如两个子发音器件是指一个带喇叭的MCU外接的语音芯片和一个带喇叭的TTS芯片),每个子发音器件都与单片机芯片意向连接。The pickups mentioned inclaims 2, 10, 19, and 22 are defined to contain a sounding device, and these pickups can be pickups that adopt a direct sound production method or a TTS sound production method or both adopt a direct sound production method Adopt the pickup of TTS sound production mode again, a sounding device in these 3 kinds of pickups is generally exactly a sub-sounding device, but does not exclude that a sounding device contained in the pickup mentioned inclaim 2, 10, 19 or 22 is made of A plurality of sub-pronunciation devices are formed (each sub-pronunciation device in a plurality of sub-pronunciation devices mentioned here can be the same or different sub-pronunciation devices of the sound production mode, such as two sub-pronunciation devices refer to a voice chip externally connected to the MCU with a horn and A TTS chip with a horn), and each sub-sounding device is intentionally connected with the single-chip microcomputer chip.

本说明书主要是针对直播方式拾音器和TTS方式拾音器来描述拾音器的技术特征的,而既采用直接产音方式又采用TTS产音方式的拾音器的各技术特征(包括硬件电路特征,解码处理,机械外型)等都可参照本说明书前面描述的直播方式拾音器和TTS方式拾音器的各技术特征。(本段所提及的各‘拾音器’可以是非插卡方式,插数据卡方式或插部件式数据卡方式拾音器)This manual mainly describes the technical characteristics of the pickups for the live broadcast pickup and the TTS pickup, but the technical features (including hardware circuit features, decoding processing, mechanical appearance) of the pickups that use both the direct sound production method and the TTS sound production method. type) etc. can refer to each technical feature of the live mode pickup and the TTS mode pickup described earlier in this manual. (Each 'pickup' mentioned in this paragraph can be a non-plug-in card mode, a data card mode or a plug-in data card mode pickup)

本发明文件说某拾音器是既采用直接产音方式又采用TTS产音方式的,那是指此拾音器是既采用直接产音方式又采用TTS产音方式的拾音器。本发明文件说某拾音器是采用直接产音方式的,那是指此拾音器是直播方式拾音器。本发明文件说某拾音器是采用TTS产音方式的,那是指此拾音器是TTS方式拾音器。The document of the present invention says that a certain sound pickup not only adopts the direct sound production method but also adopts the TTS sound production method, that means that this pickup is the pickup that not only adopts the direct sound production method but also adopts the TTS sound production method. The document of the present invention says that certain sound pick-up adopts the direct sound production mode, and that refers to that this sound pick-up is a live mode sound pick-up. The document of the present invention says that certain sound pick-up adopts the TTS sound production mode, and that refers to that this sound pick-up is a TTS mode sound pick-up.

以下3段各处提到的语种可以是任一非复合语种或任一复合语种。The language mentioned in each of the following 3 paragraphs can be any non-composite language or any compound language.

本说明书主要是以直播方式和TTS方式来描述编解码处理的,混合方式的编解码处理可参照本说明书前面描述的直播方式(或TTS方式)的编解码处理;如按编码规则中有关某语种的混合方式的编码细则来编码,那涉及到的拼合信息的值的形式,码字的形式,有声读物的形式等与本说明书前面述的按编码规则中有关某语种的直播方式(或TTS方式)的编码细则来编码涉及到的拼合信息的值的形式,码字的形式,有声读物的形式等是相似的。如按编码规则中有关某语种的直播方式(或TTS方式)的编码细则来编码得到的拼合信息的值在有声读物整页面上码字中可按某方式放置处置,那按编码规则中有关某语种的混合方式的编码细则来编码得到的拼合信息的值在有声读物整页面上码字中就也可按那方式放置处置。This manual mainly describes the codec processing in the live mode and TTS mode. The codec process in the mixed mode can refer to the codec process in the live mode (or TTS mode) described earlier in this manual; The encoding rules of the mixed mode are used to encode, the form of the value of the combined information involved, the form of the code word, the form of the audiobook, etc. are the same as the live broadcast method (or TTS method) of a certain language in the coding rules mentioned above in this specification. ) encoding rules to encode the form of the value of the stitched information involved, the form of the codeword, the form of the audiobook, etc. are similar. If the value of the spliced information encoded according to the coding rules of the live broadcast mode (or TTS mode) of a certain language in the coding rules can be placed in a certain way in the code words on the entire page of the audiobook, then according to the relevant certain language in the coding rules The value of the combined information obtained by encoding the encoding rules of the mixed mode of the language can also be placed and disposed in that way in the codeword on the entire page of the audiobook.

一个既采用直接产音方式又采用TTS产音方式的拾音器,解码时可能有采用某解码规则中有关某语种的混合方式的解码细则来解码,也可能有采用某解码规则中有关某语种的直播方式的解码细则来解码,也可能有采用某解码规则中有关某语种的TTS方式的解码细则来解码,而且这3种情况可能全部或有部分同时存在于此拾音器里。A pickup that adopts both the direct sound production method and the TTS sound production method may be decoded by using the decoding rules of the mixed method of a certain language in a certain decoding rule, or it may use the live broadcast of a certain language in a certain decoding rule It may be decoded by the decoding rules of the method, or it may be decoded by using the decoding rules of the TTS method of a certain language in a certain decoding rule, and all or some of these three situations may exist in the pickup at the same time.

在本说明书中,如说到拾音器是采用直接产音方式的时,如是用MCU外接喇叭来发音,是默认是MCU驱动MCU外接的喇叭发音的,而且发音器件是喇叭(此喇叭也是一个子发音器件),如是用MCU外接的带喇叭的语音芯片发音,是默认此语音芯片是与其带的喇叭一起发音的,而且发音器件是此带喇叭的语音芯片(此带喇叭的语音芯片也是一个子发音器件);在本说明书中,如说到拾音器是采用TTS产音方式的时,如是用MCU内的TTS器件与MCU外接的喇叭一起发音,是默认是MCU驱动MCU外接的喇叭发音的,而且发音器件是喇叭(此喇叭也是一个子发音器件),如是用MCU外接的带喇叭的TTS芯片发音,是默认此TTS芯片是与其带的喇叭一起发音的,而且发音器件是此带喇叭的TTS芯片(此带喇叭的TTS芯片也是一个子发音器件)。本发明文件所说的喇叭,可由喇叭电路等来作等同替换,喇叭电路是指由喇叭与一些简单的阻容等元件组成的,比如如附图1所示,由喇叭和1K电阻和8050组成一个喇叭电路。如拾音器是既采用直接产音方式又采用TTS产音方式的,此拾音器内有一个发音器件,此发音器件由数个子发音器件组成,显然子发音器件的定义与本段以上述类似,子发音器件可以是喇叭和带喇叭的语音芯片,或带喇叭的语音芯片和带喇叭的TTS芯片等。In this manual, if the pickup is used to produce sound directly, if the MCU external speaker is used to produce sound, it is the default that the MCU drives the MCU external speaker to produce sound, and the sound device is the speaker (this speaker is also a sub-voice device), if you use the voice chip with speaker externally connected to MCU to pronounce, it is default that the voice chip is pronounced together with the speaker, and the pronunciation device is the voice chip with speaker (this voice chip with speaker is also a sub-pronunciation device); in this manual, if the pickup adopts the TTS sound production method, if the TTS device in the MCU is used to make a sound together with the speaker connected to the MCU, it is the default that the MCU drives the speaker connected to the MCU to make a sound, and the sound is The device is a speaker (this speaker is also a sub-pronunciation device), if you use the TTS chip with a speaker external to the MCU to pronounce, it is default that the TTS chip is pronounced together with its speaker, and the pronunciation device is this TTS chip with a speaker ( This TTS chip with horn is also a sub-sounding device). The speaker mentioned in the document of the present invention can be equivalently replaced by a speaker circuit, etc. The speaker circuit is composed of a speaker and some simple resistance-capacitance components, such as shown in Figure 1, consisting of a speaker, a 1K resistor and an 8050 A horn circuit. If the pickup adopts both the direct sound production method and the TTS sound production method, there is a sounding device in the pickup, and the sounding device is composed of several sub-sounding devices. Obviously, the definition of the sub-sounding device is similar to the above in this paragraph. The sub-sounding device The device can be a speaker and a voice chip with a speaker, or a voice chip with a speaker and a TTS chip with a speaker, etc.

本发明文件各处述及的既采用直接产音方式又采用TTS产音方式的拾音器内有一个发音器件,此发音器件一般由数个子发音器件组成,这数个子发音器件默认是各自与MCU意向连接的。There is a pronunciation device in the pick-up that not only adopts the direct sound production mode but also adopts the TTS sound production mode mentioned in the documents of the present invention everywhere, and this pronunciation device is generally made up of several sub-sounding devices, and these several sub-sounding devices default to be respectively with the MCU intention connected.

本发明文件所说的某解码规则中有关某语种的解码细则涉及的发音形式是全用直接发音方式的含义是:用此解码规则中有关某语种的解码细则来对表示由此解码规则中有关某语种的解码细则定义的语句集中任一语句的发音的拼合信息的值进行解码都是用直接发音方式的,本说明书其它类似说法,比如某解码规则中有关某语种的解码细则涉及的发音形式是全用TTS发音方式的,某解码规则中有关某语种的解码细则涉及的发音形式是全用整步直接发音方式的,某解码规则中有关某语种的解码细则涉及的发音形式是用混合发音方式的等按此类推理解。The pronunciation forms involved in the decoding rules of a certain language in the said decoding rules of the present invention are the full meaning of the direct pronunciation mode: use the decoding rules of a certain language in the decoding rules to represent the relevant in the decoding rules The value of the combined information of the pronunciation of any sentence in the statement set defined by the decoding rules of a certain language is decoded by direct pronunciation. Other similar expressions in this manual, such as the pronunciation form involved in the decoding rules of a certain language in a certain decoding rule It uses the TTS pronunciation method entirely. The pronunciation forms involved in the decoding rules of a certain language in a certain decoding rule are all directly pronounced in full steps. The pronunciation forms involved in the decoding rules of a certain language in a certain decoding rule are mixed pronunciations. The manner and so on are understood by analogy.

对本发明文件各处所说到的解码时用直接发音方式(或类似说法比如解码时用TTS发音方式,解码时用整步TTS发音方式等),其涉及的‘解码时’默认是指在对一个拼合信息的值进行解码时。Use the direct pronunciation mode (or similar saying such as when decoding with the TTS pronunciation mode when decoding, use the whole step TTS pronunciation mode etc.) When the value of the flattened message is decoded.

本发明文件有述及编码语句集与解码语句集可完全相同或部分相同或完全不同,完全相同的情况比如编码语句集只含A,B,C三个语句,解码语句集只含A,B,C三个语句,部分相同的情况比如编码语句集只含A,B,C三个语句,解码语句集只含A,D两个语句,完全不同的情况比如编码语句集只含A,B,C三个语句,解码语句集只含D,E,F三个语句。The document of the present invention mentions that the coded sentence set and the decoded sentence set can be completely the same or partly the same or completely different. For example, the coded sentence set only contains A, B, and C three sentences, and the decoded sentence set only contains A, B. , C three sentences, part of the same situation, for example, the coded sentence set only contains A, B, C three sentences, the decoded sentence set only contains A, D two sentences, completely different situations, such as the coded sentence set only contains A, B , C three statements, the decoding statement set only contains D, E, F three statements.

除非提及时有特别指明有别的限定或可从提及处的上下文得出有别的限定,本说明书中各处提及的一个语种(或类似说法比如说某语种,某一语种等)默认可以是任一非复合语种或任一复合语种。除非提及时有特别指明有别的限定或可从提及处的上下文得出有别的限定,本说明书中各处提及的多个语种默认可以全是非复合语种或全是复合语种或既包含非复合语种也包含复合语种。除非提及时有特别指明有别的限定或可从提及处的上下文得出有别的限定,本说明书中各处提及包括某一语种在内的多个语种时,默认这多个语种中除开此某一语种外的其它语种可全是非复合语种或全是复合语种或既包含非复合语种也包含复合语种。A language (or similar expressions such as a certain language, a certain language, etc.) mentioned everywhere in this specification defaults to Can be any non-composite language or any compound language. Unless otherwise specified when mentioned or can be derived from the context of the mention, the multiple languages mentioned in this specification can be all non-compound languages or all compound languages or both. Non-compound languages also include compound languages. Unless otherwise specified in the reference or can be derived from the context of the reference, when multiple languages including a certain language are mentioned in this specification, it is assumed that the language in the multiple languages is Other languages except this certain language can be all non-composite languages or all compound languages or include both non-composite languages and compound languages.

本发明文件说到的任一编码语句集里,任两个语句都是不同的,在将多个编码语句集合起来成一个编码语集时,对任一语句,如果此语句同时存在于此多个编码语句集中的几个编码语句集里,那合起来的编码语句集里仍是只留一个此语句;本发明文件说到的任一解码语句集里,任两个语句都是不同的,在将多个解码语句集合起来成一个解码语句集时,对任一语句,如果此语句同时存在于此多个解码语句集中的几个解码语句集里,那合起来的解码语句集里仍是只留一个此语句。(本段所说的‘编码语句集合起来’中的‘编码语句集’是一个名词,本段所说的‘解码语句集合起来’中的‘解码语句集’是一个名词,本段所说的‘编码语句集中’中的‘编码语句集’是一个名词,本段所说的‘解码语句集中’中的‘解码语句集’是一个名词)In any set of coded sentences mentioned in the document of the present invention, any two sentences are all different. In several coded sentence sets in a coded sentence set, there is still only one such sentence in the combined coded sentence set; in any decoding sentence set mentioned in the document of the present invention, any two sentences are different, When multiple decoding sentences are combined into one decoding sentence set, for any sentence, if this sentence exists in several decoding sentence sets in the multiple decoding sentence sets at the same time, then the combined decoding sentence set is still Leave only one of these statements. ('encoded statement set' in the 'encoded statement set' mentioned in this paragraph is a noun, and the 'decoded statement set' in the 'decoded statement set' mentioned in this section is a noun. The 'encoded statement set' in the 'encoded statement set' is a noun, and the 'decoded statement set' in the 'decoded statement set' mentioned in this paragraph is a noun)

对于只适用于一个语种的编码规则,所说的此编码规则定义编码语句集与说此编码规则中有关此语种的编码细则定义编码语句集是等价的,所说的此编码规则完整定义编码语句集与说此编码规则中有关此语种的编码细则完整定义编码语句集是等价的,所说的此编码规则隐含定义编码语句集与说此编码规则中有关此语种的编码细则隐含定义编码语句集是等价的;对于只适用于一个语种的解码规则,所说的此解码规则定义解码语句集与说此解码规则中有关此语种的解码细则定义解码语句集是等价的,所说的此解码规则完整定义解码语句集与说此解码规则中有关此语种的解码细则完整定义解码语句集是等价的,所说的此解码规则隐含定义解码语句集与说此解码规则中有关此语种的解码细则隐含定义解码语句集是等价的。(本段所说语种可以为任一非复合语种或任一复合语种)For a coding rule that is only applicable to one language, the coded sentence set defined by the coded rule is equivalent to the coded sentence set defined in the coded rules related to this language in the coded rule, and the coded sentence set defined by the coded rule is completely defined. The statement set is equivalent to saying that the encoding rules about this language in this encoding rule completely define the encoding statement set, and the said encoding rule implicitly defines the encoding statement set and the encoding rules about this language in this encoding rule implicit Defining the encoding statement set is equivalent; for a decoding rule that is only applicable to one language, it is equivalent to saying that the decoding rule defines the decoding statement set and the decoding rules related to this language in the decoding rule define the decoding statement set, The complete definition of the decoding sentence set in the decoding rule is equivalent to the complete definition of the decoding sentence set in the decoding rules related to this language in the decoding rule, and the implicit definition of the decoding sentence set in the decoding rule is equivalent to the decoding rule The decoding rules for this language implicitly define that the sets of decoding sentences are equivalent. (The language mentioned in this paragraph can be any non-compound language or any compound language)

本发明文件所说的一个码字涉及的多个拼合信息的值,这些拼合信息的值可能是相同语种(此相同语种可能为非复合语种或复合语种)或不同语种(此不同语种可能全是非复合语种或全是复合语种或既包含非复合语种也包含复合语种),而且这些拼合信息的值中,可能既有由直播方式的编码处理来编码得到的,也有由TTS方式的编码处理来编码得到的,也有由混合方式的编码处理来编码得到的,或者这些拼合信息的值可能全是由直播方式(或全是由TTS方式或全是由混合方式)的编码处理来编码得到的,或是其它情况。The value of a plurality of splicing information that a codeword mentioned in the present invention file involves, the value of these splicing information may be the same language (this same language may be non-composite language or compound language) or different languages (this different languages may all be non-composite languages) compound languages or all are compound languages or both non-composite languages and compound languages), and the value of these combined information may be encoded by live encoding processing or TTS encoding processing obtained by coding in a mixed mode, or the values of these combined information may all be encoded in a live mode (or all in a TTS mode or all in a hybrid mode), or Is another situation.

在权利要求13,14,15,16,17,21或24中提及的解码程序及解码数据,可以是只适用于一个语种的一套解码程序及解码数据,也可以是适用于多个语种(此多个语种可能全是非复合语种或全是复合语种或既包含非复合语种也包含复合语种)的一套解码程序及解码数据The decoding program and decoding data mentioned inclaims 13, 14, 15, 16, 17, 21 or 24 may be a set of decoding programs and decoding data applicable to only one language, or may be applicable to multiple languages (The multiple languages may be all non-composite languages or all compound languages or include both non-composite languages and compound languages) a set of decoding programs and decoding data

关于本说明书述的印欧语系和非印欧语系涉及的变音情况与形同音不同情况的关系,本发明文件是这样约定的:对一个单词的某词形形态(或一个词),如其有两个明显不同的发音,或者在两个不同发音时其词意明显不同,那一般应看成是形同音不同的情况,如其有两个区别不大的发音,而且在这两个不同发音时其词意相同,那一般应看成是变音的情况;对一个单词的某词形形态(或一个词),首先涉及其是否有形同音不同的情况,如没有,那再涉及此单词的此词形形态(或此词)是否有变音的情况,如有,那对每个不同的发音情况再涉及各自是否有变音的情况。Regarding the relationship between the Indo-European language family and the non-Indo-European language family described in this specification, the relationship between the situation of inflections and the different situations of homophones, the document of the present invention stipulates that: for a certain form (or a word) of a word, if its If there are two obviously different pronunciations, or the word meanings are obviously different in the two different pronunciations, it should generally be regarded as a case of homonyms. If there are two pronunciations with little difference, and the two different When the meaning of the word is the same when pronounced, it should generally be regarded as an inflected situation; for a certain form of a word (or a word), it first involves whether there is a situation of homophony and difference, if not, then it involves the word Whether this morphological form (or this word) has the situation of inflection, if any, then relates to the situation of respectively whether there is inflection to each different pronunciation situation.

对印欧语系任一语种,解码用到的所有数据库,比如单词数据库,词根数据库,范句库等这些数据库都应归入解码数据范围。对非印欧语系任一语种,解码用到的所有数据库,比如词数据库,单字拼音表(汉语时),范句库等这些数据库都应归入解码数据范围。对任一复合语种,解码用到的所有数据库,比如单词数据库(英语的),词根数据库(英语的),词数据库(汉语的),单字拼音表(汉语的),范句库(汉语的)等这些数据库都应归入解码数据范围。For any language of the Indo-European language family, all databases used for decoding, such as word databases, root databases, sample sentence databases, etc., should be included in the scope of decoding data. For any language of the non-Indo-European language family, all databases used for decoding, such as word databases, single-character pinyin tables (in Chinese), sample sentence databases, etc., should be included in the scope of decoding data. For any compound language, all databases used for decoding, such as word database (English), word root database (English), word database (Chinese), single-character pinyin table (Chinese), sample sentence database (Chinese) These databases should be included in the scope of decoding data.

本说明书在阐述编解码技术时是按这样来说的:对于印欧语系各语种,解码数据一般是包括单词数据库和词根数据库等,但不包括诸音码的发音数据(直接发音方式时也是这样),非印欧语系各语种的解码数据也是不包括诸音码的发音数据的,复合语种的解码数据也是不包括诸音码的发音数据的。当然如是把诸音码的发音数据归入语种的解码数据也可,这与把诸音码的发音数据和语种的解码数据分开是等同替换。This instruction manual is said according to this when elaborating codec technology: for the various languages of Indo-European language family, decoding data generally comprises word database and word root database etc., but does not comprise the pronunciation data of all phonetic codes (during the direct pronunciation mode also is like this ), the decoding data of non-Indo-European languages does not include the pronunciation data of all phonetic codes, and the decoding data of compound languages does not include the pronunciation data of all phonetic codes. Of course it is also possible to classify the pronunciation data of all phonetic codes into the decoding data of the language, which is equal to replacing the pronunciation data of all the phonetic codes and the decoding data of the language.

本发明文件各处提及的解码数据,一般是包括单词数据库的数据和词根数据库的数据等,为了叙述方便,各处是按这样来说的:解码数据一般是包括单词数据库和词根数据库等。各处有提及的‘单词数据库,词根数据库等解码数据’也是按这样来说的。本段前面述是针对印欧语系各语种的,对于非印欧语系各语种和复合语种也是类似这样的。The decoded data that the document of the present invention mentions everywhere generally is to comprise the data of word database and the data of root database etc., for the convenience of narration, everywhere is said like this: decoded data generally comprises word database and root database etc. The 'word database, root database and other decoding data' mentioned everywhere are also said in this way. The above mentioned in this paragraph is for the Indo-European languages, and it is similar for the non-Indo-European languages and compound languages.

以上三段适用于直播方式或TTS方式或混合方式的编解码处理,以上3段提及的解码数据默认是指一个语种的解码数据。The above three paragraphs are applicable to the codec processing of live broadcast mode, TTS mode or mixed mode. The decoded data mentioned in the above three paragraphs refers to the decoded data of one language by default.

本说明书各处提及的‘印欧语系语种’,其‘语种’前没加限定,就是泛指印欧语系各语种,而象‘印欧语系某语种’就是指印欧语系某一语种;本说明书各处提及的‘非印欧语系语种’,其‘语种’前没加限定,就是泛指非印欧语系各语种,而象‘非印欧语系某语种’就是指非印欧语系某一语种。The "Indo-European languages" mentioned in this manual, without any limitation before "language", refers to all Indo-European languages in general, and "a certain language of the Indo-European language" refers to a certain language of the Indo-European language; The "non-Indo-European languages" mentioned in the manual, without any limitation before the "language", refers to all non-Indo-European languages in general, and "a non-Indo-European language" refers to a non-Indo-European language. one language.

显然本发明文件各处提及的‘频率编号的长度信息’与‘频率编号长度信息’是同一概念。Apparently, the 'length information of the frequency number' and 'the length information of the frequency number' mentioned in the documents of the present invention are the same concept.

本说明书在各处叙述的技术点默认是适用于语言的总概念(语言最上位概念)的(除非叙述时有特别指明有他义或可从上下文推出有他义),适用于语言的总概念即是既适用于印欧语系的各语种,也适用于非印欧语系的各语种的,这是显而易见的,而且一般也是适用于任一复合语种的除非特别指明不适用于复合语种或可从上下文推出不适用于复合语种。The technical points described in this manual are defaulted to be applicable to the general concept of language (the highest level concept of language) (unless it is specifically stated that there are other meanings or can be deduced from the context), and the general concept of language is applicable It is obvious that it is applicable to both Indo-European languages and non-Indo-European languages, and it is generally applicable to any compound language unless it is specifically stated that it is not applicable to compound languages or can be obtained from Context push does not work with compound languages.

本发明文件各处所提的码字携带某值的含义就是指可从码字得到某值(一般就是把某值或此值的变换值放入此码字里),本段所说的码字是指单个码字。The meaning that the code words mentioned in the documents of the present invention carry a certain value refers to that a certain value can be obtained from the code word (generally, a certain value or the transformation value of this value is put into this code word), and the code word mentioned in this paragraph A word refers to a single codeword.

本发明文件各处所提及的一个码字的值有一个拼合信息的值,是指此码字的值有此拼合信息的值(一般是把此拼合信息的值本身放在此码字里,或是把此拼合信息的值的变换值放在此码字里),而且此码字的值不与其它拼合信息的值有关(由此推理可知,此码字的值里没有有关其它拼合信息的值的信息)。The value of a codeword mentioned in the document of the present invention has a value of combining information, which means that the value of this codeword has the value of this combining information (generally the value of this combining information itself is placed in this codeword) , or put the transformation value of the value of this merging information in this codeword), and the value of this codeword is not related to the value of other merging information (from this reasoning, it can be seen that there is no relevant other merging in the value of this codeword information on the value of the information).

按本说明书的内容可知,本发明文件各处提及的在语音编码有声读物整页面上的诸个码字中的各码字,其码值是共与数个拼合信息的值有关的(此码字即拼合信息码),所以在本发明文件中所说的语音编码有声读物整页面上如还有另外的码字,但如其码值不是与拼合信息的值有关的,此码字就不属于本发明文件各处所说的语音编码有声读物整页面上的诸个码字的范围;比如,在有些语音编码有声读物的整页面上可能有另一些特殊的码字,比如码字带的是暂停的控制信息,如拾音器读取此码字后,就把拾音器正在播的语音暂停下来,再读一次此码字又从暂停处继续播,此种码字的值不是与拼合信息的值有关的,所以此种码字不属于本发明文件各处所说的语音编码有声读物整页面上的诸个码字的范围。According to the contents of this specification, it can be seen that the code values of each code word in the code words on the entire page of the speech coded audiobook mentioned in the document of the present invention are related to the value of several pieced information (here Code word is the combination information code), so if there are other code words on the entire page of the speech coded audiobook said in the document of the present invention, but as its code value is not related to the value of the combination information, this code word is not It belongs to the scope of codewords on the entire page of the audio-coded audiobook mentioned in the document of the present invention; for example, there may be other special codewords on the entire page of some audio-coded audiobooks, such as the codeword with Paused control information, such as after the pickup reads the codeword, it pauses the voice that the pickup is playing, reads the codeword again and continues playing from the paused point, the value of this codeword is not related to the value of the spliced information Yes, so this kind of codeword does not belong to the range of codewords on the entire page of the speech coded audiobook said in various places of the document of the present invention.

如提及时无特别指明有他义,本发明文件各处提及的码字默认是指语音编码有声读物的整页面上的诸个码字中的码字。如提及时无特别指明有他义,本发明文件各处提及的导电码,条形码,图形码或磁码默认也是属于有声读物的整页面上的诸个码字中的码字。If there is no other meaning when mentioned, the codewords mentioned in the documents of the present invention refer to the codewords in the codewords on the entire page of the speech-encoded audiobook by default. If there is no special meaning when mentioned, the conductive code, barcode, graphic code or magnetic code mentioned throughout the document of the present invention is also the codeword among the codewords on the entire page of the audiobook by default.

本发明文件各处出现的‘某’,如其是修饰其后面名词的而且此‘某’后面紧跟着没出现数量词,那这个‘某’默认是指‘某一’的意思,这也是常识;比如说‘某语句’就是指‘某一语句’。本发明文件各处出现的‘此’,如其是修饰其后面名词的而且此‘此’后面紧跟着没出现数量词,那这个‘此’默认是指‘此一’的意思,这也是常识;比如说‘此语种’就是指‘此一语种’。The 'some' appearing in the document of the present invention, if it modifies the noun behind it and there is no quantifier immediately after the 'some', then this 'some' means 'some' by default, which is also common sense; For example, 'a statement' means 'a statement'. The 'he' appearing everywhere in the document of this invention, if it modifies the nouns behind it and there is no quantifier immediately after the 'this', then the 'he' defaults to the meaning of 'this one', which is also common sense; For example, 'this language' means 'this language'.

本发明文件各处提及的某语种的码字,是指此码字的值是共与数个拼合信息的值有关的而且此码字涉及的这数个拼合信息的值都是此语种的;本发明文件各处提及的某语种的拼合信息的值,是指此拼合信息的值是表示此语种的语句发音的拼合信息的值;本发明文件各处提及的只适用于某语种的数据卡,是指此数据卡上只存储有只适用于此语种的解码程序及解码数据;本发明文件各处提及的适用于某语种的数据卡(在‘适用’前面无‘只’的),是指此数据卡上存储有适用于此语种的解码程序及解码数据(不排除此解码程序及解码数据还可能适用于别的语种),还可能存储有适用于别的语种的另外的解码程序及解码数据;本发明文件各处提及的只适用于某语种的部件式数据卡,是指此部件式数据卡上只存储有只适用于此语种的解码程序及解码数据;本发明文件各处提及的适用于某语种的部件式数据卡(在‘适用’前面无‘只’的),是指此部件式数据卡上存储有适用于此语种的解码程序及解码数据(不排除此解码程序及解码数据还可能适用于别的语种),还可能存储有适用于别的语种的另外的解码程序及解码数据;本发明文件各处提及的只适用于某语种的解码程序及解码数据,是指拾音器涉及的可用于对表示此语种的语句发音的拼合信息的值进行解码的解码程序及解码数据(但不能对别的语种的拼合信息的值进行解码);本发明文件各处提及的适用于某语种的解码程序及解码数据(在‘适用’前面无‘只’的),是指拾音器涉及的可用于对表示此语种的语句发音的拼合信息的值进行解码的解码程序及解码数据,此解码程序及解码数据还可能可用于对别的语种的拼合信息的值进行解码;本发明文件各处提及的适用于多个语种的一套解码程序及解码数据,是指拾音器涉及的可用于对这多个语种中各自语种的拼合信息的值都能解码的一套解码程序及解码数据(但不能对这多个语种外别的语种的拼合信息的值进行解码);本发明文件各处提及的只适用于某语种的解码规则,是指可用于对表示某语种的语句发音的拼合信息的值进行解码的解码规则(但此解码规则不能对别的语种的拼合信息的值进行解码);本发明文件各处提及的只适用于某语种的编码规则,是指可用于对某语种的语句进行编码以生成拼合信息的值的编码规则(但此编码规则不能对别的语种的语句进行编码)。本段各处提到的‘某语种’可以是印欧语系任一语种或非印欧语系任一语种或任一复合语种,本段各处提到的‘多个语种’可以全是非复合语种或全是复合语种或既包含非复合语种也包含复合语种。The code word of a certain language mentioned in the document of the present invention means that the value of this code word is related to the value of several pieced together information and the values of these several pieced together information involved in this code word are all of this language The value of the mosaic information of a certain language mentioned in the document of the present invention refers to that the value of this mosaic information is the value of the mosaic information representing the sentence pronunciation of this language; The data card refers to the data card that only stores the decoding program and decoding data that are only applicable to this language; the data card that is applicable to a certain language mentioned in the document of the present invention (there is no "only" before "applicable") ), which means that the decoding program and decoding data suitable for this language are stored on the data card (this decoding program and decoding data may also be applicable to other languages), and there may also be other languages applicable to other languages stored on the data card. The decoding program and decoding data; the part-type data card that is only applicable to a certain language mentioned in the documents of the present invention means that only the decoding program and decoding data that are only applicable to this language are stored on the part-type data card; The part-type data card applicable to a certain language mentioned in the invention documents (there is no "only" in front of "applicable") means that the part-type data card stores the decoding program and decoding data suitable for this language ( It is not ruled out that this decoding program and decoding data may also be applicable to other languages), and may also store other decoding programs and decoding data applicable to other languages; Program and decoding data refer to the decoding program and decoding data that can be used to decode the value of the combined information representing the sentence pronunciation of this language (but cannot decode the value of the combined information of other languages) that the pickup relates to; The decoding program and decoding data applicable to a certain language mentioned in the document (there is no "only" before "applicable") refers to the values involved in the pickup that can be used to decode the combined information representing the pronunciation of sentences in this language The decoding program and decoding data, this decoding program and decoding data may also be used to decode the value of the combined information in other languages; a set of decoding programs and decoding data applicable to multiple languages mentioned in the document of the present invention , refers to a set of decoding programs and decoding data that can be used to decode the values of the combined information of the respective languages in these multiple languages related to the pickup (but can not be used for the value of the combined information of other languages outside these multiple languages. decoding); the decoding rules that are only applicable to a certain language mentioned in the documents of the present invention refer to the decoding rules that can be used to decode the value of the combined information that represents the sentence pronunciation of a certain language (but this decoding rule cannot be used for other languages) The value of the combined information of the language is decoded); the coding rules mentioned in the documents of the present invention are only applicable to a certain language, and refer to the coding rules that can be used to encode the sentences of a certain language to generate the value of the combined information (but this Encoding rules cannot encode sentences in other languages). The "a certain language" mentioned in this paragraph can be any language of the Indo-European language family or any non-Indo-European language or any compound language, and the "multiple languages" mentioned in this paragraph can all be non-compound languages Either all of them are composite languages or they contain both non-compound languages and composite languages.

本发明文件各处提及的适用于某语种的编码规则(在‘适用’前面无‘只’的),此编码规则可以是只适用于此语种,或可以是适用于包括此语种在内的多个语种;对于本发明文件各处提及的适用于某语种的解码规则’(在‘适用’前面无‘只’的),也是同样道理,此解码规则可以是只适用于此语种,或可以是适用于包括此语种在内的多个语种。本段各处提到的‘某语种’可以是印欧语系任一语种或非印欧语系任一语种或任一复合语种。The coding rules applicable to a certain language mentioned in the document of this invention (there is no "only" in front of "applicable"), this coding rule can be only applicable to this language, or can be applicable to including this language multiple languages; the same is true for the decoding rules applicable to a certain language mentioned in the document of the present invention (there is no “only” before the word “applicable”). This decoding rule can only be applicable to this language, or Can be applicable to multiple languages including this one. The "a certain language" mentioned in this paragraph can be any language of the Indo-European language family or any language of the non-Indo-European language family or any composite language.

本发明文件各处提及的编解码规则,是指编码规则和解码规则。The encoding and decoding rules mentioned throughout the documents of the present invention refer to encoding rules and decoding rules.

导电码技术的介绍:导电码及其读码头的技术,是本发明人在另一发明申请‘一种有声读物及拾音器主体’里(申请号为2006102005537)申请的,本说明书也作个介绍:一般是在读物的页面上,由导电物质(一般是导电油墨)组成不同形状以形成不同的电路组合来表示不同的码值,一般可采用在读物的页面上印刷导电油墨的方式来形成导电码,导电码可通过拾音器的机电式导电码读码头被拾音器读取,在读码时使一个机电式导电码读码头与一个导电码接触,在导电码的码字区域内还有触区,在读码时触区用于与机电式导电码读码头接触并由码字区域内的全部触区或部分触区与机电式导电码读码头作电流交流以检测码值;一个导电码的大小可类似于一个条形码(每个导电码占的面积约为4CM*1CM);导电码一般分通断式和模数转换式两种形式,相应的机电式导电码读码头也分通断式和模数转换式两种形式读码头;用机电通断式读码头读取通断式导电码的一般过程是:读码时,使读码头与读物的页面上的导电码接触,由读码头上的一个或多个触头向一个导电码码字的一个或多个触区注入电流,电流经过导电码码字上导电物质形成的网络后,再从导电码其它各触区流回读码头,再对从导电码其它各触区回来的电流的有无情况进行检测,由检测结果得到导电码的各触区位置间的通断组合情况而得出码值;如导电码的质材的颜色与读物的页面的颜色不同的话,还可用光电式读码头读取导电码。本说明书中有关图6的拾音器的实施例涉及读码的部分,就是一个机电通断式读码头读取通断式导电码(单排码线单码段通断式导电码)的。The introduction of the conductive code technology: The conductive code and the technology of reading the dock are applied by the inventor in another invention application 'an audiobook and the main body of the pickup' (the application number is 2006102005537). This specification also makes an introduction: Generally, on the page of the reading material, different shapes are composed of conductive substances (usually conductive ink) to form different circuit combinations to represent different code values. Generally, the conductive code can be formed by printing conductive ink on the page of the reading material. The conductive code can be read by the pickup through the electromechanical conductive code reading terminal of the pickup. When reading the code, an electromechanical conductive code reading terminal is in contact with a conductive code. There is also a contact area in the code word area of the conductive code. When reading the code The time contact area is used to contact with the electromechanical conductive code reading terminal and make current exchange with the electromechanical conductive code reading terminal by all or part of the contact area in the code word area to detect the code value; the size of a conductive code can be similar to A barcode (the area occupied by each conductive code is about 4CM*1CM); the conductive code is generally divided into two forms: on-off type and analog-to-digital conversion, and the corresponding electromechanical conductive code reading terminal is also divided into on-off type and analog-to-digital conversion The general process of reading the on-off conductive code with the electromechanical on-off reading terminal is: when reading the code, make the reading terminal contact the conductive code on the page of the reading material, and one or the other on the reading terminal Multiple contacts inject current into one or more contact areas of a conductive code word. After the current passes through the network formed by the conductive material on the conductive code word, it flows back to the reading dock from other contact areas of the conductive code, and then The presence or absence of the current returned from other contact areas of the conductive code is detected, and the code value is obtained from the on-off combination of the contact area positions of the conductive code from the detection results; such as the color of the material of the conductive code and the reading material. If the color of the page is different, the conductive code can also be read by the photoelectric reading dock. The embodiment of the pickup about Fig. 6 in this specification relates to the part of code reading, is exactly that an electromechanical on-off type reading dock reads on-off type conductive code (single-row code line single code segment on-off type conductive code).

如提及时无特别指明有他义,本发明文件各处提及的其前面没带数量词的‘拾音器主体’默认是指一个拾音器主体。If there is no special meaning when mentioning, the "pickup body" mentioned in the documents of the present invention without quantifiers in front of it refers to a pickup body by default.

本发明文件中各处提及的‘有声读物’与‘语音编码有声读物’是同一概念。The 'audiobook' mentioned everywhere in the document of the present invention is the same concept as the 'speech coded audiobook'.

权利要求书中各处提及的‘所述部诸码字中的一个码字’是指部诸码字中的某一码字。'A codeword in the said codewords' mentioned everywhere in the claims refers to a certain codeword in the codewords.

在权利要求书中有多处提及的‘所述诸个码字中的某一码字’,是指语音编码有声读物的整页面上的诸个码字中的一个码字,比如诸个码字中含有4个码字(码字1,码字2,码字3和码字4),那诸个码字中的某一码字就是指码字1或码字2或码字3或码字4。'A certain codeword among the codewords' mentioned in the claims refers to one of the codewords on the entire page of the audio-coded audiobook, such as the Contain 4 codewords (codeword 1,codeword 2,codeword 3 and codeword 4) in the codeword, a certain codeword in that all codewords refers tocodeword 1 orcodeword 2 orcodeword 3 orcodeword 4.

在权利要求中各处提及的‘所述诸个码字中的某一码字’中的‘所述’是只用于修饰此‘所述’后面的‘诸个码字’的。在权利要求书中各处提及的‘所述语种的拼合信息的值’中的‘所述’是只用于修饰此‘所述’后面的‘语种’的。在权利要求书中各处提及的‘所述的一个码字的码值’中的‘所述’是只用于修饰此‘所述’后面的‘一个码字’的。在权利要求书中各处提及的‘所述数个数据卡’中的‘所述’是只用于修饰此‘所述’后面的‘数个数据卡’的。在权利要求书中各处提及的‘所述数个卡插槽’中的‘所述’是只用于修饰此‘所述’后面的‘数个卡插槽’的。在权利要求书中各处提及的‘所述数个部件式数据卡’中的‘所述’是只用于修饰此‘所述’后面的‘数个部件式数据卡’的。在权利要求书中各处提及的‘所述数个部件式卡插槽’中的‘所述’是只用于修饰此‘所述’后面的‘数个部件式卡插槽’的。在权利要求书中各处提及的‘所述一个码字’中的‘所述’是只用于修饰此‘所述’后面的‘一个码字’的。The 'said' in 'one of the codewords' mentioned in the claims is only used to modify the 'codewords' following the 'said'. The 'said' in 'the value of the combined information of the said language' mentioned in the claims is only used to modify the 'language' behind the 'said'. The 'said' in 'the code value of the said one codeword' mentioned in the claims is only used to modify the 'one codeword' following the 'said'. The 'said' in the 'several data cards' mentioned in the claims is only used to modify the 'several data cards' following the 'said'. The 'said' in 'several card slots' mentioned in the claims is only used to modify the 'several card slots' behind this 'said'. The 'said' in the 'several component data cards' mentioned in the claims is only used to modify the 'several component data cards' behind the 'said'. The 'said' in 'several component card slots' mentioned in the claims is only used to modify the 'several component card slots' behind this 'said'. The 'said' in 'the one codeword' mentioned in the claims is only used to modify the 'one codeword' following the 'said'.

本发明文件各处所说的码字被拾音器读取,默认是指每次一个码字被一个拾音器的一个读码头读取(每个拾音器只有一个读码头);如有每次多个码字被一个拾音器的多个读码头读取,那这种每次多个码字被一个拾音器的多个读码头读取的情况与每次一个码字被一个拾音器的一个读码头读取是等同替换,因这多个读码头合起来可看成是一个面积很大的读码头,这多个码字合起来可看成是一个面积很大的码字。(本段提到的‘多个码字’中的‘多’与‘多个读码头’中的‘多’数量可相同或不同,因有比如一个读码头可一次读几个码字或一个码一次被几个读码头读等情况)。如提及时无特别指明有他义,本发明文件各处提及的读码头显然默认是指单个的读码头。如提及时无特别指明有他义,本发明文件各处提及的拾音器显然默认是指单个的拾音器。The said code word in document of the present invention is read by pickup by pickup, acquiescently refers to that each code word is read by a reading dock of a pickup (each pickup has only one reading dock); Multiple reading docks of a pickup are read, and the situation that multiple codewords are read by multiple reading docks of a pickup each time is equivalent to one codeword being read by one reading dock of a pickup every time. Because of these multiple reading docks can be regarded as a very large area of reading docks together, these multiple code words can be regarded as a very large code word of an area together. (the 'many' in the 'multiple codewords' mentioned in this paragraph can be the same or different from the 'many' in the 'multiple reading terminals', because for example, one reading terminal can read several codewords or one The code is read by several reading terminals at a time, etc.). If there is no special meaning when it is mentioned, the reading terminal mentioned in the document of the present invention obviously refers to a single reading terminal by default. If there is no special meaning when it is mentioned, the pickups mentioned in the documents of the present invention refer to a single pickup by default.

在任一权利要求项中出现的‘前述’都是指本权利要求项内的前面所述。'Aforementioned' appearing in any claim refers to the preceding statement in this claim.

在权利要求书中各处提及的‘前述部诸码字’中的‘前述’与‘部诸码字’是默认搭配,即只要‘前述’后面出现‘部诸码字’,那此‘前述’就是只修饰这个‘部诸码字’的,而不修饰‘部诸码字’后面跟的其它内容,‘前述部诸码字’是指本权利要求项内前面述及的‘部诸码字’;与此类似,权利要求书中各处提及的‘前述单片机芯片’,‘前述拾音器’,‘前述数个卡插槽’,‘前述数个数据卡’,‘前述数个部件式卡插槽’,‘前述数个部件式数据卡’,‘前述语音编码有声读物’,‘前述读码头’,‘前述发音器件’,‘前述诸个码字’,‘前述读物’,‘前述某一码字’,‘前述一个码字’,‘前述拼合信息的值’,‘前述拼合信息的值1’,‘前述多个码字’,‘前述语句集’,‘前述编码语句集’,‘前述解码语句集’,‘前述语句1’,‘前述语句’,‘前述TTS器件’,‘前述拾音器主体’,‘前述编码规则’,‘前述这种编码规则’或‘前述这种解码规则’中的各提法中的‘前述’与此提法中此‘前述’后面的部分(比如‘前述’与‘编码规则’)都是默认搭配,即这些提法中的各提法中的‘前述’词是只用于修饰此提法中此‘前述’后面的部分(比如‘前述编码规则’中的‘前述’是只用于修饰此‘前述’后面的‘编码规则’的)。"Aforementioned" and "all codewords" in the "aforementioned codewords" mentioned everywhere in the claims are collocations by default, that is, as long as "partial codewords" appear behind the "aforementioned", then this' Aforesaid ' just only modifies this ' all code words ', and does not modify other content that follows behind ' all code words ', and ' all code words of the foregoing parts' refers to the ' all code words mentioned above' in the claims Code word'; similar to this, the 'aforesaid single-chip microcomputer chip', 'the aforesaid pickup', 'the aforesaid several card slots', 'the aforesaid several data cards', 'the aforesaid several components mentioned in the claims Type card slot', 'the aforementioned several component data cards', 'the aforementioned speech coded audiobook', 'the aforementioned reading dock', 'the aforementioned pronunciation device', 'the aforementioned codewords', 'the aforementioned reading material',' The aforementioned certain codeword', 'the aforementioned one codeword', 'the value of the aforementioned combined information', 'the value 1 of the aforementioned combined information', 'the aforementioned multiple codewords', 'the aforementioned sentence set', 'the aforementioned coded sentence set ', 'The aforementioned decoding statement set', 'The aforementioned statement 1', 'The aforementioned statement', 'The aforementioned TTS device', 'The aforementioned pickup body', 'The aforementioned encoding rule', 'The aforementioned encoding rule' or 'The aforementioned such The 'aforementioned' in each reference in the decoding rules' and the part after the 'aforementioned' in this formulation (such as 'aforementioned' and 'encoding rules') are default collocations, that is, each of these formulations The word "aforementioned" in the reference is only used to modify the part behind the "aforementioned" in this formulation (for example, the "aforementioned" in the "aforementioned encoding rules" is only used to modify the "encoding rules" behind the "aforementioned" ).

本发明文件各处所提及的码字的值就是码字的码值。The values of the codewords mentioned throughout the present document are the code values of the codewords.

在本发明文件中各处有提及‘码字涉及的全部数个拼合信息的值’(这里的码字是指单个码字),这种提法中的‘码字’与‘数个拼合信息的值’的关系是:码字的值是共与这数个表示如何用音码拼合成语句发音的拼合信息的值有关的(码字的值不再与别的拼合信息的值有关),所以这数个拼合信息的值是码字涉及的全部数个拼合信息的值,且一般是码字有一个拼合信息的值的全部(或部分)。(本段所提及的几个‘数个’都是指相同数目)In the document of the present invention, there are references to 'the value of all the combined information involved in the code word' (the code word here refers to a single code word), and the 'code word' and 'number of combined information' in this formulation The relation of the value ' of information is: the value of code word is to be related to the value of the information of how to piece together the pronunciation of sentence with these several representations (the value of code word is no longer relevant with the value of other combination information) , so the values of these pieces of pieced information are the values of all pieces of pieced information involved in the codeword, and generally all (or part) of the value of one piece of pieced information that the codeword has. (The several 'several' mentioned in this paragraph all refer to the same number)

在本发明文件中各处提及‘码字涉及的各拼合信息的值’(这里的码字是指单个码字),是指码字涉及的全部数个拼合信息的值中的各个拼合信息的值。In the document of the present invention, it is mentioned everywhere that 'the value of each piece of information involved in the codeword' (the codeword here refers to a single codeword) refers to each piece of information in the values of all the pieces of information that are involved in the codeword value.

在本发明文件中各处有提及‘码字涉及的拼合信息的值’(这里的码字是指单个码字),是指码字涉及的全部数个拼合信息的值中的一个拼合信息的值(如这里的数个是一个的话,那全部数个拼合信息的值中的一个拼合信息的值就是指这一个拼合信息的值)。在本发明文件中各处有类似‘码字涉及的表示如何用音码拼合成语句发音的拼合信息的值’或‘码字涉及的表示语句发音的拼合信息的值’的提法,这些提法与‘码字涉及的拼合信息的值’的意思相同;对于本发明文件中有‘码字涉及的此语种的拼合信息的值’的提法,也是按‘码字涉及的拼合信息的值’这样来理解(只是对拼合信息的值作了限定是某语种),是指码字涉及的全部数个拼合信息的值中的一个此语种的拼合信息的值(如这里的数个是一个的话,那全部数个拼合信息的值中的一个此语种的拼合信息的值就是指这一个拼合信息的值且这一个拼合信息的值是此语种的)。对于码字涉及的拼合信息的值,可能是此码字的值有此拼合信息的值的全部或部分。In the document of the present invention, there are references to "the value of the combined information involved in the codeword" (the codeword here refers to a single codeword), which refers to a combined information in the value of all the combined information involved in the codeword (if the number here is one, the value of one piece of pieced information among all the values of pieced together information refers to the value of this piece of pieced together information). In the document of the present invention, there are references similar to 'the value of the combined information of how the code word relates to the expression of the pronunciation of the sentence' or 'the value of the combined information of the expression sentence pronunciation involved in the code word', these proposals The meaning of the method is the same as that of 'the value of the combined information involved in the code word'; for the reference of 'the value of the combined information of this language that the code word involves' in the document of the present invention, it is also by the value of the combined information involved in the code word 'Understanding in this way (just limiting the value of the combined information to be a certain language), refers to the value of one of the combined information of this language in the value of all the combined information involved in the codeword (as here, several are one If so, the value of one piece of information in this language among the values of all pieces of pieced together information refers to the value of this piece of pieced together information and the value of this piece of pieced together information is in this language). For the value of the combination information involved in the codeword, it may be that the value of the codeword has all or part of the value of the combination information.

以上5段所提及的各码字都是指单个码字。Each codeword mentioned in the above 5 paragraphs refers to a single codeword.

在本发明文件中各处有提及有关‘拼合信息的值涉及的语句’(或类似提法),这种提法中的‘拼合信息的值’与‘语句’的关系是:此拼合信息的值就是表示此语句发音的拼合信息的值,所以此语句是拼合信息的值涉及的语句。In the document of the present invention, there are references to the "sentences involved in the value of the combined information" (or similar formulations), the relationship between the "value of the combined information" and the "statement" in this formulation is: the combined information The value of is the value of the combined information representing the pronunciation of this sentence, so this sentence is the sentence involved in the value of the combined information.

对于任一语种(可以是印欧语系任一语种或非印欧语系任一语种或任一复合语种),本发明文件各处所说的此语种的语句集,是指语句的集合,此语句集里的所有语句都是此语种的(此语种也可能是复合语种),语句集的范围并无特别限制,可以是有限的,比如教给幼儿班小孩的英语语句,显然语句个数一般是有限的,或甚至可以是无限的(比如英语语种的任意语句);如定义某语句集里语句个数是无限的(比如此语句集包括某一语种的任意语句),而如用几种音码可拼合此语句集发音,那此语句集里的任一语句的发音都可由这几种音码拼合成。For any language (it can be any language of the Indo-European language family or any language of the non-Indo-European language family or any compound language), the sentence set of this language mentioned in the documents of the present invention refers to the collection of sentences. All the sentences in are in this language (this language may also be a compound language), the scope of the sentence set is not particularly limited, it can be limited, such as English sentences taught to kindergarten children, obviously the number of sentences is generally limited of, or even can be infinite (such as any sentence in the English language); as defining the number of sentences in a certain sentence set is infinite (such as this sentence set includes any sentence in a certain language), and as using several phonetic codes The pronunciation of this sentence collection can be put together, and the pronunciation of any sentence in this sentence collection can be assembled by these several phonetic codes.

对于任一语种,如定义两个不同的语句集(比如都是按完整定义方式来定义的),那对这两个语句集,可分别作编码处理,可分别作解码处理;如不同编码规则中有关某语种的编码细则对应定义(比如都是按完整定义方式来定义的)的是此语种的不同语句集,那这不同编码规则中有关此语种的编码细则应看作是不同的;如不同解码规则中有关某语种的解码细则对应定义(比如都是按完整定义方式来定义的)的是此语种的不同语句集,那这不同解码规则中有关此语种的解码细则应看作是不同的。本段所说的语种可以是印欧语系任一语种或非印欧语系任一语种或任一复合语种。For any language, if two different sentence sets are defined (for example, both are defined according to the complete definition method), then the two sentence sets can be encoded and decoded separately; if different encoding rules The coding rules of a certain language in the corresponding definition (for example, are defined according to the complete definition method) are different sentence sets of this language, then the coding rules of this language in different coding rules should be regarded as different; The decoding rules for a certain language in different decoding rules correspond to different sentence sets of the language (for example, they are all defined in the way of complete definition), so the decoding rules for this language in different decoding rules should be regarded as different of. The language mentioned in this paragraph may be any language of the Indo-European language family or any language of the non-Indo-European language family or any compound language.

如提及时没特别指明有他义,本发明文件各处所提及的拼合信息的值(或表示语句发音的拼合信息的值或表示如何用音码拼合成语句发音的拼合信息的值)默认是指一个拼合信息的值。There are no other meanings when mentioning, the value of the splicing information (or the value of the splicing information representing the pronunciation of the sentence or the value of the splicing information of how to splice the pronunciation of the sentence) mentioned in the documents of the present invention is defaulted Refers to the value of a flattened message.

本发明文件各处所说的读物的整页面上有某种码字,是指在整页面上的全部页面上都有这种码字或在整页面上的部分页面上(甚至只有一个页面上)有这种码字。Some kind of code word is arranged on the whole page of reading material mentioned in document of the present invention everywhere, refers to that all pages on the whole page have this code word or on the partial page on the whole page (even only on one page) There is such a code word.

本发明文件各处提及的一个码字的值是共与数个表示如何用音码拼合成语句发音的拼合信息的值有关的(如在‘与’前没加‘共’来修饰的话,就不是本段所说情况),是指此码字的值只与这数个表示如何用音码拼合成语句发音的拼合信息的值有关,而不与其它的表示如何用音码拼合成语句发音的拼合信息的值有关的。本发明文件各处提及的一个码字的值是共与数个拼合信息的值有关的(或提及的一个码字的值是共与数个表示语句发音的拼合信息的值有关的),显然也按本段所述理解。The value of a code word mentioned in the document of the present invention everywhere is related to the value of the combination information of how to splice into a sentence pronunciation with several representations altogether (if not adding ' totally ' before ' and ' to modify, Just not the said situation of this paragraph), it means that the value of this code word is only related to the value of the information of how to piece together the sentence pronunciation with these several representations, but not with other representations how to piece together a sentence with the sound code The pronunciation is related to the value of the flattening information. The value of a code word mentioned everywhere in the document of the present invention is related to the value of several mosaic information (or the value of a code word mentioned is related to the value of several mosaic information representing sentence pronunciation) , obviously also understood as described in this paragraph.

在权利要求1,10,19,22中提及的‘前述诸个码字中的每个码字的码值是共与数个表示如何用音码拼合成语句发音的拼合信息的值有关的’,其意思是指诸个码字中的各个码字的码值各自是共与数个拼合信息的值有关的;可举个例:如诸个码字中含有3个码字(码字1,码字2,码字3),那码字1是共与数个拼合信息的值有关的,码字2也是共与数个拼合信息的值有关的,码字3也是共与数个拼合信息的值有关的。而且实际上这诸个码字中的不同码字涉及的全部数个拼合信息的值一般是互不相同的。比如仍是刚才的例:如诸个码字中含有3个码字(码字1,码字2,码字3),那码字1是共与数个拼合信息的值有关的,码字2也是共与数个拼合信息的值有关的,码字3也是共与数个拼合信息的值有关的(这3句里的3个‘数个’不一定是相同数目);而码字1涉及的全部数个拼合信息的值(也即前面说的‘那码字1是共与数个拼合信息的值有关的’一句中提及的数个拼合信息的值)与码字2涉及的全部数个拼合信息的值(也即前面说的‘那码字2是共与数个拼合信息的值有关的’一句中提及的数个拼合信息的值)与码字3涉及的全部数个拼合信息的值(也即前面说的‘那码字3是共与数个拼合信息的值有关的’一句中提及的数个拼合信息的值)互相一般不是相同所指;比如码字1涉及的全部数个拼合信息的值中只含1个拼合信息的值,而码字2涉及的全部数个拼合信息的值中只含3个拼合信息的值,那码字1涉及的全部数个拼合信息的值与码字2涉及的全部数个拼合信息的值就不是相同所指;如码字1涉及的全部数个拼合信息的值中只含2个拼合信息的值且这2个拼合信息的值分别对应语句1和语句2,而码字2涉及的全部数个拼合信息的值中只含2个拼合信息的值且这2个拼合信息的值分别对应语句3和语句4,而语句1,语句2,语句3,语句4是4个不同语句,那码字1涉及的全部数个拼合信息的值与码字2涉及的全部数个拼合信息的值就不是相同所指;如码字1涉及的全部数个拼合信息的值中只含1个拼合信息的值且这1个拼合信息的值对应语句1,而码字2涉及的全部数个拼合信息的值中只含1个拼合信息的值且这1个拼合信息的值也对应语句1,并且假设这里是把表示语句1发音的拼合信息的值分成两部分分别放在码字1和码字2里,这时码字1涉及的全部数个拼合信息的值与码字2涉及的全部数个拼合信息的值是相同所指,不过码字1和码字2的码值不同(这两个码字的值分别是表示语句1发音的拼合信息的值的不同部分);如码字1涉及的全部数个拼合信息的值中只含2个拼合信息的值且这2个拼合信息的值分别对应语句1和语句2,而码字2涉及的全部数个拼合信息的值中只含2个拼合信息的值且这2个拼合信息的值分别对应语句1和语句3,并且假设这里是把表示语句1发音的拼合信息的值分成两部分分别放在码字1和码字2里,而语句1,语句2,语句3是3个不同语句,那码字1涉及的全部数个拼合信息的值与码字2涉及的全部数个拼合信息的值就不是相同所指。本段内容可从本说明书其余内容和常识推出。Inclaim 1, 10, 19, the code value of each code word in the aforementioned code words mentioned inclaim 1, 10, 19, and 22 is related to the value of the combination information of how to use phonetic codes to form a sentence pronunciation together with several expressions ', it means that the code value of each codeword in the codewords is respectively related to the value of several pieced together information; for example: if contain 3 codewords (codewords) in allcodewords 1,codeword 2, codeword 3), thencodeword 1 is related to the value of several combined information,codeword 2 is also related to the value of several combined information, andcodeword 3 is also related to several values The value of the flattening information is related. In fact, the values of all pieces of combined information involved in different codewords among the codewords are generally different from each other. For example, still the example just now: as containing 3 codewords (codeword 1,codeword 2, codeword 3) in all codewords, thatcodeword 1 is relevant with the value of several splicing information altogether,codeword 2 is also related to the value of several combined information, andcode word 3 is also related to the value of several combined information (3 'numbers' in these 3 sentences are not necessarily the same number); andcode word 1 The values of all the combined information involved (that is, the value of the several combined information mentioned in the sentence "thatcodeword 1 is related to the values of several combined information" mentioned earlier) are related to thecode word 2. The values of all the combined information (that is, the values of the several combined information mentioned in the sentence "thatcodeword 2 is related to the values of several combined information" mentioned earlier) are related to all the numbers involved in thecodeword 3. The values of a combination of information (that is, the values of several pieces of combination information mentioned in the sentence "that codeword 3 is related to the value of several pieces of combination information" mentioned above) generally do not refer to each other; for example, codewords The value of all the combined information involved in 1 contains only one value of combined information, and the value of all the combined information involved in codeword 2 only contains 3 values of combined information, then all the values of codeword 1 involved The values of several pieces of combined information and the values of all the pieces of combined information involved in codeword 2 are not the same; for example, the values of all the pieces of combined information involved in codeword 1 only contain 2 values of combined information and these 2 The values of the pieces of combined information correspond to statement 1 and statement 2 respectively, while the values of all the pieces of combined information involved in codeword 2 only contain 2 values of combined information and the values of these 2 combined information correspond to statement 3 and statement 4 respectively , and statement 1, statement 2, statement 3, and statement 4 are four different statements, then the value of all the combined information involved in codeword 1 is not the same as the value of all the combined information involved in codeword 2 ; For example, the values of all the pieces of information involved in codeword 1 only contain the value of one piece of information and the value of this one piece of information corresponds to sentence 1, while the values of all the pieces of information involved in codeword 2 only contain Contains a value of combined information and the value of this combined information also corresponds to sentence 1, and assumes that the value of the combined information representing the pronunciation of sentence 1 is divided into two parts and placed in codeword 1 and codeword 2 respectively. The values of all the pieces of combined information involved incodeword 1 are the same as the values of all the pieces of combined information involved incodeword 2, but the code values ofcodeword 1 andcodeword 2 are different (the values of these two codewords Values are different parts of the value of the mosaic information representing the pronunciation of sentence 1); as in the value of all the mosaic information involved incodeword 1, only the value of 2 mosaic information is contained and the value of these 2 mosaic information corresponds to the sentence respectively 1 andstatement 2, and all the combined information values involved incodeword 2 only contain 2 values of combined information and the values of these 2 combined information correspond tostatement 1 andstatement 3 respectively, and it is assumed that theexpression statement 1 The value of the combined information of the pronunciation is divided into two parts and placed incodeword 1 andcodeword 2, andstatement 1,statement 2, andstatement 3 are three different statements, so the value of all the combined information involved incodeword 1 The values of all pieces of combined information related tocodeword 2 are not the same. This paragraph can be inferred from the rest of this manual and common sense.

本发明文件各处所说的‘拼合信息的值对应的语句’或‘拼合信息的值对应语句’或类似表达中的‘拼合信息的值’与‘语句’的关系是:此拼合信息的值是表示此语句发音的拼合信息的值,所以此拼合信息的值与此语句是对应的。The relationship between "the value of the combined information" and the "statement" in the "sentence corresponding to the value of the combined information" or "the statement corresponding to the value of the combined information" or similar expressions in the documents of the present invention is: the value of the combined information is Indicates the value of the mosaic information of the pronunciation of this sentence, so the value of this mosaic information corresponds to this sentence.

本发明文件各处所说的‘录音数据’是指语音录音数据。本发明文件所说的一个单词的录音数据,此录音数据是指可以产生此单词发音的那些数据的集合,比如是指一个单词发音的ADPCM格式的录音数据文件里的所有数据;此是常识。对于其它类似情况比如一个元音的录音数据等,也是类似这样的。The 'recorded data' mentioned throughout the document of the present invention refers to voice recording data. The said recording data of a word in the document of the present invention, this recording data refers to the collection of those data that can produce this word pronunciation, for example refers to all data in the recording data file of the ADPCM format of a word pronunciation; This is common sense. The same is true for other similar situations such as recording data of a vowel.

本说明书在述及在存储器里放置的某语种的诸音码的录音数据时,是针对某解码规则中有关此语种的解码细则解码发音时要用到的所有音码中的每个音码都做数个录音数据(比如ADPCM格式的;一个音码要做多个录音数据的情况比如是因为此音码要录轻读的音也要录重读的音),再把这些录音数据集合起来而成此语种的诸音码的录音数据的,本说明书各处都是以这种方式为例来阐述的;另在实际中也可以做一个或多个大的录音数据文件,此解码规则中有关此语种的解码细则解码发音时要用到的所有音码中各音码的数个录音数据都可以从这一个或多个大的录音数据文件中取得,此语种的诸音码的录音数据就是指这一个大的录音数据文件或是指这多个大的录音数据文件的集合,这后一种方式是本段内前面说的把要用到的各个音码的数个录音数据集合起来成此语种的诸音码的录音数据的方式的等同替换。When this instruction manual is referring to the recording data of the various phonetic codes of a certain language placed in the memory, it is aimed at the decoding rules about this language in a certain decoding rule. Do several recording data (such as in ADPCM format; the situation of multiple recording data for one sound code is, for example, because this sound code needs to record the sound of light reading and also record the sound of stress), and then gather these recording data. The recording data of the various phonetic codes of this language is described everywhere in this manual as an example; in addition, one or more large recording data files can also be made in practice. Several recording data of each phonetic sign indicating number in all phonetic sign indicating numbers that will be used when decoding pronunciation of this language kind can all be obtained from this one or more big recording data files, and the recording data of all sound sign indicating numbers of this language is exactly Refers to this large recording data file or refers to the collection of these large recording data files. This latter method is to gather together several recording data of each sound code to be used in this paragraph. The equivalent replacement of the mode of the recording data of all phonetic codes of this language.

本发明文件各处所说的音码的发音数据,是指音码的录音数据或用其它方式产生此音码语音所用的数据(用其它方式产生此音码语音所用的数据是指比如用语音波形的矢量技术来模拟发此音码的语音所采用的参数)等,但一般是指音码的录音数据。The pronunciation data of the said phonetic sign indicating number in document of the present invention refers to the recording data of phonetic sign indicating number or produces the used data of this phonetic sign indicating number speech in other ways (produces the used data of this phonetic sign indicating number speech in other ways and refers to such as using speech waveform The vector technology that simulates the parameters adopted by the voice of this phonetic sign indicating number) etc., but generally refers to the recording data of the phonetic sign indicating number.

本发明文件所说的从诸音码的发音数据中(比如诸音码的录音数据中)取得的一个音码的发音数据(或一个音码的录音数据),在不同应用时可能是不同的数据,比如一个元音,可在音码的录音数据中取得此元音的重读的录音数据和轻读的录音数据,那在需要用此元音重读的音时,那从诸音码的录音数据中取得的此音码的录音数据就默认是此元音的重读的录音数据,在需要用此元音轻读的音时,那从诸音码的录音数据中取得的此音码的录音数据就默认是此元音的轻读的录音数据。The pronunciation data (or the recording data of a phonetic sign indicating number) that obtains from the pronunciation data of all sound sign indicating numbers (such as in the recording data of all sound sign indicating numbers) that document of the present invention said may be different in different applications Data, such as a vowel, can obtain the stressed recording data of this vowel and the recording data of light reading in the recording data of sound sign indicating number, then when needing the sound of this vowel stress, then from the recording of all phonetic sign indicating numbers The recording data of this phonetic sign indicating number obtained in the data is just the recording data of the accented reading of this vowel by default, when needing the sound of light reading of this vowel, the recording of this phonetic sign indicating number that then obtains from the recording data of all phonetic sign indicating numbers The data is the recording data of the light reading of this vowel by default.

本发明文件所说的一个(或多个或各或这些或那些或一些)音码的发音数据,默认是指从某语种的诸音码的发音数据中取出的一个(或多个或各或这些或那些或一些)音码的发音数据;本发明文件所说的一个(或多个或各或这些或那些或一些)音码的录音数据,默认是指从某语种的诸音码的发音数据中取出的一个(或多个或各或这些或那些或一些)音码的录音数据。The pronunciation data of one (or more or each or these or those or some) phonetic codes said in the document of the present invention refers by default to one (or more or each or some) that takes out from the pronunciation data of all phonetic codes of a certain language The pronunciation data of these or those or some) phonetic codes; The recording data of one (or more or each or these or those or some) phonetic codes said in the document of the present invention refers to the pronunciation of all phonetic codes from a certain language by default The recording data of one (or more or each or these or those or some) sound codes taken out in the data.

对于某语种,本发明文件所说的此语种的诸音码的录音数据,可有多种方案,比如英语的如本说明书第16页述有3种方案,汉语的如本说明书第46页至48页述有4种方案。For a certain language, the recording data of all phonetic codes of this language said in the document of the present invention can have multiple schemes, such as English has 3 kinds of schemes as the 16th page of this description, Chinese as the 46th page of this specification to There are 4 options described on page 48.

本发明文件各处所说的某语种的诸音码的发音数据,实际是指按某解码规则中有关此语种的解码细则在解码时用到的诸音码的发音数据(此解码规则中有关此语种的解码细则总是有定义此语种的解码语句集的,而不论其用什么方式定义,甚至可以是隐含定义)。The pronunciation data of all phonetic codes of a certain language said in the document of the present invention everywhere actually refer to the pronunciation data of all phonetic codes that are used when decoding by the decoding rules about this language in a certain decoding rule (relevant to this in this decoding rule The decoding rules of a language always have a set of decoding sentences that define this language, no matter how it is defined, it can even be implicitly defined).

本发明文件各处所说的诸音码的发音数据以及诸音码的录音数据,是针对某语种的,并且可能是按本说明书第16页述的3种方案中的一种(或选定本说明书第46页至第48页述的4种方案中的一种)等方案来生成的。The pronunciation data of all phonetic codes and the recording data of all phonetic codes mentioned in the documents of the present invention are aimed at a certain language, and may be by one of the 3 schemes stated on the 16th page of this description (or select this One of the four schemes described on pages 46 to 48 of the manual) and other schemes.

如提及时无特别指明有他义,本发明文件各处所提及的存储器,默认是指在拾音器(非插卡方式拾音器或插数据卡方式拾音器或插部件式数据卡方式拾音器)使用时,能被作为此拾音器的主处理芯片的单片机芯片读取使用的存储器(对于插数据卡方式拾音器或插部件式数据卡方式拾音器,那单片机芯片能读取使用的存储器包括数据卡或部件式数据卡上的存储器)。There are other meanings if there is no special indication when mentioning, the memory mentioned in the documents of the present invention everywhere refers to when the pickup (non-insert card mode pickup or data card mode pickup or plug-in data card mode pickup) is used by default, The memory that can be read and used by the single-chip microcomputer chip as the main processing chip of this pickup (for the pickup of the plug-in data card mode or the pickup of the plug-in data card mode, the memory that the single-chip microcomputer chip can read and use includes a data card or a component data card on memory).

本发明文件各处提及的把一个拼合信息的值分散(或分开)放在多个码字里,一般是指把这个拼合信息的值的BIT值分开成多个部分并分别放在多个码字里,这是显而易见的。Dispersing (or separating) the value of a mosaic information and placing it in multiple codewords mentioned throughout the document of the present invention generally refers to separating the BIT value of the value of this mosaic information into multiple parts and placing them in multiple codewords respectively. In code words, this is obvious.

本发明文件各处提及的‘诸个码字中的全部码字或部分码字’,是以单个码字为最小单位的,比如诸个码字中含3个码字(码1,码2,码3),那诸个码字中的全部码字就是这3个码字,诸个码字中的部分码字的可能情况是(码1)或(码2)或(码3)或(码1,码2)或(码1,码3)或(码2,码3);对于本发明文件各处提及的‘部诸码字中的全部码字或部分码字’或其它类似说法,也是按此类推理解,比如部诸码字中含3个码字(码1,码2,码3),那部诸码字中的全部码字就是这3个码字,部诸码字中的部分码字的可能情况是(码1)或(码2)或(码3)或(码1,码2)或(码1,码3)或(码2,码3)。The 'all codewords or part codewords in the codewords' mentioned in the documents of the present invention take a single codeword as the smallest unit, for example, contain 3 codewords (code 1, codeword 1) in all codewords. 2, code 3), all code words in that all code words are exactly these 3 code words, the possible situation of part code word in all code words is (code 1) or (code 2) or (code 3) Or (code 1, code 2) or (code 1, code 3) or (code 2, code 3); For the whole code word or part code word in ' department all code words' mentioned in the document of the present invention everywhere or Other similar sayings are also understood by analogy, such as containing 3 code words (code 1,code 2, code 3) in all code words of the part, and all code words in the all code words of that part are exactly these 3 code words, the part The possible situation of part code word in all code words is (code 1) or (code 2) or (code 3) or (code 1, code 2) or (code 1, code 3) or (code 2, code 3) .

本发明文件各处提及的‘数个拼合信息的值的部分’,是指在这数个拼合信息的值中每个拼合信息的值里取一部分BIT,再把取到的这些部分BIT全合起来就得到数个拼合信息的值的部分;数个拼合信息的值的部分比如是指一个拼合信息的值的部分和另一拼合信息的值的部分和第3个拼合信息的值的部分,这里的数个就是指3个。(本段说的‘一个’,‘另一’,‘第3个’都是修饰其后面的‘拼合信息的值’的)The 'parts of the value of several combined information' mentioned in the document of the present invention refers to taking a part of the BIT in the value of each combined information among the values of the several combined information, and then taking all the BITs of these parts taken. Combined to get the value part of several pieced information; the value part of several pieced information refers to the value part of one pieced information, the value part of another pieced information and the value part of the third pieced information , the number here refers to 3. (The 'one', 'another', and 'third' mentioned in this paragraph all modify the 'value of combined information' behind it)

对于插数据卡方式拾音器主体,把数个数据卡插入此插数据卡方式拾音器主体而形成拾音器后,显然此拾音器主体上的部件(读码头,单片机芯片,发音器件等)也就成了此拾音器的部件,如果此拾音器主体上有存储器,那此拾音器主体上的所有存储器和这数个数据卡上的所有存储器就组成了与此拾音器有关的存储器集,且此拾音器主体上的存储器(如果此拾音器主体上的存储器存储有程序或数据的话)里和这数个数据卡上的各存储器里存储的程序就都属于此拾音器涉及的程序且存储的数据就都属于此拾音器涉及的数据(这句话里提及的程序以及数据都是在拾音器读码使用时可被MCU使用的),如果此拾音器主体上没有存储器,且这数个数据卡上的所有存储器就成了与此拾音器有关的存储器集,那这数个数据卡上的各存储器里存储的程序就都属于此拾音器涉及的程序且存储的数据就都属于此拾音器涉及的数据(这句话里提及的程序以及数据都是在拾音器读码使用时可被MCU使用的),对于插部件式数据卡方式拾音器主体,把数个部件式数据卡插入此插部件式数据卡方式拾音器主体而形成拾音器后,显然此拾音器主体上的部件和此数个部件式数据卡上的部件也就成了此拾音器的部件(本句说的部件是指读码头,单片机芯片或发音器件等),如果此拾音器主体上有存储器,那此拾音器主体上的所有存储器和这数个部件式数据卡上的所有存储器就组成了与此拾音器有关的存储器集,且此拾音器主体上的存储器(如果此拾音器主体上的存储器存储有程序或数据的话)里和这数个部件式数据卡上的各存储器里存储的程序就都属于此拾音器涉及的程序且存储的数据就都属于此拾音器涉及的数据(这句话里提及的程序以及数据都是在拾音器读码使用时可被MCU使用的),如果此拾音器主体上没有存储器,那这数个部件式数据卡上的所有存储器就成了与此拾音器有关的存储器集,且这数个部件式数据卡上的各存储器里存储的程序就都属于此拾音器涉及的程序且存储的数据就都属于此拾音器涉及的数据(这句话里提及的程序以及数据都是在拾音器读码使用时可被MCU使用的)。本段所述是显然的。For the main body of the pickup in the way of inserting data cards, after inserting several data cards into the main body of the pickup in the way of inserting data cards to form a pickup, obviously the components on the main body of the pickup (reading port, single-chip microcomputer chip, pronunciation device, etc.) also become the pickup. parts, if there is memory on the main body of the pickup, then all the memory on the main body of the pickup and all the memory on the several data cards constitute a memory set relevant to the pickup, and the memory on the main body of the pickup (if this If the memory on the main body of the pickup has programs or data stored in it) and the programs stored in the memories on the several data cards all belong to the program related to the pickup and the stored data all belong to the data related to the pickup (this sentence The programs and data mentioned in the words can be used by the MCU when the pickup is used for code reading), if there is no memory on the main body of the pickup, and all the memories on the several data cards become the memory related to the pickup set, the programs stored in each memory on these several data cards all belong to the programs involved in this pickup and the stored data all belong to the data involved in this pickup (the programs and data mentioned in this sentence are all in It can be used by the MCU when the pickup is used for code reading). For the pickup body of the plug-in data card method, after inserting several component data cards into the pickup body of the plug-in data card method to form a pickup, it is obvious that the pickup on the pickup body Parts and the parts on these several component data cards have just become the parts of the pickup (the parts said in this sentence refer to the reading dock, single-chip microcomputer chip or pronunciation device, etc.), if there is a memory on the pickup main body, then the pickup All the memories on the main body and all the memories on these several component data cards have just formed the memory set relevant to this pickup, and the memory on the pickup body (if the memory on the pickup body is stored with programs or data) The programs stored in each memory on the several component data cards belong to the program related to the pickup, and the stored data all belong to the data related to the pickup (the programs and data mentioned in this sentence are all Can be used by MCU when the pick-up is used for code reading), if there is no memory on the pick-up body, then all the memory on the several component data cards will become the memory sets relevant to this pickup, and these several component The programs stored in each memory on the data card belong to the program involved in the pickup, and the stored data belong to the data involved in the pickup (the programs and data mentioned in this sentence are available when the pickup is used for code reading. used by the MCU). What is stated in this paragraph is obvious.

显然,本发明文件各处提及的与某拾音器主体配合使用的数据卡或部件式数据卡上的存储器里总是存储有有用信息的(否则不称为与此拾音器主体配合使用的数据卡或部件式数据卡)。本段说的配合使用是指把卡插入拾音器主体来使用。Obviously, useful information is always stored in the memory on the data card used in conjunction with a certain pickup main body mentioned in the document of the present invention or the memory on the component type data card (otherwise it is not called a data card or a data card used in conjunction with this pickup main body). Component Data Card). The cooperative use mentioned in this paragraph refers to inserting the card into the main body of the pickup for use.

本发明文件各处所提及的‘部分所述拾音器涉及的程序’中的‘部分’是用于修饰‘所述拾音器涉及的程序’的,‘部分所述拾音器涉及的程序’是指拾音器涉及使用的所有程序的一部分;本发明文件各处所提及的‘部分所述拾音器涉及的数据’中的‘部分’是用于修饰‘所述拾音器涉及的数据’的,‘部分所述拾音器涉及的数据’是指拾音器涉及使用的所有数据的一部分。The "part" in the "part of the program related to the pickup" mentioned throughout the document of the present invention is used to modify the "program related to the pickup", and the "part of the program related to the pickup" means that the pickup is related to Part of all programs used; 'part' of 'part of said pickup-related data' mentioned throughout this document is used to modify 'said pickup-related data', 'part of said pickup-related 'Data' refers to all data part of the pickup involved in use.

本发明文件各处提及的拾音器涉及的数据,包括解码有关数据(解码有关数据可能是只适用于一个语种的解码程序用的解码数据,或是多套适用于各自语种的解码程序各自用的解码数据的总和,或是适用于多个语种的一套解码程序用的解码数据等情况),如拾音器的MCU需用到音码的发音数据的话(如拾音器是采用直接产音方式的或拾音器是既采用直接产音方式又采用TTS产音方式的,那拾音器的MCU一般需用到音码的发音数据),那诸音码的发音数据(可能有多个不同语种的诸音码的发音数据)也应包括在拾音器涉及的数据里,如拾音器有需用其它数据,那这些其它数据应包括在拾音器涉及的数据里。本段所说的一个语种或各自语种可以是任一非复合语种或任一复合语种;本段所说的多个语种可以全为非复合语种或全为复合语种或既包括非复合语种也包括复合语种。The data related to the sound pick-up that the document of the present invention mentions everywhere includes decoding relevant data (decoding relevant data may be only applicable to the decoding data that the decoding program of a language uses, or a plurality of sets of decoding programs that are applicable to each language are used separately The sum of the decoded data, or the decoded data for a set of decoding programs applicable to multiple languages), if the MCU of the pickup needs to use the pronunciation data of the sound code (such as the pickup adopts the direct sound production method or the pickup It adopts both the direct sound production method and the TTS sound production method, the MCU of the pickup generally needs to use the pronunciation data of the sound code), the pronunciation data of the sound codes (there may be a plurality of pronunciations of the sound codes in different languages Data) should also be included in the data related to the pickup, if the pickup needs to use other data, then these other data should be included in the data related to the pickup. One or each of the languages mentioned in this paragraph may be any non-composite language or any compound language; the multiple languages mentioned in this paragraph may be all non-composite languages or all compound languages or include both non-compound languages and compound language.

在权利要求书中各处提及的‘包括前述一个码字在内的所述部诸码字中的多个码字’,是指包括前述一个码字在内的多个码字,而且这多个码字都是所述部诸码字中的码字。'A plurality of codewords in the said codewords including the aforementioned one codeword' mentioned everywhere in the claims refers to a plurality of codewords including the aforementioned one codeword, and this A plurality of codewords are codewords in said section of codewords.

显然,在任一权利要求项中提及的‘前述读物’是指本权利要求项内前面提及的‘读物’,而不是指本权利要求项内前面提及的‘语音编码有声读物’。Obviously, the 'previous reading' mentioned in any claim refers to the 'reading' mentioned in this claim, not the 'speech-encoded audiobook' mentioned in this claim.

本说明书前面所提及的数据卡(或部件式数据卡)与拾音器主体配套或与有声读物配套,一般是指配套出售,而且是在拾音器主体与有声读物是分开出售的前提下说的;实际上拾音器主体与有声读物也可一起出售,所以各数据卡(或部件式数据卡)可以与拾音器主体和有声读物配套或不配套。The data card (or component-type data card) mentioned earlier in this manual is matched with the main body of the pickup or matched with the audiobook, which generally means that it is sold as a set, and it is said on the premise that the main body of the pickup and the audiobook are sold separately; actually The upper pickup body is also sold with the audiobook, so each data card (or component data card) may or may not be paired with the pickup body and audiobook.

如提及时无特别指明有他义,本发明文件各处提及的非复合语种默认是以官方用语为例的(如提及的非复合语种有其官方用语的话),比如提及的汉语语种就默认是以汉语普通话为例的(除非提及时有特别指明有他义);而如提及的语种是复合语种的话,除非特别指明有他义,那默认复合语种包含的多个单独语种都是以官方用语为例的(如包含的单独语种有其官方用语的话),比如一复合语种包含英语和汉语,那其中的英语是以官方英语为例的,其中的汉语是以汉语普通话为例的(除非提及时有特别指明有他义)。If there is no special indication of other meanings when mentioned, the non-compound languages mentioned in the document of the present invention default to the official language as an example (if the non-compound language mentioned has its official language), such as the Chinese language mentioned By default, Mandarin Chinese is used as an example (unless there are other meanings specified when mentioning it); and if the mentioned language is a compound language, unless it is specifically specified that there are other meanings, then the multiple individual languages included in the compound language by default are all Take the official language as an example (if the individual language included has its own official language), for example, a compound language includes English and Chinese, then the English in it is official English as an example, and the Chinese in it is Mandarin Chinese as an example (unless otherwise specified when mentioned).

一般情况,只适用于一个语种的一套解码程序及解码数据是根据只适用于一个语种的解码规则或适用于包括此语种在内的多个语种的解码规则中有关此语种的解码细则来生成的,适用于多个(此‘多个’与上一句的‘多个’不一定是相同数目)语种的一套解码程序及解码数据是根据适用于多个语种的解码规则来生成的,适用于某语种的一套解码程序及解码数据是根据只适用于此语种或适用于包括此语种在内的多个语种的解码规则来生成的;因解码规则中有关各自语种的解码细则一般涉及有用发音形式(指全用直接发音方式或全用TTS发音方式或用混合发音方式等)播放语句发音,所以解码程序及解码数据的具体内容一般也是涉及发音的。本段说的一个语种或某语种或多个语种中的任一语种,可以为非印欧语系任一语种或印欧语系任一语种或任一复合语种。In general, a set of decoding programs and decoding data applicable only to one language is generated according to the decoding rules applicable to only one language or the decoding rules applicable to multiple languages including this language. A set of decoding programs and decoding data applicable to multiple languages (the "multiple" and the "multiple" in the previous sentence are not necessarily the same number) are generated according to the decoding rules applicable to multiple languages, applicable A set of decoding programs and decoding data in a certain language is generated according to decoding rules that are only applicable to this language or applicable to multiple languages including this language; because the decoding rules of the respective languages in the decoding rules generally involve useful Pronunciation form (referring to fully using direct pronunciation mode or fully using TTS pronunciation mode or mixed pronunciation mode etc.) plays sentence pronunciation, so the specific content of decoding program and decoding data generally also relates to pronunciation. A language or a certain language or any of the multiple languages mentioned in this paragraph may be any non-Indo-European language or any Indo-European language or any compound language.

本发明文件也把一套解码程序及解码数据中的解码程序称为一套解码程序。The document of the present invention also refers to a set of decoding programs and the decoding programs in the decoded data as a set of decoding programs.

如提及时无特别指明有他义,本发明文件各处提及的前面无数量词的‘解码程序’(此‘解码程序’后不跟‘及解码数据’的),默认与‘一套解码程序’等价;如提及时无特别指明有他义,本发明文件各处提及的前面无数量词的‘解码程序及解码数据’的,默认与‘一套解码程序及解码数据’等价。If there is no special indication of other meanings when mentioned, the "decoding program" without quantifiers mentioned in the documents of the present invention (this "decoding program" is not followed by "and decoding data"), defaults to "a set of decoding programs" 'Equivalent; if there is no other meaning when it is mentioned, the "decoding program and decoding data" mentioned in the document of the present invention without quantifiers in front of it is equivalent to "a set of decoding program and decoding data" by default.

本说明书各处提及的解码程序是指只适用于一个语种或适用于多个语种的一套解码程序。解码程序可以是只适用于一个语种或适用于多个语种的,对其中一个语种(如是适用于多个语种时,这里的‘其中一个语种’就是指这多个语种中的一个,如是只适用于一个语种时,这里的‘其中一个语种’就是指这一个语种),解码程序是按适用于此语种的解码规则中有关此语种的解码细则来进行解码的;本段提到的一个语种,可以为非印欧语系任一语种或印欧语系任一语种或任一复合语种;本段提到的多个语种,可以全为非复合语种或全为复合语种或既有复合语种也有非复合语种。The decoding programs mentioned throughout this manual refer to a set of decoding programs that are only applicable to one language or applicable to multiple languages. The decoding program can be only applicable to one language or applicable to multiple languages. For one of the languages (if it is applicable to multiple languages, "one of the languages" here refers to one of the multiple languages, if it is only applicable For a language, "one of the languages" here refers to this language), the decoding program is to decode according to the decoding rules for this language in the decoding rules applicable to this language; a language mentioned in this paragraph, It can be any language of the non-Indo-European language family or any language of the Indo-European language family or any compound language; the multiple languages mentioned in this paragraph can be all non-compound languages or all compound languages or both compound languages and non-compound languages language.

本说明书中关于‘可’的用法:本说明书中各处提及到某事物可这样,那不排除此事物还可那样,本说明书中各处提及到某事物可这样或可那样(即类似的有多个选择项),那不排除此事物还可有更多的选择项;比如说解码涉及的硬件发音的方式可以是直接产音方式或TTS产音方式,不排除还可为其它产音方式。本段说的‘可’与‘可以’等价。本段所述是常识。The use of "may" in this specification: it is mentioned that something can be like this everywhere in this specification, that does not exclude that this thing can also be that way, and it is mentioned everywhere in this specification that something can be like this or that (that is, similar There are multiple options), it does not rule out that there are more options for this matter; for example, the hardware pronunciation method involved in decoding can be the direct sound production method or the TTS sound production method, and it is not ruled out that it can also be used for other products. sound mode. The 'may' and 'may' mentioned in this paragraph are equivalent. This paragraph is common sense.

本发明文件各处提及的拾音器解码播出语句发音(或各处类似提及的比如‘解码播出句子发音’等),其中有提及解码,在解码过程中有涉及到播放语句发音,但只是由拾音器内的程序发发音指示(比如调用发音程序)或传预备发音数据给MCU外接的TTS芯片等,其中提及的‘播出语句发音’是指拾音器用发音器件真正发出语音来(本说明书中有时也直接说成发音器件播出语句发音)。所以本发明文件所说的‘播放语句发音’与‘播出语句发音’有点不同。The pickup decoding and broadcasting sentence pronunciation mentioned in the documents of the present invention (or similarly mentioned everywhere such as "decoding and broadcasting sentence pronunciation" etc.), wherein there is a mention of decoding, and in the decoding process, the playback sentence pronunciation is involved. But only the program in the pickup sends the pronunciation instruction (such as calling the pronunciation program) or transmits the prepared pronunciation data to the external TTS chip of the MCU. In this instruction manual, sometimes it is directly said that the pronunciation device broadcasts the sentence pronunciation). So the said 'play sentence pronunciation' of the document of the present invention is somewhat different from 'broadcast sentence pronunciation'.

本发明文件各处提及的拾音器解码播出语句发音是指由拾音器播出一个语句发音,如一个码字被一拾音器读取后播出多个语句发音,那此播出多个语句发音包含播出其中一个语句发音的情况,所以此码字仍可被当成是此拾音器能读取以用于此拾音器播出语句发音的部诸码字中的一个码字。The pickup decoding and broadcasting sentence pronunciation mentioned in the document of the present invention refers to broadcasting a sentence pronunciation by the pickup, as a code word is read by a pickup and broadcasts a plurality of sentence pronunciations, then the broadcasting of a plurality of sentence pronunciations includes Broadcast the situation of wherein a sentence pronunciation, so this code word can still be regarded as this pick-up can read a code word in the part all code words for this pick-up to broadcast sentence pronunciation.

本发明文件各处提及的‘播放语句发音’是指MCU在解码过程中播放所解码语句的发音(本发明文件说的播放语句发音是指拾音器内的程序发发音指示或传预备发音数据给MCU外接的发音器件等,而不是真正用发音器件发音;这里说的发发音指示是指比如调用发音程序),这个提法(指‘播放语句发音’)中的‘语句’是指一个语句。The 'playing sentence pronunciation' mentioned in the document of the present invention refers to the pronunciation of the MCU playing the decoded sentence in the decoding process (the playing sentence pronunciation that the document of the present invention says refers to that the program in the pickup sends the pronunciation instruction or transmits the prepared pronunciation data to The MCU is connected to the pronunciation device, etc., instead of actually using the pronunciation device to pronounce; the pronunciation instruction mentioned here refers to calling the pronunciation program, for example), and the "sentence" in this formulation (referring to "playing sentence pronunciation") refers to a sentence.

本发明文件各处提及的语音,是泛指,并不是特指语句的发音,其既可能是语句的发音,也可能是音码的发音等,这需要根据其所处文章位置前后的语义来确定。The voice mentioned in the document of the present invention refers generally, not specifically to the pronunciation of the sentence, which may be the pronunciation of the sentence, or the pronunciation of the phonetic code, etc., which needs to be based on the semantics before and after the position of the article to make sure.

本发明文件中各处提及的‘按语句发音顺序排组起来的表示英语语句发音的各音标的编号信息’中的两个‘语句’词是指同一语句;本发明文件中各处提及的‘按语句发音顺序排组起来的表示汉语语句发音的各拼音的代号信息’中的两个‘语句’词是指同一语句。The two "sentences" words in the 'number information of each phonetic symbol representing the pronunciation of English sentences' mentioned in the document of the present invention refer to the same sentence; mentioned in the document of the present invention The two words "sentence" in the "code information of each pinyin representing the pronunciation of Chinese sentences arranged in the order of pronunciation of sentences" refer to the same sentence.

本说明书中有提及的‘可以是其有关印欧语系任一语种的编码细则有用1类音码或2类音码或其它形式音码且含有压缩编码处理且是直播方式或TTS方式或混合方式的的只适用于此语种或适用于包括此语种在内的多个语种的各种编码规则’,其中的‘含有压缩编码处理’是指其中前面提及的‘编码细则’含有压缩编码处理,其中的‘是直播方式或TTS方式或混合方式的’是指其中前面提及的‘编码细则’是直播方式或TTS方式或混合方式的,其中前面的‘其’与其中后面的‘编码规则’是相同所指,其中的‘只适用于此语种或适用于包括此语种在内的多个语种’是修饰其中的‘编码规则’的;本说明书中有提及的‘可以是其有关非印欧语系任一语种的编码细则有用3类音码或4类音码或其它形式音码且含有压缩编码处理且是直播方式或TTS方式或混合方式的的只适用于此语种或适用于包括此语种在内的多个语种的各种编码规则’也是与此类似理解。本说明书中有提及的‘可以是其有关印欧语系任一语种的解码细则有用1类音码或2类音码或其它形式音码且含有压缩解码处理且是直播方式或TTS方式或混合方式的的只适用于此语种或适用于包括此语种在内的多个语种的各种解码规则’,其中的‘含有压缩解码处理’是指其中前面提及的‘解码细则’含有压缩解码处理,其中的‘是直播方式或TTS方式或混合方式的’是指其中前面提及的‘解码细则’是直播方式或TTS方式或混合方式的,其中前面的‘其’与其中后面的‘解码规则’是相同所指,其中的‘只适用于此语种或适用于包括此语种在内的多个语种’是修饰其中的‘解码规则’的;本说明书中有提及的‘可以是其有关非印欧语系任一语种的解码细则有用3类音码或4类音码或其它形式音码且含有压缩解码处理且是直播方式或TTS方式或混合方式的的只适用于此语种或适用于包括此语种在内的多个语种的各种解码规则’也是与此类似理解。What is mentioned in this manual can be that the coding rules for any language of the Indo-European languagefamily use Class 1 orClass 2 phonetic codes or other forms of phonetic codes and contain compression coding processing and are live broadcast or TTS or mixed The coding rules that are only applicable to this language or applicable to multiple languages including this language', the "contains compression coding processing" means that the aforementioned "coding rules" contain compression coding processing , where 'is live or TTS or mixed' means that the aforementioned 'coding rules' are live or TTS or mixed, and the preceding 'its' is the same as the following 'encoding rules' ' refers to the same, and the 'only applicable to this language or applicable to multiple languages including this language' is to modify the 'coding rules'; the 'mentioned in this manual can refer to its relevant non- The coding rules of any language of the Indo-European language family have 3 types of phonetic codes or 4 types of phonetic codes or other forms of phonetic codes and contain compression coding processing and are live broadcast or TTS or mixed. Only applicable to this language or applicable to The various encoding rules' of multiple languages including this language are also understood similarly to this. What is mentioned in this manual can be that the decoding rules for any language of the Indo-European languagefamily use Class 1 orClass 2 phonetic codes or other forms of phonetic codes and contain compression decoding processing and are live broadcast or TTS or mixed The method is only applicable to this language or various decoding rules applicable to multiple languages including this language', where 'contains compression decoding processing' means that the aforementioned 'decoding rules' contain compression decoding processing , where 'is live or TTS or mixed' means that the aforementioned 'decoding rules' are live or TTS or mixed, and the preceding 'its' is the same as the following 'decoding rules' ' means the same thing, and the 'only applicable to this language or applicable to multiple languages including this language' is to modify the 'decoding rules'; The decoding rules of any language of the Indo-Europeanlanguage family use 3 types of phonetic codes or 4 types of phonetic codes or other forms of phonetic codes and contain compression decoding processing and are live broadcast or TTS or mixed. It is only applicable to this language or applicable to including The various decoding rules' of multiple languages including this language are also understood similarly to this.

本发明文件各处在述及印欧语系编解码时提及的‘有用1类音码或2类音码或其它形式音码’,是指只用1类音码中的一种或多种音码,或者只用2类音码中的一种或多种音码,或者只用其它形式音码中的一种或多种音码,或者用1类音码中的一种或多种音码和2类音码中的一种或多种音码,或者用1类音码中的一种或多种音码和其它形式音码中的一种或多种音码,或者用2类音码中的一种或多种音码和其它形式音码中的一种或多种音码,或者用1类音码中的一种或多种音码和2类音码中的一种或多种音码和和其它形式音码中的一种或多种音码;本发明文件各处在述及非印欧语系编解码时提及的‘有用3类音码或4类音码或其它形式音码’,也是按此类推理解。The documents of the present invention refer to 'useful 1 class phonetic code or 2 class phonetic codes or other form phonetic codes' when referring to the Indo-European codec everywhere, refers to only using one or more of the 1 class phonetic codes Sound code, or only use one or more sound codes in 2 class sound codes, or only use one or more sound codes in other forms of sound codes, or use one or more sound codes in 1 class sound codes One or more sound codes in sound codes and 2 class sound codes, or with one or more sound codes in 1 class sound codes and one or more sound codes in other forms of sound codes, or with 2 One or more sound codes in class sound codes and one or more sound codes in other forms of sound codes, or one or more sound codes in 1 class sound codes and one in 2 class sound codes One or more phonetic codes and one or more phonetic codes in other forms of phonetic codes; the 'useful 3 types of phonetic codes or 4 types of phonetic codes mentioned in the documents of the present invention when referring to non-Indo-European language codecs Code or other form phonetic sign indicating number ', also be understood by analogy.

本说明书中各处有提及的‘可以是与本说明书提及或没提及的适用于印欧语系任一语种(英语或法语或德语或俄语等)的其有关此语种的编码细则含有压缩编码处理的任一编码规则(此编码规则可能是只适用于此语种或适用于包括此语种在内的多个语种的)有关此语种对应的本说明书提及或没提及的适用于此语种的其有关此语种的解码细则含有压缩解码处理且是直播方式的的各种解码规则’,其中‘本说明书提及或没提及的适用于印欧语系任一语种’是修饰‘编码规则’的,其中‘其有关此语种的编码细则含有压缩编码处理’也是修饰‘编码规则’的,其中‘本说明书提及或没提及的适用于此语种’是修饰‘解码规则’的,其中‘其有关此语种的解码细则含有压缩解码处理且是直播方式的’也是修饰‘解码规则’的,其中第1个‘其’是指其中后面的‘编码规则’,第2个‘其’是指其中后面的‘解码规则’;本说明书中有与本段最前面引用的那段话类似的说法,比如‘可以是与本说明书提及或没提及的适用于非印欧语系任一语种(汉语或韩语或日语等)的其有关此语种的编码细则含有压缩编码处理的任一编码规则(此编码规则可能是只适用于此语种或适用于包括此语种在内的多个语种的)有关此语种对应的本说明书提及或没提及的适用于此语种的其有关此语种的解码细则含有压缩解码处理且是直播方式的的各种解码规则’,‘可以是与本说明书提及或没提及的适用于印欧语系任一语种(英语或法语或德语或俄语等)的其有关此语种的编码细则含有压缩编码处理的任一编码规则(此编码规则可能是只适用于此语种或适用于包括此语种在内的多个语种的)有关此语种对应的本说明书提及或没提及的适用于此语种的其有关此语种的解码细则含有压缩解码处理且是TTS方式的的各种解码规则’,‘可以是与本说明书提及或没提及的适用于非印欧语系任一语种(汉语或韩语或日语等)的其有关此语种的编码细则含有压缩编码处理的任一编码规则(此编码规则可能是只适用于此语种或适用于包括此语种在内的多个语种的)有关此语种对应的本说明书提及或没提及的适用于此语种的其有关此语种的解码细则含有压缩解码处理且是TTS方式的的各种解码规则’,以及其它类似说法也都是按此类推理解。The '' mentioned everywhere in this manual can be related to any language in the Indo-European language family (English, French, German, Russian, etc.) that is mentioned or not mentioned in this manual. Any encoding rule for encoding processing (this encoding rule may be applicable only to this language or applicable to multiple languages including this language) is applicable to this language that is mentioned or not mentioned in this manual corresponding to this language The detailed decoding rules for this language include various decoding rules for compression decoding and live broadcasting', in which 'any language mentioned or not mentioned in this manual is applicable to any language of the Indo-European language family' is a modification of 'coding rules' , where 'the encoding rules for this language include compression encoding processing' is also a modification of 'encoding rules', where 'this specification applies to this language mentioned or not mentioned' is a modification of 'decoding rules', where ' Its decoding rules for this language include compression decoding processing and live broadcasting, and it also modifies the "decoding rules". The first "its" refers to the following "encoding rules", and the second "its" refers to Among them, the following 'decoding rules'; in this specification, there is a statement similar to the paragraph quoted at the beginning of this paragraph, such as 'it can be any language that is not mentioned in this specification and is applicable to non-Indo-European languages ( Chinese or Korean or Japanese, etc.), its coding rules for this language contain any coding rules for compression coding processing (this coding rule may be applicable only to this language or to multiple languages including this language) The decoding rules applicable to this language, which are mentioned or not mentioned in this manual corresponding to this language, contain compression decoding processing and various decoding rules in live mode',' can be mentioned in this manual or The encoding rules applicable to any language of the Indo-European language family (English, French, German, Russian, etc.) that are not mentioned contain any encoding rules for compression encoding processing (this encoding rule may only be applicable to this language) Or applicable to multiple languages including this language) The decoding rules applicable to this language that are mentioned or not mentioned in this manual corresponding to this language include compression decoding processing and are in TTS mode The various decoding rules',' can be any of the coding rules applicable to any non-Indo-European language (Chinese, Korean, Japanese, etc.) that are mentioned or not mentioned in this manual and contain compression coding processing. A coding rule (this coding rule may be only applicable to this language or applicable to multiple languages including this language) related to this language, which is mentioned or not mentioned in this manual and which is applicable to this language The decoding rules of the language contain various decoding rules of compression decoding processing and TTS mode', and other similar expressions are also understood by analogy.

本说明书中有提及的‘本说明书提及或没提及的适用于印欧语系任一语种(英语或法语或德语或俄语等)的其有关此语种的编码细则含有压缩编码处理且是TTS方式的的各种编码规则’,其中‘本说明书提及或没提及的适用于印欧语系任一语种(英语或法语或德语或俄语等)’是修饰‘编码规则’的,其中‘其有关此语种的编码细则含有压缩编码处理且是TTS方式的’也是修饰‘编码规则’的,其中的‘其’是指其中后面的‘编码规则’;本说明书中有与本段内最前面引用的那段话类似的说法,比如‘本说明书提及或没提及的适用于非印欧语系任一语种(汉语或韩语或日语等)的其有关此语种的编码细则含有压缩编码处理且是TTS方式的的各种编码规则’,或比如‘本说明书提及或没提及的适用于印欧语系任一语种(英语或法语或德语或俄语等)的其有关此语种的编码细则含有压缩编码处理且是直播方式的的各种编码规则’,或比如‘本说明书提及或没提及的适用于非印欧语系任一语种(汉语或韩语或日语等)的其有关此语种的编码细则含有压缩编码处理且是直播方式的的各种编码规则’,以及其它类似说法也按此类推理解。The coding rules for any language in the Indo-European language family (English, French, German, Russian, etc.) that are mentioned in this manual or not mentioned in this manual contain compression coding processing and are TTS Various coding rules in different ways', where 'applicable to any language of the Indo-European language family (English or French or German or Russian, etc.) mentioned or not mentioned in this manual' is a modification of the 'coding rules', where 'the The coding rules for this language contain compressed coding processing and are in the TTS mode, which also modifies the "coding rules", in which "its" refers to the "coding rules" behind it; For example, the coding rules for any language (Chinese, Korean, Japanese, etc.) that are not mentioned in this specification and are applicable to any language of the non-Indo-European language family (Chinese, Korean, Japanese, etc.) contain compression coding processing and are Various encoding rules in the TTS mode', or such as 'the encoding rules applicable to any language of the Indo-European language family (English or French or German or Russian, etc.) mentioned or not mentioned in this manual contain compression Various encoding rules for encoding processing and live broadcasting', or for example, 'the encoding applicable to any language of the non-Indo-European language family (Chinese or Korean or Japanese, etc.) mentioned or not mentioned in this manual The detailed rules contain various encoding rules for compression encoding processing and live broadcast', and other similar expressions can also be understood by analogy.

本发明文件各处提及的‘有关此语种的编码细则含有压缩编码处理且是直播方式的’是固定用法,此固定用法的含义是:其中‘是直播方式的’是指这里说到的有关此语种的编码细则是直播方式的;与此类似,本发明文件各处提及的‘有关此语种的解码细则含有压缩解码处理且是直播方式的’也是固定用法,本发明文件各处提及的‘有关此语种的编码细则含有压缩编码处理且是TTS方式的’也是固定用法,本发明文件各处提及的‘有关此语种的解码细则含有压缩解码处理且是TTS方式的’也是固定用法,本发明文件各处提及的‘有关此语种的编码细则含有压缩编码处理且是混合方式的’也是固定用法,本发明文件各处提及的‘有关此语种的解码细则含有压缩解码处理且是混合方式的’也是固定用法,这些也都按本段前面所述对‘有关此语种的编码细则含有压缩编码处理且是直播方式的’的理解那样类推理解。The "coding rules about this language contain compression coding and live broadcast" mentioned in the document of the present invention is a fixed usage, and the meaning of this fixed usage is: wherein "is a live broadcast" refers to the related The encoding details of this language are in a live broadcast mode; similarly, the "decoding rules about this language contain compression decoding processing and are live broadcast" mentioned in the documents of the present invention are also fixed usages, and the documents of the present invention are mentioned everywhere. "The encoding rules for this language include compression encoding processing and is in TTS mode" is also a fixed usage, and the "decoding rules about this language include compression decoding processing and is in TTS mode" mentioned throughout the document of the present invention is also a fixed usage. The "decoding rules about this language include compression encoding processing and mixed mode" mentioned in the document of the present invention is also a fixed usage, and the "decoding rules about this language include compression decoding processing and "It is a mixed method" is also a fixed usage, and these are also understood in the same way as the understanding of "the coding rules for this language include compression coding processing and live broadcast" as mentioned earlier in this paragraph.

本发明文件各处叙述分步TTS发音方式时有提及的‘由MCU把组成语句的全部预备发音数据的各个部分预备发音数据依次分次传给TTS器件发音以形成语句的全部发音’或类似说法(比如说的‘由MCU把组成语句的非全部预备发音数据的各个部分预备发音数据依次分次传给TTS器件发音以形成语句的非全部发音’),其中‘依次’的意思是指依发音次序,其中‘分次传’的意思是指这些传的动作穿插在解码过程中。本发明文件各处叙述分步直接发音方式时有提及的类似‘由MCU按组成语句的全部预备发音数据的各个部分预备发音数据依次分次发音以形成语句的全部发音’的说法中的‘依次’的意思也是指依发音次序。Documents of the present invention have mentioned "by the MCU, each part of the prepared pronunciation data that forms the sentence is sent to the TTS device pronunciation to form the whole pronunciation of the sentence" or similar Saying (for example, 'by the MCU, each part of the preparatory pronunciation data that constitutes the sentence is sent to the TTS device in turn for pronunciation to form the non-full pronunciation of the sentence'), where the meaning of 'sequentially' refers to Pronunciation order, where 'sub-transmission' means that the actions of these transmissions are interspersed in the decoding process. The document of the present invention everywhere narrates step-by-step direct pronunciation mode when there is mentioned in the saying similar 'by MCU according to each part prepared pronunciation data of forming sentence all prepared pronunciation data to pronounce successively to form the whole pronunciation of sentence' "Sequentially" also means to follow the order of pronunciation.

本发明文件各处叙述整步TTS发音方式时有提及的‘由MCU将语句的全部或非全部预备发音数据传给TTS芯片以播出语句的全部发音或非全部发音’或类似提法,是指由MCU将语句的全部或非全部预备发音数据一次传给TTS芯片以播出语句发音。Documents of the present invention have mentioned "all or not all of the prepared pronunciation data of the sentence is sent to the TTS chip by the MCU to broadcast all or not all of the pronunciation of the sentence" or similar formulations mentioned when the whole step TTS pronunciation mode is described everywhere in the document of the present invention, It means that the MCU transmits all or part of the prepared pronunciation data of the sentence to the TTS chip at one time to broadcast the pronunciation of the sentence.

本发明文件各处叙述分步直接发音方式或分步TTS发音方式时有提及的‘可看成是由数个部分预备发音数据组成语句的全部或非全部预备发音数据的’或类似提法,是指在道理上是可由这数个部分预备发音数据组成语句的全部(或非全部)预备发音数据的,但实际上解码时并没作把这数个部分预备发音数据组成为语句的全部(或非全部)预备发音数据的动作,而是在解码过程中每得到一个部分预备发音数据就拿去发音了。Documents of the present invention describe the step-by-step direct pronunciation mode or the step-by-step TTS pronunciation mode when it is mentioned that "can be regarded as all or not all of the prepared pronunciation data of a sentence composed of several partial prepared pronunciation data" or similar formulations , means that in principle all (or not all) of the prepared pronunciation data of the sentence can be formed by these several parts of the prepared pronunciation data, but in fact, these several parts of the prepared pronunciation data are not formed into all of the sentences when decoding (or not all) the action of preparing the pronunciation data, but every time a part of the preparation pronunciation data is obtained in the decoding process, it is used for pronunciation.

本发明文件中各处提及的‘与适用于此语种的编码规则有关此语种对应的适用于此语种的解码规则中有关此语种的解码细则’,其中第1个‘适用于此语种’是修饰‘编码规则’的,其中第2个‘适用于此语种’是修饰‘解码规则’的,其中‘解码细则’是指其中‘解码规则’中有关此语种的解码细则。对于类似说法(比如仍按本段内最前面引用的那段话那样说,但把其中的‘此语种’换成‘此复合语种’),仍是按此类推理解。The "decoding rules applicable to this language in the coding rules applicable to this language" mentioned everywhere in the document of the present invention are related to this language, and the first "applicable to this language" is Modifies 'encoding rules', where the second 'applicable to this language' is modified 'decoding rules', where 'decoding rules' refers to the decoding rules for this language in the 'decoding rules'. For similar statements (for example, still say the same as the paragraph quoted at the beginning of this paragraph, but replace the "this language" with "this compound language"), it is still understood by analogy.

本发明文件中各处提及的‘与对此语种语句编码时所用编码规则有关此语种对应的解码规则中有关此语种的解码细则’或类似提法,其中‘解码细则’是指其中‘解码规则’中有关此语种的解码细则,其中‘与对此语种语句编码时所用编码规则有关此语种对应’是修饰‘解码规则’的。The "decoding rules related to the language in the decoding rules corresponding to the language corresponding to the coding rules used when encoding sentences in this language" or similar formulations mentioned in the document of the present invention, wherein the "decoding rules" refers to the "decoding rules" The decoding rules of this language in the rules', where the 'corresponding to this language is related to the coding rules used when encoding the sentences of this language' is to modify the 'decoding rules'.

在权利要求书中的各权利要求项里,每两个分号之间的文字内容是一个完整技术特征的描述,一个权利要求项里各完整技术特征之间是‘且’的关系,在两个分号之间的文字可能被数个‘,或者,’分成多个部分,每个部分表示此两个分号之间的文字描述的完整技术特征里或的一个选择(但有例外情况,见下段述),每个部分里如再需用‘或’或‘或者’的话那‘或’或‘或者’的前后就无逗号;但为了阅读方便,在权利要求书中有分段,不过仍是以每两个分号之间的文字内容为一个完整技术特征的描述,如一个段落的文字被几个分号隔成3部分,那每部分的文字内容就是一个完整技术特征的描述,那此段落就包括3个完整技术特征的描述,如一个段落的文字没被分号隔开,那此段落就是1个完整技术特征的描述。(但作为独立权利要求的权利要求项1,10,19,22中的前叙部分不适合本段所述)In each claim item in the claims, the text between each two semicolons is a description of a complete technical feature, and the relationship between the complete technical features in a claim is an 'and' relationship. The text between the two semicolons may be divided into multiple parts by several ', or,', and each part represents one of the complete technical features described in the text between the two semicolons or a selection (but there are exceptions, See the next paragraph), if you need to use 'or' or 'or' in each part, there will be no commas before and after 'or' or 'or'; but for the convenience of reading, there are paragraphs in the claims, but The text content between every two semicolons is still used as a description of a complete technical feature. For example, if a paragraph of text is divided into three parts by several semicolons, then the text content of each part is a description of a complete technical feature. Then this paragraph includes descriptions of 3 complete technical features. If the text of a paragraph is not separated by semicolons, then this paragraph is a description of 1 complete technical feature. (However, the forewords inclaims 1, 10, 19, and 22 as independent claims are not suitable for the description in this paragraph)

另外在权利要求书中有类似这样的说法:对于某事物(事物是名词),怎样1,或者,怎样2,这样的说法是指怎样1与怎样2是或的关系,即某事物可以是怎样1或某事物可以是怎样2;比如权利要求书中提到的‘对于所述诸个码字中的某一码字,此码字的载体是所述读物的一个页,或者,此码字的载体是一小块表面不导电的平面材料,此平面材料是粘贴在所述读物的一个页面上的;’就是这样;权利要求书中还有类似本段所说情况(即比如说:对于某事物,怎样1,或者,怎样2,怎样3,或者,怎样4),也按本段所说类推理解。In addition, there is a statement like this in the claims: For something (thing is a noun), how 1, or how 2, this statement refers to the relationship between how 1 and how 2, that is, how something can be 1 or how something can be 2; such as mentioned in the claims 'for a certain codeword among the said codewords, the carrier of this codeword is a page of said reading material, or, this codeword The carrier is a small piece of flat material with a non-conductive surface, and this flat material is pasted on a page of the reading material; that's it; Something, how 1, or how 2, how 3, or how 4), can also be understood by analogy in this paragraph.

本发明文件中各处提及的‘编码规则中有关某语种的直播方式的编码细则’,其中‘直播方式’是修饰‘编码细则’的,其中‘编码规则中有关某语种’也是修饰‘编码细则’的,本发明文件其它类似提法比如‘编码规则中有关某语种的TTS方式的编码细则’,‘编码规则中有关某语种的混合方式的编码细则’,‘解码规则中有关某语种的直播方式的解码细则’,‘解码规则中有关某语种的TTS方式的解码细则’,‘解码规则中有关某语种的混合方式的解码细则’等也都是按此类推理解;本发明文件各处提及的‘编码规则中有关某语种的直播方式或TTS方式或混合方式的编码细则’,其中‘直播方式或TTS方式或混合方式’是修饰‘编码细则’的,其中‘编码规则中有关某语种’也是修饰‘编码细则’的,本发明文件各处提及的‘解码规则中有关某语种的直播方式或TTS方式或混合方式的解码细则’也是按此类推理解。The "encoding rules for the live broadcast method of a certain language in the coding rules" mentioned in the document of the present invention, wherein the "live broadcast method" is a modification of the "encoding rules", wherein the "encoding rules for a certain language" is also a modification of the encoding Detailed rules', other similar references in the document of the present invention such as 'encoding rules for the TTS mode of a certain language in the encoding rules', 'encoding rules for the mixed mode of a certain language in the encoding rules', 'in the decoding rules for a certain language The decoding rules of the live broadcast mode', the 'decoding rules of the TTS mode of a certain language in the decoding rules', the 'decoding rules of the mixed mode of a certain language in the decoding rules', etc. are also understood by analogy; everywhere in the document of the present invention The "encoding rules concerning the live broadcast or TTS or mixed encoding rules of a certain language" mentioned in the "encoding rules", where "live broadcast or TTS or mixed means" is modified "encoding rules", where "in the encoding rules about a certain language "Language" also modifies the "encoding rules", and the "decoding rules in the decoding rules related to the live broadcast mode or TTS mode or mixed mode of a certain language" mentioned in the document of the present invention is also understood by analogy.

如用整步TTS发音方式,解码时涉及的预备发音数据中各音码的拼法数据(其中也可能有音码的发音信息数据)的排列顺序一般是按这些数据对应的发音在语句的全部发音中的顺序来排的(假设这种情况为情形1),本说明书各处述及整步TTS发音方式时也是按情形1来阐述的,但如这些数据(指各音码的拼法数据,还可能有音码的发音信息数据)是按某种约定的顺序在预备发音数据里排列的,显然也可,只要能让TTS器件驱动喇叭依语句的全部或非全部预备发音数据按约定播出语句发音即可,这种情况显然与情形1是等同替换。如用分步TTS发音方式,解码时涉及的部分预备发音数据中各音码的拼法数据(其中也可能有音码的发音信息数据)的排列顺序一般是按这些数据对应的发音在此部分预备发音数据对应的发音中的顺序来排的(假设这种情况为情形2),本说明书各处述及分步TTS发音方式时也是按情形2来阐述的,但如这些数据(指各音码的拼法数据,还可能有音码的发音信息数据)是按某种约定的顺序在此部分预备发音数据里排列的,显然也可,只要能让TTS器件驱动喇叭按约定播出此部分预备发音数据对应的发音即可,这种情况显然与情形2是等同替换。如用整步直接发音方式,解码时涉及的预备发音数据中各音码的发音信息数据的排列顺序一般是按这些数据对应的发音在语句的全部发音中的顺序来排的(假设这种情况为情形3),本说明书各处述及整步直接发音方式时也是按情形3来阐述的,但如这些数据(指各音码的发音信息数据)是按某种约定的顺序在预备发音数据里排列的,显然也可,只要能让MCU或外接的语音芯片驱动喇叭依语句的全部或非全部预备发音数据按约定播出语句发音即可,这种情况显然与情形3是等同替换。如用分步直接发音方式,解码时涉及的部分预备发音数据中各音码的发音信息数据的排列顺序一般是按这些数据对应的发音在此部分预备发音数据对应的发音中的顺序来排的(假设这种情况为情形4),本说明书各处述及分步直接发音方式时也是按情形4来阐述的,但如这些数据(指各音码的发音信息数据)是按某种约定的顺序在此部分预备发音数据里排列的,显然也可,只要能让MCU或外接的语音芯片驱动喇叭按约定播出此部分预备发音数据对应的发音即可,这种情况显然与情形4是等同替换。As with the full-step TTS pronunciation mode, the arrangement order of the spelling data (pronunciation information data of the phonetic codes may also be arranged therein) of each phonetic sign indicating number (wherein the pronunciation information data of the phonetic sign indicating number) is generally by the pronunciation corresponding to these data in the whole of sentence in the preliminary pronunciation data that involves in decoding The order in the pronunciation is arranged (assuming that this situation is situation 1), and this manual also sets forth bysituation 1 when referring to the whole step TTS pronunciation mode everywhere, but as these data (referring to the spelling data of each phonetic sign indicating number) , may also have the pronunciation information data of the sound sign indicating number) is arranged in the preparation pronunciation data according to a certain agreed order, obviously also can, as long as TTS device can be allowed to drive the loudspeaker according to all or not all preparation pronunciation data of sentence broadcast by agreement It is enough to pronounce the sentence, and this situation is obviously equivalent tosituation 1. As with the step-by-step TTS pronunciation mode, the arrangement order of the spelling data (the pronunciation information data of the phonetic codes may also be arranged therein) of each phonetic sign indicating number in the partial preparation pronunciation data involved in decoding is generally by the corresponding pronunciation of these data in this part The order in the pronunciation corresponding to the prepared pronunciation data is arranged (assuming that this situation is situation 2), and when this manual mentions the step-by-step TTS pronunciation mode, it is also set forth according tosituation 2, but as these data (referring to each sound The spelling data of the code, and the pronunciation information data of the sound code may also be arranged in this part of the prepared pronunciation data according to a certain agreed order. Obviously, it is also possible, as long as the TTS device can drive the speaker to play this part according to the agreement It is sufficient to prepare the pronunciation corresponding to the pronunciation data, and this case is obviously an equivalent replacement withcase 2. As with the full-step direct pronunciation mode, the arrangement order of the pronunciation information data of each phonetic sign indicating number in the preliminary pronunciation data involved in decoding is generally arranged by the pronunciation corresponding to these data in the order of the whole pronunciation of the sentence (assuming this situation For situation 3), this specification sheet also sets forth bysituation 3 when referring to the whole step direct pronunciation mode everywhere, but as these data (referring to the pronunciation information data of each phonetic sign indicating number) are in the preparation pronunciation data in the order of certain agreement Arranged in the above, it is obviously also possible, as long as the MCU or an external voice chip can drive the speaker to pronounce the sentence according to the agreement according to all or part of the prepared pronunciation data of the sentence. This situation is obviously an equivalent replacement withCase 3. As with the step-by-step direct pronunciation mode, the arrangement order of the pronunciation information data of each phonetic sign indicating number in the partial preliminary pronunciation data involved in decoding is generally arranged by the pronunciation corresponding to these data in the pronunciation corresponding to this part of the preliminary pronunciation data. (assuming that this situation is situation 4), this manual also sets forth bysituation 4 when referring to the step-by-step direct pronunciation mode everywhere, but as these data (referring to the pronunciation information data of each phonetic sign indicating number) are by certain agreement The order is arranged in this part of the prepared pronunciation data, obviously, as long as the MCU or an external voice chip can drive the speaker to play the pronunciation corresponding to this part of the prepared pronunciation data according to the agreement, this situation is obviously the same ascase 4 replace.

本发明文件各处提及的‘排组’词是指‘排列并组合’的意思。The word 'arrangement' mentioned throughout the documents of the present invention refers to the meaning of 'arrangement and combination'.

本发明文件在述及整步直接发音方式时,提及的‘MCU把各音码的发音信息数据(比如词形音标或词形拼音)按这些数据对应的发音在语句的全部发音中的顺序排组起来成预备发音数据’或类似提法,其中的‘排组起来’也就是‘排列并组合起来’的意思,其中的‘顺序’是指,比如一个音码的发音信息数据对应的发音在语句的全部发音中是第1个发音,那此音码的发音信息数据在预备发音数据中就排第1,如另一个音码的发音信息数据对应的发音在语句的全部发音中是第2个发音,那此音码的发音信息数据在预备发音数据中就排第2,这里说的音码的发音信息数据对应的发音以及语句的全部发音都是指在使用直接发音方式时发的音;本说明书各处提及的其它类似提法,比如述整步TTS发音方式时提及的‘MCU把各音码的拼法数据按这些数据对应的发音在语句的全部发音中的顺序排组起来成预备发音数据’也是按这样类推理解,这里说的TTS方式的‘这些数据对应的发音’以及‘语句的全部发音’都是指在使用TTS发音方式时发的音。When the document of the present invention refers to the direct pronunciation mode of the whole step, the 'MCU mentioned' puts the pronunciation information data (such as phonetic symbols or phonetic symbols of the word form) of each phonetic code in the order of the whole pronunciation of the sentence according to the pronunciation corresponding to these data. Arranged to form prepared pronunciation data' or similar formulations, wherein "arranged" means "arranged and combined", and the "order" refers to, for example, the pronunciation corresponding to the pronunciation information data of a phonetic code Be the 1st pronunciation in the whole pronunciation of sentence, then the pronunciation information data of this phonetic sign indicating number just arranges the 1st in the preparation pronunciation data, as the corresponding pronunciation of the pronunciation information data of another sound sign indicating number is the first in the whole pronunciation ofsentence 2 pronunciations, then the pronunciation information data of this phonetic sign indicating number just ranks the 2nd in the preparation pronunciation data, and the pronunciation corresponding to the pronunciation information data of the phonetic sign indicating number mentioned here and all pronunciations of the sentence all refer to sending out when using the direct pronunciation mode other similar formulations mentioned throughout this manual, such as 'MCU arranges the spelling data of each phonetic code in the order of the entire pronunciation of the sentence according to the pronunciation corresponding to these data, as mentioned in the whole step TTS pronunciation mode Combining them into "prepared pronunciation data" is also understood by analogy. The "pronunciation corresponding to these data" and "all pronunciation of the sentence" in the TTS mode mentioned here refer to the pronunciation when using the TTS pronunciation mode.

本发明文件在述及分步直接发音方式时,提及的‘显然一个部分预备发音数据是由数个音码的发音信息数据组成,这些数据一般是按它们对应的发音在此部分预备发音数据对应的发音中的顺序排列的’,这里说的‘这些数据’是指这数个音码的发音信息数据,这里说的对应的发音及排列都是针对音码的发音信息数据来说的,有关这里说到的‘按它们对应的发音在此部分预备发音数据对应的发音中的顺序’(如这些数据仅是一个音码的发音信息数据,那显然就不涉及排列的顺序了),比如一个音码的发音信息数据对应的发音在此部分预备发音数据对应的发音中是第1个发音,那此音码的发音信息数据在此部分预备发音数据中就排第1,如另一个音码的发音信息数据对应的发音在此部分预备发音数据对应的发音中是第2个发音,那此音码的发音信息数据在此部分预备发音数据中就排第2,这里说的音码的发音信息数据对应的发音以及部分预备发音数据对应的发音都是指在使用直接发音方式时发的音;本说明书各处提及的其它类似提法,比如分步TTS发音方式时提及的‘显然一个部分预备发音数据是由数个音码的拼法数据组成(其中也可能有音码的发音信息数据),这些数据一般是按它们对应的发音在此部分预备发音数据对应的发音中的顺序排列的’也是按这样类推理解,这里说的‘这些数据’是指这数个音码的拼法数据(其中也可能有音码的发音信息数据),这里说的对应的发音及排列都是针对音码的拼法数据或音码的发音信息数据来说的,(并且这里提的引号里的这句话里说的‘它们对应的发音’以及‘部分预备发音数据对应的发音’都是指在使用TTS发音方式时发的音)。本段所述和上一段所述都是显然的。When the document of the present invention refers to the step-by-step direct pronunciation mode, the mentioned 'obviously a part of the prepared pronunciation data is made up of the pronunciation information data of several phonetic codes, and these data are generally prepared pronunciation data in this part by their corresponding pronunciations. The sequence in the corresponding pronunciation is arranged', said 'these data' here refers to the pronunciation information data of these several phonetic codes, and the corresponding pronunciation and arrangement said here are all for the pronunciation information data of the phonetic codes, About the 'order in the pronunciation corresponding to this part of the prepared pronunciation data according to their corresponding pronunciation' mentioned here (as these data are only the pronunciation information data of a phonetic code, then obviously the order of arrangement is not involved), such as The pronunciation corresponding to the pronunciation information data of a phonetic sign indicating number is the 1st pronunciation in the corresponding pronunciation of this part preparation pronunciation data, then the pronunciation information data of this sound sign indicating number just arranges the 1st in this part preparation pronunciation data, as another sound The pronunciation corresponding to the pronunciation information data of sign indicating number is the 2nd pronunciation in the pronunciation corresponding to this part of preparation pronunciation data, then the pronunciation information data of this sound sign indicating number just arranges the 2nd in this part preparation pronunciation data, the sound sign indicating number said here The pronunciation corresponding to the pronunciation information data and the pronunciation corresponding to part of the prepared pronunciation data refer to the pronunciation when using the direct pronunciation method; Obviously a part of the prepared pronunciation data is made up of spelling data of several phonetic codes (the pronunciation information data of the phonetic codes may also be arranged therein), and these data are generally in the corresponding pronunciation of this part prepared pronunciation data by their corresponding pronunciation Sequentially arranged' is also understood by analogy, and "these data" said here refers to the spelling data of these several phonetic codes (the pronunciation information data of the phonetic codes may also be arranged therein), and the corresponding pronunciation and arrangement of said here are all It is for the spelling data of the phonetic code or the pronunciation information data of the phonetic code. Refers to the sound pronounced when using the TTS pronunciation method). Both this paragraph and the previous paragraph are obvious.

如提及时无特别指明有他义,本发明文件各处提及的MCU发音(或MCU驱动喇叭发音),默认是指MCU用发音数据(比如一些音码的发音数据,音码的发音数据是指比如象ADPCM格式这样的录音数据)用直接产音方式驱动喇叭来发语音的。If there is no special indication when mentioning that there are other meanings, the MCU pronunciation (or MCU-driven speaker pronunciation) mentioned in the documents of the present invention refers to the pronunciation data (such as the pronunciation data of some sound codes, the pronunciation data of the sound codes is Refers to the recording data such as the ADPCM format) that uses the direct sound production method to drive the speaker to make a voice.

本发明文件各处所说的表示某语种语句发音的拼合信息的值,其实际上是由某编码规则中有关此语种的编码细则来编码生成的(此编码规则可以是只适用于此语种或适用于包括此语种在内的多个语种),也可说成此拼合信息的值可以由此编码规则来编码生成;本发明文件各处所说的表示某1语种语句发音的拼合信息的值,其实际上是被某解码规则中有关某2语种的解码细则来解码的(此解码规则可以是只适用于此某2语种或适用于包括此某2语种在内的多个语种,这里某1语种与某2语种可以是相同语种也可不是相同语种),也可说成此拼合信息的值可以被此解码规则来解码。对于只适用于一个语种的编码规则,本发明文件所说的用此编码规则来编码显然等同于说成用此编码规则有关此语种的编码细则来编码;对于只适用于一个语种的解码规则,本发明文件所说的用此解码规则来解码显然等同于说成用此解码规则有关此语种的解码细则来解码。(本段提到的一个语种或某1语种或某2语种,可以为非印欧语系任一语种或印欧语系任一语种或任一复合语种)The value of the combined information that represents the pronunciation of a certain language sentence in the documents of the present invention is actually coded by the coding rules about this language in a certain coding rule (this coding rule can be only applicable to this language or applicable) In multiple languages including this language), it can also be said that the value of this mosaic information can be coded and generated by this encoding rule; In fact, it is decoded by the decoding rules related to a certain 2 languages in a certain decoding rule (this decoding rule can be only applicable to this certain 2 languages or applicable to multiple languages including this certain 2 languages, here a certain language It may or may not be the same language as a certain 2 languages), it can also be said that the value of this combined information can be decoded by this decoding rule. For the encoding rules that are only applicable to one language, encoding with this encoding rule in the documents of the present invention is obviously equivalent to encoding with the relevant encoding rules of this language in this encoding rule; for the decoding rules that are only applicable to one language, Decoding with this decoding rule in the said document of the present invention is obviously equivalent to being said to decode with the relevant decoding rules of this language of this decoding rule. (A language or a certain 1 language or a certain 2 languages mentioned in this paragraph can be any language of the non-Indo-European language family or any language of the Indo-European language family or any compound language)

本发明文件在述及插部件式数据卡方式拾音器时,提及此拾音器内单片机芯片与读码头意向连接,单片机芯片与发音器件意向连接,一般有几种情况:1是可能这几个部件(单片机芯片,读码头和发音器件)都在拾音器主体内,本来这几个部件就连接,2是这几个部件都在拾音器主体内,但没完全连接,需把数个部件式数据卡插入拾音器主体成为拾音器后,通过卡上的连线使这几个部件连接,3是这几个部件分布在拾音器主体和数个部件式数据卡,需把数个部件式数据卡插入拾音器主体成为拾音器后,使这几个部件连接。本段说的连接,都是指意向连接。本段说的拾音器和拾音器主体都是指插部件式数据卡方式的。When the document of the present invention refers to the pickup of the plug-in type data card mode, it is mentioned that the single-chip microcomputer chip in the pickup is connected with the intention of the reading dock, and the single-chip microcomputer chip is connected with the pronunciation device intention. Generally, there are several situations: 1. These parts ( The single-chip microcomputer chip, the reading port and the pronunciation device) are all in the main body of the pickup. Originally, these parts are connected. The second is that these parts are all in the main body of the pickup, but they are not fully connected. Several component data cards need to be inserted into the pickup. After the main body becomes a pickup, these parts are connected through the connection on the card. 3. These parts are distributed between the pickup body and several component data cards, and several component data cards need to be inserted into the pickup body to become a pickup. , so that these several components are connected. The connections mentioned in this paragraph all refer to intentional connections. The pickup and the pickup body mentioned in this paragraph both refer to the plug-in data card method.

本说明书中对本说明书和权利要求书的内容引用(比如有说:在权利要求10中提及的‘。。。’),其引用的内容可以是权利要求书或本说明书中的多句话或整句话,也可以是整句话的部分等。In this specification, the content of this specification and claims is cited (for example: "..." mentioned in claim 10), and the content cited can be multiple sentences in the claims or this specification or A whole sentence, or part of a whole sentence, etc.

本发明文件中各处提及的‘包括数个印欧语系语种在内的多个语种’,这里的多个语种是指一个印欧语系语种加上数个其它语种(这句说的数个其它语种,可以全是非印欧语系语种,也可以全是复合语种,也可以是既包含非印欧语系语种也包含复合语种),或多个印欧语系语种,或多个印欧语系语种加上数个其它语种(这句说的数个其它语种,可以全是非印欧语系语种,也可以全是复合语种,也可以是既包含非印欧语系语种也包含复合语种)。(本段里不同句子提及的‘数个’不一定是相同数目,当然本段说的各个‘数个’总是不大于本段说的‘多个语种’中的多个的数目)。The 'multiple languages including several Indo-European languages' mentioned everywhere in the document of the present invention, the multiple languages here refer to one Indo-European language plus several other languages (the several in this sentence Other languages can be all non-Indo-European languages, or all compound languages, or both non-Indo-European languages and compound languages), or multiple Indo-European languages, or multiple Indo-European languages plus List several other languages (the several other languages mentioned in this sentence can be all non-Indo-European languages, or all compound languages, or include both non-Indo-European languages and compound languages). (The "several" mentioned in different sentences in this paragraph are not necessarily the same number, of course, each "several" mentioned in this paragraph is always not greater than the number of multiples in the "multiple languages" mentioned in this paragraph).

本发明文件中各处提及的‘包括数个非印欧语系语种在内的多个语种’,这里的多个语种是指一个非印欧语系语种加上数个其它语种(这句说的数个其它语种,可以全是印欧语系语种,也可以全是复合语种,也可以是既包含印欧语系语种也包含复合语种),或多个非印欧语系语种,或多个非印欧语系语种加上数个其它语种(这句说的数个其它语种,可以全是印欧语系语种,也可以全是复合语种,也可以是既包含印欧语系语种也包含复合语种)。(本段里不同句子提及的‘数个’不一定是相同数目,当然本段说的各个‘数个’总是不大于本段说的‘多个语种’中的多个的数目)。The 'multiple languages including several non-Indo-European languages' mentioned everywhere in the document of the present invention, the multiple languages here refer to a non-Indo-European language plus several other languages (this sentence Several other languages, which may be all Indo-European languages, or all composite languages, or both Indo-European languages and composite languages), or multiple non-Indo-European languages, or multiple non-Indo-European languages The language of the language family plus several other languages (the several other languages mentioned in this sentence can be all Indo-European languages, or all compound languages, or include both Indo-European languages and compound languages). (The "several" mentioned in different sentences in this paragraph are not necessarily the same number, of course, each "several" mentioned in this paragraph is always not greater than the number of multiples in the "multiple languages" mentioned in this paragraph).

本发明文件中各处提及的‘包括此语种在内的多个语种’,这里的多个语种是指此语种加上数个其它语种;除非提及时特别指明这数个其它语种是些什么语种,那这里的数个其它语种,默认可以全是印欧语系语种,或全是非印欧语系语种,或全是复合语种,或既包含印欧语系语种也包含非印欧语系语种,或既包含印欧语系语种也包含复合语种,或既包含非印欧语系语种也包含复合语种,或既包含非印欧语系语种也包含印欧语系语种也包含复合语种等。"Multiple languages including this language" mentioned in this invention document, the multiple languages here refer to this language plus several other languages; unless the mention specifically specifies what these other languages are Language, then the several other languages here can be all Indo-European languages by default, or all non-Indo-European languages, or all compound languages, or include both Indo-European languages and non-Indo-European languages, or both Including Indo-European languages and compound languages, or both non-Indo-European languages and compound languages, or both non-Indo-European languages and Indo-European languages and compound languages, etc.

权利要求书中说的‘同样一个音码可以用于前述编码语句集中的不同语句的拼合发音’,对这句话举个例:比如此一个音码与其它数个音码一起可拼合成前述编码语句集中的语句1的发音,此一个音码还可与另外数个音码一起可拼合成前述编码语句集中的语句2的发音,甚至此一个音码还可用于拼合前述编码语句集中的更多语句的发音;权利要求书中说的‘同样一个音码可以用于前述解码语句集中的不同语句的拼合发音’也是与此类似理解。(本段所说编码语句集也可以是任一非复合语种的或任一复合语种的,本段所说解码语句集也可以是任一非复合语种的或任一复合语种的)Said in the claims 'the same phonetic sign indicating number can be used for the splicing pronunciation of the different sentences in the aforementioned coded sentence collection', give an example to this sentence: such as this phonetic code and other several phonetic codes can be combined into the aforementioned The pronunciation of thesentence 1 in the coded sentence collection, this phonetic sign indicating number can also be assembled into the pronunciation of thesentence 2 in the aforementioned coded sentence collection together with other several phonetic codes, even this phonetic code can also be used to put together more in the aforementioned coded sentence collection Pronunciation of multiple sentences; "the same phonetic code can be used for combined pronunciation of different sentences in the aforementioned decoding sentence set" said in the claims is similar to this understanding. (The coded sentence set mentioned in this paragraph can also be of any non-compound language or any compound language, and the decoding sentence set mentioned in this paragraph can also be of any non-compound language or any compound language)

本发明文件各处所说的‘数个’与‘一个或多个’是等价的。The 'several' and 'one or more' mentioned in various places in the present document are equivalent.

权利要求书中各处有提及‘对于前述编码语句集中任一语句,可用多个音码来拼合成这任一语句的发音’,而用于拼合编码语句集中任意一语句发音的多个音码与用于拼合编码语句集中任意另一语句(这里说的‘编码语句集中任意另一语句’是指除这任意一语句外的编码语句集中任意一语句)发音的多个音码一般是不同的(这句话里的几个‘编码语句集’都是相同所指);权利要求书中各处有提及的‘对于前述解码语句集中任一语句,可用多个音码来拼合成这任一语句的发音’,与此类似理解。(本段所说编码语句集也可以是任一非复合语种的或任一复合语种的,本段所说解码语句集也可以是任一非复合语种的或任一复合语种的)It is mentioned everywhere in the claims that "for any sentence in the aforementioned set of coded sentences, multiple phonetic codes can be combined to form the pronunciation of any sentence", and it is used to combine multiple phonetic codes for the pronunciation of any sentence in the set of coded sentences. The code is generally different from the multiple phonetic codes used to splicing any other sentence in the coded sentence set ('any other sentence in the coded sentence set' mentioned here refers to any sentence in the coded sentence set except this arbitrary sentence). (several 'encoded sentence sets' in this sentence all refer to the same); in the claims, it is mentioned that 'for any sentence in the aforementioned decoded sentence set, multiple phonetic codes can be combined to form this The pronunciation of any sentence is understood similarly to this. (The coded sentence set mentioned in this paragraph can also be of any non-compound language or any compound language, and the decoding sentence set mentioned in this paragraph can also be of any non-compound language or any compound language)

本发明文件涉及的可用多个音码来拼合成一语句的发音,这多个音码中可能有相同的音码,比如英语的语句‘HE AND HE’,如用单词音码来拼合(不考虑陌生词和时态处理),What file of the present invention relates to can be put together the pronunciation of a sentence with a plurality of phonetic codes, and identical phonetic code may be arranged in these a plurality of phonetic codes, such as English sentence ' HE AND HE ', as splicing with word phonetic code (not Consider unfamiliar words and tense handling),

那这里要用3个单词音码‘HE’,‘AND’,‘HE’来拼合(其中两个是相同的单词音码‘HE’),再比如英语的语句‘OK OK’,如用单词音码来拼合(不考虑陌生词和时态处理),那这里要用2个相同的单词音码‘OK’来拼合,而对于英语的语句‘HE READ BOOK’,Then here will use 3 word sound codes 'HE', 'AND', 'HE' to splice (two of them are the same word sound code 'HE'), and for example English sentence 'OK OK' again, if use word phonetic code to put together (not considering unfamiliar words and tense processing), then here will use 2 identical word phonetic codes 'OK' to put together, and for the English sentence 'HE READ BOOK',

如用单词音码来拼合(不考虑陌生词和时态处理),那这里是用3个不同的单词音码‘HE’,‘READ’,‘BOOK’来拼合。As splicing with word phonetic sign indicating number (do not consider strange word and tense processing), here is to use 3 different word phonetic sign indicating number ' HE', ' READ', ' BOOK' to put together.

本发明文件各处涉及的用一种或多种音码来拼合语句发音,都是指用多个音码来拼合语句发音,如是用一种音码来拼合语句发音,那这多个音码都是属于这种音码的,如是用多种音码来拼合语句发音,那这多个音码是分属于这多种音码的,比如这里多种音码是两种(比如语种是英语,两种音码是元辅音和单词),多个音码是5个,那可能这5个音码中有3个是第1种音码,另2个音码是第2种音码。(本段提及的‘多种音码’中的‘多’与‘多个音码’中的‘多’不一定是相同数目;本段所说语句可以是任一非复合语种的或任一复合语种的)The documents of the present invention relate to using one or more phonetic codes to put together the sentence pronunciation, all refer to using a plurality of phonetic codes to put together the sentence pronunciation, if a kind of phonetic code is used to put together the sentence pronunciation, then these multiple phonetic codes All belong to this kind of phonetic sign indicating number, such as use multiple sound sign indicating number to piece together sentence pronunciation, then these a plurality of sound sign indicating numbers belong to these many kinds of sound sign indicating numbers, for example here multiple sound sign indicating numbers are two kinds (such as language kind is English , two kinds of phonetic codes are vowels and consonants and words), and a plurality of phonetic codes is 5, then maybe 3 of these 5 phonetic codes are the first kind of phonetic codes, and the other 2 phonetic codes are the second kind of phonetic codes. (' many' in ' multiple phonetic codes' mentioned in this paragraph and ' many' in ' multiple phonetic codes ' are not necessarily the same number; this paragraph said sentence can be any non-compound language or any a compound language)

对于任一语种,本发明文件各处涉及的用一种或多种音码来拼合语句发音,都是指用多个音码来拼合语句发音,本发明文件所说的拼合信息的值也是指用多个音码来拼合的,所以一般情况语句集中的各语句都是超过1个单词(词)的;如果有时采用一个音码来表示语句发音,比如只有一个单词的语句,这时仅用一个单词音码就可表示其发音,显然用一个音码来表示这种语句发音涉及的编解码处理可沿用本说明书前面反复阐述的用多个音码来拼合语句发音涉及的编解码处理(还更简单些),但本发明文件约定这种用一个音码来表示这种语句发音的方式的编码值不属于本发明文件所说的拼合信息的值的范围,也不涉及本发明文件所说的拼合信息的值的范围,至于用一个音码来表示语句发音所涉及的发音方式及拾音器的结构等则仍同于用多个音码来拼合语句发音所涉及的发音方式及拾音器的结构。而对象这样单个单词(或词)的语句,如仍是用多个音码来做拼合,比如用多个元辅音音码来拼合单词(或词)的发音,那仍属于本发明的内容,这样编码得到的编码值仍属于本发明文件所说的拼合信息的值的范围,涉及的编解码处理可沿用本说明书前面反复阐述的用多个音码来拼合语句发音涉及的编解码处理。For any language, one or more phonetic codes involved in the present invention's documents to put together the sentence pronunciation all refer to a plurality of phonetic codes to put together the sentence pronunciation, and the value of the said combination information in the documents of the present invention also refers to Combine with a plurality of phonetic codes, so each sentence in the general situation sentence collection is all more than 1 word (word); If adopt a phonetic code sometimes to represent the pronunciation of a sentence, such as the sentence that has only one word, at this moment only use A word phonetic sign indicating number just can represent its pronunciation, obviously with a phonetic sign indicating number the coding and decoding process that this sentence pronunciation involves can continue to use a plurality of phonetic codes to put together the coding and decoding process that the sentence pronunciation involves in the front of this specification sheet repeatedly simpler), but the document of the present invention stipulates that this coding value of the mode of representing the pronunciation of this sentence with a phonetic code does not belong to the scope of the value of the said combination information of the document of the present invention, nor does it involve the said document of the present invention The scope of the value of the pieced together information, as to represent the pronunciation mode involved in the sentence pronunciation and the structure of the pick-up device etc. with a phonetic sign indicating number then still be the same as the pronunciation mode involved in the sentence pronunciation and the structure of the pick-up device with a plurality of sound codes. And the sentence of such single word (or word) of object, as still doing splicing with a plurality of phonetic codes, such as using a plurality of vowel-consonant sound codes to splicing the pronunciation of word (or word), that still belongs to content of the present invention, The encoded value obtained by encoding in this way still belongs to the scope of the combined information value mentioned in the document of the present invention, and the related encoding and decoding process can follow the encoding and decoding process involved in combining the pronunciation of a sentence with multiple phonetic codes repeatedly explained in the front of this specification.

本说明书中各处提及的‘直播方式的以只适用于英语为例的编码规则’都是相同所指;本说明书中各处提及的‘直播方式的以只适用于汉语为例的编码规则’都是相同所指;本说明书中各处提及的‘TTS方式的以只适用于英语为例的编码规则’都是相同所指;本说明书中各处提及的‘TTS方式的以只适用于汉语为例的编码规则’都是相同所指;本说明书中各处提及的‘直播方式的以只适用于英语为例的解码规则’都是相同所指;本说明书中各处提及的‘直播方式的以只适用于汉语为例的解码规则’都是相同所指;本说明书中各处提及的‘TTS方式的以只适用于英语为例的解码规则’都是相同所指;本说明书中各处提及的‘TTS方式的以只适用于汉语为例的解码规则’都是相同所指。The "encoding rules for live broadcast methods that are only applicable to English as an example" mentioned everywhere in this manual refer to the same; the "encoding rules for live broadcast methods that are only applicable to Chinese as an example" mentioned everywhere in this manual The rules' all refer to the same; the 'encoding rules of the TTS method that are only applicable to English as an example' mentioned in this manual all refer to the same; The encoding rules that are only applicable to Chinese as an example' all refer to the same; the 'decoding rules that only apply to English in the live broadcast mode' mentioned in this manual all refer to the same; everywhere in this manual The "decoding rules applicable only to Chinese as an example of the live broadcast method" mentioned are the same; the "decoding rules of the TTS method only applicable to English as an example" mentioned everywhere in this manual are the same Refers to; the "decoding rules of the TTS method that are only applicable to Chinese as an example" mentioned everywhere in this specification all refer to the same.

对于权利要求4,本权利要求项里提及的‘所述编码规则’,‘这种编码规则’(此‘这种编码规则’的前面没加‘前述’),‘前述这种编码规则’都是相同所指,在权利要求5,6,7,8,13,14,15,16,17,21,24里也有这样的情况。对于权利要求13,本权利要求项里提及的‘所述解码规则’,‘这种解码规则’(此‘这种解码规则’的前面没加‘前述’),‘前述这种解码规则’都是相同所指,在权利要求14,15,16,17,21,24里也有这样的情况。Forclaim 4, 'the encoding rule' mentioned in this claim item, 'this encoding rule' (the front of this 'this encoding rule' does not add 'aforementioned'), 'the foregoing encoding rule' All refer to the same, and there are such situations inclaims 5, 6, 7, 8, 13, 14, 15, 16, 17, 21, and 24. Forclaim 13, the 'decoding rule' and 'this decoding rule' mentioned in this claim (this 'decoding rule' is not preceded by 'aforementioned'), 'the aforementioned decoding rule' All refer to the same, and there are such situations inclaims 14, 15, 16, 17, 21, and 24.

本发明文件所说的产音方式(直接产音方式或TTS产音方式),是针对拾音器的硬件来说,比如用MCU或MCU外接的语音芯片或TTS器件驱动喇叭来发音;本发明文件所说的直接发音方式和TTS发音方式,以及整步直接发音方式和分步直接发音方式和整步TTS发音方式和分步TTS发音方式,都是针对解码规则,解码程序来说的,当然产音方式与发音方式是有关系的,比如拾音器是采用直接产音方式的拾音器,那解码时就要用直接发音方式而不用TTS发音方式,如拾音器是采用TTS产音方式的拾音器,那解码时就要用TTS发音方式而不用直接发音方式。The said sound production mode (direct sound production mode or TTS sound production mode) of the document of the present invention, is for the hardware of pickup, such as with MCU or MCU external speech chip or the TTS device driving loudspeaker to pronounce; The direct pronunciation method and TTS pronunciation method mentioned, as well as the whole-step direct pronunciation method and step-by-step direct pronunciation method, the whole-step TTS pronunciation method and the step-by-step TTS pronunciation method, are all for decoding rules and decoding programs. The method is related to the pronunciation method. For example, if the pickup adopts the direct sound production method, then the direct pronunciation method should be used instead of the TTS pronunciation method when decoding. For example, the pickup adopts the TTS sound production method. Use the TTS pronunciation method instead of the direct pronunciation method.

本发明文件各处提及的编码规则,其包含的所有编码细则,可以是全部成文的,或全部不成文的,或部分成文部分不成文的(显然这3种形式互相是等同替换),编码涉及使用的编码数据也可以是全部成文的,或全部不成文的,或部分成文部分不成文的(显然这3种形式互相是等同替换),比如一个只适用于一个语种(或适用于多个语种)的编码规则,其中的所有编码细则,可以是全部成文的,或全部不成文的,或部分成文部分不成文的,而且其中的某一个编码细则可以是部分成文部分不成文的,编码涉及使用的编码数据也可以是全部成文的,或全部不成文的,或部分成文部分不成文的,这里说的成文是指比如写在纸上或做成电脑WORD这类文档,这里说的不成文的编码规则(或编码细则或编码数据)是指比如是只在编码者脑子里想好的编码规则(或编码细则或编码数据),他(编码者)做编码时就可按其编码细则都是写在纸上的编码规则,或按其编码细则都在脑子里的编码规则,或按其部分编码细则写在纸上而部分编码细则是在脑子里的编码细则来做编码;同样,本发明文件各处提及的解码规则,其包含的所有解码细则,可以是全部成文的,或全部不成文的,或部分成文部分不成文的(显然这3种形式互相是等同替换),解码涉及使用的解码数据也可以是全部成文的,或全部不成文的,或部分成文部分不成文的(显然这3种形式互相是等同替换),比如一个只适用于一个语种(或适用于多个语种)的解码规则,其中的所有解码细则,可以是全部成文的,或全部不成文的,或部分成文部分不成文的,而且其中的某一个解码细则可以是部分成文部分不成文的,解码涉及使用的解码数据也可以是全部成文的,或全部不成文的,或部分成文部分不成文的,这里说的成文是指比如写在纸上或做成电脑WORD这类文档,这里说的不成文的解码规则(或解码细则或解码数据)是指比如是只在程序员脑子里想好的解码规则(或解码细则或解码数据),他(程序员)做解码时就可按其解码细则都是写在纸上的解码规则,或按其解码细则都在脑子里的解码规则,或按其部分解码细则写在纸上而部分解码细则是在脑子里的解码细则来做解码。(本段所说的一个语种可以是任一非复合语种或任一复合语种,本段所说的多个语种可以全是非复合语种或全是复合语种或既包含非复合语种也包含复合语种)The coding rules mentioned everywhere in the document of the present invention, all the coding rules contained in it, can be all written, or all unwritten, or partially written and partially unwritten (obviously, these 3 forms are equivalent to each other), coding The coded data involved in the use can also be all written, or all unwritten, or partly written and partly unwritten (obviously these three forms are equivalent to each other), for example, one is only applicable to one language (or applicable to multiple language), all the coding rules in it can be all written, or all unwritten, or partly written and partly unwritten, and one of the coding rules can be partly written and partly unwritten, and the coding involves The coded data used can also be all written, or all unwritten, or partially written and partially unwritten. The written data here refers to documents such as written on paper or made into computer WORD. The unwritten data mentioned here The coding rules (or coding rules or coding data) refer to the coding rules (or coding rules or coding data) that are only thought of in the mind of the coder, and he (the coder) can follow the coding rules when coding. It is the coding rules written on paper, or the coding rules that are all in the brain according to its coding rules, or coded according to the part of the coding rules that are written on paper and part of the coding rules are the coding rules in the mind; similarly, this The decoding rules mentioned throughout the invention document, and all the decoding rules contained in it, can be all written, all unwritten, or partially written and partially unwritten (obviously, these three forms are equivalent to each other). The decoding data used can also be all written, or all unwritten, or partially written and partially unwritten (obviously these three forms are equivalent to each other), for example, one is only applicable to one language (or is applicable to multiple languages) ) decoding rules, all the decoding rules can be all written, or all unwritten, or partly written and partly unwritten, and one of the decoding rules can be partly written and partly unwritten, decoding involves the use of The decoded data can also be all written, or all unwritten, or partially written and partially unwritten. The written here refers to documents such as written on paper or made into a computer WORD. The unwritten here Decoding rules (or decoding rules or decoding data) refer to the decoding rules (or decoding rules or decoding data) that are only thought of in the programmer's mind, and he (programmer) can follow the decoding rules when decoding. The decoding rules written on paper, or the decoding rules that are all in the brain according to its decoding rules, or decode according to the partial decoding rules written on paper and part of the decoding rules in the mind. (A language mentioned in this paragraph can be any non-compound language or any compound language, and the multiple languages mentioned in this paragraph can be all non-compound languages or all compound languages, or include both non-compound languages and compound languages)

关于本发明文件中数词‘多’的用法,在本发明文件里,在一个段落里,除非特别注明是相同所指或者可从前后意思推出是相同所指,其后面跟不同量词的‘多’(比如‘多个’中的‘多’与‘多种’中的‘多’)不是相同所指;在一个段落里,除非特别注明是相同所指或者可从前后意思推出是相同所指,如用同一用‘多’与某量词(‘个’或‘种’等)组成的数量词来修饰不同的名词,那不同名词前的此数量词不是相同所指,比如一个段落里出现几个‘多个’,且各‘多个’修饰的是不同的名词,比如此段落里出现的‘多个完整句子’中的‘多个’与此段落里出现的‘多个非完整句子’中的‘多个’不是相同所指(除非特别注明是相同所指或者可从前后意思推出是相同所指);一个段落里,如用同一用‘多’与某量词(‘个’或‘种’等)组成的数量词来修饰字型相同的名词,除非特别注明是相同所指或者可从前后意思推出是相同所指(比如一个段落里前面出现‘多个码字’,后面出现‘这多个码字’,那就可凭这个‘这’字推出此‘这多个码字’中的‘多个’是与前面出现的‘多个码字’中的‘多个’是相同所指的,这里的‘码字’就是被修饰的名词),那处于此段落内不同地方的字型相同的名词前的此数量词不是相同所指,比如一个段落里出现几个‘多种音码’,这几个‘多种音码’里的‘多种’不是相同所指的(除非特别注明是相同所指或者可从前后意思推出是相同所指);除非特别指明有他义,处在不同段落的‘多’默认不是相同所指。(本段说到的段落里出现的两个‘多’是相同所指的意思是:如一个‘多’在其取值范围内任取一个值比如3,另一个‘多’也一定取相同的值3,本段说到的两个‘多’不是相同所指的意思是:如一个‘多’在其取值范围内任取一个值比如3,另一个‘多’的取值与前面这个‘多’的取值没任何关系,可能是3也可能不是3;本段说到的段落里出现的具有相同量词的两个数量词比如两个‘多个’是相同所指或不是相同所指的意思也是类似的)。关于本发明文件中数词‘数’的用法,也是与本段以上所述‘多’的用法相同。(本段提及的发明文件包括本申请的说明书和权利要求书)Regarding the usage of the numeral 'many' in the document of the present invention, in the document of the present invention, in a paragraph, unless it is specially noted that it refers to the same reference or can be inferred from the meaning before and after, it is followed by a different quantifier' Many' (such as 'many' in 'many' and 'many' in 'multiple') are not the same referent; in a paragraph, unless it is specifically noted that it is the same referent or can be inferred from the previous and subsequent meanings Refers to, if different nouns are modified with the same quantifier consisting of 'many' and a certain quantifier ('a' or 'kind', etc.), the quantifiers before different nouns do not have the same referent, for example, how many times appear in a paragraph "multiple" and each "multiple" modify different nouns, such as the "multiple" in the "multiple complete sentences" appearing in this paragraph and the "multiple incomplete sentences" appearing in this paragraph The "plurality" in is not the same referent (unless it is specifically stated that it is the same referent or can be deduced from the previous and subsequent meanings); "kinds" etc.) to modify nouns with the same font type, unless it is specified that they refer to the same thing or can be deduced from the meaning before and after (such as "multiple code words" appearing in the front of a paragraph, followed by 'these multiple codewords', then the 'multiple' in the 'those multiple codewords' can be deduced from the 'this' word and the 'multiple' in the preceding 'multiple codewords' are refer to the same, the 'code word' here is the modified noun), the quantifier before the noun with the same font in different places in this paragraph is not the same referent, for example, there are several 'various' in a paragraph The "multiple phonetic codes" in these several "multiple phonetic codes" do not refer to the same thing (unless it is specially noted that they are the same thing or can be deduced from the meaning before and after); unless otherwise specified In other words, 'multiple' in different paragraphs does not mean the same thing by default. (The two "many" appearing in the paragraph mentioned in this paragraph are the same meaning: if one "many" takes any value such as 3 within its value range, the other "many" must also take the same The value of 3, the two "multiple" mentioned in this paragraph are not the same means: if one "multiple" can take a value such as 3 within its value range, the value of the other "multiple" is the same as the previous The value of "many" does not matter, it may be 3 or not 3; two quantifiers with the same quantifier appearing in the paragraph mentioned in this paragraph, such as two "many" are the same referent or not the same referent means similarly). Regarding the usage of the numeral 'number' in the document of the present invention, it is also the same as the usage of 'many' mentioned above in this paragraph. (The invention documents mentioned in this paragraph include the specification and claims of this application)

本发明文件提及的把一个(或数个)数据卡插入拾音器主体成为拾音器,是默认此拾音器主体是插数据卡方式拾音器主体,且是指把这一个(或数个)数据卡插入此拾音器主体使一个(或数个)数据卡和拾音器主体合为一个插数据卡方式拾音器;本发明文件提及的把一个(或数个)部件式数据卡插入拾音器主体成为拾音器,是默认此拾音器主体是插部件式数据卡方式拾音器主体,且是指把这一个(或数个)部件式数据卡插入此拾音器主体使一个(或数个)部件式数据卡和拾音器主体合为一个插部件式数据卡方式拾音器。The document of the present invention refers to inserting one (or several) data cards into the main body of the pickup to become a pickup, which means that the main body of the pickup is the main body of the pickup in the way of inserting data cards by default, and refers to inserting the one (or several) data cards into the pickup The main body makes one (or several) data cards and the main body of the pickup combine into a pickup in the form of inserting a data card; as mentioned in the document of the present invention, one (or several) component data cards are inserted into the main body of the pickup to become a pickup, which is the pickup main body by default. It is the main body of the pickup in the form of a plug-in data card, and it means that one (or several) component data cards are inserted into the pickup body so that one (or several) component data cards and the pickup body are combined into one plug-in data card. Card mode pickup.

如提及时无特别指明有他义,本发明文件各处提及的其前面没数量词的‘拾音器主体’默认是指单个的拾音器主体。If there is no special indication of other meanings when mentioned, the 'pickup body' mentioned in the documents of the present invention without a quantifier in front of it refers to a single pickup body by default.

权利要求书里提及的‘所述诸个码字中有部诸码字;前述部诸码字是所述诸个码字中的全部码字或部分码字;所述诸个码字可以是一个或多个码字;前述部诸码字可以是一个或多个码字;’,这里提及的两个‘多个’可以是相同数目或不是相同数目,而且诸个码字包含的码字个数是大于或等于部诸码字包含的码字个数的(比如如诸个码字包含多个码字,部诸码字包含多个码字,那本括号内第1句里的‘多’就大于或等于本括号内第2句里的‘多’)。权利要求书里提及的‘所述部诸码字是所述诸个码字中的全部码字或部分码字;所述诸个码字可以是一个或多个码字;所述部诸码字可以是一个或多个码字’也是按这样理解。(本发明文件各处提及的诸个码字是多少个码字与诸个码字包含多少个码字是相同表达方式,本发明文件各处提及的部诸码字是多少个码字与部诸码字包含多少个码字是相同表达方式,本发明文件各处提及的诸个码字可以是多少个码字与诸个码字可以包含多少个码字是相同表达方式,本发明文件各处提及的部诸码字可以是多少个码字与部诸码字可以包含多少个码字是相同表达方式,比如‘诸个码字包含多个码字’与‘诸个码字是多个码字’是相同意思,比如‘部诸码字包含一个码字’与‘部诸码字是一个码字’是相同意思,比如‘部诸码字可以是一个或多个码字’与‘部诸码字可以包含一个或多个码字’是相同意思)There are part codewords in the described codewords mentioned in the claims; the aforementioned codewords are all codewords or part codewords in the described codewords; the codewords can be Is one or more codewords; All codewords of the foregoing part can be one or more codewords; ', the two 'multiple' mentioned here can be the same number or not the same number, and the codewords contained The number of codewords is greater than or equal to the number of codewords that all codewords comprise (such as all codewords comprise a plurality of codewords, and all codewords of the department comprise a plurality of codewords, then in the first sentence in the parentheses The 'many' in this bracket is greater than or equal to the 'many' in the second sentence in this bracket). The 'all codewords of said part mentioned in the claims are all codewords or part codewords in said all codewords; said all codewords can be one or more codewords; said all codewords A codeword can be one or more codewords' is also understood in this way. (How many codewords and how many codewords are included in the codewords mentioned in the documents of the present invention is the same expression, how many codewords are the codewords in the codewords mentioned in the documents of the present invention It is the same expression as how many code words are included in the code words of the department, and how many code words and how many code words can be included in the various code words mentioned in the documents of the present invention are the same expression. How many codewords all codewords can be and how many codewords some codewords can contain mentioned in the invention document is the same expression, such as 'the codewords contain multiple codewords' and 'the codewords The word is a plurality of codewords' is the same meaning, such as 'partial codewords include a codeword' and 'partial codewords are a codeword' is the same meaning, such as 'partial codewords can be one or more codewords' Word 'and' all codewords can contain one or more codewords' have the same meaning)

本发明文件各处提及的诸个码字中的每个码字的码值是共与数个拼合信息的值有关的,是指这诸个码字中每一码字都有此特征:此码字的码值是共与数个拼合信息的值有关的;如在这诸个码字中某一码字的码值是共与数个拼合信息的值有关的,这诸个码字中另外任一码字的码值也是共与数个拼合信息的值有关的,那上一句提到的‘数个’和上上一句提到的‘数个’可能是相同数目也可能不是相同数目且上一句提到的‘数个拼合信息的值’与上上一句提到的‘数个拼合信息的值’可能是相同所指也可能不是相同所指。本发明文件各处提及的类似提法也有类似含义,比如部诸码字中的每个码字的码值是共与数个拼合信息的值有关的,或者提及的诸个码字(或部诸码字)中的每个码字的码值是共与数个拼合信息的值有关的,或其它类似提法,也都有与从本段第2行‘此码字的码值’至第6行‘相同所指’所述含义类似的含义。本说明书在述及诸个码字定义时,提及的‘把语音编码有声读物所有页面上的所有其码值是共与数个表示语句发音的拼合信息的值有关的码字都集合起来’,也是这样的,是指这所有要被集中起来的码字中每一码字都有此特征:此码字的码值是共与数个拼合信息的值有关的;如在这所有要被集中起来的码字中某一码字的码值是共与数个拼合信息的值有关的,这所有要被集中起来的码字中另外任一码字的码值也是共与数个拼合信息的值有关的,那上一句提到的‘数个’和上上一句提到的‘数个’可能是相同数目也可能不是相同数目且上一句提到的‘数个拼合信息的值’与上上一句提到的‘数个拼合信息的值’可能是相同所指也可能不是相同所指。The code value of each code word in the various code words mentioned in the documents of the present invention is related to the value of several pieced together information, and refers to that each code word has this feature in these various code words: The code value of this code word is related to the value of several mosaic information altogether; As the code value of a certain code word in these code words is related to the value of several mosaic information altogether, these code words The code value of any other code word in is also related to the value of several combined information, so the "several" mentioned in the previous sentence and the "several" mentioned in the previous sentence may or may not be the same number The number and the "values of several combined information" mentioned in the previous sentence may or may not be the same as the "values of several combined information" mentioned in the previous sentence. The similar references mentioned in the documents of the present invention everywhere have similar meanings, for example, the code value of each code word in all code words is related to the value of several pieces of combined information, or the mentioned code words ( or the code value of each code word in all code words) is related to the value of several splicing information altogether, or other similar formulations also all have the code value related to this code word from the 2nd row of this paragraph ' to line 6 'same referent' has a meaning similar to that stated. When referring to the definitions of various codewords in this manual, it is mentioned that "all the codewords whose code values on all pages of the audio-coded audiobook are related to the value of the combined information representing the pronunciation of the sentence are all assembled" , is also like this, which means that each codeword in all the codewords to be gathered has this feature: the code value of this codeword is related to the value of several pieces of combined information; The code value of a certain code word in the collected code words is related to the value of several splicing information, and the code value of any other code word in all the code words to be gathered is also related to several splicing information. The value of the "number" mentioned in the previous sentence and the "number" mentioned in the previous sentence may or may not be the same number, and the "number of combined information values" mentioned in the previous sentence is the same as The 'values of several combined information' mentioned in the previous sentence may or may not be the same referent.

在权利要求书中各处提及的‘同样多个’,是指这‘多个’与从本段落中这‘多个’的位置往前倒数第1个‘多个’是相同数目(相同数目的意思是指比如前一个‘多个’是2个,那后一个‘多个’也是2个,如前一个‘多个’是4个,那后一个‘多个’也是4个),比如权利要求书中提及的‘所述数个数据卡是多个数据卡且所述数个卡插槽是同样多个卡插槽’,其中后面的‘多个’与前面的‘多个’(从后面‘多个’的位置往前倒数第1个‘多个’就是前面的‘多个’)是相同数目;同样,在权利要求书中各处提及的‘同样数个’,是指这‘数个’与从本段落中这‘数个’的位置往前倒数第1个‘数个’是相同数目。(本段各引号内的内容都是权利要求书里内容的引用)The "same number" mentioned in the claims means that the "multiple" is the same number as the first "multiple" from the position of the "multiple" in this paragraph (the same The number means that for example, the previous "multiple" is 2, and the latter "multiple" is also 2; if the previous "multiple" is 4, then the latter "multiple" is also 4), For example, "the several data cards are a plurality of data cards and the plurality of card slots are the same number of card slots" mentioned in the claims, wherein the following "multiple" is the same as the front "multiple '(the first 'multiple' from the position of 'multiple' to the front is the previous 'multiple') is the same number; similarly, the 'same number' mentioned everywhere in the claims, It means that this "several" is the same number as the first "several" from the position of this "several" in this paragraph. (The content in the quotation marks in this paragraph is the content in the claims)

本说明书里有在括号里还有括号的情况,一般是对外括号内的内容有注释,此注释的内容就放在内括号内。In this manual, there are cases where there are brackets inside the brackets. Generally, there are comments on the content in the outer brackets, and the contents of the comments are placed in the inner brackets.

权利要求书中多次提及‘所述拾音器的所述单片机芯片是按一种解码规则中有关语种2的解码细则来对读码得到的前述拼合信息的值1执行解码使所述拾音器的所述发音器件播出前述语句1发音的’或类似表达,其中的‘一种’是只修饰‘解码规则’的;本发明文件各处提及的‘一种解码规则中有关此语种的解码细则’或类似表达也是这样的,其中的‘一种’是只修饰‘解码规则’的;本发明文件各处提及的‘一种编码规则中有关此语种的编码细则’或类似表达也是这样的,其中的‘一种’是只修饰‘编码规则’的。It is repeatedly mentioned in the claims that 'the single-chip microcomputer chip of the sound pickup is to decode thevalue 1 of the aforementioned combination information obtained by reading the code according to the decoding rules of therelevant language 2 in a decoding rule so that all the sound pickups Said pronunciation device broadcasts the ' or similar expression of pronunciation ofaforementioned sentence 1, and wherein ' a kind of ' only modifies ' decoding rule '; The detailed rules about the decoding of this language in the ' a kind of decoding rule mentioned everywhere in the document of the present invention ' or similar expressions are also like this, and the 'one' is only modified by 'decoding rules'; the same is true of the 'encoding rules related to this language in a coding rule' or similar expressions mentioned throughout the document of the present invention , one of the 'one' is only to modify the 'encoding rules'.

本说明书各处提及的说明书句子中某一成分修饰另一成分,所说的修饰默认是指只修饰,即某一成分只限定另一成分,而不限定另一成分之外的内容,比如有说在权利要求中各处提及的‘所述诸个码字中的某一码字’中的‘所述’是只用于修饰此‘所述’后面的‘诸个码字’的,那此‘所述’就是只用于限定此‘诸个码字’的。In the description sentences mentioned in this manual, a certain component modifies another component, and the modification mentioned refers to only modification by default, that is, a certain component only limits another component, and does not limit the content other than the other component, such as It is said that the "said" in "a certain codeword among the said codewords" mentioned in the claims is only used to modify the "codewords" behind this "said" , then the 'said' is only used to limit the 'codewords'.

如提及时无特别指明有他义,在本说明书中各处提及的第多少多少页,默认是指本说明书的第多少多少页。If there is no other meaning specified when mentioning, the number of pages mentioned in various places in this manual refers to the number of pages in this manual by default.

如有拾音器(包括本说明书前面说的电子产品拾音器),把读码头和MCU集成在一起(比如手机可读码并解码播语音,而此手机的主MCU与用于读码的照相电路是集成在一起的,这里照相电路相当于读码头),那这种情况应该是把读码头和MCU分开的等同替换。If there is a pickup (including the electronic product pickup mentioned earlier in this manual), integrate the reading dock with the MCU (for example, the mobile phone can read the code and decode the broadcast voice, and the main MCU of the mobile phone is integrated with the camera circuit for code reading. Together, the camera circuit here is equivalent to the reading dock), then this situation should be an equivalent replacement of separating the reading dock and the MCU.

本发明文件提到的‘其它电子类产品’是指MP3播放机,复读机,手机等。The 'other electronic products' mentioned in the documents of the present invention refer to MP3 players, repeaters, mobile phones and the like.

本发明文件提到的手机,包括GSM,CDMA等手机,也包括小灵通等。The mobile phone mentioned in the document of the present invention includes mobile phones such as GSM and CDMA, and also includes PHS and the like.

如与某拾音器有关的存储器集里只含一个存储器,那此存储器集里的各存储器显然就是指这一个存储器。Only contain a storer in the memory set relevant with certain pickup, then each memory store in this memory set obviously refers to this storer.

本发明文件提到的作为码字载体的小块平面材料,其面积多大并无特别限制,如果用一面积比较大的平面材料,上有一个或多个码字,把此平面材料粘贴在作为有声读物主体的读物的页面上,一般不会把读物的一页面覆盖完,此平面材料也等同是小块平面材料。The small piece of plane material mentioned in the document of the present invention as the code word carrier is not particularly limited in its area. If a plane material with a relatively large area has one or more code words on it, this plane material is pasted on as a code word carrier. On the page of the reading material of the main body of the audiobook, generally, one page of the reading material will not be covered completely, and this flat material is also equivalent to a small piece of flat material.

权利要求19中提到的‘需将前述数个数据卡插入前述拾音器主体的前述数个卡插槽里’,如是将多个数据卡插入多个卡插槽的情况(这句话里两个‘多’的数值相同),那显然是按一个数据卡与一个卡插槽这样一对一的插。权利要求22中提到的‘需将前述数个部件式数据卡插入前述拾音器主体的前述数个部件式卡插槽里’,如是将多个部件式数据卡插入多个部件式卡插槽的情况(这句话里两个‘多’的数值相同),那显然是按一个部件式数据卡与一个部件式卡插槽这样一对一的插。Mentioned in claim 19, the aforementioned several data cards need to be inserted in the aforementioned several card slots of the aforementioned pickup body', such as the case where a plurality of data cards are inserted into a plurality of card slots (two in this sentence The value of 'Multiple' is the same), it is obviously a one-to-one insertion of one data card and one card slot. As mentioned in claim 22, 'the aforementioned several component-type data cards need to be inserted into the aforementioned several component-type card slots of the aforementioned pickup body', such as inserting multiple component-type data cards into multiple component-type card slots In this case (in this sentence, the values of the two 'many' are the same), it is obviously a one-to-one insertion of a component-type data card and a component-type card slot.

如有一拾音器主体,主体内的存储器里存储有只用于某语种解码的解码程序及解码数据,此拾音器主体又可插数据卡(或部件式数据卡),数据卡(或部件式数据卡)上存储有只用于另一语种解码的解码程序及解码数据,那此拾音器主体在读前一语种的拼合信息码时,不用插卡,此拾音器主体相当于非插卡方式拾音器主体(也即非插卡方式拾音器);此拾音器主体上在插入本段前述的数据卡(或部件式数据卡)后,可用于读另一语种的拼合信息码,这时此拾音器主体相当于插数据卡方式拾音器主体(或插部件式数据卡方式拾音器主体),把本段前述的数据卡(或部件式数据卡)插入此拾音器主体而形成的拾音器就相当于是插数据卡方式拾音器(或插部件式数据卡方式拾音器)。If there is a pickup main body, the memory in the main body stores a decoding program and decoding data that are only used for decoding in a certain language, and this pickup main body can be inserted into a data card (or a component data card), a data card (or a component data card) The decoding program and decoding data that are only used for decoding in another language are stored on the computer, so the main body of the pickup does not need to insert a card when reading the combined information code of the previous language. card insertion mode pickup); the main body of this pickup can be used to read the spliced information code of another language after inserting the aforementioned data card (or component data card) in this paragraph, and at this moment, this pickup main body is equivalent to the pickup of the data card mode Main body (or plug-in type data card mode pickup body), the pickup formed by inserting the aforementioned data card (or component type data card) into the pickup body is equivalent to the plug-in data card mode pickup (or plug-in type data card mode pickup).

对于一个拾音器主体,它同时具有不用插卡就可读码解码播出语句发音,插数据卡来读码解码播出语句发音,插部件式数据卡来读码解码播出语句发音这3个特征中的全部或任意2个,那就可根据使用时的具体使用情况是这3种情况的哪一种,来把此拾音器主体当成相应的非插卡方式或插数据卡方式或插部件式数据卡方式拾音器主体;比如一个拾音器主体,既可不插卡就可读码解码播出语句发音,又可只插数据卡而形成插数据卡方式拾音器以读码解码播出语句发音,又可只插部件式数据卡而形成插部件式数据卡方式拾音器以读码解码播出语句发音,那在此拾音器主体不插卡使用时,可把此拾音器主体等同看成是非插卡方式拾音器主体且把此拾音器主体等同看成是非插卡方式拾音器,而在只把数个数据卡插入此拾音器主体而形成拾音器以使用时,可把此拾音器主体等同看成是插数据卡方式拾音器主体且把此形成的拾音器等同看成是插数据卡方式拾音器,而在只把数个部件式数据卡插入此拾音器主体而形成拾音器以使用时,可把此拾音器主体等同看成是插部件式数据卡方式拾音器主体且把此形成的拾音器等同看成是插部件式数据卡方式拾音器。For a pickup body, it also has three features: code reading, decoding and broadcasting sentence pronunciation without inserting a card, inserting a data card to read code, decoding and broadcasting sentence pronunciation, inserting a component type data card to read code, decoding and broadcasting sentence pronunciation. All or any two of them, then the main body of the pickup can be regarded as the corresponding non-plug-in card mode or plug-in data card mode or plug-in data Card mode pickup main body; for example, a pickup main body can read code, decode and broadcast sentence pronunciation without inserting a card, or only insert a data card to form a data card-inserted pickup to read code, decode and broadcast sentence pronunciation, or only insert a data card The part-type data card is used to form the plug-in data card mode pickup to read the code and decode the broadcast sentence pronunciation, then when the pickup body is not inserted into the card, the pickup body can be regarded as the non-card type pickup body and this The main body of the pickup is equivalently regarded as a non-card pickup, and when only several data cards are inserted into the pickup main body to form a pickup for use, the pickup main body can be regarded as the pickup main body of the data card insertion mode and formed The pickup is equivalently regarded as the pickup of the plug-in data card mode, and when only several component data cards are inserted into the pickup main body to form a pickup for use, the pickup main body can be regarded as the pickup main body of the plug-in component data card mode and The pickup that this forms is equated as the plug-in type data card mode pickup.

本说明书所说的插数据卡方式拾音器,是以把存储器做成数据卡的形式,插数据卡方式拾音器主体上做有插槽,使用时直接把数据卡插入插数据卡方式拾音器主体的插槽而形成拾音器为例的,本说明书涉及插数据卡的地方默认是指这种方式;至于其它连接方式(比如用软缆线把数据卡与插数据卡方式拾音器主体连接起来),本质与此相同,是本段前面述的直接插数据卡的方式的等同替换。本说明书所说的插部件式数据卡方式拾音器,是以把部件,存储器芯片等(如部件中包括存储器比如MCU内有存储器,那存储器芯片不是必需的)做成部件式数据卡的形式,插部件式数据卡方式拾音器主体上做有插槽,使用时直接把部件式数据卡插入插部件式数据卡方式拾音器主体的插槽而形成拾音器为例的,本说明书涉及插部件式数据卡的地方默认是指这种方式;至于其它连接方式(比如用软缆线把部件式数据卡与插部件式数据卡方式拾音器主体连接起来),本质与此相同,是本段前面述的直接插部件式数据卡的方式的等同替换。The data card pickup mentioned in this manual is to make the memory into a data card. There is a slot on the main body of the data card pickup. When using it, directly insert the data card into the slot on the main body of the data card pickup. Taking the pickup as an example, the place where the data card is inserted in this manual refers to this method by default; as for other connection methods (such as connecting the data card to the main body of the pickup with a data card by using a flexible cable), the essence is the same as this , which is an equivalent replacement of the method of directly inserting the data card mentioned earlier in this paragraph. The plug-in type data card mode pickup mentioned in this manual is to make parts, memory chips, etc. There is a slot on the main body of the component type data card pickup. When using it, directly insert the component type data card into the slot of the component type data card type pickup body to form a pickup as an example. This manual refers to the place where the component type data card is inserted. The default refers to this method; as for other connection methods (such as connecting the component type data card and the plug-in type data card pickup body with a flexible cable), the essence is the same as this, which is the direct plug-in type mentioned above in this paragraph. An equivalent replacement by means of a data card.

本说明书各处述及到存储器集(其前面没加‘与拾音器有关的’)时,是默认此存储器集是与拾音器有关的存储器集(此拾音器是在提到存储器集那里涉及的拾音器)。When this specification sheet mentions storage set (do not add ' relevant to the pick-up' in front of it), it is acquiescence that this storage set is the storage set relevant to the pickup (this pickup is the pickup that is mentioned there related to the storage set).

如把一个拼合信息的值的BIT分开放在多个码字里(或把数个拼合信息的值的变换值的BIT分开放在多个码字里),这多个码字又分别处于多本有声读物的整页面上,使用时需由拾音器读这多本有声读物整页面上的这多个码字,而得到拼合信息的值来解码播出语句发音,这种情况可把这多本有声读物等同看成合起来是一本有声读物。As the BIT of the value of a pieced together information is separately placed in a plurality of codewords (or the BIT of the transformation value of the value of several pieced together information is separately placed in a plurality of codewords), these multiple codewords are respectively in multiple codewords On the entire page of the audiobook, the multiple codewords on the entire page of the audiobook need to be read by the pickup to obtain the value of the combined information to decode the pronunciation of the broadcast sentence. In this case, the multiple audiobooks can be used Audiobooks are equivalent to being considered as one audiobook together.

本发明文件各处说到的从多个码字得到某值,是指读这多个码并把这多个码字的值组合起来后得到某值,比如把一个拼合信息的值放在多个码字里时,读码使用时就要读这多个码字并把这多个码字的值组合起来后得到此拼合信息的值,即从这多个码字得到此拼合信息的值,或是比如把一个拼合信息的值的变换值放在多个码字里,读码使用时要读这多个码字并把这多个码字的值组合起来后得到此拼合信息的值的变换值,并对此变换值作逆变换,这样可从这多个码字得到此拼合信息的值。Obtaining a certain value from a plurality of codewords mentioned in the document of the present invention refers to reading these multiple codes and combining the values of these multiple codewords to obtain a certain value, such as putting the value of a piece of information in multiple When reading code words, it is necessary to read these multiple code words and combine the values of these multiple code words to obtain the value of the combined information, that is, to obtain the value of the combined information from these multiple code words. , or for example, put the transformation value of a combined information value in multiple codewords. When reading the code, you need to read the multiple codewords and combine the values of the multiple codewords to get the value of the combined information. and perform inverse transformation on the transformed value, so that the value of the spliced information can be obtained from the plurality of codewords.

本发明文件有说到从一个或多个码字得到某拼合信息的值,那此拼合信息的值在这一个或多个码字里放的形式可以是各种各样的,比如直接把此拼合信息的值放在这一个或多个码字里,或是把此拼合信息的值的变换值放在这一个或多个码字里等。The document of the present invention mentions that the value of a certain combination information is obtained from one or more codewords, then the value of the combination information can be placed in various forms in the one or more codewords, such as directly putting this The value of the combined information is placed in the one or more codewords, or the transformation value of the value of the combined information is placed in the one or more codewords, etc.

本发明文件述及的拾音器,也可用手机采用的照相头来读码,那一般就可把照相头当成读码头(这种情况是照相机图象处理电路在主MCU内且一般是照相头连线到主MCU的,这种情况照相涉及的读码软件一般放在主MCU内),或把照相头加照相机图象处理集成电路当成读码头(这种情况照相机图象处理电路是做成照相机图象处理集成电路的,照相头和此集成电路都在主MCU芯片外,一般是照相头连线到此集成电路且此集成电路连线到主MCU,这种情况照相涉及的读码软件一般可放在照相机图象处理集成电路内,由照相机图象处理集成电路得到读码结果后,再由此集成电路把读码结果传给主MCU),用照相功能读的相应的有声读物整页面上的码字一般是二维条形码(且因二维条形码的BIT容量大,所以可在二维条形码上放较多拼合信息的值,由拾音器读码字并按码值里的拼合信息的值来解码播出语句发音),也可修改照相涉及的读码软件使此种照相方式的读码头可以读导电码或一维条形码或图形码等(甚至可修改照相涉及的读码软件使这些不同码种的码字都可被一个照相方式的读码头分别读)。本发明文件述及的拾音器的读码头可以是本段所述的读码头,可把这种读码头称为照相式读码头。本段所说的拾音器既可以是本说明书前面说到的电子产品拾音器(而且电子产品拾音器可以为非插卡方式拾音器或插数据卡方式拾音器或插部件式数据卡方式拾音器),也可以是本发明文件提到的除电子产品拾音器以外的拾音器(而且除电子产品拾音器以外的拾音器可以是非插卡方式拾音器或插数据卡方式拾音器或插部件式数据卡方式拾音器)。The pickup mentioned in the document of the present invention can also use the camera head that the mobile phone adopts to read codes, then generally the camera head can be used as a reading dock (this situation is that the camera image processing circuit is in the main MCU and generally the camera head is connected To the main MCU, in this case, the code reading software involved in photography is generally placed in the main MCU), or the camera head plus the camera image processing integrated circuit is used as a reading terminal (in this case, the camera image processing circuit is made into a camera image For integrated circuits, the camera head and the integrated circuit are outside the main MCU chip. Generally, the camera head is connected to the integrated circuit and the integrated circuit is connected to the main MCU. In this case, the code reading software involved in the camera can generally be Put it in the camera image processing integrated circuit, after the code reading result is obtained by the camera image processing integrated circuit, then the integrated circuit sends the code reading result to the main MCU), and the corresponding audiobook read with the camera function is on the entire page The code word is generally a two-dimensional barcode (and because the BIT capacity of the two-dimensional barcode is large, so you can put more values of the combined information on the two-dimensional barcode, the code word is read by the pickup and the value of the combined information in the code value is used. Decoding and broadcasting sentence pronunciation), can also modify the code reading software involved in photography so that the reading terminal of this photographic mode can read conductive codes or one-dimensional bar codes or graphic codes, etc. (even can modify the code reading software involved in photography to make these different codes All kinds of code words can be read respectively by the reading terminal of a photographic mode). The reading terminal of the pickup mentioned in the document of the present invention can be the reading terminal described in this paragraph, and this reading terminal can be called a photographic type reading terminal. The pickup mentioned in this paragraph can be the electronic product pickup mentioned earlier in this manual (and the electronic product pickup can be a non-plug-in card mode pickup or a data card mode pickup or a plug-in data card mode pickup), or this The pickups mentioned in the invention documents are pickups other than pickups for electronic products (and pickups other than pickups for electronic products can be non-card pickups or data card pickups or plug-in data card pickups).

显然在权利要求1,10,19,22中提到的语音编码有声读物,读物,拾音器主体,拾音器都是指一个的。Apparently, the voice coded audiobook mentioned inclaims 1, 10, 19, and 22, the reading material, the main body of the pickup, and the pickup all refer to one.

本发明文件各处在描述把一个(或数个)数据卡插入一个插数据卡方式拾音器主体时,说这一个(或数个)数据卡与此拾音器主体形成一个插数据卡方式拾音器,与说这一个(或数个)数据卡与此拾音器主体合为一个插数据卡方式拾音器是等价的;本发明文件各处在描述把一个(或数个)部件式数据卡插入一个插部件式数据卡方式拾音器主体时,说这一个(或数个)部件式数据卡与此拾音器主体形成一个插部件式数据卡方式拾音器,与说这一个(或数个)部件式数据卡与此拾音器主体合为一个插部件式数据卡方式拾音器是等价的。When describing that one (or several) data cards are inserted into a data card mode pickup body everywhere in the document of the present invention, it is said that this (or several) data cards and the pickup body form a data card mode pickup, and say This one (or several) data card and this pickup main body are combined into one plug-in data card mode pickup and are equivalent; During card mode pickup main body, say this one (or several) component type data card and this pickup main body form a plug-in component type data card mode pickup, say this one (or several) component type data card and this pickup main body combine For a plug-in data card mode pickup is equivalent.

显然,本发明文件各处提到的数据卡上存储有数据,默认是指数据卡上的存储器里存储有数据;本发明文件各处提到的部件式数据卡上存储有数据,默认是指部件式数据卡上的存储器里存储有数据。显然,本发明文件各处提到的把数据存储在数据卡上,默认是指把数据存储在数据卡上的存储器里;本发明文件各处提到的把数据存储在部件式数据卡上,默认是指把数据存储在部件式数据卡上的存储器里。显然,本发明文件各处提到的数据卡上的数据,默认是指数据卡上的存储器里的数据;本发明文件各处提到的部件式数据卡上的数据,默认是指部件式数据卡上的存储器里的数据。显然,本发明文件各处提到的拾音器主体内存储有数据,默认是指拾音器主体内的存储器里存储有数据。本发明文件各处提到的把数据存储在拾音器主体内,默认是指把数据存储在拾音器主体内的存储器里。本发明文件各处提到的拾音器主体内的数据,默认是指拾音器主体内的存储器里的数据。如与某拾音器有关的存储器集里只含一个存储器,那此存储器集里的各存储器显然就是指这一个存储器。Apparently, the data stored on the data card mentioned in the document of the present invention refers to the data stored in the memory on the data card by default; the data stored in the component data card mentioned in the document of the present invention refers to Data is stored in the memory on the component data card. Obviously, the storage of data on the data card mentioned in the document of the present invention refers to storing the data in the memory on the data card by default; The default is to store data in the memory on the component data card. Obviously, the data on the data card mentioned in the document of the present invention refers to the data in the memory on the data card by default; the data on the component data card mentioned in the document of the present invention refers to the component data by default. data in memory on the card. Obviously, data is stored in the main body of the pickup mentioned in the documents of the present invention, which means that data is stored in the memory in the main body of the pickup by default. The document of the present invention mentions that data is stored in the main body of the pickup, and by default it means that the data is stored in the memory in the main body of the pickup. The data in the pickup main body mentioned everywhere in the document of the present invention refers to the data in the memory in the pickup main body by default. Only contain a storer in the memory set relevant with certain pickup, then each memory store in this memory set obviously refers to this storer.

本发明文件各处所说的非复合语种是指印欧语系的语种或非印欧语系的语种;本发明文件各处所说的任一非复合语种是指印欧语系任一语种或非印欧语系任一语种。本发明文件约定印欧语系的任一语种都不是把世界上其它两种或两种以上现存语种合起来的语种(此印欧语系是指本发明文件所说的印欧语系),本发明文件约定非印欧语系的任一语种都不是把世界上其它两种或两种以上现存语种合起来的语种(此非印欧语系是指本发明文件所说的非印欧语系)。The non-compound language mentioned in the document of the present invention refers to the language of the Indo-European language family or the language of the non-Indo-European language family; any non-compound language mentioned in the document of the present invention refers to any language of the Indo-European language family or any language. The document of the present invention stipulates that any language of the Indo-European language family is not a combination of two or more existing languages in the world (the Indo-European language family refers to the Indo-European language family mentioned in the document of the present invention), and the document of the present invention It is agreed that any language of the non-Indo-European language family is not a language that combines two or more existing languages in the world (this non-Indo-European language family refers to the non-Indo-European language family mentioned in the document of the present invention).

本发明文件各处所说的一单独语种是指印欧语系的某一语种或非印欧语系的某一语种,或是既不属于印欧语系又不属于非印欧语系的世界上某国家或某地区使用的某一语种,或甚至可以是某一臆造语种,但不能是臆造的复合语种(比如不能是象英语加汉语这样的复合语种),世界语可被看成是臆造语种,单独语种显然不是本说明书各处说到的复合语种,任一单独语种都不是把世界上其它两种或两种以上现存语种合起来的语种。本发明文件各处提及的其前面没用‘单独’来修饰的‘语种’,默认不会是既不属于印欧语系又不属于非印欧语系的单独语种(除非从提及处的上下文可推出是),比如提及到‘一个语种’,此语种默认不会是既不属于印欧语系又不属于非印欧语系的单独语种(除非从提及处的上下文可推出是);再比如提及到‘多个语种’,这多个语种中任一语种默认都不会是既不属于印欧语系又不属于非印欧语系的单独语种(除非从提及处的上下文可推出这多个语种中有语种是既不属于印欧语系又不属于非印欧语系的单独语种);再比如提及到‘表示一语种的语句发音的拼合信息的值’,此语种默认不会是既不属于印欧语系又不属于非印欧语系的单独语种(除非从提及处的上下文可推出是)。A separate language mentioned in this document refers to a certain language of the Indo-European language family or a certain language of the non-Indo-European language family, or a certain country or a certain language in the world that neither belongs to the Indo-European language family nor does not belong to the non-Indo-European language family. A certain language used in the region, or even a fictional language, but it cannot be a fictional compound language (for example, it cannot be a compound language such as English and Chinese). Esperanto can be regarded as a fictional language, and a single language is obviously not The composite languages mentioned everywhere in this instruction manual, any single language is not a combination of other two or more existing languages in the world. The "language" that is not modified with "individual" mentioned in this invention document everywhere will not be a separate language that neither belongs to the Indo-European language family nor does not belong to the non-Indo-European language family by default (unless the context of the mention It can be deduced to be), for example, when "a language" is mentioned, this language will not be a separate language that neither belongs to the Indo-European language family nor does not belong to the non-Indo-European language family by default (unless it can be deduced from the context of the mention); then For example, when "multiple languages" is mentioned, any of these multiple languages will not be a separate language that does not belong to the Indo-European language family or non-Indo-European language family by default (unless it can be deduced from the context of the mention that this One of the multiple languages is a separate language that neither belongs to the Indo-European language family nor does not belong to the non-Indo-European language family); for another example, when it is mentioned that "the value of the combined information indicating the pronunciation of a sentence in a language", this language will not be the default A separate language that is neither Indo-European nor non-Indo-European (unless it can be inferred from the context in which it is mentioned).

本发明文件所说的现存语种是指当今在世界某地在报纸或广播或文献或日常生活中正使用的而且是被国家政府或联合国命名为某语种的语种,或是在历史上在世界某地使用过的而且是被国家政府或联合国命名为某语种的语种,比如官方英语,官方法语,官方德语,汉语普通话,官方韩语,官方日语,低地德语,中国文言文,广东话等。本说明书所说的把两种或两种以上现存语种合起来的情况,是指现在人为合起来成为比如本说明书说的复合语种,不包括在历史上因政治,迁移等原因造成的多个自然语种的融合,也不包括当今社会的语种吸收外来词这样的融合。The existing language mentioned in this invention document refers to the language that is being used in newspapers or broadcasting or documents or daily life in a certain place in the world today and is named as a certain language by the national government or the United Nations, or in a certain place in the world in history Languages that have been used and named by the national government or the United Nations, such as official English, official French, official German, Chinese Mandarin, official Korean, official Japanese, Low German, Chinese classical Chinese, Cantonese, etc. The combination of two or more existing languages mentioned in this manual refers to the combination of two or more existing languages, such as the compound languages mentioned in this manual, which does not include multiple natural languages caused by political, migration and other reasons in history. The integration of languages does not include the integration of foreign words absorbed by languages in today's society.

本发明文件各处所说的某语种的语句集,是指语句集中每个语句都是此语种的。The set of sentences of a certain language mentioned in the documents of the present invention refers to that each sentence in the set of sentences is of this language.

本发明文件各处说到的拾音器读取码字以播出语句发音(或类似说法),实际是指拾音器被人操纵以读取码字以播出语句发音,这是显然的,之所以说成拾音器读取码字以播出语句发音是为了叙述简洁。The pick-up that the document of the present invention mentions everywhere reads the code word to broadcast the sentence pronunciation (or similar sayings), actually refers to that the pickup is manipulated by people to read the code word to broadcast the sentence pronunciation, this is obvious, the reason why it is said The code word is read by a pickup to broadcast the pronunciation of the sentence for the sake of brevity.

如提及时无特别指明有他义且不能从提及处的上下文得到有他义,那本发明文件各处提及的其前面没加数量词的数据卡(或部件式数据卡)默认是指一个数据卡(或部件式数据卡)的。If there is no other meaning specified when mentioning and the other meaning cannot be obtained from the context of the mention, then the data card (or component data card) mentioned throughout the document of the present invention without a quantifier in front of it refers to a default data card (or component data card).

拾音器读一个或多个码字后,如得到多个拼合信息的值,那一般是按默认约定来顺序对这多个拼合信息的值进行解码并播出各语句发音来,但也可能是按约定(比如在码字的BIT中有标志BIT表示只播哪些语句)只对这多个拼合信息的值中的数个拼合信息的值进行解码播出数个语句发音来,这两种情况都包含播出一个语句发音的情况。(本段说的‘数个’的值显然小于本段说的‘多个’的值)After the pickup reads one or more codewords, if multiple values of the combined information are obtained, the value of the multiple combined information is generally decoded in order according to the default agreement and the pronunciation of each sentence is played, but it may also be Agreed (such as in the BIT of the code word, there is a sign BIT to indicate which sentences are only played) only the values of several pieces of information in the value of the pieces of information are decoded and several sentences are pronounced. Contains the case where the pronunciation of a sentence is played. (The value of "several" mentioned in this paragraph is obviously smaller than the value of "multiple" mentioned in this paragraph)

本说明书前面说到的拾音器播出语句发音,播出一个语句发音,播出此语句发音,或其它类似说法比如说把此语句播出来或使拾音器的发音器件播出语句发音或码字对应播的语句发音等,可以是播出一个语句的全部发音,本说明书前面说到的拾音器播出多个语句发音,或其它类似说法比如说把多个语句播出来或使拾音器的发音器件播出多个语句发音等,可以是播出多个语句中每个语句的全部发音;但实际中也可能出现是按约定(比如在码字的BIT中有标志BIT)只播某语句的发音的一部分(比如有声读物的一个码字的值有一表示一语句发音的拼合信息的值,此语句含10个单词,拾音器读此码字后,按码字里的拼合信息的值解码时,按约定只播其前面5个单词的音;或比如有声读物的一个码字的值有一表示一语句发音的拼合信息的值,此语句含3个句子,拾音器读此码字后,按码字里的拼合信息的值解码时,按约定只播其第1个句子和第3个句子的音)。The pickup mentioned earlier in this manual broadcasts the pronunciation of a sentence, broadcasts the pronunciation of a sentence, broadcasts the pronunciation of this sentence, or other similar sayings such as broadcasting this sentence or making the pronunciation device of the pickup broadcast the pronunciation of a sentence or the corresponding code word The pronunciation of the sentence to be broadcast, etc., can be to broadcast all the pronunciation of a sentence, the pickup mentioned earlier in this manual broadcasts the pronunciation of multiple sentences, or other similar sayings such as broadcasting multiple sentences or making the pronunciation device of the pickup play Pronunciation of a plurality of sentences etc., can be to broadcast the whole pronunciation of each sentence in a plurality of sentences; But in reality also may appear to be by agreement (for example in the BIT of code word, have sign BIT) only broadcast the pronunciation of certain sentence A part (for example, the value of a codeword in an audiobook has a value representing the combined information of the pronunciation of a sentence. This sentence contains 10 words. After the pickup reads the codeword, when decoding according to the value of the combined information in the codeword, according to the agreement Only broadcast the sound of its preceding 5 words; Or such as the value of a code word of audiobook has the value of the splicing information that represents the pronunciation of a sentence, this sentence contains 3 sentences, after the pickup reads this code word, press the value in the code word When the value of the combined information is decoded, only the sounds of the first sentence and the third sentence are played as agreed).

本发明文件约定象上一段说的那样把一个语句的非全部发音播出来的情况也叫做播出语句发音,即本说明书前面各处说的拾音器播出语句发音,播出一个语句发音,播出此语句发音,或其它类似说法比如说把此语句播出来或使拾音器的发音器件播出语句发音或码字对应播的语句发音等,既可以是播出此语句的全部发音,也可以是播出此语句的非全部发音;而本说明书前面各处说的拾音器播出多个语句发音,或其它类似说法比如说把多个语句播出来或使拾音器的发音器件播出多个语句发音等,既可以是播出此多个语句中每个语句的全部发音,也可以是播出此多个语句中每个语句的非全部发音,也可以是播出此多个语句中部分语句中每个语句的全部发音以及此多个语句中另一部分语句中每个语句的非全部发音。而且本段说的播出一个语句发音是本段说的播出多个语句发音的下位概念,因为播出多个语句发音包括播出一个语句发音的情况;至于本发明文件各处说的播出一个或多个语句发音,是指播出一个语句发音或播出多个语句发音,显然分别按本段内以上所述播出一个语句发音或播出多个语句发音的情况理解即可;在权利要求书里各处提到的播出语句发音,既可以是播出语句的全部发音,也可以是播出语句的非全部发音,(以上3句话说的语句都是指单个的语句)。本段说的播出语句发音可以是本说明书前面述的用MCU或语音芯片驱动喇叭来播出语句发音,也可以是本说明书前面述的用TTS器件驱动喇叭来播出语句发音或其它方式等。The document of the present invention stipulates that the situation that the non-full pronunciation of a sentence is played out is also called the broadcast sentence pronunciation as said in the previous paragraph, that is, the pickup that is said everywhere in the front of this manual broadcasts the sentence pronunciation, broadcasts a sentence pronunciation, and broadcasts the sentence pronunciation. Pronunciation of this sentence, or other similar sayings such as broadcasting this sentence or making the pronunciation device of the pickup to broadcast the pronunciation of the sentence or the sentence pronunciation of the corresponding broadcast of the code word, etc., both can be the whole pronunciation of the broadcast, or can It is to broadcast not all pronunciations of this sentence; while the pickup mentioned in the front of this manual broadcasts multiple sentence pronunciations, or other similar sayings such as broadcasting multiple sentences or making the pronunciation device of the pickup play multiple sentences Pronunciation, etc., can be to broadcast all the pronunciations of each sentence in the multiple sentences, or to broadcast not all pronunciations of each sentence in the multiple sentences, or to broadcast some of the sentences in the multiple sentences The full pronunciation of each sentence in , and the partial pronunciation of each sentence in another part of the multiple sentences. And the broadcasting of a sentence pronunciation that this paragraph says is the subordinate concept of broadcasting a plurality of sentence pronunciations that this paragraph says, because broadcasting a plurality of sentence pronunciations includes the situation of broadcasting a sentence pronunciation; Pronunciation of one or more sentences refers to the pronunciation of one sentence or the pronunciation of multiple sentences. Obviously, it is sufficient to understand the pronunciation of one sentence or the pronunciation of multiple sentences as described above in this paragraph; The pronunciation of the broadcast sentence mentioned in the claims can be the entire pronunciation of the broadcast sentence, or not all the pronunciation of the broadcast sentence, (the sentences mentioned in the above three sentences all refer to a single sentence) . The pronunciation of the broadcast sentence mentioned in this paragraph can be the pronunciation of the sentence by using the MCU or voice chip to drive the speaker as mentioned earlier in this manual, or it can be the pronunciation of the sentence by driving the speaker with a TTS device as mentioned above in this manual, or other methods, etc. .

本发明文件述及播出语句的非全部发音,实际是指播出语句的一部分发音,这样写是为了不与本发明文件所说的部分预备发音数据混淆,本发明文件所说的语句的非全部预备发音数据就是用于产生非全部发音的;如播出语句的全部发音,在整步直接发音时,是由MCU把语句的全部预备发音数据转成语句的全部发音播出,在整步TTS发音时,是由MCU把语句的全部预备发音数据传给TTS器件以播出语句的全部发音,在分步直接发音时,是由MCU把组成语句的全部预备发音数据的各部分预备发音数据依次播出音来以形成语句的全部发音播出,在分步TTS发音时,是由MCU把组成语句的全部预备发音数据的各部分预备发音数据依次传给TTS器件播出音来以形成语句的全部发音播出。如播出语句的非全部发音,在整步直接发音时,是由MCU把语句的非全部预备发音数据转成语句的非全部发音播出,在整步TTS发音时,是由MCU把语句的非全部预备发音数据传给TTS器件以播出语句的非全部发音,在分步直接发音时,是由MCU把组成语句的非全部预备发音数据的各部分预备发音数据依次播出音来以形成语句的非全部发音播出,在分步TTS发音时,是由MCU把组成语句的非全部预备发音数据的各部分预备发音数据依次传给TTS器件播出音来以形成语句的非全部发音播出。The document of the present invention refers to the non-full pronunciation of the broadcast sentence, actually refers to a part of the pronunciation of the broadcast sentence, and it is written in order not to confuse with the said part of the prepared pronunciation data of the document of the present invention. All the prepared pronunciation data are used to produce non-full pronunciations; such as the whole pronunciation of the broadcast sentence, when the whole step is directly pronounced, the MCU converts all the prepared pronunciation data of the sentence into the whole pronunciation of the sentence and broadcasts it. When TTS is pronounced, the MCU transmits all the preparatory pronunciation data of the sentence to the TTS device to broadcast all the pronunciation of the sentence. When the pronunciation is performed step by step, the MCU sends the preparatory pronunciation data of each part of the complete preparatory pronunciation data of the sentence. The pronunciation is broadcasted sequentially to form the entire pronunciation of the sentence. During the step-by-step TTS pronunciation, the MCU sends all the preliminary pronunciation data of the sentence to the TTS device to play the pronunciation to form the sentence. The full pronunciation of is broadcast. For example, if the non-full pronunciation of the sentence is broadcast, when the full-step pronunciation is directly pronounced, the MCU converts the non-full prepared pronunciation data of the sentence into the non-full pronunciation of the sentence. Not all of the preparatory pronunciation data is sent to the TTS device to broadcast the non-full pronunciation of the sentence. When the step-by-step direct pronunciation is performed, the MCU plays the pronunciation of each part of the non-full preparatory pronunciation data of the sentence to form The non-full pronunciation of sentence is broadcasted, and when step-by-step TTS is pronounced, it is to be sent to the TTS device to broadcast the pronunciation to form the non-full pronunciation broadcast of sentence by MCU sending each part of the non-full prepared pronunciation data of forming sentence to the TTS device to play out the pronunciation successively out.

因为播出语句发音时伴随着播放语句发音的动作,所以本发明文件所说的播放语句发音或类似说法也如上3段述的播出语句发音的情况,即播放语句发音可以是播放语句的全部发音或播放语句的非全部发音;本发明文件所说的播放多个语句发音或类似说法也如上3段述的播出多个语句发音的情况,即既可以是播放此多个语句中每个语句的全部发音,也可以是播放此多个语句中每个语句的非全部发音,也可以是播放此多个语句中部分语句中每个语句的全部发音以及此多个语句中另一部分语句中每个语句的非全部发音。Because the broadcast sentence pronunciation is accompanied by the action of the playback sentence pronunciation, so the said playback sentence pronunciation or similar sayings in the document of the present invention are also the situation of the broadcast sentence pronunciation described in the above 3 paragraphs, that is, the playback sentence pronunciation can be all of the playback sentence Pronunciation or the non-full pronunciation of playing sentence; The said playing multiple sentence pronunciation of the present invention file or the situation of similar saying also broadcasting a plurality of sentence pronunciation as above 3 paragraphs, promptly both can be to play each in this multiple sentence All the pronunciations of the sentence, or playing not all the pronunciations of each sentence in the multiple sentences, or playing all the pronunciations of each sentence in some of the sentences in the multiple sentences and another part of the sentences in the multiple sentences Partial pronunciation of each sentence.

显然,播出语句的全部发音与播出语句的非全部发音没什么本质不同,播放语句的全部发音与播放语句的非全部发音没什么本质不同。Obviously, there is no essential difference between the full pronunciation of the broadcast sentence and the partial pronunciation of the broadcast sentence, and there is no essential difference between the full pronunciation of the broadcast sentence and the partial pronunciation of the broadcast sentence.

因为解码时可能采用整步直接,分步直接,整步TTS,分步TTS等发音方式,所以如采用分步直接或分步TTS发音方式,如做播放语句的非全部发音时是要在解码过程中来作相应处理的(只对解出的一部分部分预备发音数据做发音处理)。如采用整步直接或整步TTS发音方式,如是播放语句的非全部发音,那既可以解码得出语句的全部预备发音数据,再把此语句的预备发音数据的一部分(即语句的非全部预备发音数据)拿去发音得到语句的非全部发音(本说明书各处是按这一种方式来述的),也可解码时只解码得出语句的非全部预备发音数据并拿去发音得到语句的非全部发音,后一种方式显然是前一种方式的等同替换。Because the pronunciation methods such as full-step direct, step-by-step direct, full-step TTS, and step-by-step TTS may be used during decoding, so if the step-by-step direct or step-by-step TTS pronunciation method is used, for example, when not all pronunciations of playback sentences are to be decoded In the process, corresponding processing is carried out (only a part of the pre-pronunciation data which is solved is processed for pronunciation). Directly or whole step TTS pronunciation mode as adopting the whole step, so the partial pronunciation of playing sentence, that both can decode and get all the preparation pronunciation data of sentence, then a part of the preparation pronunciation data of this sentence (i.e. the non-full preparation pronunciation of sentence) Pronunciation data) to get the non-full pronunciation of the sentence (this manual is described in this way everywhere), or only decode the non-full pre-pronunciation data of the sentence when decoding and take it to the pronunciation to obtain the pronunciation of the sentence Not all pronunciations, the latter way is obviously an equivalent replacement of the former way.

本发明文件提到的语句的预备发音数据,与语句的全部预备发音数据是相同所指。本发明文件提到的语句的全部预备发音数据,是指语句的预备发音数据的全部,本发明文件提到的语句的非全部预备发音数据,是指语句的预备发音数据的部分。本发明文件提到的语句的全部拼法格式数据,是指语句的拼法格式数据的全部,本发明文件提到的语句的非全部拼法格式数据,是指语句的拼法格式数据的部分。The pre-pronunciation data of the sentence mentioned in the document of the present invention refers to the same as all the pre-pronunciation data of the sentence. All the pre-pronunciation data of the sentence mentioned in the document of the present invention refers to all the pre-pronunciation data of the sentence, and the part of the pre-pronunciation data of the sentence mentioned in the document of the present invention refers to the part of the pre-pronunciation data of the sentence. All the spelling format data of the sentence mentioned in the document of the present invention refers to all of the spelling format data of the sentence, and the non-all spelling format data of the sentence mentioned in the document of the present invention refers to the part of the spelling format data of the sentence .

本发明文件所说的语句的预备发音数据,在解码用直接发音方式时和解码用TTS发音方式时是不同格式的数据。The pre-pronunciation data of the sentences mentioned in the document of the present invention are data in different formats when the direct pronunciation mode is used for decoding and when the TTS pronunciation mode is used for decoding.

如提及时无特别指明有他义也不能从提及处的上下文推出有他义,本发明文件各处提及的预备发音数据(其前面没加‘语句’来修饰的),是默认是某语句的预备发音数据。If there is no special indication when mentioning that there is other meaning and it cannot be deduced from the context of the mention that there is other meaning, the prepared pronunciation data mentioned in the document of the present invention (modified without adding "sentence" in front of it) is a certain one by default. Prepared pronunciation data for sentences.

本说明书各处有说到把一些码字定义成某拾音器能读的部诸码字,这部诸码字中每个码字是都可被此拾音器读取以用于此拾音器播出语句发音的(拾音器可以是每次读一个或多个码字来解码播出语句发音)。This instruction manual has mentioned that some codewords are defined as all codewords that can be read by a pickup, and each codeword in these codewords can be read by the pickup for the pronunciation of sentences broadcast by the pickup (the pickup can read one or more codewords at a time to decode the pronunciation of the broadcast sentence).

本发明文件所说的拾音器读数个码字后播出语句发音,可能是人用拾音器读了数个码字后拾音器自己就播出数个语句发音(播出数个语句发音包括播出语句发音的情况),也可能是人用拾音器读数个码字后拾音器再根据人的按键操作来播出数个语句发音(播出数个语句发音包括播出语句发音的情况),这些方式互相是等同替换。The said pickup device of the present invention reads several code words and broadcasts the sentence pronunciation after reading several code words, may be that after the people read several code words with the pickup device, the pickup itself just broadcasts several sentence pronunciations (broadcasting several sentence pronunciations includes broadcasting sentence pronunciation situation), it may also be that people use the pickup to read several codewords, and the pickup then broadcasts several sentence pronunciations according to people's button operations (the broadcasting of several sentence pronunciations includes the broadcasting of sentence pronunciations), these methods are equivalent to each other replace.

在权利要求书中多处提到的一个(或多个)数据卡与所述语音编码有声读物和所述拾音器主体配套,一般是指与此语音编码有声读物和此拾音器主体一起配套销售等;在权利要求书中多处提到的一个(或多个)部件式数据卡与所述语音编码有声读物和所述拾音器主体配套,一般是指与此语音编码有声读物和此拾音器主体一起配套销售等。The one (or more) data cards mentioned in multiple places in the claims are matched with the speech-encoded audiobook and the pickup body, which generally means that the speech-encoded audiobook and the pickup body are sold together; The one (or more) part-type data cards mentioned in multiple places in the claims are matched with the speech-coded audiobook and the pickup body, which generally means that the speech-coded audiobook and the pickup body are sold together. wait.

本发明文件各处涉及到的播语音以及录音数据的制作,默认都是针对单声道的,这是常识,如采用多声道的方式,那显然是单声道的等同替换。The making of broadcasting voice and recording data involved in various places of the present invention document is all aimed at monaural by default, and this is common sense, as adopting the mode of multi-channel, that is obviously the equivalent replacement of mono.

本发明文件所说的某拼合信息的值是某语种的,是指此拼合信息的值是此语种的拼合信息的值。The document of the present invention says that the value of a certain combined information is of a certain language, which means that the value of the combined information is the value of the combined information in this language.

本发明文件所说的某语种的一个编码语句集里的各语句都是此语种的。本发明文件所说的某语种的一个解码语句集里的各语句都是此语种的。本发明文件各处提到的某编码规则中有关某语种的编码细则定义的编码语句集,默认是此语种的编码语句集;本发明文件各处提到的某解码规则中有关某语种的解码细则定义的解码语句集,默认是此语种的解码语句集。Each sentence in a coded sentence set of a certain language mentioned in the document of the present invention is all in this language. Each sentence in a decoding sentence set of a certain language mentioned in the document of the present invention is all in this language. The coded statement set defined by the coding rules of a certain language in the certain coding rules mentioned in the documents of the present invention is the coded sentence set of this language by default; the decoding of a certain language in the certain decoding rules mentioned in the documents of the present invention The decoding statement set defined by the rules, the default is the decoding statement set of this language.

本说明书所说的码字上的校验位,是指码值的校验位,比如奇偶校验位等。The check bit on the code word mentioned in this specification refers to the check bit of the code value, such as a parity check bit.

一个单独语种的语句,也可以被看成是复合语种的语句,比如一个复合语种是英语加汉语,那一个英语语种的语句也可被看成是这个复合语种(英语加汉语)的语句(是作为这个复合语种语句的特例),如另一个复合语种是英语加韩语,那这一个英语语种的语句也可被看成是这一个复合语种(英语加韩语)的语句(是作为这一个复合语种语句的特例)。A sentence of a single language can also be regarded as a sentence of a compound language. For example, a compound language is English plus Chinese, and a sentence of that English language can also be regarded as a sentence of this compound language (English plus Chinese). As a special case of this compound language sentence), if another compound language is English plus Korean, then this English language sentence can also be regarded as a sentence of this compound language (English plus Korean) (as this compound language statement).

一个语句可能属于多个语种,比如有一个其含有的句体都是英语句体的语句,即此语句是英语语种的语句,此语句可能被包括在某编码规则中有关英语语种的编码细则定义的编码语句集里,而此语句也可能被包括在某编码规则中有关一复合语种(此复合语种是英语加汉语)的编码细则定义的编码语句集里即此语句也属于此复合语种;还可能出现这种情况:有的语句,既属于某官方用语,又属于某方言,而此官方用语和此方言是被定义成两个不同语种的。A sentence may belong to multiple languages. For example, if there is a sentence whose sentence body is English, that is, the sentence is a sentence in the English language. This sentence may be included in the definition of the coding rules for the English language in a certain coding rule. In the coded sentence set, and this sentence may also be included in the coded sentence set defined by the coding rules about a compound language (this compound language is English plus Chinese) in a certain coding rule, that is, this sentence also belongs to this compound language; It may happen that some sentences belong to both an official language and a certain dialect, and the official language and the dialect are defined as two different languages.

本发明文件各处提到的‘一部诸码字’或‘某部诸码字’或‘此部诸码字’,都是指一部诸码字(此部诸码字中有数个码字),而不是指部诸码字中的一个码字。The 'all codewords of a part' or 'all codewords of a certain part' or 'all codewords of this part' mentioned in the documents of the present invention all refer to a part of all codewords (there are several codewords in all codewords of this part) word), rather than referring to a codeword in all codewords.

权利要求书各处有提到的‘包括前述一个码字在内的所述部诸码字中的多个码字’,其中的‘包括前述一个码字在内’是修饰其中的‘多个码字’的。There are references to 'a plurality of codewords in the codewords including the aforementioned one codeword' mentioned throughout the claims, in which 'including the aforementioned one codeword' is used to modify the 'multiple codewords' codeword'.

如有对一个语种的语句进行编码的一些编码细则是分开处于多个不同编码规则里的,那要使用此语种语句编码用的这些编码细则时,可把这多个编码规则合起来看成是一个编码规则,那本段前面述的这些编码细则就是此合起来的编码规则中有关此语种的编码细则;如有对一个语种的拼合信息的值进行解码的一些解码细则是分开处于多个不同解码规则里的,那要使用此语种的这些解码细则时,可把这多个解码规则合起来看成是一个解码规则,那本段前面述的这些解码细则就是此合起来的解码规则中有关此语种的解码细则。If there are some encoding details for encoding sentences of a language that are separated in a plurality of different encoding rules, when these encoding rules for encoding sentences in this language are to be used, these multiple encoding rules can be combined as An encoding rule, the encoding rules mentioned above in this paragraph are the encoding rules related to this language in the combined encoding rules; In the decoding rules, when using these decoding rules of this language, these multiple decoding rules can be combined as a decoding rule, and the decoding rules mentioned above in this paragraph are related to the combined decoding rules. The decoding rules for this language.

关于本说明书所说的句体概念:因为一个语句可以是由数个完整句子组成,或由数个非完整句子组成,或由数个完整句子加数个非完整句子组成,那可把其中的各完整句子看成是句体,把其中的各非完整句子也看成是句体,这样一个语句就是由一些句体组成的;不论一个完整句子还是一个非完整句子,其成分都是同一单独语种的,比如一个句子中不可能既有汉语词又有英语单词,如语句中出现几个汉语词和几个英语单词挨在一起的情况,那这几个汉语词应看成是一个完整句子或一个非完整句子,这几个英语单词应看成是一个完整句子或一个非完整句子;本段所说的完整句子可以是不包括分句的,也可以是包括分句的,本段所说的非完整句子可以是不包括分句的,也可以是包括分句的。Regarding the sentence body concept mentioned in this manual: because a sentence can be composed of several complete sentences, or composed of several incomplete sentences, or composed of several complete sentences plus several incomplete sentences, then the Each complete sentence is regarded as a sentence body, and each non-complete sentence is also regarded as a sentence body. Such a sentence is composed of some sentence bodies; no matter a complete sentence or an incomplete sentence, its components are the same and separate For example, it is impossible to have both Chinese words and English words in a sentence. If there are several Chinese words and several English words next to each other in a sentence, then these Chinese words should be regarded as a complete sentence Or an incomplete sentence, these English words should be regarded as a complete sentence or an incomplete sentence; the complete sentence mentioned in this paragraph may not include clauses, or it may include clauses. The incomplete sentences mentioned may not include clauses or may include clauses.

本说明书各处说到一个复合语种时,述及解码时,涉及的此复合语种的诸音码的发音数据中的非复合语种(此非复合语种是此复合语种包含的)的相关音码的发音数据,是相当于一个解码规则中有关此非复合语种的解码细则用到的此非复合语种的诸音码的发音数据的,一般就是一些音码(比如元辅音音码)的录音数据的集合。When referring to a compound language everywhere in this specification, when referring to the decoding, the relevant phonetic codes of the non-compound language (this non-compound language is included by this compound language) in the pronunciation data of all phonetic codes of this compound language involved Pronunciation data is equivalent to the pronunciation data of all phonetic sign indicating numbers of this non-compound language used in the decoding rules about this non-composite language in a decoding rule, and generally is exactly the recording data of some sound sign indicating numbers (such as vowel and consonant sound sign indicating numbers) gather.

本说明书述及编码时说到的‘使用的音码情况’,是指用哪些音码来拼合编码语句集发音,此编码语句集是编码时定义的;本说明书述及解码时说到的‘使用的音码情况’,是指用哪些音码来拼合解码语句集发音,此解码语句集是解码时定义的。本说明书所说的某编码规则中有关某语种的编码细则里使用的音码情况,是指用哪些音码来拼合编码语句集发音,此编码语句集是此编码规则中有关此语种的编码细则定义的;本说明书所说的某解码规则中有关某语种的解码细则里使用的音码情况,是指用哪些音码来拼合解码语句集发音,此解码语句集是此解码规则中有关此语种的解码细则定义的。The "sound code situation used" mentioned in this manual when referring to encoding refers to which phonetic codes are used to assemble the pronunciation of the coded sentence set. This coded sentence set is defined during encoding; The used phonetic code situation' refers to which phonetic codes are used to assemble the pronunciation of the decoding sentence set, and this decoding sentence set is defined during decoding. The phonetic codes used in the coding rules of a certain language in a certain coding rule mentioned in this manual refer to which phonetic codes are used to assemble the pronunciation of the coded sentence set. This coded sentence set is the coded rules about this language in this coding rule. Defined; the phonetic codes used in the decoding rules of a certain language in a certain decoding rule mentioned in this manual refer to which phonetic codes are used to assemble the pronunciation of the decoding sentence set. This decoding sentence set is related to this language in the decoding rule. Defined by the decoding rules.

本说明书所说的‘权利要求’与‘权利要求项’是相同所指。The terms 'claims' and 'claims' mentioned in this specification refer to the same thing.

本说明书说的‘权利要求3,12,20或23’是指权利要求3或12或20或23,其它类似说发类推。The 'claim 3, 12, 20 or 23' mentioned in this specification refers to claim 3 or 12 or 20 or 23, and other similar claims are deduced by analogy.

本说明书各处说的‘多个语种可以全是非复合语种或全是复合语种或既含有非复合语种又含有复合语种’,显然此多个语种中的各语种是互不相同的,比如两个语种可以全是非复合语种,那可以这两个语种一个是英语一个是法语。It is said that 'multiple languages can be all non-composite languages or all compound languages or contain both non-composite languages and compound languages' mentioned in this manual. Obviously, the languages in these multiple languages are different from each other, for example, two The languages can be all non-compound languages, so one of the two languages can be English and the other French.

本说明书有述及的常规语言领域,就是指平常说话,报纸,广播电视,书籍,教材等日常涉及的使用语言的领域。The conventional language field mentioned in this manual refers to the field of daily use of language involved in daily speech, newspapers, radio and television, books, teaching materials, etc.

本说明书各处说的解码时涉及的发音形式,是指按某解码规则中有关某语种的解码细则对一语种的拼合信息的值进行解码,此解码规则中有关此语种的解码细则涉及的发音形式;本说明书各处说的对一语种的拼合信息的值解码时涉及的发音形式,是指按某解码规则中有关某语种的解码细则对此语种的拼合信息的值进行解码,此解码规则中有关此语种的解码细则涉及的发音形式。The pronunciation forms involved in decoding mentioned throughout this manual refer to the decoding of the value of the combined information of a language according to the decoding rules of a certain language in a certain decoding rule. The pronunciation involved in the decoding rules of this decoding rule form; the pronunciation form involved in decoding the value of a language’s combined information mentioned in this manual refers to decoding the value of this language’s combined information according to the decoding rules of a certain language in a certain decoding rule. This decoding rule The pronunciation forms involved in the decoding rules for this language.

本说明书说到的‘将其涉及的发音形式是全用TTS发音方式的适用于多个语种的解码规则中有关其中某语种的解码细则’,其中第一个‘其’字是指‘解码规则中有关其中某语种的解码细则’,其中‘其涉及的发音形式是全用TTS发音方式’是修饰‘解码规则中有关其中某语种的解码细则’的;本说明书说到的‘将按其涉及的发音形式是全用TTS发音方式的解码规则中有关一个语种的解码细则来作的解码’,其中‘其’字是指‘解码规则中有关一个语种的解码细则’,其中‘其涉及的发音形式是全用TTS发音方式’是修饰‘解码规则中有关一个语种的解码细则’的;本说明书说到的‘将用其编码得到的所有拼合信息的值都是本来用于采用TTS发音方式来解码的只适用于一个语种的编码规则’,其中‘其’字是指‘只适用于一个语种的编码规则’,其中‘用其编码得到的所有拼合信息的值都是本来用于采用TTS发音方式来解码’是修饰‘只适用于一个语种的编码规则’的;本说明书说到的‘将用其编码得到的所有拼合信息的值都是本来用于采用TTS发音方式来解码的适用于多个语种的编码规则中有关其中某语种的编码细则’,其中第1个‘其’字是指‘编码规则中有关其中某语种的编码细则’,其中‘用其编码得到的所有拼合信息的值都是本来用于采用TTS发音方式来解码’是修饰‘编码规则中有关其中某语种的编码细则’的;本说明书说到的‘将按用其编码得到的所有拼合信息的值都是本来用于采用TTS发音方式来解码的编码规则中有关一个语种的编码细则来作的编码’,其中‘其’字是指‘编码规则中有关一个语种的编码细则’,其中‘用其编码得到的所有拼合信息的值都是本来用于采用TTS发音方式来解码’是修饰‘编码规则中有关一个语种的编码细则’的;本说明书中其它与本段引用的说法类似的说法,按此类推理解。In this manual, "the pronunciation forms involved are all in the TTS pronunciation mode, and the decoding rules applicable to multiple languages are related to the decoding rules of one of the languages", in which the first word "its" refers to the "decoding rules" "decoding rules for a certain language" in the "decoding rules for a certain language" in which "the pronunciation forms involved are all in the TTS pronunciation mode" is a modification of "the decoding rules for a certain language in the decoding rules"; The pronunciation form is decoded with the decoding rules of a language in the decoding rules of the TTS pronunciation method', where the word 'its' refers to the decoding rules of a language in the decoding rules', where the pronunciation involved The form is all in the TTS pronunciation mode, which modifies the decoding rules related to a language in the decoding rules; the value of all the combined information that will be encoded by this manual is originally used to use the TTS pronunciation mode. Decoded encoding rules applicable only to one language', wherein the word 'its' refers to 'encoding rules applicable only to one language', wherein 'the value of all the combined information obtained by encoding it is originally used for TTS pronunciation Decoding by way of modifying 'encoding rules only applicable to one language'; the value of 'all combined information that will be encoded using it' mentioned in this manual is originally used for decoding by using TTS pronunciation method and is applicable to multiple languages. The coding rules for one of the languages in the coding rules for one language', where the first word 'its' refers to the coding rules for one of the languages in the coding rules', where 'the value of all the combined information obtained by using its coding Both are originally used to decode by using the TTS pronunciation method' to modify the encoding rules of a certain language in the encoding rules'; the value of 'all the combined information obtained by using its encoding' mentioned in this manual is originally used In the coding rules related to a language in the coding rules for decoding by using the TTS pronunciation method', the word "its" refers to the coding rules related to a language in the coding rules', where "all the codes obtained by using its coding The value of the combined information is originally used to decode by using the TTS pronunciation method, which is to modify the encoding rules of a language in the encoding rules; other expressions similar to those quoted in this paragraph in this manual should be understood by analogy.

显然,如一编码规则中有关某语种的编码细则含有压缩编码处理,那用此编码规则中有关某语种的编码细则来编码生成拼合信息的值时一般是有压缩编码处理的。显然,如一解码规则中有关某语种的解码细则含有压缩解码处理,那用此解码规则中有关某语种的解码细则来对拼合信息的值进行解码时一般是有压缩解码处理的。Apparently, if the encoding rules about a certain language in an encoding rule contain compressed encoding processing, then there is generally compression encoding processing when using the encoding rules about a certain language in this encoding rule to encode the value of the combined information. Obviously, if the decoding rules related to a certain language in a decoding rule include compression and decoding processing, then the value of the combined information is generally decoded by using the decoding rules related to a certain language in the decoding rule.

本发明文件所说的在数据库(比如单词数据库)里存储数据,实际是指把数据放在数据库里。Store data in database (such as word database) said in the document of the present invention, actually refer to that data is placed in database.

本发明文件各处提及的‘与一编码规则有关某语种对应的解码规则中有关此语种的解码细则’,其中‘与一编码规则有关某语种对应’是修饰其中的‘解码规则’的;本发明文件其它类似于‘与一编码规则有关某语种对应的解码规则中有关此语种的解码细则’这样的说法,也按此类推理解。The "decoding rules related to a certain language in the decoding rules corresponding to an encoding rule" mentioned throughout the document of the present invention, wherein "corresponding to a certain language related to an encoding rule" is a modification of the "decoding rule"; Other expressions in the document of the present invention that are similar to "decoding rules related to this language in the decoding rules corresponding to a certain language related to an encoding rule" can also be understood by analogy.

本发明文件所说的任一编码规则,一般都有这些共同特征:可适用于一个或多个语种,对这一个或多个语种中的每个语种都有此编码规则中有关此语种的编码细则,对这一个或多个语种中的每个语种,此编码规则中有关此语种的编码细则都有定义编码语句集(可采用不同定义方式,甚至可以是隐含定义,甚至可以是本说明书没提到的定义编码语句集的方式),此编码规则中有关此语种的编码细则一般都有压缩编码处理(可以是本说明书有述的或没述的压缩编码处理方式);除了本说明书前面所述的一些编码规则举例外,实际中还会有一些其它形式的本说明书没提及的编码规则(这些编码规则也符合本段前面述的编码规则的那些共同特征)。本发明文件所说的任一解码规则,一般都有这些共同特征:可适用于一个或多个语种,对这一个或多个语种中的每个语种都有此解码规则中有关此语种的解码细则,对这一个或多个语种中的每个语种,此解码规则中有关此语种的解码细则都有定义解码语句集(可采用不同定义方式,甚至可以是隐含定义,甚至可以是本说明书没提到的定义解码语句集的方式),此解码规则中有关此语种的解码细则一般都有压缩解码处理(可以是本说明书有述的或没述的压缩解码处理方式);除了本说明书前面所述的一些解码规则举例外,实际中还会有一些其它形式的本说明书没提及的解码规则(这些解码规则也符合本段前面述的解码规则的那些共同特征)。并且本说明书前面主要述的是这样的情况:对一个语句,是按某编码规则中有关此语句的语种的编码细则来编码生成拼合信息的值,再按某解码规则中有关此语句的语种的解码细则来对此拼合信息的值进行解码播放语句发音的且此解码规则与此编码规则有关此语种是对应的,且此编码规则中有关此语句的语种的编码细则是有压缩编码处理的,此编码规则中有关此语句的语种的编码细则有定义编码语句集,且此解码规则中有关此语句的语种的解码细则是有压缩解码处理的,此解码规则中有关此语句的语种的解码细则有定义解码语句集,(假设从前面‘对一个语句’起到这里‘定义解码语句集’止所述的这种编解码情况是情形1);但实际中可能还会有一些更复杂的情况,比如:Any encoding rule mentioned in the present invention document generally has these common characteristics: it can be applicable to one or more languages, and each language in this one or more languages has the relevant coding of this language in this encoding rule. Detailed rules, for each of the one or more languages, the coding rules related to this language in this coding rule have a definition coded sentence set (different definition methods can be used, even implicit definitions, or even this manual The method of defining the coded sentence set is not mentioned), and the coding rules of this language in this coding rule generally have compression coding processing (it can be the compression coding processing method described or not described in this specification); In addition to the examples of some encoding rules mentioned above, there are actually some other forms of encoding rules not mentioned in this specification (these encoding rules also conform to the common features of the aforementioned encoding rules in this paragraph). Any decoding rule mentioned in the document of the present invention generally has these common features: it can be applied to one or more languages, and each language in the one or more languages has the decoding related to this language in this decoding rule. Detailed rules, for each of the one or more languages, the decoding rules related to this language in the decoding rules have a definition of the decoding sentence set (different definition methods can be used, even implicit definitions, or even this manual The method of defining the decoding sentence set is not mentioned), and the decoding rules for this language in this decoding rule generally have compression decoding processing (it can be the compression decoding processing method described or not described in this manual); In addition to the examples of some decoding rules mentioned above, there are actually some other forms of decoding rules that are not mentioned in this specification (these decoding rules also conform to the common features of the above-mentioned decoding rules in this paragraph). And the main description in the front of this manual is such a situation: for a sentence, the value of the combined information is generated by encoding according to the coding rules related to the language of the sentence in a certain coding rule, and then according to the language of the sentence in a certain decoding rule The decoding rules are used to decode the value of the combined information to play the pronunciation of the sentence, and the decoding rules are related to the coding rules. The encoding rules for the language of this statement in this encoding rule define the encoding statement set, and the decoding rules for the language of this sentence in this decoding rule are processed by compression decoding, and the decoding rules for the language of this sentence in this decoding rule There is a set of decoding statements defined, (assuming that the encoding and decoding situation described above from the previous "to a statement" to "defining the set of decoding statements" here is situation 1); but in practice, there may be some more complicated situations ,for example:

1,如把一个语种的方言和官方用语当成两个不同语种,那对方言里的某些语句中的任一语句,按某编码规则中有关此方言的编码细则来编码生成的表示此语句发音的拼合信息的值,可能被某解码规则中有关此官方用语的解码细则来解码的,2,对一个非复合语种的语句,那按某编码规则中有关此非复合语种的编码细则来编码生成的拼合信息的值,可能可被某解码规则中有关某复合语种的解码细则来解码的(比如这里说的非复合语种是英语,这里说的复合语种是英语加汉语),3,对一个非复合语种的语句,那按某编码规则中有关此非复合语种的直播方式的编码细则来编码生成的拼合信息的值,可能可被某解码规则中有关某复合语种的TTS方式的解码细则来解码的(比如这里说的非复合语种是英语,这里说的复合语种是英语加汉语),4,对一个非复合语种的语句,那按某编码规则中有关某复合语种的TTS方式的编码细则来编码生成的拼合信息的值,可能可被某解码规则中有关此复合语种的直播方式的解码细则来解码的(比如这里说的非复合语种是英语,这里说的复合语种是英语加韩语),5,对一个非复合语种的语句,那按某编码规则中有关某一复合语种的直播方式的编码细则来编码生成的拼合信息的值,可能可被某解码规则中有关另一复合语种的直播方式(TTS方式)的解码细则来解码的(比如这里说的非复合语种是英语,这里说的某一复合语种是英语加汉语加韩语,这里说的另一复合语种是英语加汉语加日语)。1. If the dialect and the official language of a language are regarded as two different languages, then any sentence in some sentences in the dialect is coded according to the coding rules of the dialect in a certain coding rule to represent the pronunciation of the sentence The value of the combined information may be decoded by the decoding rules of the official language in a certain decoding rule. 2. For a statement in a non-compound language, it is encoded and generated according to the coding rules of the non-compound language in a certain coding rule The value of the combined information may be decoded by the decoding rules of a certain compound language in a certain decoding rule (for example, the non-compound language mentioned here is English, and the compound language mentioned here is English plus Chinese), 3. For a non-compound language For a compound language sentence, the value of the combined information generated by encoding according to the coding rules of the non-complex language live broadcast method in a certain coding rule may be decoded by the decoding rules of a certain compound language TTS method in a certain decoding rule (For example, the non-compound language mentioned here is English, and the compound language mentioned here is English plus Chinese), 4, for a sentence of a non-compound language, then according to the coding rules of the TTS mode of a certain compound language in a certain coding rule The value of the combined information generated by encoding may be decoded by the decoding rules of the live broadcast method of this compound language in a certain decoding rule (for example, the non-compound language mentioned here is English, and the compound language mentioned here is English plus Korean), 5. For a statement in a non-compound language, the value of the combined information generated by encoding according to the encoding rules of a certain compound language in a certain encoding rule may be converted into a certain decoding rule for another compound language. (TTS mode) decoding rules to decode (for example, the non-compound language mentioned here is English, a certain compound language mentioned here is English plus Chinese plus Korean, and another compound language mentioned here is English plus Chinese plus Japanese) .

除上段述5种情况外还会有更多的编解码情况,这些编解码情况(这些编解码情况包括上段述5种情况;本段以下各处说到的‘这些编解码情况’都是指这里说的‘这些编解码情况’)与前两段说到的情形1的本质相同,这些编解码情况中任一编解码情况涉及的编码规则也符合前两段述的编码规则的那些共同特征,这些编解码情况中任一编解码情况涉及的解码规则也符合前两段述的解码规则的那些共同特征;对这些编解码情况中的任一编解码情况,解码规则对一个拼合信息的值进行解码时对音码的定义方式及使用的音码情况,一般是要与用编码规则编码生成此拼合信息的值时对音码的定义方式及使用的音码情况相同或兼容;对这些编解码情况本发明文件就没再专门定义解码规则与编码规则对应等关系,即不论解码规则是不是专门被设计出来以用于对用编码规则编码生成的拼合信息的值进行解码的,权利要求3,12,20或23中述及的编解码情况可以是这些编解码情况中的任一个,即可以是这样:权利要求3里编码规则中有关语种1的编码规则是这些编解码情况中的任一编解码情况里涉及的编码规则中有关某语种的编码规则且此权利要求里解码规则中有关语种2的解码规则是本句内前面所述任一编解码情况里涉及的解码规则中有关某语种的解码规则(这句的两个‘某语种’可能是相同语种也可能不是相同语种);也可以是这样:权利要求12里编码规则中有关语种1的编码规则是这些编解码情况中的任一编解码情况里涉及的编码规则中有关某语种的编码规则且此权利要求里解码规则中有关语种2的解码规则是本句内前面所述任一编解码情况里涉及的解码规则中有关某语种的解码规则(这句的两个‘某语种’可能是相同语种也可能不是相同语种);也可以是这样:权利要求20里编码规则中有关语种1的编码规则是这些编解码情况中的任一编解码情况里涉及的编码规则中有关某语种的编码规则且此权利要求里解码规则中有关语种2的解码规则是本句内前面所述任一编解码情况里涉及的解码规则中有关某语种的解码规则(这句的两个‘某语种’可能是相同语种也可能不是相同语种);也可以是这样:权利要求23里编码规则中有关语种1的编码规则是这些编解码情况中的任一编解码情况里涉及的编码规则中有关某语种的编码规则且此权利要求里解码规则中有关语种2的解码规则是本句内前面所述任一编解码情况里涉及的解码规则中有关某语种的解码规则(这句的两个‘某语种’可能是相同语种也可能不是相同语种);当然这些编解码情况各自涉及的拼合信息的值在码字里的放置(比如对拼合信息的值进行变换,把拼合信息的值放多个码字里),诸个码字及部诸码字的定义,拾音器的硬件特征,读码处理,产音方式等都与本说明书前面反复述的有关前两段说到的情形1涉及的这些技术特征相同;这些编解码情况各自也是用音码来做拼合,编解码涉及的音码的选取,编解码方式,发音形式等都可参照前两段说到的情形1的处理;这些编解码情况各自涉及的拾音器也可以是非插卡方式拾音器或插数据卡方式拾音器或插部件式数据卡拾音器,而且可以是与其它电子类产品集成在一起的,这些技术特征与前两段说到的情形1涉及的这些技术特征都是一样的。In addition to the 5 situations mentioned in the previous paragraph, there will be more codec situations. The "these codec situations" mentioned here) are essentially the same as the situation 1 mentioned in the previous two paragraphs, and the coding rules involved in any of these codec situations also conform to the common features of the coding rules mentioned in the previous two paragraphs , the decoding rules involved in any of these codec cases also conform to the common features of the decoding rules described in the previous two paragraphs; for any of these codec cases, the decoding rules have When decoding, the definition mode of the phonetic code and the situation of the phonetic code used are generally the same or compatible with the definition mode of the phonetic code and the situation of the used phonetic code when encoding the value of this combination of information with encoding rules; Decoding situation The document of the present invention does not specifically define the corresponding relationship between decoding rules and encoding rules, that is, regardless of whether the decoding rules are specially designed to decode the value of the combined information generated by encoding with encoding rules, claim 3 , 12, 20 or 23 mentioned codec situations can be any one of these codec situations, that is, it can be like this: the coding rule about language 1 in the coding rules in claim 3 is any one of these codec situations The coding rules related to a certain language in the coding rules involved in a codec situation and the decoding rules related to language 2 in the decoding rules in this claim are related to a certain language in the decoding rules involved in any codec situation mentioned above in this sentence The decoding rule of the language (the two "a certain language" in this sentence may or may not be the same language); it can also be like this: the encoding rule related to language 1 in the encoding rule in claim 12 is the codec in these encoding and decoding situations The coding rules related to a certain language in the coding rules involved in any codec situation and the decoding rules related to language 2 in the decoding rules in this claim are related to the decoding rules involved in any codec situation mentioned above in this sentence The decoding rule of a certain language (the two "a certain language" in this sentence may or may not be the same language); it can also be like this: the encoding rule related to language 1 in the encoding rule in claim 20 is in these encoding and decoding situations The coding rules related to a certain language in the coding rules involved in any codec situation and the decoding rules related to language 2 in the decoding rules in this claim are among the decoding rules involved in any codec situation mentioned above in this sentence Decoding rules related to a certain language (the two "a certain language" in this sentence may or may not be the same language); it can also be like this: the coding rules related to language 1 in the coding rules in claim 23 are these encoding and decoding situations The coding rules related to a certain language in the coding rules involved in any codec situation in the claim and the decoding rules related to language 2 in the decoding rules in this claim are the decoding rules involved in any codec situation mentioned above in this sentence The decoding rules about a certain language in this sentence (the two "a certain language" in this sentence may or may not be the same language); of course, the values of the splicing information involved in each of these encoding and decoding situations are placed in the codeword (for example, for splicing transform the value of the information, and put the value of the combined information into multiple codewords), the definitions of the codewords and part of the codewords, the hardware characteristics of the pickup, the code reading processing, the sound production method, etc. The technical features involved in thesituation 1 mentioned in the first two paragraphs are the same; these encoding and decoding situations are also combined with audio codes, and the selection of audio codes involved in encoding and decoding, encoding and decoding methods, pronunciation forms, etc. can refer to The processing ofsituation 1 mentioned in the previous two paragraphs; the pickups involved in these codec situations can also be non-plug-in card pickups or data card pickups or plug-in data card pickups, and can be integrated with other electronic products Together, these technical features are the same as those involved inScenario 1 mentioned in the previous two paragraphs.

以上3段提到的一个语种可以是任一非复合语种或任一复合语种,以上3段提到的多个语种可以全是非复合语种或全是复合语种或既含有非复合语种又含有复合语种。One of the languages mentioned in the above 3 paragraphs can be any non-composite language or any compound language, and the multiple languages mentioned in the above 3 paragraphs can be all non-composite languages or all compound languages or contain both non-compound languages and compound languages .

一般情况下,由一个编码规则来编码生成一表示一语句发音的拼合信息的值,再由一解码规则来对此拼合信息的值进行解码,此编码规则一般有特征:可适用于一个或多个语种,对这一个或多个语种中的每个语种都有此编码规则中有关此语种的编码细则,对这一个或多个语种中的每个语种,此编码规则中有关此语种的编码细则都有定义编码语句集,此编码规则中有关此语句的语种的编码细则有压缩编码处理,是由此编码规则中有关此语句的语种的编码细则来编码生成此拼合信息的值的,此解码规则一般有特征:可适用于一个或多个语种,对这一个或多个语种中的每个语种都有此解码规则中有关此语种的解码细则,对这一个或多个语种中的每个语种,此解码规则中有关此语种的解码细则都有定义解码语句集,此解码规则中有关此语句的语种的解码细则有压缩解码处理,是由此解码规则中有关此语句的语种的解码细则来对此拼合信息的值进行解码的;本段内以上所述情况也符合这样:本段内前面述的拼合信息的值是由此编码规则编码生成,此拼合信息的值是由此解码规则解码的,此编码规则在编码生成此拼合信息的值时有压缩编码处理,此解码规则在对此拼合信息的值进行解码时有压缩解码处理。本段提到的一个语种可以是任一非复合语种或任一复合语种,本段提到的多个语种可以全是非复合语种或全是复合语种或既含有非复合语种又含有复合语种。In general, an encoding rule is used to encode and generate a value representing the combined information of a sentence pronunciation, and then a decoding rule is used to decode the value of the combined information. This encoding rule generally has characteristics: it can be applied to one or more language, for each of the one or more languages, there are coding rules for this language in this coding rule, and for each of the one or more languages, there are coding rules for this language in this coding rule The detailed rules have a set of defined coded sentences. The coded rules about the language of this sentence in this coded rule are compressed and encoded. The value of the combined information is encoded by the coded rules about the language of this sentence in this coded rule. Decoding rules generally have characteristics: applicable to one or more languages, each of the one or more languages has decoding rules related to this language in the decoding rules, and each of the one or more languages For each language, the decoding rules for this language in this decoding rule have a defined decoding statement set. The decoding rules for this language in this decoding rule have compression and decoding processing, which is the decoding of the language for this sentence in this decoding rule. The value of the combined information is decoded according to the detailed rules; the above-mentioned situation in this paragraph is also consistent with this: the value of the aforementioned combined information in this paragraph is encoded by this encoding rule, and the value of the combined information is decoded by this For rule decoding, this encoding rule has compression encoding processing when encoding and generating the value of the combined information, and this decoding rule has compression decoding processing when decoding the value of the combining information. A language mentioned in this paragraph can be any non-composite language or any compound language, and the multiple languages mentioned in this paragraph can be all non-compound languages or all compound languages, or contain both non-composite languages and compound languages.

本申请文件约定一个非插卡方式(或插数据卡方式或插部件式数据卡方式)的拾音器内只有一个发音器件,而这个发音器件可以是一个子发音器件,或是把多个子发音器件一起当成此发音器件(这种情况可说成此发音器件由多个子发音器件组成),本申请文件所说的子发音器件就是优先权申请文件里说的发音器件,一个子发音器件可以是一个喇叭,一个喇叭电路,一个带喇叭的语音芯片,一个带喇叭的TTS芯片等;本申请文件各处提及一个发音器件是一个喇叭或一个喇叭电路或一个带喇叭的语音芯片或一个带喇叭的TTS芯片(以及类似描述)时,是默认此发音器件就是一个子发音器件的,且此子发音器件就是这一个喇叭或一个喇叭电路或一个带喇叭的语音芯片或一个带喇叭的TTS芯片等的。本申请文件所说的子发音器件都是指拾音器中用于解码后发音的。本段所说的拾音器可为没与电子类产品集成的拾音器,也可为电子产品拾音器。This application document stipulates that there is only one sounding device in the pickup of a non-plug-in card mode (or plug-in data card mode or plug-in data card mode), and this sounding device can be a sub-sounding device, or a plurality of sub-sounding devices together As this pronunciation device (this situation can be said that this pronunciation device is made up of a plurality of sub-sounding devices), the said sub-sounding device of this application document is exactly the pronunciation device of saying in the priority application document, and a sub-sounding device can be a loudspeaker , a speaker circuit, a voice chip with a speaker, a TTS chip with a speaker, etc.; this application document mentions that a sound device is a speaker or a speaker circuit or a voice chip with a speaker or a TTS with a speaker Chip (and similar descriptions), it is defaulted that the sounding device is a sub-sounding device, and the sub-sounding device is the speaker or a speaker circuit or a voice chip with a speaker or a TTS chip with a speaker, etc. The sub-pronunciation devices mentioned in this application document all refer to those used in the pickup for pronunciation after decoding. The pickup mentioned in this paragraph can be a pickup that is not integrated with an electronic product, or a pickup for an electronic product.

如一个发音器件由多个子发音器件组成,这种情况时仍是此发音器件与MCU意向连接,且是指这多个子发音器件中每个子发音器件与MCU直接连接或间接连接。本申请文件所说的拾音器的所述单片机芯片对读码得到的拼合信息的值执行解码使拾音器的发音器件播出语句发音(或类似描述),这里的发音器件就可以是由多个子发音器件组成的,也可以就是一个子发音器件;如一个发音器件是由多个子发音器件组成的,那在拾音器读码解码时一般是在对有些拼合信息的值解码时用某一子发音器件发音,在对另外一些拼合信息的值解码时用另一子发音器件发音,一般不会在解码时多个子发音器件同时发音,但也可能有在解码时多个子发音器件同时发音的情况,本申请文件所说的拾音器的所述单片机芯片对读码得到的拼合信息的值执行解码使拾音器的发音器件播出语句发音(或类似描述),其涉及的发音情况可以是解码时没多个发音器件同时发音的发音情况,也可以是解码时有多个发音器件同时发音的发音情况。If a sounding device is composed of multiple sub-sounding devices, in this case, the sounding device is still intended to be connected to the MCU, and it means that each sub-sounding device in the multiple sounding devices is directly or indirectly connected to the MCU. The single-chip microcomputer chip of the pickup mentioned in the application documents performs decoding to the value of the combined information obtained by reading the code to make the pronunciation device of the pickup broadcast the sentence pronunciation (or similar description), and the pronunciation device here can be made of a plurality of sub-pronunciation devices Composed of, also can be exactly a sub-pronunciation device; As a pronunciation device is made up of a plurality of sub-pronunciation devices, then generally be to use certain sub-pronunciation device to pronounce when the value decoding of some combined information when the pick-up reading code is decoded, Use another sub-pronunciation device to pronounce when decoding the value of some other combined information. Generally, multiple sub-pronunciation devices will not pronounce at the same time when decoding, but there may also be a situation where multiple sub-pronunciation devices are simultaneously pronouncing when decoding. This application document The said one-chip microcomputer chip of said pickup carries out decoding to the value of the combination information that code reading obtains and makes the pronunciation device of pickup broadcast sentence pronunciation (or similar description), and the pronunciation situation that it relates to can be that a plurality of pronunciation devices do not have simultaneously when decoding The pronunciation of the pronunciation may also be the pronunciation of multiple pronunciation devices simultaneously during decoding.

权利要求书中提到的‘语句1’,‘语种1’,‘语种2’,‘拼合信息的值1’都是代指,这样做是为了写权利要求书的方便简明,权利要求书中的‘语句1’是指某一语句,‘语种1’是指某一语种,‘语种2’也是指某个语种(但同一个权利要求项里提到的语种1和语种2可能是相同语种也可能不是相同语种),‘拼合信息的值1’是代指权利要求书中提到的‘表示如何用音码拼合成此语句1发音的拼合信息的值’。The 'statement 1', 'language 1', 'language 2', and 'value 1 of combined information' mentioned in the claims are all references. This is done for the convenience and simplicity of writing the claims. In the claims "Statement 1" refers to a certain sentence, "language 1" refers to a certain language, and "language 2" also refers to a certain language (butlanguage 1 andlanguage 2 mentioned in the same claim may be the same language may not be the same language), "thevalue 1 of the combined information" refers to "the value of the combined information that expresses how to combine the pronunciation of thesentence 1 with the phonetic code" mentioned in the claims.

权利要求3中的各‘语种1’都是相同所指,权利要求3中的各‘语种2’都是相同所指;权利要求12中的各‘语种1’都是相同所指,权利要求12中的各‘语种2’都是相同所指;权利要求20中的各‘语种1’都是相同所指,权利要求20中的各‘语种2’都是相同所指;权利要求23中的各‘语种1’都是相同所指,权利要求23中的各‘语种2’都是相同所指;All the 'languages 1' inclaim 3 have the same reference, and all the 'languages 2' inclaim 3 have the same reference; all the 'languages 1' inclaim 12 have the same reference, and the claims All 'languages 2' inclaim 12 refer to the same; all 'languages 1' inclaim 20 all refer to the same, and all 'languages 2' inclaim 20 refer to the same; in claim 23 All 'languages 1' in claim 23 refer to the same, and all 'languages 2' in claim 23 refer to the same;

权利要求3中‘所述编码规则’与‘此编码规则’是相同所指,权利要求3中‘一种解码规则’与‘此解码规则’是相同所指;权利要求12中‘所述编码规则’与‘此编码规则’是相同所指,权利要求12中‘所述解码规则’与‘此解码规则’是相同所指。Inclaim 3, "the encoding rule" and "this encoding rule" refer to the same, and inclaim 3, "a decoding rule" and "this decoding rule" refer to the same; inclaim 12, "the encoding rule Rules' and 'this encoding rule' refer to the same, and 'the decoding rule' and 'this decoding rule' inclaim 12 refer to the same.

本发明文件所说的一个拼合信息的值涉及的BIT,是指表示这个拼合信息的值的全部二进制BIT。本发明文件所说的一个拼合信息的值的BIT与说的一个拼合信息的值涉及的BIT是相同含义;本发明文件所说的一个拼合信息的值的全部BIT是指一个拼合信息的值涉及的BIT的全部,本发明文件所说的一个拼合信息的值的部分BIT是指一个拼合信息的值涉及的BIT的部分。本发明文件所说的把一个拼合信息的值放在一码字里,是指把这个拼合信息的值的BIT放在这一个码字里,本发明文件所说的把数个拼合信息的值放在一码字里,是指把这数个拼合信息的值中每个拼合信息的值涉及的BIT都放在这一个码字里。The BITs involved in the value of a combined information mentioned in the document of the present invention refer to all binary BITs representing the value of the combined information. The BIT of the value of a merging information mentioned in the document of the present invention is the same meaning as the BIT involved in the value of a merging information; The whole of the BIT, the BIT part of the value of a combined information mentioned in the document of the present invention refers to the part of the BIT involved in the value of a combined information. Putting the value of a combined information in a codeword in the said document of the present invention refers to putting the BIT of the value of the combined information in this codeword, and putting the values of several combined information in the document of the present invention Putting it in one codeword refers to putting the BITs involved in each value of the combined information among the values of the combined information into the one codeword.

对于本发明文件提到的‘数个表示如何用音码拼合成语句发音的拼合信息的值’,其中的‘数个’是修饰其中的‘表示如何用音码拼合成语句发音的拼合信息的值’的,对于本发明文件提到的‘数个表示语句发音的拼合信息的值’或‘数个拼合信息的值’也是按这样理解。For the 'several values that represent how to piece together the combined information of sentence pronunciation with phonetic codes' mentioned in the document of the present invention, 'several' wherein ' represents how to piece together the combined information of sentence pronunciation with phonetic codes' for modification Value' is also understood in the same way for the 'values of several pieced information representing the pronunciation of a sentence' or 'values of several pieced information' mentioned in the document of the present invention.

关于本发明文件所说的用编码规则来编码生成拼合信息的值,其编码的执行形式,可为人用手工来编码,或用电脑程序自动编码,或是其它形式,显然这些形式互相是等同替换。With regard to the encoding rules used in the document of the present invention to encode the value of the combined information, the execution form of the encoding can be manually encoded, or automatically encoded by a computer program, or other forms. Obviously, these forms are equivalent to each other. .

如果在编码过程中就一边编码一边进行其它处理(比如加密等)操作,最后把得到的值直接放到码字里,那这种情况等同于先编码生成数个拼合信息的值,再进行其它处理(比如加密等)操作;而对码字的值中的数个拼合信息的变换值这样的值,进行解码时,一般是先对这些值作处理(比如逆变换)得到数个拼合信息的值,再解码,而如是对这些值一边作处理(比如逆变换)一般解码,那这等同于先作处理(比如逆变换)以得到数个拼合信息的值再作解码。If during the encoding process, other processing (such as encryption, etc.) is performed while encoding, and finally the obtained value is directly put into the codeword, then this situation is equivalent to first encoding to generate several values of the combined information, and then performing other operations. Processing (such as encryption, etc.) operations; and when decoding values such as the transformation values of several pieced together information in the value of the codeword, generally these values are first processed (such as inversely transformed) to obtain several pieces of pieced together information Values, and then decode, and if these values are processed (such as inverse transformation) and generally decoded, then this is equivalent to first processing (such as inverse transformation) to obtain several values of the combined information and then decoding.

对于本说明书中的某一句子,如句子中有括号来作注释,那括号里的内容也算此句子的一部分。For a certain sentence in this manual, if there are brackets in the sentence for comments, the content in the brackets is also a part of the sentence.

注:在数据库的范畴,有记录(RECORD),字段(FIELD)的概念,每个记录含有一个或多个字段的信息,比如:Note: In the category of database, there are the concepts of record (RECORD) and field (FIELD). Each record contains information of one or more fields, such as:

(职工编号    姓名    年龄)(employee number name age)

0036         张三    300036 Zhang San 30

0037         李四    270037 Li Si 27

就是两个记录(RECORD),每个记录各有三个字段(职工编号,姓名,年龄)的信息,It is two records (RECORD), each record has three fields (employee number, name, age) information,

第1个记录是‘0036    张三    30’,The first record is '0036 Zhang San 30',

第2个记录是‘0037    李四    27’;The second record is '0037 Lisi 27';

按常规是用汉语的‘记录’来称呼数据库里的‘RECORD’概念的,但本发明文件由于经常使用作动词的‘记录’这个词,所以为免混淆,本发明文件是用汉语的‘记录项’来称呼数据库里的‘RECORD’概念的,比如上面第1个记录项是‘0036    张三    30’。Conventionally, the concept of 'RECORD' in the database is called by the Chinese 'record', but the document of the present invention uses the word 'record' as a verb often, so in order to avoid confusion, the document of the present invention uses the Chinese 'record' item' to call the 'RECORD' concept in the database, for example, the first record item above is '0036 Zhang San 30'.

注:在本发明文件的说明书中,有些地方有提及的‘本段以上’,‘本段上面’,‘本段前面’,‘本段内前面’等,显然都是指本段范围内前面所述;同理,如有些地方有提及到的‘本段以下’,‘本段下面’,‘本段后面’,是指本段范围内后面所述。本说明书中有些地方提及的‘前面两段’,是不包括此‘前面两段’几个词本身所处的那段的,其它比如前面多少段这样的类似提法也是这样的。本说明书中有些地方提及的‘后面两段’,是不包括此‘后面两段’几个词本身所处的那段的,其它比如后面多少段这样的类似提法也是这样的。Note: In the description of the invention document, there are references to 'above this paragraph', 'above this paragraph', 'before this paragraph', 'before this paragraph', etc., which obviously refer to the scope of this paragraph As mentioned above; similarly, if there are references to 'below this paragraph', 'below this paragraph', and 'behind this paragraph' in some places, it refers to what is described later within the scope of this paragraph. The "first two paragraphs" mentioned in some places in this manual does not include the paragraph where the words "first two paragraphs" are located, and the same is true for other similar references such as the first few paragraphs. The "two paragraphs in the back" mentioned in some places in this specification does not include the paragraph in which the words "two paragraphs in the back" are located, and the same is true for other similar references such as several paragraphs in the back.

注:本说明书中各处所称的‘本发明文件’包括本次申请的权利要求书,说明书和说明书附图。如提及时无特别指明有其他情况,本发明文件的说明书中各处提及的‘本说明书’或‘权利要求书’或权利要求项(比如提及的‘权利要求1’)默认是指针对本发明文件的,这是显然的。Note: The "invention document" referred to everywhere in this specification includes the claims, specification and drawings of this application. If there are no other circumstances specified when mentioning, the 'this specification' or 'claims' or claim items (such as the mentioned 'claim 1') mentioned in the description of the present invention document refer to the default Invention of documents, it is obvious.

Claims (24)

Translated fromChinese
1.一种语音编码有声读物,其主体为读物,在前述读物的整页面上有诸个码字,此诸个码字属于前述语音编码有声读物,其特征在于:1. A phonetic coded audiobook, its main body is a reading material, and there are code words on the entire page of the aforementioned reading material, and these code words belong to the aforementioned phonetic coded audiobook, characterized in that:前述诸个码字中的每个码字的码值是共与数个表示如何用音码拼合成语句发音的拼合信息的值有关的。The code value of each code word in the aforementioned code words is related to the value of the combination information of how to use phonetic sign indicating numbers to piece together into sentence pronunciation altogether.2.根据权利要求1所述的语音编码有声读物,其特征在于:2. The speech-encoded audiobook of claim 1, wherein:所述诸个码字中有部诸码字;前述部诸码字是所述诸个码字中的全部码字或部分码字;所述诸个码字可以是一个或多个码字;前述部诸码字可以是一个或多个码字;All code words of part are arranged in described all code words; All code words of aforesaid part are all code words or part code words in described all code words; Described all code words can be one or more code words; All code words of aforementioned part can be one or more code words;前述部诸码字中的每个码字都可被同一个拾音器读取,前述部诸码字中的每个码字是通过前述拾音器的读码头被前述拾音器读取的,前述部诸码字中的每个码字可被前述拾音器读取以用于前述拾音器播出语句发音;Each code word in all code words of aforementioned part can be read by same pickup, each code word in all code words of aforementioned part is to be read by aforementioned pickup by the reading pier of aforementioned pickup, all code words of aforementioned part Each code word in can be read by aforesaid pickup for the aforesaid pickup to broadcast sentence pronunciation;前述拾音器是非插卡方式拾音器,或是插数据卡方式拾音器,或是插部件式数据卡方式拾音器;前述拾音器包括有一个前述读码头,一个发音器件,一个单片机芯片,此单片机芯片是前述拾音器的主处理芯片,在前述拾音器里此单片机芯片与前述读码头意向连接,在前述拾音器里此单片机芯片与前述发音器件意向连接,前述拾音器在使用时,需用到存储器集,此存储器集用于存放前述拾音器涉及的程序和前述拾音器涉及的数据,此存储器集里的各存储器可被前述拾音器的前述单片机芯片读取使用;The aforementioned pickup is a non-plug-in card mode pickup, or a plug-in data card mode pickup, or a plug-in data card mode pickup; the aforementioned pickup includes an aforementioned reading terminal, a sounding device, and a single-chip microcomputer chip, and this single-chip microcomputer chip is the aforementioned pickup. Main processing chip, in the aforementioned pickup, this single-chip microcomputer chip is connected with the aforementioned reading dock intentionally, and in the aforementioned pickup, this single-chip microcomputer chip is connected with the aforementioned pronunciation device intentionally, and the aforementioned pickup needs to use a memory set when in use, and this memory set is used to store The programs related to the aforementioned pickup and the data related to the aforementioned pickup, each memory in this memory set can be read and used by the aforementioned single-chip microcomputer chip of the aforementioned pickup;前述拾音器在被使用时,前述拾音器的前述单片机芯片利用前述拾音器的前述读码头可以读取前述部诸码字中的每个码字,前述拾音器在被使用时,前述拾音器的前述单片机芯片根据读得的前述部诸码字中的码字的码值使前述拾音器的前述发音器件播出语句发音;Aforesaid pickup is when being used, and the aforementioned single-chip microcomputer chip of aforementioned pickup utilizes the aforementioned reading pier of aforementioned pickup to be able to read each codeword in all codewords of aforementioned portion, and aforementioned pickup is when being used, and the aforementioned single-chip microcomputer chip of aforementioned pickup is based on reading The code value of the code word in all code words of the aforementioned part that gets makes the aforementioned pronunciation device of the aforementioned sound pick-up broadcast the sentence pronunciation;对于前述部诸码字中的一个码字涉及的表示如何用音码拼合成语句1发音的拼合信息的值,此表示如何用音码拼合成此语句1发音的拼合信息的值是按照一种编码规则来编码生成的,此编码规则编码生成此表示如何用音码拼合成此语句1发音的拼合信息的值时是有压缩编码处理的,此表示如何用音码拼合成此语句1发音的拼合信息的值也是拼合信息的值1;For the value of how a code word in the aforementioned part of all code words relates to the combination information of how to piece together the pronunciation of sentence 1 with phonetic sign indicating number, this expression is how to piece together the value of the combination information of this sentence 1 pronunciation with phonetic sign indicating number according to a kind of Encoding rules are generated by encoding. This encoding rule encoding generates the value of how to use phonetic codes to form the value of the combination information of the pronunciation of this statement 1. Compression coding is used to process this value. This indicates how to use phonetic codes to form the value of the pronunciation of this statement 1. The value of the merged information is also the value 1 of the merged information;对于前述拼合信息的值1涉及的前述语句1,可以是一个完整句子,或者,是一个完整句子的部分,或者,是含有多个完整句子,或者,是含有数个完整句子和数个非完整句子,或者,是含有多个非完整句子。For the aforementioned sentence 1 involved in the value 1 of the aforementioned combined information, it can be a complete sentence, or a part of a complete sentence, or contain multiple complete sentences, or contain several complete sentences and several incomplete sentences. Sentences, or, are multiple incomplete sentences.3.根据权利要求2所述的语音编码有声读物,其特征在于:3. The speech-encoded audiobook of claim 2, wherein:所述拼合信息的值1是按照所述编码规则中有关语种1的编码细则来编码生成的,此编码规则中有关语种1的编码细则是含有压缩编码处理的,此编码规则是只适用于语种1或适用于包括语种1在内的多个语种的一种编码规则,只需从所述一个码字即可得到所述拼合信息的值1或需从包括所述一个码字在内的所述部诸码字中的多个码字才能得到所述拼合信息的值1,所述拾音器的所述单片机芯片利用所述读码头可读取所述一个码字或所述多个码字而得到所述拼合信息的值1,所述拾音器的所述单片机芯片是按一种解码规则中有关语种2的解码细则来对读码得到的所述拼合信息的值1执行解码使所述拾音器的所述发音器件播出所述语句1发音的,而且此解码规则中有关语种2的解码细则是含有压缩解码处理的,此解码规则是只适用于语种2或适用于包括语种2在内的多个语种的一种解码规则。The value 1 of the spliced information is generated by encoding according to the encoding rules related to language 1 in the encoding rules. The encoding rules related to language 1 in the encoding rules include compression encoding processing. This encoding rule is only applicable to languages 1 or a coding rule applicable to multiple languages including language 1, the value of the combined information can be obtained only from the one codeword 1 or from all the codewords including the one codeword A plurality of codewords in all codewords of the above-mentioned parts just can obtain the value 1 of described mosaic information, and the described single-chip microcomputer chip of described sound pick-up utilizes described reading terminal to be able to read described a codeword or described a plurality of codewords and Obtain the value 1 of the merging information, the single-chip microcomputer chip of the pickup is to decode the value 1 of the merging information obtained by reading the code according to the decoding rules of the relevant language 2 in a decoding rule to make the pickup of the pickup The pronunciation device broadcasts the pronunciation of the sentence 1, and the decoding details of the language 2 in the decoding rule contain compression decoding processing, and the decoding rule is only applicable to the language 2 or multiple languages including the language 2. A decoding rule for each language.4.根据权利要求3所述的语音编码有声读物,其特征在于:4. The speech-encoded audiobook of claim 3, wherein:所述语种1与所述语种2是相同语种或是不同语种;The language 1 and the language 2 are the same language or different languages;所述语句1是属于所述语种1的一个语句,所述语种1是属于印欧语系的一个语种;是用所述编码规则中有关所述语种1的编码细则来编码生成所述拼合信息的值1的,且这种编码规则中有关所述语种1的编码细则是含有压缩编码处理的,这种编码规则中有关所述语种1的编码细则是直播方式的;The sentence 1 is a sentence belonging to the language 1, and the language 1 is a language belonging to the Indo-European language family; the coding rules about the language 1 in the coding rules are used to encode and generate the combined information The value is 1, and the encoding details about the language 1 in the encoding rules include compression encoding processing, and the encoding details about the language 1 in the encoding rules are live broadcast;前述这种编码规则中有关所述语种1的编码细则有定义编码涉及的编码语句集,所述语句1属于此编码语句集;The coding rules about the language 1 in the aforementioned coding rules define the coding sentence set involved in the coding, and the sentence 1 belongs to the coding sentence set;前述这种编码规则中有关所述语种1的编码细则可用于对前述编码语句集中任一语句进行编码以生成表示如何用音码拼合成此任一语句发音的拼合信息的值;The coding rules about the language 1 in the aforementioned coding rules can be used to encode any sentence in the aforementioned coding sentence set to generate a value representing how to combine phonetic codes into the pronunciation of any sentence;对于前述编码语句集中任一语句,可用多个音码来拼合成这任一语句的发音;For any sentence in the aforementioned coded sentence collection, a plurality of phonetic codes can be used to piece together the pronunciation of this arbitrary sentence;可被当成用以拼合前述编码语句集发音的音码,也即前述这种编码规则中有关所述语种1的编码细则可以使用的音码,可以是表示发音的元辅音或元音辅音组合,也可以是可以转换成发音的词根,单词,单词组合或范句,也可以是经组合后可转换为发音的字符组合或字母,也可以是特殊音码;既可以由元辅音,元音辅音组合和单词中的任一种音码来拼合前述编码语句集发音,也可以由元辅音,元音辅音组合,词根,单词和单词组合中的多种音码来拼合前述编码语句集发音,也可以由范句音码与元辅音,元音辅音组合和单词中的任一种音码来拼合前述编码语句集发音,也可以由范句音码与元辅音,元音辅音组合,词根,单词和单词组合中的多种音码来拼合前述编码语句集发音,也可以由词根和单词组合中的任一种音码来拼合前述编码语句集发音,也可以由词根和单词组合中的任一种音码与范句音码来拼合前述编码语句集发音,也可以由字符组合和字母中的一种或多种音码来拼合前述编码语句集发音,也可以由元辅音,元音辅音组合,词根,单词,单词组合和范句中的一种或多种音码与字符组合和字母中的一种或多种音码来拼合前述编码语句集发音,也可以由元辅音,元音辅音组合,词根,单词,单词组合,范句,字符组合和字母中的一种或多种音码与特殊音码来拼合前述编码语句集发音;Can be regarded as the phonetic code that is used to put together the pronunciation of the aforementioned coded sentence set, that is, the phonetic code that can be used in the coding rules about the language 1 in the aforementioned coding rules, can be the combination of vowels and consonants or vowels and consonants representing pronunciation, It can also be a root word that can be converted into pronunciation, a word, a combination of words or a model sentence, or a combination of characters or letters that can be converted into pronunciation after combination, or a special phonetic code; Combination and any kind of phonetic sign indicating number in the word to splice the aforementioned coded sentence set pronunciation, also can be by vowel and consonant, vowel-consonant combination, root, word and multiple phonetic codes in the word combination come together the aforementioned coded sentence set pronunciation, also Can put together the aforementioned coded sentence set pronunciation by model sentence phonetic sign indicating number and vowel and consonant, vowel and consonant combination and word and the multiple phonetic codes in the word combination to piece together the pronunciation of the aforementioned coded sentence set, and also can put together the pronunciation of the aforementioned coded sentence set by any phonetic code in the root and the word combination, and can also be made by any one of the root and the word combination Kind of phonetic sign indicating number and pattern sentence sound sign indicating number come together pronunciation of aforementioned coded statement set, also can come together pronunciation of aforementioned encoded statement set by one or more sound codes in character combination and letter, also can be by vowel consonant, vowel consonant combination , root, word, word combination and one or more phonetic codes in the model sentence and one or more phonetic codes in the character combination and the letter to piece together the aforementioned coded statement set pronunciation, also can be by vowel consonant, vowel consonant One or more phonetic codes and special phonetic codes in combinations, root words, words, word combinations, model sentences, character combinations and letters are used to assemble the pronunciation of the aforementioned coded sentence sets;用以拼合前述编码语句集发音的音码,同样一个音码可以用于前述编码语句集中的不同语句的拼合发音。The phonetic codes used to assemble the pronunciations of the aforementioned coded sentence sets, the same phonetic codes can be used for the combined pronunciations of different sentences in the aforementioned coded sentence sets.5.根据权利要求3所述的语音编码有声读物,其特征在于:5. The speech-encoded audiobook of claim 3, wherein:所述语种1与所述语种2是相同语种或是不同语种;The language 1 and the language 2 are the same language or different languages;所述语句1是属于所述语种1的一个语句,所述语种1是属于非印欧语系的一个语种;是用所述编码规则中有关所述语种1的编码细则来编码生成所述拼合信息的值1的,且这种编码规则中有关所述语种1的编码细则是含有压缩编码处理的,这种编码规则中有关所述语种1的编码细则是直播方式的;The sentence 1 is a sentence belonging to the language 1, and the language 1 is a language belonging to the non-Indo-European language family; the coding rules about the language 1 in the coding rules are used to encode and generate the combined information The value of 1, and the encoding details about the language 1 in the encoding rules include compression encoding processing, and the encoding details about the language 1 in the encoding rules are live broadcast;前述这种编码规则中有关所述语种1的编码细则有定义编码涉及的编码语句集,所述语句1属于此编码语句集;The coding rules about the language 1 in the aforementioned coding rules define the coding sentence set involved in the coding, and the sentence 1 belongs to the coding sentence set;前述这种编码规则中有关所述语种1的编码细则可用于对前述编码语句集中任一语句进行编码以生成表示如何用音码拼合成此任一语句发音的拼合信息的值;The coding rules about the language 1 in the aforementioned coding rules can be used to encode any sentence in the aforementioned coding sentence set to generate a value representing how to combine phonetic codes into the pronunciation of any sentence;对于前述编码语句集中任一语句,可用多个音码来拼合成这任一语句的发音;For any sentence in the aforementioned coded sentence collection, a plurality of phonetic codes can be used to piece together the pronunciation of this arbitrary sentence;可被当成用以拼合前述编码语句集发音的音码,也即前述这种编码规则中有关所述语种1的编码细则可以使用的音码,可以是表示发音的元辅音或元音辅音组合,也可以是可以转换成发音的字,词,词组合或范句,也可以是经组合后可转换为发音的偏旁部首组合或偏旁部首,也可以是特殊音码;既可以由元辅音,元音辅音组合,字和词中的任一种音码来拼合前述编码语句集发音,也可以由元辅音,元音辅音组合,字,词和词组合中的多种音码来拼合前述编码语句集发音,也可以由范句音码与元辅音,元音辅音组合,字和词中的任一种音码来拼合前述编码语句集发音,也可以由范句音码与元辅音,元音辅音组合,字,词和词组合中的多种音码来拼合前述编码语句集发音,也可以由词组合这一种音码来拼合前述编码语句集发音,也可以由词组合音码与范句音码来拼合前述编码语句集发音,也可以由偏旁部首组合和偏旁部首中的一种或多种音码来拼合前述编码语句集发音,也可以由元辅音,元音辅音组合,字,词,词组合和范句中的一种或多种音码与偏旁部首组合和偏旁部首中的一种或多种音码来拼合前述编码语句集发音,也可以由元辅音,元音辅音组合,字,词,词组合,范句,偏旁部首组合和偏旁部首中的一种或多种音码与特殊音码来拼合前述编码语句集发音;Can be regarded as the phonetic code that is used to put together the pronunciation of the aforementioned coded sentence set, that is, the phonetic code that can be used in the coding rules about the language 1 in the aforementioned coding rules, can be the combination of vowels and consonants or vowels and consonants representing pronunciation, Also can be the word that can be converted into pronunciation, word, word combination or model sentence, also can be the radical combination or radical radical that can be converted into pronunciation after being combined, also can be special phonetic sign indicating number; , vowel and consonant combination, any kind of phonetic sign indicating number in word and word to put together the aforementioned coded sentence set pronunciation, also can be by vowel and consonant, vowel and consonant combination, word, word and word combination in multiple phonetic codes to put together the aforementioned Coding statement set pronunciation, also can be by model sentence sound sign indicating number and Yuan consonant, vowel consonant combination, any kind of sound sign indicating number in word and word splicing aforementioned coding statement set pronunciation, also can be by model sentence sound sign indicating number and Yuan consonant, Combination of vowels and consonants, multiple phonetic codes in words, words and word combinations to put together the pronunciation of the aforementioned coded sentence set, also can combine the pronunciation of the aforementioned coded sentence set by the combination of words Combine the pronunciation of the aforementioned coded sentence set with the model sentence phonetic code, also can put together the pronunciation of the aforementioned coded sentence set by radical combination and one or more phonetic codes in the radical, also can be by vowel consonant, vowel consonant Combination, word, speech, word combination and one or more phonetic codes in the combination of radicals and radicals and one or more phonetic codes in radicals are put together the aforementioned coded sentence set pronunciation, also can be by unit Consonant, vowel-consonant combination, word, word, word combination, model sentence, radical combination and one or more phonetic codes and special phonetic codes in radicals and radicals are used to assemble the aforementioned coded sentence set pronunciation;用以拼合前述编码语句集发音的音码,同样一个音码可以用于前述编码语句集中的不同语句的拼合发音。The phonetic codes used to assemble the pronunciations of the aforementioned coded sentence sets, the same phonetic codes can be used for the combined pronunciations of different sentences in the aforementioned coded sentence sets.6.根据权利要求3所述的语音编码有声读物,其特征在于:6. The speech-encoded audiobook of claim 3, wherein:所述语种1与所述语种2是相同语种或是不同语种;The language 1 and the language 2 are the same language or different languages;所述语句1是属于所述语种1的一个语句,所述语种1是属于印欧语系的一个语种;是用所述编码规则中有关所述语种1的编码细则来编码生成所述拼合信息的值1的,且这种编码规则中有关所述语种1的编码细则是含有压缩编码处理的,这种编码规则中有关所述语种1的编码细则是TTS方式的;The sentence 1 is a sentence belonging to the language 1, and the language 1 is a language belonging to the Indo-European language family; the coding rules about the language 1 in the coding rules are used to encode and generate the combined information The value is 1, and the encoding details about the language 1 in this encoding rule include compressed encoding processing, and the encoding details about the language 1 in this encoding rule are in TTS mode;前述这种编码规则中有关所述语种1的编码细则有定义编码涉及的编码语句集,所述语句1属于此编码语句集;The coding rules about the language 1 in the aforementioned coding rules define the coding sentence set involved in the coding, and the sentence 1 belongs to the coding sentence set;前述这种编码规则中有关所述语种1的编码细则可用于对前述编码语句集中任一语句进行编码以生成表示如何用音码拼合成此任一语句发音的拼合信息的值;The coding rules about the language 1 in the aforementioned coding rules can be used to encode any sentence in the aforementioned coding sentence set to generate a value representing how to combine phonetic codes into the pronunciation of any sentence;对于前述编码语句集中任一语句,可用多个音码来拼合成这任一语句的发音;For any sentence in the aforementioned coded sentence collection, a plurality of phonetic codes can be used to piece together the pronunciation of this arbitrary sentence;可被当成用以拼合前述编码语句集发音的音码,也即前述这种编码规则中有关所述语种1的编码细则可以使用的音码,可以是可表示成用于TTS器件发音的发音信息数据的元辅音或元音辅音组合,也可以是可以转换为用于TTS器件发音的拼法数据或发音信息数据的词根,单词,单词组合或范句,也可以是经组合后可转换为用于TTS器件发音的拼法数据的字符组合或字母,也可以是特殊音码;既可以由元辅音,元音辅音组合和单词中的任一种音码来拼合前述编码语句集发音,也可以由元辅音,元音辅音组合,词根,单词和单词组合中的多种音码来拼合前述编码语句集发音,也可以由范句音码与元辅音,元音辅音组合和单词中的任一种音码来拼合前述编码语句集发音,也可以由范句音码与元辅音,元音辅音组合,词根,单词和单词组合中的多种音码来拼合前述编码语句集发音,也可以由词根和单词组合中的任一种音码来拼合前述编码语句集发音,也可以由词根和单词组合中的任一种音码与范句音码来拼合前述编码语句集发音,也可以由字符组合和字母中的一种或多种音码来拼合前述编码语句集发音,也可以由元辅音,元音辅音组合,词根,单词,单词组合和范句中的一种或多种音码与字符组合和字母中的一种或多种音码来拼合前述编码语句集发音,也可以由元辅音,元音辅音组合,词根,单词,单词组合,范句,字符组合和字母中的一种或多种音码与特殊音码来拼合前述编码语句集发音;Can be regarded as the sound code that is used to put together the pronunciation of the aforementioned coded sentence set, that is, the sound code that can be used in the coding rules about the language 1 in the aforementioned coding rules, can be expressed as pronunciation information for the pronunciation of the TTS device The vowel and consonant or the combination of vowel and consonant of the data can also be the root word, word, word combination or model sentence that can be converted into spelling data or pronunciation information data for TTS device pronunciation, and can also be converted into Character combinations or letters of the spelling data pronounced by the TTS device can also be special phonetic codes; both can be composed of vowels and consonants, vowel-consonant combinations and any phonetic codes in words to splice the pronunciation of the aforementioned coded sentence set, or can By vowel and consonant, vowel and consonant combination, word root, multiple phonetic sign indicating number in word and word combination come together the aforementioned coded statement set pronunciation, also can be any in the pattern sentence phonetic sign indicating number and vowel and consonant, vowel and consonant combination and word A kind of phonetic sign indicating number is put together the aforementioned coded sentence set pronunciation, also can be by model sentence phonetic sign indicating number and vowel consonant, vowel consonant combination, root, word and the multiple phonetic codes in word combination come together the aforementioned coded sentence set pronunciation, also can be by Any one of the phonetic codes in the root and the word combination is used to splice the pronunciation of the aforementioned coded sentence set, and any kind of phonetic code and the model sentence phonetic code in the root and the word combination can be used to piece together the pronunciation of the aforementioned coded sentence set. One or more phonetic codes in combination and letter come together pronunciation of aforementioned coding statement set, also can be by vowel consonant, vowel consonant combination, root, word, one or more phonetic codes in word combination and model sentence and One or more phonetic codes in character combinations and letters are used to combine the pronunciation of the aforementioned coded sentence set, and it can also be composed of vowels and consonants, vowel and consonant combinations, roots, words, word combinations, model sentences, character combinations and letters. Or multiple phonetic codes and special phonetic codes are used to piece together the aforementioned coded sentence set pronunciation;用以拼合前述编码语句集发音的音码,同样一个音码可以用于前述编码语句集中的不同语句的拼合发音。The phonetic codes used to assemble the pronunciations of the aforementioned coded sentence sets, the same phonetic codes can be used for the combined pronunciations of different sentences in the aforementioned coded sentence sets.7.根据权利要求3所述的语音编码有声读物,其特征在于:7. The speech-encoded audiobook of claim 3, wherein:所述语种1与所述语种2是相同语种或是不同语种;The language 1 and the language 2 are the same language or different languages;所述语句1是属于所述语种1的一个语句,所述语种1是属于非印欧语系的一个语种;是用所述编码规则中有关所述语种1的编码细则来编码生成所述拼合信息的值1的,且这种编码规则中有关所述语种1的编码细则是含有压缩编码处理的,这种编码规则中有关所述语种1的编码细则是TTS方式的;The sentence 1 is a sentence belonging to the language 1, and the language 1 is a language belonging to the non-Indo-European language family; the coding rules about the language 1 in the coding rules are used to encode and generate the combined information The value of 1, and the encoding details about the language 1 in this encoding rule include compression encoding processing, and the encoding details about the language 1 in this encoding rule are in TTS mode;前述这种编码规则中有关所述语种1的编码细则有定义编码涉及的编码语句集,所述语句1属于此编码语句集;The coding rules about the language 1 in the aforementioned coding rules define the coding sentence set involved in the coding, and the sentence 1 belongs to the coding sentence set;前述这种编码规则中有关所述语种1的编码细则可用于对前述编码语句集中任一语句进行编码以生成表示如何用音码拼合成此任一语句发音的拼合信息的值;The coding rules about the language 1 in the aforementioned coding rules can be used to encode any sentence in the aforementioned coding sentence set to generate a value representing how to combine phonetic codes into the pronunciation of any sentence;对于前述编码语句集中任一语句,可用多个音码来拼合成这任一语句的发音;For any sentence in the aforementioned coded sentence collection, a plurality of phonetic codes can be used to piece together the pronunciation of this arbitrary sentence;可被当成用以拼合前述编码语句集发音的音码,也即前述这种编码规则中有关所述语种1的编码细则可以使用的音码,可以是可表示成用于TTS器件发音的发音信息数据的元辅音或元音辅音组合,也可以是可以转换为用于TTS器件发音的拼法数据或发音信息数据的字,词,词组合或范句,也可以是经组合后可转换为用于TTS器件发音的拼法数据或发音信息数据的偏旁部首组合或偏旁部首,也可以是特殊音码;既可以由元辅音,元音辅音组合,字和词中的任一种音码来拼合前述编码语句集发音,也可以由元辅音,元音辅音组合,字,词和词组合中的多种音码来拼合前述编码语句集发音,也可以由范句音码与元辅音,元音辅音组合,字和词中的任一种音码来拼合前述编码语句集发音,也可以由范句音码与元辅音,元音辅音组合,字,词和词组合中的多种音码来拼合前述编码语句集发音,也可以由词组合这一种音码来拼合前述编码语句集发音,也可以由词组合音码与范句音码来拼合前述编码语句集发音,也可以由偏旁部首组合和偏旁部首中的一种或多种音码来拼合前述编码语句集发音,也可以由元辅音,元音辅音组合,字,词,词组合和范句中的一种或多种音码与偏旁部首组合和偏旁部首中的一种或多种音码来拼合前述编码语句集发音,也可以由元辅音,元音辅音组合,字,词,词组合,范句,偏旁部首组合和偏旁部首中的一种或多种音码与特殊音码来拼合前述编码语句集发音;Can be regarded as the sound code that is used to put together the pronunciation of the aforementioned coded sentence set, that is, the sound code that can be used in the coding rules about the language 1 in the aforementioned coding rules, can be expressed as pronunciation information for the pronunciation of the TTS device The vowel and consonant or vowel and consonant combination of the data can also be converted into spelling data or pronunciation information data for TTS device pronunciation, words, word combinations or model sentences, and can also be converted into use after combination The combination of radicals or radicals of the spelling data or pronunciation information data pronounced by the TTS device can also be a special phonetic code; it can be composed of vowels and consonants, combinations of vowels and consonants, any phonetic code in words and words To put together the aforementioned coded sentence set pronunciation, also can be by vowel and consonant, vowel and consonant combination, word, the multiple phonetic codes in word and word combination come together the aforementioned coded sentence set pronunciation, also can be by model sentence phonetic code and Yuan consonant, Combination of vowels and consonants, any sound code in words and words to splice together the pronunciation of the aforementioned coded sentence set, and can also be composed of model sentence phonetic codes and vowels and consonants, combinations of vowels and consonants, multiple sounds in words, words and word combinations code to put together the pronunciation of the aforementioned coded sentence set, or the pronunciation of the aforementioned coded sentence set can be put together by the word combination sound code, or the pronunciation of the aforementioned coded sentence set can be put together by the word combination sound code and the model sentence sound code, or can be made by Radical radical combination and one or more phonetic codes in radical radical combine the aforementioned coded statement set pronunciation, also can be by vowel-consonant, vowel-consonant combination, word, word, word combination and model sentence in one or A variety of phonetic codes and radical combinations and one or more phonetic codes in the radicals are used to assemble the pronunciation of the aforementioned coded sentence set, and can also be composed of vowels and consonants, vowel and consonant combinations, words, words, word combinations, model sentences , one or more phonetic codes and special phonetic codes in the combination of radicals and radicals to combine the pronunciation of the aforementioned coded sentence set;用以拼合前述编码语句集发音的音码,同样一个音码可以用于前述编码语句集中的不同语句的拼合发音。The phonetic codes used to assemble the pronunciations of the aforementioned coded sentence sets, the same phonetic codes can be used for the combined pronunciations of different sentences in the aforementioned coded sentence sets.8.根据权利要求3所述的语音编码有声读物,其特征在于:8. The speech-encoded audiobook of claim 3, wherein:所述语种1与所述语种2是相同语种或是不同语种;The language 1 and the language 2 are the same language or different languages;所述语句1是属于所述语种1的一个语句,所述语种1是属于印欧语系的一个语种或是属于非印欧语系的一个语种或是一个复合语种;是用所述编码规则中有关所述语种1的编码细则来编码生成所述拼合信息的值1的,且这种编码规则中有关所述语种1的编码细则是含有压缩编码处理的,这种编码规则中有关所述语种1的编码细则是直播方式或TTS方式或混合方式的;The sentence 1 is a sentence belonging to the language 1, and the language 1 is a language belonging to the Indo-European language family or a language belonging to a non-Indo-European language family or a compound language; The encoding rules of the language 1 are used to encode and generate the value 1 of the combined information, and the encoding rules related to the language 1 in this encoding rule include compression encoding processing, and the encoding rules related to the language 1 The encoding details are live broadcast or TTS or mixed;前述这种编码规则中有关所述语种1的编码细则有定义编码涉及的编码语句集,所述语句1属于此编码语句集;The coding rules about the language 1 in the aforementioned coding rules define the coding sentence set involved in the coding, and the sentence 1 belongs to the coding sentence set;前述这种编码规则中有关所述语种1的编码细则可用于对前述编码语句集中任一语句进行编码以生成表示如何用音码拼合成此任一语句发音的拼合信息的值;The coding rules about the language 1 in the aforementioned coding rules can be used to encode any sentence in the aforementioned coding sentence set to generate a value representing how to combine phonetic codes into the pronunciation of any sentence;对于前述编码语句集中任一语句,可用多个音码来拼合成这任一语句的发音;For any sentence in the aforementioned coded sentence collection, a plurality of phonetic codes can be used to piece together the pronunciation of this arbitrary sentence;用以拼合前述编码语句集发音的音码,同样一个音码可以用于前述编码语句集中的不同语句的拼合发音。The phonetic codes used to assemble the pronunciations of the aforementioned coded sentence sets, the same phonetic codes can be used for the combined pronunciations of different sentences in the aforementioned coded sentence sets.9.据权利要求1,2,3,4,5,6,7或8所述的语音编码有声读物,其特征在于:9. The speech coded audiobook according to claim 1, 2, 3, 4, 5, 6, 7 or 8, characterized in that:所述播出语句发音是播出语句的全部发音或播出语句的非全部发音;The pronunciation of the broadcast sentence is the entire pronunciation of the broadcast sentence or the non-full pronunciation of the broadcast sentence;所述播出所述语句1发音是播出语句1的全部发音或播出语句1的非全部发音;The broadcasting of the pronunciation of the sentence 1 is to broadcast all the pronunciations of the sentence 1 or to broadcast not all the pronunciations of the sentence 1;所述播出前述语句1发音是播出语句1的全部发音或播出语句1的非全部发音;The pronunciation of the aforementioned sentence 1 is to broadcast all the pronunciations of the sentence 1 or to broadcast the non-full pronunciation of the sentence 1;所述读物,可以是书,书籍,印有图文的卡片,报纸杂志或地图;The reading materials can be books, books, cards with pictures and texts, newspapers, magazines or maps;所述语音编码有声读物,有一个或多个页面而且只在一个页面上有码字,或者,有多个页面而且在多个页面上都有码字;所述语音编码有声读物的任一页面上或没有码字,或有一个码字,或有多个处在任意不同位置的码字;The speech-encoded audiobook has one or more pages and codewords on only one page, or has multiple pages and codewords on multiple pages; any page of the speech-encoded audiobook or there is no codeword, or there is a codeword, or there are multiple codewords in any different positions;所述诸个码字中的某一码字是处在所述读物的整页面上需播此码字对应播的数个语句发音的位置;A certain code word in described all code words is the position that needs to broadcast several sentence pronunciations that this code word corresponds to broadcast on the whole page of described reading material;所述诸个码字中的某一码字,可以是导电码,或者可以是条形码,或者可以是图形码,或者可以是磁码;对于所述诸个码字中的某一码字,此码字的载体可以是所述读物的一个页,此码字的载体平面是所述读物的一个页面,或者,此码字的载体可以是一小块平面材料,此码字的载体平面是平面材料的表面,此平面材料的表面上有此码字,此平面材料是粘贴在所述读物的一个页面上的。A certain code word in the said all code words can be a conductive code, or can be a bar code, or can be a graphic code, or can be a magnetic code; for a certain code word in the said all code words, this The carrier of the codeword can be a page of the reading material, and the carrier plane of the codeword is a page of the reading material, or the carrier of the codeword can be a small piece of planar material, and the carrier plane of the codeword is a plane On the surface of the material, there is the code word on the surface of the plane material, and the plane material is pasted on a page of the reading material.10.一种可与权利要求1所述的语音编码有声读物配合使用播出语句发音的拾音器主体,此拾音器主体也就是拾音器,此拾音器包括有一个读码头,一个发音器件,一个单片机芯片,此单片机芯片是前述拾音器的主处理芯片,此单片机芯片与前述读码头意向连接,此单片机芯片与前述发音器件意向连接,前述拾音器在使用时,需用到存储器集,此存储器集用以存放程序和数据,此存储器集里的各存储器可被前述拾音器的前述单片机芯片读取使用,前述语音编码有声读物的主体是读物,在前述读物的整页面上有诸个码字,此诸个码字属于前述语音编码有声读物,前述拾音器在被使用时,前述拾音器的前述单片机芯片可以利用前述拾音器的前述读码头来读取前述诸个码字中的部诸码字中的每个码字,其特征在于:10. A pick-up main body that can be used in conjunction with the speech coded audiobook as claimed in claim 1 to broadcast a sentence pronunciation, this pick-up main body is also a pick-up, and this pick-up includes a reading terminal, a pronunciation device, a single-chip microcomputer chip, the The single-chip microcomputer chip is the main processing chip of the aforementioned pickup, and this single-chip microcomputer chip is connected with the aforementioned reading terminal intentionally, and this single-chip microcomputer chip is connected with the aforementioned pronunciation device intentionally. Data, each memory in this memory set can be read and used by the aforementioned single-chip microcomputer chip of the aforementioned pickup, and the main body of the aforementioned audio-coded audiobook is the reading material, and there are various codewords on the entire page of the aforementioned reading material, and these codewords belong to For the speech coded audiobook, when the aforementioned pickup is in use, the aforementioned single-chip microcomputer chip of the aforementioned pickup can utilize the aforementioned reading terminal of the aforementioned pickup to read each codeword in some of the aforementioned codewords. in:前述诸个码字中的每个码字的码值是共与数个表示如何用音码拼合成语句发音的拼合信息的值有关的;前述拾音器在被使用时,由前述拾音器的前述单片机芯片根据读得的前述部诸码字中的码字的码值使前述拾音器的前述发音器件播出语句发音。The code value of each code word in the aforementioned code words is related to the value of the information of how to piece together the pronunciation of a sentence with several representations in total; According to the code value of the code word in the code words of the aforementioned parts read, the aforementioned pronunciation device of the aforementioned sound pick-up is played out the sentence pronunciation.11.根据权利要求10所述的拾音器主体,其特征在于:11. The pickup body according to claim 10, characterized in that:所述单片机芯片与所述读码头直接连接或间接连接;所述单片机芯片与所述发音器件直接连接或间接连接;The single-chip microcomputer chip is directly or indirectly connected to the reading dock; the single-chip microcomputer chip is directly or indirectly connected to the pronunciation device;所述存储器集可以是所述单片机芯片内的数个存储器,或可以是所述单片机芯片外所述拾音器主体内的数个存储器,或可以是所述单片机芯片内的数个存储器加上所述单片机芯片外所述拾音器主体内的数个存储器;所述拾音器涉及的程序及所述拾音器涉及的数据存储在所述存储器集里;Described memory set can be several memories in the single-chip microcomputer chip, or can be several memories in the pickup main body outside the described single-chip microcomputer chip, or can be several memories in the described single-chip microcomputer chip plus the described Several memories in the main body of the pickup outside the single-chip microcomputer chip; the programs related to the pickup and the data related to the pickup are stored in the memory set;所述部诸码字是所述诸个码字中的全部码字或部分码字;所述诸个码字可以是一个或多个码字;所述部诸码字可以是一个或多个码字;All codewords of described part are all codewords or part codewords in described all codewords; Described all codewords can be one or more codewords; All codewords of described part can be one or more codewords Codeword;对于所述部诸码字中的一个码字涉及的表示如何用音码拼合成语句1发音的拼合信息的值,此表示如何用音码拼合成此语句1发音的拼合信息的值是按照一种编码规则来编码生成的,此编码规则编码生成此表示如何用音码拼合成此语句1发音的拼合信息的值时是有压缩编码处理的,此表示如何用音码拼合成此语句1发音的拼合信息的值也是拼合信息的值1,只需从前述一个码字即可得到前述拼合信息的值1或需从包括前述一个码字在内的所述部诸码字中的多个码字才能得到前述拼合信息的值1,所述拾音器的所述单片机芯片利用所述读码头可读取前述一个码字或前述多个码字而得到前述拼合信息的值1,所述拾音器的所述单片机芯片是按一种解码规则来对读码得到的前述拼合信息的值1执行解码使所述拾音器的所述发音器件播出前述语句1发音的,而且此解码规则在对前述拼合信息的值1执行解码时是有压缩解码处理的;For the value of the combination information of how a code word in the code words of the said part relates to how to piece together the pronunciation of the sentence 1 with the phonetic sign indicating number, this expression is according to a This encoding rule is used to encode and generate the value of the combination information indicating how to use the phonetic codes to form the pronunciation of this statement 1. This indicates how to use the phonetic codes to form the pronunciation of the statement 1. The value of the mosaic information is also the value 1 of the mosaic information, the value 1 of the aforementioned mosaic information only needs to be obtained from the aforementioned code word or needs to be obtained from a plurality of codes in the described code words including the aforementioned code word word to obtain the value 1 of the aforementioned combination information, the single-chip microcomputer chip of the pickup can read the aforementioned codeword or the aforementioned multiple codewords and obtain the value 1 of the aforementioned combination information by using the reading dock. The single-chip microcomputer chip performs decoding on the value 1 of the aforementioned combination information obtained by reading the code according to a decoding rule to make the pronunciation device of the pickup play the pronunciation of the aforementioned sentence 1, and this decoding rule is used for the combination of the aforementioned combination information. When the value 1 performs decoding, there is compression decoding processing;对于前述拼合信息的值1涉及的前述语句1,可以是一个完整句子,或者,是一个完整句子的部分,或者,是含有多个完整句子,或者,是含有数个完整句子和数个非完整句子,或者,是含有多个非完整句子。For the aforementioned sentence 1 involved in the value 1 of the aforementioned combined information, it can be a complete sentence, or a part of a complete sentence, or contain multiple complete sentences, or contain several complete sentences and several incomplete sentences. Sentences, or, are multiple incomplete sentences.12.根据权利要求11所述的拾音器主体,其特征在于:12. The pickup body according to claim 11, characterized in that:所述拼合信息的值1是按照所述编码规则中有关语种1的编码细则来编码生成的,此编码规则中有关语种1的编码细则是含有压缩编码处理的,此编码规则是只适用于语种1或适用于包括语种1在内的多个语种的一种编码规则,所述拾音器的所述单片机芯片是按所述解码规则中有关语种2的解码细则来对读码得到的所述拼合信息的值1执行解码使所述拾音器的所述发音器件播出所述语句1发音的,而且此解码规则中有关语种2的解码细则是含有压缩解码处理的,所述解码规则是只适用于语种2或适用于包括语种2在内的多个语种的一种解码规则。The value 1 of the spliced information is generated by encoding according to the encoding rules related to language 1 in the encoding rules. The encoding rules related to language 1 in the encoding rules include compression encoding processing. This encoding rule is only applicable to languages 1 or a coding rule applicable to multiple languages including language 1, the single-chip microcomputer chip of the pickup is the combination information obtained by reading the code according to the decoding rules of the relevant language 2 in the decoding rule The value 1 performs decoding to make the pronunciation device of the pickup play the pronunciation of the sentence 1, and the decoding details about the language 2 in this decoding rule contain compression decoding processing, and the decoding rule is only applicable to the language 2 or a decoding rule applicable to multiple languages including language 2.13.根据权利要求12所述的拾音器主体,其特征在于:13. The pickup body according to claim 12, characterized in that:所述语种1与所述语种2是相同语种;The language 1 and the language 2 are the same language;所述语句1是属于所述语种1的一个语句,所述语种1是属于印欧语系的一个语种;是用所述编码规则中有关所述语种1的编码细则来编码生成所述拼合信息的值1的,且这种编码规则中有关所述语种1的编码细则是含有压缩编码处理的;The sentence 1 is a sentence belonging to the language 1, and the language 1 is a language belonging to the Indo-European language family; the coding rules about the language 1 in the coding rules are used to encode and generate the combined information The value is 1, and the coding details about the language 1 in this coding rule include compressed coding processing;前述这种编码规则中有关所述语种1的编码细则有定义编码涉及的编码语句集,所述语句1属于此编码语句集;The coding rules about the language 1 in the aforementioned coding rules define the coding sentence set involved in the coding, and the sentence 1 belongs to the coding sentence set;所述语种2是属于印欧语系的一个语种;所述单片机芯片是按所述解码规则中有关所述语种2的解码细则来对所述拼合信息的值1执行解码使所述拾音器的所述发音器件播出所述语句1发音的,这种解码规则中有关所述语种2的解码细则是含有压缩解码处理的,这种解码规则中有关所述语种2的解码细则是直播方式的;The language 2 is a language belonging to the Indo-European language family; the single-chip microcomputer chip performs decoding on the value 1 of the combined information according to the decoding rules related to the language 2 in the decoding rules to make the pickup. The pronunciation device broadcasts the pronunciation of the sentence 1, and the decoding details about the language 2 in the decoding rules contain compression decoding processing, and the decoding details about the language 2 in the decoding rules are in a live broadcast mode;前述这种解码规则中有关所述语种2的解码细则有定义解码涉及的解码语句集,所述语句1属于此解码语句集;The decoding rules about the language 2 in the aforementioned decoding rules define the set of decoding sentences involved in decoding, and the sentence 1 belongs to the set of decoding sentences;前述这种解码规则中有关所述语种2的解码细则与前述这种编码规则中有关所述语种1的编码细则是专门对应的或是一般对应的;The decoding rules about the language 2 in the aforementioned decoding rules correspond specifically or generally to the encoding rules about the language 1 in the aforementioned encoding rules;前述这种编码规则中有关所述语种1的编码细则可用于对前述编码语句集中任一语句进行编码以生成表示如何用音码拼合成此任一语句发音的拼合信息的值;所述单片机芯片可按前述这种解码规则中有关所述语种2的解码细则来对表示如何用音码拼合成前述解码语句集中任一语句发音的拼合信息的值进行解码使所述拾音器的所述发音器件播出语句发音;The coding rules about the language 1 in the aforementioned coding rules can be used to encode any sentence in the aforementioned coding sentence set to generate the value of how to use phonetic codes to splicing into this arbitrary sentence pronunciation; the single-chip microcomputer chip Can be decoded according to the decoding rules about the language 2 in the aforementioned decoding rules to represent how to splicing into the value of the splicing information of any sentence pronunciation in the aforementioned decoding sentence collection with the phonetic sign indicating number Pronunciation of sentences;对于前述编码语句集中任一语句,可用多个音码来拼合成这任一语句的发音;可被当成用以拼合前述编码语句集发音的音码,也即前述这种编码规则中有关所述语种1的编码细则可以使用的音码,可以是元辅音或元音辅音组合,也可以是词根,单词,单词组合或范句,也可以是字符组合或字母,也可以是特殊音码;For any sentence in the aforementioned coded sentence set, multiple phonetic codes can be used to piece together the pronunciation of this arbitrary sentence; The phonetic codes that can be used in the coding rules of language 1 can be vowels and consonants or combinations of vowels and consonants, or roots, words, word combinations or model sentences, or character combinations or letters, or special phonetic codes;对于前述解码语句集中任一语句,可用多个音码来拼合成这任一语句的发音;可被当成用以拼合前述解码语句集发音的音码,也即前述这种解码规则中有关所述语种2的解码细则可以使用的音码,可以是元辅音或元音辅音组合,也可以是词根,单词,单词组合或范句,也可以是字符组合或字母,也可以是特殊音码;For any sentence in the foregoing decoded sentence collection, multiple phonetic codes can be used to piece together the pronunciation of this arbitrary sentence; The sound codes that can be used in the decoding rules of language 2 can be vowels and consonants or combinations of vowels and consonants, or roots, words, word combinations or model sentences, or character combinations or letters, or special sound codes;并且前述这种解码规则中有关所述语种2的解码细则对音码的定义方式与前述这种编码规则中有关所述语种1的编码细则对音码的定义方式是相同或兼容的;And the definition mode of the decoding rules about the language 2 in the aforementioned decoding rules is the same or compatible with the definition of the phonetic codes in the coding rules about the language 1 in the aforementioned coding rules;所述存储器集里存储有用于解码发音的诸音码的发音数据;既可以由元辅音,元音辅音组合和单词中的任一种音码来拼合前述编码语句集发音,也可以由元辅音,元音辅音组合,词根,单词和单词组合中的多种音码来拼合前述编码语句集发音,也可以由范句音码与元辅音,元音辅音组合和单词中的任一种音码来拼合前述编码语句集发音,也可以由范句音码与元辅音,元音辅音组合,词根,单词和单词组合中的多种音码来拼合前述编码语句集发音,也可以由词根和单词组合中的任一种音码来拼合前述编码语句集发音,也可以由词根和单词组合中的任一种音码与范句音码来拼合前述编码语句集发音,也可以由字符组合和字母中的一种或多种音码来拼合前述编码语句集发音,也可以由元辅音,元音辅音组合,词根,单词,单词组合和范句中的一种或多种音码与字符组合和字母中的一种或多种音码来拼合前述编码语句集发音,也可以由元辅音,元音辅音组合,词根,单词,单词组合,范句,字符组合和字母中的一种或多种音码与特殊音码来拼合前述编码语句集发音;The pronunciation data of all phonetic codes that are used to decode pronunciation is stored in the described memory set; Both can be by vowel consonant, any kind of phonetic code in the vowel-consonant combination and word to piece together the aforementioned coded sentence set pronunciation, also can be by vowel consonant , vowel and consonant combination, word root, multiple phonetic codes in words and word combinations to piece together the aforementioned coded statement set pronunciation, also can be any kind of phonetic code in the model sentence phonetic code and vowel and consonant, vowel and consonant combination and word To put together the pronunciation of the aforementioned coded sentence set, it is also possible to put together the aforementioned coded sentence set pronunciation by the model sentence phonetic sign indicating number and vowel and consonant, the combination of vowel and consonant, word root, word and word combination Any kind of phonetic sign indicating number in combination comes together pronunciation of aforementioned coded sentence collection, also can come together pronunciation of aforementioned coded sentence collection by any kind of phonetic code and model sentence phonetic code in root and word combination, also can be by character combination and letter One or more phonetic codes in the above-mentioned coded sentence set pronunciation are spliced together, and one or more phonetic codes and character combinations and One or more phonetic codes in the letters are used to assemble the pronunciation of the aforementioned coded sentence set, or one or more of vowels and consonants, combinations of vowels and consonants, roots, words, word combinations, model sentences, character combinations and letters Phonetic sign indicating number and special phonetic sign indicating number come together the pronunciation of aforementioned coded sentence set;用以拼合前述编码语句集发音的音码,同样一个音码可以用于前述编码语句集中的不同语句的拼合发音;In order to piece together the phonetic sign indicating number of pronunciation of aforementioned coded sentence set, the same phonetic code can be used for the combined pronunciation of the different sentences in aforementioned coded sentence set;既可以由元辅音,元音辅音组合和单词中的任一种音码来拼合前述解码语句集发音,也可以由元辅音,元音辅音组合,词根,单词和单词组合中的多种音码来拼合前述解码语句集发音,也可以由范句音码与元辅音,元音辅音组合和单词中的任一种音码来拼合前述解码语句集发音,也可以由范句音码与元辅音,元音辅音组合,词根,单词和单词组合中的多种音码来拼合前述解码语句集发音,也可以由词根和单词组合中的任一种音码来拼合前述解码语句集发音,也可以由词根和单词组合中的任一种音码与范句音码来拼合前述解码语句集发音,也可以由字符组合和字母中的一种或多种音码来拼合前述解码语句集发音,也可以由元辅音,元音辅音组合,词根,单词,单词组合和范句中的一种或多种音码与字符组合和字母中的一种或多种音码来拼合前述解码语句集发音,也可以由元辅音,元音辅音组合,词根,单词,单词组合,范句,字符组合和字母中的一种或多种音码与特殊音码来拼合前述解码语句集发音;Both can be by vowel and consonant, vowel and consonant combination and any kind of phonetic sign indicating number in word to piece together the pronunciation of aforementioned decoded statement set, also can be by vowel and consonant, vowel and consonant combination, word root, multiple sound sign indicating numbers in word and word combination To put together the aforementioned decoding sentence set pronunciation, also can be by model sentence sound sign indicating number and vowel consonant, any kind of sound sign indicating number in the combination of vowel and consonant and word put together the aforementioned decoding sentence set pronunciation, also can be by model sentence sound sign indicating number and vowel consonant , the combination of vowels and consonants, root, word and multiple phonetic codes in the combination of words to splice the pronunciation of the aforementioned decoding sentence set, and also can splicing the pronunciation of the aforementioned decoding sentence set by any sound code in the root and the combination of words, or can By any kind of phonetic sign indicating number and model sentence phonetic sign indicating number in root and word combination, the aforementioned decoding sentence set pronunciation can be put together, and the aforementioned decoding sentence set pronunciation can also be put together by one or more sound codes in character combination and letter, also Can be by vowel consonant, vowel and consonant combination, root, word, word combination and one or more phonetic codes in the model sentence and one or more phonetic codes in the character combination and letter to piece together the aforementioned decoding statement set pronunciation, Also can be by vowel and consonant, vowel and consonant combination, word root, word, word combination, model sentence, character combination and one or more phonetic codes and special phonetic codes in the letter to piece together the aforementioned decoding sentence set pronunciation;用以拼合前述解码语句集发音的音码,同样一个音码可以用于前述解码语句集中的不同语句的拼合发音;In order to piece together the phonetic sign indicating number of pronunciation of aforementioned decoded sentence set, same sound code can be used for the combined pronunciation of the different sentences of aforementioned decoded sentence set;可按照前述这种解码规则中有关所述语种2的解码细则来对表示前述解码语句集中语句发音的拼合信息的值进行解码的解码程序及解码数据存储在所述存储器集里。The decoding program and decoding data that can decode the value of the combined information representing the pronunciation of the sentences in the decoding sentence set according to the decoding rules related to the language 2 in the aforementioned decoding rules are stored in the memory set.14.根据权利要求12所述的拾音器主体,其特征在于:14. The pickup body according to claim 12, characterized in that:所述语种1与所述语种2是相同语种;The language 1 and the language 2 are the same language;所述语句1是属于所述语种1的一个语句,所述语种1是属于非印欧语系的一个语种;是用所述编码规则中有关所述语种1的编码细则来编码生成所述拼合信息的值1的,且这种编码规则中有关所述语种1的编码细则是含有压缩编码处理的;The sentence 1 is a sentence belonging to the language 1, and the language 1 is a language belonging to the non-Indo-European language family; the coding rules about the language 1 in the coding rules are used to encode and generate the combined information The value of 1, and the encoding details of the language 1 in this encoding rule include compression encoding processing;前述这种编码规则中有关所述语种1的编码细则有定义编码涉及的编码语句集,所述语句1属于此编码语句集;The coding rules about the language 1 in the aforementioned coding rules define the coding sentence set involved in the coding, and the sentence 1 belongs to the coding sentence set;所述语种2是属于非印欧语系的一个语种;所述单片机芯片是按所述解码规则中有关所述语种2的解码细则来对所述拼合信息的值1执行解码使所述拾音器的所述发音器件播出语句发音的,这种解码规则中有关所述语种2的解码细则是含有压缩解码处理的,这种解码规则中有关所述语种2的解码细则是直播方式的;The language type 2 is a language type belonging to the non-Indo-European language family; the single-chip microcomputer chip performs decoding on the value 1 of the splicing information according to the decoding rules about the language type 2 in the decoding rules so that all of the pickups Said pronunciation device broadcasts the sentence pronunciation, the decoding detail about said language 2 in this decoding rule is to contain compression decoding processing, and the decoding detail about said language 2 in this decoding rule is the live mode;前述这种解码规则中有关所述语种2的解码细则有定义解码涉及的解码语句集,所述语句1属于此解码语句集;The decoding rules about the language 2 in the aforementioned decoding rules define the set of decoding sentences involved in decoding, and the sentence 1 belongs to the set of decoding sentences;前述这种解码规则中有关所述语种2的解码细则与前述这种编码规则中有关所述语种1的编码细则是专门对应的或是一般对应的;The decoding rules about the language 2 in the aforementioned decoding rules correspond specifically or generally to the encoding rules about the language 1 in the aforementioned encoding rules;前述这种编码规则中有关所述语种1的编码细则可用于对前述编码语句集中任一语句进行编码以生成表示如何用音码拼合成此任一语句发音的拼合信息的值;所述单片机芯片可按前述这种解码规则中有关所述语种2的解码细则来对表示如何用音码拼合成前述解码语句集中任一语句发音的拼合信息的值进行解码使所述拾音器的所述发音器件播出语句发音;The coding rules about the language 1 in the aforementioned coding rules can be used to encode any sentence in the aforementioned coding sentence set to generate the value of how to use phonetic codes to splicing into this arbitrary sentence pronunciation; the single-chip microcomputer chip Can be decoded according to the decoding rules about the language 2 in the aforementioned decoding rules to represent how to splicing into the value of the splicing information of any sentence pronunciation in the aforementioned decoding sentence collection with the phonetic sign indicating number Pronunciation of sentences;对于前述编码语句集中任一语句,可用多个音码来拼合成这任一语句的发音;可被当成用以拼合前述编码语句集发音的音码,也即前述这种编码规则中有关所述语种1的编码细则可以使用的音码,可以是元辅音或元音辅音组合,也可以是字,词,词组合或范句,也可以是偏旁部首组合或偏旁部首,也可以是特殊音码;For any sentence in the aforementioned coded sentence set, multiple phonetic codes can be used to piece together the pronunciation of this arbitrary sentence; The phonetic code that can be used in the coding rules of language 1 can be a vowel and consonant or a combination of vowels and consonants, it can also be a word, word, word combination or model sentence, it can also be a combination of radicals or radicals, or a special sound code;对于前述解码语句集中任一语句,可用多个音码来拼合成这任一语句的发音;可被当成用以拼合前述解码语句集发音的音码,也即前述这种解码规则中有关所述语种2的解码细则可以使用的音码,可以是元辅音或元音辅音组合,也可以是字,词,词组合或范句,也可以是偏旁部首组合或偏旁部首,也可以是特殊音码;For any sentence in the foregoing decoded sentence collection, multiple phonetic codes can be used to piece together the pronunciation of this arbitrary sentence; The sound code that can be used in the decoding rules of language 2 can be a vowel and consonant or a combination of vowels and consonants, it can also be a word, word, word combination or model sentence, it can also be a combination of radicals or radicals, or it can be a special sound code;并且前述这种解码规则中有关所述语种2的解码细则对音码的定义方式与前述这种编码规则中有关所述语种1的编码细则对音码的定义方式是相同或兼容的;And the definition mode of the decoding rules about the language 2 in the aforementioned decoding rules is the same or compatible with the definition of the phonetic codes in the coding rules about the language 1 in the aforementioned coding rules;所述存储器集里存储有用于解码发音的诸音码的发音数据;既可以由元辅音,元音辅音组合,字和词中的任一种音码来拼合前述编码语句集发音,也可以由元辅音,元音辅音组合,字,词和词组合中的多种音码来拼合前述编码语句集发音,也可以由范句音码与元辅音,元音辅音组合,字和词中的任一种音码来拼合前述编码语句集发音,也可以由范句音码与元辅音,元音辅音组合,字,词和词组合中的多种音码来拼合前述编码语句集发音,也可以由词组合这一种音码来拼合前述编码语句集发音,也可以由词组合音码与范句音码来拼合前述编码语句集发音,也可以由偏旁部首组合和偏旁部首中的一种或多种音码来拼合前述编码语句集发音,也可以由元辅音,元音辅音组合,字,词,词组合和范句中的一种或多种音码与偏旁部首组合和偏旁部首中的一种或多种音码来拼合前述编码语句集发音,也可以由元辅音,元音辅音组合,字,词,词组合,范句,偏旁部首组合和偏旁部首中的一种或多种音码与特殊音码来拼合前述编码语句集发音;用以拼合前述编码语句集发音的音码,同样一个音码可以用于前述编码语句集中的不同语句的拼合发音;The pronunciation data of all phonetic codes that are used to decode pronunciation is stored in the described memory set; Both can be by vowel and consonant, vowel and consonant combination, any kind of phonetic code in word and word comes together the aforementioned coded sentence set pronunciation, also can be by Yuan and consonants, vowel and consonant combinations, multiple phonetic codes in words, words and word combinations to put together the aforementioned coded sentence set pronunciation, also can be by model sentence phonetic code and vowel and consonant, vowel and consonant combinations, arbitrary in words and words A kind of phonetic sign indicating number is put together the aforementioned coded sentence set pronunciation, also can be combined by model sentence phonetic sign indicating number and vowel consonant, the multiple phonetic codes in word, word and word combination come together the aforementioned coded sentence set pronunciation, also can This kind of phonetic sign indicating number of word combination comes together pronunciation of aforementioned coded sentence set, also can come together pronunciation of aforementioned encoded sentence set by word combination phonetic sign indicating number and model sentence phonetic sign indicating number, also can be by a radical combination and radical in radical One or more phonetic codes can be used to put together the pronunciation of the aforementioned coded sentence set, and can also be composed of one or more phonetic codes and radical combinations and radicals in vowel-consonant, vowel-consonant combination, word, word, word combination and model sentence One or more phonetic codes in the radicals are used to piece together the pronunciation of the aforementioned coded sentence set, and can also be composed of vowels and consonants, vowel-consonant combinations, words, words, word combinations, model sentences, radical combinations and radicals. One or more phonetic codes and special phonetic codes are used to assemble the pronunciation of the aforementioned coded sentence collection; the phonetic codes used to piece together the pronunciation of the aforementioned coded sentence collection, and the same phonetic code can be used for the combined pronunciation of different sentences in the aforementioned coded sentence collection;既可以由元辅音,元音辅音组合,字和词中的任一种音码来拼合前述解码语句集发音,也可以由元辅音,元音辅音组合,字,词和词组合中的多种音码来拼合前述解码语句集发音,也可以由范句音码与元辅音,元音辅音组合,字和词中的任一种音码来拼合前述解码语句集发音,也可以由范句音码与元辅音,元音辅音组合,字,词和词组合中的多种音码来拼合前述解码语句集发音,也可以由词组合这一种音码来拼合前述解码语句集发音,也可以由词组合音码与范句音码来拼合前述解码语句集发音,也可以由偏旁部首组合和偏旁部首中的一种或多种音码来拼合前述解码语句集发音,也可以由元辅音,元音辅音组合,字,词,词组合和范句中的一种或多种音码与偏旁部首组合和偏旁部首中的一种或多种音码来拼合前述解码语句集发音,也可以由元辅音,元音辅音组合,字,词,词组合,范句,偏旁部首组合和偏旁部首中的一种或多种音码与特殊音码来拼合前述解码语句集发音;用以拼合前述解码语句集发音的音码,同样一个音码可以用于前述解码语句集中的不同语句的拼合发音;Both can be by vowel and consonant, vowel and consonant combination, any kind of phonetic sign indicating number in word and word to piece together the pronunciation of aforementioned decoded statement set, also can be by vowel and consonant, vowel and consonant combination, multiple in word, word and word combination Sound sign indicating number is put together the aforementioned decoding statement set pronunciation, also can be by model sentence sound sign indicating number and vowel consonant, vowel and consonant combination, any kind of sound sign indicating number in word and word is put together aforementioned decoding sentence set pronunciation, also can be by model sentence sound code and vowel-consonant, vowel-consonant combination, word, multiple phonetic codes in word and word combination come together the aforementioned decoding statement set pronunciation, also can put together the aforementioned decoding statement set pronunciation by word combination this kind of phonetic sign indicating number, also can The pronunciation of the aforementioned decoded sentence set is assembled by the word combination phonetic code and the model sentence phonetic code, and the pronunciation of the aforementioned decoded sentence set can also be assembled by one or more phonetic codes in the combination of radicals and radicals. Consonant, vowel-consonant combination, word, word, word combination and one or more phonetic codes in the model sentence and one or more phonetic codes in the radical combination and the radical radical are used to stitch together the pronunciation of the aforementioned decoding sentence set , also can be by vowel consonant, vowel and consonant combination, word, word, word combination, model sentence, one or more phonetic codes and special phonetic codes in radical combination and radicals to put together the aforementioned decoding statement set pronunciation ; In order to piece together the phonetic sign indicating number of pronunciation of aforementioned decoding sentence set, same sound sign indicating number can be used for the splicing pronunciation of the different sentences of aforementioned decoding sentence set;可按照前述这种解码规则中有关所述语种2的解码细则来对表示前述解码语句集中语句发音的拼合信息的值进行解码的解码程序及解码数据存储在所述存储器集里。The decoding program and decoding data that can decode the value of the combined information representing the pronunciation of the sentences in the decoding sentence set according to the decoding rules related to the language 2 in the aforementioned decoding rules are stored in the memory set.15.根据权利要求12所述的拾音器主体,其特征在于:15. The pickup body according to claim 12, characterized in that:所述语种1与所述语种2是相同语种;The language 1 and the language 2 are the same language;所述语句1是属于所述语种1的一个语句,所述语种1是属于印欧语系的一个语种;是用所述编码规则中有关所述语种1的编码细则来编码生成所述拼合信息的值1的,且这种编码规则中有关所述语种1的编码细则是含有压缩编码处理的;The sentence 1 is a sentence belonging to the language 1, and the language 1 is a language belonging to the Indo-European language family; the coding rules about the language 1 in the coding rules are used to encode and generate the combined information The value is 1, and the coding details about the language 1 in this coding rule include compressed coding processing;前述这种编码规则中有关所述语种1的编码细则有定义编码涉及的编码语句集,所述语句1属于此编码语句集;The coding rules about the language 1 in the aforementioned coding rules define the coding sentence set involved in the coding, and the sentence 1 belongs to the coding sentence set;所述语种2是属于印欧语系的一个语种;所述单片机芯片是按所述解码规则中有关所述语种2的解码细则来对所述拼合信息的值1执行解码使所述拾音器的所述发音器件播出语句发音的,这种解码规则中有关所述语种2的解码细则是含有压缩解码处理的,这种解码规则中有关所述语种2的解码细则是TTS方式的;The language 2 is a language belonging to the Indo-European language family; the single-chip microcomputer chip performs decoding on the value 1 of the combined information according to the decoding rules related to the language 2 in the decoding rules to make the pickup. The pronunciation device broadcasts the pronunciation of the sentence, the decoding details about the language 2 in the decoding rules contain compression decoding processing, and the decoding details about the language 2 in the decoding rules are in the TTS mode;前述这种解码规则中有关所述语种2的解码细则有定义解码涉及的解码语句集,所述语句1属于此解码语句集;The decoding rules about the language 2 in the aforementioned decoding rules define the set of decoding sentences involved in decoding, and the sentence 1 belongs to the set of decoding sentences;前述这种解码规则中有关所述语种2的解码细则与前述这种编码规则中有关所述语种1的编码细则是专门对应的或是一般对应的;The decoding rules about the language 2 in the aforementioned decoding rules correspond specifically or generally to the encoding rules about the language 1 in the aforementioned encoding rules;前述这种编码规则中有关所述语种1的编码细则可用于对前述编码语句集中任一语句进行编码以生成表示如何用音码拼合成此任一语句发音的拼合信息的值;所述单片机芯片可按前述这种解码规则中有关所述语种2的解码细则来对表示如何用音码拼合成前述解码语句集中任一语句发音的拼合信息的值进行解码使所述拾音器的所述发音器件播出语句发音;The coding rules about the language 1 in the aforementioned coding rules can be used to encode any sentence in the aforementioned coding sentence set to generate the value of how to use phonetic codes to splicing into this arbitrary sentence pronunciation; the single-chip microcomputer chip Can be decoded according to the decoding rules about the language 2 in the aforementioned decoding rules to represent how to splicing into the value of the splicing information of any sentence pronunciation in the aforementioned decoding sentence collection with the phonetic sign indicating number Pronunciation of sentences;对于前述编码语句集中任一语句,可用多个音码来拼合成这任一语句的发音;可被当成用以拼合前述编码语句集发音的音码,也即前述这种编码规则中有关所述语种1的编码细则可以使用的音码,可以是元辅音或元音辅音组合,也可以是词根,单词,单词组合或范句,也可以是字符组合或字母,也可以是特殊音码;For any sentence in the aforementioned coded sentence set, multiple phonetic codes can be used to piece together the pronunciation of this arbitrary sentence; The phonetic codes that can be used in the coding rules of language 1 can be vowels and consonants or combinations of vowels and consonants, or roots, words, word combinations or model sentences, or character combinations or letters, or special phonetic codes;对于前述解码语句集中任一语句,可用多个音码来拼合成这任一语句的发音;可被当成用以拼合前述解码语句集发音的音码,也即前述这种解码规则中有关所述语种2的解码细则可以使用的音码,可以是元辅音或元音辅音组合,也可以是词根,单词,单词组合或范句,也可以是字符组合或字母,也可以是特殊音码;For any sentence in the foregoing decoded sentence collection, multiple phonetic codes can be used to piece together the pronunciation of this arbitrary sentence; The sound codes that can be used in the decoding rules of language 2 can be vowels and consonants or combinations of vowels and consonants, or roots, words, word combinations or model sentences, or character combinations or letters, or special sound codes;并且前述这种解码规则中有关所述语种2的解码细则对音码的定义方式与前述这种编码规则中有关所述语种1的编码细则对音码的定义方式是相同或兼容的;And the definition mode of the decoding rules about the language 2 in the aforementioned decoding rules is the same or compatible with the definition of the phonetic codes in the coding rules about the language 1 in the aforementioned coding rules;既可以由元辅音,元音辅音组合和单词中的任一种音码来拼合前述编码语句集发音,也可以由元辅音,元音辅音组合,词根,单词和单词组合中的多种音码来拼合前述编码语句集发音,也可以由范句音码与元辅音,元音辅音组合和单词中的任一种音码来拼合前述编码语句集发音,也可以由范句音码与元辅音,元音辅音组合,词根,单词和单词组合中的多种音码来拼合前述编码语句集发音,也可以由词根和单词组合中的任一种音码来拼合前述编码语句集发音,也可以由词根和单词组合中的一种音码与范句音码来拼合前述编码语句集发音,也可以由字符组合和字母中的一种或多种音码来拼合前述编码语句集发音,也可以由元辅音,元音辅音组合,词根,单词,单词组合和范句中的一种或多种音码与字符组合和字母中的一种或多种音码来拼合前述编码语句集发音,也可以由元辅音,元音辅音组合,词根,单词,单词组合,范句,字符组合和字母中的一种或多种音码与特殊音码来拼合前述编码语句集发音;Both can be combined by vowel and consonant, vowel and consonant combination and any kind of phonetic sign indicating number in word to piece together the pronunciation of aforementioned coded statement set, also can be by vowel and consonant, vowel and consonant combination, word root, multiple sound sign indicating numbers in word and word combination To put together the aforementioned coded sentence set pronunciation, also can be by model sentence sound sign indicating number and Yuan consonant, vowel consonant combination and any kind of sound sign indicating number in word to put together aforementioned coded sentence set pronunciation, also can be by model sentence sound sign indicating number and Yuan consonant , vowel and consonant combination, root, multiple phonetic codes in word and word combination come together pronunciation of aforementioned coded statement set, also can come together pronunciation of aforementioned coded statement set by any kind of phonetic code in root and word combination, also can By a kind of phonetic sign indicating number and model sentence phonetic sign indicating number in the root and word combination, the pronunciation of the aforementioned coded sentence set can be put together, and the pronunciation of the aforementioned coded sentence set can also be put together by one or more phonetic codes in the character combination and letters. By vowel and consonant, vowel and consonant combination, root, word, word combination and one or more sound codes in the model sentence and one or more sound codes in the character combination and letter to piece together the aforementioned coded sentence set pronunciation, also One or more phonetic codes and special phonetic codes in the vowel and consonant, the combination of vowel and consonant, word root, word, word combination, model sentence, character combination and letter can be used to assemble the pronunciation of the aforementioned coded sentence set;用以拼合前述编码语句集发音的音码,同样一个音码可以用于前述编码语句集中的不同语句的拼合发音;In order to piece together the phonetic sign indicating number of pronunciation of aforementioned coded sentence set, the same phonetic code can be used for the combined pronunciation of the different sentences in aforementioned coded sentence set;既可以由元辅音,元音辅音组合和单词中的任一种音码来拼合前述解码语句集发音,也可以由元辅音,元音辅音组合,词根,单词和单词组合中的多种音码来拼合前述解码语句集发音,也可以由范句音码与元辅音,元音辅音组合和单词中的任一种音码来拼合前述解码语句集发音,也可以由范句音码与元辅音,元音辅音组合,词根,单词和单词组合中的多种音码来拼合前述解码语句集发音,也可以由词根和单词组合中的任一种音码来拼合前述解码语句集发音,也可以由词根和单词组合中的一种音码与范句音码来拼合前述解码语句集发音,也可以由字符组合和字母中的一种或多种音码来拼合前述解码语句集发音,也可以由元辅音,元音辅音组合,词根,单词,单词组合和范句中的一种或多种音码与字符组合和字母中的一种或多种音码来拼合前述解码语句集发音,也可以由元辅音,元音辅音组合,词根,单词,单词组合,范句,字符组合和字母中的一种或多种音码与特殊音码来拼合前述解码语句集发音;Both can be by vowel and consonant, vowel and consonant combination and any kind of phonetic sign indicating number in word to piece together the pronunciation of aforementioned decoded statement set, also can be by vowel and consonant, vowel and consonant combination, word root, multiple sound sign indicating numbers in word and word combination To put together the aforementioned decoding sentence set pronunciation, also can be by model sentence sound sign indicating number and vowel consonant, any kind of sound sign indicating number in the combination of vowel and consonant and word put together the aforementioned decoding sentence set pronunciation, also can be by model sentence sound sign indicating number and vowel consonant , the combination of vowels and consonants, root, word and multiple phonetic codes in the combination of words to splice the pronunciation of the aforementioned decoding sentence set, and also can splicing the pronunciation of the aforementioned decoding sentence set by any sound code in the root and the combination of words, or can By a kind of phonetic sign indicating number and model sentence phonetic sign indicating number in root and word combination, put together the aforementioned decoding sentence set pronunciation, also can put together the aforementioned decoding sentence set pronunciation by one or more sound codes in character combination and letter, also can By vowel and consonant, vowel and consonant combination, root, word, word combination and one or more sound codes in the model sentence and one or more sound codes in the character combination and letter to piece together the aforementioned decoding sentence set pronunciation, also One or more phonetic codes and special phonetic codes in vowels and consonants, vowel and consonant combinations, roots, words, word combinations, model sentences, character combinations and letters can be used to assemble the pronunciation of the aforementioned decoding statement set;用以拼合前述解码语句集发音的音码,同样一个音码可以用于前述解码语句集中的不同语句的拼合发音;In order to piece together the phonetic sign indicating number of pronunciation of aforementioned decoded sentence set, same sound code can be used for the combined pronunciation of the different sentences of aforementioned decoded sentence set;可按照前述这种解码规则中有关所述语种2的解码细则来对表示前述解码语句集中语句发音的拼合信息的值进行解码的解码程序及解码数据存储在所述存储器集里;According to the decoding rules about the language 2 in the above-mentioned decoding rules, the decoding program and the decoding data for decoding the value of the combined information representing the pronunciation of the sentences in the decoding sentence set are stored in the memory set;所述发音器件是TTS器件。The pronunciation device is a TTS device.16.根据权利要求12所述的拾音器主体,其特征在于:16. The pickup body of claim 12, wherein:所述语种1与所述语种2是相同语种;The language 1 and the language 2 are the same language;所述语句1是属于所述语种1的一个语句,所述语种1是属于非印欧语系的一个语种;是用所述编码规则中有关所述语种1的编码细则来编码生成所述拼合信息的值1的,且这种编码规则中有关所述语种1的编码细则是含有压缩编码处理的;The sentence 1 is a sentence belonging to the language 1, and the language 1 is a language belonging to the non-Indo-European language family; the coding rules about the language 1 in the coding rules are used to encode and generate the combined information The value of 1, and the encoding details of the language 1 in this encoding rule include compression encoding processing;前述这种编码规则中有关所述语种1的编码细则有定义编码涉及的编码语句集,所述语句1属于此编码语句集;The coding rules about the language 1 in the aforementioned coding rules define the coding sentence set involved in the coding, and the sentence 1 belongs to the coding sentence set;所述语种2是属于非印欧语系的一个语种;所述单片机芯片是按所述解码规则中有关所述语种2的解码细则来对所述拼合信息的值1执行解码使所述拾音器的所述发音器件播出语句发音的,这种解码规则中有关所述语种2的解码细则是含有压缩解码处理的,这种解码规则中有关所述语种2的解码细则是TTS方式的;The language type 2 is a language type belonging to the non-Indo-European language family; the single-chip microcomputer chip performs decoding on the value 1 of the splicing information according to the decoding rules about the language type 2 in the decoding rules so that all of the pickups Said pronunciation device broadcasts sentence pronunciation, and the decoding detail about said language 2 in this decoding rule is to contain compression decoding processing, and the decoding detail about said language 2 in this decoding rule is TTS mode;前述这种解码规则中有关所述语种2的解码细则有定义解码涉及的解码语句集,所述语句1属于此解码语句集;The decoding rules about the language 2 in the aforementioned decoding rules define the set of decoding sentences involved in decoding, and the sentence 1 belongs to the set of decoding sentences;前述这种解码规则中有关所述语种2的解码细则与前述这种编码规则中有关所述语种1的编码细则是专门对应的或是一般对应的;The decoding rules about the language 2 in the aforementioned decoding rules correspond specifically or generally to the encoding rules about the language 1 in the aforementioned encoding rules;前述这种编码规则中有关所述语种1的编码细则可用于对前述编码语句集中任一语句进行编码以生成表示如何用音码拼合成此任一语句发音的拼合信息的值;所述单片机芯片可按前述这种解码规则中有关所述语种2的解码细则来对表示如何用音码拼合成前述解码语句集中任一语句发音的拼合信息的值进行解码使所述拾音器的所述发音器件播出语句发音;The coding rules about the language 1 in the aforementioned coding rules can be used to encode any sentence in the aforementioned coding sentence set to generate the value of how to use phonetic codes to splicing into this arbitrary sentence pronunciation; the single-chip microcomputer chip Can be decoded according to the decoding rules about the language 2 in the aforementioned decoding rules to represent how to splicing into the value of the splicing information of any sentence pronunciation in the aforementioned decoding sentence collection with the phonetic sign indicating number Pronunciation of sentences;对于前述编码语句集中任一语句,可用多个音码来拼合成这任一语句的发音;可被当成用以拼合前述编码语句集发音的音码,也即前述这种编码规则中有关所述语种1的编码细则可以使用的音码,可以是元辅音或元音辅音组合,也可以是字,词,词组合或范句,也可以是偏旁部首组合或偏旁部首,也可以是特殊音码;For any sentence in the aforementioned coded sentence set, multiple phonetic codes can be used to piece together the pronunciation of this arbitrary sentence; The phonetic code that can be used in the coding rules of language 1 can be a vowel and consonant or a combination of vowels and consonants, it can also be a word, word, word combination or model sentence, it can also be a combination of radicals or radicals, or a special sound code;对于前述解码语句集中任一语句,可用多个音码来拼合成这任一语句的发音;可被当成用以拼合前述解码语句集发音的音码,也即前述这种解码规则中有关所述语种2的解码细则可以使用的音码,可以是元辅音或元音辅音组合,也可以是字,词,词组合或范句,也可以是偏旁部首组合或偏旁部首,也可以是特殊音码;For any sentence in the foregoing decoded sentence collection, multiple phonetic codes can be used to piece together the pronunciation of this arbitrary sentence; The sound code that can be used in the decoding rules of language 2 can be a vowel and consonant or a combination of vowels and consonants, it can also be a word, word, word combination or model sentence, it can also be a combination of radicals or radicals, or it can be a special sound code;并且前述这种解码规则中有关所述语种2的解码细则对音码的定义方式与前述这种编码规则中有关所述语种1的编码细则对音码的定义方式是相同或兼容的;And the definition mode of the decoding rules about the language 2 in the aforementioned decoding rules is the same or compatible with the definition of the phonetic codes in the coding rules about the language 1 in the aforementioned coding rules;既可以由元辅音,元音辅音组合,字和词中的任一种音码来拼合前述编码语句集发音,也可以由元辅音,元音辅音组合,字,词和词组合中的多种音码来拼合前述编码语句集发音,也可以由范句音码与元辅音,元音辅音组合,字和词中的任一种音码来拼合前述编码语句集发音,也可以由范句音码与元辅音,元音辅音组合,字,词和词组合中的多种音码来拼合前述编码语句集发音,也可以由词组合这一种音码来拼合前述编码语句集发音,也可以由词组合音码与范句音码来拼合前述编码语句集发音,也可以由偏旁部首组合和偏旁部首中的一种或多种音码来拼合前述编码语句集发音,也可以由元辅音,元音辅音组合,字,词,词组合和范句中的一种或多种音码与偏旁部首组合和偏旁部首中的一种或多种音码来拼合前述编码语句集发音,也可以由元辅音,元音辅音组合,字,词,词组合,范句,偏旁部首组合和偏旁部首中的一种或多种音码与特殊音码来拼合前述编码语句集发音;Both can be by vowel and consonant, the combination of vowel and consonant, any kind of phonetic sign indicating number in word and word to put together the pronunciation of aforementioned coded statement set, also can be by vowel and consonant, combination of vowel and consonant, multiple in word, word and word combination Phonetic sign indicating number is put together the aforementioned coded sentence set pronunciation, also can be by model sentence phonetic sign indicating number and vowel consonant, vowel and consonant combination, any kind of sound code in word and word is put together aforementioned coded sentence set pronunciation, also can be by model sentence sound Code and vowel consonant, vowel and consonant combination, word, multiple phonetic codes in word and word combination come together the aforementioned coded sentence set pronunciation, also can come together the aforementioned coded sentence set pronunciation by word combination this kind of phonetic sign indicating number, also can The pronunciation of the aforementioned coded sentence set is assembled by the word combination phonetic code and the model sentence phonetic code, and the pronunciation of the aforementioned encoded sentence set can also be assembled by one or more phonetic codes in the combination of radicals and radicals. Consonants, vowel-consonant combinations, words, words, word combinations and one or more phonetic codes in the combination of radicals and radicals and one or more phonetic codes in the radicals are used to stitch together the pronunciation of the aforementioned coded sentence set , also can be by vowel consonant, vowel and consonant combination, word, word, word combination, model sentence, one or more phonetic codes and special phonetic codes in radical combination and radicals to splicing the aforementioned coded sentence set pronunciation ;用以拼合前述编码语句集发音的音码,同样一个音码可以用于前述编码语句集中的不同语句的拼合发音;In order to piece together the phonetic sign indicating number of pronunciation of aforementioned coded sentence set, the same phonetic code can be used for the combined pronunciation of the different sentences in aforementioned coded sentence set;既可以由元辅音,元音辅音组合,字和词中的任一种音码来拼合前述解码语句集发音,也可以由元辅音,元音辅音组合,字,词和词组合中的多种音码来拼合前述解码语句集发音,也可以由范句音码与元辅音,元音辅音组合,字和词中的任一种音码来拼合前述解码语句集发音,也可以由范句音码与元辅音,元音辅音组合,字,词和词组合中的多种音码来拼合前述解码语句集发音,也可以由词组合这一种音码来拼合前述解码语句集发音,也可以由词组合音码与范句音码来拼合前述解码语句集发音,也可以由偏旁部首组合和偏旁部首中的一种或多种音码来拼合前述解码语句集发音,也可以由元辅音,元音辅音组合,字,词,词组合和范句中的一种或多种音码与偏旁部首组合和偏旁部首中的一种或多种音码来拼合前述解码语句集发音,也可以由元辅音,元音辅音组合,字,词,词组合,范句,偏旁部首组合和偏旁部首中的一种或多种音码与特殊音码来拼合前述解码语句集发音;Both can be by vowel and consonant, vowel and consonant combination, any kind of phonetic sign indicating number in word and word to piece together the pronunciation of aforementioned decoded statement set, also can be by vowel and consonant, vowel and consonant combination, multiple in word, word and word combination Sound sign indicating number is put together the aforementioned decoding statement set pronunciation, also can be by model sentence sound sign indicating number and vowel consonant, vowel and consonant combination, any kind of sound sign indicating number in word and word is put together aforementioned decoding sentence set pronunciation, also can be by model sentence sound code and vowel-consonant, vowel-consonant combination, word, multiple phonetic codes in word and word combination come together the aforementioned decoding statement set pronunciation, also can put together the aforementioned decoding statement set pronunciation by word combination this kind of phonetic sign indicating number, also can The pronunciation of the aforementioned decoded sentence set is assembled by the word combination phonetic code and the model sentence phonetic code, and the pronunciation of the aforementioned decoded sentence set can also be assembled by one or more phonetic codes in the combination of radicals and radicals. Consonant, vowel-consonant combination, word, word, word combination and one or more phonetic codes in the model sentence and one or more phonetic codes in the radical combination and the radical radical are used to stitch together the pronunciation of the aforementioned decoding sentence set , also can be by vowel consonant, vowel and consonant combination, word, word, word combination, model sentence, one or more phonetic codes and special phonetic codes in radical combination and radicals to put together the aforementioned decoding statement set pronunciation ;用以拼合前述解码语句集发音的音码,同样一个音码可以用于前述解码语句集中的不同语句的拼合发音;In order to piece together the phonetic sign indicating number of pronunciation of aforementioned decoded sentence set, same sound code can be used for the combined pronunciation of the different sentences of aforementioned decoded sentence set;可按照前述这种解码规则中有关所述语种2的解码细则来对表示前述解码语句集中语句发音的拼合信息的值进行解码的解码程序及解码数据存储在所述存储器集里;According to the decoding rules about the language 2 in the above-mentioned decoding rules, the decoding program and the decoding data for decoding the value of the combined information representing the pronunciation of the sentences in the decoding sentence set are stored in the memory set;所述发音器件是TTS器件。The pronunciation device is a TTS device.17.根据权利要求12所述的拾音器主体,其特征在于:17. The pickup body of claim 12, wherein:所述语句1是属于所述语种1的一个语句,所述语种1是属于印欧语系的一个语种或是属于非印欧语系的一个语种或是一个复合语种;是用所述编码规则中有关所述语种1的编码细则来编码生成所述拼合信息的值1的,且这种编码规则中有关所述语种1的编码细则是含有压缩编码处理的,这种编码规则中有关所述语种1的编码细则是直播方式或TTS方式或混合方式的;The sentence 1 is a sentence belonging to the language 1, and the language 1 is a language belonging to the Indo-European language family or a language belonging to a non-Indo-European language family or a compound language; The encoding rules of the language 1 are used to encode and generate the value 1 of the combined information, and the encoding rules related to the language 1 in this encoding rule include compression encoding processing, and the encoding rules related to the language 1 The encoding details are live broadcast or TTS or mixed;所述语句1是属于所述语种2的一个语句,所述语种2是属于印欧语系的一个语种或是属于非印欧语系的一个语种或是一个复合语种;所述单片机芯片是按所述解码规则中有关所述语种2的解码细则来对所述拼合信息的值1执行解码使所述拾音器的所述发音器件播出所述语句1发音的,这种解码规则中有关所述语种2的解码细则是含有压缩解码处理的,这种解码规则中有关所述语种2的解码细则是直播方式或TTS方式或混合方式的;The statement 1 is a statement belonging to the language 2, and the language 2 is a language belonging to the Indo-European language family or a language belonging to a non-Indo-European language family or a compound language; The decoding rules related to the language 2 in the decoding rules are used to perform decoding on the value 1 of the combined information to make the pronunciation device of the pickup play the pronunciation of the sentence 1. In this decoding rule, the related language 2 The decoding details include compression decoding processing, and the decoding details related to the language 2 in this decoding rule are live broadcast or TTS or mixed;可按照前述这种解码规则中有关所述语种2的解码细则来对表示前述解码语句集中语句发音的拼合信息的值进行解码的解码程序及解码数据存储在所述存储器集里;According to the decoding rules about the language 2 in the above-mentioned decoding rules, the decoding program and the decoding data for decoding the value of the combined information representing the pronunciation of the sentences in the decoding sentence set are stored in the memory set;所述语种1与所述语种2是相同语种或是不同语种;The language 1 and the language 2 are the same language or different languages;前述这种编码规则中有关所述语种1的编码细则有定义编码涉及的编码语句集,所述语句1属于此编码语句集;The coding rules about the language 1 in the aforementioned coding rules define the coding sentence set involved in the coding, and the sentence 1 belongs to the coding sentence set;前述这种解码规则中有关所述语种2的解码细则有定义解码涉及的解码语句集,所述语句1属于此解码语句集;The decoding rules about the language 2 in the aforementioned decoding rules define the set of decoding sentences involved in decoding, and the sentence 1 belongs to the set of decoding sentences;前述这种编码规则中有关所述语种1的编码细则可用于对前述编码语句集中任一语句进行编码以生成表示如何用音码拼合成此任一语句发音的拼合信息的值;所述单片机芯片可按前述这种解码规则中有关所述语种2的解码细则来对表示如何用音码拼合成前述解码语句集中任一语句发音的拼合信息的值进行解码使所述拾音器的所述发音器件播出语句发音;The coding rules about the language 1 in the aforementioned coding rules can be used to encode any sentence in the aforementioned coding sentence set to generate the value of how to use phonetic codes to splicing into this arbitrary sentence pronunciation; the single-chip microcomputer chip Can be decoded according to the decoding rules about the language 2 in the aforementioned decoding rules to represent how to splicing into the value of the splicing information of any sentence pronunciation in the aforementioned decoding sentence collection with the phonetic sign indicating number Pronunciation of sentences;对于前述编码语句集中任一语句,可用多个音码来拼合成这任一语句的发音;For any sentence in the aforementioned coded sentence collection, a plurality of phonetic codes can be used to piece together the pronunciation of this arbitrary sentence;对于前述解码语句集中任一语句,可用多个音码来拼合成这任一语句的发音;For any sentence in the aforementioned decoding sentence collection, a plurality of phonetic codes can be used to piece together the pronunciation of this arbitrary sentence;用以拼合前述编码语句集发音的音码,同样一个音码可以用于前述编码语句集中的不同语句的拼合发音;In order to piece together the phonetic sign indicating number of pronunciation of aforementioned coded sentence set, the same phonetic code can be used for the combined pronunciation of the different sentences in aforementioned coded sentence set;用以拼合前述解码语句集发音的音码,同样一个音码可以用于前述解码语句集中的不同语句的拼合发音。The phonetic codes used to assemble the pronunciations of the aforementioned decoded sentence sets can also be used for the combined pronunciation of different sentences in the aforementioned decoded sentence sets.18.根据权利要求10,11,12,13,14,15,16或17所述的拾音器主体,其特征在于:18. A pickup body according to claim 10, 11, 12, 13, 14, 15, 16 or 17, characterized in that:所述播出语句发音是播出语句的全部发音或播出语句的非全部发音;The pronunciation of the broadcast sentence is the entire pronunciation of the broadcast sentence or the non-full pronunciation of the broadcast sentence;所述播出所述语句1发音是播出语句1的全部发音或播出语句1的非全部发音;The broadcasting of the pronunciation of the sentence 1 is to broadcast all the pronunciations of the sentence 1 or to broadcast not all the pronunciations of the sentence 1;所述播出前述语句1发音是播出语句1的全部发音或播出语句1的非全部发音;The pronunciation of the aforementioned sentence 1 is to broadcast all the pronunciations of the sentence 1 or to broadcast the non-full pronunciation of the sentence 1;所述读物可以是书,书籍,印有图文的卡片,报纸杂志或地图;The reading materials can be books, books, cards with pictures and texts, newspapers, magazines or maps;所述语音编码有声读物,有一个或多个页面而且只在一个页面上有码字,或者,有多个页面而且在多个页面上都有码字;所述语音编码有声读物的任一页面上或没有码字,或有一个码字,或有多个处在任意不同位置的码字;The speech-encoded audiobook has one or more pages and codewords on only one page, or has multiple pages and codewords on multiple pages; any page of the speech-encoded audiobook or there is no codeword, or there is a codeword, or there are multiple codewords in any different positions;所述诸个码字中的某一码字是处在所述读物的整页面上需播此码字对应播的数个语句发音的位置;A certain code word in described all code words is the position that needs to broadcast several sentence pronunciations that this code word corresponds to broadcast on the whole page of described reading material;所述读码头是可用于读取所述语音编码有声读物的整页面上的导电码的机电式导电码读码头,或者所述读码头是可用于读取所述语音编码有声读物的整页面上的条形码或导电码的光电式读码头,或者所述读码头是可用于读取所述语音编码有声读物的整页面上的图形码的光电式图形码读码头,或者所述读码头是可用于读取所述语音编码有声读物的整页面上的磁码的磁码读码头,或者所述读码头是可用于读取所述语音编码有声读物的整页面上的二维条形码或一维条形码或导电码的照相式读码头;The reading dock is an electromechanical conductive code reading dock operable to read a conductive code on an entire page of the speech-encoded audiobook, or the reading dock is operable to read an entire page of the speech-encoded audiobook. bar code or conductive code, or the reader is an optoelectronic graphic code reader that can be used to read a graphic code on a full page of the voice-coded audiobook, or the reader is a reader that can be used to a magnetic code reading header that reads a magnetic code on an entire page of said speech-encoded audiobook, or said reading header is operable to read a two-dimensional barcode or a one-dimensional barcode on an entire page of said speech-encoded audiobook or Photo-type reading terminal of conductive code;对于所述诸个码字中的某一码字,此码字的载体是所述读物的一个页,或者,此码字的载体是一小块表面不导电的平面材料,此平面材料是粘贴在所述读物的一个页面上的;For a certain code word in the said code words, the carrier of the code word is a page of the reading material, or the carrier of the code word is a small piece of surface non-conductive planar material, and this planar material is pasted on a page of the reading;可把所述拾音器主体做成大机体带读码头形式的拾音器主体且机体的外型适于让语音编码有声读物放在机体里,使用时,将所述语音编码有声读物放入机体里使用,或者,可把所述拾音器主体做成大机体带读码头形式的拾音器主体且机体的外型适于让语音编码有声读物放在机体上,使用时,将所述语音编码有声读物放在机体上使用,或者,可把所述拾音器主体做成手持式的拾音器主体,或者,可把所述拾音器主体做成小壳体带读码头形式的拾音器主体且小壳体可放在桌面上;所述拾音器主体是没与其它电子类产品集成的,或者,所述拾音器主体是与手机集成在一起的或所述拾音器主体是与MP3播放机集成在一起的或者所述拾音器主体是与复读机集成在一起的。The main body of the pickup can be made into a pickup main body in the form of a large body with a reading dock, and the shape of the body is suitable for placing the speech-coded audiobook in the body. When in use, the speech-coded audiobook is put into the body for use, Or, the main body of the pickup can be made into a pickup main body with a reading dock in a large body and the shape of the body is suitable for allowing the speech coded audiobook to be placed on the body. When in use, the speech coded audiobook is placed on the body Use, or, described pickup main body can be made the pickup main body of hand-held, perhaps, described pickup main body can be made the pickup main body of small casing band reading dock form and small casing can be placed on the desktop; The main body of the pickup is not integrated with other electronic products, or the main body of the pickup is integrated with the mobile phone or the main body of the pickup is integrated with the MP3 player or the main body of the pickup is integrated with the repeater together.19.一种可与权利要求1所述的语音编码有声读物配合使用播出语句发音的拾音器主体,此拾音器主体在被使用时是与前述语音编码有声读物和数个数据卡配合使用的,前述拾音器主体有同样数个卡插槽,前述数个数据卡中的每个数据卡上有存储器,前述拾音器主体包括有一个读码头,一个发音器件,一个单片机芯片,在读码使用时,需将前述数个数据卡插入前述拾音器主体的前述数个卡插槽里,使前述拾音器主体与前述数个数据卡合为一个拾音器,在前述拾音器里,前述单片机芯片与前述读码头意向连接,在前述拾音器里,前述单片机芯片与前述发音器件意向连接,前述单片机芯片是前述拾音器的主处理芯片,前述拾音器在使用时,需用到存储器集,此存储器集用于存放程序和数据,此存储器集里的各存储器可被前述拾音器的前述单片机芯片读取使用,前述语音编码有声读物的主体是读物,在前述读物的整页面上有诸个码字,此诸个码字属于前述语音编码有声读物,前述拾音器在被使用时,前述拾音器的前述单片机芯片可以利用前述拾音器的前述读码头来读取前述诸个码字中的部诸码字中的每个码字,其特征在于:19. A pick-up body that can be used in conjunction with the voice-coded audiobook as claimed in claim 1 to broadcast the pronunciation of a sentence. This pickup body is used in conjunction with the aforementioned voice-coded audiobook and several data cards when used. The main body of the pickup has the same number of card slots, and there is a memory on each of the aforementioned several data cards. A plurality of data cards are inserted into the aforementioned card slots of the aforementioned pickup body, so that the aforementioned pickup body and the aforementioned plurality of data cards are engaged into a pickup. In, the aforementioned single-chip microcomputer chip is intentionally connected with the aforementioned pronunciation device, and the aforementioned single-chip microcomputer chip is the main processing chip of the aforementioned pickup. When the aforementioned pickup is in use, it needs to use a memory set. Each memory can be read and used by the aforementioned single-chip microcomputer chip of the aforementioned pickup, and the main body of the aforementioned voice-coded audiobook is the reading material, and there are various codewords on the entire page of the aforementioned reading material. When the pickup is used, the aforementioned single-chip microcomputer chip of the aforementioned pickup can utilize the aforementioned reading dock of the aforementioned pickup to read each codeword in the portion of the codewords in the aforementioned codewords, and it is characterized in that:前述诸个码字中的每个码字的码值是共与数个表示如何用音码拼合成语句发音的拼合信息的值有关的;前述拾音器在被使用时,由前述拾音器的前述单片机芯片根据读得的前述部诸码字中的码字的码值使前述拾音器的前述发音器件播出语句发音。The code value of each code word in the aforementioned code words is related to the value of the information of how to piece together the pronunciation of a sentence with several representations in total; According to the code value of the code word in the code words of the aforementioned parts read, the aforementioned pronunciation device of the aforementioned sound pick-up is played out the sentence pronunciation.20.根据权利要求19所述的拾音器主体,其特征在于:20. The pickup body of claim 19, wherein:在所述拾音器里,所述单片机芯片与所述读码头直接连接或间接连接,所述单片机芯片与所述发音器件直接连接或间接连接;In the pickup, the single-chip microcomputer chip is directly or indirectly connected to the reading dock, and the single-chip microcomputer chip is directly or indirectly connected to the pronunciation device;所述数个数据卡是一个数据卡且所述数个卡插槽是一个卡插槽,或者,所述数个数据卡是多个数据卡且所述数个卡插槽是同样多个卡插槽;The several data cards are one data card and the several card slots are one card slot, or the several data cards are multiple data cards and the several card slots are the same multiple cards slot;所述存储器集可以是所述单片机芯片内的数个存储器加上所述数个数据卡上的存储器,或可以是所述单片机芯片外所述拾音器主体内的数个存储器加上所述数个数据卡上的存储器,或可以是所述单片机芯片内的数个存储器加上所述单片机芯片外所述拾音器主体内的数个存储器加上所述数个数据卡上的存储器,或可以是所述数个数据卡上的存储器;Described memory set can be that several memories in described single-chip microcomputer chip add the memory on described several data cards, or can be that several memories in described pickup main body outside described single-chip microcomputer chip add described several The memory on the data card, or can be several memories in the single-chip microcomputer chip plus the several memories in the pickup main body outside the described single-chip microcomputer chip plus the memory on the several data cards, or can be all memory on said plurality of data cards;所述拾音器涉及的程序及所述拾音器涉及的数据存储在所述存储器集里;所述数个数据卡上的每个数据卡的存储器里至少存储有部分所述拾音器涉及的程序或部分所述拾音器涉及的数据;The program related to the pickup and the data related to the pickup are stored in the memory set; the memory of each data card on the plurality of data cards stores at least part of the program related to the pickup or part of the The data involved in the pickup;对于所述部诸码字中的一个码字涉及的表示如何用音码拼合成语句1发音的拼合信息的值,此表示如何用音码拼合成此语句1发音的拼合信息的值是按照一种编码规则中有关语种1的编码细则来编码生成的,此编码规则中有关语种1的编码细则是含有压缩编码处理的,此表示如何用音码拼合成此语句1发音的拼合信息的值也是拼合信息的值1,此编码规则是只适用于语种1或适用于包括语种1在内的多个语种的一种编码规则,只需从前述一个码字即可得到前述拼合信息的值1或需从包括前述一个码字在内的所述部诸码字中的多个码字才能得到前述拼合信息的值1,所述拾音器的所述单片机芯片利用所述读码头可读取前述一个码字或前述多个码字而得到前述拼合信息的值1,所述拾音器的所述单片机芯片是按一种解码规则中有关语种2的解码细则来对读码得到的前述拼合信息的值1执行解码使所述拾音器的所述发音器件播出前述语句1发音的,而且此解码规则中有关语种2的解码细则是含有压缩解码处理的,此解码规则是只适用于语种2或适用于包括语种2在内的多个语种的一种解码规则;For the value of the combination information of how a code word in the code words of the said part relates to how to piece together the pronunciation of the sentence 1 with the phonetic sign indicating number, this expression is according to a The coding rules related to language 1 in this coding rule are coded and generated. The coding rules related to language 1 in this coding rule contain compression coding processing. The value 1 of the spliced information, this encoding rule is only applicable to language 1 or a coding rule applicable to multiple languages including language 1, and the value 1 or It is necessary to obtain the value 1 of the aforementioned combination information from a plurality of codewords in the codewords including the aforementioned codeword, and the single-chip microcomputer chip of the pickup can read the aforementioned codeword by using the reading dock. Word or the aforementioned multiple code words to obtain the value 1 of the aforementioned combination information, the single-chip microcomputer chip of the pickup is to perform the value 1 of the aforementioned combination information obtained by reading the code according to the decoding rules of the relevant language 2 in a decoding rule Decoding enables the pronunciation device of the pickup to broadcast the pronunciation of the aforementioned sentence 1, and the decoding details of the relevant language 2 in the decoding rule contain compression and decoding processing, and this decoding rule is only applicable to the language 2 or is suitable for including the language A decoding rule for multiple languages including 2;对于前述拼合信息的值1涉及的前述语句1,可以是一个完整句子,或者,是一个完整句子的部分,或者,是含有多个完整句子,或者,是含有数个完整句子和数个非完整句子,或者,是含有多个非完整句子;For the aforementioned sentence 1 involved in the value 1 of the aforementioned combined information, it can be a complete sentence, or a part of a complete sentence, or contain multiple complete sentences, or contain several complete sentences and several incomplete sentences. Sentence, or, contains multiple incomplete sentences;所述部诸码字是所述诸个码字中的全部码字或部分码字;所述诸个码字可以是一个或多个码字;所述部诸码字可以是一个或多个码字。All codewords of described part are all codewords or part codewords in described all codewords; Described all codewords can be one or more codewords; All codewords of described part can be one or more codewords Codeword.21.根据权利要求20所述的拾音器主体,其特征在于:21. The pickup body of claim 20, wherein:所述语句1是属于所述语种1的一个语句,所述语种1是属于印欧语系的一个语种或是属于非印欧语系的一个语种或是一个复合语种;是用所述编码规则中有关所述语种1的编码细则来编码生成所述拼合信息的值1的,且这种编码规则中有关所述语种1的编码细则是含有压缩编码处理的,这种编码规则中有关所述语种1的编码细则是直播方式或TTS方式或混合方式的;The sentence 1 is a sentence belonging to the language 1, and the language 1 is a language belonging to the Indo-European language family or a language belonging to a non-Indo-European language family or a compound language; The encoding rules of the language 1 are used to encode and generate the value 1 of the combined information, and the encoding rules related to the language 1 in this encoding rule include compression encoding processing, and the encoding rules related to the language 1 The encoding details are live broadcast or TTS or mixed;所述语句1是属于所述语种2的一个语句,所述语种2是属于印欧语系的一个语种或是属于非印欧语系的一个语种或是一个复合语种;所述单片机芯片是按所述解码规则中有关所述语种2的解码细则来对所述拼合信息的值1执行解码使所述拾音器的所述发音器件播出所述语句1发音的,这种解码规则中有关所述语种2的解码细则是含有压缩解码处理的,这种解码规则中有关所述语种2的解码细则是直播方式或TTS方式或混合方式的;The statement 1 is a statement belonging to the language 2, and the language 2 is a language belonging to the Indo-European language family or a language belonging to a non-Indo-European language family or a compound language; The decoding rules related to the language 2 in the decoding rules are used to perform decoding on the value 1 of the combined information to make the pronunciation device of the pickup play the pronunciation of the sentence 1. In this decoding rule, the related language 2 The decoding details include compression decoding processing, and the decoding details related to the language 2 in this decoding rule are live broadcast or TTS or mixed;可按照前述这种解码规则中有关所述语种2的解码细则来对表示前述解码语句集中语句发音的拼合信息的值进行解码的解码程序及解码数据存储在所述存储器集里;According to the decoding rules about the language 2 in the above-mentioned decoding rules, the decoding program and the decoding data for decoding the value of the combined information representing the pronunciation of the sentences in the decoding sentence set are stored in the memory set;所述语种1与所述语种2是相同语种或是不同语种;The language 1 and the language 2 are the same language or different languages;前述这种编码规则中有关所述语种1的编码细则有定义编码涉及的编码语句集,所述语句1属于此编码语句集;The coding rules about the language 1 in the aforementioned coding rules define the coding sentence set involved in the coding, and the sentence 1 belongs to the coding sentence set;前述这种解码规则中有关所述语种2的解码细则有定义解码涉及的解码语句集,所述语句1属于此解码语句集;The decoding rules about the language 2 in the aforementioned decoding rules define the set of decoding sentences involved in decoding, and the sentence 1 belongs to the set of decoding sentences;前述这种编码规则中有关所述语种1的编码细则可用于对前述编码语句集中任一语句进行编码以生成表示如何用音码拼合成此任一语句发音的拼合信息的值;所述单片机芯片可按前述这种解码规则中有关所述语种2的解码细则来对表示如何用音码拼合成前述解码语句集中任一语句发音的拼合信息的值进行解码使所述拾音器的所述发音器件播出语句发音;The coding rules about the language 1 in the aforementioned coding rules can be used to encode any sentence in the aforementioned coding sentence set to generate the value of how to use phonetic codes to splicing into this arbitrary sentence pronunciation; the single-chip microcomputer chip Can be decoded according to the decoding rules about the language 2 in the aforementioned decoding rules to represent how to splicing into the value of the splicing information of any sentence pronunciation in the aforementioned decoding sentence collection with the phonetic sign indicating number Pronunciation of sentences;对于前述编码语句集中任一语句,可用多个音码来拼合成这任一语句的发音;For any sentence in the aforementioned coded sentence collection, a plurality of phonetic codes can be used to piece together the pronunciation of this arbitrary sentence;对于前述解码语句集中任一语句,可用多个音码来拼合成这任一语句的发音;For any sentence in the aforementioned decoding sentence collection, a plurality of phonetic codes can be used to piece together the pronunciation of this arbitrary sentence;用以拼合前述编码语句集发音的音码,同样一个音码可以用于前述编码语句集中的不同语句的拼合发音;In order to piece together the phonetic sign indicating number of pronunciation of aforementioned coded sentence set, the same phonetic code can be used for the combined pronunciation of the different sentences in aforementioned coded sentence set;用以拼合前述解码语句集发音的音码,同样一个音码可以用于前述解码语句集中的不同语句的拼合发音;In order to piece together the phonetic sign indicating number of pronunciation of aforementioned decoded sentence set, same sound code can be used for the combined pronunciation of the different sentences of aforementioned decoded sentence set;所述数个数据卡是一个数据卡且所述数个卡插槽是一个卡插槽且此数据卡是与所述语音编码有声读物配套的或此数据卡是与所述拾音器主体配套的或此数据卡是与所述语音编码有声读物和所述拾音器主体配套的或此数据卡既不与所述语音编码有声读物配套也不与所述拾音器主体配套,或者,所述数个数据卡是多个数据卡且所述数个卡插槽是同样多个卡插槽且这数个数据卡全部是与所述语音编码有声读物配套的或这数个数据卡全部是与所述拾音器主体配套的或这数个数据卡中的部分数据卡是与所述语音编码有声读物配套的而另有部分数据卡是与所述拾音器主体配套的或这数个数据卡全部是与所述语音编码有声读物和所述拾音器主体配套的或这数个数据卡中的部分数据卡是与所述语音编码有声读物和所述拾音器主体配套的或这数个数据卡既不与所述语音编码有声读物配套也不与所述拾音器主体配套;The plurality of data cards is a data card and the plurality of card slots is a card slot and the data card is associated with the speech coded audiobook or the data card is associated with the pickup body or The data card is associated with the speech-encoded audiobook and the pickup body or the data card is neither associated with the speech-encoded audiobook nor with the pickup body, or the plurality of data cards are A plurality of data cards and the plurality of card slots are the same plurality of card slots and the plurality of data cards are all matched with the speech coded audiobook or the plurality of data cards are all matched with the pickup body or some of these data cards are matched with the speech coded audiobook, while another part of the data cards are matched with the main body of the pickup, or all of these data cards are matched with the speech coded audiobook The reading material is matched with the main body of the pickup or some of the data cards in the plurality of data cards are matched with the audio-coded audiobook and the main body of the pickup, or neither of the data cards is matched with the audio-coded audiobook Nor is it matched with the main body of the pickup;所述播出语句发音是播出语句的全部发音或播出语句的非全部发音;The pronunciation of the broadcast sentence is the entire pronunciation of the broadcast sentence or the non-full pronunciation of the broadcast sentence;所述播出所述语句1发音是播出语句1的全部发音或播出语句1的非全部发音;The broadcasting of the pronunciation of the sentence 1 is to broadcast all the pronunciations of the sentence 1 or to broadcast not all the pronunciations of the sentence 1;所述播出前述语句1发音是播出语句1的全部发音或播出语句1的非全部发音;The pronunciation of the aforementioned sentence 1 is to broadcast all the pronunciations of the sentence 1 or to broadcast the non-full pronunciation of the sentence 1;所述读物可以是书,书籍,印有图文的卡片,报纸杂志或地图;The reading materials can be books, books, cards with pictures and texts, newspapers, magazines or maps;所述语音编码有声读物,有一个或多个页面而且只在一个页面上有码字,或者,有多个页面而且在多个页面上都有码字;所述语音编码有声读物的任一页面上或没有码字,或有一个码字,或有多个处在任意不同位置的码字;The speech-encoded audiobook has one or more pages and codewords on only one page, or has multiple pages and codewords on multiple pages; any page of the speech-encoded audiobook or there is no codeword, or there is a codeword, or there are multiple codewords in any different positions;所述诸个码字中的某一码字是处在所述读物的整页面上需播此码字对应播的数个语句发音的位置;A certain code word in described all code words is the position that needs to broadcast several sentence pronunciations that this code word corresponds to broadcast on the whole page of described reading material;所述读码头是可用于读取所述语音编码有声读物的整页面上的导电码的机电式导电码读码头,或者所述读码头是可用于读取所述语音编码有声读物的整页面上的条形码或导电码的光电式读码头,或者所述读码头是可用于读取所述语音编码有声读物的整页面上的图形码的光电式图形码读码头,或者所述读码头是可用于读取所述语音编码有声读物的整页面上的磁码的磁码读码头,或者所述读码头是可用于读取所述语音编码有声读物的整页面上的二维条形码或一维条形码或导电码的照相式读码头;The reading dock is an electromechanical conductive code reading dock operable to read a conductive code on an entire page of the speech-encoded audiobook, or the reading dock is operable to read an entire page of the speech-encoded audiobook. bar code or conductive code, or the reader is an optoelectronic graphic code reader that can be used to read a graphic code on a full page of the voice-coded audiobook, or the reader is a reader that can be used to a magnetic code reading header that reads a magnetic code on an entire page of said speech-encoded audiobook, or said reading header is operable to read a two-dimensional barcode or a one-dimensional barcode on an entire page of said speech-encoded audiobook or Photo-type reading terminal of conductive code;对于所述诸个码字中的某一码字,此码字的载体是所述读物的一个页,或者,此码字的载体是一小块表面不导电的平面材料,此平面材料是粘贴在所述读物的一个页面上的;For a certain code word in the said code words, the carrier of the code word is a page of the reading material, or the carrier of the code word is a small piece of surface non-conductive planar material, and this planar material is pasted on a page of the reading;可把所述拾音器主体做成大机体带读码头形式的拾音器主体且机体的外型适于让语音编码有声读物放在机体里,机体上有卡插槽,使用时,将所述语音编码有声读物放入机体里使用,或者,可把所述拾音器主体做成大机体带读码头形式的拾音器主体且机体的外型适于让语音编码有声读物放在机体上,机体上有卡插槽,使用时,将所述语音编码有声读物放在机体上使用,或者,可把所述拾音器主体做成手持式的拾音器主体,手持式的拾音器主体上有卡插槽,或者,可把所述拾音器主体做成小壳体带读码头形式的拾音器主体且小壳体可放在桌面上,小壳体上有卡插槽;所述拾音器主体是没与其它电子类产品集成的,或者,所述拾音器主体是与手机集成在一起的或所述拾音器主体是与MP3播放机集成在一起的或者所述拾音器主体是与复读机集成在一起的。The main body of the pickup can be made into a pickup main body of a large body with a dock form, and the shape of the body is suitable for allowing voice-coded audiobooks to be placed in the body. There is a card slot on the body. When in use, the voice-coded audiobook The reading material is put into the body for use, or, the pickup body can be made into a pickup body in the form of a large body with a reading dock and the shape of the body is suitable for placing the voice-coded audiobook on the body, and there is a card slot on the body. When in use, the voice coded audiobook is placed on the body for use, or the pickup main body can be made into a handheld pickup main body, and a card slot is arranged on the handheld pickup main body, or the pickup can be The main body is made into a small housing with a pickup body in the form of a dock, and the small housing can be placed on the desktop, and there is a card slot on the small housing; the main body of the pickup is not integrated with other electronic products, or, the The main body of the pickup is integrated with the mobile phone or the main body of the pickup is integrated with the MP3 player or the main body of the pickup is integrated with the repeater.22.一种可与权利要求1所述的语音编码有声读物配合使用播出语句发音的拾音器主体,此拾音器主体在被使用时是与前述语音编码有声读物和数个部件式数据卡配合使用的,前述拾音器主体有同样数个部件式卡插槽,前述数个部件式数据卡中的每个部件式数据卡上有存储器,在读码使用时,需将前述数个部件式数据卡插入前述拾音器主体的前述数个部件式卡插槽里,使前述拾音器主体与前述数个部件式数据卡合为一个拾音器,此拾音器包括有一个读码头,一个发音器件,一个单片机芯片,在此拾音器里,前述单片机芯片与前述读码头意向连接,在此拾音器里,前述单片机芯片与前述发音器件意向连接,前述单片机芯片是此拾音器的主处理芯片,前述拾音器在使用时,需用到存储器集,此存储器集用于存放程序和数据,此存储器集里的各存储器可被前述拾音器的前述单片机芯片读取使用,前述语音编码有声读物的主体是读物,在前述读物的整页面上有诸个码字,此诸个码字属于前述语音编码有声读物,前述拾音器在被使用时,前述拾音器的前述单片机芯片可以利用前述拾音器的前述读码头来读取前述诸个码字中的部诸码字中的每个码字,其特征在于:22. A pick-up body that can be used in conjunction with the speech-coded audiobook as claimed in claim 1 to play out the pronunciation of a sentence, and this pickup body is used in conjunction with the aforementioned speech-coded audiobook and several component data cards when used , the aforementioned pickup body has the same number of component-type card slots, and each component-type data card in the aforementioned plurality of component-type data cards has a memory. In the aforementioned several component-type card slots of the main body, the aforementioned pickup body and the aforementioned plurality of component-type data cards are combined into a pickup. The aforementioned single-chip microcomputer chip is connected with the aforementioned reading dock intentionally. In this pickup, the aforementioned single-chip microcomputer chip is connected with the aforementioned pronunciation device intentionally. The aforementioned single-chip microcomputer chip is the main processing chip of this pickup. The memory set is used to store programs and data. Each memory in this memory set can be read and used by the aforementioned single-chip microcomputer chip of the aforementioned sound pickup. The main body of the aforementioned voice-coded audiobook is the reading material, and there are various codewords on the entire page of the aforementioned reading material. These codewords belong to the aforementioned voice coded audiobooks, and when the aforementioned pickup is used, the aforementioned single-chip microcomputer chip of the aforementioned pickup can utilize the aforementioned reading dock of the aforementioned pickup to read each of the portions of the aforementioned codewords. A codeword is characterized in that:前述诸个码字中的每个码字的码值是共与数个表示如何用音码拼合成语句发音的拼合信息的值有关的;前述拾音器在被使用时,由前述拾音器的前述单片机芯片根据读得的前述部诸码字中的码字的码值使前述拾音器的前述发音器件播出语句发音。The code value of each code word in the aforementioned code words is related to the value of the information of how to piece together the pronunciation of a sentence with several representations in total; According to the code value of the code word in the code words of the aforementioned parts read, the aforementioned pronunciation device of the aforementioned sound pick-up is played out the sentence pronunciation.23.根据权利要求22所述的拾音器主体,其特征在于:23. The pickup body of claim 22, wherein:所述读码头和所述单片机芯片和所述发音器件布置在所述拾音器主体内和所述数个部件式数据卡上,所述读码头和所述单片机芯片和所述发音器件中至少有一个是处于所述数个部件式数据卡上的;The reading terminal, the single-chip microcomputer chip and the sounding device are arranged in the pickup body and on the plurality of component data cards, and there is at least one of the reading terminal, the single-chip microcomputer chip and the sounding device are located on the plurality of component data cards;在所述拾音器里,所述单片机芯片与所述读码头直接连接或间接连接,所述单片机芯片与所述发音器件直接连接或间接连接;In the pickup, the single-chip microcomputer chip is directly or indirectly connected to the reading dock, and the single-chip microcomputer chip is directly or indirectly connected to the pronunciation device;所述数个部件式数据卡是一个部件式数据卡且所述数个部件式卡插槽是一个部件式卡插槽,或者,所述数个部件式数据卡是多个部件式数据卡且所述数个部件式卡插槽是同样多个部件式卡插槽;The plurality of component data cards is a component data card and the plurality of component card slots is a component card slot, or, the plurality of component data cards are a plurality of component data cards and The plurality of component card slots is the same plurality of component card slots;所述单片机芯片位于拾音器主体内,所述存储器集可以是所述单片机芯片内的数个存储器加上所述数个部件式数据卡上的存储器或可以是所述单片机芯片外所述拾音器主体内的数个存储器加上所述数个部件式数据卡上的存储器或可以是所述单片机芯片内的数个存储器加上所述单片机芯片外所述拾音器主体内的数个存储器加上所述数个部件式数据卡上的存储器或可以是所述数个部件式数据卡上的存储器,或者,所述单片机芯片位于一部件式数据卡上,所述存储器集可以是所述单片机芯片内的数个存储器或可以是所述单片机芯片外所述数个部件式数据卡上的存储器或可以是所述单片机芯片内的数个存储器加上所述单片机芯片外所述数个部件式数据卡上的存储器或可以是所述单片机芯片内的数个存储器加上所述拾音器主体内的数个存储器或可以是所述单片机芯片内的数个存储器加上所述拾音器主体内的数个存储器加上所述单片机芯片外所述数个部件式数据卡上的存储器或可以是所述拾音器主体内的数个存储器加上所述单片机芯片外所述数个部件式数据卡上的存储器;The single-chip microcomputer chip is located in the pickup main body, and the memory set can be several memories in the single-chip microcomputer chip plus the storage on the several component data cards or can be outside the single-chip microcomputer chip and described in the pickup main body Several memories on the said several part-type data card or can be several memories in the said single-chip microcomputer chip plus the several memories in the said pick-up body outside the said single-chip microcomputer chip plus the said data The memory on a component data card may be the memory on the several component data cards, or the single-chip microcomputer chip is located on a component data card, and the memory set may be the data in the single-chip microcomputer chip A memory or may be the memory on the several component data cards outside the single-chip microcomputer chip or may be several memories in the single-chip microcomputer chip plus the memory on the several component data cards outside the single-chip microcomputer chip Storer or can be several memory storages in the described single-chip microcomputer chip plus several memory storages in the described pickup body or can be several memory storages in the described single-chip microcomputer chip plus the several memory storages in the described pickup main body plus all The memory on the several component data cards outside the single-chip microcomputer chip or can be the several memory storages in the pickup main body plus the memory on the several component data cards outside the single-chip microcomputer chip;所述拾音器涉及的程序及所述拾音器涉及的数据存储在所述存储器集里;所述数个部件式数据卡中的每个部件式数据卡上的存储器里至少存储有部分所述拾音器涉及的程序或部分所述拾音器涉及的数据;The program related to the pickup and the data related to the pickup are stored in the memory set; at least part of the data related to the pickup is stored in the memory on each component data card in the several component data cards. program or parts of the data referred to by the pickup;对于所述部诸码字中的一个码字涉及的表示如何用音码拼合成语句1发音的拼合信息的值,此表示如何用音码拼合成此语句1发音的拼合信息的值是按照一种编码规则中有关语种1的编码细则来编码生成的,此编码规则中有关语种1的编码细则是含有压缩编码处理的,此表示如何用音码拼合成此语句1发音的拼合信息的值也是拼合信息的值1,此编码规则是只适用于语种1或适用于包括语种1在内的多个语种的一种编码规则,只需从前述一个码字即可得到前述拼合信息的值1或需从包括前述一个码字在内的所述部诸码字中的多个码字才能得到前述拼合信息的值1,所述拾音器的所述单片机芯片利用所述读码头可读取前述一个码字或前述多个码字而得到前述拼合信息的值1,所述拾音器的所述单片机芯片是按一种解码规则中有关语种2的解码细则来对读码得到的前述拼合信息的值1执行解码使所述拾音器的所述发音器件播出前述语句1发音的,而且此解码规则中有关语种2的解码细则是含有压缩解码处理的,此解码规则是只适用于语种2或适用于包括语种2在内的多个语种的一种解码规则;For the value of the combination information of how a code word in the code words of the said part relates to how to piece together the pronunciation of the sentence 1 with the phonetic sign indicating number, this expression is according to a The coding rules related to language 1 in this coding rule are coded and generated. The coding rules related to language 1 in this coding rule contain compression coding processing. The value 1 of the spliced information, this encoding rule is only applicable to language 1 or a coding rule applicable to multiple languages including language 1, and the value 1 or It is necessary to obtain the value 1 of the aforementioned combination information from a plurality of codewords in the codewords including the aforementioned codeword, and the single-chip microcomputer chip of the pickup can read the aforementioned codeword by using the reading dock. Word or the aforementioned multiple code words to obtain the value 1 of the aforementioned combination information, the single-chip microcomputer chip of the pickup is to perform the value 1 of the aforementioned combination information obtained by reading the code according to the decoding rules of the relevant language 2 in a decoding rule Decoding enables the pronunciation device of the pickup to broadcast the pronunciation of the aforementioned sentence 1, and the decoding details of the relevant language 2 in the decoding rule contain compression and decoding processing, and this decoding rule is only applicable to the language 2 or is suitable for including the language A decoding rule for multiple languages including 2;对于前述拼合信息的值1涉及的前述语句1,可以是一个完整句子,或者,是一个完整句子的部分,或者,是含有多个完整句子,或者,是含有数个完整句子和数个非完整句子,或者,是含有多个非完整句子;For the aforementioned sentence 1 involved in the value 1 of the aforementioned combined information, it can be a complete sentence, or a part of a complete sentence, or contain multiple complete sentences, or contain several complete sentences and several incomplete sentences. Sentence, or, contains multiple incomplete sentences;所述部诸码字是所述诸个码字中的全部码字或部分码字;所述诸个码字可以是一个或多个码字;所述部诸码字可以是一个或多个码字。All codewords of described part are all codewords or part codewords in described all codewords; Described all codewords can be one or more codewords; All codewords of described part can be one or more codewords Codeword.24.根据权利要求23所述的拾音器主体,其特征在于:24. The pickup body of claim 23, wherein:所述语句1是属于所述语种1的一个语句,所述语种1是属于印欧语系的一个语种或是属于非印欧语系的一个语种或是一个复合语种;是用所述编码规则中有关所述语种1的编码细则来编码生成所述拼合信息的值1的,且这种编码规则中有关所述语种1的编码细则是含有压缩编码处理的,这种编码规则中有关所述语种1的编码细则是直播方式或TTS方式或混合方式的;The sentence 1 is a sentence belonging to the language 1, and the language 1 is a language belonging to the Indo-European language family or a language belonging to a non-Indo-European language family or a compound language; The encoding rules of the language 1 are used to encode and generate the value 1 of the combined information, and the encoding rules related to the language 1 in this encoding rule include compression encoding processing, and the encoding rules related to the language 1 The encoding details are live broadcast or TTS or mixed;所述语句1是属于所述语种2的一个语句,所述语种2是属于印欧语系的一个语种或是属于非印欧语系的一个语种或是一个复合语种;所述单片机芯片是按所述解码规则中有关所述语种2的解码细则来对所述拼合信息的值1执行解码使所述拾音器的所述发音器件播出所述语句1发音的,这种解码规则中有关所述语种2的解码细则是含有压缩解码处理的,这种解码规则中有关所述语种2的解码细则是直播方式或TTS方式或混合方式的;The statement 1 is a statement belonging to the language 2, and the language 2 is a language belonging to the Indo-European language family or a language belonging to a non-Indo-European language family or a compound language; The decoding rules related to the language 2 in the decoding rules are used to perform decoding on the value 1 of the combined information to make the pronunciation device of the pickup play the pronunciation of the sentence 1. In this decoding rule, the related language 2 The decoding details include compression decoding processing, and the decoding details related to the language 2 in this decoding rule are live broadcast or TTS or mixed;可按照前述这种解码规则中有关所述语种2的解码细则来对表示前述解码语句集中语句发音的拼合信息的值进行解码的解码程序及解码数据存储在所述存储器集里;According to the decoding rules about the language 2 in the above-mentioned decoding rules, the decoding program and the decoding data for decoding the value of the combined information representing the pronunciation of the sentences in the decoding sentence set are stored in the memory set;所述语种1与所述语种2是相同语种或是不同语种;The language 1 and the language 2 are the same language or different languages;前述这种编码规则中有关所述语种1的编码细则有定义编码涉及的编码语句集,所述语句1属于此编码语句集;The coding rules about the language 1 in the aforementioned coding rules define the coding sentence set involved in the coding, and the sentence 1 belongs to the coding sentence set;前述这种解码规则中有关所述语种2的解码细则有定义解码涉及的解码语句集,所述语句1属于此解码语句集;The decoding rules about the language 2 in the aforementioned decoding rules define the set of decoding sentences involved in decoding, and the sentence 1 belongs to the set of decoding sentences;前述这种编码规则中有关所述语种1的编码细则可用于对前述编码语句集中任一语句进行编码以生成表示如何用音码拼合成此任一语句发音的拼合信息的值;所述单片机芯片可按前述这种解码规则中有关所述语种2的解码细则来对表示如何用音码拼合成前述解码语句集中任一语句发音的拼合信息的值进行解码使所述拾音器的所述发音器件播出语句发音;The coding rules about the language 1 in the aforementioned coding rules can be used to encode any sentence in the aforementioned coding sentence set to generate the value of how to use phonetic codes to splicing into this arbitrary sentence pronunciation; the single-chip microcomputer chip Can be decoded according to the decoding rules about the language 2 in the aforementioned decoding rules to represent how to splicing into the value of the splicing information of any sentence pronunciation in the aforementioned decoding sentence collection with the phonetic sign indicating number Pronunciation of sentences;对于前述编码语句集中任一语句,可用多个音码来拼合成这任一语句的发音;For any sentence in the aforementioned coded sentence collection, a plurality of phonetic codes can be used to piece together the pronunciation of this arbitrary sentence;对于前述解码语句集中任一语句,可用多个音码来拼合成这任一语句的发音;For any sentence in the aforementioned decoding sentence collection, a plurality of phonetic codes can be used to piece together the pronunciation of this arbitrary sentence;用以拼合前述编码语句集发音的音码,同样一个音码可以用于前述编码语句集中的不同语句的拼合发音;In order to piece together the phonetic sign indicating number of pronunciation of aforementioned coded sentence set, the same phonetic code can be used for the combined pronunciation of the different sentences in aforementioned coded sentence set;用以拼合前述解码语句集发音的音码,同样一个音码可以用于前述解码语句集中的不同语句的拼合发音;In order to piece together the phonetic sign indicating number of pronunciation of aforementioned decoded sentence set, same sound code can be used for the combined pronunciation of the different sentences of aforementioned decoded sentence set;所述播出语句发音是播出语句的全部发音或播出语句的非全部发音;The pronunciation of the broadcast sentence is the entire pronunciation of the broadcast sentence or the non-full pronunciation of the broadcast sentence;所述播出所述语句1发音是播出语句1的全部发音或播出语句1的非全部发音;The broadcasting of the pronunciation of the sentence 1 is to broadcast all the pronunciations of the sentence 1 or to broadcast not all the pronunciations of the sentence 1;所述播出前述语句1发音是播出语句1的全部发音或播出语句1的非全部发音;The pronunciation of the aforementioned sentence 1 is to broadcast all the pronunciations of the sentence 1 or to broadcast the non-full pronunciation of the sentence 1;所述读物可以是书,书籍,印有图文的卡片,报纸杂志或地图;The reading materials can be books, books, cards with pictures and texts, newspapers, magazines or maps;所述语音编码有声读物,有一个或多个页面而且只在一个页面上有码字,或者,有多个页面而且在多个页面上都有码字;所述语音编码有声读物的任一页面上或没有码字,或有一个码字,或有多个处在任意不同位置的码字;The speech-encoded audiobook has one or more pages and codewords on only one page, or has multiple pages and codewords on multiple pages; any page of the speech-encoded audiobook or there is no codeword, or there is a codeword, or there are multiple codewords in any different positions;所述诸个码字中的某一码字是处在所述读物的整页面上需播此码字对应播的数个语句发音的位置;A certain code word in described all code words is the position that needs to broadcast several sentence pronunciations that this code word corresponds to broadcast on the whole page of described reading material;所述读码头是可用于读取所述语音编码有声读物的整页面上的导电码的机电式导电码读码头,或者所述读码头是可用于读取所述语音编码有声读物的整页面上的条形码或导电码的光电式读码头,或者所述读码头是可用于读取所述语音编码有声读物的整页面上的图形码的光电式图形码读码头,或者所述读码头是可用于读取所述语音编码有声读物的整页面上的磁码的磁码读码头,或者所述读码头是可用于读取所述语音编码有声读物的整页面上的二维条形码或一维条形码或导电码的照相式读码头;The reading dock is an electromechanical conductive code reading dock operable to read a conductive code on an entire page of the speech-encoded audiobook, or the reading dock is operable to read an entire page of the speech-encoded audiobook. bar code or conductive code, or the reader is an optoelectronic graphic code reader that can be used to read a graphic code on an entire page of the voice-coded audiobook, or the reader is a photoelectric reader that can be used to a magnetic code reading header that reads a magnetic code on an entire page of said speech-encoded audiobook, or said reading header is operable to read a two-dimensional barcode or a one-dimensional barcode on an entire page of said speech-encoded audiobook or Photo-type reading terminal of conductive code;对于所述诸个码字中的某一码字,此码字的载体是所述读物的一个页,或者,此码字的载体是一小块表面不导电的平面材料,此平面材料是粘贴在所述读物的一个页面上的;For a certain code word in the said code words, the carrier of the code word is a page of the reading material, or the carrier of the code word is a small piece of surface non-conductive planar material, and this planar material is pasted on a page of the reading;可把所述拾音器主体做成大机体带读码头形式的拾音器主体且机体的外型适于让语音编码有声读物放在机体里,机体上有部件式卡插槽,使用时,将所述语音编码有声读物放入机体里使用,或者,可把所述拾音器主体做成大机体带读码头形式的拾音器主体且机体的外型适于让语音编码有声读物放在机体上,机体上有部件式卡插槽,使用时,将所述语音编码有声读物放在机体上使用,或者,可把所述拾音器主体做成手持式的拾音器主体,手持式的拾音器主体上有部件式卡插槽,或者,可把所述拾音器主体做成小壳体带读码头形式的拾音器主体且小壳体可放在桌面上,小壳体上有部件式卡插槽;所述拾音器主体是没与其它电子类产品集成的,或者,所述拾音器主体是与手机集成在一起的或所述拾音器主体是与MP3播放机集成在一起的或者所述拾音器主体是与复读机集成在一起的;The main body of the pickup can be made into a pickup main body in the form of a large body with a reading dock and the shape of the body is suitable for allowing speech coded audiobooks to be placed in the body. There is a component card slot on the body. When in use, the voice The coded audiobook is put into the body for use, or the pickup body can be made into a large body with a reading dock and the shape of the body is suitable for putting the speech coded audiobook on the body, and the body has a component type Card slot, when in use, put the voice coded audiobook on the body for use, or, the pickup body can be made into a handheld pickup body, and there is a component card slot on the handheld pickup body, or , the main body of the pickup can be made into a pickup main body of a small housing with a reading dock and the small housing can be placed on the desktop, and there is a component type card slot on the small housing; the main body of the pickup is not connected with other electronic devices. Product integration, or, the main body of the pickup is integrated with the mobile phone or the main body of the pickup is integrated with the MP3 player or the main body of the pickup is integrated with the repeater;所述数个部件式数据卡是一个部件式数据卡且所述数个部件式卡插槽是一个部件式卡插槽且此部件式数据卡是与所述语音编码有声读物配套的或此部件式数据卡是与所述拾音器主体配套的或此部件式数据卡是与所述语音编码有声读物和所述拾音器主体配套的或此部件式数据卡既不与所述语音编码有声读物配套也不与所述拾音器主体配套,或者,所述数个部件式数据卡是多个部件式数据卡且所述数个部件式卡插槽是多个部件式卡插槽且这数个部件式数据卡全部是与所述语音编码有声读物配套的或这数个部件式数据卡全部是与所述拾音器主体配套的或这数个部件式数据卡中的部分部件式数据卡是与所述语音编码有声读物配套的而另有部分部件式数据卡是与所述拾音器主体配套的或这数个部件式数据卡全部是与所述语音编码有声读物和所述拾音器主体配套的或这数个部件式数据卡的部分部件式数据卡是与所述语音编码有声读物和所述拾音器主体配套的或这数个部件式数据卡既不与所述语音编码有声读物配套也不与所述拾音器主体配套。The plurality of component data cards is a component data card and the plurality of component card slots is a component card slot and the component data card is associated with the speech encoded audiobook or the component The data card is associated with the pickup body or the sub-data card is associated with the speech-encoded audiobook and the pickup body or the sub-data card is neither associated with the speech-encoded audiobook nor Matching with the pickup body, or, the several component data cards are a plurality of component data cards and the several component card slots are a plurality of component card slots and the several component data cards All of these component data cards are matched with the speech coded audiobook or all of these component data cards are matched with the pickup body or some of these component data cards are matched with the speech code audiobook The readings are matched and some component data cards are matched with the pickup body or these several component data cards are all matched with the speech coded audiobook and the pickup body or these several component data cards are matched with the pickup body Partial data cards of the card are associated with the speech-encoded audiobook and the pickup body or the several-part data cards are neither associated with the speech-encoded audiobook nor with the pickup body.
CNA2006102012899A2005-12-202006-12-13Voice coding talking book and pickup main bodyWithdrawnCN101201892A (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CNA2006102012899ACN101201892A (en)2005-12-202006-12-13Voice coding talking book and pickup main body

Applications Claiming Priority (3)

Application NumberPriority DateFiling DateTitle
CN200510200842.22005-12-20
CN2005102008422005-12-20
CNA2006102012899ACN101201892A (en)2005-12-202006-12-13Voice coding talking book and pickup main body

Publications (1)

Publication NumberPublication Date
CN101201892Atrue CN101201892A (en)2008-06-18

Family

ID=39517051

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CNA2006102012899AWithdrawnCN101201892A (en)2005-12-202006-12-13Voice coding talking book and pickup main body

Country Status (1)

CountryLink
CN (1)CN101201892A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN103366146A (en)*2012-04-052013-10-23华东师范大学Self-service specification reader and reading system based on two-dimensional code
CN111369978A (en)*2018-12-262020-07-03北京搜狗科技发展有限公司 A data processing method, device and device for data processing
CN112148838A (en)*2020-09-232020-12-29北京中电普华信息技术有限公司Business source object extraction method and device

Cited By (5)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN103366146A (en)*2012-04-052013-10-23华东师范大学Self-service specification reader and reading system based on two-dimensional code
CN111369978A (en)*2018-12-262020-07-03北京搜狗科技发展有限公司 A data processing method, device and device for data processing
CN111369978B (en)*2018-12-262024-05-17北京搜狗科技发展有限公司 A data processing method, a data processing device and a data processing device
CN112148838A (en)*2020-09-232020-12-29北京中电普华信息技术有限公司Business source object extraction method and device
CN112148838B (en)*2020-09-232024-04-19北京中电普华信息技术有限公司Service source object extraction method and device

Similar Documents

PublicationPublication DateTitle
RamanAuditory user interfaces: toward the speaking computer
US5091950A (en)Arabic language translating device with pronunciation capability using language pronunciation rules
CN107798123B (en)Knowledge base and establishing, modifying and intelligent question and answer methods, devices and equipment thereof
CN101414412A (en)Interaction type acoustic control children education studying device
CN103098124B (en)Method and system for text to speech conversion
CN100504733C (en) Phonetic-Shape Coded Chinese Character Input Method
CN101201892A (en)Voice coding talking book and pickup main body
WO2005121993A1 (en)Application system of multidimentional chinese learning
Liang et al.A Taiwanese text-to-speech system with applications to language learning
CN1963753A (en)Multidimensional Chinese understanding system and method applied for Chinese study
CN201111735Y (en)Interaction type acoustic control children education studying device
CN1901041A (en)Voice dictionary forming method and voice identifying system and its method
JP2005250525A (en) Kanji analysis support device, foreign language sentence processing device, and translation program
CN111326030A (en)Reading, dictation and literacy integrated learning system, device and method
CN103853705A (en)Real-time voice subtitle translation method of Chinese voice and foreign language voice of computer
ChomskySelected readings on transformational theory
SchmirlerSyntactic Features and Text Types in 20th Century Plains Cree: A Constraint Grammar Approach
CN115016654A (en) Chinese phonetic symbols, input method, phonetic lookup method, device, system and storage medium
CN110716654B (en)Chinese character input method, voice synthesis method and Chinese character input system
CN101436179A (en)Method and apparatus for converting text
CN101409021A (en)Sound music reading material and pick-up device
LemmenmeierSpoken language corpora: Approaches for facilitating linguistic research
TW480430B (en)Communication assistance system collocating with sign language keyboard, sentence generation and audio-visual feedback output
CN201017486Y (en)Portable typing type reading machine
CN103854647A (en)Chinese-foreign-language bidirectional real time voice translation wireless mobile communication device

Legal Events

DateCodeTitleDescription
C06Publication
PB01Publication
C04Withdrawal of patent application after publication (patent law 2001)
WW01Invention patent application withdrawn after publication

[8]ページ先頭

©2009-2025 Movatter.jp