Summary of the invention:
The objective of the invention is in order to overcome the deficiency of existing keyword encoding, reach the unique purpose of coding, invent a kind of Chinese pinyin initial or English word initial utilized, in conjunction with the additional supplemental code of descriptor information and can reflect the hot topic yard of clicked situation, form Chinese and English keyword encoding unique encoding method and special-purpose alphabetical dibbler.
Technical solution of the present invention is achieved in that a kind of Chinese and English keyword encoding unique encoding method and special-purpose alphabetical dibbler, includes:
1), character code selects the character code of descriptor for use, promptly Chinese (comprise Chinese individual character, context with) is with word first letter of pinyin, English (comprise English-word, the context with) initial with word (or English-word, context is together); It is the same that the equal default size of character code is write letter, and promptly character code is the combination of 26 letters;
2), supplemental code is selected for use with descriptor the be associated character code of information, promptly Chinese is with word first letter of pinyin, English initial with word; With a kind of identifier beginning, the character code of identifier and related information constitutes supplemental code before each group supplemental code;
3), the popular sign indicating number of popular sign indicating number generates by the descriptor clicking rate, represents putting in order of corresponding descriptor with arabic numeral, as 0,1,2 etc.Identical or when initially putting in storage, the popular yardage value of corresponding descriptor can be set to different numerical value at random when clicking rate.With the identifier beginning, identifier and numerical value constitute popular sign indicating number before popular sign indicating number; Show as " .1 ", " .2 " " .3 " etc. as popular sign indicating number;
4), special-purpose alphabetical dibbler is a Radio infrared dibbler, the dibbler keyboard includes subregion and the function key of establishing, 10 numerical keys and 26 letter keys, 26 letter keys are distributed in the bottom of keyboard, 10 numerical keys are distributed in the keyboard middle part, function key is distributed in keyboard top and centers on a side of numerical key, the character code of descriptor, 26 letter keys of supplemental code and dibbler are corresponding one by one, popular sign indicating number is corresponding with the dibbler numerical key, directly clicks input.
Setting up of supplemental code can reduce the repetition rate of coding effectively, and the retrieval rate of descriptor is improved.
Ordering when popular sign indicating number can reflect repeated code be the more important thing is and can scientifically be expressed the clicked situation of descriptor.
Coding method design science of the present invention, by supporting special-purpose alphabetical dibbler or can or have the telepilot input retrieval of alphabetic keypad by Qwerty keyboard, highly versatile, coding method meets the language thinking characteristic, be easy to grasp, can satisfy people's demand fast, be particularly useful for fields such as video request program, information inquiry, internet Keywork retrieval.
Embodiment:
The generation of coding, as long as simple the setting, coding just can be generated automatically by computing machine, sets up the database as shown in coding-example as us, comprise following field, as numbering, title of the song, number of words, character code, singer, singer's character code, languages, program request number, popular sign indicating number, coding etc.By the basic data of artificial input, as manual numbering, title of the song, singer, languages, other all available computers generation.Can generate character code by title of the song, singer's name generates singer's supplemental code, and the program request number generates corresponding popular sign indicating number by program.Final our the needed coding that synthesizes.
Its specific coding step is as follows:
1. set up the theme dictionary with Chinese, English or Chinese and English mixing vocabulary;
2. utilize computing machine to generate the character code of each descriptor, Chinese word first letter of pinyin, the English word initial that adopts, digital constant (the visual actual needs of numerical coding defines flexibly), other character is cast out;
3. determine the supplemental code of this theme dictionary; According to artificially selected its main related or indispensable content of its descriptor range of application (title of the song in this way, then singer's name is related information; Film in this way, then movie star's name is related information ...) by above principle, generate corresponding supplemental code;
4. in whole theme dictionary, the clicked rate of adding up arbitrary descriptor generates corresponding popular sign indicating number;
5. more than comprehensive, keyword encoding=character code+supplemental code+hot topic yard then;
Following example all is the coding that adopts above principle to make, and retrieval the time can be selected full code (character code+supplemental code+hot topic sign indicating number) retrieval, character code+supplemental code retrieval, character code+hot topic sign indicating number retrieval, supplemental code+hot topic sign indicating number retrieval and supplemental code+character code retrieval etc. fast to retrieve to make up, with adapt to different field to the descriptor retrieval need.
Example is taken from 1.2 ten thousand song databases commonly used, because of length is limit, and only can be with wherein part explanation:
Coding example: keyword encoding=character code+supplemental code+hot topic sign indicating number; Re-using popular sign indicating number when the above two have repeated code distinguishes; Popular sign indicating number is here by " .0 ", " .1 ", " .2 " ... expression;
Application note:
" love " of Mo Wenwei, coding is A-MWW.2, coding is unique; Its character code A repeated code reaches 15 in this table;
" Angel " of pottery Zhe, A-TZ.1; " Angeline " of pottery Zhe, A-TZ.0; Adopt character code+supplemental code, coding is the same, adds popular sign indicating number identification during repeated code; Promptly add " .0 ", " .1 "." .0 " can cast out when using, and " .0 " is default value, can simplify coding.
" the A BETTER DAY " of Liu Dehua, coding is ABD-LDH, Mo Wenwei's " a brandnew day ", coding is ABD-MWW, its character code is the same, but the supplemental code difference.
Character code and supplemental code adopt 26 English alphabets in the character encoding, by the array mode difference.It can represent descriptor quantitative relation and repeated code situation as follows:
A character code can be represented 26 descriptor, and repeated code is maximum;
Two character codes can be represented 26*26=676 descriptor, and repeated code is many;
Three character codes can be represented 26*26*26=17,576 descriptor, and repeated code is more;
Four character codes can be represented 26*26*26*26=456,976 descriptor; Repeated code is less
Five character codes can be represented 26*26*26*26*26=11,881,376 descriptor.Repeated code is few.
......
Can see that four character code has the combination more than 450,000 with reference to above data, consider the vocabulary characteristics irregular, dispersion that distribute, for one 40,000 lexicon, importing 3 or 4 letters can retrieve.In our test of long duration, be 10,000 to 3.5 ten thousand song storehouse for the song storage capacity, only need 3,4 of inputs to get final product by character code is general.4 all inquiries well of vocabulary that character code is above.Especially also be that so the character code position is many more when lexicon is big more, repeated code is few more, good more finding.What repeated code was maximum is that one or two character codes if a bit code such as A reach more than 1000 or more, are just imagined and known, do not introduce supplemental code etc., can't use at all.
The effect of supplemental code:
Because a lot of descriptor particularly repeated code of one or two word is many, at this moment, add that the associated content supplemental code discerns.As: * * * singer's a certain first song, certain film of * * * movie star; So better identification; Supplemental code not only solves coincident code problem, and the facility of directly retrieving by supplemental code is provided.
Supplemental code is one of original creation of the present invention.
It is thorough that good coding must be considered, considers that the people's is easy to use.Except effective solution repeated code, supplemental code has been arranged, the people just can retrieve and find by the letter of supplemental code, many a kind of approach.Do not had supplemental code, retrieval method is just single.
As a same reason, the descriptor that has, not only one of the field that is associated with its descriptor also can add other supplemental code again with reference to this principle; Supplemental code also possibility is a plurality of, can increase the approach of retrieval.
The effect of popular sign indicating number:
Popular sign indicating number also is for solving the repeated code of descriptor, the more important thing is the popular degree that the user provides this descriptor that can be.Use popular sign indicating number can determine the uniqueness of encoding.
All under the same case, the quantity descending sort by clicked rate has illustrated its popular degree in character code and supplemental code, that is: generally speaking, originally the people of Dian Jiing is many, and it is big that promptly clicked probability is wanted, possibility that will be clicked is bigger, so it is just more reasonable to come the front.
Popular sign indicating number also is one of original creation of the present invention, and it discloses the popular degree of descriptor, illustrates its clicked probability how much, during Computer Processing,, can realize the man-machine effect of " finding is promptly thought " by popular ordering.Popular sign indicating number is the part of coding just, and it is complete to guarantee to encode, no repeated code.Constantly dynamically update, it illustrates real popular degree.Do not have popular sign indicating number, the arrangement of descriptor can not scientifically reflect its clicked situation.
The descriptor search operaqtion:
In actual applications, as our requesting song, point uncivilian luxuriant " love ", it is encoded to: " A-MWW.2 ", and can accurately import, go out to have all descriptor of this coding by computer statistics, express side by side.
Because our coding is to be built in the program of computing machine, coding is to computer Recognition, and you import A, and computing machine can't confirm what A is, the repeated code of A all can only be listed.Computing machine can only be confirmed not " A-MWW.2 " of sign indicating number together;
Certainly, as man-machine conversation, your requesting song can have multiple mode:
1) by popular fuzzy query: can put above so much code, as long as input A, computing machine can rapid screening go out A letter all titles of the song for head, by popular descending sort, few if bear the same name, but one page can be presented on the screen, multipage also can adopt nextpage to search;
2) by letter and supplemental code fuzzy query: input A puts singer's identifier (represent above us-) again, imports supplemental code MWW again, and generally you are as long as import M again or MW just can find, every eliminatings that letter is many one decks repeated code of importing more.
3) by the supplemental code fuzzy query: input singer identifier (represent above us-), import MWW again, generally you find Mo Wenwei earlier, thereby find the song list of its correspondence as long as import M again or MW is just passable, press A promptly by the hot topic arrangement.
4) run in the phonetic alphabet that can not discern available? representative.Promptly import: A-MW? fuzzy query.
Be the tabulation of specific coding example below:
Table one:
| Title of the song | Number of words | Character code | The singer | Singer's character code | Languages | Popular sign indicating number 1 | The program request number | Coding 1 |
| ANDY | 1 | A | A Du | AT | National language | | 12 | A-AT |
| AMANI | 1 | A | BE-YOND | BY | Guangdong language | | 1 | A-BY |
| ASHES | 1 | A | Cai Jianya | CJY | English | | 5 | A-CJY |
| Like | 1 | A | Chen Xiaochun | CXC | National language | | 6 | A-CXC |
| ABCDE | 1 | A | Liu Dehua | LDH | National language | | 70 | A-LDH |
| AMOUR | 1 | A | Liang Yongqi | LYQ | National language | | 5000 | A-LYQ |
| Like | 1 | A | Mo Wenwei | MWW | National language | | 11178 | A-MWW |
| AITAI | 1 | A | The little fine jade of honey snow | MXWQ | National language | | 12080 | A-MXWQ |
| Angel | 1 | A | The pottery Zhe | TZ | National language | 1 | 120 | A-TZ.1 |
| Angeline | 1 | A | The pottery Zhe | TZ | National language | 0 | 123 | A-TZ.0 |
| Like | 1 | A | Little brave team | XHD | National language | | 11513 | A-XHD |
| arigatou | 1 | A | Zheng Xiuwen | ZXW | Guangdong language | | 13 | A-ZXW |
| ANGEL | 1 | A | Zhao Yonghua | ZYH | National language | | 5 | A-ZYH |
| AMOUR | 1 | A | Open the schoolmate | ZYY | Guangdong language | | 17 | A-ZYY |
| Like | 1 | A | Zheng Zhihua | ZZH | National language | | 8000 | A-ZZH |
| ALL?AROUND?THE WORLD | 4 | AATW | Li Wen | LW | English | | 13 | AATW-LW |
| ALL?ABOUT?US | 3 | AAU | Xu Huixin | XHQ | National language | | 11 | AAU-XHQ |
| Like | 3 | ABA | Zhao Yonghua | ZYH | National language | | 4 | ABA-ZYH |
| Like that you are the same | 7 | ABAN | Xu Ruyun | XLY | National language | | 10101 | ABAN-XLY |
| Like me | 4 | ABAW | Zero point band | LDYD | National language | | 6262 | ABAW-LDYD |
| A?BETTER?DAY | 3 | ABD | Liu Dehua | LDH | National language | | 7 | ABD-LDH |
| a?brandnew?day | 3 | ABD | Mo Wenwei | MWW | National language | | 3 | ABD-MW |
| Love less than flavour | 6 | ABDDZW | Black dragon | HL | National language | | 2 | ABDDZW-HL |
| Love less thanflavour | 6 | ABDDZW | Jiang Yuheng | ?JYH | National language | | | 1 | ABDDZW-JYH |
| Like not to thepeople | 4 | ABDR | Jiang Hui | ?JH | National language | | 55 | ABDR-JH |
| Like not to thepeople | 4 | ABDR | Ceng Xinmei | ?ZXM | Amoyese | | 43 | ABDR-ZXM |
| Like not regret | 4 | ABHH | Tian Zhen | ?TZ | National language | | 2225 | ABHH-TZ |
Table two:
| Title of the song | Number of words | Phonetic | The singer | Singer's phonetic sign indicating number | Languages | Popular sign indicating number 2 | The program request number | Coding 2 | Coding 3 |
| AITAI | 1 | A | The little fine jade of honey snow | MXWQ | National language | 0 | 12080 | A.0 | A.0-MXWQ |
| Like | 1 | A | Little brave team | XHD | National language | 1 | 11513 | A.1 | A.1-XHD |
| Like | 1 | A | Mo Wenwei | MWW | National language | 2 | 11178 | A.2 | A.2-MWW |
| Like | 1 | A | Zheng Zhihua | ZZH | National language | 3 | 8000 | A.3 | A.3-ZZH |
| AMOUR | 1 | A | Liang Yongqi | LYQ | National language | 4 | 5000 | A.4 | A.4-LYQ |
| Angeline | 1 | A | The pottery Zhe | TZ | National language | 5 | 123 | A.5 | A.5-TZ |
| Angel | 1 | A | The pottery Zhe | TZ | National language | 6 | 120 | A.6 | A.6-TZ |
| ABCDE | 1 | A | Liu Dehua | LDH | National language | 7 | 70 | A.7 | A.7-LDH |
| AMOUR | 1 | A | Open the schoolmate | ZYY | Guangdong language | 8 | 17 | A.8 | A.8-ZYY |
| arigatou | 1 | A | Zheng Xiuwen | ZXW | Guangdong language | 9 | 13 | A.9 | A.9-ZXW |
| ANDY | 1 | A | A Du | AT | National language | 10 | 12 | A.10 | A.10-AT |
| Like | 1 | A | Chen Xiaochun | CXC | National language | 11 | 6 | A.11 | A.11-CXC |
| ASHES | 1 | A | Cai Jianya | CJY | English | 12 | 5 | A.12 | A.12-CJY |
| ANGEL | 1 | A | Zhao Yonghua | ZYH | National language | 13 | 5 | A.13 | A.13-ZYH |
| AMANI | 1 | A | BE-YOND | BY | Guangdong language | 14 | 1 | A.14 | A.14-BY |
| ALL?AROUND THE?WORLD | 4 | AATW | Li Wen | LW | English | | 13 | AATW | AATW-LW |
| ALL?ABOUT?US | 3 | AAU | Xu Huixin | XHQ | National language | | 11 | AAU | AAU-XHQ |
| Like | 3 | ABA | Zhao Yonghua | ZYH | National language | | 4 | ABA | ABA-ZYH |
| Like that you are the same | 7 | ABANDYY | Xu Ruyun | XLY | National language | | 10101 | ABANDYY | ABANDYY-XLY |
| Like me | 4 | ABAW | Zero point band | LDYD | National language | | 6262 | ABAW | ABAW-LDYD |
| A?BETTER?DAY | 3 | ABD | Liu Dehua | LDH | National language | 0 | 7 | ABD.0 | ABD.0-LDH |
| a?brandnew day | 3 | ABD | Mo Wenwei | MWW | National language | 1 | 3 | ABD.1 | ABD.1-MWW |
| Love less than flavour | 6 | ABDDZW | Black dragon | HL | National language | 0 | 2 | ABDDZW.0 | ABDDZW.0-HL |
| Love less than flavour | 6 | ABDDZW | Jiang Yuheng | JYH | National language | 1 | 1 | ABDDZW.1 | ABDDZW.1-JYH |
| Like not to the people | 4 | ABDR | Jiang Hui | JH | National language | 0 | 55 | ABDR.0 | ABDR.0-JH |
| Like not to the people | 4 | ABDR | Ceng Xinmei | ZXM | Amoyese | 1 | 43 | ABDR.1 | ABDR.1-ZXM |
| Like not regret | 4 | ABHH | Tian Zhen | TZ | National language | | 2225 | ABHH | ABHH-TZ |
Table three:
| Title of the song | Number of words | Character code | The singer | Singer's phonetic sign indicating number | Languages | Popular sign indicating number 1 | Popular sign indicating number 2 | The program request number | Coding 4 | Coding 5 |
| AITAI | 1 | A | The little fine jade of honey snow | MXWQ | National language | | 0 | 12080 | A-MXWQ | A-MXWQ.0 |
| Like | 1 | A | Little brave team | XHD | National language | | 1 | 11513 | A-XHD | A-XHD.1 |
| Like | 1 | A | Mo Wenwei | MWW | National language | | 2 | 11178 | A-MWW | A-MWW.2 |
| Like | 1 | A | Zheng Zhihua | ZZH | National language | | 3 | 8000 | A-ZZH | A-ZZH.3 |
| AMOUR | 1 | A | Liang Yongqi | LYQ | National language | | 4 | 5000 | A-LYQ | A-LYQ.4 |
| Angeline | 1 | A | The pottery Zhe | TZ | National language | 0 | 5 | 123 | A-TZ.0 | A-TZ.5 |
| Angel | 1 | A | The pottery Zhe | TZ | National language | 1 | 6 | 120 | A-TZ.1 | A-TZ.6 |
| ABCDE | 1 | A | Liu Dehua | LDH | National language | | 7 | 70 | A-LDH | A-LDH.7 |
| AMOUR | 1 | A | Open the schoolmate | ZYY | Guangdong language | | 8 | 17 | A-ZYY | A-ZYY.8 |
| arigatou | 1 | A | Zheng Xiuwen | ZXW | Guangdong language | | 9 | 13 | A-ZXW | A-ZXW.9 |
| ANDY | 1 | A | A Du | AT | National language | | 10 | 12 | A-AT | A-AT.10 |
| Like | 1 | A | Chen Xiaochun | CXC | National language | | 11 | 6 | A-CXC | A-CXC.11 |
| ASHES | 1 | A | Cai Jianya | CJY | English | | 12 | 5 | A-CJY | A-CJY.12 |
| ANGEL | 1 | A | Zhao Yonghua | ZYH | National language | | 13 | 5 | A-ZYH | A-ZYH.13 |
| AMANI | 1 | A | BE-YOND | BY | Guangdong language | | 14 | 1 | A-BY | A-BY.14 |
| ALL?AROUND THE?WORLD | 4 | AATW | Li Wen | LW | English | | | 13 | AATW-LW | AATW-LW |
| ALL?ABOUT?US | 3 | AAU | Xu Huixin | XHQ | National language | | | 11 | AAU-XHQ | AAU-XHQ |
| Like | 3 | ABA | Zhao Yonghua | ZYH | National language | | | 4 | ABA-ZYH | ABA-ZYH |
| Like that you are the same | 7 | ABANDYY | Xu Ruyun | XLY | National language | | | 10101 | ABANDYY-XLY | ABANDYY-XLY |
| Like me | 4 | ABAW | Zero point band | LDYD | National language | | | 6262 | ABAW-LDYD | ABAW-LDYD |
| A?BETTER?DAY | 3 | ABD | Liu Dehua | LDH | National language | | 0 | 7 | ABD-LDH | ABD-LDH.0 |
| a?brandnew day | 3 | ABD | Mo Wenwei | MWW | National language | | 1 | 3 | ABD-MWW | ABD-MWW.1 |
| Love less than flavour | 6 | ABDDZW | Black dragon | HL | National language | | 0 | 2 | ABDDZW-HL | ABDDZW-HL.0 |
| Love less than flavour | 6 | ABDDZW | Jiang Yuheng | JYH | National language | | 1 | 1 | ABDDZW-JYH | ABDDZW-JYH.1 |
| Like not to the people | 4 | ABDR | Jiang Hui | JH | National language | | 0 | 55 | ABDR-JH | ABDR-JH.0 |
| Like not to the people | 4 | ABDR | Ceng Xinmei | ZXM | Amoyese | | 1 | 43 | ABDR-ZXM | ABDR-ZXM.1 |
| Like not regret | 4 | ABHH | Tian Zhen | TZ | National language | | | 2225 | ABHH-TZ | ABHH-TZ |
The present invention and existing isocoding compare, characteristics that have and advantage:
Chinese and English keyword encoding unique encoding method is a kind of coding method that invents on the prior theory basis, and through updating, constantly test has increased others not available function in practice.Principal feature is as follows:
1) this coding method is primarily aimed at the keyword encoding of a certain specific industry or dictionary, also is suitable for all Chinese characters and vocabulary.
2) vocabulary can be encoded in advance, also can compile (under the prerequisite of default coded program) by the man-machine conversation mode while building; As long as coding is artificial simple the setting.After input raw data and the check and correction, other is finished by computing machine substantially;
3) this coding method is unique to arbitrary coding requirement; Computer has unique identifier, guarantees promptly promptly; The compiling method that has does not effectively solve coincident code problem, cause application not strong, and extendibility is not enough, can also deal with reluctantly as 20,000 lexicons, in case to 100,000 or much more too many with regard to repeated code, inconvenience maybe can't be used;
4) information be associated sign indicating number is as a supplement introduced in this coding method, meets mass psychology, is simple and easy to usefulness, need not study, makes every effort to the foolization operation, is beneficial to retrieval;
5) popular sign indicating number is introduced in this coding method, more helps man-machine good dialogue, allows computer press close to people's idea;
6) vocabulary of suitable Chinese and English and mixing;
7) every profession and trade can be provided with the flexible code database of oneself according to the actual requirements with reference to the rule of character encoding; Keep outside the individual character of oneself, also have a general character, guarantee the effective interchange between industry;
8) consider mode of operation manually various and that may use later by keyboard, mouse, telepilot, special-purpose alphabetical dibbler, touch-screen, rocking bar etc., utilize standard English alphabet and general symbol(s) as coded combination, by easy input method, can conveniently use.Coding is to be application service, and the compiling method that has only is concerned about coding, seldom considers application facet, does not propose the input method of practicality, has so only solved the one-side problem of data storage in the computer, and does not have fine consideration people's use to be only most important.Any compiling method is all finally served the people, what the people required is simple more, science more, i.e. " handy easy-to-use ".People's ease of use is paid attention in this coding method especially when design, consider the language thinking habit that China is popular, takes into account interactive characteristics separately, the fine unification that has solved compiling method and input method.
9) by being used of corresponding input method, and promote, can produce good social benefit.Can be applied to video request program, data check, touch-screen retrieval, internet Keywork earlier and application such as search, can be applicable to social all trades and professions.
With reference to accompanying drawing, the letter dibbler is a Radio infrared dibbler, dibbler keyboard subregion is provided with 1,10numerical keys 2 of function key and 26letter keys 3,26letter keys 3 are distributed in the bottom of keyboard and are docile and obedient preface and distribute line by line from top to down,numerical key 2 is distributed in the keyboard middle part,function key 1 is distributed in keyboard top and centers on a side of numerical key, and identifier is defined on thefunction key 1.