Summary of the invention
The deficiency existing for prior art, the technical problem to be solved in the present invention is the message word that contains not clear character to be revised to quality and success ratio that the reception of raising message shows.
In order to achieve the above object, a technical scheme of the present invention has been to provide a kind of message word modification method based on dictionary, it is characterized in that, comprises the following steps:
Step 1, message common wordss are set up to message dictionary;
If contain not clear character in the message word that step 2 receives, in message dictionary, search and this reception word other identical words except not clear character; If the word finding only has one, using this word as revising word.
Preferably, in above-mentioned steps 1, also comprise: the word in message dictionary is set up to phrase storehouse by the conventional collocation of message respectively;
In above-mentioned steps 2, also comprise: the word finding if above-mentioned is greater than one, respectively by its word as an alternative, replace receiving word respectively with message in before and after adjacent, containing not clear word character or that revised, do not form phrase, in phrase storehouse, search the phrase identical with forming phrase; If the phrase finding only has one, using the alternative word containing in phrase as revising word.
Preferably, above-mentioned message dictionary can expand, and its method is: if occur at least one times, same reception word, containing failing to understand character and not being included in message dictionary, is not added into message dictionary by this reception word.
Preferably, above-mentioned phrase storehouse can be expanded, and its method is: if occur at least one times, the phrase that the word of same adjacent two receptions forms is not included in phrase storehouse, this phrase is added into phrase storehouse.
Preferably, in above-mentioned steps 2, also comprise: to revising word, indicate the character at its place, former not clear character place.
Another technical scheme of the present invention has been to provide a kind of message word update the system based on dictionary, it is characterized in that, comprises with lower module and contacts successively:
Dictionary module, function comprises: message common wordss are set up to message dictionary;
Correcting module, function comprises: if contain not clear character in the message word receiving, in message dictionary, search and this reception word other identical words except not clear character; If the word finding only has one, using this word as revising word.
Preferably, in above-mentioned dictionary module, function also comprises: the word in message dictionary is set up to phrase storehouse by the conventional collocation of message respectively;
In above-mentioned correcting module, function also comprises: the word finding if above-mentioned is greater than one, respectively by its word as an alternative, replace to receive word respectively with message in before and after adjacent, containing not clear word character or that revised, do not form phrase, in phrase storehouse, search the phrase identical with forming phrase; If the phrase finding only has one, using the alternative word containing in phrase as revising word.
Preferably, in above-mentioned dictionary module, function also comprises: if occur at least one times, same reception word, containing failing to understand character and not being included in message dictionary, is not added into message dictionary by this reception word.
Preferably, in above-mentioned dictionary module, function also comprises: if occur at least one times, the phrase that the word of same adjacent two receptions forms is not included in phrase storehouse, this phrase is added into phrase storehouse.
Preferably, in above-mentioned correcting module, function also comprises: to revising word, indicate the character at its place, former not clear character place.
The invention has the beneficial effects as follows: the present invention sets up message dictionary by message common wordss, and the word in message dictionary is set up to phrase storehouse by the conventional collocation of message respectively, by the search comparison to this dictionary and phrase storehouse, can effectively to the message word that contains not clear character, revise, the message that has improved wireless or wire transmission receives quality and the success ratio showing.And the present invention also provides dictionary and phrase storehouse to continue the means that expand in receiving message, has more improved the correction efficiency of message word.
Embodiment
For the present invention is become apparent, hereby with preferred embodiment, and coordinate accompanying drawing to be described in detail below.Should note; following examples are to provide to those of ordinary skills and realize and use of the present invention; but not in order to limit the present invention, anyly do not depart from the modification that maximum magnitude that claims of the present invention define is done, be still included in scope of patent protection of the present invention.
The embodiment of the message word modification method based on dictionary
Whole world navigation warning receiver business (being designated hereinafter simply as NAVTEX) is an important component part of GMDSS (being designated hereinafter simply as GMDSS).Radio station utilizes the CFEC mode of arrowband printergram (being designated hereinafter simply as NBDP) to use English to broadcast NAVTEX message in 518KHz frequency on the bank.Boat-carrying NAVTEX receiver receives message and shows or print.Yet due to radio interference and the reason such as radio signal is weak, receiver possibly cannot correctly receive or the partial character of the message of decoding, and has produced not clear character, expression mode is " * " character in message.According to technical protocol, the bit error rate should abandon over 30% message, waits for that receive next time.
In order to improve the readability of the NAVTEX message that has not clear character, can apply the message word modification method based on dictionary of the present invention, with the conventional dictionary of NAVTEX message, revise word or the professional abbreviation that has not clear character.And the phrase storehouse that can also utilize the commonly used collocation of word in NAVTEX message to form, to there being a plurality of correction words of ambiguity to judge selection.
The dictionary here and phrase storehouse are the special uses of NAVTEX message.First, more than 5400 Nautical English specialty common wordss included in dictionary; Secondly, conventional in NAVTEX message more than 270 abbreviation vocabulary and regular collocation have been set up.And dictionary and phrase storehouse are extendible, in receiving the process of NAVTEX message, constantly increase the word and the phrase that in new NAVTEX message, use, improve the success ratio of revising containing not clear characters word.
As shown in Figure 1, be the process flow diagram of the embodiment of the message word modification method based on dictionary of the present invention.One of its implementation is to use computing machine to add to write computer program, and the NAVTEX message receiving is operated.Refer to Fig. 1, details are as follows for each step of the method for the present embodiment.
Step S05: set up dictionary and phrase storehouse in storer.Dictionary is included and is comprised more than 270 conventional abbreviation vocabulary in more than 5400 Nautical English specialty common wordss and NAVTEX message.Dictionary can be set up by character number classification, or press alphabet sequence classification foundation again after character number classification.The conventional regular collocation in NAVTEX message of the vocabulary that comprises in dictionary is included in phrase storehouse.Go to step S10.
Step S10: receive entire chapter message.The message word of each reception is failed to understand to character proceeds to respectively step S20 or step S40 according to having or not.
Step S20: if receive word not containing not clear character, search in dictionary and have or not identical therewith word.Go to step S25.
Step S25: if without same word, in dictionary identical characters number word class in add this reception word; If have, do not add.Go to step S30.
Step S30: if upper one receives word not containing not clear character, form phrase with this reception word, search in phrase storehouse and have or not identical therewith phrase.Go to step S35.
Step S35: if without identical phrase, add the phrase of this composition in phrase storehouse; If have, do not add.Go to step S90.
Step S40: if receive word containing not clear character, search in the word of identical characters number in dictionary and this reception word other (comprise character content, character number, put in order) identical words except not clear character.According to result, go to step respectively S50, step S60 or step S70.
Step S50: if only find a word, using the word finding as revising word, and indicate the character (being former not clear character) being corrected.Go to step S90.
Step S60: if find a plurality of words, the word each being found respectively with message in before and after the word of collocation form phrase, in phrase storehouse, search respectively and have or not identical phrase.According to result, go to step respectively S63 or step S66.
Step S63: if only find a phrase, using the word wherein finding as revising word, and indicate the character (being former not clear character) being corrected.Go to step S90.
Step S66: if can not find or find a plurality of phrases, revise unsuccessfully, replace receiving the not clear character in word with " * ".Go to step S90.
Step S70: if can not find word, revise unsuccessfully, replace receiving the not clear character in word with " * ".Go to step S90.
Step S90: carry out subsequent operation.
In the present embodiment step S25, in the word of reception, all characters all clearly and are not included in message dictionary, and this situation has only occurred once just this word to be added into message dictionary.If also will consider that whether this word is conventional, can increase frequency condition, for example, in receiving 100 messages, this situation has occurred 2 times, this word is added into message dictionary.
In like manner, in the present embodiment step S35, the phrase that the word of adjacent two receptions forms is not included in phrase storehouse, and message phrase storehouse has only occurred once just this phrase to be added into this situation.If also will consider that this phrase is commonly used collocation, can increase frequency condition, for example, in receiving 1000 messages, this situation has occurred 3 times, this phrase is added into message phrase storehouse.
In the present embodiment step S30, the condition that a upper word and this reception word form phrase is, a upper word is containing not clear character, and the phrase storehouse confidence level of expansion is higher like this.Can certainly relax group entry part, go up a word not containing not clear character or revised word, can comparatively fast expand phrase storehouse like this, improve and revise efficiency.
In the present embodiment step S50 and S63, to revising word, indicate the character at its place, former not clear character place, this is for when showing or print message, to revising the character being corrected in word, can or add mark etc. by the font different from other characters, font, color and represent, for example, capital and small letter upset, italic, redness or underline.Can remind like this reader to revised message correctly judgement and understand.
In the present embodiment step S66, if find a plurality of phrases, also have a kind of mode can find out correction word, that is: in step S35, if find, have identical phrase in phrase storehouse, to the conventional frequency eigenwert of this phrase cumulative 1; In step S66, if find a plurality of phrases, select the phrase of conventional frequency eigenwert maximum, using the word wherein finding as revising word, and indicate the character (being former not clear character) being corrected.Go to step S90.
The method that the present embodiment is searched by this dictionary analysis, can revise the message word containing not clear character of part, thereby improves the readability of NAVTEX message, reduces dropping packets, guarantees the promptness of message transmissions.The use of this technology, makes the NAVTEX receiving ability under mal-condition obtain larger raising.
The embodiment of the message word update the system based on dictionary
Whole world navigation warning receiver business (being designated hereinafter simply as NAVTEX) is an important component part of GMDSS (being designated hereinafter simply as GMDSS).Radio station utilizes the CFEC mode of arrowband printergram (being designated hereinafter simply as NBDP) to use English to broadcast NAVTEX message in 518KHz frequency on the bank.Boat-carrying NAVTEX receiver receives message and shows or print.Yet due to radio interference and the reason such as radio signal is weak, receiver possibly cannot correctly receive or the partial character of the message of decoding, and has produced not clear character, expression mode is " * " character in message.According to technical protocol, the bit error rate should abandon over 30% message, waits for that receive next time.
In order to improve the readability of the NAVTEX message that has not clear character, can apply the message word modification method based on dictionary of the present invention, with the conventional dictionary of NAVTEX message, revise word or the professional abbreviation that has not clear character.And the phrase storehouse that can also utilize the commonly used collocation of word in NAVTEX message to form, to there being a plurality of correction words of ambiguity to judge selection.
The dictionary here and phrase storehouse are the special uses of NAVTEX message.First, more than 5400 Nautical English specialty common wordss included in dictionary; Secondly, conventional in NAVTEX message more than 270 abbreviation vocabulary and regular collocation have been set up.And dictionary and phrase storehouse are extendible, in receiving the process of NAVTEX message, constantly increase the word and the phrase that in new NAVTEX message, use, improve the success ratio of revising containing not clear characters word.
As shown in Figure 2, be the structural drawing of the embodiment of the message word update the system based on dictionary of the present invention.Its implementation comprises that using computing machine to add writes computer program, or the circuit that comprises programmable logic device (PLD), special IC form disposal system etc., and the NAVTEX message receiving is operated.Refer to Fig. 2, details are as follows for the modules of the system of the present embodiment.
Module 10: dictionary module, function comprises:
Function 1, sets up dictionary and phrase storehouse.Dictionary is included and is comprised more than 270 conventional abbreviation vocabulary in more than 5400 Nautical English specialty common wordss and NAVTEX message.Dictionary can be set up by character number classification, or press alphabet sequence classification foundation again after character number classification.The conventional regular collocation in NAVTEX message of the vocabulary that comprises in dictionary is included in phrase storehouse;
Function 2, if receive word not containing not clear character and exclude dictionary, is added into dictionary;
Function 3, if the word of the word receiving in message and front and back thereof collocation containing not clear character, and the phrase of collocation excludes phrase storehouse, this phrase is added into phrase storehouse.
Module 20: correcting module, function comprises:
Function 1 if receive word containing not clear character, is searched in the word of identical characters number and this reception word other (comprise character content, character number, put in order) identical words except not clear character in dictionary;
Function 2, if can not find word, revises unsuccessfully, with " * ", replaces receiving the not clear character in word;
Function 3, if only find a word, using the word finding as revising word, and indicates the character (being former not clear character) being corrected;
Function 4, if find a plurality of words, the word each being found respectively with message in before and after the word of collocation form phrase, in phrase storehouse, search respectively and have or not identical phrase;
Function 5, if can not find or find a plurality of phrases, revises unsuccessfully, with " * ", replaces receiving the not clear character in word;
Function 6, if only find a phrase, using the word wherein finding as revising word, and indicates the character (being former not clear character) being corrected.
In the present embodiment dictionary functions of modules 2, in the word of reception, all characters all clearly and are not included in message dictionary, and this situation has only occurred once just this word to be added into message dictionary.If also will consider that whether this word is conventional, can increase frequency condition, for example, in receiving 100 messages, this situation has occurred 2 times, this word is added into message dictionary.
In like manner, in the present embodiment dictionary functions of modules 3, if the word of the word receiving in message and forward and backward collocation thereof containing not clear character, and the phrase of collocation excludes phrase storehouse, only there is once just this phrase to be added into message phrase storehouse in this situation.If also will consider that this phrase is commonly used collocation, can increase frequency condition, for example, in receiving 1000 messages, this situation has occurred 3 times, this phrase is added into message phrase storehouse.
In the present embodiment dictionary functions of modules 3, the condition that forward and backward word and this reception word form phrase is, forward and backward word is containing not clear character, and the phrase storehouse confidence level of expansion is higher like this.Can certainly relax group entry part, forward and backward word, not containing failing to understand character or having revised word, can comparatively fast expand phrase storehouse like this, improves and revises efficiency.
In the present embodiment correcting module function 3 and function 6, to revising word, indicate the character at its place, former not clear character place, this is for when showing or print message, to revising the character being corrected in word, can or add mark etc. by the font different from other characters, font, color and represent, for example, capital and small letter upset, italic, redness or underline.Can remind like this reader to revised message correctly judgement and understand.
In the present embodiment correcting module function 5, if find a plurality of phrases, also have a kind of mode can find out correction word, that is: in dictionary functions of modules 3, if phrase storehouse listed in the phrase of collocation, to the conventional frequency eigenwert of this phrase cumulative 1; In correcting module function 5, if find a plurality of phrases, select the phrase of conventional frequency eigenwert maximum, using the word wherein finding as revising word, and indicate the character (being former not clear character) being corrected.
The system that the present embodiment is searched by this dictionary analysis, can revise the message word containing not clear character of part, thereby improves the readability of NAVTEX message, reduces dropping packets, guarantees the promptness of message transmissions.The use of this technology, makes the NAVTEX receiving ability under mal-condition obtain larger raising.