Summary of the invention
The method and apparatus that the application provides a kind of handwriting characteristic to extract, to solve the low problem of hand script Chinese input equipment handwriting data feature recognition accuracy.
In order to address the above problem, the application discloses a kind of method that handwriting characteristic extracts, and comprising:
The time series of writing according to handwriting data gathers handwriting data and carries out pre-service, obtains pretreated handwriting data;
According to described time series, described pretreated handwriting data is carried out to uniformly-spaced segmentation, obtain a plurality of stroke vector paragraphs;
Obtain the on-line diagnostic of described a plurality of stroke vector paragraphs, described on-line diagnostic comprises angle and the centre coordinate of described a plurality of stroke vector paragraphs;
According to described pretreated handwriting data, obtain the center of gravity of described handwriting data, according to described center of gravity, extract the off-line diagnostic of described pretreated handwriting data;
According to described on-line diagnostic and off-line diagnostic, carry out numerical value normalized, the feature using the result of described numerical value normalized as the described handwriting data gathering.
Preferably, described time series of writing according to handwriting data gathers handwriting data and carries out pre-service, and the step that obtains pretreated handwriting data comprises:
The handwriting data of collection is carried out after linear dimension Regularization according to the time series of writing, obtain each natural stroke segment length;
According to described each the natural stroke segment length obtaining, obtain the length of the handwriting data that formed by described each natural stroke section.
Preferably, the angle of described a plurality of stroke vector paragraphs comprises: the angle between the stroke vector paragraph that the angle of the angle of each stroke vector paragraph and X-axis positive dirction, each stroke vector paragraph and Y-axis positive dirction and each stroke vector paragraph are adjacent.
Preferably, it is characterized in that, described off-line diagnostic comprises projection off-line diagnostic or grid off-line diagnostic or fan-shaped off-line diagnostic or profile off-line diagnostic.
Preferably, when described off-line diagnostic is described projection off-line diagnostic, the step that the described center of gravity of described foundation is extracted the off-line diagnostic of described pretreated handwriting data comprises:
The center of gravity of described handwriting data of take is carried out divided in horizontal direction to described pretreated handwriting data and vertical direction is cut apart as cut-point, by described pretreated handwriting data He Xia subregion in subregion from divided in horizontal direction is, from vertical direction, be divided into left half region and right half region, the centre coordinate that scans respectively each stroke vector paragraph number that subregion, lower subregion, left half region and right half region occur on described;
When described off-line diagnostic is described grid off-line diagnostic, the step that the described center of gravity of described foundation is extracted the off-line diagnostic of described pretreated handwriting data comprises:
Eight directions of definition two dimensional surface, East, West, South, North, the southeast, northeast, southwest, northwest;
The center of gravity of described handwriting data of take is carried out divided in horizontal direction to described pretreated handwriting data and vertical direction is cut apart as cut-point, by described pretreated handwriting data grid and lower grid from divided in horizontal direction is, from vertical direction, be divided into left grid and right grid, the number occurring in eight directions of the centre coordinate that scans respectively each stroke vector paragraph grid, lower grid, left grid and right grid on described;
When described off-line diagnostic is described fan-shaped off-line diagnostic, the step that the described center of gravity of described foundation is extracted the fan-shaped off-line diagnostic of described pretreated handwriting data comprises:
Eight directions of definition two dimensional surface, East, West, South, North, the southeast, northeast, southwest, northwest;
The center of gravity of described handwriting data of take is the center of circle, and described pretreated handwriting data is divided into a plurality of sector regions, scans respectively the number that the centre coordinate of each stroke vector paragraph occurs in eight directions;
When described off-line diagnostic is described profile off-line diagnostic, the step that the described center of gravity of described foundation is extracted the profile off-line diagnostic of described pretreated handwriting data comprises:
Eight directions of definition two dimensional surface, East, West, South, North, the southeast, northeast, southwest, northwest;
The center of gravity of described handwriting data of take is end point, scans respectively the number that the centre coordinate of each stroke vector paragraph occurs in eight directions.
In order to address the above problem, disclosed herein as well is the device that a kind of handwriting characteristic extracts, comprising:
Acquisition module, gathers handwriting data and carries out pre-service for the time series of writing according to handwriting data, obtains pretreated handwriting data;
Cut apart module, for according to described time series, described pretreated handwriting data being carried out to uniformly-spaced segmentation, obtain a plurality of stroke vector paragraphs;
Computing module, for obtaining the on-line diagnostic of described a plurality of stroke vector paragraphs, described on-line diagnostic comprises angle and the centre coordinate of described a plurality of stroke vector paragraphs;
Extraction module, for obtain the center of gravity of described handwriting data according to described pretreated handwriting data, extracts the off-line diagnostic of described pretreated handwriting data according to described center of gravity;
Processing module, for according to described on-line diagnostic and off-line diagnostic, carries out numerical value normalized, the feature using the result of described numerical value normalized as the described handwriting data gathering.
Preferably, described acquisition module comprises: linear gauge mould preparation piece, for the handwriting data of collection is carried out after linear dimension Regularization according to the time series of writing, obtains each natural stroke segment length;
Length acquisition module, for described each the natural stroke segment length according to obtaining, obtains the length of the handwriting data that is comprised of described each natural stroke section.
Preferably, the angle of described a plurality of stroke vector paragraphs comprises: the angle between the stroke vector paragraph that the angle of the angle of each stroke vector paragraph and X-axis positive dirction, each stroke vector paragraph and Y-axis positive dirction and each stroke vector paragraph are adjacent.
Preferably, described off-line diagnostic comprises projection off-line diagnostic or grid off-line diagnostic or fan-shaped off-line diagnostic or profile off-line diagnostic.
Preferably, when described off-line diagnostic is described projection off-line diagnostic, described extraction module is when extracting the off-line diagnostic of described pretreated handwriting data according to described center of gravity:
The center of gravity of described handwriting data of take is carried out divided in horizontal direction to described pretreated handwriting data and vertical direction is cut apart as cut-point, by described pretreated handwriting data He Xia subregion in subregion from divided in horizontal direction is, from vertical direction, be divided into left half region and right half region, the centre coordinate that scans respectively each stroke vector paragraph number that subregion, lower subregion, left half region and right half region occur on described;
When described off-line diagnostic is described grid off-line diagnostic, described extraction module is when extracting the off-line diagnostic of described pretreated handwriting data according to described center of gravity:
Eight directions of definition two dimensional surface, East, West, South, North, the southeast, northeast, southwest, northwest;
The center of gravity of described handwriting data of take is carried out divided in horizontal direction to described pretreated handwriting data and vertical direction is cut apart as cut-point, by described pretreated handwriting data grid and lower grid from divided in horizontal direction is, from vertical direction, be divided into left grid and right grid, the number occurring in eight directions of the centre coordinate that scans respectively each stroke vector paragraph grid, lower grid, left grid and right grid on described;
When described off-line diagnostic is described fan-shaped off-line diagnostic, described extraction module is when extracting the fan-shaped off-line diagnostic of described pretreated handwriting data according to described center of gravity:
Eight directions of definition two dimensional surface, East, West, South, North, the southeast, northeast, southwest, northwest;
The center of gravity of described handwriting data of take is the center of circle, and described pretreated handwriting data is divided into a plurality of sector regions, scans respectively the number that the centre coordinate of each stroke vector paragraph occurs in eight directions;
When described off-line diagnostic is described profile off-line diagnostic, described extraction module is when extracting the profile off-line diagnostic of described pretreated handwriting data according to described center of gravity:
Eight directions of definition two dimensional surface, East, West, South, North, the southeast, northeast, southwest, northwest;
The center of gravity of described handwriting data of take is end point, scans respectively the number that the centre coordinate of each stroke vector paragraph occurs in eight directions.
Compared with prior art, the application comprises following advantage:
First, the application carries out uniformly-spaced segmentation according to time series to pretreated handwriting data, obtains the on-line diagnostic of a plurality of stroke vector paragraphs, and described on-line diagnostic comprises angle and the centre coordinate of a plurality of stroke vector paragraphs.By calculating angle and the centre coordinate of a plurality of stroke vector paragraphs, thereby the feature extraction that makes handwriting data has covered local characteristics and the global property of handwriting data, avoid only considering in existing method the position of handwriting data unique point, thereby caused the incomplete problem of handwriting data feature extraction.
Secondly, the application is by obtaining the center of gravity of handwriting data to pretreated handwriting data, and carry out symmetrical projection according to center of gravity, then extract local feature and the global property of the handwriting data of adjacent area, thus the too mechanical and not good problem of deformation adaptability while having avoided wide and contour mode to extract handwriting data feature.
Again, the application, by the on-line diagnostic extracting and the combination of off-line diagnostic, has obtained effective handwriting data feature, and then has guaranteed the reliability of follow-up sorter training, and significantly improved the classify accuracy of sorter, finally improved the recognition accuracy of hand script Chinese input equipment.
Embodiment
For the application's above-mentioned purpose, feature and advantage can be become apparent more, below in conjunction with the drawings and specific embodiments, the application is described in further detail.
With reference to Fig. 1, show the method that a kind of handwriting characteristic in the embodiment of the present application one extracts, comprising:
Step 101: the time series of writing according to handwriting data gathers handwriting data and carries out pre-service, obtains pretreated handwriting data.
Wherein, the time series that handwriting data is write is obtained by collecting device.
The handwriting data that a kind of handwriting data gathers after collecting device as shown in Figure 2, wherein, handwriting data is after collecting device, collect a series of data coordinates point, data coordinates point comprises abscissa value and the ordinate value of each point, and, the end mark of each stroke and the end mark of whole word.For example: the data coordinates point collecting comprises (X0, Y0), (X1, Y1), (X2, Y2) ... (Xn, Yn).The essential characteristic that includes handwriting data in a series of data coordinates points that collect, can process according to these features data of identifying the handwriting, and then extracts handwriting characteristic.
Step 102: according to described time series, described pretreated handwriting data is carried out to uniformly-spaced segmentation, obtain a plurality of stroke vector paragraphs.
According to collecting device, the seasonal effect in time series stroke of user writing is carried out accurately to uniformly-spaced segmentation, the handwriting data after segmentation is stroke vector paragraph.
Step 103: obtain the on-line diagnostic of described a plurality of stroke vector paragraphs, described on-line diagnostic comprises angle and the centre coordinate of described a plurality of stroke vector paragraphs.
The centre coordinate of stroke vector paragraph can obtain by following formula:
Wherein, Xifor the origin coordinates of stroke vector paragraph, Xi+1termination coordinate for stroke vector paragraph.
Step 104: obtain the center of gravity of described handwriting data according to described pretreated handwriting data, extract the off-line diagnostic of described pretreated handwriting data according to described center of gravity.
It should be noted that, above-mentioned steps 103 and 104 is not limited to said sequence when reality is carried out, and also can step 104 carry out before step 103, can also the two executed in parallel.
Step 105: according to described on-line diagnostic and off-line diagnostic, carry out numerical value normalized, the feature using the result of described numerical value normalized as the described handwriting data gathering.
Wherein, the scope of the result of numerical value normalized can suitably be set according to actual conditions by those skilled in the art, is preferably 0-8.
Feature refers to the special nature that a certain material possesses self, is basic sign and the sign that is different from other materials.For the handwriting characteristic of hand script Chinese input equipment, refer to handwriting mode and characteristic in shape.
The result of numerical value normalized can be carried out to the identification of word in the following manner as the feature of the handwriting data gathering.
First, by the feature of handwriting data and the template comparison of character library of extracting, the word that the characteristic matching rate of the handwriting data with extracting is large is listed, for user, selected, user selects, after correct input characters, to complete the identification of handwriting.
Wherein, the process of establishing of the template of character library comprises: to word known in dictionary, by trainer's handwriting input, set up the corresponding relation of dictionary Chinese word and handwriting, the word of trainer's handwriting input is as the template of known word.Same word can be by a plurality of trainer's handwriting inputs, and repeatedly, thereby a word can corresponding a plurality of hand-written templates.When coupling, can be by a plurality of template matches of the word of handwriting input and a plurality of words.
It should be noted that, the application has only enumerated and a kind ofly the feature of the handwriting data of extraction is carried out to word has known method for distinguishing, can adopt any mode in prior art to carry out word identification to the feature of extracted handwriting data, and the application is not limited
By the present embodiment, first, the application carries out uniformly-spaced segmentation according to time series to pretreated handwriting data, obtains the on-line diagnostic of a plurality of stroke vector paragraphs, and described on-line diagnostic comprises angle and the centre coordinate of a plurality of stroke vector paragraphs.By calculating angle and the centre coordinate of a plurality of stroke vector paragraphs, thereby the feature extraction that makes handwriting data has covered local characteristics and the global property of handwriting data, avoid only considering in existing method the position of handwriting data unique point, thereby caused the incomplete problem of handwriting data feature extraction.
Secondly, the application is by obtaining the center of gravity of handwriting data to pretreated handwriting data, and carry out symmetrical projection according to center of gravity, then extract local feature and the global property of the handwriting data of adjacent area, thus the too mechanical and not good problem of deformation adaptability while having avoided wide and contour mode to carry out the extraction of handwriting characteristic.
Again, the application, by the on-line diagnostic extracting and the combination of off-line diagnostic, has obtained effective handwriting data feature, and then has guaranteed the reliability of follow-up sorter training, and significantly improved the classify accuracy of sorter, finally improved the recognition accuracy of hand script Chinese input equipment.
With reference to Fig. 3, show the method that a kind of handwriting characteristic in the embodiment of the present application two extracts.
In the present embodiment, a kind of method that handwriting characteristic extracts, comprising:
Step 301: the time series of writing according to handwriting data gathers handwriting data and carries out pre-service, obtains pretreated handwriting data.
In the present embodiment, by collecting device, collect a series of coordinate points of handwriting data.Wherein, coordinate points comprises abscissa value and the ordinate value of each coordinate points, and the starting point coordinate of each stroke, the termination coordinate of each stroke, the end coordinate of each stroke and the end coordinate of whole word.
After collecting handwriting data, the time series of writing according to handwriting data gathers handwriting data and carries out pre-service, obtains pretreated handwriting data.
Step 302: the handwriting data of collection is carried out to linear dimension Regularization according to the time series of writing, and the regular size to 96*96, then obtains each natural stroke segment length.
Linear dimension Regularization refers to the size of unifying the handwriting data of collection by stretching, adopts the conversion such as rotation, translation to change the position of the handwriting data gathering.
Nature stroke section refer to user in writing process horizontal, vertical, skim, right-falling stroke.
By following formula, obtain each natural stroke segment length:
Wherein, Xifor the starting point abscissa value of natural stroke section, Xi+1for the termination abscissa value of natural stroke section, Yifor the starting point ordinate value of natural stroke section, Yi+1ordinate value for the termination coordinate of natural stroke section.
Can obtain each natural stroke segment length according to (1) formula, according to each the natural stroke segment length obtaining, obtain the length of the handwriting data that formed by each natural stroke section.
Can obtain by following formula the length of handwriting data:
Wherein, n is the number of point, lifor natural stroke segment length.
It should be noted that, the scope of linear dimension Regularization can suitably be set according to actual conditions by those skilled in the art, is preferably the regular size to 96*96.
Step 303: according to described time series, described pretreated handwriting data is carried out to uniformly-spaced segmentation, obtain a plurality of stroke vector paragraphs;
Can obtain stroke vector paragraph by following formula:
Wherein, lvfor stroke vector paragraph, ndfor the dimension of proper vector, ndvalue arbitrarily, as long as length that can decile handwriting data.
Step 304: obtain the on-line diagnostic of described a plurality of stroke vector paragraphs, described on-line diagnostic comprises angle and the centre coordinate of described a plurality of stroke vector paragraphs; The angle of described a plurality of stroke vector paragraphs comprises: the angle between the stroke vector paragraph that the angle of the angle of each stroke vector paragraph and X-axis positive dirction, each stroke vector paragraph and Y-axis positive dirction and each stroke vector paragraph are adjacent.
Wherein, the angular range of the angular range of each stroke vector paragraph and X-axis positive dirction and each stroke vector paragraph and Y-axis positive dirction is 0-180 degree.
The angle of a kind of stroke vector paragraph and adjacent stroke vector paragraph as shown in Figure 4.Wherein, stroke vector paragraph 0 is adjacent vector with stroke vector paragraph 1, and the angle between stroke vector paragraph is obtained by stroke vector paragraph 0 and 1 calculating of stroke vector paragraph, and the scope of the angle between the stroke vector paragraph that stroke vector paragraph is adjacent is 0-180 degree.
Step 305: obtain the center of gravity of described handwriting data according to described pretreated handwriting data, extract the off-line diagnostic of described pretreated handwriting data according to described center of gravity;
Calculate the centre coordinate of each the natural stroke segment length after linear dimension Regularization, by following formula, calculate the centre coordinate of nature stroke segment length:
Xifor the initial abscissa value of natural stroke section, Xi+1for natural stroke section is ended abscissa value; Yifor the starting point ordinate value of natural stroke section, Yi+1ordinate value for the termination coordinate of natural stroke section.
According to formula (1) and formula (4), obtain the center of gravity of handwriting data, the center of gravity formula of handwriting data:
Wherein, each meaning of parameters in above-mentioned center of gravity formula is identical with formula (1) and (4).
Preferably, described off-line diagnostic comprises projection off-line diagnostic or grid off-line diagnostic or fan-shaped off-line diagnostic or profile off-line diagnostic.
Preferably, when described off-line diagnostic is described projection off-line diagnostic, the step that the described center of gravity of described foundation is extracted the off-line diagnostic of described pretreated handwriting data comprises:
The center of gravity of described handwriting data of take is carried out divided in horizontal direction to described pretreated handwriting data and vertical direction is cut apart as cut-point, by described pretreated handwriting data He Xia subregion in subregion from divided in horizontal direction is, from vertical direction, be divided into left half region and right half region.
The projection off-line diagnostic that the center of gravity of handwriting data of take is cut-point as shown in Figure 5.Wherein, center of gravity represents with solid round dot, and the region after divided represents with grid.Then the centre coordinate that each stroke vector paragraph is scanned respectively in the upper subregion after cutting apart, lower subregion, left half region and the right half region number that subregion, lower subregion, left half region and right half region occur on described.Wherein, be divided into handwriting data in He Xia subregion, subregion and scan according to mode from left to right or mode from right to left, be divided into handwriting data in left half region and right half region and scan according to mode from top to bottom or mode from top to bottom.
When described off-line diagnostic is described grid off-line diagnostic, the step that the described center of gravity of described foundation is extracted the off-line diagnostic of described pretreated handwriting data comprises:
Eight directions of definition two dimensional surface, East, West, South, North, the southeast, northeast, southwest, northwest, eight concrete directions are as shown in Figure 6.
The center of gravity of described handwriting data of take is carried out divided in horizontal direction to described pretreated handwriting data and vertical direction is cut apart as cut-point, by described pretreated handwriting data grid and lower grid from divided in horizontal direction is, from vertical direction, be divided into left grid and right grid.Wherein, the lattice number of upper grid and lower grid is consistent, and the lattice number of left grid and right grid is also consistent; And upper grid height is consistent with lower grid height, left mesh width is also consistent with right mesh width.The number occurring in eight directions of the centre coordinate that then scans respectively each stroke vector paragraph in the upper grid after cutting apart, lower grid, left grid and right grid grid, lower grid, left grid and right grid on described.
When described off-line diagnostic is described fan-shaped off-line diagnostic, the step that the described center of gravity of described foundation is extracted the fan-shaped off-line diagnostic of described pretreated handwriting data comprises:
Eight directions of definition two dimensional surface, East, West, South, North, the southeast, northeast, southwest, northwest;
The center of gravity of described handwriting data of take is the center of circle, and described pretreated handwriting data is divided into a plurality of sector regions.For example: take center of gravity as the center of circle, center of gravity represents (as the black circle of circle centre position in Fig. 7) with solid round dot, and handwriting data is divided into 16 sector regions, as shown in Figure 7.Scanning is divided into the number that the centre coordinate of each stroke vector paragraph in 16 sector regions occurs in eight directions respectively.
When described off-line diagnostic is described profile off-line diagnostic, the step that the described center of gravity of described foundation is extracted the profile off-line diagnostic of described pretreated handwriting data comprises:
Eight directions of definition two dimensional surface, East, West, South, North, the southeast, northeast, southwest, northwest;
The center of gravity of described handwriting data of take is end point, scans respectively the number that the centre coordinate of each stroke vector paragraph occurs in eight directions, wherein, can scan eastwards from east to west or from west, and concrete scan mode the application is not limited.
It should be noted that, the center of gravity of handwriting data is the center of circle, divides a plurality of sector regions, can according to actual conditions, suitably divide sector region by those skilled in the art, is preferably 16 sector regions.
Step 306: according to described on-line diagnostic and off-line diagnostic, carry out numerical value normalized, the feature using the result of described numerical value normalized as the described handwriting data gathering.
According to described on-line diagnostic and off-line diagnostic, carry out numerical value normalized, the feature using the result of described numerical value normalized as the described handwriting data gathering.
Explanation based on said method embodiment, the application also provides the embodiment of corresponding a kind of handwriting characteristic extraction element, realizes the content described in said method embodiment.
Referring to Fig. 8, show the structured flowchart of a kind of handwriting characteristic extraction element in the embodiment of the present application four, specifically can comprise: acquisition module, for the time series of writing according to handwriting data, gather handwriting data and carry out pre-service, obtain pretreated handwriting data.
Cut apart module, for according to described time series, described pretreated handwriting data being carried out to uniformly-spaced segmentation, obtain a plurality of stroke vector paragraphs.
Computing module, for obtaining the on-line diagnostic of described a plurality of stroke vector paragraphs, described on-line diagnostic comprises angle and the centre coordinate of described a plurality of stroke vector paragraphs.
Extraction module, for obtain the center of gravity of described handwriting data according to described pretreated handwriting data, extracts the off-line diagnostic of described pretreated handwriting data according to described center of gravity.
Processing module, for according to described on-line diagnostic and off-line diagnostic, carries out numerical value normalized, the feature using the result of described numerical value normalized as the described handwriting data gathering.
Preferably, described acquisition module comprises: linear gauge mould preparation piece, for the handwriting data of collection is carried out after linear dimension Regularization according to the time series of writing, obtains each natural stroke segment length.
Length acquisition module, for described each the natural stroke segment length according to obtaining, obtains the length of the handwriting data that is comprised of described each natural stroke section.
Preferably, the angle of described a plurality of stroke vector paragraphs comprises: the angle of the angle of each stroke vector paragraph and X-axis positive dirction, each stroke vector paragraph and Y-axis positive dirction and, the angle between the stroke vector paragraph that each stroke vector paragraph is adjacent.
Preferably, described off-line diagnostic comprises projection off-line diagnostic or grid off-line diagnostic or fan-shaped off-line diagnostic or profile off-line diagnostic.
Preferably, when described off-line diagnostic is described projection off-line diagnostic, described extraction module is when extracting the off-line diagnostic of described pretreated handwriting data according to described center of gravity:
The center of gravity of described handwriting data of take is carried out divided in horizontal direction to described pretreated handwriting data and vertical direction is cut apart as cut-point, by described pretreated handwriting data He Xia subregion in subregion from divided in horizontal direction is, from vertical direction, be divided into left half region and right half region, the centre coordinate that scans respectively each stroke vector paragraph number that subregion, lower subregion, left half region and right half region occur on described.
When described off-line diagnostic is described grid off-line diagnostic, described extraction module is when extracting the off-line diagnostic of described pretreated handwriting data according to described center of gravity:
Eight directions of definition two dimensional surface, East, West, South, North, the southeast, northeast, southwest, northwest;
The center of gravity of described handwriting data of take is carried out divided in horizontal direction to described pretreated handwriting data and vertical direction is cut apart as cut-point, by described pretreated handwriting data grid and lower grid from divided in horizontal direction is, from vertical direction, be divided into left grid and right grid, the number occurring in eight directions of the centre coordinate that scans respectively each stroke vector paragraph grid, lower grid, left grid and right grid on described.
When described off-line diagnostic is described fan-shaped off-line diagnostic, described extraction module is when extracting the fan-shaped off-line diagnostic of described pretreated handwriting data according to described center of gravity:
Eight directions of definition two dimensional surface, East, West, South, North, the southeast, northeast, southwest, northwest;
The center of gravity of described handwriting data of take is the center of circle, and described pretreated handwriting data is divided into a plurality of sector regions, scans respectively the number that the centre coordinate of each stroke vector paragraph occurs in eight directions.
When described off-line diagnostic is described profile off-line diagnostic, described extraction module is when extracting the profile off-line diagnostic of described pretreated handwriting data according to described center of gravity:
Eight directions of definition two dimensional surface, East, West, South, North, the southeast, northeast, southwest, northwest;
The center of gravity of described handwriting data of take is end point, scans respectively the number that the centre coordinate of each stroke vector paragraph occurs in eight directions.
In sum, a kind of handwriting characteristic extraction element of the embodiment of the present application mainly comprises following advantage:
First, the application carries out uniformly-spaced segmentation according to time series to pretreated handwriting data, obtains the on-line diagnostic of a plurality of stroke vector paragraphs, and described on-line diagnostic comprises angle and the centre coordinate of a plurality of stroke vector paragraphs.By calculating angle and the centre coordinate of a plurality of stroke vector paragraphs, thereby the feature extraction that makes handwriting data has covered local characteristics and the global property of handwriting data, avoid only considering in existing method the position of handwriting data unique point, thereby caused the incomplete problem of handwriting data feature extraction.
Secondly, the application is by obtaining the center of gravity of handwriting data to pretreated handwriting data, and carry out symmetrical projection according to center of gravity, then extract local feature and the global property of the handwriting data of adjacent area, thus the too mechanical and not good problem of deformation adaptability while having avoided wide and contour mode to extract handwriting data feature.
Again, the application, by the on-line diagnostic extracting and the combination of off-line diagnostic, has obtained effective handwriting data feature, and then has guaranteed the reliability of follow-up sorter training, and significantly improved the classify accuracy of sorter, finally improved the recognition accuracy of hand script Chinese input equipment.
For device embodiment, because it is substantially similar to embodiment of the method, so description is fairly simple, relevant part is referring to the part explanation of embodiment of the method.
Each embodiment in this instructions all adopts the mode of going forward one by one to describe, and each embodiment stresses is the difference with other embodiment, between each embodiment identical similar part mutually referring to.
The method and apparatus that a kind of handwriting characteristic above the application being provided extracts, be described in detail, applied specific case herein the application's principle and embodiment are set forth, the explanation of above embodiment is just for helping to understand the application's method and core concept thereof; Meanwhile, for one of ordinary skill in the art, the thought according to the application, all will change in specific embodiments and applications, and in sum, this description should not be construed as the restriction to the application.