m is used to represent the character category to which each frame number character may belong (i.e., 26 english letters and 10 natural numbers from 1 to 10);

y_cfor representing an indicator variable representing a character class predicted by the systemWhether the character type is consistent with the real character type or not, if so, y_cIs 1, otherwise y_cIs 0;

p_cused to represent the prediction probability of the training sample being the character class c.

The character recognition model is optimized by an Adam optimization method in the prior art.

The invention also provides an end-to-end frame number identification method based on deep learning, which is realized by applying the end-to-end frame number identification system, and please refer to fig. 4 and 8, and comprises the following steps:

and step S4, the end-to-end frame number recognition system simultaneously carries out corresponding character type recognition on each character in the frame number character string in the image according to the characteristic vector and based on a preset character recognition model, and finally obtains a character recognition result of the frame number character string through recognition.

Referring to fig. 5, in step S4, the process of the end-to-end frame number recognition system recognizing the character type corresponding to each character in the frame number character string specifically includes the following steps:

step S41, the end-to-end frame number recognition system locates the components corresponding to the frame number characters on each appointed character position in the frame number character string in the feature vector based on the preset character recognition model, and obtains a plurality of locating results of the frame number characters related to each appointed character position;

step S42, the end-to-end frame number recognition system converts the same feature vector into a plurality of corresponding prediction vectors according to each positioning result;

step S43, the end-to-end frame number recognition system calculates component values corresponding to components in the prediction vectors based on a preset character recognition model;

and step S44, the end-to-end frame number recognition system recognizes the character type corresponding to the component corresponding to the maximum component value in each prediction vector based on a preset character recognition model, uses the recognized character type as the character type corresponding to the frame number character on the corresponding designated character position in the frame number character string, and finally recognizes to obtain the character recognition result of the frame number character string.

In the above technical solution, in step S41, the positioning method adopted by the system to position the component corresponding to the frame number character on the designated character position in the feature vector is obtained by recognition of a pre-trained character recognition model, the recognition positioning method is a positioning method existing in the prior art, and the positioning method is not within the scope of the present invention, so the specific method process of the system to position the component based on the convolutional neural network is not described here.

In step S42, the prediction vector is a 36-dimensional feature vector, 36 components in the 36-dimensional feature vector are used to indicate that the frame number character on the designated character position may be the corresponding 26 english alphabets or 10 natural numbers from 1 to 10 (i.e., the possible character categories of the frame number character), and the component value corresponding to each of the 36 components is the prediction probability that the component is the corresponding character category.

In step S42, the method for the system to locate the position of the character feature corresponding to the frame number character of the designated character position in the 1024-dimensional feature vector is the conventional locating method, and the locating method is not within the scope of the claimed invention, and therefore, will not be described herein.

In step S43, the method for calculating the component values corresponding to the components in the 36-dimensional feature vector by the system is also the method existing in the prior art, and the above convolutional neural network is preferably used for calculation, and the specific calculation process is not described herein.

It should be emphasized that each character recognition unit only recognizes the frame number character type of the designated character position in the frame number character string, for example, the first character recognition unit only recognizes the character type corresponding to the frame number character at the first designated character position in the frame number character string, and the second character recognition unit only recognizes the character type … … corresponding to the frame number character at the second designated character position in the frame number character string, so that the characters in the frame number are arranged in sequence in the recognition result of the system performing character recognition on the frame number character string, and the disorder situation does not occur.

In conclusion, the end-to-end frame number recognition system provided by the invention can automatically recognize the frame number characters of the input image containing the frame number character string, has a quick and efficient recognition process and high recognition accuracy, and solves the technical problems of low recognition efficiency, easy omission and wrong detection of the traditional manual recognition mode.

It should be understood that the above-described embodiments are merely preferred embodiments of the invention and the technical principles applied thereto. It will be understood by those skilled in the art that various modifications, equivalents, changes, and the like can be made to the present invention. However, such variations are within the scope of the invention as long as they do not depart from the spirit of the invention. In addition, certain terms used in the specification and claims of the present application are not limiting, but are used merely for convenience of description.

Claims

1. An end-to-end frame number recognition system based on deep learning is used for carrying out automatic recognition on frame numbers and is characterized by comprising the following components:

2. The end-to-end frame number identification system of claim 1, wherein the end-to-end frame number identification system performs convolutional identification on the image through a convolutional neural network to obtain the feature map corresponding to the image.

3. The end-to-end frame number identification system of claim 2, wherein the end-to-end frame number identification system converts the feature map corresponding to the image into the feature vector through the convolutional neural network.

4. The end-to-end frame number recognition system of claim 1, wherein the character recognition module comprises a plurality of character recognition units, each of the character recognition units is used for recognizing the character type corresponding to the frame number character at a designated character position in the frame number character string,

each character recognition unit specifically comprises:

5. The end-to-end frame number identification system of claim 4, wherein the number of said character recognition units is 17, each said character recognition unit is used for recognizing the character type corresponding to the frame number character at one of the designated character positions in the 17-bit frame number character string.

6. The end-to-end frame number recognition system of claim 1, further comprising a character recognition model training module coupled to the character recognition module for training the character recognition model based on the character recognition results.

7. An end-to-end frame number identification method based on deep learning is realized by applying the end-to-end frame number identification system as any one of claims 1 to 6, and is characterized by comprising the following steps of:

8. The end-to-end frame number identification method according to claim 7, wherein in step S4, the process of the end-to-end frame number identification system identifying the character type corresponding to each of the characters in the frame number character string specifically includes the following steps: