Movatterモバイル変換


[0]ホーム

URL:


CN112990432B - Target recognition model training method and device and electronic equipment - Google Patents

Target recognition model training method and device and electronic equipment
Download PDF

Info

Publication number
CN112990432B
CN112990432BCN202110242083.5ACN202110242083ACN112990432BCN 112990432 BCN112990432 BCN 112990432BCN 202110242083 ACN202110242083 ACN 202110242083ACN 112990432 BCN112990432 BCN 112990432B
Authority
CN
China
Prior art keywords
training
sample
current
loss function
image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110242083.5A
Other languages
Chinese (zh)
Other versions
CN112990432A (en
Inventor
张梦琴
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Kingsoft Cloud Network Technology Co Ltd
Original Assignee
Beijing Kingsoft Cloud Network Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Kingsoft Cloud Network Technology Co LtdfiledCriticalBeijing Kingsoft Cloud Network Technology Co Ltd
Priority to CN202110242083.5ApriorityCriticalpatent/CN112990432B/en
Publication of CN112990432ApublicationCriticalpatent/CN112990432A/en
Application grantedgrantedCritical
Publication of CN112990432BpublicationCriticalpatent/CN112990432B/en
Activelegal-statusCriticalCurrent
Anticipated expirationlegal-statusCritical

Links

Classifications

Landscapes

Abstract

The application provides a target recognition model training method, a device and electronic equipment, which are used for acquiring a training sample set and a fitting image set, inputting samples in a current training sample subset into an initial model to obtain a first feature vector and a prediction label of each sample; extracting features of the current fitting image through a second middle layer of the initial image classification model to obtain a second feature vector corresponding to the current fitting image; calculating a feature loss function value according to the first feature vector corresponding to the positive sample and the second feature vector corresponding to the current fitting image; calculating a cross entropy loss function value according to the prediction label and the real label corresponding to each sample; and performing back propagation training based on the characteristic loss function value and the cross entropy loss function value to obtain a target recognition model. The application can train out the target recognition model which can recognize whether the image contains the target or not, and improve the recognition accuracy and recall rate of the target recognition model.

Description

Target recognition model training method and device and electronic equipment
Technical Field
The present application relates to the field of image recognition technologies, and in particular, to a method and apparatus for training a target recognition model, and an electronic device.
Background
The current image classification tasks are mainly divided into traditional image classification tasks and fine-grained image classification tasks. In an image recognition scene which only needs to recognize whether a certain target exists in an image and does not need to recognize the type, position and other detailed information of the target, if a traditional image classification task is adopted for model training, the characteristics of a key small target are easily ignored, so that the recognition capability of the model is poor; if the fine-grained classification task is used for model training, the training process and the obtained model are too complex, and the recognition efficiency is affected.
Disclosure of Invention
The application aims to provide a target recognition model training method, a device and electronic equipment, wherein a feature loss function value can be calculated through feature extraction of a fitting image, and a reverse gradient propagation training is carried out on an initial image classification model through the feature loss function value and a cross entropy loss function value, so that a target recognition model capable of recognizing whether an image contains a target can be trained, and the recognition accuracy rate and recall rate of the target recognition model are improved.
In a first aspect, an embodiment of the present application provides a method for training a target recognition model, where the method is applied to an electronic device, and the electronic device pre-stores an initial image classification model; the method comprises the following steps: acquiring a training sample set and a fitting image set; the samples in the training sample set comprise positive samples and negative samples, and the image in the fitting image set is an image with the area ratio of the target being greater than a set threshold; based on the training sample set and the fitting image set, determining a training sample subset and a current fitting image corresponding to each round of training, and executing the following operations for each round of training: inputting samples in the current training sample subset into an initial image classification model to obtain a first feature vector and a prediction label of each sample; the first feature vector is a vector output by a first middle layer of the initial image classification model; extracting features of the current fitting image through a second middle layer of the initial image classification model to obtain a second feature vector corresponding to the current fitting image; calculating a feature loss function value of the training according to a first feature vector corresponding to the positive sample in the current training sample subset and a second feature vector corresponding to the current fitting image; calculating a cross entropy loss function value of the current training according to the prediction label and the real label corresponding to each sample in the current training sample subset; and determining the total loss value of the training round based on the characteristic loss function value and the cross entropy loss function value of the training round, and performing reverse gradient propagation training on the initial image classification model according to the total loss value of the training round until the training round reaches the preset times or the total loss value converges to the preset convergence threshold value, so as to obtain the target recognition model.
Further, the initial image classification model comprises a convolutional neural network, an attention structure, a fusion module and a classifier which are connected in sequence; the fusion module is a first intermediate layer; inputting samples in the current training sample subset into an initial image classification model to obtain a first feature vector and a predictive label of each sample, wherein the step comprises the following steps: inputting samples in the current training sample subset into a convolutional neural network to obtain an original feature map corresponding to each sample; inputting the original feature map corresponding to each sample into an attention structure to obtain an attention map corresponding to each sample; inputting the original feature map and attention map corresponding to each sample into a fusion module to obtain a first feature vector corresponding to each sample; and inputting the first feature vector corresponding to each sample into a classifier to obtain a prediction label corresponding to each sample.
Further, the step of inputting the original feature map and the attention map corresponding to each sample into the fusion module to obtain the first feature vector corresponding to each sample includes: for each sample corresponding original feature map and attention map, the following operations are performed: spatially normalizing the attention map corresponding to the sample by a softmax function to obtain a value corresponding to each pixel in the attention map; and taking the value corresponding to each pixel in the attention map as a weight value, and carrying out weighted summation on the original feature map corresponding to the sample to obtain a first feature vector corresponding to the sample.
Further, the second intermediate layer is a convolutional neural network; the step of extracting the characteristics of the current fitting image through the second middle layer of the initial image classification model to obtain a second characteristic vector corresponding to the current fitting image comprises the following steps: and inputting the current fitting image into a convolutional neural network to obtain a second feature vector corresponding to the current fitting image.
Further, the step of calculating the feature loss function value of the current training according to the first feature vector corresponding to the positive sample in the current training sample subset and the second feature vector corresponding to the current fitting image includes: calculating a first feature loss function value corresponding to each positive sample according to a first feature vector corresponding to each positive sample in the current training sample subset and a second feature vector corresponding to the current fitting image; and carrying out mean value calculation on the first characteristic loss function values corresponding to the positive samples to obtain the characteristic loss function values of the training.
Further, the step of calculating the first feature loss function value corresponding to each positive sample according to the first feature vector corresponding to each positive sample in the current training sample subset and the second feature vector corresponding to the current fitting image includes: the first characteristic loss function value of the positive sample is calculated by the following formula:
Wherein L is2 A first feature loss function value representing a positive sample; the MSE () represents the mean square error function,a first feature vector corresponding to the positive sample is represented; v2 Representing a second feature vector corresponding to the currently fitted image.
Further, the step of calculating the cross entropy loss function value of the present training according to the prediction label and the real label corresponding to each sample in the current training sample subset includes: calculating a first cross entropy loss function value corresponding to each sample according to the prediction label, the real label and the cross entropy loss function corresponding to each sample in the current training sample subset; and carrying out mean value calculation on the first cross entropy loss function value corresponding to each sample to obtain the cross entropy loss function value of the training.
Further, the step of determining the total loss value of the present training based on the feature loss function value and the cross entropy loss function value of the present training includes: and summing the characteristic loss function value and the cross entropy loss function value of the training to obtain the total loss value of the training.
Further, the attention structure includes three convolution layers; each convolution layer is followed by a BN layer and a linear connection unit.
Further, the method further comprises the following steps: predicting a designated image by using a target recognition model obtained by current training every other preset training rounds; designating the image as a target related image which is not marked with labels; if the confidence of the predicted result exceeds a preset threshold, adding the specified image to a training sample set for model training.
Further, the method further comprises the following steps: acquiring an image to be identified; and inputting the image to be identified into the target identification model to obtain an identification result corresponding to the image to be identified.
In a second aspect, the embodiment of the application also provides a training device for the target recognition model, which is applied to electronic equipment, wherein the electronic equipment is pre-stored with an initial image classification model; the device comprises: the image set acquisition module is used for acquiring a training sample set and a fitting image set; the samples in the training sample set comprise positive samples and negative samples, and the image in the fitting image set is an image with the area ratio of the target being greater than a set threshold; the model training module is used for determining a training sample subset and a current fitting image corresponding to each round of training based on the training sample set and the fitting image set, and executing the following operations for each round of training: inputting samples in the current training sample subset into an initial image classification model to obtain a first feature vector and a prediction label of each sample; the first feature vector is a vector output by a first middle layer of the initial image classification model; extracting features of the current fitting image through a second middle layer of the initial image classification model to obtain a second feature vector corresponding to the current fitting image; calculating a feature loss function value of the training according to a first feature vector corresponding to the positive sample in the current training sample subset and a second feature vector corresponding to the current fitting image; calculating a cross entropy loss function value of the current training according to the prediction label and the real label corresponding to each sample in the current training sample subset; and determining the total loss value of the training round based on the characteristic loss function value and the cross entropy loss function value of the training round, and performing reverse gradient propagation training on the initial image classification model according to the total loss value of the training round until the training round reaches the preset times or the total loss value converges to the preset convergence threshold value, so as to obtain the target recognition model.
In a third aspect, an embodiment of the present application further provides an electronic device, including a processor and a memory, where the memory stores computer executable instructions executable by the processor, where the processor executes the computer executable instructions to implement the method according to the first aspect.
In a fourth aspect, embodiments of the present application also provide a computer-readable storage medium storing computer-executable instructions that, when invoked and executed by a processor, cause the processor to implement the method of the first aspect.
In the target recognition model training method provided by the embodiment of the application, a training sample set and a fitting image set are firstly obtained; the samples in the training sample set comprise positive samples and negative samples, and the image in the fitting image set is an image with the area ratio of the target being greater than a set threshold value; based on the training sample set and the fitting image set, determining a training sample subset and a current fitting image corresponding to each round of training, and executing the following operations for each round of training: inputting samples in the current training sample subset into an initial image classification model to obtain a first feature vector and a prediction label of each sample; the first feature vector is a vector output by a first middle layer of the initial image classification model; extracting features of the current fitting image through a second middle layer of the initial image classification model to obtain a second feature vector corresponding to the current fitting image; calculating a feature loss function value of the training according to a first feature vector corresponding to the positive sample in the current training sample subset and a second feature vector corresponding to the current fitting image; calculating a cross entropy loss function value of the current training according to the prediction label and the real label corresponding to each sample in the current training sample subset; and determining the total loss value of the training round based on the characteristic loss function value and the cross entropy loss function value of the training round, and performing reverse gradient propagation training on the initial image classification model according to the total loss value of the training round until the training round reaches the preset times or the total loss value converges to the preset convergence threshold value, so as to obtain the target recognition model. According to the method and the device for classifying the images, the feature loss function value can be calculated through feature extraction of the fitted images, the initial image classification model is subjected to inverse gradient propagation training through the feature loss function value and the cross entropy loss function value, the object recognition model capable of recognizing whether the images carry objects can be trained, and the recognition accuracy rate and recall rate of the object recognition model are improved.
Drawings
In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings that are needed in the description of the embodiments or the prior art will be briefly described, and it is obvious that the drawings in the description below are some embodiments of the present application, and other drawings can be obtained according to the drawings without inventive effort for a person skilled in the art.
FIG. 1 is a flowchart of a training method for a target recognition model according to an embodiment of the present application;
FIG. 2 is a schematic diagram of a training process of a target recognition model according to an embodiment of the present application;
FIG. 3 is a flowchart of a target recognition method according to an embodiment of the present application;
FIG. 4 is a block diagram of a training device for a target recognition model according to an embodiment of the present application;
fig. 5 is a schematic structural diagram of an electronic device according to an embodiment of the present application.
Detailed Description
The technical solutions of the present application will be clearly and completely described in connection with the embodiments, and it is apparent that the described embodiments are some embodiments of the present application, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the application without making any inventive effort, are intended to be within the scope of the application.
The current image classification tasks are mainly divided into traditional image classification tasks and fine-grained image classification tasks. In the traditional image classification task, no matter how large the important discrimination area in the image accounts for the proportion of the whole image, only the extraction features of the whole image are extracted at the same time, and then classification is carried out; however, in the fine-grained image classification, the distinguishable region in the image to be classified is often only in a small region in the image, so that it is generally necessary to obtain the region of the object of interest first, and then finely classify the object in a plurality of categories with small differences.
The classification of fine-grained images is classified into strong supervised learning and weak supervised learning. The strong supervision learning needs to add more marking frames to the network to perform the strong supervision learning, so that the network can learn the position information of the target, and the method is similar to the target detection task. The weak supervision learning is to judge the position of the region through the unsupervised learning by the network, and then pay special Attention to the characteristic difference of the region to identify the category of the target, and the common method is to obtain the position of the judging region through analyzing the most prominent part in the characteristic diagram based on the image classification of the Attention (Attention) mechanism.
In an image recognition scene which only needs to recognize whether a certain target exists in an image and does not need to recognize the type, position and other detailed information of the target, if a traditional image classification task is adopted for model training, the characteristics of a key small target are easily ignored, so that the recognition capability of the model is poor; if the fine-grained classification task is used for model training, the training process and the obtained model are too complex, and the recognition efficiency is affected.
Based on the above, the embodiment of the application provides a target recognition model training method, a device and electronic equipment, wherein a feature loss function value can be calculated through feature extraction of a fitting image, and a reverse gradient propagation training is carried out on an initial image classification model through the feature loss function value and a cross entropy loss function value, so that a target recognition model capable of recognizing whether a target is carried in the image can be trained, and the recognition accuracy rate and recall rate of the target recognition model are improved.
For the sake of understanding the present embodiment, first, a method for training a target recognition model disclosed in the present embodiment is described in detail.
FIG. 1 is a flowchart of a training method for a target recognition model, which is applied to an electronic device, wherein an initial image classification model is pre-stored in the electronic device; the initial image classification model may be implemented in a variety of ways, and is not specifically limited herein. The object may be a gun, a cutter, or the like, and the object recognition model trained by the object recognition model training method provided in this embodiment may quickly determine whether a certain image includes or carries the object, where the object recognition model training method specifically includes the following steps:
Step S11, a training sample set and a fitting image set are obtained.
The samples in the training sample set comprise positive samples and negative samples, wherein the positive samples are images containing targets, and the negative samples do not contain the targets; the image in the fitting image set is an image with the area ratio of the target being larger than a set threshold, for example, only contains pure samples of the target, or the area ratio of the target being larger than a certain threshold, for example, 95%, and the threshold can be adjusted according to practical situations.
And step S12, based on the training sample set and the fitting image set, determining a training sample set and a current fitting image corresponding to each round of training, and executing the following operation for each round of training until the training round reaches the preset times or the total loss value converges to the preset convergence threshold value, so as to obtain the target recognition model.
During model training, a training sample subset corresponding to current wheel training and a current fitting image are required to be determined from a training sample set and a fitting image set, for example, 20 images are selected from the training sample set to serve as samples in the training sample subset corresponding to the current wheel training, and one fitting image is randomly extracted from the fitting image set to serve as the current fitting image. And then executing the model training process of the following five steps until the training round reaches the preset times (such as 100 times) or the total loss value converges to the preset convergence threshold value, and stopping training to obtain the target recognition model.
The following five steps are performed for each round of training:
step S121, inputting samples in the current training sample subset into an initial image classification model to obtain a first feature vector and a prediction label of each sample; wherein the first feature vector is a vector output by a first intermediate layer of the initial image classification model.
The above-mentioned process of obtaining the first feature vector may include various ways, and the first intermediate layer for extracting the feature vector is different for the initial image classification models with different structures. In the embodiment of the present application, the first middle layer may be a fusion module, where after the feature map extracted by the neural network and the attention map extracted by the attention structure are fused, a first feature vector of the sample is output.
On the basis of obtaining the first feature vector of the sample, the classification result, that is, the prediction label of the sample, can be further output through the classifier, for example, the label includes Y and N, Y represents that the sample is an image containing the target, and N represents that the sample is an image not containing the target.
And step S122, extracting features of the current fitting image through a second middle layer of the initial image classification model to obtain a second feature vector corresponding to the current fitting image.
And the second middle layer is different from the first middle layer in structure position in the initial image classification model, and the second characteristic vector can be output through the second middle layer by inputting the current fitting image into the initial image classification model.
Step S123, calculating the feature loss function value of the current training according to the first feature vector corresponding to the positive sample in the current training sample subset and the second feature vector corresponding to the current fitting image.
The calculation of the feature loss function value may be performed by substituting two feature vectors into a preset feature loss function. If the positive samples are one, the first feature vector corresponding to the positive sample and the second feature vector corresponding to the current fitting image are directly substituted into a preset feature loss function to be calculated, and normally, the positive samples are multiple, the feature loss function value of each positive sample can be calculated respectively, and then the average value of the feature loss function values corresponding to the positive samples is taken as the feature loss function value of the training of the round.
Step S124, calculating the cross entropy loss function value of the training according to the prediction label and the real label corresponding to each sample in the current training sample subset.
Similarly, the calculation of the cross entropy loss function value can also be realized by adopting a preset calculation formula, and the average value of the cross entropy loss function values corresponding to a plurality of samples can be taken as the cross entropy loss function value of the training.
And step S125, determining the total loss value of the round of training based on the characteristic loss function value and the cross entropy loss function value of the round of training, and carrying out reverse gradient propagation training on the initial image classification model according to the total loss value of the round of training.
In the step, the characteristic loss function value and the cross entropy loss function value of the round of training are added to obtain the total loss value of the round of training, and then the initial image classification model is subjected to reverse gradient propagation training through the total loss value.
Through a certain number of cyclic training processes, a more ideal target recognition model can be finally obtained. According to the target recognition model training method provided by the embodiment of the application, the feature vector extraction of the fitting image is added, so that the feature loss function value can be calculated, the initial image classification model is subjected to reverse gradient propagation training through the feature loss function value and the cross entropy loss function value, the target recognition model capable of recognizing whether the image carries a target can be trained, and the recognition accuracy and recall rate of the target recognition model are improved.
In the following, a preferred embodiment is listed, and a training process of a target recognition model is realized by adding an attention mechanism, and as shown in fig. 2, in the embodiment of the present application, the initial image classification model includes a convolutional neural network, an attention structure, a fusion module and a classifier which are sequentially connected; the fusion module, that is, the first intermediate layer, may output a first feature vector of the sample.
The specific model training process is as follows:
(1) Aiming at the current training sample subset and the corresponding current fitting image thereof, simultaneously carrying out the characteristic extraction step:
the feature extraction process for the current training sample subset is as follows:
A. and inputting samples in the current training sample subset into a convolutional neural network to obtain an original feature map corresponding to each sample.
In the embodiment of the present application, a Residual Network (res net 50) is used to implement a process of extracting a feature map from a sample in a current training sample subset, and the current main convolutional neural Network may be other networks, such as VGG, res net152, and the like. Model parameters trained on an ImageNet image database are adopted in the initialization process, and only the full-connection layer of the last layer is required to be modified into a two-class problem of whether a target is carried in the current sample set in the training process. The input size of all sample data is first scaled to 224 x 224, and in the embodiment of the application, the feature map extracted by the last convolution layer of the res net50 model is extracted as the original feature map Vs of the samples in the current training sample subset.
B. Inputting the original feature map corresponding to each sample into an attention structure to obtain an attention map corresponding to each sample; the attention structure includes three convolution layers; each convolution layer is followed by a BN layer and a linear connection unit.
After the feature map Vs is obtained from the res net50, it is input to the Attention structure learning to obtain an Attention map Vatt. The Attention structure consists of three convolutions, the first using 1024 convolution kernels of size 1*1, the second using 512 convolution kernels of size 3*3, the third using 1 convolution kernel of size 1*1, with one BN layer and a modified linear element following each convolution. The BN layer functions in three main ways: the training and convergence speed of the network is increased; controlling gradient explosion to prevent gradient disappearance; overfitting is prevented.
C. And inputting the original feature map and attention map corresponding to each sample into a fusion module to obtain a first feature vector corresponding to each sample.
Specifically, for each original feature map and attention map corresponding to each sample, the following operations are performed: spatially normalizing the attention map corresponding to the sample by a softmax function to obtain a value corresponding to each pixel in the attention map; and taking the value corresponding to each pixel in the attention map as a weight value, and carrying out weighted summation on the original feature map corresponding to the sample to obtain a first feature vector corresponding to the sample.
The softmax function described above is as follows:
wherein a isi,j To take care of the values at the (i, j) position in the vant, i.e. the weight values at the (i, j) position in the original feature map, after spatial normalization;to pay attention to force Vatt The middle position is the value at (i, j).
The first eigenvector is calculated as follows:
v1 =∑i,j xi,j ai,j
wherein v is1 Representing a first feature vector corresponding to the sample; x is xi,j Representing the feature vector, a, at the position (i, j) in the original feature map Vsi,j The values at the (i, j) position in Vatt, i.e. the weight values at the (i, j) position in the original feature map, are sought after spatial normalization.
The feature extraction process for the current fitted image is as follows:
A. and inputting the current fitting image into a convolutional neural network to obtain a second feature vector corresponding to the current fitting image. The convolutional neural network is the second middle layer of the initial image classification model.
The same depth convolution neural network ResNet50 is used for extracting the characteristics of the current fitting image, at the moment, the last full connection layer of the network model is removed, and the characteristics of the last convolution layer are extracted as characteristic vectors, so that a second characteristic vector v corresponding to the current fitting image is obtained2
(2) And inputting the first feature vector corresponding to each sample into a classifier to obtain a prediction label corresponding to each sample.
Using the first eigenvector v corresponding to each sample1 To learn a two-class linear classifier for object recognition:where W and b are linear classifier parameters, and each sample is associated with a first eigenvector v1 And inputting the classifier to obtain the prediction label corresponding to each sample.
(3) The feature Loss function value corresponding to this round of training is calculated, as Loss2 in fig. 2.
In order to train the Attention structure, the embodiment of the application needs to calculate the characteristic fitting loss, namely calculate the second characteristic vector v of the fitting image2 And a first feature vector v for classification1 The fitting capability of the training method enables an attention mechanism to automatically judge the characteristics of a target area in an image, and the attention mechanism needs to be noted that as no target is contained in a negative sample during training, the characteristic loss function value corresponding to the training is calculated only for a positive sample by the following steps.
A. And calculating a first characteristic loss function value corresponding to each positive sample according to the first characteristic vector corresponding to each positive sample in the current training sample subset and the second characteristic vector corresponding to the current fitting image.
Specifically, the first characteristic loss function value of the positive sample is calculated by the following formula:
Wherein L is2 A first feature loss function value representing a positive sample; the MSE () represents the mean square error function,a first feature vector corresponding to the positive sample is represented; v2 Representing a second feature vector corresponding to the currently fitted image.
B. And carrying out mean value calculation on the first characteristic loss function values corresponding to the positive samples to obtain the characteristic loss function values of the training.
For example, the subset of the training samples of the present round includes 20 images, wherein 7 positive samples are included, and then an average value of the first feature loss function values corresponding to the 7 positive samples respectively can be calculated to obtain the feature loss function value of the training of the present round.
(4) The cross entropy Loss function value corresponding to this round of training is calculated, as Loss1 in fig. 2.
A. And calculating a first cross entropy loss function value corresponding to each sample according to the prediction label, the real label and the cross entropy loss function corresponding to each sample in the current training sample subset.
Computing predictive labelsLoss from the genuine tag y, i.e. minimize +.>And y, the formula is: />Where Cross Entropy () is the Cross Entropy loss function. From this function, a first cross entropy loss function value corresponding to each sample can be calculated.
B. And carrying out mean value calculation on the first cross entropy loss function value corresponding to each sample to obtain the cross entropy loss function value of the training.
For example, if the subset of training samples includes 20 images, an average value of the first cross entropy loss function values corresponding to the 20 samples may be calculated to obtain the cross entropy loss function value of the training sample.
(5) The total Loss value corresponding to the training round is calculated, such as the Loss total in fig. 2.
And summing the characteristic loss function value and the cross entropy loss function value of the training to obtain the total loss value of the training.
ModelThe final loss function is:therefore, the characteristic loss function value and the cross entropy loss function value of the training are summed to obtain the total loss value of the training.
(6) Back propagation training. And performing back propagation training based on the calculated total loss value of the training round.
Repeating the steps (1) - (6), and training to obtain the target recognition model.
In addition, the samples in the training sample set need to be labeled manually before training, namely, the samples are divided into positive samples and negative samples, and because the data labeling cost is high, training data are few when the preliminary image classification model is trained.
Namely: in the model training process, predicting a designated image by using a target recognition model obtained by current training every other preset training rounds; designating the image as a target related image which is not marked with labels; if the confidence of the predicted result exceeds a preset threshold, adding the specified image to a training sample set for model training.
In practical application, a certain threshold k can be set, firstly, a trained target recognition model is loaded to predict unlabeled data, images with confidence coefficient larger than the threshold k are automatically picked and added into training, and each training time is n epochs, the model automatically reselects the unlabeled data once, and the size of the threshold k is adjusted by observing the selected data quantity and the test result in the model training process. Through fine adjustment of the model, the accuracy and generalization capability of the model can be improved.
According to the target recognition model training method provided by the embodiment of the application, the attention weighted feature vector and the fitting capacity between the fitting images are calculated to directly train the attention structure while the cross entropy loss of model prediction is calculated, so that the accuracy of model recognition is improved. In addition, the semi-supervised training method for selecting unlabeled images while training is performed in the training process, so that the generalization capability of the model can be improved without increasing the labeling cost.
Further, an embodiment of the present application further provides a target recognition method, as shown in fig. 3, where the method includes the following steps:
step S302, obtaining an image to be identified;
step S304, inputting the image to be identified into the target identification model to obtain an identification result corresponding to the image to be identified.
The target recognition model is a target recognition model obtained by training the target recognition model training method described in the previous embodiment, and the image to be recognized is input into the target recognition model to obtain a recognition result corresponding to the image to be recognized, that is, a prediction label is obtained by the extraction process of the first feature vector described in the previous embodiment and the prediction of the classifier, where the prediction label can represent whether the image to be recognized is an image including the target. The specific identification process can be referred to the previous embodiment, and will not be described herein.
Based on the method embodiment, the embodiment of the application also provides a target recognition model training device which is applied to electronic equipment, wherein the electronic equipment pre-stores an initial image classification model; referring to fig. 4, the apparatus includes:
an image set acquisition module 41, configured to acquire a training sample set and a fitting image set; the samples in the training sample set comprise positive samples and negative samples, and the image in the fitting image set is an image with the area ratio of the target being greater than a set threshold; the model training module 42 is configured to determine, based on the training sample set and the fitting image set, a training sample subset and a current fitting image corresponding to each training round, and perform the following operations for each training round, until the training round reaches a preset number of times or the total loss value converges to a preset convergence threshold, and obtain the target recognition model.
The model training module 42 includes: the feature extraction and identification module 421, the loss value calculation module 422 and the back propagation training module 423, where the feature extraction and identification module 421 is configured to input samples in the current training sample subset into the initial image classification model to obtain a first feature vector and a prediction label of each sample; the first feature vector is a vector output by a first middle layer of the initial image classification model; extracting features of the current fitting image through a second middle layer of the initial image classification model to obtain a second feature vector corresponding to the current fitting image; the loss value calculation module 422 is configured to calculate a feature loss function value of the current training according to a first feature vector corresponding to the positive sample in the current training sample subset and a second feature vector corresponding to the current fitting image; calculating a cross entropy loss function value of the current training according to the prediction label and the real label corresponding to each sample in the current training sample subset; determining a total loss value of the present round of training based on the characteristic loss function value and the cross entropy loss function value of the present round of training; the back propagation training module 423 is configured to perform back gradient propagation training on the initial image classification model according to the total loss value of the current training.
Further, the initial image classification model comprises a convolutional neural network, an attention structure, a fusion module and a classifier which are connected in sequence; the fusion module is a first intermediate layer; the feature extraction and identification module 421 is further configured to: inputting samples in the current training sample subset into a convolutional neural network to obtain an original feature map corresponding to each sample; inputting the original feature map corresponding to each sample into an attention structure to obtain an attention map corresponding to each sample; inputting the original feature map and attention map corresponding to each sample into a fusion module to obtain a first feature vector corresponding to each sample; and inputting the first feature vector corresponding to each sample into a classifier to obtain a prediction label corresponding to each sample.
Further, the feature extraction and identification module 421 is further configured to: for each sample corresponding original feature map and attention map, the following operations are performed: spatially normalizing the attention map corresponding to the sample by a softmax function to obtain a value corresponding to each pixel in the attention map; and taking the value corresponding to each pixel in the attention map as a weight value, and carrying out weighted summation on the original feature map corresponding to the sample to obtain a first feature vector corresponding to the sample.
Further, the second intermediate layer is a convolutional neural network; the feature extraction and identification module 421 is further configured to: and inputting the current fitting image into a convolutional neural network to obtain a second feature vector corresponding to the current fitting image.
Further, the loss value calculation module 422 is further configured to: calculating a first feature loss function value corresponding to each positive sample according to a first feature vector corresponding to each positive sample in the current training sample subset and a second feature vector corresponding to the current fitting image; and carrying out mean value calculation on the first characteristic loss function values corresponding to the positive samples to obtain the characteristic loss function values of the training.
Further, the loss value calculation module 422 is further configured to: the first characteristic loss function value of the positive sample is calculated by the following formula:
wherein L is2 A first feature loss function value representing a positive sample; the MSE () represents the mean square error function,a first feature vector corresponding to the positive sample is represented; v2 Representing a second feature vector corresponding to the currently fitted image.
Further, the loss value calculation module 422 is further configured to: calculating a first cross entropy loss function value corresponding to each sample according to the prediction label, the real label and the cross entropy loss function corresponding to each sample in the current training sample subset; and carrying out mean value calculation on the first cross entropy loss function value corresponding to each sample to obtain the cross entropy loss function value of the training.
Further, the loss value calculation module 422 is further configured to: and summing the characteristic loss function value and the cross entropy loss function value of the training to obtain the total loss value of the training.
Further, the attention structure includes three convolution layers; each convolution layer is followed by a BN layer and a linear connection unit.
Further, the model training module 42 is further configured to: in the model training process, predicting a designated image by using a target recognition model obtained by current training every other preset training rounds; designating the image as a target related image which is not marked with labels; if the confidence of the predicted result exceeds a preset threshold, adding the specified image to a training sample set for model training.
Further, the device further comprises: the image recognition module is used for acquiring an image to be recognized; and inputting the image to be identified into the target identification model to obtain an identification result corresponding to the image to be identified.
The implementation principle and the generated technical effects of the object recognition model training device provided by the embodiment of the application are the same as those of the object recognition model training method, and for the sake of brief description, reference may be made to corresponding contents in the embodiment of the object recognition model training method where the embodiment of the object recognition model training device is not mentioned.
An embodiment of the present application further provides an electronic device, as shown in fig. 5, which is a schematic structural diagram of the electronic device, where the electronic device includes a processor 51 and a memory 50, where the memory 50 stores computer executable instructions that can be executed by the processor 51, and the processor 51 executes the computer executable instructions to implement the above method.
In the embodiment shown in fig. 5, the electronic device further comprises a bus 52 and a communication interface 53, wherein the processor 51, the communication interface 53 and the memory 50 are connected by the bus 52.
The memory 50 may include a high-speed random access memory (RAM, random Access Memory), and may further include a non-volatile memory (non-volatile memory), such as at least one magnetic disk memory. The communication connection between the system network element and at least one other network element is achieved via at least one communication interface 53 (which may be wired or wireless), and the internet, wide area network, local network, metropolitan area network, etc. may be used. Bus 52 may be an ISA (Industry Standard Architecture ) bus, a PCI (Peripheral Component Interconnect, peripheral component interconnect standard) bus, or EISA (Extended Industry Standard Architecture ) bus, among others. The bus 52 may be classified as an address bus, a data bus, a control bus, or the like. For ease of illustration, only one bi-directional arrow is shown in FIG. 5, but not only one bus or type of bus.
The processor 51 may be an integrated circuit chip with signal processing capabilities. In implementation, the steps of the above method may be performed by integrated logic circuits of hardware in the processor 51 or by instructions in the form of software. The processor 51 may be a general-purpose processor, including a central processing unit (Central Processing Unit, CPU for short), a network processor (Network Processor, NP for short), etc.; but also digital signal processors (Digital Signal Processor, DSP for short), application specific integrated circuits (Application Specific Integrated Circuit, ASIC for short), field-programmable gate arrays (Field-Programmable Gate Array, FPGA for short) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like. The steps of the method disclosed in connection with the embodiments of the present application may be embodied directly in the execution of a hardware decoding processor, or in the execution of a combination of hardware and software modules in a decoding processor. The software modules may be located in a random access memory, flash memory, read only memory, programmable read only memory, or electrically erasable programmable memory, registers, etc. as well known in the art. The storage medium is located in a memory and the processor 51 reads information in the memory and in combination with its hardware performs the steps of the method of the previous embodiment.
The embodiment of the application also provides a computer readable storage medium, which stores computer executable instructions that, when being called and executed by a processor, cause the processor to implement the above method, and the specific implementation can refer to the foregoing method embodiment and will not be described herein.
The method, the apparatus and the computer program product of the electronic device for training the target recognition model provided by the embodiments of the present application include a computer readable storage medium storing program codes, and the instructions included in the program codes may be used to execute the method described in the foregoing method embodiment, and specific implementation may refer to the method embodiment and will not be repeated herein.
The relative steps, numerical expressions and numerical values of the components and steps set forth in these embodiments do not limit the scope of the present application unless it is specifically stated otherwise.
The functions, if implemented in the form of software functional units and sold or used as a stand-alone product, may be stored in a non-volatile computer readable storage medium executable by a processor. Based on this understanding, the technical solution of the present application may be embodied essentially or in a part contributing to the prior art or in a part of the technical solution, in the form of a software product stored in a storage medium, comprising several instructions for causing a computer device (which may be a personal computer, a server, a network device, etc.) to perform all or part of the steps of the method according to the embodiments of the present application. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a random access Memory (RAM, random Access Memory), a magnetic disk, or an optical disk, or other various media capable of storing program codes.
In the description of the present application, it should be noted that the directions or positional relationships indicated by the terms "center", "upper", "lower", "left", "right", "vertical", "horizontal", "inner", "outer", etc. are based on the directions or positional relationships shown in the drawings, are merely for convenience of describing the present application and simplifying the description, and do not indicate or imply that the devices or elements referred to must have a specific orientation, be configured and operated in a specific orientation, and thus should not be construed as limiting the present application. Furthermore, the terms "first," "second," and "third" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance.
Finally, it should be noted that: the above examples are only specific embodiments of the present application, and are not intended to limit the scope of the present application, but it should be understood by those skilled in the art that the present application is not limited thereto, and that the present application is described in detail with reference to the foregoing examples: any person skilled in the art may modify or easily conceive of the technical solution described in the foregoing embodiments, or perform equivalent substitution of some of the technical features, while remaining within the technical scope of the present disclosure; such modifications, changes or substitutions do not depart from the spirit and scope of the technical solutions of the embodiments of the present application, and are intended to be included in the scope of the present application. Therefore, the protection scope of the present application shall be subject to the protection scope of the claims.

Claims (14)

The model training module is used for determining a training sample subset corresponding to each round of training and a current fitting image based on the training sample set and the fitting image set, and executing the following operations for each round of training: inputting samples in the current training sample subset into the initial image classification model to obtain a first feature vector and a prediction label of each sample; the first feature vector is a vector output by a first middle layer of the initial image classification model; extracting features of the current fitting image through a second middle layer of the initial image classification model to obtain a second feature vector corresponding to the current fitting image; calculating a feature loss function value of the current training according to a first feature vector corresponding to the positive sample in the current training sample subset and a second feature vector corresponding to the current fitting image; calculating a cross entropy loss function value of the current training according to the prediction label and the real label corresponding to each sample in the current training sample subset; and determining the total loss value of the round of training based on the characteristic loss function value and the cross entropy loss function value of the round of training, and performing reverse gradient propagation training on the initial image classification model according to the total loss value of the round of training until the training round reaches the preset times or the total loss value converges to the preset convergence threshold value, so as to obtain the target recognition model.
CN202110242083.5A2021-03-042021-03-04Target recognition model training method and device and electronic equipmentActiveCN112990432B (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN202110242083.5ACN112990432B (en)2021-03-042021-03-04Target recognition model training method and device and electronic equipment

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN202110242083.5ACN112990432B (en)2021-03-042021-03-04Target recognition model training method and device and electronic equipment

Publications (2)

Publication NumberPublication Date
CN112990432A CN112990432A (en)2021-06-18
CN112990432Btrue CN112990432B (en)2023-10-27

Family

ID=76352849

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN202110242083.5AActiveCN112990432B (en)2021-03-042021-03-04Target recognition model training method and device and electronic equipment

Country Status (1)

CountryLink
CN (1)CN112990432B (en)

Families Citing this family (65)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN113421212B (en)*2021-06-232023-06-02华侨大学 A medical image enhancement method, device, equipment and medium
CN113505820B (en)*2021-06-232024-02-06北京阅视智能技术有限责任公司Image recognition model training method, device, equipment and medium
CN113435348A (en)*2021-06-292021-09-24上海商汤智能科技有限公司Vehicle type identification method and training method, device, equipment and storage medium
CN113191461B (en)*2021-06-292021-09-17苏州浪潮智能科技有限公司Picture identification method, device and equipment and readable storage medium
CN113435525B (en)*2021-06-302024-06-21平安科技(深圳)有限公司Classification network training method, device, computer equipment and storage medium
CN113486804B (en)*2021-07-072024-02-20科大讯飞股份有限公司Object identification method, device, equipment and storage medium
CN113657406B (en)*2021-07-132024-04-23北京旷视科技有限公司Model training and feature extraction method and device, electronic equipment and storage medium
CN113254435B (en)*2021-07-152021-10-29北京电信易通信息技术股份有限公司Data enhancement method and system
CN113822427B (en)*2021-07-292025-08-15腾讯科技(深圳)有限公司Model training method, image matching method, device and storage medium
CN113962280A (en)*2021-08-122022-01-21京东科技控股股份有限公司Classification model training method and device, emotion data classification method and related equipment
CN113642481A (en)*2021-08-172021-11-12百度在线网络技术(北京)有限公司 Identification method, training method, device, electronic device and storage medium
CN113468108B (en)*2021-09-062021-11-12辰风策划(深圳)有限公司Enterprise planning scheme intelligent management classification system based on characteristic data identification
CN113762508B (en)*2021-09-062024-10-22京东鲲鹏(江苏)科技有限公司Training method, device, equipment and medium for image classification network model
CN113808021B (en)*2021-09-172024-06-28北京金山云网络技术有限公司Image processing and model training method and device and electronic equipment
CN113850219B (en)*2021-09-302024-07-02广州文远知行科技有限公司Data collection method, device, vehicle and storage medium
CN113807316B (en)*2021-10-082023-12-12南京恩博科技有限公司Training method and device of smoke concentration estimation model, electronic equipment and medium
CN113963236A (en)*2021-11-022022-01-21北京奕斯伟计算技术有限公司Target detection method and device
CN113989899B (en)*2021-11-082025-10-03北京百度网讯科技有限公司 Method, device and storage medium for determining feature extraction layer in face recognition model
CN114220041A (en)*2021-11-122022-03-22浙江大华技术股份有限公司Target recognition method, electronic device, and storage medium
CN114091594A (en)*2021-11-152022-02-25北京市商汤科技开发有限公司 Model training method and device, equipment and storage medium
CN116152938B (en)*2021-11-182025-02-18腾讯科技(深圳)有限公司Method, device and equipment for training identity recognition model and transferring electronic resources
CN114399671A (en)*2021-11-302022-04-26际络科技(上海)有限公司Target identification method and device
CN114417959B (en)*2021-12-062022-12-02浙江大华技术股份有限公司Correlation method for feature extraction, target identification method, correlation device and apparatus
CN114186097A (en)*2021-12-102022-03-15北京百度网讯科技有限公司 Method and apparatus for training a model
CN114241374B (en)*2021-12-142022-12-13百度在线网络技术(北京)有限公司Training method of live broadcast processing model, live broadcast processing method, device and equipment
CN114494794A (en)*2021-12-162022-05-13苏州安智汽车零部件有限公司Image semi-automatic labeling model training method for automatic driving
CN114266308B (en)*2021-12-212025-06-13浙江网商银行股份有限公司 Detection model training method and device, image detection method and device
CN114255381B (en)*2021-12-232023-05-12北京瑞莱智慧科技有限公司Training method of image recognition model, image recognition method, device and medium
CN114419391A (en)*2021-12-272022-04-29北京三快在线科技有限公司 Target image recognition method and device, electronic device and readable storage medium
CN114548213B (en)*2021-12-292025-07-22浙江大华技术股份有限公司Model training method, image recognition method, terminal device and computer medium
CN114332538B (en)*2021-12-302025-07-11中国农业银行股份有限公司 Image classification model training method, image classification method, device and storage medium
CN114004963B (en)*2021-12-312022-03-29深圳比特微电子科技有限公司Target class identification method and device and readable storage medium
CN114565016B (en)*2022-01-242025-04-22有米科技股份有限公司 Training of label recognition model, method and device for recognizing image labels
CN114445662B (en)*2022-01-252025-07-11南京理工大学 A robust image classification method and system based on label embedding
CN114496118B (en)*2022-01-262025-06-24郑州安图生物工程股份有限公司 Drug sensitivity result identification method, device, electronic device and readable storage medium
CN114565082B (en)*2022-03-012025-09-05浙江工业大学 A multi-scale deep learning recognition method for sequence data
CN114612717B (en)*2022-03-092023-05-26四川大学华西医院 AI model training label generation method, training method, use method and equipment
CN114677255A (en)*2022-03-172022-06-28北京中交兴路信息科技有限公司Method and device for identifying vehicle body in truck picture, storage medium and terminal
CN114722826B (en)*2022-04-072024-02-02平安科技(深圳)有限公司Model training method and device, electronic equipment and storage medium
CN114827460A (en)*2022-04-152022-07-29武汉理工大学Cloud deck image following method and device based on brushless motor control and electronic equipment
CN114581838B (en)*2022-04-262022-08-26阿里巴巴达摩院(杭州)科技有限公司 Image processing method, device and cloud device
CN114782996A (en)*2022-05-102022-07-22西华师范大学 Image recognition processing method, device, electronic device and storage medium
CN114648680B (en)*2022-05-172022-08-16腾讯科技(深圳)有限公司 Training method, device, equipment and medium for image recognition model
CN115034327B (en)*2022-06-222024-08-13支付宝(杭州)信息技术有限公司External data application, user identification method, device and equipment
CN115100717B (en)*2022-06-292024-07-19腾讯科技(深圳)有限公司Training method of feature extraction model, and recognition method and device of cartoon object
CN114866162B (en)*2022-07-112023-09-26中国人民解放军国防科技大学 Signal data enhancement method and system and communication radiation source identification method and system
CN115082740B (en)*2022-07-182023-09-01北京百度网讯科技有限公司Target detection model training method, target detection device and electronic equipment
CN116091797A (en)*2022-07-252023-05-09网易(杭州)网络有限公司Image similarity determination method and training method and device for model of image similarity determination method
CN115424294A (en)*2022-07-272022-12-02浙江大华技术股份有限公司Training method of wearing detection model, wearing detection method and related equipment
CN115063753B (en)*2022-08-172023-05-12苏州魔视智能科技有限公司Safety belt wearing detection model training method and safety belt wearing detection method
CN117034219B (en)*2022-09-092024-07-19腾讯科技(深圳)有限公司Data processing method, device, equipment and readable storage medium
CN115375978B (en)*2022-10-272023-02-03北京闪马智建科技有限公司Behavior information determination method and apparatus, storage medium, and electronic apparatus
CN115858836A (en)*2022-12-272023-03-28吉林大学 Image retrieval method and device, device, and computer-readable storage medium
CN116127067B (en)*2022-12-282023-10-20北京明朝万达科技股份有限公司Text classification method, apparatus, electronic device and storage medium
CN116137061B (en)*2023-04-202023-08-01北京睿芯通量科技发展有限公司Training method and device for quantity statistical model, electronic equipment and storage medium
CN116912618A (en)*2023-06-162023-10-20平安科技(深圳)有限公司Image classification model training method and device, electronic equipment and storage medium
CN116935363B (en)*2023-07-042024-02-23东莞市微振科技有限公司Cutter identification method, cutter identification device, electronic equipment and readable storage medium
CN117171559A (en)*2023-08-112023-12-05深圳数联天下智能科技有限公司Training method and related device for human body activity type recognition model
CN117058100B (en)*2023-08-142024-10-18阿里巴巴达摩院(杭州)科技有限公司 Image recognition method, electronic device and computer readable storage medium
CN116958787A (en)*2023-08-172023-10-27中国人民财产保险股份有限公司Training method of image recognition model, image recognition method and related equipment
CN117058493B (en)*2023-10-132024-02-13之江实验室 A security defense method, device and computer equipment for image recognition
CN118097595B (en)*2024-02-282024-10-15小米汽车科技有限公司Deceleration strip identification method and device, storage medium and vehicle
CN118506113B (en)*2024-07-192024-10-01武汉数聚速达网络科技有限责任公司Image recognition model training method and system based on deep learning
CN119810562B (en)*2025-01-102025-09-26北京交通大学Training method for railway intrusion target classification model, electronic equipment and storage medium
CN120510456B (en)*2025-07-212025-09-12苏州大学Fine-grained target detection and identification method and device and readable storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN108681774A (en)*2018-05-112018-10-19电子科技大学Based on the human body target tracking method for generating confrontation network negative sample enhancing
WO2019184124A1 (en)*2018-03-302019-10-03平安科技(深圳)有限公司Risk-control model training method, risk identification method and apparatus, and device and medium
CN111046959A (en)*2019-12-122020-04-21上海眼控科技股份有限公司Model training method, device, equipment and storage medium

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN110008842A (en)*2019-03-092019-07-12同济大学A kind of pedestrian's recognition methods again for more losing Fusion Model based on depth

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
WO2019184124A1 (en)*2018-03-302019-10-03平安科技(深圳)有限公司Risk-control model training method, risk identification method and apparatus, and device and medium
CN108681774A (en)*2018-05-112018-10-19电子科技大学Based on the human body target tracking method for generating confrontation network negative sample enhancing
CN111046959A (en)*2019-12-122020-04-21上海眼控科技股份有限公司Model training method, device, equipment and storage medium

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
少量样本下基于孪生CNN的SAR目标识别;王博威;潘宗序;胡玉新;马闻;;雷达科学与技术(06);全文*

Also Published As

Publication numberPublication date
CN112990432A (en)2021-06-18

Similar Documents

PublicationPublication DateTitle
CN112990432B (en)Target recognition model training method and device and electronic equipment
Bendale et al.Towards open set deep networks
CN111814902A (en) Target detection model training method, target recognition method, device and medium
US20210097344A1 (en)Target identification in large image data
CN113469088A (en)SAR image ship target detection method and system in passive interference scene
CN104504366A (en)System and method for smiling face recognition based on optical flow features
CN110135505B (en)Image classification method and device, computer equipment and computer readable storage medium
CN110096938A (en)A kind for the treatment of method and apparatus of action behavior in video
CN110909784B (en)Training method and device of image recognition model and electronic equipment
CN111325067A (en) Identification method, device and electronic equipment of illegal video
CN114821282A (en)Image detection model and method based on domain confrontation neural network
CN114067401B (en) Target detection model training and identity verification method and device
CN111815582A (en) A two-dimensional code region detection method with improved background prior and foreground prior
CN114821022A (en)Credible target detection method integrating subjective logic and uncertainty distribution modeling
CN118781471B (en)Model training method, object recognition device and electronic equipment
CN111539456A (en)Target identification method and device
CN113902041A (en) Method and device for training and authentication of target detection model
CN117523218A (en)Label generation, training of image classification model and image classification method and device
CN112101114A (en)Video target detection method, device, equipment and storage medium
CN112784691B (en)Target detection model training method, target detection method and device
CN112446311A (en)Object re-recognition method, electronic device, storage medium and device
CN110490058B (en)Training method, device and system of pedestrian detection model and computer readable medium
Yatbaz et al.Run-time introspection of 2d object detection in automated driving systems using learning representations
CN118447366A (en)Contraband identification model construction method and contraband identification method
CN112990145B (en)Group-sparse-based age estimation method and electronic equipment

Legal Events

DateCodeTitleDescription
PB01Publication
PB01Publication
SE01Entry into force of request for substantive examination
SE01Entry into force of request for substantive examination
GR01Patent grant
GR01Patent grant

[8]ページ先頭

©2009-2025 Movatter.jp