Movatterモバイル変換


[0]ホーム

URL:


CN111179217A - A multi-scale target detection method in remote sensing images based on attention mechanism - Google Patents

A multi-scale target detection method in remote sensing images based on attention mechanism
Download PDF

Info

Publication number
CN111179217A
CN111179217ACN201911229726.1ACN201911229726ACN111179217ACN 111179217 ACN111179217 ACN 111179217ACN 201911229726 ACN201911229726 ACN 201911229726ACN 111179217 ACN111179217 ACN 111179217A
Authority
CN
China
Prior art keywords
attention
feature
remote sensing
target
features
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201911229726.1A
Other languages
Chinese (zh)
Inventor
于瑞国
汪嫱
王臣汉
李雪威
喻梅
姜汉
高洁
刘志强
应翔
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tianjin University
Original Assignee
Tianjin University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tianjin UniversityfiledCriticalTianjin University
Priority to CN201911229726.1ApriorityCriticalpatent/CN111179217A/en
Publication of CN111179217ApublicationCriticalpatent/CN111179217A/en
Pendinglegal-statusCriticalCurrent

Links

Images

Classifications

Landscapes

Abstract

Translated fromChinese

本发明公开一种基于注意力机制的遥感图像多尺度目标检测方法,针对遥感图像背景复杂、小目标众多,目标尺寸多样化的特点,采用特征金字塔网络进行不同尺度目标的预测。为了更加准确地关注到目标所在区域,在网络中引入注意力机制,有效地提高目标检测精度。其中,包括全局空间注意力模块和像素特征注意力模块。全局空间注意力模块通过从浅层特征中提取空间位置相关信息,并与深层特征融合,强化深层特征的位置表达能力。像素特征注意力模块以多尺度卷积核产生与输入同尺寸的特征图,并以通道注意力的方式为每层特征图赋予权值,得到细节信息表现良好的像素级注意力图在不增加计算量的同时增加了感受野。

Figure 201911229726

The invention discloses a multi-scale target detection method in remote sensing images based on an attention mechanism. Aiming at the characteristics of complex backgrounds, numerous small targets and diverse target sizes in remote sensing images, a feature pyramid network is used to predict targets of different scales. In order to pay more attention to the area where the target is located, an attention mechanism is introduced into the network to effectively improve the target detection accuracy. Among them, the global spatial attention module and the pixel feature attention module are included. The global spatial attention module enhances the position expression ability of deep features by extracting spatial position-related information from shallow features and fusing them with deep features. The pixel feature attention module uses a multi-scale convolution kernel to generate a feature map of the same size as the input, and assigns weights to each layer of feature maps in the form of channel attention, and obtains a pixel-level attention map with good detailed information performance without increasing computation. while increasing the receptive field.

Figure 201911229726

Description

Attention mechanism-based remote sensing image multi-scale target detection method
Technical Field
The invention belongs to the field of artificial intelligence, deep learning and computer vision, relates to a target detection and salient feature extraction technology, and particularly relates to a remote sensing image target detection technology based on an attention mechanism.
Background
In the related art, the target detection technology mainly aims at natural images, and the methods are mainly divided into two types: one is a target detection method of two-stage, which is widely applied. The Two-stage method is represented by the method of fast R-CNN, FPN and the like, and the target detection is divided into Two stages, namely region generation and target classification. Firstly, feature extraction is carried out by utilizing a pre-trained CNN network, then a certain number of candidate regions are generated through a region generation network RPN, and then fine-grained object classification and frame regression are carried out on the candidate regions. The Two-stage method can obtain a relatively accurate detection result, but has the disadvantage that the real-time detection speed cannot be achieved because a large number of candidate frames with different scales need to be generated.
The other is a target detection method of one-stage. The One-stage method can directly extract features in the network to predict the classification and the position of the object, and greatly improves the detection speed although some accuracy is lost. For example, represented by YOLO, the image to be detected is scaled to a uniform size, and in order to detect targets at different positions, the image is equally divided into grids, and if the center of a certain target falls in a grid cell, the grid cell is responsible for predicting the target. The detection speed of the YOLO is improved by about 10 times compared with that of FasterR-CNN. However, the detection accuracy of the one-stage method is not as good as that of the two-stage method, and the size of the mesh division has a large influence on the detection result.
In the aspect of target detection of the remote sensing image, the remote sensing image has the characteristics of complex background, diversified angles, different sizes, a large number of small targets and the like, so that the difficulty is relatively high. However, the target detection of the existing remote sensing image still adopts the same method as the natural image, and no solution is provided for the problems that the network cannot accurately notice key image areas and the like due to complicated backgrounds in the remote sensing image.
Disclosure of Invention
The invention aims to overcome the defects in the prior art, provides a remote sensing image multi-scale target detection method based on an attention mechanism, and can effectively optimize the problem that the target is difficult to accurately position in a complicated background when the traditional detection method aiming at a natural image faces the remote sensing image.
The purpose of the invention is realized by the following technical scheme:
a remote sensing image multi-scale target detection method based on an attention mechanism comprises the following steps:
firstly, extracting features of different scales from a remote sensing image through a feature pyramid network, extracting shallow features from a lower layer network, and extracting deep features from a higher layer network;
step two, introducing a global spatial attention module (GSA), extracting shallow features from the lower layer network, and optimizing deep features by using attention information of the shallow features;
thirdly, the characteristics extracted from the last layer pass through a pixel characteristic attention module (PFA), so that the receptive field is enlarged, and the loss of detail information is reduced;
step four, the feature maps with different scales generated in the step one are sent to a candidate frame generation network (RPN) to generate candidate frames;
mapping the target in the candidate frame to feature maps with different scales, deforming the candidate frame on each feature map to a fixed size through ROI posing, and extracting the features of the related target;
and step six, classifying the candidate frames and performing frame regression.
Further, the step one specifically comprises the following steps: a characteristic pyramid network is adopted, the characteristic pyramid network comprises a bottom-up part and a top-down part, the bottom-up part is a convolution and pooling process, and the resolution is continuously reduced; taking each block as a stage, wherein the total number of the stages is five; the top-down process performs up-sampling from a feature map ofstage 5, and then performs horizontal connection with a feature map with the same size generated from bottom to top; and then, carrying out convolution on each fusion result by adopting a convolution kernel of 3x3 to eliminate the aliasing effect of up-sampling.
Further, the second step is the operation carried out in the process of extracting the features from bottom to top in the first step, firstly, the features of the shallow layer are subjected to global average pooling and global maximum pooling, and then the two attention maps are fused by pixel-by-pixel addition to generate a global attention map; and finally multiplying the global attention map by the deep feature map.
Furthermore, step three is to introduce a pixel feature attention module afterstage 5 from bottom to top in step one, and endow each layer of the feature map with a weight value in a channel attention mode; the global information and the detail information are fused, and the positioning capability of the network to the target area is improved.
And further, the step four is that the candidate box generation network extracts candidate boxes with different sizes by processing the extracted convolution feature maps with different scales by using anchors, and searches for a predefined number of areas possibly containing the target.
Compared with the prior art, the technical scheme of the invention has the following beneficial effects:
1. the remote sensing image multi-scale target detection method based on the attention mechanism provides a new idea for target detection of remote sensing images, the network can be more accurately positioned to the position of a target by introducing the attention mechanism, and the method has good generalization and good effect on target detection of natural images.
And 2, the GSA module effectively reduces the information loss of small targets, improves the training speed, accelerates the convergence of the network and can achieve better detection effect on targets with different sizes.
And 3, the PFA module increases the reception field, reduces the false detection of the network on the premise of not increasing the calculated amount, and effectively improves the accuracy of target detection.
Drawings
Fig. 1 is a schematic view of the overall structure of the present invention.
FIG. 2 is a diagram of a global spatial attention module.
FIG. 3 is a schematic diagram of a pixel feature attention module.
Fig. 4a and 4b are graphs of target detection results of the FPN method and the inventive method (AMOD), respectively.
Detailed Description
The invention is described in further detail below with reference to the figures and specific examples. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
The invention provides a remote sensing image multi-scale target detection model based on an attention mechanism, which is shown in figure 1 and is an overall structural schematic diagram of the invention, wherein two modules are shown in figures 2 and 3, and the overall processing steps are as follows:
and S1.1, adopting ResNet101 as a feature extraction network, extracting shallow features and deep features through convolution and pooling, wherein the shallow features are high in resolution and are positioned at the lower layer of the extraction network, and the deep features are low in resolution and are positioned at the upper layer of the extraction network.
Step S2.1: by extracting the global spatial attention of the shallow layer and fusing the global spatial attention with the deep layer features, the problem of small object information loss caused by low resolution of the deep layer features is optimized. For F ∈ RH×W×CRespectively carrying out global average pooling and global maximum pooling on the dimension C to obtain F' epsilon RH×W×1Then fused by means of pixel-by-pixel addition, as shown in formula (1):
Figure BDA0002303204660000031
wherein F is a shallow feature map, FgsaFor the output result of global spatial attention, sigmoid is sigmoid activation function, AvgPool is global average pooling, MaxPool is global maximum pooling,
Figure BDA0002303204660000032
is added pixel by pixel.
Step S2.2: downsampling the shallow features to the same size as the deep features, and multiplying by them, i.e. transferring more accurate position information to the deep features, as shown in equation (2):
Figure BDA0002303204660000033
wherein down represents down sampling, F' is a deep feature map, FgsaIs a global attention map for shallow features,
Figure BDA0002303204660000034
is pixel-by-pixel multiplication.
Step S3.1: the channel attention branch of the pixel feature attention is calculated. In order to obtain global information of each channel of the feature map, the feature map is subjected to global average pooling, the size of the feature map is changed to be 1 × 1 × C, then 1 × 1 convolution is performed to integrate the feature map, and the calculation formula is shown as (3):
MG(F)=σ(f1×1(AvgPool(F))) (3)
wherein f is1×1Representing a convolution of 1x1, AvgPool for global mean pooling, σ for relu activation function, MG(F) Is the output of the channel attention branch.
Step S3.2: the convolution branch of the pixel feature attention is calculated. The feature map is convolved by 1 × 1, as shown in equation (4):
MC(F)=f1×1(F) (4)
wherein f is1×1Representing a convolution with a kernel of 1, MC(F) Is the output of the convolution branch.
Step S3.3: and calculating the multi-scale convolution branch of the attention of the pixel feature. A convolution kernel of 5x5 with a step size of 2 is applied to the input to obtain a feature map of halved size. Then, using 5x5, convolution kernels withstep size 1 and convolution kernels withsize 3 andstep sizes 2 and 1, respectively, we get the feature maps asinput sizes 1/2 and 1/4, respectively. And finally, upsampling the smaller feature map, fusing the smaller feature map and the smaller feature map together, and then, upsampling again to restore the size of the original feature map, wherein the formula is shown as (5):
Figure BDA0002303204660000041
where up represents upsampling, f3×3And f5×5Respectively denoted as convolutions with convolution kernels of 3 and 5,
Figure BDA0002303204660000042
for pixel-by-pixel addition, Mm(F) Is the output of the multi-scale convolution branch.
Step S3.4: inputting the feature map F into feature maps obtained from different branches of the pixel feature attention module for fusion, as shown in formulas (6) and (7):
Figure BDA0002303204660000043
Fout=M(F) (7)
where sigma is the relu activation function,
Figure BDA0002303204660000044
in order to add the pixels one by one,
Figure BDA0002303204660000045
for pixel-by-pixel multiplication, MG(F)、MC(F)、Mm(F) Respectively obtaining results of a feature graph F through a global channel attention branch, a convolution branch and a multi-scale convolution branch, wherein F is the result of the feature graph FoutIs the output of the pixel feature attention module.
Step S3.5: the top-down process starts with the feature map ofstage 5 being upsampled and then cross-concatenated with the same size feature map generated from the bottom-up. And performing convolution on each fusion result by adopting a convolution kernel of 3x3 to generate feature maps with different scales.
Step S4.1: the feature maps of different scales are convoluted by adopting a convolution kernel of 3x3, and a plurality of regions with different length-width ratios of different scales are learned by taking the central point of each region of the feature maps as the center, and the regions are called anchors. And calculating the intersection ratio IOU of each anchor and the true value, wherein the IOU is called positive anchor which is more than 0.5 as shown in an equation (8), and the IOU is called negative anchor otherwise. The area size set by the method anchor is 322,642,1282,2242,2562And the length-width ratios of the three different components are {1:1,1:2,2:1 }.
Figure BDA0002303204660000046
Wherein, box represents the extracted candidate frame, and gt is the real frame of each target.
Step S4.2: and mapping the candidate frame to the feature map by using the candidate frame generated by the RPN network in the step S4.1 and the feature map with different scales generated in the step S3.
Step S5: and deforming the candidate frame in the feature map in the step S4.2 to a fixed size, and then performing full-connection operation to perform prediction. For each candidate frame, selecting a stage for prediction according to the size of the candidate frame, as shown in formula (9):
Figure BDA0002303204660000051
where w and h represent the width and height of the candidate box, respectively. Parameter k0Is a positive integer, and represents that w × h is 1282In the case of (2), the target stage to which the candidate box should be mapped is generally fixed to 4 in the experiment. Reference area 128 is determined by the size of the data set. k is the stage intended to detect the target.
Step S5.2: the classification of the specific category is performed using softmax, as shown in equation (10):
Lcls(pi,pi*)=-log[pi*pi+(1-pi*)(1-pi)](10)
wherein p isiPredicting the probability of being the target, p, for the Anchori*Labels for true values, Lcls(pi,pi*) Is a classification loss function.
Step S5.3: the accurate position of the object is obtained by performing bounding box regression with L1 loss, as shown in formula (11):
Figure BDA0002303204660000052
wherein t isi={tx,ty,tw,thIs a vector representing the 4 coordinates of the predicted box, ti*Is the coordinate vector, p, of the real box corresponding to posivetachoriPredicting the probability of being the target, p, for the Anchori*Labels for true values, Lreg(ti,ti*) Is the bounding box regression loss function.
Step S5.4: performing combined training of classification and bounding box regression, wherein the total loss function is shown as formula (12):
Figure BDA0002303204660000053
wherein p isiPredicting the probability of being the target, p, for the Anchori*Tag for true value, ti={tx,ty,tw,thIs a vector representing the 4 coordinates of the predicted box, ti*Is the coordinate vector of the real frame corresponding to the positive anchor, Lcls(pi,pi*) As a function of classification loss, Lreg(ti,ti*) As a function of the regression loss of the bounding box, λ is the balance weight, NclsIs the size of mini-batch,NregIs the number of anchors.
In order to verify that the method has better generalization, the invention performs experiments on the remote sensing data set NWPU (comprising 10 target classes), the RSOD (comprising 4 target classes) and the natural data set PASCAL VOC07 (comprising 20 target classes). The remote sensing data set is as follows 8: a ratio of 2 randomly partitions the training set and the test set.
The Average Precision (AP) of each category and the average precision mean (mAP) of all categories are used as evaluation criteria to quantitatively evaluate the detection effect, with larger values being better. As shown in Table 1, the experimental results show that the model provided by the invention achieves the best detection effect for both remote sensing images and natural images. On the NWPU data set, the detection effect of small targets such as storefront, ship and the like and longer targets such as bridge and harbor is greatly improved, and is improved by 3.75% compared with the FPN method; the detection effect of each category on the RSOD data set is improved to different degrees; on the PASCAL VOC07, the SSD300 and the YOLO3 adopt PASCAL VOCs 07 and 12 as training sets and test sets, and the other models only adopt PASCAL VOC07 as the training sets, so that the AMOD model can achieve the best effect under fewer training sets. The model provided by the invention is 0.8% higher than YOLO3 and 1.3% higher than FPN.
TABLE 1 comparison of the detection results of different methods on NWPU-VHR 10, RSOD and PASCAL VOC07 data sets
Figure BDA0002303204660000061
In table 1, by comparing the maps of the target detection of the most advanced method at present, it can be known that, for both the remote sensing image and the natural image, the multi-scale target detection method of the remote sensing image based on the attention mechanism provided by the present invention can effectively improve the accuracy of target detection. Especially, on the NWPU data set, the improvement is 9.2 percent compared with that of fast R-CNN and 5.2 percent compared with that ofYOLO 3.
In the training speed, the remote sensing image multi-scale target detection method based on the attention mechanism can be used for positioning the target area more quickly, so that the training speed is improved, and the convergence is accelerated. As shown in table 2, the improved AMOD method has a significant improvement in speed compared to the reference FPN method. Training speed was increased 37.5% on NWPU data set, 20% on RSOD and 35% on PASCAL VOC 07.
TABLE 2 comparison of training speeds of different methods on NWPU, RSOD, PASCAL VOC07 data sets
Number of FPN iterationsNumber of AMOD iterations
NWPU4000025000
RSOD5000040000
PASCAL VOC 07200000130000
The partial remote sensing image detection results are shown in fig. 4a and 4b, and it can be seen that the FPN method does not detect all bridges and tennis courts, but the improved AMOD method of the present invention increases the receptive field, reduces missing detection, and effectively improves the target detection accuracy due to the addition of the pixel feature attention module.
The present invention is not limited to the above-described embodiments. The foregoing description of the specific embodiments is intended to describe and illustrate the technical solutions of the present invention, and the above specific embodiments are merely illustrative and not restrictive. Those skilled in the art can make many changes and modifications to the invention without departing from the spirit and scope of the invention as defined in the appended claims.

Claims (5)

1. A remote sensing image multi-scale target detection method based on an attention mechanism is characterized by comprising the following steps:
firstly, extracting features of different scales from a remote sensing image through a feature pyramid network, extracting shallow features from a low-level network, and extracting deep features from a high-level network;
introducing a global spatial attention module (GSA) to extract attention information of shallow features from a lower layer network and optimize deep features;
thirdly, the characteristics extracted from the last layer pass through a pixel characteristic attention module (PFA), so that the receptive field is enlarged, and the loss of detail information is reduced;
step four, the feature maps with different scales generated in the step one are sent to a candidate frame generation network (RPN) to generate candidate frames;
mapping the target in the candidate frame to feature maps with different scales, deforming the candidate frame on each feature map to a fixed size through ROI posing, and extracting the features of the related target;
and step six, classifying the candidate frames and performing frame regression.
2. The method for detecting the multi-scale target of the remote sensing image based on the attention mechanism is characterized in that the first step specifically comprises the following steps: a characteristic pyramid network is adopted, the characteristic pyramid network comprises a bottom-up part and a top-down part, the bottom-up part is a convolution and pooling process, and the resolution is continuously reduced; taking each block as a stage, wherein the total number of the stages is five; the top-down process performs up-sampling from a feature map of stage 5, and then performs horizontal connection with a feature map with the same size generated from bottom to top; and then, carrying out convolution on each fusion result by adopting a convolution kernel of 3x3 to eliminate the aliasing effect of up-sampling.
3. The method for detecting the multi-scale target of the remote sensing image based on the attention mechanism is characterized in that the second step is an operation performed in the process of extracting the features from bottom to top in the first step, the features of the shallow layer are subjected to global average pooling and global maximum pooling, and then the two attention diagrams are fused by pixel-by-pixel addition to generate a global attention diagram; and finally multiplying the global attention map by the deep feature map.
4. The method for detecting the multi-scale target of the remote sensing image based on the attention mechanism is characterized in that in the third step, a pixel feature attention module is introduced after the stage 5 from the bottom to the top in the first step, and each layer of a feature map is endowed with a weight value in a channel attention mode; the global information and the detail information are fused, and the positioning capability of the network to the target area is improved.
5. The method for detecting the multi-scale target of the remote sensing image based on the attention mechanism as claimed in claim 1, wherein the fourth step is that the candidate frame generation network extracts candidate frames with different sizes by processing the extracted convolution feature maps with different scales by using anchors, and searches for a predefined number of areas which may contain the target.
CN201911229726.1A2019-12-042019-12-04 A multi-scale target detection method in remote sensing images based on attention mechanismPendingCN111179217A (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN201911229726.1ACN111179217A (en)2019-12-042019-12-04 A multi-scale target detection method in remote sensing images based on attention mechanism

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN201911229726.1ACN111179217A (en)2019-12-042019-12-04 A multi-scale target detection method in remote sensing images based on attention mechanism

Publications (1)

Publication NumberPublication Date
CN111179217Atrue CN111179217A (en)2020-05-19

Family

ID=70650174

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN201911229726.1APendingCN111179217A (en)2019-12-042019-12-04 A multi-scale target detection method in remote sensing images based on attention mechanism

Country Status (1)

CountryLink
CN (1)CN111179217A (en)

Cited By (63)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN111652152A (en)*2020-06-042020-09-11上海眼控科技股份有限公司Crowd density detection method and device, computer equipment and storage medium
CN111738110A (en)*2020-06-102020-10-02杭州电子科技大学 Vehicle target detection method in remote sensing images based on multi-scale attention mechanism
CN111798921A (en)*2020-06-222020-10-20武汉大学RNA binding protein prediction method and device based on multi-scale attention convolution neural network
CN111814726A (en)*2020-07-202020-10-23南京工程学院 A detection method for detecting robot visual objects
CN111815677A (en)*2020-07-102020-10-23中山大学新华学院 Target tracking method, device, terminal device and readable storage medium
CN111860398A (en)*2020-07-282020-10-30河北师范大学 Remote sensing image target detection method, system and terminal device
CN111914804A (en)*2020-08-182020-11-10中科弘云科技(北京)有限公司Multi-angle rotation remote sensing image small target detection method
CN111914917A (en)*2020-07-222020-11-10西安建筑科技大学Target detection improved algorithm based on feature pyramid network and attention mechanism
CN111914726A (en)*2020-07-282020-11-10联芯智能(南京)科技有限公司 Pedestrian detection method based on multi-channel adaptive attention mechanism
CN111931684A (en)*2020-08-262020-11-13北京建筑大学 A weak and small target detection method based on discriminative features of video satellite data
CN111950586A (en)*2020-07-012020-11-17银江股份有限公司 A Target Detection Method Introducing Bidirectional Attention
CN112016472A (en)*2020-08-312020-12-01山东大学Driver attention area prediction method and system based on target dynamic information
CN112016569A (en)*2020-07-242020-12-01驭势科技(南京)有限公司Target detection method, network, device and storage medium based on attention mechanism
CN112084868A (en)*2020-08-102020-12-15北京航空航天大学Target counting method in remote sensing image based on attention mechanism
CN112101189A (en)*2020-09-112020-12-18北京航空航天大学 SAR image target detection method and test platform based on attention mechanism
CN112132216A (en)*2020-09-222020-12-25平安国际智慧城市科技股份有限公司Vehicle type recognition method and device, electronic equipment and storage medium
CN112163580A (en)*2020-10-122021-01-01中国石油大学(华东)Small target detection algorithm based on attention mechanism
CN112183203A (en)*2020-08-262021-01-05北京工业大学Real-time traffic sign detection method based on multi-scale pixel feature fusion
CN112183269A (en)*2020-09-182021-01-05哈尔滨工业大学(深圳)Target detection method and system suitable for intelligent video monitoring
CN112215207A (en)*2020-11-102021-01-12中国人民解放军战略支援部队信息工程大学Remote sensing image airplane target detection method combining multi-scale and attention mechanism
CN112232232A (en)*2020-10-202021-01-15城云科技(中国)有限公司Target detection method
CN112307984A (en)*2020-11-022021-02-02安徽工业大学Safety helmet detection method and device based on neural network
CN112418345A (en)*2020-12-072021-02-26苏州小阳软件科技有限公司Method and device for quickly identifying fine-grained small target
CN112560907A (en)*2020-12-022021-03-26西安电子科技大学Limited pixel infrared unmanned aerial vehicle target detection method based on mixed domain attention
CN112651326A (en)*2020-12-222021-04-13济南大学Driver hand detection method and system based on deep learning
CN112686304A (en)*2020-12-292021-04-20山东大学Target detection method and device based on attention mechanism and multi-scale feature fusion and storage medium
CN112784779A (en)*2021-01-282021-05-11武汉大学Remote sensing image scene classification method based on feature pyramid multilevel feature fusion
CN112800964A (en)*2021-01-272021-05-14中国人民解放军战略支援部队信息工程大学Remote sensing image target detection method and system based on multi-module fusion
CN113011308A (en)*2021-03-152021-06-22山东大学Pedestrian detection method introducing attention mechanism
CN113033448A (en)*2021-04-022021-06-25东北林业大学Remote sensing image cloud-removing residual error neural network system, method and equipment based on multi-scale convolution and attention and storage medium
CN113052188A (en)*2021-03-262021-06-29大连理工大学人工智能大连研究院Method, system, equipment and storage medium for detecting remote sensing image target
CN113223044A (en)*2021-04-212021-08-06西北工业大学Infrared video target detection method combining feature aggregation and attention mechanism
CN113239825A (en)*2021-05-192021-08-10四川中烟工业有限责任公司High-precision tobacco beetle detection method in complex scene
CN113255699A (en)*2021-06-102021-08-13浙江华睿科技有限公司Small target object image detection method and device, electronic equipment and storage medium
CN113327226A (en)*2021-05-072021-08-31北京工业大学Target detection method and device, electronic equipment and storage medium
CN113420838A (en)*2021-08-202021-09-21中国科学院空天信息创新研究院SAR and optical image classification method based on multi-scale attention feature fusion
CN113610024A (en)*2021-08-132021-11-05天津大学 A multi-strategy deep learning remote sensing image small target detection method
CN113793472A (en)*2021-09-152021-12-14应急管理部沈阳消防研究所 Image-based fire detector pose estimation method based on feature deep aggregation network
CN113989645A (en)*2021-11-082022-01-28华东师范大学 Object detection method for large-scale aerial remote sensing images
CN114022682A (en)*2021-11-052022-02-08天津大学Weak and small target detection method based on attention secondary feature fusion mechanism
CN114119582A (en)*2021-12-012022-03-01安徽大学Synthetic aperture radar image target detection method
CN114220015A (en)*2021-12-212022-03-22一拓通信集团股份有限公司Improved YOLOv 5-based satellite image small target detection method
CN114241277A (en)*2021-12-222022-03-25中国人民解放军国防科技大学 Attention-guided multi-feature fusion camouflage target detection method, device, device and medium
CN114332620A (en)*2021-12-302022-04-12杭州电子科技大学 A method for vehicle target recognition in airborne images based on feature fusion and attention mechanism
CN114494728A (en)*2022-02-102022-05-13北京工业大学 A small target detection method based on deep learning
CN114565860A (en)*2022-03-012022-05-31安徽大学 A Multi-dimensional Reinforcement Learning Synthetic Aperture Radar Image Target Detection Method
CN114648802A (en)*2022-05-192022-06-21深圳市海清视讯科技有限公司Method, device and equipment for identifying facial expressions of users
CN114663654A (en)*2022-05-262022-06-24西安石油大学Improved YOLOv4 network model and small target detection method
CN114708511A (en)*2022-06-012022-07-05成都信息工程大学 Remote sensing image target detection method based on multi-scale feature fusion and feature enhancement
CN114926747A (en)*2022-05-312022-08-19常州大学Remote sensing image directional target detection method based on multi-feature aggregation and interaction
CN115115938A (en)*2022-07-252022-09-27中国人民解放军战略支援部队航天工程大学 A method for detecting salient objects in remote sensing images
CN115410102A (en)*2022-08-312022-11-29重庆大学 SAR Image Aircraft Target Detection Method Based on Joint Attention Mechanism
CN115410089A (en)*2022-06-282022-11-29北京理工大学 Optical Remote Sensing Small-Scale Object Detection Method Based on Adaptive Local Context Embedding
CN115424104A (en)*2022-08-192022-12-02西安电子科技大学Target detection method based on feature fusion and attention mechanism
CN115565077A (en)*2022-09-292023-01-03哈尔滨天枢问道技术有限公司Remote sensing image small target detection algorithm based on spatial feature integration
CN115641484A (en)*2022-10-312023-01-24西安电子科技大学Variable-scale remote sensing image target detection method based on ternary feature fusion
CN116229272A (en)*2023-03-142023-06-06中国人民解放军陆军军事交通学院镇江校区 A high-precision remote sensing image detection method and system based on representative point representation
CN116403115A (en)*2023-06-072023-07-07江西啄木蜂科技有限公司Large-format remote sensing image target detection method
CN116740370A (en)*2023-05-182023-09-12北京理工大学Complex target recognition method based on deep self-attention transformation network
CN117789153A (en)*2024-02-262024-03-29浙江驿公里智能科技有限公司Automobile oil tank outer cover positioning system and method based on computer vision
CN118608399A (en)*2024-06-142024-09-06铜陵学院 A method, system, device and medium for fusion of infrared image and visible light image
CN118608938A (en)*2024-05-222024-09-06云南数科林业规划设计有限公司 A Geographical Environment Information Management System
CN118644666A (en)*2024-08-142024-09-13江西师范大学 An image processing method and system for remote sensing target detection scenes

Citations (7)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN109284670A (en)*2018-08-012019-01-29清华大学 A pedestrian detection method and device based on multi-scale attention mechanism
CN109614985A (en)*2018-11-062019-04-12华南理工大学 A target detection method based on densely connected feature pyramid network
CN110084195A (en)*2019-04-262019-08-02西安电子科技大学Remote Sensing Target detection method based on convolutional neural networks
CN110084210A (en)*2019-04-302019-08-02电子科技大学The multiple dimensioned Ship Detection of SAR image based on attention pyramid network
CN110084249A (en)*2019-04-242019-08-02哈尔滨工业大学The image significance detection method paid attention to based on pyramid feature
US20190266712A1 (en)*2018-02-242019-08-29United States Of America As Represented By The Administrator Of The NasaSystem and method for imaging underwater environments using fluid lensing
CN110197182A (en)*2019-06-112019-09-03中国电子科技集团公司第五十四研究所Remote sensing image semantic segmentation method based on contextual information and attention mechanism

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20190266712A1 (en)*2018-02-242019-08-29United States Of America As Represented By The Administrator Of The NasaSystem and method for imaging underwater environments using fluid lensing
CN109284670A (en)*2018-08-012019-01-29清华大学 A pedestrian detection method and device based on multi-scale attention mechanism
CN109614985A (en)*2018-11-062019-04-12华南理工大学 A target detection method based on densely connected feature pyramid network
CN110084249A (en)*2019-04-242019-08-02哈尔滨工业大学The image significance detection method paid attention to based on pyramid feature
CN110084195A (en)*2019-04-262019-08-02西安电子科技大学Remote Sensing Target detection method based on convolutional neural networks
CN110084210A (en)*2019-04-302019-08-02电子科技大学The multiple dimensioned Ship Detection of SAR image based on attention pyramid network
CN110197182A (en)*2019-06-112019-09-03中国电子科技集团公司第五十四研究所Remote sensing image semantic segmentation method based on contextual information and attention mechanism

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
应翔, IEEE ACCESS*

Cited By (93)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN111652152A (en)*2020-06-042020-09-11上海眼控科技股份有限公司Crowd density detection method and device, computer equipment and storage medium
CN111738110A (en)*2020-06-102020-10-02杭州电子科技大学 Vehicle target detection method in remote sensing images based on multi-scale attention mechanism
CN111798921A (en)*2020-06-222020-10-20武汉大学RNA binding protein prediction method and device based on multi-scale attention convolution neural network
CN111798921B (en)*2020-06-222022-08-05武汉大学RNA binding protein prediction method and device based on multi-scale attention convolution neural network
CN111950586B (en)*2020-07-012024-01-19银江技术股份有限公司Target detection method for introducing bidirectional attention
CN111950586A (en)*2020-07-012020-11-17银江股份有限公司 A Target Detection Method Introducing Bidirectional Attention
CN111815677A (en)*2020-07-102020-10-23中山大学新华学院 Target tracking method, device, terminal device and readable storage medium
CN111814726A (en)*2020-07-202020-10-23南京工程学院 A detection method for detecting robot visual objects
CN111814726B (en)*2020-07-202023-09-22南京工程学院Detection method for visual target of detection robot
CN111914917A (en)*2020-07-222020-11-10西安建筑科技大学Target detection improved algorithm based on feature pyramid network and attention mechanism
CN112016569A (en)*2020-07-242020-12-01驭势科技(南京)有限公司Target detection method, network, device and storage medium based on attention mechanism
CN111914726B (en)*2020-07-282024-05-07联芯智能(南京)科技有限公司 Pedestrian detection method based on multi-channel adaptive attention mechanism
CN111914726A (en)*2020-07-282020-11-10联芯智能(南京)科技有限公司 Pedestrian detection method based on multi-channel adaptive attention mechanism
CN111860398A (en)*2020-07-282020-10-30河北师范大学 Remote sensing image target detection method, system and terminal device
CN112084868A (en)*2020-08-102020-12-15北京航空航天大学Target counting method in remote sensing image based on attention mechanism
CN111914804A (en)*2020-08-182020-11-10中科弘云科技(北京)有限公司Multi-angle rotation remote sensing image small target detection method
CN111931684B (en)*2020-08-262021-04-06北京建筑大学 A weak and small target detection method based on discriminative features of video satellite data
CN111931684A (en)*2020-08-262020-11-13北京建筑大学 A weak and small target detection method based on discriminative features of video satellite data
CN112183203A (en)*2020-08-262021-01-05北京工业大学Real-time traffic sign detection method based on multi-scale pixel feature fusion
CN112183203B (en)*2020-08-262024-05-28北京工业大学Real-time traffic sign detection method based on multi-scale pixel feature fusion
CN112016472B (en)*2020-08-312023-08-22山东大学Driver attention area prediction method and system based on target dynamic information
CN112016472A (en)*2020-08-312020-12-01山东大学Driver attention area prediction method and system based on target dynamic information
CN112101189A (en)*2020-09-112020-12-18北京航空航天大学 SAR image target detection method and test platform based on attention mechanism
CN112101189B (en)*2020-09-112022-09-30北京航空航天大学 SAR image target detection method and test platform based on attention mechanism
CN112183269A (en)*2020-09-182021-01-05哈尔滨工业大学(深圳)Target detection method and system suitable for intelligent video monitoring
CN112183269B (en)*2020-09-182023-08-29哈尔滨工业大学(深圳) A target detection method and system suitable for intelligent video surveillance
CN112132216B (en)*2020-09-222024-04-09平安国际智慧城市科技股份有限公司Vehicle type recognition method and device, electronic equipment and storage medium
CN112132216A (en)*2020-09-222020-12-25平安国际智慧城市科技股份有限公司Vehicle type recognition method and device, electronic equipment and storage medium
CN112163580B (en)*2020-10-122022-05-03中国石油大学(华东)Small target detection algorithm based on attention mechanism
CN112163580A (en)*2020-10-122021-01-01中国石油大学(华东)Small target detection algorithm based on attention mechanism
CN112232232B (en)*2020-10-202022-09-27城云科技(中国)有限公司Target detection method
CN112232232A (en)*2020-10-202021-01-15城云科技(中国)有限公司Target detection method
CN112307984A (en)*2020-11-022021-02-02安徽工业大学Safety helmet detection method and device based on neural network
CN112307984B (en)*2020-11-022023-02-17安徽工业大学 Safety helmet detection method and device based on neural network
CN112215207A (en)*2020-11-102021-01-12中国人民解放军战略支援部队信息工程大学Remote sensing image airplane target detection method combining multi-scale and attention mechanism
CN112560907B (en)*2020-12-022024-05-28西安电子科技大学Finite pixel infrared unmanned aerial vehicle target detection method based on mixed domain attention
CN112560907A (en)*2020-12-022021-03-26西安电子科技大学Limited pixel infrared unmanned aerial vehicle target detection method based on mixed domain attention
CN112418345B (en)*2020-12-072024-02-23深圳小阳软件有限公司Method and device for quickly identifying small targets with fine granularity
CN112418345A (en)*2020-12-072021-02-26苏州小阳软件科技有限公司Method and device for quickly identifying fine-grained small target
CN112651326A (en)*2020-12-222021-04-13济南大学Driver hand detection method and system based on deep learning
CN112686304A (en)*2020-12-292021-04-20山东大学Target detection method and device based on attention mechanism and multi-scale feature fusion and storage medium
CN112800964B (en)*2021-01-272021-10-22中国人民解放军战略支援部队信息工程大学 Remote sensing image target detection method and system based on multi-module fusion
CN112800964A (en)*2021-01-272021-05-14中国人民解放军战略支援部队信息工程大学Remote sensing image target detection method and system based on multi-module fusion
CN112784779A (en)*2021-01-282021-05-11武汉大学Remote sensing image scene classification method based on feature pyramid multilevel feature fusion
CN113011308A (en)*2021-03-152021-06-22山东大学Pedestrian detection method introducing attention mechanism
CN113052188A (en)*2021-03-262021-06-29大连理工大学人工智能大连研究院Method, system, equipment and storage medium for detecting remote sensing image target
CN113052188B (en)*2021-03-262024-07-26大连理工大学人工智能大连研究院Remote sensing image target detection method, system, equipment and storage medium
CN113033448A (en)*2021-04-022021-06-25东北林业大学Remote sensing image cloud-removing residual error neural network system, method and equipment based on multi-scale convolution and attention and storage medium
CN113223044A (en)*2021-04-212021-08-06西北工业大学Infrared video target detection method combining feature aggregation and attention mechanism
CN113327226A (en)*2021-05-072021-08-31北京工业大学Target detection method and device, electronic equipment and storage medium
CN113239825A (en)*2021-05-192021-08-10四川中烟工业有限责任公司High-precision tobacco beetle detection method in complex scene
CN113255699B (en)*2021-06-102022-01-18浙江华睿科技股份有限公司Small target object image detection method and device, electronic equipment and storage medium
CN113255699A (en)*2021-06-102021-08-13浙江华睿科技有限公司Small target object image detection method and device, electronic equipment and storage medium
CN113610024A (en)*2021-08-132021-11-05天津大学 A multi-strategy deep learning remote sensing image small target detection method
CN113610024B (en)*2021-08-132022-08-12天津大学 A multi-strategy deep learning remote sensing image small target detection method
CN113420838B (en)*2021-08-202021-11-02中国科学院空天信息创新研究院SAR and optical image classification method based on multi-scale attention feature fusion
CN113420838A (en)*2021-08-202021-09-21中国科学院空天信息创新研究院SAR and optical image classification method based on multi-scale attention feature fusion
CN113793472A (en)*2021-09-152021-12-14应急管理部沈阳消防研究所 Image-based fire detector pose estimation method based on feature deep aggregation network
CN114022682A (en)*2021-11-052022-02-08天津大学Weak and small target detection method based on attention secondary feature fusion mechanism
CN113989645A (en)*2021-11-082022-01-28华东师范大学 Object detection method for large-scale aerial remote sensing images
CN114119582B (en)*2021-12-012024-04-26安徽大学Synthetic aperture radar image target detection method
CN114119582A (en)*2021-12-012022-03-01安徽大学Synthetic aperture radar image target detection method
CN114220015A (en)*2021-12-212022-03-22一拓通信集团股份有限公司Improved YOLOv 5-based satellite image small target detection method
CN114241277A (en)*2021-12-222022-03-25中国人民解放军国防科技大学 Attention-guided multi-feature fusion camouflage target detection method, device, device and medium
CN114332620B (en)*2021-12-302025-05-27杭州电子科技大学 Airborne image vehicle target recognition method based on feature fusion and attention mechanism
CN114332620A (en)*2021-12-302022-04-12杭州电子科技大学 A method for vehicle target recognition in airborne images based on feature fusion and attention mechanism
CN114494728B (en)*2022-02-102024-06-07北京工业大学Small target detection method based on deep learning
CN114494728A (en)*2022-02-102022-05-13北京工业大学 A small target detection method based on deep learning
CN114565860A (en)*2022-03-012022-05-31安徽大学 A Multi-dimensional Reinforcement Learning Synthetic Aperture Radar Image Target Detection Method
CN114648802B (en)*2022-05-192022-08-23深圳市海清视讯科技有限公司User facial expression recognition method, device and equipment
CN114648802A (en)*2022-05-192022-06-21深圳市海清视讯科技有限公司Method, device and equipment for identifying facial expressions of users
CN114663654A (en)*2022-05-262022-06-24西安石油大学Improved YOLOv4 network model and small target detection method
CN114926747B (en)*2022-05-312025-04-29常州大学 A method for directional target detection in remote sensing images based on multi-feature aggregation and interaction
CN114926747A (en)*2022-05-312022-08-19常州大学Remote sensing image directional target detection method based on multi-feature aggregation and interaction
CN114708511A (en)*2022-06-012022-07-05成都信息工程大学 Remote sensing image target detection method based on multi-scale feature fusion and feature enhancement
CN114708511B (en)*2022-06-012022-08-16成都信息工程大学 Remote sensing image target detection method based on multi-scale feature fusion and feature enhancement
CN115410089A (en)*2022-06-282022-11-29北京理工大学 Optical Remote Sensing Small-Scale Object Detection Method Based on Adaptive Local Context Embedding
CN115115938A (en)*2022-07-252022-09-27中国人民解放军战略支援部队航天工程大学 A method for detecting salient objects in remote sensing images
CN115115938B (en)*2022-07-252025-04-15中国人民解放军战略支援部队航天工程大学 A method for salient object detection in remote sensing images
CN115424104A (en)*2022-08-192022-12-02西安电子科技大学Target detection method based on feature fusion and attention mechanism
CN115410102A (en)*2022-08-312022-11-29重庆大学 SAR Image Aircraft Target Detection Method Based on Joint Attention Mechanism
CN115565077A (en)*2022-09-292023-01-03哈尔滨天枢问道技术有限公司Remote sensing image small target detection algorithm based on spatial feature integration
CN115641484A (en)*2022-10-312023-01-24西安电子科技大学Variable-scale remote sensing image target detection method based on ternary feature fusion
CN116229272B (en)*2023-03-142023-10-31中国人民解放军陆军军事交通学院镇江校区 A high-precision remote sensing image detection method and system based on representative point representation
CN116229272A (en)*2023-03-142023-06-06中国人民解放军陆军军事交通学院镇江校区 A high-precision remote sensing image detection method and system based on representative point representation
CN116740370A (en)*2023-05-182023-09-12北京理工大学Complex target recognition method based on deep self-attention transformation network
CN116403115B (en)*2023-06-072023-08-22江西啄木蜂科技有限公司Large-format remote sensing image target detection method
CN116403115A (en)*2023-06-072023-07-07江西啄木蜂科技有限公司Large-format remote sensing image target detection method
CN117789153B (en)*2024-02-262024-05-03浙江驿公里智能科技有限公司Automobile oil tank outer cover positioning system and method based on computer vision
CN117789153A (en)*2024-02-262024-03-29浙江驿公里智能科技有限公司Automobile oil tank outer cover positioning system and method based on computer vision
CN118608938A (en)*2024-05-222024-09-06云南数科林业规划设计有限公司 A Geographical Environment Information Management System
CN118608399A (en)*2024-06-142024-09-06铜陵学院 A method, system, device and medium for fusion of infrared image and visible light image
CN118644666A (en)*2024-08-142024-09-13江西师范大学 An image processing method and system for remote sensing target detection scenes

Similar Documents

PublicationPublication DateTitle
CN111179217A (en) A multi-scale target detection method in remote sensing images based on attention mechanism
CN112396002B (en)SE-YOLOv 3-based lightweight remote sensing target detection method
WO2021244079A1 (en)Method for detecting image target in smart home environment
CN115100136B (en) Workpiece category and pose estimation method based on YOLOv4-tiny model
CN112884742B (en) A multi-target real-time detection, recognition and tracking method based on multi-algorithm fusion
CN111126472A (en)Improved target detection method based on SSD
CN111079739B (en)Multi-scale attention feature detection method
CN114565824B (en)Single-stage rotating ship detection method based on full convolution network
CN116168240B (en)Arbitrary-direction dense ship target detection method based on attention enhancement
CN113888461B (en) Small hardware defect detection method, system and equipment based on deep learning
CN110163836A (en)Based on deep learning for the excavator detection method under the inspection of high-altitude
CN111368660A (en) A single-stage semi-supervised image human object detection method
CN111476084A (en) A Deep Learning-Based Dynamic Parking Space Status Recognition Method in Parking Lot
CN114743023B (en) An image detection method of wheat spider based on RetinaNet model
CN116645592B (en) A crack detection method and storage medium based on image processing
CN115082778B (en)Multi-branch learning-based homestead identification method and system
CN114332921A (en) Pedestrian detection method based on Faster R-CNN network based on improved clustering algorithm
CN114463624A (en) A method and device for detecting illegal buildings applied to urban management and supervision
CN117576095A (en) A metal surface defect detection and classification method for multi-scale learning tasks
CN111950451A (en) Multi-category target recognition method based on multi-scale prediction CNN and Loongson chip
CN115063890A (en)Human body posture estimation method based on two-stage weighted mean square loss function
CN111310609A (en)Video target detection method based on time sequence information and local feature similarity
CN114359709A (en)Target detection method and device for remote sensing image
CN117197097A (en) A method for detecting power equipment components based on infrared images
CN112926681B (en) A method and device for target detection based on deep convolutional neural network

Legal Events

DateCodeTitleDescription
PB01Publication
PB01Publication
SE01Entry into force of request for substantive examination
SE01Entry into force of request for substantive examination
RJ01Rejection of invention patent application after publication
RJ01Rejection of invention patent application after publication

Application publication date:20200519


[8]ページ先頭

©2009-2025 Movatter.jp