CN118840615B

Movatterモバイル変換

Info

Publication number: CN118840615B
Application number: CN202411303032.9A
Authority: CN
Inventors: 徐从安; 吴俊峰; 孙显; 周伟; 高龙; 史骏; 宿南; 林云; 蔡卓燃
Original assignee: Naval Aeronautical University
Current assignee: Naval Aeronautical University
Priority date: 2024-09-19
Filing date: 2024-09-19
Publication date: 2025-01-21
Anticipated expiration: 2044-09-19
Also published as: CN118840615A

Abstract

Translated fromChinese

本发明公开了一种基于多序列曼巴的多模态船只图像分类方法，将同一船只的自然光图像和红外图像同时输入到多模态分类模型中，得到分类结果；所述多模态分类模型包括序列转换模块、交叉注意力曼巴计算模块、交替遍历曼巴计算模块、光谱空间状态融合模块和分类模块。本发明使用差异较大的自然光图像和红外图像作为模型输入，先通过交叉注意力曼巴的计算初步对两种特征进行融合，再通过交叉遍历曼巴计算将两种特征进一步融合，然后利用光谱空间状态融合对两种特征进行融合和解析，从而获得丰富的图像表征，提高了图像分类的准确率。

The present invention discloses a multimodal ship image classification method based on multi-sequence Mamba, wherein the natural light image and infrared image of the same ship are simultaneously input into the multimodal classification model to obtain a classification result; the multimodal classification model comprises a sequence conversion module, a cross-attention Mamba calculation module, an alternating traversal Mamba calculation module, a spectral space state fusion module and a classification module. The present invention uses natural light images and infrared images with large differences as model inputs, first preliminarily fuses the two features through the calculation of cross-attention Mamba, then further fuses the two features through the cross-traversal Mamba calculation, and then fuses and analyzes the two features using the spectral space state fusion, thereby obtaining a rich image representation and improving the accuracy of image classification.