CN115661900B

Movatterモバイル変換

Info

Publication number: CN115661900B
Application number: CN202211325764.9A
Authority: CN
Inventors: 吴先健; 高新波; 张颜; 王楠楠; 梁凯
Original assignee: Chongqing University of Post and Telecommunications
Current assignee: Chongqing University of Post and Telecommunications
Priority date: 2022-10-27
Filing date: 2022-10-27
Publication date: 2025-08-08
Anticipated expiration: 2042-10-27
Also published as: CN115661900A

Abstract

本发明属于计算机视觉与人工智能领域，具体涉及一种基于先验信息的人脸热红外‑可见光图像转换方法。在对比学习框架下，本发明设计了一种基于人脸解析图作为先验信息去引导生成网络学习人脸图像的局部纹理信息。基于先验信息的人脸热红外‑可见光生成网络模型主要包括人脸解析图条件网络模块、空间特征变换映射层、注意力模块、生成器网络模块以及判别器；该模型通过空间特征变换映射层STL进行转换，它以人脸解析图映射特征为先验条件，生成一对调制参数，根据调制参数对生成网络的人脸特征进行仿射变换，从而自适应地优化人脸图像的生成质量，同时通过设计的人脸梯度增强损失一起监督学习，有利于缓解人脸生成图像上的伪影出现，提高局部纹理细节，使图像生成尽可能还原对应的人脸属性信息。

The present invention belongs to the field of computer vision and artificial intelligence, and specifically relates to a method for converting thermal infrared-visible light images of faces based on prior information. Under the contrastive learning framework, the present invention designs a method based on a face parsing graph as prior information to guide the generation network to learn the local texture information of face images. The thermal infrared-visible light generation network model of the face based on prior information mainly includes a face parsing graph conditional network module, a spatial feature transformation mapping layer, an attention module, a generator network module and a discriminator; the model is converted through the spatial feature transformation mapping layer STL, which uses the face parsing graph mapping features as a priori conditions to generate a pair of modulation parameters, and performs affine transformation on the face features of the generation network according to the modulation parameters, thereby adaptively optimizing the generation quality of the face image, and at the same time supervises the learning together with the designed face gradient enhancement loss, which is conducive to alleviating the appearance of artifacts on the face generated image, improving local texture details, and making the image generation restore the corresponding face attribute information as much as possible.