CN119762358B

Movatterモバイル変換

Info

Publication number: CN119762358B
Application number: CN202411833524.9A
Authority: CN
Inventors: 刘婧雯; 蔡敏捷
Original assignee: Hunan University
Current assignee: Hunan University
Priority date: 2024-12-13
Filing date: 2024-12-13
Publication date: 2025-10-03
Anticipated expiration: 2044-12-13
Also published as: CN119762358A

Abstract

Translated fromChinese

本发明公开了一种基于多尺度特征融合的稀疏新视角图像合成方法，属于计算机视觉与图像生成技术领域，具体包括以下步骤：S1.多尺度参考点生成与特征采样；S2.多感受野残差特征提取；S3.基于注意力网络的特征聚合与图像生成；S4.新视角合成模型的预训练；S5.预训练模型的微调。与现有技术不同，本发明通过结合多尺度特征和残差特征提取方法，提出了一种新的特征融合策略，同时引入基于注意力网络的特征聚合进行新视角图像的高效生成。此外，本发明还提出了基于预训练与迁移学习的自适应优化机制，从而能够加速稀疏场景下的训练过程，提高合成结果的质量与效率。基于本发明的方法，可显著提升稀疏场景下的新视角图像合成效果。

The present invention discloses a sparse new-perspective image synthesis method based on multi-scale feature fusion, which belongs to the field of computer vision and image generation technology, and specifically includes the following steps: S1. Multi-scale reference point generation and feature sampling; S2. Multi-receptive field residual feature extraction; S3. Feature aggregation and image generation based on attention network; S4. Pre-training of new-perspective synthesis model; S5. Fine-tuning of pre-trained model. Different from the existing technology, the present invention proposes a new feature fusion strategy by combining multi-scale features and residual feature extraction methods, and introduces feature aggregation based on attention network for efficient generation of new-perspective images. In addition, the present invention also proposes an adaptive optimization mechanism based on pre-training and transfer learning, which can accelerate the training process in sparse scenes and improve the quality and efficiency of synthesis results. Based on the method of the present invention, the new-perspective image synthesis effect in sparse scenes can be significantly improved.