CN119006678A

Movatterモバイル変換

Info

Publication number: CN119006678A
Application number: CN202410997744.9A
Authority: CN
Inventors: 曲强; 王余希
Original assignee: Shenzhen Institute of Advanced Technology of CAS
Current assignee: Shenzhen Institute of Advanced Technology of CAS
Priority date: 2024-07-24
Filing date: 2024-07-24
Publication date: 2024-11-22

Abstract

Translated fromChinese

本发明公开一种针对无位姿输入的三维高斯溅射优化方法。该方法包括：对于输入图像，利用光线预测模型预测光线束分布，获得光线束形式的分布特征；基于光线束形式的分布特征，计算相机位姿；对于光线束分布，基于光线的体密度进行采样，获得聚焦在视觉中心区域的三维高斯点云的初始空间分布；对于输入图像，通过视锥体投影和物体遮罩计算，获得可视壳；基于所述三维高斯点云的初始空间分布和所述可视壳进行三维高斯溅射场景训练，获得满足设定损失函数标准的三维场景重建模型，其中所述损失函数中包含相机位姿参数的训练正则项。本发明为三维高斯溅射训练提供了重要的初始化场景信息，显著提升了最终三维结构的质量与细节丰富度。

The present invention discloses a three-dimensional Gaussian sputtering optimization method for non-pose input. The method comprises: for an input image, using a light prediction model to predict the distribution of light beams, and obtaining the distribution characteristics of the light beam form; based on the distribution characteristics of the light beam form, calculating the camera pose; for the light beam distribution, sampling based on the volume density of the light, and obtaining the initial spatial distribution of the three-dimensional Gaussian point cloud focused on the visual center area; for the input image, obtaining the visible shell through cone projection and object mask calculation; based on the initial spatial distribution of the three-dimensional Gaussian point cloud and the visible shell, performing three-dimensional Gaussian sputtering scene training, obtaining a three-dimensional scene reconstruction model that meets the set loss function standard, wherein the loss function contains the training regularization term of the camera pose parameters. The present invention provides important initialization scene information for three-dimensional Gaussian sputtering training, and significantly improves the quality and detail richness of the final three-dimensional structure.