CN120182509A

Movatterモバイル変換

Info

Publication number: CN120182509A
Application number: CN202510653377.5A
Authority: CN
Inventors: 赵吴凡; 张帅; 华彤延; 洪忠铖
Original assignee: Hong Kong University Of Science And Technology Guangzhou
Current assignee: Hong Kong University Of Science And Technology Guangzhou
Priority date: 2025-05-21
Filing date: 2025-05-21
Publication date: 2025-06-20
Anticipated expiration: 2045-05-21
Also published as: CN120182509B

Abstract

Translated fromChinese

本发明提供室内场景重建的方法、装置、存储介质及设备，包括：获取激光雷达点云信息和全景RGB图像；对全景RGB图像预处理得到六面立方体图像；结合全景相机的内参和外参将激光雷达点云信息投影到六面立方体图像，提取点云信息中每个点的颜色信息，生成彩色点云数据；根据彩色点云数据中每个点与全景相机的距离，生成RGB‑D数据序列；将六面立方体图像输入至对象分割模型，将室内场景划分为若干个独立对象区域；将各个独立对象区域输入至视觉‑语言模型得到各个独立对象的语义标签；将包含语义标签的各个独立对象与RGB‑D序列进行投影对准得到对准后的点云数据；将对准后的点云输入据输入至神经核表面重建模型得到重建后的室内场景。

The present invention provides a method, an apparatus, a storage medium and a device for indoor scene reconstruction, comprising: obtaining laser radar point cloud information and a panoramic RGB image; preprocessing the panoramic RGB image to obtain a six-sided cube image; projecting the laser radar point cloud information onto the six-sided cube image in combination with the internal and external parameters of a panoramic camera, extracting the color information of each point in the point cloud information, and generating color point cloud data; generating an RGB-D data sequence according to the distance between each point in the color point cloud data and the panoramic camera; inputting the six-sided cube image into an object segmentation model to divide the indoor scene into a plurality of independent object areas; inputting each independent object area into a visual-language model to obtain a semantic label of each independent object; projecting and aligning each independent object containing the semantic label with the RGB-D sequence to obtain aligned point cloud data; and inputting the aligned point cloud input data into a neural core surface reconstruction model to obtain a reconstructed indoor scene.