Movatterモバイル変換


[0]ホーム

URL:


CN119379954B - High-fidelity radiation field reconstruction method and system based on polarization normal estimation - Google Patents

High-fidelity radiation field reconstruction method and system based on polarization normal estimation
Download PDF

Info

Publication number
CN119379954B
CN119379954BCN202411962980.3ACN202411962980ACN119379954BCN 119379954 BCN119379954 BCN 119379954BCN 202411962980 ACN202411962980 ACN 202411962980ACN 119379954 BCN119379954 BCN 119379954B
Authority
CN
China
Prior art keywords
polarization
fidelity
radiation field
field reconstruction
image sequence
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202411962980.3A
Other languages
Chinese (zh)
Other versions
CN119379954A (en
Inventor
胡德文
谭臻
冯镭
张礼廉
李明
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
National University of Defense Technology
Original Assignee
National University of Defense Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by National University of Defense TechnologyfiledCriticalNational University of Defense Technology
Priority to CN202411962980.3ApriorityCriticalpatent/CN119379954B/en
Publication of CN119379954ApublicationCriticalpatent/CN119379954A/en
Application grantedgrantedCritical
Publication of CN119379954BpublicationCriticalpatent/CN119379954B/en
Activelegal-statusCriticalCurrent
Anticipated expirationlegal-statusCritical

Links

Classifications

Landscapes

Abstract

The invention discloses a high-fidelity radiation field reconstruction method and a system based on polarization normal estimation, wherein the method comprises the steps of extracting polarization degree and polarization angle from a polarized image sequence and extracting position codes from an RGB image sequence; inputting a multichannel tensor consisting of a polarized image sequence, polarization degree, polarization angle and position code into a trained attention mechanism image segmentation network to obtain an estimation result of a polarization normal vector, extracting a camera pose matrix from an RGB image sequence by adopting a motion structure recovery algorithm SfM and establishing a sparse point cloud, and realizing high-fidelity radiation field reconstruction according to the established sparse point cloud by using an improved DN-Splatter algorithm guided by the estimation result of the polarization normal vector. The invention aims to accelerate the convergence process of three-dimensional Gaussian ellipsoid optimization, so that the generated grid surface is smoother, and the problems of slow convergence of the existing radiation field reconstruction and unsmooth surface during grid generation are effectively solved.

Description

High-fidelity radiation field reconstruction method and system based on polarization normal estimation
Technical Field
The invention belongs to the field of high-precision three-dimensional model reconstruction, and particularly relates to a high-fidelity radiation field reconstruction method and system based on polarization normal estimation.
Background
The three-dimensional reconstruction is to reconstruct an object or a scene through a single or a plurality of two-dimensional images, is an important research direction in the field of computer vision at present, and has wide application in aspects such as medical diagnosis, cultural relic reconstruction, navigation positioning and the like. The traditional three-dimensional reconstruction is realized by a plurality of geometric calculation steps such as camera parameter estimation, dense point cloud reconstruction, surface reconstruction and the like according to the multi-view geometric principle. By taking reference to the ability of humans to reconstruct and perceive three-dimensional scenes based on brain rather than geometric solutions, it extends very naturally to the process of simulating the construction of three-dimensional space from two-dimensional images using deep learning methods. The domestic research has a certain foundation, and mainly focuses on improving the overall accuracy and rendering rate of three-dimensional reconstruction, but the accuracy of the three-dimensional Gaussian splatter algorithm is not high due to lack of geometric constraint on fitting surfaces. The method mainly comprises the following steps of generating artifacts and blurring in the surface optimization process, poor fitting consistency, low complex scene optimization convergence speed and the like. In view of the different polarization states exhibited by light on the surface of different objects, measurement of the polarization profile is also one of the specific applications of polarized light. The polarization normal vector calculation based on the physical model is difficult to overcome noise interference caused by complex scenes and illumination, and meanwhile, a good solution is not available for processing pi-ambiguity. With the development of deep learning technology, the image segmentation network is widely applied to polarization normal vector extraction, and object-level polarization normal vector estimation is extended to scene level at present, but the generalization of the network is not ideal, and error estimation is easy to occur to complex scenes.
Disclosure of Invention
Aiming at the problems in the prior art, the invention provides a high-fidelity radiation field reconstruction method and a high-fidelity radiation field reconstruction system based on polarization normal estimation, which aim to accelerate the convergence process of three-dimensional Gaussian ellipsoid optimization, so that the generated grid surface is smoother, and the problems of slow convergence of the existing radiation field reconstruction and unsmooth surface when the grid is generated are effectively solved.
In order to solve the technical problems, the invention adopts the following technical scheme:
a high-fidelity radiation field reconstruction method based on polarization normal estimation comprises the following steps:
s1, extracting polarization degree and polarization angle from a polarized image sequence, and extracting position codes from an RGB image sequence;
S2, inputting a multichannel tensor consisting of a polarized image sequence, polarization degree, polarization angle and position code into a trained attention mechanism image segmentation network to obtain an estimation result of a polarization normal vector, wherein the attention mechanism image segmentation network is an improved network of an image segmentation network Unet ++, an attention gate is added in residual connection of the image segmentation network Unet +, and a function expression of the attention gate is as follows:
,
,
,
wherein,For the output characteristics of the attention gate,For the input features of the attention gate,In order to add the attention output,For a nonlinear activated and resampled output,In the form of a linear transformation matrix,For the function to be activated by the ReLU,For inputting featuresIs used for the weight coefficient of the (c),To input gating signalIs used for the weight coefficient of the (c),In order to input the gate control signal,AndIn order for the offset to be a function of,The function is activated for Sigmoid,For the output of the upper layer,Is a parameterAndIs a collection of (3);
S3, extracting a camera pose matrix from the RGB image sequence by adopting a motion structure recovery algorithm SfM and establishing a sparse point cloud;
s4, replacing the normal vector of the original DN-Splatter algorithm with the estimation result of the polarization normal vector to obtain an improved DN-Splatter algorithm, and realizing high-fidelity radiation field reconstruction according to the established sparse point cloud by utilizing the improved DN-Splatter algorithm.
Optionally, the extracting the function expression of the polarization degree and the polarization angle from the polarized image sequence in the step S1 is:
,
,
,
,
,
wherein,In order to be of a degree of polarization,Is the polarization angle, the polarization degreeFor characterizing the proportion of the intensity of the polarized light in the beam to the total intensity, the angle of polarizationFor characterizing the angle between the amounts of linear polarization,AndIs the vector of Stokes,Is the polarization intensity in the direction of 0 degrees,Is the polarized intensity in the direction of 45 degrees,Is the polarized intensity in the 90 degree direction.
Optionally, the extracting a functional expression of the position code from the RGB image sequence in step S1 is:
,
wherein,For the purpose of the position coding,AndThe focal lengths of the cameras in the x and y directions,AndThe coordinates of the principal point of the camera in the x and y directions,AndIs the coordinates of the real world point in the camera coordinate system.
Optionally, step S2 is preceded by a step of training the attention mechanism image segmentation network using a polarized image sequence sample with a polarized normal vector label, wherein the calculated functional expression of the polarized normal vector label is:
,
wherein,Is the normal vector of polarization, which is the polarization vector,Curved surface upper point being three-dimensional Gaussian ellipsoidIs provided with a height of (1),Curved surface upper point being three-dimensional Gaussian ellipsoidIs provided with x-axis and y-axis coordinates,As a function of the angle of incidence,Is the azimuth angle of the incident space, and has:
,
,
,
wherein,In order to be of a refractive index,In order to be of a degree of polarization,As a signed arc-tangent function,AndIs the vector of Stokes,Is the polarization intensity in the direction of 0 degrees,Is the polarized intensity in the direction of 45 degrees,Is the polarized intensity in the direction of 90 degrees,Is 135 degree polarized intensity.
Optionally, when the improved DN-Splatter algorithm is used for realizing high-fidelity radiation field reconstruction according to the established sparse point cloud, the adopted loss function is obtained by weighted summation of luminosity error loss and Gaussian smoothness loss, and the function expression of the luminosity error loss is as follows:
,
wherein,In order to account for the loss of luminosity errors,As the coefficient of the light-emitting diode,In order for the frame to be a target frame,The frame to be reconstructed is then processed to obtain,Is a structural similarity measure, and the computational function expression of the structural similarity measure is:
,
wherein,Representation ofIs a measure of the structural similarity of (a),AndRespectively isIs used for the average value of (a),AndRespectively isIs a function of the variance of (a),Is thatIs used to determine the covariance of (1),AndIs a constant for avoiding the phenomenon of zero removal.
Optionally, the gaussian smoothness loss is expressed as a function of:
,
wherein,In order to be a loss of gaussian smoothness,In the case of a pixel which is a pixel,Is a pixelDepth to neighborhoodIs used for the gradient of (a),For the transpose operation,Is a pixelIntensity with neighborhoodIs a gradient of (a).
In addition, the invention also provides a high-fidelity radiation field reconstruction system based on the polarization normal estimation, which comprises a microprocessor and a memory which are connected with each other, wherein the microprocessor is configured to execute the high-fidelity radiation field reconstruction method based on the polarization normal estimation.
Furthermore, the invention provides a computer readable storage medium having stored therein a computer program configured to perform the high fidelity radiation field reconstruction method based on polarization normal estimation by a processor.
Furthermore, the invention provides a computer program product comprising a computer program configured to execute the high-fidelity radiation field reconstruction method based on polarization normal estimation by a processor.
Compared with the prior art, the method has the advantages that firstly, in order to improve the precision and the robustness in the three-dimensional reconstruction of scenes and objects, the method introduces polarization information into the model reconstruction process, as the polarization state of light changes when light interacts with the surfaces of different materials, the polarized image can realize the normal vector calculation of pixel level through processing, and the method effectively solves the problem of polarized normal vector estimation by means of a deep learning networkThe fuzzy problem ensures the estimation precision of the normal vector of the multiple objects at the scene level. Secondly, the method of the invention provides priori constraint for radiation field optimization and reconstruction by using the normal vector of polarization estimation, accelerates the convergence process of three-dimensional Gaussian ellipsoid optimization, leads the generated grid surface to be smoother, induces Gaussian balls to tend to be flattened in the normal direction, more smoothly fits the plane, improves the optimization convergence rate and the overall accuracy of radiation field reconstruction, has the advantages of strong instantaneity, smooth fitting and high accuracy, and can effectively solve the problems of slow convergence of the existing radiation field reconstruction and unsmooth surface when generating grids.
Drawings
FIG. 1 is a schematic diagram of a basic flow of a method according to an embodiment of the present invention.
Fig. 2 is a schematic diagram of the working principle of the method according to the embodiment of the invention.
Fig. 3 is a schematic diagram of mapping two-dimensional image coordinates to three-dimensional coordinates according to an embodiment of the present invention.
Fig. 4 is a schematic diagram of a network structure of an attention mechanism image segmentation network according to an embodiment of the present invention.
Fig. 5 is a schematic diagram of a network structure of an attention gate according to an embodiment of the present invention.
Fig. 6 is a schematic diagram of an embodiment of the present invention for improving the high-fidelity radiation field reconstruction of DN-Splatter algorithm.
Detailed Description
In order to make the present invention better understood by those skilled in the art, the following description will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings, and it is apparent that the described embodiments are only some embodiments of the present invention, not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
The technical problems to be solved by the high-fidelity radiation field reconstruction method based on polarization normal estimation of the invention are that the precision of scene-level polarization normal vector estimation is low and the radiation field reconstruction surface is not smooth, as shown in fig. 1 and 2, the high-fidelity radiation field reconstruction method based on polarization normal estimation of the embodiment comprises the following steps:
s1, extracting polarization degree and polarization angle from a polarized image sequence, and extracting position codes from an RGB image sequence;
S2, inputting a multichannel tensor consisting of a polarized image sequence, a polarization degree, a polarization angle and a position code into a trained attention mechanism image segmentation network to obtain an estimation result of a polarization normal vector;
S3, extracting a camera pose matrix from the RGB image sequence by adopting a motion structure recovery algorithm SfM and establishing a sparse point cloud;
s4, replacing the normal vector of the original DN-Splatter algorithm with the estimation result of the polarization normal vector to obtain an improved DN-Splatter algorithm, and realizing high-fidelity radiation field reconstruction according to the established sparse point cloud by utilizing the improved DN-Splatter algorithm.
Calculating polarization degree and polarization angle information based on Stokes vectors through the input polarized images in the four polarization directions, and extracting the function expression of the polarization degree and the polarization angle from the polarized image sequence in the step S1 of the embodiment is as follows:
,
,
,
,
,
wherein,In order to be of a degree of polarization,Is the polarization angle, the polarization degreeFor characterizing the proportion of the intensity of the polarized light in the beam to the total intensity, the angle of polarizationFor characterizing the angle between the amounts of linear polarization,AndIs the vector of Stokes,Is the polarization intensity in the direction of 0 degrees,Is the polarized intensity in the direction of 45 degrees,Is the polarized intensity in the direction of 90 degrees,Is the coordinates of the pixel points
Since the position information of the observation direction affects the measured polarization information, the two-dimensional image coordinates are mapped to three dimensions through the camera internal parameters, and normalized to form a position code, fig. 3 is a schematic diagram of the principle of mapping the two-dimensional image coordinates to the three-dimensional coordinates in the embodiment, where O is the origin,Is a two-dimensional image coordinate and,Is a three-dimensional coordinate. In step S1 of this embodiment, the functional expression for extracting the position code from the RGB image sequence is:
,
wherein,For the purpose of the position coding,AndThe focal lengths of the cameras in the x and y directions,AndThe coordinates of the principal point of the camera in the x and y directions,AndIs the coordinates of the real world point in the camera coordinate system.
Inputting the multichannel tensor consisting of the original polarized image, the polarization degree, the polarization angle and the position code into a trained attention mechanism image segmentation network to obtain an estimation result of the polarization normal vector. Considering the difficulty in extracting normal vectors of multiple objects in a complex scene, referring to fig. 4, in this embodiment, attention gates are added in residual connection of a picture segmentation network Unet ++ with dense jump connection (triangles in fig. 4 are added attention gates, other structures are the same as Unet ++, Xi,j in each circular picture frame represents features), so that feature information of upper and lower layers of depth can be communicated, generalization of the network and interactivity between features are enhanced, the network can learn features of different depths by itself, parameter quantity is controlled to a certain extent, and overfitting is prevented under the condition of small sample training. The input to the attention gate is the upsampling feature of the extension node and the corresponding feature from the current encoder. The upsampling feature is used as a gating signal to suppress irrelevant areas in the task. The newly designed network effectively reduces the angle error of the normal vector estimation, improves the scene generalization of the normal vector estimation, and provides accurate priori constraint for the three-dimensional reconstruction task. FIG. 5 is a schematic diagram of the network structure of the attention gate in the present embodiment, assuming that the current encoder characteristics of the input areThe up-sampling feature of the extended node is thatThe functional expression of the attention gate is:
,
,
,
wherein,To pay attention toThe output characteristics of the gate are such that,For the input features of the attention gate,In order to add the attention output,For a nonlinear activated and resampled output,In the form of a linear transformation matrix,For the function to be activated by the ReLU,For inputting featuresIs used for the weight coefficient of the (c),To input gating signalIs used for the weight coefficient of the (c),In order to input the gate control signal,AndIn order for the offset to be a function of,The function is activated for Sigmoid,For the output of the upper layer,Is a parameterAndIs a set of (3). Resampling means that the output of the last node and the weight matrixElement-wise multiplication to achieve enhancement of the feature.
The embodiment further includes a step of training the attention mechanism image segmentation network by using a polarized image sequence sample with a polarized normal vector label, wherein the expression of the calculation function of the polarized normal vector label is:
,
wherein,Is the normal vector of polarization, which is the polarization vector,Curved surface upper point being three-dimensional Gaussian ellipsoidIs provided with a height of (1),Curved surface upper point being three-dimensional Gaussian ellipsoidIs provided with x-axis and y-axis coordinates,As a function of the angle of incidence,Is the azimuth angle of the incident space, and has:
,
,
,
wherein,In order to be of a refractive index,In order to be of a degree of polarization,As a signed arc-tangent function,AndIs the vector of Stokes,Is the polarization intensity in the direction of 0 degrees,Is the polarized intensity in the direction of 45 degrees,Is the polarized intensity in the direction of 90 degrees,Is 135 degree polarized intensity.
Referring to fig. 2, step S3 of this embodiment includes extracting a camera pose matrix from an RGB image sequence by using a motion structure restoration algorithm SfM, and creating a sparse point cloud, where the motion structure restoration algorithm SfM is an existing algorithm (see literature :Snavely, N., Seitz, S. M.,&Szeliski, R. (2006). Photo Tourism: Exploring Photo Collections In 3D. ACM Transactions on Graphics, 25(3), 835-846.),, which includes two major key links, feature extraction and matching and incremental three-dimensional reconstruction, inputting a group of images, extracting feature points from each input image by using a feature extraction and matching algorithm, and performing descriptor matching.
In this embodiment, the estimation result of the polarization normal vector is substituted for the original DN-Splatter algorithm in step S4 (see document :Bhat, S. F., Birkl, R., Wofk, D., Wonka, P.,&Müller, M. (2023). DN-Splatter: Depth and Normal Priors for Gaussian Splatting and Meshing. arXiv. arXiv:2403.17822), in which the normal vector is modified by DN-Splatter algorithm, and the high-fidelity radiation field reconstruction is realized by using the modified DN-Splatter algorithm according to the established sparse point cloud, fig. 6 is a schematic diagram of the high-fidelity radiation field reconstruction of the modified DN-Splatter algorithm in this embodiment, the estimation result of the polarization normal vector is substituted for the normal vector, the constraint of expanding the corresponding gaussian ellipsoid in the normal vector direction is given according to the polarization normal vector of the pixel point, the actual surface is attached in a flattened form as much as possible, the gaussian ellipsoid optimization variable mainly includes information such as size, color, pose of the triaxial direction, and the like, and the normal vector constraint regularizes the gaussian position of the edge perception loss in the optimization process, and the gaussian local smoothing and direction correction of the fitting surface.
As an optional implementation manner, when the embodiment uses the modified DN-Splatter algorithm to implement high-fidelity radiation field reconstruction according to the established sparse point cloud, the adopted loss function is obtained by weighted summation of luminosity error loss and gaussian smoothness loss, and the function expression of luminosity error loss is as follows:
,
wherein,In order to account for the loss of luminosity errors,As the coefficient of the light-emitting diode,In order for the frame to be a target frame,The frame to be reconstructed is then processed to obtain,Is a structural similarity measure, and the computational function expression of the structural similarity measure is:
,
wherein,Representation ofIs a measure of the structural similarity of (a),AndRespectively isIs used for the average value of (a),AndRespectively isIs a function of the variance of (a),Is thatIs used to determine the covariance of (1),AndIs a constant for avoiding the phenomenon of zero removal.
In this embodiment, the functional expression of the gaussian smoothness loss is:
,
wherein,In order to be a loss of gaussian smoothness,In the case of a pixel which is a pixel,Is a pixelDepth to neighborhoodIs used for the gradient of (a),For the transpose operation,Is a pixelIntensity with neighborhoodIs a gradient of (a).
In summary, the high-fidelity radiation field reconstruction method based on polarization normal estimation in the embodiment has the following characteristics that 1) the method of the embodiment firstly provides a new method for guiding three-dimensional radiation field reconstruction by using the normal vector of polarization estimation, thereby accelerating the optimal convergence rate of the radiation field and greatly improving the fitting precision of the three-dimensional reconstruction surface. 2) The method improves the prior scene-level polarization normal vector estimation network, adopts the attention gate to enhance the learning of the target area in Unet ++ jump connection, leads the attention connection between adjacent convolution layers to better process the relationship between the local normal vector estimation and the whole scene segmentation. 3) According to the method, an accurate normal vector diagram can be estimated only through a polarized image of a single view, and the step of initializing a Gaussian direction by using the normal direction of the estimated initial SfM point cloud for monocular normal estimation is omitted. 4) The method of the embodiment can enable grids generated by reconstructing the three-dimensional radiation field to be smoother, and can be better suitable for downstream robot navigation tasks.
In addition, the embodiment also provides a high-fidelity radiation field reconstruction system based on polarization normal estimation, which comprises a microprocessor and a memory which are connected with each other, wherein the microprocessor is configured to execute the high-fidelity radiation field reconstruction method based on the polarization normal estimation.
Furthermore, the present embodiment provides a computer readable storage medium having stored therein a computer program configured to execute the high-fidelity radiation field reconstruction method based on polarization normal estimation by a processor.
Furthermore, the present embodiment provides a computer program product comprising a computer program configured to execute the high-fidelity radiation field reconstruction method based on polarization normal estimation by a processor.
It will be appreciated by those skilled in the art that embodiments of the present application may be provided in the form of a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-readable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein. The present application is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the application. It will be understood that each flow and/or block of the flowchart illustrations and/or block diagrams, and combinations of flows and/or blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks. These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks. These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
The above description is only a preferred embodiment of the present invention, and the protection scope of the present invention is not limited to the above examples, and all technical solutions belonging to the concept of the present invention belong to the protection scope of the present invention. It should be noted that modifications and adaptations to the present invention may occur to one skilled in the art without departing from the principles of the present invention and are intended to be within the scope of the present invention.

Claims (9)

Translated fromChinese
1.一种基于偏振法向估计的高保真辐射场重建方法,其特征在于,包括下述步骤:1. A high-fidelity radiation field reconstruction method based on polarization normal estimation, characterized in that it comprises the following steps:S1,从偏振图像序列提取出偏振度和偏振角,从RGB图像序列提取出位置编码;S1, extracts polarization degree and polarization angle from polarization image sequence, and extracts position code from RGB image sequence;S2,将偏振图像序列、偏振度、偏振角以及位置编码组成的多通道张量输入训练好的注意力机制图像分割网络得到偏振法向量的估计结果;所述注意力机制图像分割网络为图像分割网络Unet++的改进网络,且该改进网络为在图像分割网络Unet+的残差连接中添加了注意力门,所述注意力门的函数表达式为:S2, inputting the multi-channel tensor consisting of the polarization image sequence, polarization degree, polarization angle and position encoding into the trained attention mechanism image segmentation network to obtain the estimation result of the polarization normal vector; the attention mechanism image segmentation network is an improved network of the image segmentation network Unet++, and the improved network adds an attention gate to the residual connection of the image segmentation network Unet+, and the function expression of the attention gate is: , , ,其中,为注意力门的输出特征,为注意力门的输入特征,为加法注意力输出,为非线性激活与重采样后的输出,为线性变换矩阵,为ReLU激活函数,为输入特征的权重系数,为输入门控信号的权重系数,为输入门控信号,为偏置,为Sigmoid激活函数,为上一层输出,为参数的集合;in, is the output feature of the attention gate, is the input feature of the attention gate, is the additive attention output, is the output after nonlinear activation and resampling, is the linear transformation matrix, is the ReLU activation function, For input features The weight coefficient of The input gate signal The weight coefficient of is the input gating signal, and is the bias, is the Sigmoid activation function, is the output of the previous layer, For parameters , and A collection of;S3,将RGB图像序列采用运动结构恢复算法SfM提取相机位姿矩阵并建立稀疏点云;S3, using the motion structure recovery algorithm SfM to extract the camera pose matrix of the RGB image sequence and build a sparse point cloud;S4,将偏振法向量的估计结果替代原始DN-Splatter算法的法向量得到改进DN-Splatter算法,利用改进DN-Splatter算法根据建立的稀疏点云实现高保真辐射场重建。S4, the estimated result of the polarization normal vector replaces the normal vector of the original DN-Splatter algorithm to obtain an improved DN-Splatter algorithm, and the improved DN-Splatter algorithm is used to achieve high-fidelity radiation field reconstruction based on the established sparse point cloud.2.根据权利要求1所述的基于偏振法向估计的高保真辐射场重建方法,其特征在于,步骤S1中从偏振图像序列提取出偏振度和偏振角的函数表达式为:2. The high-fidelity radiation field reconstruction method based on polarization normal estimation according to claim 1 is characterized in that the function expression of extracting the polarization degree and polarization angle from the polarization image sequence in step S1 is: , , , , ,其中,为偏振度,为偏振角,偏振度用于表征光束中偏振光的光强度占总光强的比例,偏振角用于表征线偏振量之间的夹角,为Stokes矢量,为0度方向的偏振强度,为45度方向的偏振强度,为90度方向的偏振强度。in, is the degree of polarization, is the polarization angle, degree of polarization It is used to characterize the ratio of the intensity of polarized light in a beam to the total intensity of light, the polarization angle Used to characterize the angle between linear polarization quantities, , and is the Stokes vector, is the polarization intensity in the 0 degree direction, is the polarization intensity in the 45 degree direction, is the polarization intensity in the 90 degree direction.3.根据权利要求1所述的基于偏振法向估计的高保真辐射场重建方法,其特征在于,步骤S1中从RGB图像序列提取出位置编码的函数表达式为:3. The high-fidelity radiation field reconstruction method based on polarization normal estimation according to claim 1 is characterized in that the function expression for extracting the position code from the RGB image sequence in step S1 is: ,其中,为位置编码,分别为相机在x、y方向的焦距,分别为相机在x、y方向的像主点坐标,为真实世界点在相机坐标系下的坐标。in, is the position code, and are the focal lengths of the camera in the x and y directions, and are the coordinates of the principal point of the camera in the x and y directions, , and is the coordinate of the real-world point in the camera coordinate system.4.根据权利要求1所述的基于偏振法向估计的高保真辐射场重建方法,其特征在于,步骤S2之前还包括利用带有偏振法向量标签的偏振图像序列样本来训练注意力机制图像分割网络的步骤,其中偏振法向量标签的计算函数表达式为:4. The high-fidelity radiation field reconstruction method based on polarization normal estimation according to claim 1 is characterized in that before step S2, it also includes a step of using polarization image sequence samples with polarization normal vector labels to train an attention mechanism image segmentation network, wherein the calculation function expression of the polarization normal vector label is: ,其中,为偏振法向量,为三维高斯椭球的曲面上点的高度,为三维高斯椭球的曲面上点的x轴和y轴坐标,为入射角,为入射空间的方位角,且有:in, is the polarization normal vector, is a point on the surface of a three-dimensional Gaussian ellipsoid Height, is a point on the surface of a three-dimensional Gaussian ellipsoid The x-axis and y-axis coordinates, is the incident angle, is the azimuth of the incident space, and: , , ,其中,为折射率,为偏振度,为带符号的反正切函数,为Stokes矢量,为0度方向的偏振强度,为45度方向的偏振强度,为90度方向的偏振强度,为135度方向的偏振强度。in, is the refractive index, is the degree of polarization, is the signed inverse tangent function, , and is the Stokes vector, is the polarization intensity in the 0 degree direction, is the polarization intensity in the 45 degree direction, is the polarization intensity in the 90 degree direction, is the polarization intensity in the 135 degree direction.5.根据权利要求4所述的基于偏振法向估计的高保真辐射场重建方法,其特征在于,利用改进DN-Splatter算法根据建立的稀疏点云实现高保真辐射场重建时,所采用的损失函数为光度误差损失、高斯平滑度损失两者加权求和得到,光度误差损失的函数表达式为:5. According to the high-fidelity radiation field reconstruction method based on polarization normal estimation of claim 4, it is characterized in that when the high-fidelity radiation field reconstruction is realized according to the established sparse point cloud using the improved DN-Splatter algorithm, the loss function adopted is the weighted sum of the photometric error loss and the Gaussian smoothness loss, and the function expression of the photometric error loss is: ,其中,为光度误差损失,为系数,为目标帧,为重构帧,为结构相似性度量,且结构相似性度量的计算函数表达式为:in, is the photometric error loss, is the coefficient, is the target frame, To reconstruct the frame, is the structural similarity measure, and the calculation function expression of the structural similarity measure is: ,其中,表示的结构相似性度量,分别为的均值,分别为的方差,的协方差,为用于避免出现除零的现象的常数。in, express The structural similarity measure of and They are The mean of and They are The variance of for The covariance of and A constant used to avoid division by zero.6.根据权利要求5所述的基于偏振法向估计的高保真辐射场重建方法,其特征在于,高斯平滑度损失的函数表达式为:6. The high-fidelity radiation field reconstruction method based on polarization normal estimation according to claim 5, characterized in that the function expression of Gaussian smoothness loss is: ,其中,为高斯平滑度损失,为像素,为像素与邻域的深度的梯度,为转置操作,为像素与邻域的强度的梯度。in, is the Gaussian smoothness loss, is a pixel, Pixel Depth of neighborhood The gradient of is the transpose operation, Pixel Strength with neighbors gradient.7.一种基于偏振法向估计的高保真辐射场重建系统,包括相互连接的微处理器和存储器,其特征在于,所述微处理器被配置以执行权利要求1~6中任意一项所述基于偏振法向估计的高保真辐射场重建方法。7. A high-fidelity radiation field reconstruction system based on polarization normal estimation, comprising a microprocessor and a memory connected to each other, characterized in that the microprocessor is configured to execute the high-fidelity radiation field reconstruction method based on polarization normal estimation as described in any one of claims 1 to 6.8.一种计算机可读存储介质,该计算机可读存储介质中存储有计算机程序,其特征在于,该计算机程序被配置以通过处理器执行权利要求1~6中任意一项所述基于偏振法向估计的高保真辐射场重建方法。8. A computer-readable storage medium storing a computer program, wherein the computer program is configured to execute the high-fidelity radiation field reconstruction method based on polarization normal estimation as described in any one of claims 1 to 6 through a processor.9.一种计算机程序产品,包括计算机程序,其特征在于,该计算机程序被配置以通过处理器执行权利要求1~6中任意一项所述基于偏振法向估计的高保真辐射场重建方法。9. A computer program product, comprising a computer program, characterized in that the computer program is configured to execute the high-fidelity radiation field reconstruction method based on polarization normal estimation according to any one of claims 1 to 6 through a processor.
CN202411962980.3A2024-12-302024-12-30High-fidelity radiation field reconstruction method and system based on polarization normal estimationActiveCN119379954B (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN202411962980.3ACN119379954B (en)2024-12-302024-12-30High-fidelity radiation field reconstruction method and system based on polarization normal estimation

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN202411962980.3ACN119379954B (en)2024-12-302024-12-30High-fidelity radiation field reconstruction method and system based on polarization normal estimation

Publications (2)

Publication NumberPublication Date
CN119379954A CN119379954A (en)2025-01-28
CN119379954Btrue CN119379954B (en)2025-03-11

Family

ID=94336069

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN202411962980.3AActiveCN119379954B (en)2024-12-302024-12-30High-fidelity radiation field reconstruction method and system based on polarization normal estimation

Country Status (1)

CountryLink
CN (1)CN119379954B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN120411386B (en)*2025-07-022025-08-29中国电建集团华东勘测设计研究院有限公司Photovoltaic bracket three-dimensional model construction method and system based on aerial image

Citations (2)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
WO2015154601A1 (en)*2014-04-082015-10-15中山大学Non-feature extraction-based dense sfm three-dimensional reconstruction method
CN118570380A (en)*2024-05-282024-08-30清华大学 Monocular polarization 3D reconstruction method, device and electronic equipment based on deep learning

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
WO2021071992A1 (en)*2019-10-072021-04-15Boston Polarimetrics, Inc.Systems and methods for augmentation of sensor systems and imaging systems with polarization
CN115876124A (en)*2023-01-092023-03-31中国科学院深圳先进技术研究院High-light-reflection surface three-dimensional reconstruction method and device based on polarized structured light camera

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
WO2015154601A1 (en)*2014-04-082015-10-15中山大学Non-feature extraction-based dense sfm three-dimensional reconstruction method
CN118570380A (en)*2024-05-282024-08-30清华大学 Monocular polarization 3D reconstruction method, device and electronic equipment based on deep learning

Also Published As

Publication numberPublication date
CN119379954A (en)2025-01-28

Similar Documents

PublicationPublication DateTitle
CN108416840B (en) A 3D scene dense reconstruction method based on monocular camera
Cheng et al.Gaussianpro: 3d gaussian splatting with progressive propagation
Liu et al.Mvsgaussian: Fast generalizable gaussian splatting reconstruction from multi-view stereo
CN107240129A (en)Object and indoor small scene based on RGB D camera datas recover and modeling method
CN112785692B (en) A single-view multi-person reconstruction method based on deep UV prior
CN119379954B (en)High-fidelity radiation field reconstruction method and system based on polarization normal estimation
CN113850900B (en)Method and system for recovering depth map based on image and geometric clues in three-dimensional reconstruction
Liu et al.High-quality textured 3D shape reconstruction with cascaded fully convolutional networks
CN114359509A (en)Multi-view natural scene reconstruction method based on deep learning
CN115511759B (en)Point cloud image depth completion method based on cascading feature interaction
CN103530907A (en)Complicated three-dimensional model drawing method based on images
CN118247418B (en) A method for reconstructing neural radiation fields using a small number of blurred images
CN117745932A (en)Neural implicit curved surface reconstruction method based on depth fusion constraint
CN118887346A (en) 3D reconstruction method of Gaussian radiation field based on geometric distillation of pre-trained optical flow model
Wang et al.Learning-based multi-view stereo: A survey
CN118485779A (en) Sparse view indoor scene reconstruction method based on mutual information optimization neural radiance field
CN120298613A (en) A method for 3D reconstruction of global terrain of asteroids under illumination variation
CN119991937A (en) A single-view 3D human body reconstruction method based on Gaussian surface elements
Cao et al.Nerf-based polarimetric multi-view stereo
CN118864768A (en) A method for variable-scale 3D reconstruction of asteroids based on multi-image detection range fusion
Liu et al.GS-SDF: LiDAR-Augmented Gaussian Splatting and Neural SDF for Geometrically Consistent Rendering and Reconstruction
Zhu et al.Dynamic scene reconstruction: Recent advance in real-time rendering and streaming
CN114581494B (en) Method and device for estimating face optical flow based on neural non-rigid registration
ShaEnd-to-end three-dimensional reconstruction of transparent objects with multiple optimization strategies under limited constraints
Huang et al.IDiff-NeRF: single-view 3D human body reconstruction utilizing identity-based diffusion within implicit neural network framework: Y. Huang et al.

Legal Events

DateCodeTitleDescription
PB01Publication
PB01Publication
SE01Entry into force of request for substantive examination
SE01Entry into force of request for substantive examination
GR01Patent grant
GR01Patent grant

[8]ページ先頭

©2009-2025 Movatter.jp