Movatterモバイル変換

Jump to content

Geometry processing

From Wikipedia, the free encyclopedia

Research topic in computational geometry

Polygon Mesh Processing by Mario Botsch et al. is a textbook on the topic of Geometry Processing.^[1]

Geometry processing is an area of research that uses concepts fromapplied mathematics,computer science andengineering to design efficientalgorithms for the acquisition,reconstruction,analysis, manipulation, simulation and transmission of complex 3D models. As the name implies, many of the concepts, data structures, and algorithms are directly analogous tosignal processing andimage processing. For example, whereimage smoothing might convolve an intensity signal with a blur kernel formed using theLaplace operator,geometric smoothing might be achieved by convolving asurface geometry with a blur kernel formed using theLaplace-Beltrami operator.

Applications of geometry processing algorithms already cover a wide range of areas frommultimedia,entertainment and classicalcomputer-aided design, to biomedical computing,reverse engineering, andscientific computing.^[1]

Geometry processing is a common research topic atSIGGRAPH, the premiercomputer graphics academic conference, and the main topic of the annualSymposium on Geometry Processing.

Geometry processing as a life cycle

A mesh of a cactus showing the Gaussian Curvature at each vertex, using the angle defect method

Geometry processing involves working with ashape, usually in 2D or 3D, although the shape can live in a space of arbitrary dimensions. The processing of a shape involves three stages, which is known as its life cycle. At its "birth," a shape can be instantiated through one of three methods: amodel, amathematical representation, or ascan. After a shape is born, it can be analyzed and edited repeatedly in a cycle. This usually involves acquiring different measurements, such as the distances between the points of the shape, the smoothness of the shape, or itsEuler characteristic. Editing may involve denoising, deforming, or performingrigid transformations. At the final stage of the shape's "life," it is consumed. This can mean it is consumed by a viewer as a rendered asset in a game or movie, for instance. The end of a shape's life can also be defined by a decision about the shape, like whether or not it satisfies some criteria. Or it can even befabricated in the real world, through a method such as 3D printing or laser cutting.

Discrete Representation of a Shape

Like any other shape, the shapes used in geometry processing have properties pertaining to theirgeometry andtopology. The geometry of a shape concerns the position of the shape'spoints in space,tangents,normals, andcurvature. It also includes the dimension in which the shape lives (ex. $R^{2}$ or $R^{3}$ ). Thetopology of a shape is a collection of properties that do not change even after smooth transformations have been applied to the shape. It concerns dimensions such as the number ofholes andboundaries, as well as theorientability of the shape. One example of a non-orientable shape is theMobius strip.

In computers, everything must be discretized. Shapes in geometry processing are usually represented astriangle meshes, which can be seen as agraph. Each node in the graph is a vertex (usually in $R^{3}$ ), which has a position. This encodes the geometry of the shape. Directed edges connect these vertices into triangles, which by the right hand rule, then have a direction called the normal. Each triangle forms a face of the mesh. These are combinatoric in nature and encode the topology of the shape. In addition to triangles, a more general class ofpolygon meshes can also be used to represent a shape. More advanced representations likeprogressive meshes encode a coarse representation along with a sequence of transformations, which produce a fine or high resolution representation of the shape once applied. These meshes are useful in a variety of applications, including geomorphs, progressive transmission, mesh compression, and selective refinement.^[2]

A mesh of the famous Stanford bunny. Shapes are usually represented as a mesh, a collection of polygons that delineate the contours of the shape.

Properties of a shape

Euler Characteristic

One particularly important property of a 3D shape is itsEuler characteristic, which can alternatively be defined in terms of itsgenus. The formula for this in the continuous sense is $\chi =2c-2h-b$ , where $c {\displaystyle c}$ is the number of connected components, $h {\displaystyle h}$ is number of holes (as in donut holes, seetorus), and $b {\displaystyle b}$ is the number of connected components of the boundary of the surface. A concrete example of this is a mesh of apair of pants. There is one connected component, 0 holes, and 3 connected components of the boundary (the waist and two leg holes). So in this case, the Euler characteristic is -1. To bring this into the discrete world, the Euler characteristic of a mesh is computed in terms of its vertices, edges, and faces. $\chi =|V|-|E|+|F|$ .

This image shows a mesh of a pair of pants, with Euler characteristic -1. This is explained by the equation to compute the characteristic: 2c - 2h - b. The mesh has 1 connected component, 0 topological holes, and 3 boundaries (the waist hole and each leg hole): 2 - 0 - 3 = -1.

Surface reconstruction

Poisson reconstruction from surface points to mesh

A triangle mesh is constructed out of apoint cloud. Sometimes shapes are initialized only as "point clouds," a collection of sampled points from the shape's surface. Often, these point clouds need to be converted to meshes.

Depending on how a shape is initialized or "birthed," the shape might exist only as a nebula of sampled points that represent its surface in space. To transform the surface points into a mesh, the Poisson reconstruction^[3] strategy can be employed. This method states that theindicator function, a function that determines which points in space belong to the surface of the shape, can actually be computed from the sampled points. The key concept is that gradient of the indicator function is0 everywhere, except at the sampled points, where it is equal to the inward surface normal. More formally, suppose the collection of sampled points from the surface is denoted by $S {\displaystyle S}$ , each point in the space by $p_{i}$ , and the corresponding normal at that point by $n_{i}$ . Then the gradient of the indicator function is defined as:

$\triangledown g={\begin{cases}{\textbf {n}}_{i},&\forall p_{i}\in S\\0,&{\text{otherwise}}\end{cases}}$

The task of reconstruction then becomes avariational problem. To find the indicator function of the surface, we must find a function $\chi$ such that $\lVert \triangledown \chi -{\textbf {V}}\rVert$ is minimized, where ${\textbf {V}}$ is the vector field defined by the samples. As a variational problem, one can view the minimizer $\chi$ as a solution ofPoisson's equation.^[3] After obtaining a good approximation for $\chi$ and a value $\sigma$ for which the points $(x,y,z)$ with $\chi (x,y,z)=\sigma$ lie on the surface to be reconstructed, themarching cubes algorithm can be used to construct atriangle mesh from the function $\chi$ , which can then be applied in subsequent computer graphics applications.

Registration

Point to point registration

An animation depicting registration of a partial mesh onto a complete mesh, with piecewise constant approximation of the projection function

Point to plane registration

An animation depicting the same registration procedure as above, but with piecewise linear approximation of the projection function. Note that it converges much faster.

One common problem encountered in geometry processing is how to merge multiple views of a single object captured from different angles or positions. This problem is known asregistration. In registration, we wish to find an optimalrigid transformation that will align surface $X {\displaystyle X}$ with surface $Y {\displaystyle Y}$ . More formally, if $P_{Y}(x)$ is the projection of a pointx from surface $X {\displaystyle X}$ onto surface $Y {\displaystyle Y}$ , we want to find the optimal rotation matrix $R {\displaystyle R}$ and translation vector $t {\displaystyle t}$ that minimize the following objective function:

$\int _{x\in X}||Rx+t-P_{Y}(x)||^{2}dx$

While rotations are non-linear in general, small rotations can be linearized as skew-symmetric matrices. Moreover, the distance function $x-P_{Y}(x)$ is non-linear, but is amenable to linear approximations if the change in $X {\displaystyle X}$ is small. An iterative solution such asIterative Closest Point (ICP) is therefore employed to solve for small transformations iteratively, instead of solving for the potentially large transformation in one go. In ICP,n random sample points from $X {\displaystyle X}$ are chosen and projected onto $Y {\displaystyle Y}$ . In order to sample points uniformly at random across the surface of the triangle mesh, the random sampling is broken into two stages: uniformly sampling points within a triangle; and non-uniformly sampling triangles, such that each triangle's associated probability is proportional to its surface area.^[4] Thereafter, the optimal transformation is calculated based on the difference between each $x {\displaystyle x}$ and its projection. In the following iteration, the projections are calculated based on the result of applying the previous transformation on the samples. The process is repeated until convergence.

Smoothing

When shapes are defined or scanned, there may be accompanying noise, either to a signal acting upon the surface or to the actual surface geometry. Reducing noise on the former is known asdata denoising, while noise reduction on the latter is known assurface fairing. The task of geometric smoothing is analogous to signal noise reduction, and consequently employs similar approaches.

The pertinent Lagrangian to be minimized is derived by recording the conformity to the initial signal ${\bar {f}}$ and the smoothness of the resulting signal, which approximated by the magnitude of the gradient with a weight $\lambda$ :

${\mathcal {L}}(f)=\int _{\Omega }\|f-{\bar {f}}\|^{2}+\lambda \|\nabla f\|^{2}dx$ .

Taking a variation $\delta f$ on ${\mathcal {L}}$ emits the necessary condition

$0=\delta {\mathcal {L}}(f)=\int _{\Omega }\delta f(\mathbf {I} +\lambda \nabla ^{2})f-\delta f{\bar {f}}dx$ .

By discretizing this onto piecewise-constant elements with our signal on the vertices we obtain

${\begin{aligned}\sum _{i}M_{i}\delta f_{i}{\bar {f}}_{i}&=\sum _{i}M_{i}\delta f_{i}\sum _{j}(\mathbf {I} +\lambda \nabla ^{2})f_{j}=\sum _{i}\delta f_{i}\sum _{j}(M+\lambda M\nabla ^{2})f_{j},\end{aligned}}$

A noisy sphere being iteratively smoothed

where our choice of $\nabla ^{2}$ is chosen to be $M^{-1}\mathbf {L}$ for the cotangent Laplacian $\mathbf {L}$ and the $M^{-1}$ term is to map the image of the Laplacian from areas to points. Because the variation is free, this results in a self-adjoint linear problem to solve with a parameter $\lambda$ : ${\bar {f}}=(M+\lambda \mathbf {L} )f.$ When working with triangle meshes one way to determine the values of the Laplacian matrix $L {\displaystyle L}$ is through analyzing the geometry of connected triangles on the mesh.

$L_{ij}={\begin{cases}{\frac {1}{2}}(\cot(\alpha _{ij})+\cot(\beta _{ij}))&{\text{edge ij exists}}\\-\sum \limits _{i\neq j}L_{ij}&i=j\\0&{\text{otherwise}}\end{cases}}$

Where $\alpha _{ij}$ and $\beta _{ij}$ are the angles opposite the edge $(i,j)$ ^[5]Themass matrix M as an operator computes the local integral of a function's value and is often set for a mesh with m triangles as follows:

$M_{ij}={\begin{cases}{\frac {1}{3}}\sum \limits _{t=1}^{m}{\begin{cases}Area(t)&{\text{if triangle t contains vertex i}}\\0&{\text{otherwise}}\end{cases}}&{\text{if i=j}}\\0&{\text{otherwise}}\end{cases}}$

Parameterization

Occasionally, we need to flatten a 3D surface onto a flat plane. This process is known asparameterization. The goal is to find coordinatesu andv onto which we can map the surface so that distortions are minimized. In this manner, parameterization can be seen as an optimization problem. One of the major applications of mesh parameterization istexture mapping.

Mass springs method

The Tutte Embedding shows non-smooth parameterizations on the side of the beetle.

One way to measure the distortion accrued in the mapping process is to measure how much the length of the edges on the 2D mapping differs from their lengths in the original 3D surface. In more formal terms, the objective function can be written as:

${\underset {U}{\text{min}}}\sum _{ij\in E}||u_{i}-u_{j}||^{2}$

Where $E {\displaystyle E}$ is the set of mesh edges and $U {\displaystyle U}$ is the set of vertices. However, optimizing this objective function would result in a solution that maps all of the vertices to a single vertex in theuv-coordinates. Borrowing an idea from graph theory, we apply theTutte Mapping and restrict the boundary vertices of the mesh onto aunit circle or otherconvex polygon. Doing so prevents the vertices from collapsing into a single vertex when the mapping is applied. The non-boundary vertices are then positioned at thebarycentric interpolation of their neighbours. The Tutte Mapping, however, still suffers from severe distortions as it attempts to make the edge lengths equal, and hence does not correctly account for the triangle sizes on the actual surface mesh.

Least-squares conformal mappings

A comparison of the Tutte Embedding and Least-Squares-Conformal-Mapping parameterization. Notice how the LSCM parameterization is smooth on the side of the beetle.

Another way to measure the distortion is to consider thevariations on theu andv coordinate functions. The wobbliness and distortion apparent in the mass springs methods are due to high variations in theu andv coordinate functions. With this approach, the objective function becomes theDirichlet energy onu andv:

${\underset {u,v}{\text{min}}}\int _{S}||\nabla u||^{2}+||\nabla v||^{2}dA$

There are a few other things to consider. We would like to minimize the angle distortion topreserve orthogonality. That means we would like $\nabla u=\nabla v^{\perp }$ . In addition, we would also like the mapping to have proportionally similar sized regions as the original. This results to setting the Jacobian of theu andv coordinate functions to 1.

${\begin{bmatrix}{\dfrac {\partial u}{\partial x}}&{\dfrac {\partial u}{\partial y}}\\[1em]{\dfrac {\partial v}{\partial x}}&{\dfrac {\partial v}{\partial y}}\end{bmatrix}}=1$

Putting these requirements together, we can augment the Dirichlet energy so that our objective function becomes:^[6]^[7]

${\underset {u,v}{\text{min}}}\int _{S}{\frac {1}{2}}||\nabla u||^{2}+{\frac {1}{2}}||\nabla v||^{2}-\nabla u\cdot \nabla v^{\perp }$

To avoid the problem of having all the vertices mapped to a single point, we also require that the solution to the optimization problem must have a non-zero norm and that it is orthogonal to the trivial solution.

Deformation

An example of as-rigid-as-possible deformation

Deformation is concerned with transforming some rest shape to a new shape. Typically, these transformations are continuous and do not alter the topology of the shape. Modern mesh-based shape deformation methods satisfy user deformation constraints at handles (selected vertices or regions on the mesh) and propagate these handle deformations to the rest of shape smoothly and without removing or distorting details. Some common forms of interactive deformations are point-based, skeleton-based, and cage-based.^[8] In point-based deformation, a user can apply transformations to small set of points, called handles, on the shape. Skeleton-based deformation defines askeleton for the shape, which allows a user to move the bones and rotate the joints. Cage-based deformation requires a cage to be drawn around all or part of a shape so that, when the user manipulates points on the cage, the volume it encloses changes accordingly.

Point-based deformation

Handles provide a sparse set of constraints for the deformation: as the user moves one point, the others must stay in place.

A rest surface ${\hat {S}}$ immersed in $\mathbb {R} ^{3}$ can be described with a mapping ${\hat {x}}:\Omega \rightarrow \mathbb {R} ^{3}$ , where $\Omega$ is a 2D parametric domain. The same can be done with another mapping $x {\displaystyle x}$ for the transformed surface $S {\displaystyle S}$ . Ideally, the transformed shape adds as little distortion as possible to the original. One way to model this distortion is in terms of displacements $d=x-{\hat {x}}$ with a Laplacian-based energy.^[9] Applying the Laplace operator to these mappings allows us to measure how the position of a point changes relative to its neighborhood, which keeps the handles smooth. Thus, the energy we would like to minimize can be written as:

$\min _{\textbf {d}}\int _{\Omega }||\Delta {\textbf {d}}||^{2}dA$ .

While this method is translation invariant, it is unable to account for rotations. The As-Rigid-As-Possible deformation scheme^[10] applies a rigid transformation $x_{i}=R{\hat {x_{i}}}+t$ to each handle i, where $R\in SO(3)\subset \mathbb {R} ^{3}$ is arotation matrix and $t\in \mathbb {R} ^{3}$ is a translation vector. Unfortunately, there's no way to know the rotations in advance, so instead we pick a “best” rotation that minimizes displacements. To achieve local rotation invariance, however, requires a function ${\textbf {R}}:\Omega \rightarrow SO(3)$ which outputs the best rotation for every point on the surface. The resulting energy, then, must optimize over both ${\textbf {x}}$ and ${\textbf {R}}$ :

$\min _{{\textbf {x,R}}\in SO(3)}\int _{\Omega }||\nabla {\textbf {x}}-{\textbf {R}}\nabla {\hat {\textbf {x}}}||^{2}dA$

Note that the translation vector is not present in the final objective function because translations have constant gradient.

Inside-Outside Segmentation

While seemingly trivial, in many cases, determining the inside from the outside of a triangle mesh is not an easy problem. In general, given a surface $S {\displaystyle S}$ we pose this problem as determining a function $isInside(q)$ which will return $1 {\displaystyle 1}$ if the point $q {\displaystyle q}$ is inside $S {\displaystyle S}$ , and $0 {\displaystyle 0}$ otherwise.

In the simplest case, the shape is closed. In this case, to determine if a point $q {\displaystyle q}$ is inside or outside the surface, we can cast a ray $r {\displaystyle r}$ in any direction from a query point, and count the number of times $count_{r}$ it passes through the surface. If $q {\displaystyle q}$ was outside $S {\displaystyle S}$ then the ray must either not pass through $S {\displaystyle S}$ (in which case $count_{r}=0$ ) or, each time it enters $S {\displaystyle S}$ it must pass through twice, because S is bounded, so any ray entering it must exit. So if $q {\displaystyle q}$ is outside, $count_{r}$ is even. Likewise if $q {\displaystyle q}$ is inside, the same logic applies to the previous case, but the ray must intersect $S {\displaystyle S}$ one extra time for the first time it leaves $S {\displaystyle S}$ . So:

$isInside_{r}(q)=\left\{{\begin{array}{ll}1&count_{r}\ is\ odd\\0&count_{r}\ is\ even\\\end{array}}\right.$

Now, oftentimes we cannot guarantee that the $S {\displaystyle S}$ is closed. Take the pair of pants example from the top of this article. This mesh clearly has a semantic inside-and-outside, despite there being holes at the waist and the legs.

Approximating inside-outside segmentation by shooting rays from a query point for varying number of rays

The naive attempt to solve this problem is to shoot many rays in random directions, and classify $q {\displaystyle q}$ as being insideif and only if most of the rays intersected $S {\displaystyle S}$ an odd number of times. To quantify this, let us say we cast $k {\displaystyle k}$ rays, $r_{1},r_{2},\dots ,r_{k}$ . We associate a number $rayTest(q)={\frac {1}{k}}\sum _{i=1}^{k}isInside_{r_{i}}(q)$ which is the average value of $isInside_{r}$ from each ray. Therefore:

$isInside(q)=\left\{{\begin{array}{ll}1&rayTest(q)\geq 0.5\\0&rayTest(q)<0.5\\\end{array}}\right.$

In the limit of shooting many, many rays, this method handles open meshes, however it in order to become accurate, far too many rays are required for this method to be computationally ideal. Instead, a more robust approach is the Generalized Winding Number.^[11] Inspired by the 2Dwinding number, this approach uses thesolid angle at $q {\displaystyle q}$ of each triangle in the mesh to determine if $q {\displaystyle q}$ is inside or outside. The value of the Generalized Winding Number at $q {\displaystyle q}$ , $wn(q)$ is proportional to the sum of the solid angle contribution from each triangle in the mesh:

$wn(q)={\frac {1}{4\pi }}\sum _{t\in F}solidAngle(t)$

For a closed mesh, $wn(q)$ is equivalent to the characteristic function for the volume represented by $S {\displaystyle S}$ . Therefore, we say:

$isInside(q)=\left\{{\begin{array}{ll}1&wn(q)\geq 0.5\\0&wn(q)<0.5\\\end{array}}\right.$

Because $wn(q)$ is aharmonic function, it degrades gracefully, meaning the inside-outside segmentation would not change much if we poked holes in a closed mesh. For this reason, the Generalized Winding Number handles open meshes robustly. The boundary between inside and outside smoothly passes over holes in the mesh. In fact, in the limit, the Generalized Winding Number is equivalent to the ray-casting method as the number of rays goes to infinity.

Applications

Computer-aided design (CAD)
3DSurface Reconstruction,e.g. range scanners in airport security, autonomous vehicles, medical scanner data reconstruction
Image-to-world Registration,e.g.Image-guided surgery
Architecture,e.g. creating,reverse engineering
Physics simulations
Computer gamese.g.collision detection
Geologic modelling
Visualization (graphics)e.g.Information visualizations,mathematical visualizations
Texture mapping
Modelling biological systemse.g. muscle and bone modelling, real-time hand tracking

See also

References

^^a ^bBotsch, Mario; Kobbelt, Leif; Pauly, Mark; Alliez, Pierre (2010).Polygon Mesh Processing.CRC Press.ISBN 9781568814261.
^Hugues Hoppe."Progressive Meshes"(PDF).
^^a ^b"Poisson surface reconstruction".hhoppe.com. Retrieved2017-01-26.
^Szymon Rusinkiewicz, Marc Levoy."Efficient Variants of the ICP Algorithm"(PDF).
^"Chris Tralie : Laplacian Meshes".www.ctralie.com. Retrieved2017-03-16.
^Desbrun, Mathieu (2002)."Intrinsic Parameterizations of Surface Meshes"(PDF).Eurographics.21.
^Levy, Bruno (2002)."Least squares conformal maps for automatic texture atlas generation"(PDF).ACM Transactions on Graphics.21 (3):362–371.doi:10.1145/566654.566590. Archived fromthe original(PDF) on 2017-03-15. Retrieved2017-03-14.
^Jacobson, Alec; Baran, Ilya; Popović, Jovan;Sorkine, Olga (2011)."Bounded Biharmonic Weights for Real-Time Deformation"(PDF).ACM Transactions on Graphics.30 (4): 1.doi:10.1145/2010324.1964973.
^Marc, Alexa (2003). "Differential coordinates for local mesh morphing and deformation".The Visual Computer.19 (2):105–114.doi:10.1007/s00371-002-0180-0.S2CID 6847571.
^Sorkine, Olga; Alexa, Marc (2007)."As-Rigid-As-Possible Surface Modeling"(PDF).Proceedings of EUROGRAPHICS/ACM SIGGRAPH Symposium on Geometry Processing:109–116.
^Jacobson, Alec; Ladislav, Kavan;Sorkine-Hornung, Olga (2013)."Robust Inside-Outside Segmentation using Generalized Winding Numbers"(PDF).ACM Transactions on Graphics.32 (4): 1.doi:10.1145/2461912.2461916.S2CID 207202533.

Retrieved from "https://en.wikipedia.org/w/index.php?title=Geometry_processing&oldid=1315318629"

Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp