Movatterモバイル変換

Jump to content

Einstein notation

From Wikipedia, the free encyclopedia

Shorthand notation for tensor operations

Inmathematics, especially the usage oflinear algebra inmathematical physics anddifferential geometry,Einstein notation (also known as theEinstein summation convention orEinstein summation notation) is a notational convention that impliessummation over a set of indexed terms in a formula, thus achieving brevity. As part of mathematics it is a notational subset ofRicci calculus; however, it is often used in physics applications that do not distinguish betweentangent andcotangent spaces. It was introduced to physics byAlbert Einstein in 1916.^[1]

Introduction

Statement of convention

According to this convention, when an index variable appears twice in a singleterm and is not otherwise defined (seeFree and bound variables), it implies summation of that term over all the values of the index. So where the indices can range over theset{1, 2, 3}, $y=\sum _{i=1}^{3}x^{i}e_{i}=x^{1}e_{1}+x^{2}e_{2}+x^{3}e_{3}$ is simplified by the convention to: $y=x^{i}e_{i}$

The upper indices are notexponents but are indices of coordinates,coefficients orbasis vectors. That is, in this contextx² should be understood as the second component ofx rather than the square ofx (this can occasionally lead to ambiguity). The upper index position inxⁱ is because, typically, an index occurs once in an upper (superscript) and once in a lower (subscript) position in a term (see§ Application below). Typically,(x¹x²x³) would be equivalent to the traditional(xyz).

Ingeneral relativity, a common convention is that

theGreek alphabet is used for space and time components, where indices take on values 0, 1, 2, or 3 (frequently used letters areμ,ν, ...),
theLatin alphabet is used for spatial components only, where indices take on values 1, 2, or 3 (frequently used letters arei,j, ...),

In general, indices can range over anyindexing set, including aninfinite set. This should not be confused with a typographically similar convention used to distinguish betweentensor index notation and the closely related but distinct basis-independentabstract index notation.

An index that is summed over is asummation index, in this case "i ". It is also called adummy index since any symbol can replace "i " without changing the meaning of the expression (provided that it does not collide with other index symbols in the same term).

An index that is not summed over is afree index and should appear only once per term. If such an index does appear, it usually also appears in every other term in an equation. An example of a free index is the "i " in the equation $v_{i}=a_{i}b_{j}x^{j}$ , which is equivalent to the equation ${\textstyle v_{i}=\sum _{j}(a_{i}b_{j}x^{j})}$ .

Application

Einstein notation can be applied in slightly different ways. Typically, each index occurs once in an upper (superscript) and once in a lower (subscript) position in a term; however, the convention can be applied more generally to any repeated indices within a term.^[2] When dealing withcovariant and contravariant vectors, where the position of an index indicates the type of vector, the first case usually applies; a covariant vector can only be contracted with a contravariant vector, corresponding to summation of the products of coefficients. On the other hand, when there is a fixed coordinate basis (or when not considering coordinate vectors), one may choose to use only subscripts; see§ Superscripts and subscripts versus only subscripts below.

Vector representations

Superscripts and subscripts versus only subscripts

In terms ofcovariance and contravariance of vectors,

upper indices represent components ofcontravariant vectors (vectors),
lower indices represent components ofcovariant vectors (covectors).

They transform contravariantly or covariantly, respectively, with respect tochange of basis.

In recognition of this fact, the following notation uses the same symbol both for a vector or covector and itscomponents, as in: ${\begin{aligned}v=e_{i}v^{i}={\begin{bmatrix}e_{1}&e_{2}&\cdots &e_{n}\end{bmatrix}}{\begin{bmatrix}v^{1}\\v^{2}\\\vdots \\v^{n}\end{bmatrix}}\\w=w_{i}e^{i}={\begin{bmatrix}w_{1}&w_{2}&\cdots &w_{n}\end{bmatrix}}{\begin{bmatrix}e^{1}\\e^{2}\\\vdots \\e^{n}\end{bmatrix}}\end{aligned}}$

where $v {\displaystyle v}$ is the vector and $v^{i}$ are its components (not the $i {\displaystyle i}$ th covector $v {\displaystyle v}$ ), $w {\displaystyle w}$ is the covector and $w_{i}$ are its components. The basis vector elements $e_{i}$ are each column vectors, and the covector basis elements $e^{i}$ are each row covectors. (See also§ Abstract description;duality, below and theexamples)

In the presence of anon-degenerate form (anisomorphismV →V^∗, for instance aRiemannian metric orMinkowski metric), one canraise and lower indices.

A basis gives such a form (via thedual basis), hence when working onRⁿ with aEuclidean metric and a fixedorthonormal basis, one has the option to work with only subscripts.

However, if one changes coordinates, the way that coefficients change depends on the variance of the object, and one cannot ignore the distinction; seeCovariance and contravariance of vectors.

Mnemonics

In the above example, vectors are represented asn × 1matrices (column vectors), while covectors are represented as1 ×n matrices (row covectors).

When using the column vector convention:

"Upper indices goup to down;lower indices goleft to right."
"Covariant tensors arerow vectors that have indices that arebelow (co-row-below)."
Covectors are row vectors: ${\begin{bmatrix}w_{1}&\cdots &w_{k}\end{bmatrix}}.$ Hence the lower index indicates whichcolumn you are in.
Contravariant vectors are column vectors: ${\begin{bmatrix}v^{1}\\\vdots \\v^{k}\end{bmatrix}}$ Hence the upper index indicates whichrow you are in.

Abstract description

The virtue of Einstein notation is that it represents theinvariant quantities with a simple notation.

In physics, ascalar is invariant under transformations of basis. In particular, aLorentz scalar is invariant under aLorentz transformation. The individual terms in the sum are not. When the basis is changed, thecomponents of a vector change by alinear transformation described by a matrix. This led Einstein to propose the convention that repeated indices imply the summation is to be done.

As for covectors, they change by theinverse matrix. This is designed to guarantee that the linear function associated with the covector, the sum above, is the same no matter what the basis is.

The value of the Einstein convention is that it applies to othervector spaces built fromV using thetensor product andduality. For example,V ⊗ V, the tensor product ofV with itself, has a basis consisting of tensors of the forme_ij =e_i ⊗e_j. Any tensorT inV ⊗ V can be written as: $\mathbf {T} =T^{ij}\mathbf {e} _{ij}.$

V *, the dual ofV, has a basise¹,e², ...,eⁿ which obeys the rule $\mathbf {e} ^{i}(\mathbf {e} _{j})=\delta _{j}^{i}.$ whereδ is theKronecker delta. As $\operatorname {Hom} (V,W)=V^{*}\otimes W$ the row/column coordinates on a matrix correspond to the upper/lower indices on the tensor product.

Common operations in this notation

In Einstein notation, the usual element reference $A_{mn}$ for the $m {\displaystyle m}$ -th row and $n {\displaystyle n}$ -th column of matrix $A {\displaystyle A}$ becomes ${A^{m}}_{n}$ . We can then write the following operations in Einstein notation as follows.

Inner product

Theinner product of two vectors is the sum of the products of their corresponding components, with the indices of one vector lowered (see#Raising and lowering indices): $\langle \mathbf {u} ,\mathbf {v} \rangle =\langle \mathbf {e} _{i},\mathbf {e} _{j}\rangle u^{i}v^{j}=u_{j}v^{j}$ In the case of anorthonormal basis, we have $u^{j}=u_{j}$ , and the expression simplifies to: $\langle \mathbf {u} ,\mathbf {v} \rangle =\sum _{j}u^{j}v^{j}=u_{j}v^{j}$

Vector cross product

In three dimensions, thecross product of two vectors with respect to apositively oriented orthonormal basis, meaning that $\mathbf {e} _{1}\times \mathbf {e} _{2}=\mathbf {e} _{3}$ , can be expressed as: $\mathbf {u} \times \mathbf {v} =\varepsilon _{\,jk}^{i}u^{j}v^{k}\mathbf {e} _{i}$

Here, $\varepsilon _{\,jk}^{i}=\varepsilon _{ijk}$ is theLevi-Civita symbol. Since the basis is orthonormal, raising the index $i {\displaystyle i}$ does not alter the value of $\varepsilon _{ijk}$ , when treated as a tensor.

Matrix-vector multiplication

The product of a matrixA_ij with a column vectorv_j is: $\mathbf {u} _{i}=(\mathbf {A} \mathbf {v} )_{i}=\sum _{j=1}^{N}A_{ij}v_{j}$ equivalent to $u^{i}={A^{i}}_{j}v^{j}$

This is a special case of matrix multiplication.

Matrix multiplication

Thematrix product of two matricesA_ij andB_jk is: $\mathbf {C} _{ik}=(\mathbf {A} \mathbf {B} )_{ik}=\sum _{j=1}^{N}A_{ij}B_{jk}$

equivalent to ${C^{i}}_{k}={A^{i}}_{j}{B^{j}}_{k}$

Trace

For asquare matrixAⁱ_j, thetrace is the sum of the diagonal elements, hence the sum over a common indexAⁱ_i.

Outer product

Theouter product of the column vectoruⁱ by the row vectorv_j yields anm × n matrixA: ${A^{i}}_{j}=u^{i}v_{j}={(uv)^{i}}_{j}$

Sincei andj represent twodifferent indices, there is no summation and the indices are not eliminated by the multiplication.

Raising and lowering indices

Given atensor, one canraise an index or lower an index by contracting the tensor with themetric tensor,g_μν. For example, taking the tensorT^α_β, one can lower an index: $g_{\mu \sigma }{T^{\sigma }}_{\beta }=T_{\mu \beta }$

Or one can raise an index: $g^{\mu \sigma }{T_{\sigma }}^{\alpha }=T^{\mu \alpha }$

See also

Notes

This applies only for numerical indices. The situation is the opposite forabstract indices. Then, vectors themselves carry upper abstract indices and covectors carry lower abstract indices, as per the example in theintroduction of this article. Elements of a basis of vectors may carry a lowernumerical index and an upperabstract index.

References

^Einstein, Albert (1916)."The Foundation of the General Theory of Relativity".Annalen der Physik.354 (7): 769.Bibcode:1916AnP...354..769E.doi:10.1002/andp.19163540702. Archived fromthe original(PDF) on 2006-08-29. Retrieved2006-09-03.
^"Einstein Summation". Wolfram Mathworld. Retrieved13 April 2011.

Bibliography

Kuptsov, L. P. (2001) [1994],"Einstein rule",Encyclopedia of Mathematics,EMS Press.

External links

The WikibookGeneral Relativity has a page on the topic of:Einstein Summation Notation

Rawlings, Steve (2007-02-01)."Lecture 10 – Einstein Summation Convention and Vector Identities". Oxford University. Archived fromthe original on 2017-01-06. Retrieved2008-07-02.
"Vector Calculation in Index Notation (Einstein's Summation Convention)"(PDF).
"Understanding NumPy's einsum".Stack Overflow.

v
t
e

Glossary of tensor theory

Scope

Mathematics	Coordinate system Differential geometry Dyadic algebra Euclidean geometry Exterior calculus Multilinear algebra Tensor algebra Tensor calculus
Physics Engineering	Computer vision Continuum mechanics Electromagnetism General relativity Transport phenomena

Notation

Tensor
definitions

Related
abstractions

Notable tensors

Mathematics	Kronecker delta Levi-Civita symbol Metric tensor Nonmetricity tensor Ricci curvature Riemann curvature tensor Torsion tensor Weyl tensor
Physics	Moment of inertia Angular momentum tensor Spin tensor Cauchy stress tensor stress–energy tensor Einstein tensor EM tensor Gluon field strength tensor Metric tensor (GR)

Retrieved from "https://en.wikipedia.org/w/index.php?title=Einstein_notation&oldid=1316620091"

Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp