Einstein notation

From Free net encyclopedia

For other topics related to Einstein, see Einstein (disambiguation).

In mathematics, especially in applications of linear algebra to physics, the Einstein notation or Einstein summation convention is a notational convention useful when dealing with coordinate formulae.

According to this convention, when an index variable appears twice in a single term, once in an upper and once in a lower position, it implies that we are summing over all of its possible values. In typical applications, these are 1,2,3 (for calculations in Euclidean space), or 0,1,2,3 or 1,2,3,4 (for calculations in Minkowski space), but they can have any range, even (in some applications) an infinite set. Abstract index notation is an improvement of Einstein notation.

In general relativity, the Greek alphabet and the Roman alphabet are used to distinguish whether summing over 1,2,3 or 0,1,2,3 (usually Roman, i, j, ... for 1,2,3 and Greek, μ, ν, ... for 0,1,2,3). As in sign conventions, the convention used in practice varies: Roman and Greek may be reversed.

Sometimes (as in general relativity), the index is required to appear once as a superscript and once as a subscript; in other applications, all indices are subscripts. See Dual vector space and Tensor product.

It is important to keep in mind that no new physical laws or ideas result from using Einstein notation; rather, it merely helps in identifying relationships and symmetries often 'hidden' by more conventional notation.

Contents

Introduction

In mechanics and engineering, vectors in 3D space are often described in relation to orthogonal unit vectors i, j and k.

<math>\mathbf{u} = u_x \mathbf{i} + u_y \mathbf{j} + u_z \mathbf{k}</math>

If the basis vectors i, j, and k are instead expressed as e1, e2, and e3, a vector can be expressed in terms of a summation:

<math>\mathbf{u} = u_1 \mathbf{e}_1 + u_2 \mathbf{e}_2 + u_3 \mathbf{e}_3
  = \sum_{i = 1}^3 u_i \mathbf{e}_i</math>

In Einstein notation, an index that is repeated twice in an equation implies a summation, and the summation symbol need not be included.

This allows a concise algebraic presentation of vector and tensor equations. For example,

<math>\mathbf{u} \cdot \mathbf{v} = \sum_{i = 1}^3 u_i \mathbf{e}_i \cdot
  \sum_{j = 1}^3 v_j \mathbf{e}_j = u_i \mathbf{e}_i \cdot v_j
  \mathbf{e}_j </math>

or equivalently:

<math>\mathbf{u} \cdot \mathbf{v}
 =  \sum_{i = 1}^3 \sum_{j = 1}^3 u_i v_j ( \mathbf{e}_i \cdot \mathbf{e}_j ) 
 =  u_i v_j ( \mathbf{e}_i \cdot \mathbf{e}_j ) </math>

where

<math> \mathbf{e}_i \cdot
  \mathbf{e}_j = \delta_{ij} </math>

and <math>\ \delta_{ij}</math> is the Kronecker delta, which is equal to 1 when i = j, and 0 otherwise. It logically follows that this allows one j in the equation to be converted to an i, or one i to be converted to a j. Then,

<math>\mathbf{u} \cdot \mathbf{v} = u_i v_j\delta_{ij}= u_i v_i = u_j v_j </math>

For the cross product,

<math> \mathbf{u} \times \mathbf{v}= \sum_{j = 1}^3 u_j \mathbf{e}_j \times
  \sum_{k = 1}^3 v_k \mathbf{e}_k = u_j \mathbf{e}_j \times v_k
  \mathbf{e}_k = u_j v_k (\mathbf{e}_j \times \mathbf{e}_k ) = \epsilon_{ijk} \mathbf{e}_i u_j v_k
  </math>

where <math> \mathbf{e}_j \times \mathbf{e}_k = \epsilon_{ijk} \mathbf{e}_i</math> and <math>\ \epsilon_{ijk}</math> is the Levi-Civita symbol defined by:

<math>\epsilon_{ijk} =

\left\{ \begin{matrix} +1 & \mbox{if } (i,j,k) \mbox{ is } (1,2,3), (2,3,1) \mbox{ or } (3,1,2)\\ -1 & \mbox{if } (i,j,k) \mbox{ is } (3,2,1), (1,3,2) \mbox{ or } (2,1,3)\\ 0 & \mbox{otherwise: }i=j \mbox{ or } j=k \mbox{ or } k=i \end{matrix} \right. </math>

which recovers

<math> \mathbf{u} \times \mathbf{v} = (u_2 v_3 - u_3 v_2) \mathbf{e}_1 + (u_3 v_1 - u_1 v_3) \mathbf{e}_2 + (u_1 v_2 - u_2 v_1) \mathbf{e}_3</math>

from

<math> \mathbf{u} \times \mathbf{v}= \epsilon_{ijk} \mathbf{e}_i u_j v_k = \sum_{i = 1}^3 \sum_{j = 1}^3 \sum_{k = 1}^3 \epsilon_{ijk} \mathbf{e}_i u_j v_k
  </math>.

Additionally, if <math> \mathbf{w} = \mathbf{u} \times \mathbf{v}</math>, then <math> \mathbf{w} = \epsilon_{ijk} \mathbf{e}_i u_j v_k </math> and <math>\ w_i = \epsilon_{ijk} u_j v_k </math>. This also highlights that when an index appears once on both sides of the equation, this implies a system of equations instead of a summation:

<math>

\begin{matrix} w_1 = \epsilon_{1jk} u_j v_k\\ w_2 = \epsilon_{2jk} u_j v_k\\ w_3 = \epsilon_{3jk} u_j v_k \end{matrix} </math>

Alternatively, this could be expressed as

<math>
\mathbf{u} \times \mathbf{v}= \mathbf{u} \cdot \epsilon \cdot \mathbf{v}

</math>

but, this isn't the notation Einstein used.

Abstract definitions

In the traditional usage, one has in mind a vector space V  with finite dimension n, and a specific basis of V. We can write the basis vectors as e1, e2, ..., en. Then if v is a vector in V, it has coordinates v1, ..., vn relative to this basis.

The basic rule is:

v = vi ei.

In this expression, it was assumed that the term on the right side was to be summed as i  goes from 1 to n, because the index i does not appear on both sides of the expression. (Or, using Einstein's convention, because the index i  appeared twice.)

The i is known as a dummy index since the result is not dependent on it; thus we could also write, for example:

v = vj ej.

An index that is not summed over is a free index and should be found in each term of the equation or formula.

In contexts where the index must appear once as a subscript and once as a superscript, the basis vectors ei retain subscripts but the coordinates become vi with superscripts. Then the basic rule is:

v = vi ei.

The value of the Einstein convention is that it applies to other vector spaces built from V  using the tensor product and duality. For example, <math>V\otimes V</math>, the tensor product of V  with itself, has a basis consisting of tensors of the form <math>\mathbf{e}_{ij} = \mathbf{e}_i \otimes \mathbf{e}_j</math>. Any tensor T in <math>V\otimes V</math> can be written as:

<math>\mathbf{T} = T^{ij}\mathbf{e}_{ij}</math>.

V*, the dual of V, has a basis e1, e2, ..., en which obeys the rule

<math>\mathbf{e}^i (\mathbf{e}_j) = \delta_{i}^j</math>.

Here δ is the Kronecker delta, so <math>\delta_{i}^j</math> is 1 if i =j  and 0 otherwise.

Examples

Einstein summation is clarified with the help of a few simple examples. Consider four-dimensional spacetime, where indices run from 0 to 3:

<math>a^\mu b_\mu = a^0 b_0 + a^1 b_1 + a^2 b_2 + a^3 b_3</math>
<math>a^{\mu\nu} b_\mu = a^{0\nu} b_0 + a^{1\nu} b_1 + a^{2\nu} b_2 + a^{3\nu} b_3.</math>

The above example is one of contraction, a common tensor operation. The tensor <math> a^{\mu\nu}b_{\alpha}</math> becomes a new tensor by summing over the first upper index and the lower index. Typically the resulting tensor is renamed with the contracted indices removed:

<math>s^{\nu} = a^{\mu\nu}b_{\mu}.</math>

For a familiar example, consider the dot product of two vectors a and b. The dot product is defined simply as summation over the indices of a and b:

<math>\mathbf{a}\cdot\mathbf{b} = a^{\alpha}b_{\alpha} = a^0 b_0 + a^1 b_1 + a^2 b_2 + a^3 b_3,</math>

which is our familiar formula for the vector dot product. Remember it is sometimes necessary to change the components of a in order to lower its index; however, this is not necessary in Euclidean space, or any space with a metric equal to its inverse metric (e.g., flat spacetime).

Miscellanea

In some fields, Einstein notation is referred to simply as index notation.

When an index is repeated three or more times, it means that there is a mistake somewhere.

See also

de:Einsteinsche Summenkonvention fr:Convention de sommation d'Einstein it:Notazione di Einstein he:הסכם הסכימה של איינשטיין nl:Einstein-sommatieconventie pl:Konwencja sumacyjna ru:Соглашение Эйнштейна sl:Einsteinov zapis zh:愛因斯坦記號