Recent from talks
Nothing was collected or created yet.
Minkowski space
View on Wikipedia
In physics, Minkowski space (or Minkowski spacetime) (/mɪŋˈkɔːfski, -ˈkɒf-/[1]) is the main mathematical description of spacetime in the absence of gravitation. It combines inertial space and time manifolds into a four-dimensional model.
The model helps show how a spacetime interval between any two events is independent of the inertial frame of reference in which they are recorded. Mathematician Hermann Minkowski developed it from the work of Hendrik Lorentz, Henri Poincaré, and others, and said it "was grown on experimental physical grounds".
Minkowski space is closely associated with Einstein's theories of special relativity and general relativity and is the most common mathematical structure by which special relativity is formalized. While the individual components in Euclidean space and time might differ due to length contraction and time dilation, in Minkowski spacetime, all frames of reference will agree on the total interval in spacetime between events.[nb 1] Minkowski space differs from four-dimensional Euclidean space insofar as it treats time differently from the three spatial dimensions.
In 3-dimensional Euclidean space, the isometry group (maps preserving the regular Euclidean distance) is the Euclidean group. It is generated by rotations, reflections and translations. When time is appended as a fourth dimension, the further transformations of translations in time and Lorentz boosts are added, and the group of all these transformations is called the Poincaré group. Minkowski's model follows special relativity, where motion causes time dilation changing the scale applied to the frame in motion and shifts the phase of light.
Minkowski space is a pseudo-Euclidean space equipped with an isotropic quadratic form called the spacetime interval or the Minkowski norm squared. An event in Minkowski space for which the spacetime interval is zero is on the null cone of the origin, called the light cone in Minkowski space. Using the polarization identity the quadratic form is converted to a symmetric bilinear form called the Minkowski inner product, though it is not a geometric inner product. Another misnomer is Minkowski metric,[2] but Minkowski space is not a metric space.
The group of transformations for Minkowski space that preserves the spacetime interval (as opposed to the spatial Euclidean distance) is the Lorentz group (as opposed to the Galilean group).
History
[edit]| Part of a series on |
| Spacetime |
|---|
Complex Minkowski spacetime
[edit]In his second relativity paper in 1905, Henri Poincaré showed[3] how, by taking time to be an imaginary fourth spacetime coordinate ict, where c is the speed of light and i is the imaginary unit, Lorentz transformations can be visualized as ordinary rotations of the four-dimensional Euclidean sphere. The four-dimensional spacetime can be visualized as a four-dimensional space, with each point representing an event in spacetime. The Lorentz transformations can then be thought of as rotations in this four-dimensional space, where the rotation axis corresponds to the direction of relative motion between the two observers and the rotation angle is related to their relative velocity.
To understand this concept, one should consider the coordinates of an event in spacetime represented as a four-vector (t, x, y, z). A Lorentz transformation is represented by a matrix that acts on the four-vector, changing its components. This matrix can be thought of as a rotation matrix in four-dimensional space, which rotates the four-vector around a particular axis.
Rotations in planes spanned by two space unit vectors appear in coordinate space as well as in physical spacetime as Euclidean rotations and are interpreted in the ordinary sense. The "rotation" in a plane spanned by a space unit vector and a time unit vector, while formally still a rotation in coordinate space, is a Lorentz boost in physical spacetime with real inertial coordinates. The analogy with Euclidean rotations is only partial since the radius of the sphere is actually imaginary, which turns rotations into rotations in hyperbolic space (see hyperbolic rotation).
This idea, which was mentioned only briefly by Poincaré, was elaborated by Minkowski in a paper in German published in 1908 called "The Fundamental Equations for Electromagnetic Processes in Moving Bodies".[4] He reformulated Maxwell equations as a symmetrical set of equations in the four variables (x, y, z, ict) combined with redefined vector variables for electromagnetic quantities, and he was able to show directly and very simply their invariance under Lorentz transformation. He also made other important contributions and used matrix notation for the first time in this context. From his reformulation, he concluded that time and space should be treated equally, and so arose his concept of events taking place in a unified four-dimensional spacetime continuum.
Real Minkowski spacetime
[edit]In a further development in his 1908 "Space and Time" lecture,[5] Minkowski gave an alternative formulation of this idea that used a real time coordinate instead of an imaginary one, representing the four variables (x, y, z, t) of space and time in the coordinate form in a four-dimensional real vector space. Points in this space correspond to events in spacetime. In this space, there is a defined light-cone associated with each point, and events not on the light cone are classified by their relation to the apex as spacelike or timelike. It is principally this view of spacetime that is current nowadays, although the older view involving imaginary time has also influenced special relativity.
In the English translation of Minkowski's paper, the Minkowski metric, as defined below, is referred to as the line element. The Minkowski inner product below appears unnamed when referring to orthogonality (which he calls normality) of certain vectors, and the Minkowski norm squared is referred to as "sum" (a word choice that might be attributable to language translation).
Minkowski's principal tool is the Minkowski diagram, and he uses it to define concepts and demonstrate properties of Lorentz transformations (e.g., proper time and length contraction) and to provide geometrical interpretation to the generalization of Newtonian mechanics to relativistic mechanics. For these special topics, see the referenced articles, as the presentation below will be principally confined to the mathematical structure (Minkowski metric and from it derived quantities and the Poincaré group as symmetry group of spacetime) following from the invariance of the spacetime interval on the spacetime manifold as consequences of the postulates of special relativity, not to specific application or derivation of the invariance of the spacetime interval. This structure provides the background setting of all present relativistic theories, barring general relativity for which flat Minkowski spacetime still provides a springboard as curved spacetime is locally Lorentzian.
Minkowski, aware of the fundamental restatement of the theory which he had made, said
The views of space and time which I wish to lay before you have sprung from the soil of experimental physics, and therein lies their strength. They are radical. Henceforth, space by itself and time by itself are doomed to fade away into mere shadows, and only a kind of union of the two will preserve an independent reality.
— Hermann Minkowski, 1908, 1909[5]
Though Minkowski took an important step for physics, Albert Einstein saw its limitation:
At a time when Minkowski was giving the geometrical interpretation of special relativity by extending the Euclidean three-space to a quasi-Euclidean four-space that included time, Einstein was already aware that this is not valid, because it excludes the phenomenon of gravitation. He was still far from the study of curvilinear coordinates and Riemannian geometry, and the heavy mathematical apparatus entailed.[6]
For further historical information see references Galison (1979), Corry (1997) and Walter (1999).
Causal structure
[edit]
Where v is velocity, x, y, and z are Cartesian coordinates in 3-dimensional space, c is the constant representing the universal speed limit, and t is time, the four-dimensional vector v = (ct, x, y, z) = (ct, r) is classified according to the sign of c2t2 − r2. A vector is timelike if c2t2 > r2, spacelike if c2t2 < r2, and null or lightlike if c2t2 = r2. This can be expressed in terms of the sign of η(v, v), also called scalar product, as well, which depends on the signature. The classification of any vector will be the same in all frames of reference that are related by a Lorentz transformation (but not by a general Poincaré transformation because the origin may then be displaced) because of the invariance of the spacetime interval under Lorentz transformation.
The set of all null vectors at an event[nb 2] of Minkowski space constitutes the light cone of that event. Given a timelike vector v, there is a worldline of constant velocity associated with it, represented by a straight line in a Minkowski diagram.
Once a direction of time is chosen,[nb 3] timelike and null vectors can be further decomposed into various classes. For timelike vectors, one has
- future-directed timelike vectors whose first component is positive (tip of vector located in causal future (also called the absolute future) in the figure) and
- past-directed timelike vectors whose first component is negative (causal past (also called the absolute past)).
Null vectors fall into three classes:
- the zero vector, whose components in any basis are (0, 0, 0, 0) (origin),
- future-directed null vectors whose first component is positive (upper light cone), and
- past-directed null vectors whose first component is negative (lower light cone).
Together with spacelike vectors, there are 6 classes in all.
An orthonormal basis for Minkowski space necessarily consists of one timelike and three spacelike unit vectors. If one wishes to work with non-orthonormal bases, it is possible to have other combinations of vectors. For example, one can easily construct a (non-orthonormal) basis consisting entirely of null vectors, called a null basis.
Vector fields are called timelike, spacelike, or null if the associated vectors are timelike, spacelike, or null at each point where the field is defined.
Properties of time-like vectors
[edit]Time-like vectors have special importance in the theory of relativity as they correspond to events that are accessible to the observer at (0, 0, 0, 0) with a speed less than that of light. Of most interest are time-like vectors that are similarly directed, i.e. all either in the forward or in the backward cones. Such vectors have several properties not shared by space-like vectors. These arise because both forward and backward cones are convex, whereas the space-like region is not convex.
Scalar product
[edit]The scalar product of two time-like vectors u1 = (t1, x1, y1, z1) and u2 = (t2, x2, y2, z2) is
Positivity of scalar product: An important property is that the scalar product of two similarly directed time-like vectors is always positive. This can be seen from the reversed Cauchy–Schwarz inequality below. It follows that if the scalar product of two vectors is zero, then one of these, at least, must be space-like. The scalar product of two space-like vectors can be positive or negative as can be seen by considering the product of two space-like vectors having orthogonal spatial components and times either of different or the same signs.
Using the positivity property of time-like vectors, it is easy to verify that a linear sum with positive coefficients of similarly directed time-like vectors is also similarly directed time-like (the sum remains within the light cone because of convexity).
Norm and reversed Cauchy inequality
[edit]The norm of a time-like vector u = (ct, x, y, z) is defined as
The reversed Cauchy inequality is another consequence of the convexity of either light cone.[7] For two distinct similarly directed time-like vectors u1 and u2 this inequality is or algebraically,
From this, the positive property of the scalar product can be seen.
Reversed triangle inequality
[edit]For two similarly directed time-like vectors u and w, the inequality is[8] where the equality holds when the vectors are linearly dependent.
The proof uses the algebraic definition with the reversed Cauchy inequality:[9]
The result now follows by taking the square root on both sides.
Mathematical structure
[edit]It is assumed below that spacetime is endowed with a coordinate system corresponding to an inertial frame. This provides an origin, which is necessary for spacetime to be modeled as a vector space. This addition is not required, and more complex treatments analogous to an affine space can remove the extra structure. However, this is not the introductory convention and is not covered here.
For an overview, Minkowski space is a 4-dimensional real vector space equipped with a non-degenerate, symmetric bilinear form on the tangent space at each point in spacetime, here simply called the Minkowski inner product, with metric signature either (+ − − −) or (− + + +). The tangent space at each event is a vector space of the same dimension as spacetime, 4.
Tangent vectors
[edit]
In practice, one need not be concerned with the tangent spaces. The vector space structure of Minkowski space allows for the canonical identification of vectors in tangent spaces at points (events) with vectors (points, events) in Minkowski space itself. See e.g. Lee (2003, Proposition 3.8.) or Lee (2012, Proposition 3.13.) These identifications are routinely done in mathematics. They can be expressed formally in Cartesian coordinates as[10] with basis vectors in the tangent spaces defined by
Here, p and q are any two events, and the second basis vector identification is referred to as parallel transport. The first identification is the canonical identification of vectors in the tangent space at any point with vectors in the space itself. The appearance of basis vectors in tangent spaces as first-order differential operators is due to this identification. It is motivated by the observation that a geometrical tangent vector can be associated in a one-to-one manner with a directional derivative operator on the set of smooth functions. This is promoted to a definition of tangent vectors in manifolds not necessarily being embedded in Rn. This definition of tangent vectors is not the only possible one, as ordinary n-tuples can be used as well.
A tangent vector at a point p may be defined, here specialized to Cartesian coordinates in Lorentz frames, as 4 × 1 column vectors v associated to each Lorentz frame related by Lorentz transformation Λ such that the vector v in a frame related to some frame by Λ transforms according to v → Λv. This is the same way in which the coordinates xμ transform. Explicitly,
This definition is equivalent to the definition given above under a canonical isomorphism.
For some purposes, it is desirable to identify tangent vectors at a point p with displacement vectors at p, which is, of course, admissible by essentially the same canonical identification.[11] The identifications of vectors referred to above in the mathematical setting can correspondingly be found in a more physical and explicitly geometrical setting in Misner, Thorne & Wheeler (1973). They offer various degrees of sophistication (and rigor) depending on which part of the material one chooses to read.
Metric signature
[edit]The metric signature refers to which sign the Minkowski inner product yields when given space (spacelike to be specific, defined further down) and time basis vectors (timelike) as arguments. Further discussion about this theoretically inconsequential but practically necessary choice for purposes of internal consistency and convenience is deferred to the hide box below. See also the page treating sign convention in Relativity.
In general, but with several exceptions, mathematicians and general relativists prefer spacelike vectors to yield a positive sign, (− + + +), while particle physicists tend to prefer timelike vectors to yield a positive sign, (+ − − −). Authors covering several areas of physics, e.g. Steven Weinberg and Landau and Lifshitz ((− + + +) and (+ − − −), respectively) stick to one choice regardless of topic. Arguments for the former convention include "continuity" from the Euclidean case corresponding to the non-relativistic limit c → ∞. Arguments for the latter include that minus signs, otherwise ubiquitous in particle physics, go away. Yet other authors, especially of introductory texts, e.g. Kleppner & Kolenkow (1978), do not choose a signature at all, but instead, opt to coordinatize spacetime such that the time coordinate (but not time itself!) is imaginary. This removes the need for the explicit introduction of a metric tensor (which may seem like an extra burden in an introductory course), and one need not be concerned with covariant vectors and contravariant vectors (or raising and lowering indices) to be described below. The inner product is instead effected by a straightforward extension of the dot product from over to This works in the flat spacetime of special relativity, but not in the curved spacetime of general relativity, see Misner, Thorne & Wheeler (1973, Box 2.1, "Farewell to i c t ") (who, by the way use (− + + +) ). MTW also argues that it hides the true indefinite nature of the metric and the true nature of Lorentz boosts, which are not rotations. It also needlessly complicates the use of tools of differential geometry that are otherwise immediately available and useful for geometrical description and calculation – even in the flat spacetime of special relativity, e.g. of the electromagnetic field.
Terminology
[edit]Mathematically associated with the bilinear form is a tensor of type (0,2) at each point in spacetime, called the Minkowski metric.[nb 4] The Minkowski metric, the bilinear form, and the Minkowski inner product are all the same object; it is a bilinear function that accepts two (contravariant) vectors and returns a real number. In coordinates, this is the 4×4 matrix representing the bilinear form.
For comparison, in general relativity, a Lorentzian manifold L is likewise equipped with a metric tensor g, which is a nondegenerate symmetric bilinear form on the tangent space TpL at each point p of L. In coordinates, it may be represented by a 4×4 matrix depending on spacetime position. Minkowski space is thus a comparatively simple special case of a Lorentzian manifold. Its metric tensor is in coordinates with the same symmetric matrix at every point of M, and its arguments can, per above, be taken as vectors in spacetime itself.
Introducing more terminology (but not more structure), Minkowski space is thus a pseudo-Euclidean space with total dimension n = 4 and signature (1, 3) or (3, 1). Elements of Minkowski space are called events. Minkowski space is often denoted R1,3 or R3,1 to emphasize the chosen signature, or just M. It is an example of a pseudo-Riemannian manifold.
Then mathematically, the metric is a bilinear form on an abstract four-dimensional real vector space V, that is, where η has signature (+, -, -, -), and signature is a coordinate-invariant property of η. The space of bilinear maps forms a vector space which can be identified with , and η may be equivalently viewed as an element of this space. By making a choice of orthonormal basis , can be identified with the space . The notation is meant to emphasize the fact that M and are not just vector spaces but have added structure. .
An interesting example of non-inertial coordinates for (part of) Minkowski spacetime is the Born coordinates. Another useful set of coordinates is the light-cone coordinates.
Pseudo-Euclidean metrics
[edit]The Minkowski inner product is not an inner product, since it has non-zero null vectors. Since it is not a definite bilinear form it is called indefinite.
The Minkowski metric η is the metric tensor of Minkowski space. It is a pseudo-Euclidean metric, or more generally, a constant pseudo-Riemannian metric in Cartesian coordinates. As such, it is a nondegenerate symmetric bilinear form, a type (0, 2) tensor. It accepts two arguments up, vp, vectors in TpM, p ∈ M, the tangent space at p in M. Due to the above-mentioned canonical identification of TpM with M itself, it accepts arguments u, v with both u and v in M.
As a notational convention, vectors v in M, called 4-vectors, are denoted in italics, and not, as is common in the Euclidean setting, with boldface v. The latter is generally reserved for the 3-vector part (to be introduced below) of a 4-vector.
The definition [12] yields an inner product-like structure on M, previously and also henceforth, called the Minkowski inner product, similar to the Euclidean inner product, but it describes a different geometry. It is also called the relativistic dot product. If the two arguments are the same, the resulting quantity will be called the Minkowski norm squared. The Minkowski inner product satisfies the following properties.
- Linearity in the first argument
- Symmetry
- Non-degeneracy
The first two conditions imply bilinearity.
The most important feature of the inner product and norm squared is that these are quantities unaffected by Lorentz transformations. In fact, it can be taken as the defining property of a Lorentz transformation in that it preserves the inner product (i.e. the value of the corresponding bilinear form on two vectors). This approach is taken more generally for all classical groups definable this way in classical group. There, the matrix Φ is identical in the case O(3, 1) (the Lorentz group) to the matrix η to be displayed below.
Orthogonality
[edit]
Minkowski space is constructed so that the speed of light will be the same constant regardless of the reference frame in which it is measured. This property results from the relation of the time axis to a space axis. Two events u and v are orthogonal when the bilinear form is zero for them: η(v, w) = 0.
When both u and v are both space-like, then they are perpendicular, but if one is time-like and the other space-like, then the relation is hyperbolic orthogonality. The relation is preserved in a change of reference frames and consequently the computation of light speed yields a constant result. The change of reference frame is called a Lorentz boost and in mathematics it is a hyperbolic rotation. Each reference frame is associated with a hyperbolic angle, which is zero for the rest frame in Minkowski space. Such a hyperbolic angle has been labelled rapidity since it is associated with the speed of the frame.
Minkowski metric
[edit]From the second postulate of special relativity, together with homogeneity of spacetime and isotropy of space, it follows that the spacetime interval between two arbitrary events called 1 and 2 is:[13] This quantity is not consistently named in the literature. The interval is sometimes referred to as the square root of the interval as defined here.[14][15]
The invariance of the interval under coordinate transformations between inertial frames follows from the invariance of provided the transformations are linear. This quadratic form can be used to define a bilinear form via the polarization identity. This bilinear form can in turn be written as where [η] is a matrix associated with η. While possibly confusing, it is common practice to denote [η] with just η. The matrix is read off from the explicit bilinear form as and the bilinear form with which this section started by assuming its existence, is now identified.
For definiteness and shorter presentation, the signature (− + + +) is adopted below. This choice (or the other possible choice) has no (known) physical implications. The symmetry group preserving the bilinear form with one choice of signature is isomorphic (under the map given here) with the symmetry group preserving the other choice of signature. This means that both choices are in accord with the two postulates of relativity. Switching between the two conventions is straightforward. If the metric tensor η has been used in a derivation, go back to the earliest point where it was used, substitute η for −η, and retrace forward to the desired formula with the desired metric signature.
Standard basis
[edit]A standard or orthonormal basis for Minkowski space is a set of four mutually orthogonal vectors {e0, e1, e2, e3} such that and for which when
These conditions can be written compactly in the form
Relative to a standard basis, the components of a vector v are written (v0, v1, v2, v3) where the Einstein notation is used to write v = vμ eμ. The component v0 is called the timelike component of v while the other three components are called the spatial components. The spatial components of a 4-vector v may be identified with a 3-vector v = (v1, v2, v3).
In terms of components, the Minkowski inner product between two vectors v and w is given by
and
Here lowering of an index with the metric was used.
There are many possible choices of standard basis obeying the condition Any two such bases are related in some sense by a Lorentz transformation, either by a change-of-basis matrix , a real 4 × 4 matrix satisfying or Λ, a linear map on the abstract vector space satisfying, for any pair of vectors u, v,
Then if two different bases exist, {e0, e1, e2, e3} and {e′0, e′1, e′2, e′3}, can be represented as or . While it might be tempting to think of and Λ as the same thing, mathematically, they are elements of different spaces, and act on the space of standard bases from different sides.
Raising and lowering of indices
[edit]
Technically, a non-degenerate bilinear form provides a map between a vector space and its dual; in this context, the map is between the tangent spaces of M and the cotangent spaces of M. At a point in M, the tangent and cotangent spaces are dual vector spaces (so the dimension of the cotangent space at an event is also 4). Just as an authentic inner product on a vector space with one argument fixed, by Riesz representation theorem, may be expressed as the action of a linear functional on the vector space, the same holds for the Minkowski inner product of Minkowski space.[17]
Thus if vμ are the components of a vector in tangent space, then ημν vμ = vν are the components of a vector in the cotangent space (a linear functional). Due to the identification of vectors in tangent spaces with vectors in M itself, this is mostly ignored, and vectors with lower indices are referred to as covariant vectors. In this latter interpretation, the covariant vectors are (almost always implicitly) identified with vectors (linear functionals) in the dual of Minkowski space. The ones with upper indices are contravariant vectors. In the same fashion, the inverse of the map from tangent to cotangent spaces, explicitly given by the inverse of η in matrix representation, can be used to define raising of an index. The components of this inverse are denoted ημν. It happens that ημν = ημν. These maps between a vector space and its dual can be denoted η♭ (eta-flat) and η♯ (eta-sharp) by the musical analogy.[18]
Contravariant and covariant vectors are geometrically very different objects. The first can and should be thought of as arrows. A linear function can be characterized by two objects: its kernel, which is a hyperplane passing through the origin, and its norm. Geometrically thus, covariant vectors should be viewed as a set of hyperplanes, with spacing depending on the norm (bigger = smaller spacing), with one of them (the kernel) passing through the origin. The mathematical term for a covariant vector is 1-covector or 1-form (though the latter is usually reserved for covector fields).
One quantum mechanical analogy explored in the literature is that of a de Broglie wave (scaled by a factor of Planck's reduced constant) associated with a momentum four-vector to illustrate how one could imagine a covariant version of a contravariant vector. The inner product of two contravariant vectors could equally well be thought of as the action of the covariant version of one of them on the contravariant version of the other. The inner product is then how many times the arrow pierces the planes.[16] The mathematical reference, Lee (2003), offers the same geometrical view of these objects (but mentions no piercing).
The electromagnetic field tensor is a differential 2-form, which geometrical description can as well be found in MTW.
One may, of course, ignore geometrical views altogether (as is the style in e.g. Weinberg (2002) and Landau & Lifshitz 2002) and proceed algebraically in a purely formal fashion. The time-proven robustness of the formalism itself, sometimes referred to as index gymnastics, ensures that moving vectors around and changing from contravariant to covariant vectors and vice versa (as well as higher order tensors) is mathematically sound. Incorrect expressions tend to reveal themselves quickly.
Coordinate free raising and lowering
[edit]Given a bilinear form the lowered version of a vector can be thought of as the partial evaluation of that is, there is an associated partial evaluation map
The lowered vector is then the dual map Note it does not matter which argument is partially evaluated due to the symmetry of
Non-degeneracy is then equivalent to injectivity of the partial evaluation map, or equivalently non-degeneracy indicates that the kernel of the map is trivial. In finite dimension, as is the case here, and noting that the dimension of a finite-dimensional space is equal to the dimension of the dual, this is enough to conclude the partial evaluation map is a linear isomorphism from to This then allows the definition of the inverse partial evaluation map, which allows the inverse metric to be defined as where the two different usages of can be told apart by the argument each is evaluated on. This can then be used to raise indices. If a coordinate basis is used, the metric η−1 is indeed the matrix inverse to η .
Formalism of the Minkowski metric
[edit]The present purpose is to show semi-rigorously how formally one may apply the Minkowski metric to two vectors and obtain a real number, i.e. to display the role of the differentials and how they disappear in a calculation. The setting is that of smooth manifold theory, and concepts such as convector fields and exterior derivatives are introduced.
A full-blown version of the Minkowski metric in coordinates as a tensor field on spacetime has the appearance
Explanation: The coordinate differentials are 1-form fields. They are defined as the exterior derivative of the coordinate functions xμ. These quantities evaluated at a point p provide a basis for the cotangent space at p. The tensor product (denoted by the symbol ⊗) yields a tensor field of type (0, 2), i.e. the type that expects two contravariant vectors as arguments. On the right-hand side, the symmetric product (denoted by the symbol ⊙ or by juxtaposition) has been taken. The equality holds since, by definition, the Minkowski metric is symmetric.[19] The notation on the far right is also sometimes used for the related, but different, line element. It is not a tensor. For elaboration on the differences and similarities, see Misner, Thorne & Wheeler (1973, Box 3.2 and section 13.2.)
Tangent vectors are, in this formalism, given in terms of a basis of differential operators of the first order, where p is an event. This operator applied to a function f gives the directional derivative of f at p in the direction of increasing xμ with xν, ν ≠ μ fixed. They provide a basis for the tangent space at p.
The exterior derivative df of a function f is a covector field, i.e. an assignment of a cotangent vector to each point p, by definition such that for each vector field X. A vector field is an assignment of a tangent vector to each point p. In coordinates X can be expanded at each point p in the basis given by the ∂/∂xν | p . Applying this with f = xμ, the coordinate function itself, and X = ∂/ ∂xν , called a coordinate vector field, one obtains
Since this relation holds at each point p, the dxμ|p provide a basis for the cotangent space at each p and the bases d xμ|p and ∂/∂xν |p are dual to each other, at each p. Furthermore, one has for general one-forms on a tangent space α, β and general tangent vectors a, b. (This can be taken as a definition, but may also be proved in a more general setting.)
Thus when the metric tensor is fed two vectors fields a, b, both expanded in terms of the basis coordinate vector fields, the result is where aμ, bν are the component functions of the vector fields. The above equation holds at each point p, and the relation may as well be interpreted as the Minkowski metric at p applied to two tangent vectors at p.
As mentioned, in a vector space, such as modeling the spacetime of special relativity, tangent vectors can be canonically identified with vectors in the space itself, and vice versa. This means that the tangent spaces at each point are canonically identified with each other and with the vector space itself. This explains how the right-hand side of the above equation can be employed directly, without regard to the spacetime point the metric is to be evaluated and from where (which tangent space) the vectors come from.
This situation changes in general relativity. There one has where now η → g(p), i.e., g is still a metric tensor but now depending on spacetime and is a solution of Einstein's field equations. Moreover, a, b must be tangent vectors at spacetime point p and can no longer be moved around freely.
Chronological and causality relations
[edit]Let x, y ∈ M. Here,
- x chronologically precedes y if y − x is future-directed timelike. This relation has the transitive property and so can be written x < y.
- x causally precedes y if y − x is future-directed null or future-directed timelike. It gives a partial ordering of spacetime and so can be written x ≤ y.
Suppose x ∈ M is timelike. Then the simultaneous hyperplane for x is {y : η(x, y) = 0}. Since this hyperplane varies as x varies, there is a relativity of simultaneity in Minkowski space.
Generalizations
[edit]A Lorentzian manifold is a generalization of Minkowski space in two ways. The total number of spacetime dimensions is not restricted to be 4 (2 or more) and a Lorentzian manifold need not be flat, i.e. it allows for curvature.
Complexified Minkowski space
[edit]Complexified Minkowski space is defined as Mc = M ⊕ iM.[20] Its real part is the Minkowski space of four-vectors, such as the four-velocity and the four-momentum, which are independent of the choice of orientation of the space. The imaginary part, on the other hand, may consist of four pseudovectors, such as angular velocity and magnetic moment, which change their direction with a change of orientation. A pseudoscalar i is introduced, which also changes sign with a change of orientation. Thus, elements of Mc are independent of the choice of the orientation.
The inner product-like structure on Mc is defined as u ⋅ v = η(u, v) for any u,v ∈ Mc. A relativistic pure spin of an electron or any half spin particle is described by ρ ∈ Mc as ρ = u + is, where u is the four-velocity of the particle, satisfying u2 = 1 and s is the 4D spin vector,[21] which is also the Pauli–Lubanski pseudovector satisfying s2 = −1 and u ⋅ s = 0.
Generalized Minkowski space
[edit]Minkowski space refers to a mathematical formulation in four dimensions. However, the mathematics can easily be extended or simplified to create an analogous generalized Minkowski space in any number of dimensions. If n ≥ 2, n-dimensional Minkowski space is a vector space of real dimension n on which there is a constant Minkowski metric of signature (n − 1, 1) or (1, n − 1). These generalizations are used in theories where spacetime is assumed to have more or less than 4 dimensions. String theory and M-theory are two examples where n > 4. In string theory there appear conformal field theories with 1 + 1 spacetime dimensions.
de Sitter space can be formulated as a submanifold of generalized Minkowski space as can the model spaces of hyperbolic geometry (see below).
Curvature
[edit]As a flat spacetime, the three spatial components of Minkowski spacetime always obey the Pythagorean Theorem. Minkowski space is a suitable basis for special relativity, a good description of physical systems over finite distances in systems without significant gravitation. However, in order to take gravity into account, physicists use the theory of general relativity, which is formulated in the mathematics of differential geometry of differential manifolds. When this geometry is used as a model of spacetime, it is known as curved spacetime.
Even in curved spacetime, Minkowski space is still a good description in an infinitesimal region surrounding any point (barring gravitational singularities).[nb 5] More abstractly, it can be said that in the presence of gravity spacetime is described by a curved 4-dimensional manifold for which the tangent space to any point is a 4-dimensional Minkowski space. Thus, the structure of Minkowski space is still essential in the description of general relativity.
Geometry
[edit]The meaning of the term geometry for the Minkowski space depends heavily on the context. Minkowski space is not endowed with Euclidean geometry, and not with any of the generalized Riemannian geometries with intrinsic curvature, those exposed by the model spaces in hyperbolic geometry (negative curvature) and the geometry modeled by the sphere (positive curvature). The reason is the indefiniteness of the Minkowski metric. Minkowski space is, in particular, not a metric space and not a Riemannian manifold with a Riemannian metric. However, Minkowski space contains submanifolds endowed with a Riemannian metric yielding hyperbolic geometry.
Model spaces of hyperbolic geometry of low dimension, say 2 or 3, cannot be isometrically embedded in Euclidean space with one more dimension, i.e. or respectively, with the Euclidean metric , preventing easy visualization.[nb 6][22] By comparison, model spaces with positive curvature are just spheres in Euclidean space of one higher dimension.[23] Hyperbolic spaces can be isometrically embedded in spaces of one more dimension when the embedding space is endowed with the Minkowski metric .
Define to be the upper sheet () of the hyperboloid in generalized Minkowski space of spacetime dimension This is one of the surfaces of transitivity of the generalized Lorentz group. The induced metric on this submanifold, the pullback of the Minkowski metric under inclusion, is a Riemannian metric. With this metric is a Riemannian manifold. It is one of the model spaces of Riemannian geometry, the hyperboloid model of hyperbolic space. It is a space of constant negative curvature .[24] The 1 in the upper index refers to an enumeration of the different model spaces of hyperbolic geometry, and the n for its dimension. A corresponds to the Poincaré disk model, while corresponds to the Poincaré half-space model of dimension
Preliminaries
[edit]In the definition above is the inclusion map and the superscript star denotes the pullback. The present purpose is to describe this and similar operations as a preparation for the actual demonstration that actually is a hyperbolic space.
| Behavior of tensors under inclusion, pullback of covariant tensors under general maps and pushforward of vectors under general maps |
|---|
|
Behavior of tensors under inclusion: Pullback of tensors under general maps: It is defined by where the subscript star denotes the pushforward of the map F, and X1, X2, ..., Xk are vectors in TpM. (This is in accord with what was detailed about the pullback of the inclusion map. In the general case here, one cannot proceed as simply because F∗X1 ≠ X1 in general.) The pushforward of vectors under general maps: Further unwinding the definitions, the pushforward F∗: TMp → TNF(p) of a vector field under a map F: M → N between manifolds is defined by where f is a function on N. When M = Rm, N= Rn the pushforward of F reduces to DF: Rm → Rn, the ordinary differential, which is given by the Jacobian matrix of partial derivatives of the component functions. The differential is the best linear approximation of a function F from Rm to Rn. The pushforward is the smooth manifold version of this. It acts between tangent spaces, and is in coordinates represented by the Jacobian matrix of the coordinate representation of the function. The corresponding pullback is the dual map from the dual of the range tangent space to the dual of the domain tangent space, i.e. it is a linear map, |
Hyperbolic stereographic projection
[edit]
In order to exhibit the metric, it is necessary to pull it back via a suitable parametrization. A parametrization of a submanifold S of a manifold M is a map U ⊂ Rm → M whose range is an open subset of S. If S has the same dimension as M, a parametrization is just the inverse of a coordinate map φ: M → U ⊂ Rm. The parametrization to be used is the inverse of hyperbolic stereographic projection. This is illustrated in the figure to the right for n = 2. It is instructive to compare to stereographic projection for spheres.
Stereographic projection σ: Hn
R → Rn and its inverse σ−1: Rn → Hn
R are given by
where, for simplicity, τ ≡ ct. The (τ, x) are coordinates on Mn+1 and the u are coordinates on Rn.
Let and let
If then it is geometrically clear that the vector intersects the hyperplane once in point denoted
One has or
By construction of stereographic projection one has
This leads to the system of equations
The first of these is solved for λ and one obtains for stereographic projection
Next, the inverse σ−1(u) = (τ, x) must be calculated. Use the same considerations as before, but now with one gets but now with λ depending on u. The condition for P lying in the hyperboloid is or leading to
With this λ, one obtains
Pulling back the metric
[edit]One has and the map
The pulled back metric can be obtained by straightforward methods of calculus;
One computes according to the standard rules for computing differentials (though one is really computing the rigorously defined exterior derivatives), and substitutes the results into the right hand side. This yields
| Detailed outline of computation |
|---|
|
One has and With this one may write from which Summing this formula one obtains Similarly, for τ one gets yielding Now add this contribution to finally get |
This last equation shows that the metric on the ball is identical to the Riemannian metric h2(n)
R in the Poincaré ball model, another standard model of hyperbolic geometry.
| Alternative calculation using the pushforward |
|---|
|
The pullback can be computed in a different fashion. By definition, In coordinates, One has from the formula for σ–1 Lastly, and the same conclusion is reached. |
See also
[edit]Remarks
[edit]- ^ This makes spacetime distance an invariant.
- ^ Translate the coordinate system so that the event is the new origin.
- ^ This corresponds to the time coordinate either increasing or decreasing when the proper time for any particle increases. An application of T flips this direction.
- ^ For comparison and motivation of terminology, take a Riemannian metric, which provides a positive definite symmetric bilinear form, i. e. an inner product proper at each point on a manifold.
- ^ This similarity between flat space and curved space at infinitesimally small distance scales is foundational to the definition of a manifold in general.
- ^ There is an isometric embedding into according to the Nash embedding theorem (Nash (1956)), but the embedding dimension is much higher, n = (m/2)(m + 1)(3m + 11) for a Riemannian manifold of dimension m.
Notes
[edit]- ^ "Minkowski" Archived 2019-06-22 at the Wayback Machine. Random House Webster's Unabridged Dictionary.
- ^ Lee 1997, p. 31
- ^ Poincaré 1905–1906, pp. 129–176 Wikisource translation: On the Dynamics of the Electron
- ^ Minkowski 1907–1908, pp. 53–111 *Wikisource translation: s:Translation:The Fundamental Equations for Electromagnetic Processes in Moving Bodies
- ^ a b Minkowski 1908–1909, pp. 75–88 Various English translations on Wikisource: "Space and Time"
- ^ Cornelius Lanczos (1972) "Einstein's Path from Special to General Relativity", pages 5–19 of General Relativity: Papers in Honour of J. L. Synge, L. O'Raifeartaigh editor, Clarendon Press, see page 11
- ^ See Schutz's proof p 148, also Naber p. 48
- ^ Schutz p. 148, Naber p. 49
- ^ Schutz p. 148
- ^ Lee 1997, p. 15
- ^ Lee 2003, See Lee's discussion on geometric tangent vectors early in chapter 3.
- ^ Giulini 2008 pp. 5, 6
- ^ Sean M. Carroll (2019). Spacetime and Geometry (illustrated, herdruk ed.). Cambridge University Press. p. 7. ISBN 978-1-108-48839-6.
- ^ Sard 1970, p. 71
- ^ Minkowski, Landau & Lifshitz 2002, p. 4
- ^ a b Misner, Thorne & Wheeler 1973
- ^ Lee 2003. One point in Lee's proof of the existence of this map needs modification (Lee deals with Riemannian metrics.). Where Lee refers to positive definiteness to show the injectivity of the map, one needs instead appeal to non-degeneracy.
- ^ Lee 2003, The tangent-cotangent isomorphism p. 282
- ^ Lee 2003
- ^ Y. Friedman, A Physically Meaningful Relativistic Description of the Spin State of an Electron, Symmetry 2021, 13(10), 1853; https://doi.org/10.3390/sym13101853 Archived 2023-08-13 at the Wayback Machine
- ^ Jackson, J.D., Classical Electrodynamics, 3rd ed.; John Wiley \& Sons: Hoboken, NJ, US, 1998
- ^ Lee 1997, p. 66
- ^ Lee 1997, p. 33
- ^ Lee 1997
References
[edit]- Corry, L. (1997). "Hermann Minkowski and the postulate of relativity". Arch. Hist. Exact Sci. 51 (4): 273–314. doi:10.1007/BF00518231. ISSN 0003-9519. S2CID 27016039.
- Catoni, F.; et al. (2008). The Mathematics of Minkowski Space-Time. Frontiers in Mathematics. Basel: Birkhäuser Verlag. doi:10.1007/978-3-7643-8614-6. ISBN 978-3-7643-8613-9. ISSN 1660-8046.
- Galison, P. L. (1979). R McCormach; et al. (eds.). Minkowski's Space–Time: from visual thinking to the absolute world. Historical Studies in the Physical Sciences. Vol. 10. Johns Hopkins University Press. pp. 85–121. doi:10.2307/27757388. JSTOR 27757388.
- Giulini D The rich structure of Minkowski space, https://arxiv.org/abs/0802.4345v1.
- Kleppner, D.; Kolenkow, R. J. (1978) [1973]. An Introduction to Mechanics. London: McGraw-Hill. ISBN 978-0-07-035048-9.
- Landau, L.D.; Lifshitz, E.M. (2002) [1939]. The Classical Theory of Fields. Course of Theoretical Physics. Vol. 2 (4th ed.). Butterworth–Heinemann. ISBN 0-7506-2768-9.
- Lee, J. M. (2003). Introduction to Smooth manifolds. Springer Graduate Texts in Mathematics. Vol. 218. ISBN 978-0-387-95448-6.
- Lee, J. M. (2012). Introduction to Smooth manifolds. Springer Graduate Texts in Mathematics. ISBN 978-1-4419-9981-8.
- Lee, J. M. (1997). Riemannian Manifolds – An Introduction to Curvature. Springer Graduate Texts in Mathematics. Vol. 176. New York · Berlin · Heidelberg: Springer Verlag. ISBN 978-0-387-98322-6.
- Minkowski, Hermann (1907–1908), [The Fundamental Equations for Electromagnetic Processes in Moving Bodies], Nachrichten von der Gesellschaft der Wissenschaften zu Göttingen, Mathematisch-Physikalische Klasse: 53–111
- Published translation: Carus, Edward H. (1918). "Space and Time". The Monist. 28 (288): 288–302. doi:10.5840/monist19182826.
- Wikisource translation: The Fundamental Equations for Electromagnetic Processes in Moving Bodies
- Minkowski, Hermann (1908–1909), [Space and Time], Physikalische Zeitschrift, 10: 75–88 Various English translations on Wikisource: Space and Time.
- Misner, Charles W.; Thorne, Kip. S.; Wheeler, John A. (1973), Gravitation, W. H. Freeman, ISBN 978-0-7167-0344-0.
- Naber, G. L. (1992). The Geometry of Minkowski Spacetime. New York: Springer-Verlag. ISBN 978-0-387-97848-2.
- Nash, J. (1956). "The Imbedding Problem for Riemannian Manifolds". Annals of Mathematics. 63 (1): 20–63. doi:10.2307/1969989. JSTOR 1969989. MR 0075639.
- Penrose, Roger (2005). "18 Minkowskian geometry". Road to Reality : A Complete Guide to the Laws of the Universe. Alfred A. Knopf. ISBN 9780679454434.
- Poincaré, Henri (1905–1906), "Sur la dynamique de l'électron" [On the Dynamics of the Electron], Rendiconti del Circolo Matematico di Palermo, 21: 129–176, Bibcode:1906RCMP...21..129P, doi:10.1007/BF03013466, hdl:2027/uiug.30112063899089, S2CID 120211823 Wikisource translation: On the Dynamics of the Electron
- Robb A A: Optical Geometry of Motion; a New View of the Theory of Relativity Cambridge 1911, (Heffers). http://www.archive.org/details/opticalgeometryoOOrobbrich.
- Robb A A: Geometry of Time and Space, 1936 Cambridge Univ Press http://www.archive.org/details/geometryoftimean032218mbp.
- Sard, R. D. (1970). Relativistic Mechanics - Special Relativity and Classical Particle Dynamics. New York: W. A. Benjamin. ISBN 978-0805384918.
- Shaw, R. (1982). "§ 6.6 Minkowski space, § 6.7,8 Canonical forms pp 221–242". Linear Algebra and Group Representations. Academic Press. ISBN 978-0-12-639201-2.
- Walter, Scott A. (1999). "Minkowski, Mathematicians, and the Mathematical Theory of Relativity". In Goenner, Hubert; et al. (eds.). The Expanding Worlds of General Relativity. Boston: Birkhäuser. pp. 45–86. ISBN 978-0-8176-4060-6.
- Weinberg, S. (2002), The Quantum Theory of Fields, vol. 1, Cambridge University Press, ISBN 978-0-521-55001-7.
External links
[edit]
Media related to Minkowski diagrams at Wikimedia Commons
- Animation clip on YouTube visualizing Minkowski space in the context of special relativity.
- The Geometry of Special Relativity: The Minkowski Space – Time Light Cone
- Minkowski space at PhilPapers
Minkowski space
View on GrokipediaIntroduction
Definition and Basic Concept
Minkowski space is a four-dimensional real vector space, denoted as , equipped with a bilinear form that has a signature of (1,3) or (3,1), providing the mathematical framework for modeling flat spacetime in special relativity.This structure distinguishes it from Euclidean space, where the metric is positive definite; instead, Minkowski space uses an indefinite metric, known as a pseudo-metric or pseudo-Riemannian metric, which allows for the incorporation of time as a dimension on equal footing with spatial ones while preserving the invariance of the speed of light.
As the affine space underlying special relativity, it represents the arena in which physical events occur, with points corresponding to spacetime coordinates and the pseudo-metric enabling the measurement of intervals between them.
The basic notation for a position four-vector in Minkowski space is , where is the speed of light, runs from 0 to 3, and the coordinates blend time (scaled by ) with the three spatial dimensions.
Named after the German mathematician Hermann Minkowski, who introduced this geometric interpretation of spacetime in his 1908 lecture "Space and Time," the space assumes familiarity with basic vector and tensor concepts but defines the pseudo-metric as the indefinite quadratic form that yields spacetime intervals of arbitrary sign.
This setup leads to a causal structure where light cones delineate the boundaries between future, past, and spacelike-separated events.
Role in Special Relativity
Minkowski space provides the geometric framework for special relativity by treating spacetime as a four-dimensional pseudo-Euclidean manifold where the spacetime interval between events is invariant under Lorentz transformations. The interval is given by the line element which remains unchanged regardless of the inertial frame chosen to measure the coordinates .[1] This invariance encodes the core principle that the laws of physics are the same in all inertial frames, unifying space and time into a single entity where separations are classified as timelike, spacelike, or null based on the sign of .[5] The physical interpretation of this interval is particularly evident in the concept of proper time for timelike paths, which measures the time elapsed along a worldline as experienced by an observer moving with the system. For such paths, the proper time satisfies , representing the invariant "length" of the timelike curve in spacetime.[1] This distinguishes Minkowski geometry from Newtonian physics, where time is absolute and simultaneity is frame-independent; in special relativity, the relativity of simultaneity arises because events simultaneous in one frame may not be in another, leading to phenomena like time dilation.[6] The geometry resolves apparent paradoxes, such as the twin paradox, where the traveling twin ages less due to their worldline having a shorter proper time compared to the stationary twin's straight path in spacetime diagrams.[4] Lorentz invariance of the metric is formalized by the condition that any Lorentz transformation matrix satisfies , where is the Minkowski metric tensor with signature .[7] This ensures that the inner product between four-vectors is preserved, maintaining the causal structure of spacetime. Light cones emerge as the boundaries defined by null intervals (), delineating possible causal influences between events.[1] In applications, Minkowski space underpins the formulation of relativistic electromagnetism through the antisymmetric field strength tensor , which transforms covariantly under Lorentz transformations to unify electric and magnetic fields into a single entity.[1] Similarly, particle trajectories are described as worldlines parameterized by proper time, enabling the relativistic mechanics of point particles via the action principle.[5][8]Historical Development
Origins in Special Relativity
In 1905, Albert Einstein published his seminal paper "On the Electrodynamics of Moving Bodies," which laid the foundation for special relativity by introducing two key postulates: the principle of relativity, stating that the laws of physics are the same in all inertial frames, and the constancy of the speed of light in vacuum, independent of the motion of the source.[9] These postulates resolved inconsistencies between Newtonian mechanics and Maxwell's equations for electromagnetism, but Einstein's initial formulation treated space and time as separate entities.[9] Building on Einstein's work, Hermann Minkowski, Einstein's former mathematics professor, developed a geometric interpretation of special relativity during his lectures in 1907 and 1908, reimagining spacetime as a unified four-dimensional manifold where space and time coordinates are interdependent.[2] In a landmark address titled "Space and Time" delivered at the 80th Assembly of German Natural Scientists and Physicians in Cologne on September 21, 1908, Minkowski articulated this vision, famously declaring: "Henceforth space by itself, and time by itself, are doomed to fade away into mere shadows, and only a kind of union of the two will preserve an independent reality."[2] This formulation transformed relativity from an algebraic theory into a geometric one, emphasizing the invariance of spacetime intervals. Minkowski's ideas quickly influenced the development of relativistic mechanics among prominent physicists. Max Planck, for instance, praised the four-dimensional approach as a turning point in relativity, incorporating it into his work on relativistic dynamics and energy-momentum relations by 1909.[10] Others, including Max von Laue and Arnold Sommerfeld, adopted the spacetime framework to reformulate mechanics, enabling more elegant treatments of relativistic effects like mass-energy equivalence.[11] Einstein initially resisted Minkowski's geometric spacetime, dismissing it in 1907 as "superfluous learnedness" and preferring his operational, non-geometric approach.[2] However, by 1916, as he finalized the field equations of general relativity, Einstein fully embraced the spacetime manifold, recognizing its indispensability for extending special relativity to curved geometries and gravitational phenomena.[12]Evolution to Real and Complex Forms
Following Hermann Minkowski's foundational 1908 lecture, the real form of Minkowski space underwent standardization in the 1910s and 1920s, particularly through the contributions of David Hilbert, who integrated it into his axiomatic framework for physics to emphasize the (3,1) signature suited to physical applications in special relativity.[13] Hilbert's work around 1915 highlighted the geometric structure of spacetime as a rigorous basis for relativistic mechanics, promoting its adoption beyond initial geometric interpretations. In parallel, Minkowski himself introduced the complex form of spacetime in 1908, employing imaginary quantities—such as substituting for time—to enable analytic continuation and symmetrize the quadratic form into a Euclidean-like expression , facilitating deeper mathematical analysis of Lorentz transformations.[14] This complexification laid groundwork for later applications, including Roger Penrose's twistor theory in the 1960s and computations in quantum field theory where complex contours aid in evaluating propagators.[15] Key developments accelerated its dissemination: Arnold Sommerfeld adopted and refined Minkowski's framework in his 1909 textbook on electrodynamics, streamlining calculations for relativistic effects and championing its practical utility among physicists.[16] By 1927, Paul Dirac incorporated Minkowski space into his relativistic quantum mechanics, formulating the Dirac equation within this four-dimensional arena to reconcile quantum rules with special relativity, predicting phenomena like electron spin. Signature conventions sparked debate in the early decades, with variations like (-,+,+,+) initially prevalent in relativity texts, but the (+,-,-,-) form gaining dominance in particle physics by the 1950s due to its alignment with positive energy eigenvalues for massive particles and convenience in quantum calculations.[17] This evolution culminated in modern usage, where real Minkowski space provides the flat background for quantum field theory, serving as the embedding arena for Feynman diagrams that visualize particle interactions and scattering amplitudes in perturbative expansions.[18]Mathematical Formulation
Vector Space Structure
Minkowski space is formally defined as a four-dimensional real vector space, denoted , equipped with the standard operations of vector addition and scalar multiplication.[17] For any two vectors and scalar , the vector addition is given by and scalar multiplication by .[17] This structure underlies the mathematical formulation introduced by Hermann Minkowski in his 1908 address, where space and time coordinates are unified into a single continuum.[2] As an affine space, it extends the vector space by incorporating points (events) that can be translated via affine transformations, allowing displacements between points to be represented as vectors in .[19] At each point in Minkowski space, the tangent space is isomorphic to , providing a local vector space of directions.[1] This isomorphism holds globally due to the flat geometry, enabling consistent identification of tangent vectors across the space.[19] A key example is the four-velocity vector , where are the coordinates of a worldline and is the proper time along it, representing the tangent vector to the trajectory of a particle.[1] Minkowski space possesses a differentiable structure, making it a smooth manifold diffeomorphic to , which admits tensor fields and supports calculus on its vector spaces.[19] Due to its flatness, parallel transport of vectors along any curve is path-independent and preserves the vector unchanged, in contrast to curved spacetimes where holonomy effects arise.[20] This trivial parallel transport simplifies computations and highlights the affine nature of the space.[20] The underlying vector space structure is endowed with a metric that renders it pseudo-Euclidean, but the bare affine properties suffice for its foundational formulation.[17]Metric Tensor and Signature
The metric tensor provides the pseudo-Riemannian structure to Minkowski space, distinguishing it from ordinary Euclidean space by introducing an indefinite inner product that encodes the distinction between timelike and spacelike separations. In a standard coordinate basis with coordinates (often setting ), the metric tensor is diagonal and given by , corresponding to the (3,1) signature where three eigenvalues are positive (spacelike) and one is negative (timelike). This form defines a bilinear form on vectors and as , which is symmetric but indefinite due to the single negative eigenvalue, allowing for both positive and negative squared lengths. An alternative convention, with (1,3) signature, flips the signs and is prevalent in particle physics literature for consistency with quantum field theory conventions where the energy-momentum relation appears naturally positive-definite for massive particles.[21][22] The line element, which measures infinitesimal spacetime intervals, is expressed as . In the (3,1) signature, this expands to , quantifying the invariant "distance" between events in a way that is preserved under Lorentz transformations. This pseudo-metric classifies Minkowski space as a flat pseudo-Euclidean space of dimension 4 with signature (3,1), where the geometry supports hyperbolic rather than elliptic structure due to the indefinite form. The isometry group of this space, consisting of transformations that preserve the metric, is the Poincaré group, which includes Lorentz transformations and translations, forming the symmetry group of special relativity. Historically, Hermann Minkowski introduced the metric in his 1908 address, using the form , which aligns with the (1,3) signature and emphasizes the positive sign for the time component to reflect proper time as real. Subsequent developments saw a divergence in conventions: while general relativity texts often adopted the (3,1) signature for alignment with positive-definite spatial distances, modern particle physics standardized on the (1,3) signature to facilitate calculations in quantum field theory, such as the Klein-Gordon equation where the d'Alembertian operator takes a natural form. This choice does not alter physical predictions, as it merely rescales signs consistently throughout derivations, but it influences notational preferences across subfields.[23][24]Coordinates and Basis Vectors
In Minkowski space, the standard coordinate system consists of inertial coordinates, typically denoted as , where is the time coordinate (often with ) and are the Cartesian spatial coordinates associated with an inertial observer. These coordinates parameterize points in the four-dimensional spacetime, enabling the representation of events and trajectories in a flat, pseudo-Euclidean geometry.[1] The corresponding basis vectors are the coordinate basis vectors , with serving as the timelike basis vector and (for ) as the spacelike ones. These form an orthonormal set with respect to the Minkowski metric, satisfying , where is the metric tensor in this signature. In a coordinate-free perspective, the metric is defined as a smooth, non-degenerate symmetric bilinear form on the tangent space at each point of the manifold , independent of any specific coordinate choice, allowing computations in abstract vector spaces without explicit coordinates.[25][19] Raising and lowering of indices is performed using the metric tensor: the covariant components of a vector are given by , where summation over repeated indices is implied (Einstein convention). For the timelike component, this yields , reflecting the indefinite signature. The standard basis properties include the orthogonality relations encoded in the metric and the use of the Kronecker delta for mixed-index expressions, such as in the identity tensor, which remains invariant under basis changes.[26][27] Under a change of coordinates , contravariant vectors transform linearly via the Jacobian matrix: This ensures that tensorial quantities, including those computed in the standard basis, maintain their geometric meaning across inertial frames related by Lorentz transformations.[1]Vector Properties and Inner Product
Timelike, Spacelike, and Null Vectors
In Minkowski space equipped with the metric tensor of signature , vectors are classified according to the sign of their squared norm . A nonzero vector is timelike if , spacelike if , and null (or lightlike) if . These classifications carry direct physical significance in special relativity. Timelike vectors represent possible worldlines of massive particles, along which the proper time interval is real and the coordinate speed remains subluminal (). Null vectors describe the propagation of light rays or massless particles, such as photons, traveling precisely at the speed of light . Spacelike vectors correspond to separations between events that are causally disconnected, meaning no signal or particle can travel between them without exceeding .[28][28][28] A canonical example is the four-velocity of a massive particle, which is always timelike and normalized such that . Null vectors form the boundaries of the light cones at each spacetime point, separating timelike from spacelike regions.[29] This vector classification is invariant under Lorentz transformations, as the metric tensor itself is preserved, ensuring the sign of is frame-independent. Due to the indefinite nature of the metric, orthogonality () behaves differently than in Euclidean spaces: notably, no two timelike vectors can be orthogonal, even though their norms share the same sign.[27]Lorentz Inner Product
The Lorentz inner product on Minkowski space , also known as the Minkowski inner product, is defined by the bilinear form , where is the Minkowski metric tensor with components in the standard basis and summation over repeated indices is implied.[5] In explicit component form using Cartesian-like coordinates (with ), this expands to where denotes the standard Euclidean dot product on the spatial components.[1] This inner product is symmetric, satisfying , and bilinear, meaning and similarly for the second argument, for scalars .[5] Unlike the Euclidean inner product on , which is positive definite, the Lorentz inner product is indefinite due to the mixed signature of the metric, allowing to take positive, negative, or zero values.[1] Consequently, the Cauchy-Schwarz inequality in its standard form does not hold generally, as the "norms" may be imaginary or undefined for certain vectors.[30] For the standard orthonormal basis vectors (with , , etc.), the inner product evaluates to , reproducing the metric components: and for spatial indices .[17] For timelike vectors where , the proper length (or proper time in units ) involves a sign flip: , with giving the invariant length along the worldline.[31] The sign of provides a basis for classifying vectors as timelike, spacelike, or null.[1]Norms and Inequalities
In Minkowski space equipped with the Lorentz metric of signature , the norm of a nonzero vector is defined as . For timelike vectors, where , the norm is the positive value , which represents the proper time interval associated with the displacement for future-directed timelike vectors.[32] The Lorentz inner product gives rise to inequalities that reverse the familiar Euclidean forms due to the indefinite metric signature. For two future-directed timelike vectors and , the reversed Cauchy-Schwarz inequality states with equality if and only if and are parallel (i.e., one is a positive scalar multiple of the other).[33] This reversal arises because the "angle" between such vectors satisfies , reflecting the hyperbolic geometry of timelike directions.[33] A sketch of the algebraic proof via signature proceeds as follows: Without loss of generality, normalize so that . Then . Since and are both future timelike, their difference cannot enter the opposite light cone; the assumption (i.e., ) would imply , but combined with the light cone structure, this leads to a contradiction unless the vectors align, confirming the bound.[34] The reversed triangle inequality follows directly for future-directed timelike vectors and : with equality if and only if and are parallel.[32] To see this, expand using the reversed Cauchy-Schwarz inequality (with ), and take square roots after multiplying by -1 (preserving the inequality direction since both sides are negative).[32] These inequalities have key applications in bounding proper times along worldlines in special relativity. For instance, the reversed triangle inequality implies that the proper time along a geodesic (straight timelike path) between two events is maximized compared to any broken path connecting them, providing the foundation for resolving the twin paradox: the inertial twin's proper time exceeds that of the accelerating twin, with equality only for collinear inertial segments.[35]Causal and Chronological Structure
Light Cones
In Minkowski space, the light cone associated with an event at point is defined as the locus of all points such that the four-vector is null, meaning its Minkowski inner product with itself vanishes: .[17] This surface divides the spacetime into regions accessible by signals traveling at or below the speed of light. The future light cone consists of those points where is null and the time component satisfies , while the past light cone includes points with . The structure of the light cone manifests as a conical surface in standard inertial coordinates. For an event at the origin with , the equation describing the cone is , where is the speed of light, representing the propagation of light rays along null directions from the apex.[2] This cone is generated by the family of null vectors emanating from , forming a double cone symmetric about the time axis in the mostly plus metric signature. From the perspective of the tangent space at the origin, the light cone is the set of all vectors satisfying , which corresponds to the null hypersurface in the four-dimensional vector space.[17] This local description highlights the cone's role as the boundary between timelike and spacelike separations. Physically, the light cone delineates the maximum speed at which signals or influences can propagate, set by , ensuring that no causal connection exceeds this limit in special relativity.[2] This structure underpins the causal boundaries in Minkowski spacetime, originating from the invariant nature of the speed of light.Causality Relations
In Minkowski space, the causal structure defines a partial order on events based on the possibility of signal transmission between them. An event causally precedes another event , denoted , if is a future-directed timelike or null vector, meaning lies in the future light cone of .[17] This relation captures the fundamental constraint of special relativity that no information or matter can propagate faster than light, ensuring that influences from can reach via timelike or null paths.[36] A vector is future-directed if it is timelike or null and has a positive time component in the standard inertial coordinates, corresponding to progression toward increasing time.[36] Conversely, the anticausal relation involves the past light cone, where causally precedes if is future-directed timelike or null. The light cones thus partition spacetime into causally connected regions, with events outside each other's light cones being spacelike separated and incomparable under the causal order.[17] The flat geometry of Minkowski space ensures global hyperbolicity, a strong causality condition that guarantees the existence of Cauchy surfaces—spacelike hypersurfaces that intersect every inextendible timelike curve exactly once—allowing for a well-posed initial value formulation of physical laws.[37] This property arises from the absence of curvature and horizons, preventing closed timelike curves and ensuring the causal order is acyclic and predictable. (Hawking and Ellis, 1973) The spacetime interval between two events classifies their separation and determines causal connectability. In the mostly-plus signature , the interval is given by where is the time difference and the spatial displacement (with in natural units). Events are timelike separated if , null if , and spacelike if ; causal precedence requires timelike or null separation with the appropriate future-directed orientation.[17]Chronology and Closed Timelike Curves
In Minkowski space, the chronological structure arises from the relation of chronological precedence, which defines a strict partial order on events. An event chronologically precedes an event (denoted ) if lies in the chronological future of , specifically the interior of the future light cone of . This means there exists a future-directed timelike curve connecting to , ensuring strict timelike separation with the interval vector satisfying and the time component positive in inertial coordinates.[38] This relation is irreflexive () and transitive, but not total, as spacelike-separated events are incomparable. The chronological future and past of an event partition the spacetime minus the light cone, enforcing a consistent timeline order without cycles in the standard flat metric. As part of the broader causal framework, chronological precedence excludes null separations, focusing solely on timelike paths that allow for proper time elapsed along observers' worldlines.[36] Minkowski space satisfies the chronology condition, meaning it contains no closed timelike curves (CTCs). A CTC would be a smooth, piecewise future-directed timelike curve with and everywhere, potentially enabling causal paradoxes like information loops. However, in flat spacetime, such curves cannot exist. To see this, consider inertial coordinates where the curve starts at . Any future-directed timelike 4-velocity satisfies (with ), with equality only in the null limit. Thus, the total time displacement is , where is the proper length. For closure, , leading to a contradiction unless , which is trivial. Thus, all nontrivial timelike curves have positive time displacement and cannot close.[36] A related argument uses the exponential map along timelike geodesics, which in Minkowski space is simply the affine translation for tangent vector with . For future-directed , the time coordinate component increases strictly with the affine parameter, ensuring geodesics extend indefinitely without looping back. Non-geodesic timelike curves, being longer in proper time than geodesics between endpoints, inherit this acyclicity.[36] In contrast to general relativity, where CTCs can emerge in nontrivial topologies like wormholes or rotating universes (e.g., Gödel metric), Minkowski space represents the trivial flat case devoid of horizons, compact regions, or identifications that could permit such violations. The absence of CTCs underscores the global hyperbolicity of Minkowski space, where Cauchy surfaces evolve uniquely forward and backward.[39] The Poincaré group, as the full symmetry group of Minkowski space, preserves the chronological structure. Lorentz transformations and translations map light cones to light cones and timelike vectors to timelike vectors of the same type (future to future), thereby maintaining the precedence relation and preventing any induced cycles.[19]Geometric Interpretations
Hyperbolic Geometry Embedding
Minkowski space provides a natural embedding for hyperbolic geometry through the hyperboloid model, where the upper sheet of a two-sheeted hyperboloid represents hyperbolic space . Specifically, in Minkowski space with metric signature , the embedding is given by the set of points satisfying and .[40] For , this models . The Minkowski metric induces a positive definite Riemannian metric on this hyperboloid, which is the standard hyperbolic metric of constant negative curvature . When restricted to the tangent space of the hyperboloid, the indefinite Lorentzian metric becomes definite and matches the geometry of hyperbolic space, allowing geodesics on the hyperboloid to correspond to hyperbolic geodesics.[40] Timelike vectors in Minkowski space can serve as normals to these spacelike hypersurfaces. The Lorentz group , which preserves the Minkowski inner product, acts as the group of isometries on the hyperboloid, transitively mapping any point to any other while preserving the induced hyperbolic structure.[41] The hyperbolic distance between two points on the hyperboloid is given by where is the inverse hyperbolic cosine, providing a direct measure of geodesic separation in the embedded space.[40] This embedding finds application in cosmology, where open Friedmann-Robertson-Walker (FRW) models with negative spatial curvature () feature hyperbolic spatial sections of constant curvature, realizable via the hyperboloid in Minkowski space despite the overall flatness of Minkowski spacetime itself.[42]Stereographic Projections
Stereographic projections provide a means to map the hyperboloid model of hyperbolic geometry, embedded in Minkowski space, onto a bounded region of Euclidean space, facilitating visualization and computation. In the context of Minkowski space with metric , the upper sheet of the hyperboloid represents the 3-dimensional hyperbolic space . The stereographic projection from this hyperboloid to is a perspective projection from the point on the lower sheet onto the hyperplane .[43] The explicit formula for the projected coordinates of a point on the hyperboloid is given by This maps the hyperboloid bijectively onto the open unit ball , known as the Poincaré ball model.[43] Under this projection, the hyperbolic metric on the hyperboloid pulls back to the Poincaré metric on the ball: where is the Euclidean metric.[43] This form arises from the conformal nature of the stereographic map, which preserves angles while distorting distances, ensuring that geodesics in the hyperboloid project to circular arcs orthogonal to the boundary sphere . A key advantage of this projection is its ability to represent the infinite extent of hyperbolic space within a finite Euclidean disk or ball, making it particularly useful for computational geometry, computer graphics, and studying isometries, as the boundary corresponds to the ideal points at infinity. The conformality ensures that local geometric properties like angles are directly transferable, though global distances require the scaling factor .Metric Pullbacks and Embeddings
In differential geometry, the Minkowski metric on the ambient spacetime can be pulled back to a submanifold via an immersion or embedding , yielding an induced metric on . This pullback operation defines the geometry on the submanifold by restricting the ambient Lorentzian structure to the tangent spaces of . Specifically, for tangent vectors , the induced metric is given by where is the differential of at .[44] A prominent example arises for timelike curves in Minkowski space, where the pullback metric measures proper time along the curve. For a timelike curve parametrized by an affine parameter , the proper time satisfies , where is the line element pulled back from the ambient metric. This yields the arc length functional for timelike paths as , which extremizes under geodesic motion.[45] For spacelike submanifolds, such as hypersurfaces in Minkowski space, the pullback induces a positive-definite Riemannian metric on the submanifold. A hypersurface is spacelike if the tangent space at each point is spacelike, meaning the restriction of to has signature , transforming the Lorentzian ambient geometry into a Euclidean-like structure suitable for defining spatial distances. This induced metric equips with a complete Riemannian geometry when is closed or properly embedded.[36] Embeddings of submanifolds into Minkowski space preserve the flat ambient metric while inducing the pullback structure, allowing Minkowski space itself to serve as a model for flat limits of pseudo-Riemannian geometries. Conversely, Minkowski space can be realized as a flat submanifold embedded in higher-dimensional pseudo-Euclidean spaces, where the pullback recovers the standard Lorentzian metric . For instance, embeds isometrically into via the identity, with the induced metric matching the original.[46]Generalizations
Complex Minkowski Space
Complex Minkowski space is constructed by complexifying the real four-dimensional Minkowski space, yielding the vector space endowed with the standard Lorentzian metric extended bilinearly to complex coordinates . This extension allows the metric to act on complex vectors via the sesquilinear form, preserving the signature (3,1) in a formal sense, though the complex structure introduces holomorphic properties not present in the real case. The space provides a natural arena for analytic continuations in relativistic theories, where real Minkowski spacetime embeds as a real subspace. A key feature is the Hermitian inner product defined on , given by where the overline denotes complex conjugation. This form is indefinite and Hermitian, meaning , and it facilitates the study of complex null cones and lightlike structures in a holomorphic framework. The complex metric enables the definition of holomorphic functions and vector fields, supporting advanced geometric constructions such as those in twistor theory.[47] One important property is the Wick rotation, which involves the analytic continuation of the time coordinate, transforming the Minkowski metric into the positive-definite Euclidean metric . This rotation maps the complex Minkowski space to complex Euclidean space, avoiding oscillatory integrals in Minkowski signature and ensuring convergence in path integral formulations. In quantum field theory (QFT), this technique underpins the Osterwalder-Schrader reconstruction theorem, allowing Euclidean path integrals to be analytically continued back to Lorentzian spacetime for physical correlation functions.[48] Applications of complex Minkowski space abound in QFT, where it enables the analytic continuation of Green's functions and scattering amplitudes from Euclidean to physical regions, resolving issues with unitarity and causality. In twistor theory, the space serves as the fundamental domain, with twistors corresponding to null lines (α-planes) equipped with the incidence relation satisfying , linking complex spacetime points to projective twistor space . This framework unifies massless fields and provides a holomorphic approach to quantization and scattering in gauge theories.[49]Higher Dimensions and Generalized Metrics
Minkowski space generalizes naturally to arbitrary dimensions beyond the standard four-dimensional case of special relativity, providing a flat spacetime model with one timelike dimension and spacelike dimensions, denoted as or depending on the metric signature convention. In this framework, the underlying manifold is the Cartesian product , equipped with a pseudo-Riemannian metric of Lorentzian signature, where the timelike direction distinguishes causal structure from purely spatial geometries. This generalization maintains the core principles of special relativity, such as the invariance of the speed of light, while allowing for mathematical and physical applications in diverse contexts.[50] The line element for the generalized Minkowski metric in dimensions takes the form where is the time coordinate, (with ) are the spatial coordinates, and is the Euclidean metric on . This metric induces an inner product on the tangent space that classifies vectors as timelike, spacelike, or null based on the sign of , generalizing the interval classification from lower dimensions. Specific examples illustrate its utility: in dimensions, the space simplifies to a two-dimensional plane with metric , often used to model basic relativistic phenomena like wave propagation or the Unruh effect without the complexities of higher spatial degrees of freedom.[50] In contrast, ten-dimensional Minkowski space with signature (9,1) serves as the flat background for superstring theory, where the extra dimensions accommodate the vibrational modes of fundamental strings before compactification to effective four-dimensional physics.[51] Generalized metrics extend the standard form through conformal rescalings, where the metric transforms as for a positive scalar function , preserving angles between vectors while altering lengths and areas.[52] Such transformations maintain the causal structure, as null geodesics (light cones) remain invariant under conformal mappings, ensuring that the light cone at each point retains its defining role in separating timelike from spacelike separations.[52] This property is crucial in applications like conformal field theories on Minkowski backgrounds, where the enlarged symmetry group acts non-singularly on the compactified space.[53] In higher dimensions, the causal relations—such as the ordering of events along timelike paths—generalize analogously, with light cones becoming higher-dimensional cones whose apex defines local causality, independent of the specific dimensionality .Extensions to Curved Spacetimes
In general relativity, Minkowski space is extended to curved spacetimes by modeling the universe as a four-dimensional pseudo-Riemannian manifold equipped with a Lorentzian metric tensor of signature (3,1), which generalizes the flat, constant Minkowski metric . This framework allows the geometry of spacetime to vary, reflecting the influence of mass and energy, while preserving the causal structure of special relativity on small scales. The line element becomes , where is a smooth, non-degenerate symmetric tensor field that is indefinite, enabling timelike, spacelike, and null intervals analogous to those in flat space.[54] The equivalence principle, a cornerstone of this extension, asserts that locally, in a freely falling reference frame, the laws of physics reduce to those of special relativity in Minkowski space, implying that gravitational effects can be transformed away over sufficiently small regions. This local flatness is mathematically realized through the tangent space at each point of the manifold, which is a vector space equipped with a metric isometric to the Minkowski metric, allowing coordinate charts where approximates to first order. Albert Einstein formalized this generalization in his 1916 paper, introducing the variable metric as the fundamental object encoding curvature, with the geodesic equation describing the paths of freely falling particles, where are the Christoffel symbols derived from . Curvature in this extended framework is quantified by the Riemann curvature tensor , which vanishes in Minkowski space but generally nonzero in curved spacetimes, measuring the failure of parallel transport to preserve vectors along closed loops. Solutions to Einstein's field equations , where is the Einstein tensor derived from the metric, yield specific curved metrics such as the Schwarzschild solution for spherically symmetric, non-rotating masses, which asymptotically approaches Minkowski space at large distances. This structure ensures that while global topology and curvature can lead to phenomena like black holes and gravitational waves, the local physics remains governed by Minkowski-like relations, bridging special and general relativity.[36]References
- https://en.wikisource.org/wiki/Translation:Space_and_Time