Hubbry Logo
Gauge theoryGauge theoryMain
Open search
Gauge theory
Community hub
Gauge theory
logo
8 pages, 0 posts
0 subscribers
Be the first to start a discussion here.
Be the first to start a discussion here.
Gauge theory
Gauge theory
from Wikipedia

In physics, a gauge theory is a type of field theory in which the Lagrangian, and hence the dynamics of the system itself, does not change under local transformations according to certain smooth families of operations (Lie groups). Formally, the Lagrangian is invariant under these transformations.

The term "gauge" refers to any specific mathematical formalism to regulate redundant degrees of freedom in the Lagrangian of a physical system. The transformations between possible gauges, called gauge transformations, form a Lie group—referred to as the symmetry group or the gauge group of the theory. Associated with any Lie group is the Lie algebra of group generators. For each group generator there necessarily arises a corresponding field (usually a vector field) called the gauge field. Gauge fields are included in the Lagrangian to ensure its invariance under the local group transformations (called gauge invariance). When such a theory is quantized, the quanta of the gauge fields are called gauge bosons. If the symmetry group is non-commutative, then the gauge theory is referred to as non-abelian gauge theory, the usual example being the Yang–Mills theory.

Many powerful theories in physics are described by Lagrangians that are invariant under some symmetry transformation groups. When they are invariant under a transformation identically performed at every point in the spacetime in which the physical processes occur, they are said to have a global symmetry. Local symmetry, the cornerstone of gauge theories, is a stronger constraint. In fact, a global symmetry is just a local symmetry whose group's parameters are fixed in spacetime (the same way a constant value can be understood as a function of a certain parameter, the output of which is always the same).

Gauge theories are important as the successful field theories explaining the dynamics of elementary particles. Quantum electrodynamics is an abelian gauge theory with the symmetry group U(1) and has one gauge field, the electromagnetic four-potential, with the photon being the gauge boson. The Standard Model is a non-abelian gauge theory with the symmetry group U(1) × SU(2) × SU(3) and has a total of twelve gauge bosons: the photon, three weak bosons and eight gluons.

Gauge theories are also important in explaining gravitation in the theory of general relativity. Its case is somewhat unusual in that the gauge field is a tensor, the Lanczos tensor. Theories of quantum gravity, beginning with gauge gravitation theory, also postulate the existence of a gauge boson known as the graviton. Gauge symmetries can be viewed as analogues of the principle of general covariance of general relativity in which the coordinate system can be chosen freely under arbitrary diffeomorphisms of spacetime. Both gauge invariance and diffeomorphism invariance reflect a redundancy in the description of the system. An alternative theory of gravitation, gauge theory gravity, replaces the principle of general covariance with a true gauge principle with new gauge fields.

Historically, these ideas were first stated in the context of classical electromagnetism and later in general relativity. However, the modern importance of gauge symmetries appeared first in the relativistic quantum mechanics of electrons – quantum electrodynamics, elaborated on below. Today, gauge theories are useful in condensed matter, nuclear and high energy physics among other subfields.

History

[edit]

The concept and the name of gauge theory derives from the work of Hermann Weyl in 1918.[1] Weyl, in an attempt to generalize the geometrical ideas of general relativity to include electromagnetism, conjectured that Eichinvarianz or invariance under the change of scale (or "gauge") might also be a local symmetry of general relativity. After the development of quantum mechanics, Weyl, Vladimir Fock[2] and Fritz London replaced the simple scale factor with a complex quantity and turned the scale transformation into a change of phase, which is a U(1) gauge symmetry. This explained the electromagnetic field effect on the wave function of a charged quantum mechanical particle. Weyl's 1929 paper introduced the modern concept of gauge invariance[3] subsequently popularized by Wolfgang Pauli in his 1941 review.[4] In retrospect, James Clerk Maxwell's formulation, in 1864–65, of electrodynamics in "A Dynamical Theory of the Electromagnetic Field" suggested the possibility of invariance, when he stated that any vector field whose curl vanishes—and can therefore normally be written as a gradient of a function—could be added to the vector potential without affecting the magnetic field. Similarly unnoticed, David Hilbert had derived the Einstein field equations by postulating the invariance of the action under a general coordinate transformation. The importance of these symmetry invariances remained unnoticed until Weyl's work.

Inspired by Pauli's descriptions of connection between charge conservation and field theory driven by invariance, Chen Ning Yang sought a field theory for atomic nuclei binding based on conservation of nuclear isospin.[5]: 202  In 1954, Yang and Robert Mills generalized the gauge invariance of electromagnetism, constructing a theory based on the action of the (non-abelian) SU(2) symmetry group on the isospin doublet of protons and neutrons.[6] This is similar to the action of the U(1) group on the spinor fields of quantum electrodynamics.

The Yang–Mills theory became the prototype theory to resolve some of the confusion in elementary particle physics. This idea later found application in the quantum field theory of the weak force, and its unification with electromagnetism in the electroweak theory. Gauge theories became even more attractive when it was realized that non-abelian gauge theories reproduced a feature called asymptotic freedom. Asymptotic freedom was believed to be an important characteristic of strong interactions. This motivated searching for a strong force gauge theory. This theory, now known as quantum chromodynamics, is a gauge theory with the action of the SU(3) group on the color triplet of quarks. The Standard Model unifies the description of electromagnetism, weak interactions and strong interactions in the language of gauge theory.

In the 1970s, Michael Atiyah began studying the mathematics of solutions to the classical Yang–Mills equations. In 1983, Atiyah's student Simon Donaldson built on this work to show that the differentiable classification of smooth 4-manifolds is very different from their classification up to homeomorphism.[7] Michael Freedman used Donaldson's work to exhibit exotic R4s, that is, exotic differentiable structures on Euclidean 4-dimensional space. This led to an increasing interest in gauge theory for its own sake, independent of its successes in fundamental physics. In 1994, Edward Witten and Nathan Seiberg invented gauge-theoretic techniques based on supersymmetry that enabled the calculation of certain topological invariants[8][9] (the Seiberg–Witten invariants). These contributions to mathematics from gauge theory have led to a renewed interest in this area.

The importance of gauge theories in physics is exemplified in the success of the mathematical formalism in providing a unified framework to describe the quantum field theories of electromagnetism, the weak force and the strong force. This theory, known as the Standard Model, accurately describes experimental predictions regarding three of the four fundamental forces of nature, and is a gauge theory with the gauge group SU(3) × SU(2) × U(1). Modern theories like string theory, as well as general relativity, are, in one way or another, gauge theories.

See Jackson and Okun[10] for early history of gauge and Pickering[11] for more about the history of gauge and quantum field theories.

Description

[edit]

Global and local symmetries

[edit]

Global symmetry

[edit]

In physics, the mathematical description of any physical situation usually contains excess degrees of freedom; the same physical situation is equally well described by many equivalent mathematical configurations. For instance, in Newtonian dynamics, if two configurations are related by a Galilean transformation (an inertial change of reference frame) they represent the same physical situation. These transformations form a group of "symmetries" (in mathematical terms, "automorphisms") of the theory, and a physical situation corresponds not to an individual mathematical configuration but to a class of configurations related to one another by this symmetry group (i.e. an orbit of configurations under the action of the automorphism group).

This idea can be generalized to include local as well as global symmetries, analogous to much more abstract "changes of coordinates" in a situation where there is no preferred "inertial" coordinate system that covers the entire physical system. A gauge theory is a mathematical model that has symmetries of this kind, together with a set of techniques for making physical predictions consistent with the symmetries of the model.

Example of global symmetry

[edit]

When a quantity occurring in the mathematical configuration is not just a number but has some geometrical significance, such as a velocity or an axis of rotation, its representation as numbers arranged in a vector or matrix is also changed by a coordinate transformation. For instance, if one description of a pattern of fluid flow states that the fluid velocity in the neighborhood of (x = 1, y = 0) is 1 m/s in the positive x direction, then a description of the same situation in which the coordinate system has been rotated clockwise by 90 degrees states that the fluid velocity in the neighborhood of (x = 0, y = −1) is 1 m/s in the negative y direction. The coordinate transformation has affected both the coordinate system used to identify the location of the measurement and the basis in which its value is expressed. As long as this transformation is performed globally (affecting the coordinate basis in the same way at every point), the effect on values that represent the rate of change of some quantity along some path in space and time as it passes through point P is the same as the effect on values that are truly local to P.

Local symmetry

[edit]
Use of fiber bundles to describe local symmetries
[edit]

In order to adequately describe physical situations in more complex theories, it is often necessary to introduce a "coordinate basis" for some of the objects of the theory that do not have this simple relationship to the coordinates used to label points in space and time. (In mathematical terms, the theory involves a fiber bundle in which the fiber at each point of the base space consists of possible coordinate bases for use when describing the values of objects at that point.) In order to spell out a mathematical configuration, one must choose a particular coordinate basis at each point (a local section of the fiber bundle) and express the values of the objects of the theory (usually "fields" in the physicist's sense) using this basis. Two such mathematical configurations are equivalent (describe the same physical situation) if they are related by a transformation of this abstract coordinate basis (a change of local section, or gauge transformation).

In most gauge theories, the set of possible transformations of the abstract gauge basis at an individual point in space and time is a finite-dimensional Lie group. The simplest such group is U(1), which appears in the modern formulation of quantum electrodynamics (QED) via its use of complex numbers. QED is generally regarded as the first, and simplest, physical gauge theory. The set of possible gauge transformations of the entire configuration of a given gauge theory also forms a group, the gauge group of the theory. An element of the gauge group can be parameterized by a smoothly varying function from the points of spacetime to the (finite-dimensional) Lie group, such that the value of the function and its derivatives at each point represents the action of the gauge transformation on the fiber over that point.

A gauge transformation with constant parameter at every point in space and time is analogous to a rigid rotation of the geometric coordinate system; it represents a global symmetry of the gauge representation. As in the case of a rigid rotation, this gauge transformation affects expressions that represent the rate of change along a path of some gauge-dependent quantity in the same way as those that represent a truly local quantity. A gauge transformation whose parameter is not a constant function is referred to as a local symmetry; its effect on expressions that involve a derivative is qualitatively different from that on expressions that do not. (This is analogous to a non-inertial change of reference frame, which can produce a Coriolis effect.)

Gauge fields

[edit]

The "gauge covariant" version of a gauge theory accounts for this effect by introducing a gauge field (in mathematical language, an Ehresmann connection) and formulating all rates of change in terms of the covariant derivative with respect to this connection. The gauge field becomes an essential part of the description of a mathematical configuration. A configuration in which the gauge field can be eliminated by a gauge transformation has the property that its field strength (in mathematical language, its curvature) is zero everywhere; a gauge theory is not limited to these configurations. In other words, the distinguishing characteristic of a gauge theory is that the gauge field does not merely compensate for a poor choice of coordinate system; there is generally no gauge transformation that makes the gauge field vanish.

When analyzing the dynamics of a gauge theory, the gauge field must be treated as a dynamical variable, similar to other objects in the description of a physical situation. In addition to its interaction with other objects via the covariant derivative, the gauge field typically contributes energy in the form of a "self-energy" term. One can obtain the equations for the gauge theory by:

  • starting from a naïve ansatz without the gauge field (in which the derivatives appear in a "bare" form);
  • listing those global symmetries of the theory that can be characterized by a continuous parameter (generally an abstract equivalent of a rotation angle);
  • computing the correction terms that result from allowing the symmetry parameter to vary from place to place; and
  • reinterpreting these correction terms as couplings to one or more gauge fields, and giving these fields appropriate self-energy terms and dynamical behavior.

This is the sense in which a gauge theory "extends" a global symmetry to a local symmetry, and closely resembles the historical development of the gauge theory of gravity known as general relativity.

Physical experiments

[edit]

Gauge theories used to model the results of physical experiments engage in:

  • limiting the universe of possible configurations to those consistent with the information used to set up the experiment, and then
  • computing the probability distribution of the possible outcomes that the experiment is designed to measure.

We cannot express the mathematical descriptions of the "setup information" and the "possible measurement outcomes", or the "boundary conditions" of the experiment, without reference to a particular coordinate system, including a choice of gauge. One assumes an adequate experiment isolated from "external" influence that is itself a gauge-dependent statement. Mishandling gauge dependence calculations in boundary conditions is a frequent source of anomalies, and approaches to anomaly avoidance classifies gauge theories[clarification needed].

Continuum theories

[edit]

The two gauge theories mentioned above, continuum electrodynamics and general relativity, are continuum field theories. The techniques of calculation in a continuum theory implicitly assume that:

  • given a completely fixed choice of gauge, the boundary conditions of an individual configuration are completely described
  • given a completely fixed gauge and a complete set of boundary conditions, the least action determines a unique mathematical configuration and therefore a unique physical situation consistent with these bounds
  • fixing the gauge introduces no anomalies in the calculation, due either to gauge dependence in describing partial information about boundary conditions or to incompleteness of the theory.

Determination of the likelihood of possible measurement outcomes proceed by:

  • establishing a probability distribution over all physical situations determined by boundary conditions consistent with the setup information
  • establishing a probability distribution of measurement outcomes for each possible physical situation
  • convolving these two probability distributions to get a distribution of possible measurement outcomes consistent with the setup information

These assumptions have enough validity across a wide range of energy scales and experimental conditions to allow these theories to make accurate predictions about almost all of the phenomena encountered in daily life: light, heat, and electricity, eclipses, spaceflight, etc. They fail only at the smallest and largest scales due to omissions in the theories themselves, and when the mathematical techniques themselves break down, most notably in the case of turbulence and other chaotic phenomena.

Quantum field theories

[edit]

Other than these classical continuum field theories, the most widely known gauge theories are quantum field theories, including quantum electrodynamics and the Standard Model of elementary particle physics. The starting point of a quantum field theory is much like that of its continuum analog: a gauge-covariant action integral that characterizes "allowable" physical situations according to the principle of least action. However, continuum and quantum theories differ significantly in how they handle the excess degrees of freedom represented by gauge transformations. Continuum theories, and most pedagogical treatments of the simplest quantum field theories, use a gauge fixing prescription to reduce the orbit of mathematical configurations that represent a given physical situation to a smaller orbit related by a smaller gauge group (the global symmetry group, or perhaps even the trivial group).

More sophisticated quantum field theories, in particular those that involve a non-abelian gauge group, break the gauge symmetry within the techniques of perturbation theory by introducing additional fields (the Faddeev–Popov ghosts) and counterterms motivated by anomaly cancellation, in an approach known as BRST quantization. While these concerns are in one sense highly technical, they are also closely related to the nature of measurement, the limits on knowledge of a physical situation, and the interactions between incompletely specified experimental conditions and incompletely understood physical theory.[citation needed] The mathematical techniques that have been developed in order to make gauge theories tractable have found many other applications, from solid-state physics and crystallography to low-dimensional topology.

Classical gauge theory

[edit]

Classical electromagnetism

[edit]

In electrostatics, one can either discuss the electric field, E, or its corresponding electric potential, V. Knowledge of one makes it possible to find the other, except that potentials differing by a constant, , correspond to the same electric field. This is because the electric field relates to changes in the potential from one point in space to another, and the constant C would cancel out when subtracting to find the change in potential. In terms of vector calculus, the electric field is the gradient of the potential, . Generalizing from static electricity to electromagnetism, we have a second potential, the vector potential A, with

The general gauge transformations now become not just but

where f is any twice continuously differentiable function that depends on position and time. The electromagnetic fields remain the same under the gauge transformation.

Example: scalar O(n) gauge theory

[edit]
The remainder of this section requires some familiarity with classical or quantum field theory, and the use of Lagrangians.
Definitions in this section: gauge group, gauge field, interaction Lagrangian, gauge boson.

The following illustrates how local gauge invariance can be "motivated" heuristically starting from global symmetry properties, and how it leads to an interaction between originally non-interacting fields.

Consider a set of non-interacting real scalar fields, with equal masses m. This system is described by an action that is the sum of the (usual) action for each scalar field

The Lagrangian (density) can be compactly written as

by introducing a vector of fields

The term is the partial derivative of along dimension .

It is now transparent that the Lagrangian is invariant under the transformation

whenever G is a constant matrix belonging to the n-by-n orthogonal group O(n). This is seen to preserve the Lagrangian, since the derivative of transforms identically to and both quantities appear inside dot products in the Lagrangian (orthogonal transformations preserve the dot product).

This characterizes the global symmetry of this particular Lagrangian, and the symmetry group is often called the gauge group; the mathematical term is structure group, especially in the theory of G-structures. Incidentally, Noether's theorem implies that invariance under this group of transformations leads to the conservation of the currents

where the Ta matrices are generators of the SO(n) group. There is one conserved current for every generator.

Now, demanding that this Lagrangian should have local O(n)-invariance requires that the G matrices (which were earlier constant) should be allowed to become functions of the spacetime coordinates x.

In this case, the G matrices do not "pass through" the derivatives, when G = G(x),

The failure of the derivative to commute with "G" introduces an additional term (in keeping with the product rule), which spoils the invariance of the Lagrangian. In order to rectify this we define a new derivative operator such that the derivative of again transforms identically with

This new "derivative" is called a (gauge) covariant derivative and takes the form

where g is called the coupling constant; a quantity defining the strength of an interaction. After a simple calculation we can see that the gauge field A(x) must transform as follows

The gauge field is an element of the Lie algebra, and can therefore be expanded as

There are therefore as many gauge fields as there are generators of the Lie algebra.

Finally, we now have a locally gauge invariant Lagrangian

Pauli uses the term gauge transformation of the first type to mean the transformation of , while the compensating transformation in is called a gauge transformation of the second type.

Feynman diagram of scalar bosons interacting via a gauge boson

The difference between this Lagrangian and the original globally gauge-invariant Lagrangian is seen to be the interaction Lagrangian

This term introduces interactions between the n scalar fields just as a consequence of the demand for local gauge invariance. However, to make this interaction physical and not completely arbitrary, the mediator A(x) needs to propagate in space. That is dealt with in the next section by adding yet another term, , to the Lagrangian. In the quantized version of the obtained classical field theory, the quanta of the gauge field A(x) are called gauge bosons. The interpretation of the interaction Lagrangian in quantum field theory is of scalar bosons interacting by the exchange of these gauge bosons.

Yang–Mills Lagrangian for the gauge field

[edit]

The picture of a classical gauge theory developed in the previous section is almost complete, except for the fact that to define the covariant derivatives D, one needs to know the value of the gauge field at all spacetime points. Instead of manually specifying the values of this field, it can be given as the solution to a field equation. Further requiring that the Lagrangian that generates this field equation is locally gauge invariant as well, one possible form for the gauge field Lagrangian is

where the are obtained from potentials , being the components of , by

and the are the structure constants of the Lie algebra of the generators of the gauge group. This formulation of the Lagrangian is called a Yang–Mills action. Other gauge invariant actions also exist (e.g., nonlinear electrodynamics, Born–Infeld action, Chern–Simons model, theta term, etc.).

In this Lagrangian term there is no field whose transformation counterweighs the one of . Invariance of this term under gauge transformations is a particular case of a priori classical (geometrical) symmetry. This symmetry must be restricted in order to perform quantization, the procedure being denominated gauge fixing, but even after restriction, gauge transformations may be possible.[12]

The complete Lagrangian for the gauge theory is now

Example: electrodynamics

[edit]

As a simple application of the formalism developed in the previous sections, consider the case of electrodynamics, with only the electron field. The bare-bones action that generates the electron field's Dirac equation is

The global symmetry for this system is

The gauge group here is U(1), just rotations of the phase angle of the field, with the particular rotation determined by the constant θ.

"Localising" this symmetry implies the replacement of θ by θ(x). An appropriate covariant derivative is then

Identifying the "charge" e (not to be confused with the mathematical constant e in the symmetry description) with the usual electric charge (this is the origin of the usage of the term in gauge theories), and the gauge field A(x) with the four-vector potential of the electromagnetic field results in an interaction Lagrangian

where is the electric current four vector in the Dirac field. The gauge principle is therefore seen to naturally introduce the so-called minimal coupling of the electromagnetic field to the electron field.

Adding a Lagrangian for the gauge field in terms of the field strength tensor exactly as in electrodynamics, one obtains the Lagrangian used as the starting point in quantum electrodynamics.

Mathematical formalism

[edit]

Gauge theories are usually discussed in the language of differential geometry. Mathematically, a gauge is just a choice of a (local) section of some principal bundle. A gauge transformation is just a transformation between two such sections.

Although gauge theory is dominated by the study of connections (primarily because it's mainly studied by high-energy physicists), the idea of a connection is not central to gauge theory in general. In fact, a result in general gauge theory shows that affine representations (i.e., affine modules) of the gauge transformations can be classified as sections of a jet bundle satisfying certain properties. There are representations that transform covariantly pointwise (called by physicists gauge transformations of the first kind), representations that transform as a connection form (called by physicists gauge transformations of the second kind, an affine representation)—and other more general representations, such as the B field in BF theory. There are more general nonlinear representations (realizations), but these are extremely complicated. Still, nonlinear sigma models transform nonlinearly, so there are applications.

If there is a principal bundle P whose base space is space or spacetime and structure group is a Lie group, then the sections of P form a principal homogeneous space of the group of gauge transformations.

Connections (gauge connection) define this principal bundle, yielding a covariant derivative ∇ in each associated vector bundle. If a local frame is chosen (a local basis of sections), then this covariant derivative is represented by the connection form A, a Lie algebra-valued 1-form, which is called the gauge potential in physics. This is evidently not an intrinsic but a frame-dependent quantity. The curvature form F, a Lie algebra-valued 2-form that is an intrinsic quantity, is constructed from a connection form by

where d stands for the exterior derivative and stands for the wedge product. ( is an element of the vector space spanned by the generators , and so the components of do not commute with one another. Hence the wedge product does not vanish.)

Infinitesimal gauge transformations form a Lie algebra, which is characterized by a smooth Lie-algebra-valued scalar, ε. Under such an infinitesimal gauge transformation,

where is the Lie bracket.

One nice thing is that if , then where D is the covariant derivative

Also, , which means transforms covariantly.

Not all gauge transformations can be generated by infinitesimal gauge transformations in general. An example is when the base manifold is a compact manifold without boundary such that the homotopy class of mappings from that manifold to the Lie group is nontrivial. See instanton for an example.

The Yang–Mills action is now given by

where is the Hodge star operator and the integral is defined as in differential geometry.

A quantity which is gauge-invariant (i.e., invariant under gauge transformations) is the Wilson loop, which is defined over any closed path, γ, as follows:

where χ is the character of a complex representation ρ and represents the path-ordered operator.

The formalism of gauge theory carries over to a general setting. For example, it is sufficient to ask that a vector bundle have a metric connection; when one does so, one finds that the metric connection satisfies the Yang–Mills equations of motion.

Quantization of gauge theories

[edit]

Gauge theories may be quantized by specialization of methods which are applicable to any quantum field theory. However, because of the subtleties imposed by the gauge constraints (see section on Mathematical formalism, above) there are many technical problems to be solved which do not arise in other field theories. At the same time, the richer structure of gauge theories allows simplification of some computations: for example Ward identities connect different renormalization constants.

Methods and aims

[edit]

The first gauge theory quantized was quantum electrodynamics (QED). The first methods developed for this involved gauge fixing and then applying canonical quantization. The Gupta–Bleuler method was also developed to handle this problem. Non-abelian gauge theories are now handled by a variety of means. Methods for quantization are covered in the article on quantization.

The main point to quantization is to be able to compute quantum amplitudes for various processes allowed by the theory. Technically, they reduce to the computations of certain correlation functions in the vacuum state. This involves a renormalization of the theory.

When the running coupling of the theory is small enough, then all required quantities may be computed in perturbation theory. Quantization schemes intended to simplify such computations (such as canonical quantization) may be called perturbative quantization schemes. At present some of these methods lead to the most precise experimental tests of gauge theories.

However, in most gauge theories, there are many interesting questions which are non-perturbative. Quantization schemes suited to these problems (such as lattice gauge theory) may be called non-perturbative quantization schemes. Precise computations in such schemes often require supercomputing, and are therefore less well-developed currently than other schemes.

Anomalies

[edit]

Some of the symmetries of the classical theory are then seen not to hold in the quantum theory; a phenomenon called an anomaly. Among the most well known are:

Pure gauge

[edit]

A pure gauge is the set of field configurations obtained by a gauge transformation on the null-field configuration, i.e., a gauge transform of zero. So it is a particular "gauge orbit" in the field configuration's space.

Thus, in the abelian case, where , the pure gauge is just the set of field configurations for all f(x).

See also

[edit]

References

[edit]

Bibliography

[edit]
[edit]
Revisions and contributorsEdit on WikipediaRead on Wikipedia
from Grokipedia
Gauge theory is a class of quantum field theories in physics that describe the fundamental forces of nature through the principle of local gauge invariance, where the equations of motion remain unchanged under arbitrary local transformations of the fields at each point in spacetime. These theories introduce gauge fields as mediators of interactions, ensuring the invariance by compensating for the local changes in matter fields. The prototypical example is quantum electrodynamics (QED), an Abelian gauge theory based on the U(1) symmetry group, which successfully describes electromagnetism. The historical development of gauge theory began with Hermann Weyl's 1918 proposal to unify gravity and electromagnetism via local scale invariance, though this initial formulation faced challenges with quantum mechanics. A pivotal advancement occurred in 1954 when Chen Ning Yang and Robert Mills generalized the concept to non-Abelian gauge symmetries, inspired by attempts to describe the strong nuclear force while conserving isotopic spin. This Yang-Mills theory provided the framework for modern particle physics, overcoming early issues like lack of renormalizability through subsequent insights, including spontaneous symmetry breaking introduced by Peter Higgs and others in the 1960s. In the of , gauge theory unifies three of the four fundamental interactions—the electromagnetic, weak, and strong forces—via a non-Abelian gauge structure with the SU(3)c × SU(2)L × U(1)Y, where SU(3)c governs (QCD) for the strong force, SU(2)L × U(1)Y describes the electroweak sector, and gravity remains outside this framework as . Key achievements include the prediction and discovery of the W and Z bosons in 1983, confirming the electroweak theory developed by , , and , for which they shared the . Gauge theories have been rigorously tested experimentally, with quantum corrections matching observations to high precision, underscoring their role as the cornerstone of contemporary high-energy physics.

Introduction

Definition and Core Principles

Gauge theory is a type of in which the Lagrangian remains invariant under transformations of the fields, where the transformation parameters can vary with position in . These symmetries, also known as gauge symmetries, distinguish gauge theories from those with global symmetries, where parameters are constant across . Gauge symmetries introduce redundancies in the description of physical states, meaning that multiple field configurations can represent the same physical situation. To preserve the invariance of the theory under these local transformations, gauge fields are introduced, which mediate the interactions and compensate for the variations in the transformation parameters. A fundamental principle is that all physical observables must be gauge-invariant, ensuring that measurable quantities do not depend on the choice of gauge. A basic example arises in with a complex ψ\psi, whose Lagrangian is initially invariant under global phase rotations ψψeiα\psi \to \psi e^{i\alpha}. Promoting this to a local transformation ψψeiα(x)\psi \to \psi e^{i\alpha(x)}, where α(x)\alpha(x) depends on position xx, requires introducing a gauge field AμA_\mu (the electromagnetic potential) to restore invariance through covariant derivatives Dμ=μieAμD_\mu = \partial_\mu - i e A_\mu. Gauge theories originated from efforts to unify fundamental forces within the frameworks of relativity and , with first proposing local phase invariance in 1918. They provide the standard model description for electromagnetic, weak, and strong nuclear forces, enabling a unified understanding of particle interactions through these principles.

Historical Development

The origins of gauge theory trace back to efforts in the early to unify fundamental forces through local symmetries in . In 1918, proposed a of Einstein's by introducing local , or "gauge invariance," to incorporate alongside . This approach posited that the could vary under local scaling transformations, with the electromagnetic potential serving as a compensating field to maintain invariance. However, Weyl's theory encountered a critical flaw: it implied that atomic lengths, such as spectral lines, would change over time or in varying electromagnetic fields, an effect not observed experimentally, leading Einstein to critique it as unphysical. The concept gained new life with the advent of in the late 1920s. In 1926, recognized that the for a in an remains invariant under local phase transformations of the wave function, with the adjusting accordingly. independently highlighted this phase invariance in 1927, drawing parallels to Weyl's earlier scale gauge but reinterpreting it for quantum contexts without the problematic length changes. Weyl himself revised his ideas in 1929, abandoning scale invariance in favor of local phase (or "U(1)") invariance, which aligned gauge transformations with the newly developed quantum theory of . Post-World War II advancements extended gauge principles beyond abelian groups like U(1). In 1954, Chen Ning Yang and Mills generalized the framework to non-abelian groups, motivated by the need to describe strong nuclear interactions via isotopic spin () symmetry in pion-nucleon . Their theory introduced self-interacting gauge fields, but initial applications to massive particles required breaking gauge invariance, which was seen as a drawback at the time. During the 1960s and 1970s, gauge theories became central to , particularly in unifying the electromagnetic and weak forces. Sheldon Glashow proposed an SU(2) × U(1) gauge model in 1961 to merge these interactions, predicting neutral currents. and independently developed the full electroweak theory in 1967–1968, incorporating via the to generate particle masses while preserving gauge invariance at high energies. This model successfully predicted the W and Z bosons, confirmed experimentally in the 1980s. The transition from classical to quantum gauge theories was solidified by proving their renormalizability, a feat achieved by Gerardus 't Hooft and Martinus Veltman in 1971 for non-abelian cases. Parallel to electroweak unification, the strong nuclear force was described through (QCD), a non-Abelian gauge theory invariant under the SU(3)c color symmetry group. QCD emerged in 1973, with key contributions from , Harald Fritzsch, and others in formulating the theory, alongside the independent discovery of by , , and David Politzer. resolved challenges in describing quark confinement and enabled perturbative calculations at high energies (short distances), confirming QCD's viability for the strong interaction. These advancements integrated QCD into the framework alongside the electroweak sector. The discovery of was awarded the . Key milestones include Nobel Prizes recognizing these contributions: in 1979, Glashow, Salam, and Weinberg for the electroweak unification theory; and in 1999, 't Hooft and Veltman for elucidating the quantum structure of electroweak interactions through . These developments marked the shift of gauge theory from a classical unification attempt to the cornerstone of the .

Symmetries in Gauge Theory

Global Symmetries

In physics, a global refers to a transformation of the fields in a theory that leaves the action or Lagrangian invariant, where the transformation parameters are constant across . Such symmetries arise when the laws of physics remain unchanged under uniform shifts, rotations, or other operations applied everywhere simultaneously. According to Noether's theorem, every continuous global symmetry of the action corresponds to a conserved quantity, manifested through a conserved current in the theory. For instance, a global U(1) phase symmetry, where a complex scalar field ϕ\phi transforms as ϕeiαϕ\phi \to e^{i\alpha} \phi with constant α\alpha, implies conservation of the associated charge, such as electric charge in electromagnetism. The theorem establishes that the divergence of the Noether current JμJ^\mu vanishes, μJμ=0\partial_\mu J^\mu = 0, leading to a conserved charge Q=d3xJ0Q = \int d^3x \, J^0. Conceptually, for an infinitesimal transformation δϕ=ϵK(ϕ)\delta \phi = \epsilon K(\phi) with constant ϵ\epsilon, the current takes the form JμL(μϕ)δϕJ^\mu \approx \frac{\partial \mathcal{L}}{\partial (\partial_\mu \phi)} \delta \phi, ensuring conservation on-shell. Examples of global symmetries include like translational invariance, which yields momentum conservation via , and rotational invariance, which conserves . In , internal global symmetries such as the SU(2) symmetry treat protons and neutrons (or up and down quarks) as degenerate states under strong interactions, approximately conserving isospin in the limit of equal masses. Global symmetries serve as a foundational , often undergoing spontaneous breaking in physical systems, where the does not respect the symmetry of the Lagrangian, resulting in massless Goldstone bosons—one for each broken generator—as dictated by Goldstone's theorem.

Local Symmetries and Gauge Invariance

In gauge theories, local symmetries extend the of global symmetries by allowing transformation parameters to vary with position and time in . For a field such as a Dirac fermion ψ in the Abelian U(1) group, the local transformation takes the form ψ(x) → ψ'(x) = e^{i α(x)} ψ(x), where α(x) is an arbitrary smooth function of the coordinates x. This contrasts with global symmetries, where α is constant across all space, and requires additional structure to preserve the invariance of the theory's action under such position-dependent changes. To maintain gauge invariance under these local transformations, the ordinary partial derivative ∂_μ acting on the matter field must be replaced by a covariant derivative that compensates for the variation in the transformation parameter. The covariant derivative is defined as D_μ = ∂_μ - i g A_μ, where g is the coupling constant and A_μ is a vector gauge field introduced to "absorb" the local phase shift. Under the local U(1) transformation, D_μ ψ transforms in the same way as ψ itself, ensuring that terms like \bar{ψ} γ^μ D_μ ψ in the Lagrangian remain invariant. The gauge field A_μ itself must transform to preserve this covariance, specifically A'_μ(x) = A_μ(x) + (1/g) ∂_μ α(x) in the Abelian case. This transformation rule ensures that the combination D_μ ψ is covariantly transformed, highlighting the interconnected between the matter fields and the gauge fields. Conceptually, local symmetries introduce a in the description of physical configurations, as different gauge choices correspond to the same physics, and this is resolved by dynamical gauge fields that mediate interactions between particles. Physically, this framework implies that fundamental forces emerge as mechanisms for of fields across to uphold the local , exemplified by the electromagnetic force arising from U(1) gauge invariance in , where photons serve as the gauge bosons exchanging momentum to maintain phase coherence.

Classical Gauge Theories

Abelian Gauge Theories

Abelian gauge theories are gauge theories constructed from Abelian Lie groups, such as the U(1), characterized by vanishing that result in commuting group transformations. In these theories, the gauge symmetry arises from local phase transformations of the form ψeiqΛ(x)ψ\psi \to e^{i q \Lambda(x)} \psi for a charged field ψ\psi, where Λ(x)\Lambda(x) is an arbitrary smooth function and qq is the charge, ensuring that physical observables remain invariant under such redundant descriptions. The prototypical example of an Abelian gauge theory is classical electromagnetism, where the U(1) symmetry governs the interaction of electromagnetic fields with charged matter. Maxwell's equations emerge from the gauge-invariant Lagrangian density L=14FμνFμν14jμAμ,\mathcal{L} = -\frac{1}{4} F_{\mu\nu} F^{\mu\nu} - \frac{1}{4} j^\mu A_\mu, with the field strength tensor defined as Fμν=μAννAμF_{\mu\nu} = \partial_\mu A_\nu - \partial_\nu A_\mu, where AμA_\mu is the gauge potential (vector potential) and jμj^\mu represents classical currents from matter sources. Varying this action with respect to AμA_\mu yields the inhomogeneous Maxwell equations μFμν=jν\partial_\mu F^{\mu\nu} = j^\nu, while the Bianchi identity λFμν+μFνλ+νFλμ=0\partial_\lambda F_{\mu\nu} + \partial_\mu F_{\nu\lambda} + \partial_\nu F_{\lambda\mu} = 0 gives the homogeneous ones, all in natural units where ϵ0=μ0=c=1\epsilon_0 = \mu_0 = c = 1. For relativistic scalar fields, coupling occurs through the covariant derivative Dμ=μ+iqAμD_\mu = \partial_\mu + i q A_\mu, leading to the interaction term in the Lagrangian (Dμϕ)(Dμϕ)(D_\mu \phi)^* (D^\mu \phi). The gauge freedom in Abelian theories manifests as the transformation AμAμ+μΛA_\mu \to A_\mu + \partial_\mu \Lambda, under which FμνF_{\mu\nu} remains unchanged, preserving the physical fields E\mathbf{E} and B\mathbf{B}. To handle this redundancy in calculations, specific gauges are chosen; the Lorenz gauge μAμ=0\partial_\mu A^\mu = 0 simplifies wave equations for propagating fields, while the gauge A=0\nabla \cdot \mathbf{A} = 0 facilitates non-relativistic treatments by separating longitudinal and transverse components. In continuum formulations, in vacuum read E=ρ\nabla \cdot \mathbf{E} = \rho, B=0\nabla \cdot \mathbf{B} = 0, ×E=tB\nabla \times \mathbf{E} = -\partial_t \mathbf{B}, and ×B=j+tE\nabla \times \mathbf{B} = \mathbf{j} + \partial_t \mathbf{E}, describing free-field propagation at speed cc. In media, these become D=ρf\nabla \cdot \mathbf{D} = \rho_f, B=0\nabla \cdot \mathbf{B} = 0, ×E=tB\nabla \times \mathbf{E} = -\partial_t \mathbf{B}, and ×H=jf+tD\nabla \times \mathbf{H} = \mathbf{j}_f + \partial_t \mathbf{D}, where D=ϵE\mathbf{D} = \epsilon \mathbf{E} and B=μH\mathbf{B} = \mu \mathbf{H} incorporate material permittivities ϵ\epsilon and permeabilities μ\mu, with free charges ρf\rho_f and currents jf\mathbf{j}_f driving the fields. Gauge invariance ensures the consistency of these equations under local phase transformations when to charged matter.

Non-Abelian Gauge Theories

Non-Abelian gauge theories generalize the framework of gauge theories to Lie groups that are non-commutative, such as the special unitary groups SU(N) for N ≥ 2, where the generators of the satisfy [T^a, T^b] = i f^{abc} T^c with non-vanishing f^{abc}. These encode the non-Abelian nature, leading to self-interactions among the gauge fields that are absent in Abelian theories like . In contrast to Abelian cases, where gauge fields are neutral, the gauge bosons in non-Abelian theories transform non-trivially under the gauge group, effectively carrying "charge" and mediating interactions among themselves through non-linear terms. The foundational example is Yang-Mills theory, formulated by Yang and Mills as a pure non-Abelian gauge theory invariant under local transformations of the group. The Lagrangian density for the gauge fields A_μ^a (with a labeling the adjoint representation) is given by L=14FμνaFaμν,\mathcal{L} = -\frac{1}{4} F_{\mu\nu}^a F^{a\mu\nu}, where the non-Abelian field strength tensor incorporates both the abelian-like commutator of derivatives and the self-coupling term: Fμνa=μAνaνAμa+gfabcAμbAνc.F_{\mu\nu}^a = \partial_\mu A_\nu^a - \partial_\nu A_\mu^a + g f^{abc} A_\mu^b A_\nu^c. Here, g is the coupling constant, and the indices run over the group's dimension. The gauge transformations under which this theory is invariant take the matrix-valued form A_μ = A_μ^a T^a → U A_μ U^\dagger + (i/g) (∂_μ U) U^\dagger, where U(x) = exp(i θ^a(x) T^a) is a space-time-dependent group element, with T^a the generators normalized such that Tr(T^a T^b) = (1/2) δ^{ab}. This non-linear transformation rule reflects the charged nature of the gauge fields, enabling triple and quartic self-interactions derived from the field's commutator structure. Classical solutions to the Yang-Mills equations reveal rich topological structures, including instantons and monopoles, which arise as finite-action configurations due to the non-Abelian topology. Instantons, first constructed explicitly for SU(2) in , are self-dual solutions F_{μν}^a = \pm \tilde{F}_{μν}^a that saturate the action bound and carry non-trivial , serving as topological defects. Magnetic monopoles emerge in theories with broken , such as when a in the acquires a ; these are smooth, soliton-like solutions with finite energy and magnetic charge quantized in units determined by the group's topology. Independently discovered by Polyakov and 't Hooft, monopoles exist in SU(2) gauge theories with a Higgs triplet, where the long-range field is embedded in a non-singular core. A illustrative classical example is the gauged O(n) nonlinear sigma model, where an n-component scalar field φ with constraint |φ|^2 = v^2 couples to non-Abelian gauge fields in the adjoint representation, as in the SU(2) Georgi-Glashow model for n=3. The Lagrangian includes the gauge kinetic term, the scalar kinetic term with covariant derivative D_μ φ = ∂_μ φ - i g [A_μ, φ], and a symmetry-breaking potential, yielding non-linear equations that support monopole solutions in the limit of weak coupling. This setup highlights the interplay between gauge self-interactions and scalar dynamics without invoking quantum effects.

Mathematical Framework

Gauge Fields and Potentials

In gauge theories, the fundamental mathematical structure is provided by a principal fiber bundle over a manifold, where the fibers are copies of the gauge group GG, and the connection on this bundle encodes the gauge interactions. The base space is the manifold MM, typically a , while the total space PP consists of points (x,g)(x, g) with xMx \in M and gGg \in G, modulo right actions by the group. This geometric framework allows for a covariant description of gauge fields, independent of a specific coordinate choice on the base manifold. The gauge potential, denoted AμA_\mu, is a Lie algebra-valued one-form on the manifold, taking values in the g\mathfrak{g} associated to the gauge group GG. Mathematically, A=AμdxμΩ1(M)gA = A_\mu \, dx^\mu \in \Omega^1(M) \otimes \mathfrak{g}, where Ω1(M)\Omega^1(M) denotes the space of one-forms on MM. This potential represents the connection that defines along paths in the bundle: for a γ\gamma in MM, the around γ\gamma is given by the path-ordered exponential of the of AA along γ\gamma, which transports fibers from one point to another while preserving the bundle structure. In local trivializations, AμA_\mu transforms under gauge transformations g(x)Gg(x) \in G as Aμg1Aμg+ig1μgA_\mu \to g^{-1} A_\mu g + i g^{-1} \partial_\mu g, ensuring the connection's . The field strength tensor FμνF_{\mu\nu}, or more generally the curvature two-form F=12FμνdxμdxνΩ2(M)gF = \frac{1}{2} F_{\mu\nu} \, dx^\mu \wedge dx^\nu \in \Omega^2(M) \otimes \mathfrak{g}, measures the non-integrability of the connection and quantifies the "twisting" of the bundle. For Abelian gauge groups, where [g,g]=0[\mathfrak{g}, \mathfrak{g}] = 0, it simplifies to the exterior derivative F=dAF = dA. In the non-Abelian case, the curvature includes a nonlinear term accounting for the group's non-commutativity: F=dA+AAF = dA + A \wedge A, or in components, Fμν=μAννAμ+[Aμ,Aν]F_{\mu\nu} = \partial_\mu A_\nu - \partial_\nu A_\mu + [A_\mu, A_\nu]. This form arises naturally from the structure equation of connections on principal bundles and captures the self-interaction of gauge fields inherent to non-Abelian symmetries. A key identity satisfied by the field strength is the Bianchi identity, which in components reads DλFμν+DμFνλ+DνFλμ=0D_\lambda F_{\mu\nu} + D_\mu F_{\nu\lambda} + D_\nu F_{\lambda\mu} = 0, where DμD_\mu is the covariant derivative DμVν=μVν+[Aμ,Vν]D_\mu V^\nu = \partial_\mu V^\nu + [A_\mu, V^\nu] for Lie algebra-valued fields. This identity follows from the Maurer-Cartan structure equation DF=0D F = 0 (i.e., dF+AFFA=0dF + A \wedge F - F \wedge A = 0) and holds identically without reference to equations of motion or sources. It implies that the gauge current is covariantly conserved, DνJaν=0D_\nu J^{a\nu} = 0, when combined with the field equations DμFaμν=JaνD_\mu F^{a\mu\nu} = J^{a\nu}, providing a geometric consistency condition that ensures the solvability of the dynamical equations. Gauge-invariant observables in the theory are constructed using path-dependent quantities that are insensitive to local gauge transformations. Prominent among these are Wilson loops, defined for a closed path CC in spacetime as the trace in a representation of the gauge group of the path-ordered exponential: W(C)=1dimRTrR[Pexp(igCAμdxμ)]W(C) = \frac{1}{\dim R} \operatorname{Tr}_R \left[ \mathcal{P} \exp\left( i g \oint_C A_\mu \, dx^\mu \right) \right], where gg is the coupling constant and P\mathcal{P} denotes path-ordering along CC. These loops serve as non-local probes of the gauge field configuration, with their expectation values encoding information about confinement and topological properties in quantum gauge theories. Particular configurations known as pure gauge fields correspond to flat connections, where the curvature vanishes identically, Fμν=0F_{\mu\nu} = 0. Such fields can be expressed locally as Aμ=ig1UμU1A_\mu = i g^{-1} U \partial_\mu U^{-1} for some group-valued function U(x)GU(x) \in G, which is the Maurer-Cartan form pulled back from the group manifold. While these configurations are gauge-equivalent to the trivial connection Aμ=0A_\mu = 0 locally, they may exhibit non-trivial global topology, such as when the map U:MGU: M \to G has a non-contractible homotopy class, leading to phenomena like Aharonov-Bohm phases or instanton sectors in compactified spacetimes.

Lagrangians and Field Equations

In gauge theories, the dynamics of the fields are governed by a Lagrangian density that is invariant under local gauge transformations. For a pure non-Abelian gauge theory based on a Lie group with structure constants fabcf^{abc}, the Yang-Mills Lagrangian is given by LYM=14FμνaFaμν,\mathcal{L}_{\text{YM}} = -\frac{1}{4} F^a_{\mu\nu} F^{a\mu\nu}, where Fμνa=μAνaνAμa+gfabcAμbAνcF^a_{\mu\nu} = \partial_\mu A^a_\nu - \partial_\nu A^a_\mu + g f^{abc} A^b_\mu A^c_\nu is the field strength tensor, AμaA^a_\mu are the gauge potentials, gg is the coupling constant, and repeated indices are summed over with the Minkowski metric ημν=diag(1,1,1,1)\eta^{\mu\nu} = \text{diag}(1, -1, -1, -1). This form ensures gauge invariance and generalizes the electromagnetic Lagrangian to non-Abelian groups. When matter fields are included, such as Dirac fermions ψ\psi in the fundamental representation, the Lagrangian extends to incorporate gauge-invariant interactions via the covariant derivative Dμψ=(μigAμaTa)ψD_\mu \psi = (\partial_\mu - i g A^a_\mu T^a) \psi, where TaT^a are the generators of the gauge group satisfying [Ta,Tb]=ifabcTc[T^a, T^b] = i f^{abc} T^c. The full Lagrangian becomes L=14FμνaFaμν+ψˉ(iγμDμm)ψ+interaction terms,\mathcal{L} = -\frac{1}{4} F^a_{\mu\nu} F^{a\mu\nu} + \bar{\psi} (i \gamma^\mu D_\mu - m) \psi + \text{interaction terms}, with the kinetic term ψˉiγμDμψ\bar{\psi} i \gamma^\mu D_\mu \psi replacing the ordinary derivative to maintain local gauge invariance. For scalar fields ϕ\phi transforming in a representation of the group, the is Dμϕ=μϕigAμaTaϕD_\mu \phi = \partial_\mu \phi - i g A^a_\mu T^a \phi, leading to terms like (Dμϕ)(Dμϕ)(D_\mu \phi)^\dagger (D^\mu \phi) in the Lagrangian. The field equations are derived from the principle of least action using the Euler-Lagrange equations for the fields. For the gauge fields, varying the action d4xL\int d^4x \mathcal{L} with respect to AνaA^a_\nu yields the Yang-Mills equations DμFaμν=Jaν,D_\mu F^{a\mu\nu} = J^{a\nu}, where the covariant derivative on the field strength is DμFaμν=μFaμν+gfabcAμbFcμνD_\mu F^{a\mu\nu} = \partial_\mu F^{a\mu\nu} + g f^{abc} A^b_\mu F^{c\mu\nu}, and JaνJ^{a\nu} is the gauge current from matter fields, such as Jaν=ψˉγνTaψJ^{a\nu} = \bar{\psi} \gamma^\nu T^a \psi for fermions. These equations describe the propagation and self-interaction of gauge bosons, with the matter current sourcing the fields. For pure gauge theories without matter, Jaν=0J^{a\nu} = 0, reducing to the homogeneous equations. In the Abelian limit, where the gauge group is U(1)U(1) (so fabc=0f^{abc} = 0), the Yang-Mills Lagrangian simplifies to the Maxwell Lagrangian 14FμνFμν-\frac{1}{4} F_{\mu\nu} F^{\mu\nu}, and the field equations become μFμν=Jν\partial_\mu F^{\mu\nu} = J^\nu. Identifying F0i=EiF^{0i} = E^i and Fij=ϵijkBkF^{ij} = -\epsilon^{ijk} B_k, these reduce to the inhomogeneous Maxwell equations E=ρ\nabla \cdot \mathbf{E} = \rho and ×BtE=J\nabla \times \mathbf{B} - \partial_t \mathbf{E} = \mathbf{J}, with ρ=J0\rho = J^0 and J=Ji\mathbf{J} = J^i. To incorporate massive gauge bosons in a gauge-invariant manner, a classical scalar sector can be added with a potential that breaks the spontaneously. The Higgs potential is V(ϕ)=μ2ϕ2+λϕ4V(\phi) = \mu^2 |\phi|^2 + \lambda |\phi|^4 with μ2<0\mu^2 < 0, leading to a non-zero vacuum expectation value ϕ=v/2\langle \phi \rangle = v / \sqrt{2}
Add your contribution
Related Hubs
User Avatar
No comments yet.