Search This Blog

Friday, July 4, 2025

Dirac equation

From Wikipedia, the free encyclopedia
https://en.wikipedia.org/wiki/Dirac_equation

In particle physics, the Dirac equation is a relativistic wave equation derived by British physicist Paul Dirac in 1928. In its free form, or including electromagnetic interactions, it describes all spin-1/2 massive particles, called "Dirac particles", such as electrons and quarks for which parity is a symmetry. It is consistent with both the principles of quantum mechanics and the theory of special relativity, and was the first theory to account fully for special relativity in the context of quantum mechanics. The equation is validated by its rigorous accounting of the observed fine structure of the hydrogen spectrum and has become vital in the building of the Standard Model.

The equation also implied the existence of a new form of matter, antimatter, previously unsuspected and unobserved and which was experimentally confirmed several years later. It also provided a theoretical justification for the introduction of several component wave functions in Pauli's phenomenological theory of spin. The wave functions in the Dirac theory are vectors of four complex numbers (known as bispinors), two of which resemble the Pauli wavefunction in the non-relativistic limit, in contrast to the Schrödinger equation, which described wave functions of only one complex value. Moreover, in the limit of zero mass, the Dirac equation reduces to the Weyl equation.

In the context of quantum field theory, the Dirac equation is reinterpreted to describe quantum fields corresponding to spin-1/2 particles.

Dirac did not fully appreciate the importance of his results; however, the entailed explanation of spin as a consequence of the union of quantum mechanics and relativity—and the eventual discovery of the positron—represents one of the great triumphs of theoretical physics. This accomplishment has been described as fully on par with the works of Newton, Maxwell, and Einstein before him. The equation has been deemed by some physicists to be the "real seed of modern physics". The equation has also been described as the "centerpiece of relativistic quantum mechanics", with it also stated that "the equation is perhaps the most important one in all of quantum mechanics".

The Dirac equation is inscribed upon a plaque on the floor of Westminster Abbey. Unveiled on 13 November 1995, the plaque commemorates Dirac's life.

The equation, in its natural units formulation, is also prominently displayed in the auditorium at the ‘Paul A.M. Dirac’ Lecture Hall at the Patrick M.S. Blackett Institute (formerly The San Domenico Monastery) of the Ettore Majorana Foundation and Centre for Scientific Culture in Erice, Sicily.

History

The Dirac equation in the form originally proposed by Dirac is: where ψ(x, t) is the wave function for an electron of rest mass m with spacetime coordinates x, t. p1, p2, p3 are the components of the momentum, understood to be the momentum operator in the Schrödinger equation. c is the speed of light, and ħ is the reduced Planck constant; these fundamental physical constants reflect special relativity and quantum mechanics, respectively. αn and β are 4 × 4 gamma matrices.

Dirac's purpose in casting this equation was to explain the behavior of the relativistically moving electron, thus allowing the atom to be treated in a manner consistent with relativity. He hoped that the corrections introduced this way might have a bearing on the problem of atomic spectra.

Up until that time, attempts to make the old quantum theory of the atom compatible with the theory of relativity—which were based on discretizing the angular momentum stored in the electron's possibly non-circular orbit of the atomic nucleus—had failed, and the new quantum mechanics of Heisenberg, Pauli, Jordan, Schrödinger, and Dirac himself had not developed sufficiently to treat this problem. Although Dirac's original intentions were satisfied, his equation had far deeper implications for the structure of matter and introduced new mathematical classes of objects that are now essential elements of fundamental physics.

The new elements in this equation are the four 4 × 4 matrices α1, α2, α3 and β, and the four-component wave function ψ. There are four components in ψ because the evaluation of it at any given point in configuration space is a bispinor. It is interpreted as a superposition of a spin-up electron, a spin-down electron, a spin-up positron, and a spin-down positron.

The 4 × 4 matrices αk and β are all Hermitian and are involutory: and they all mutually anti-commute:

These matrices and the form of the wave function have a deep mathematical significance. The algebraic structure represented by the gamma matrices had been created some 50 years earlier by the English mathematician W. K. Clifford. In turn, Clifford's ideas had emerged from the mid-19th-century work of German mathematician Hermann Grassmann in his Lineare Ausdehnungslehre (Theory of Linear Expansion).

Making the Schrödinger equation relativistic

The Dirac equation is superficially similar to the Schrödinger equation for a massive free particle:

The left side represents the square of the momentum operator divided by twice the mass, which is the non-relativistic kinetic energy. Because relativity treats space and time as a whole, a relativistic generalization of this equation requires that space and time derivatives must enter symmetrically as they do in the Maxwell equations that govern the behavior of light – the equations must be differentially of the same order in space and time. In relativity, the momentum and the energies are the space and time parts of a spacetime vector, the four-momentum, and they are related by the relativistically invariant relation which says that the length of this four-vector is proportional to the rest mass m. Substituting the operator equivalents of the energy and momentum from the Schrödinger theory produces the Klein–Gordon equation describing the propagation of waves, constructed from relativistically invariant objects, with the wave function being a relativistic scalar: a complex number that has the same numerical value in all frames of reference. Space and time derivatives both enter to second order. This has a telling consequence for the interpretation of the equation. Because the equation is second order in the time derivative, one must specify initial values both of the wave function itself and of its first time-derivative in order to solve definite problems. Since both may be specified more or less arbitrarily, the wave function cannot maintain its former role of determining the probability density of finding the electron in a given state of motion. In the Schrödinger theory, the probability density is given by the positive definite expression and this density is convected according to the probability current vector with the conservation of probability current and density following from the continuity equation:

The fact that the density is positive definite and convected according to this continuity equation implies that one may integrate the density over a certain domain and set the total to 1, and this condition will be maintained by the conservation law. A proper relativistic theory with a probability density current must also share this feature. To maintain the notion of a convected density, one must generalize the Schrödinger expression of the density and current so that space and time derivatives again enter symmetrically in relation to the scalar wave function. The Schrödinger expression can be kept for the current, but the probability density must be replaced by the symmetrically formed expression  which now becomes the 4th component of a spacetime vector, and the entire probability 4-current density has the relativistically covariant expression

The continuity equation is as before. Everything is compatible with relativity now, but the expression for the density is no longer positive definite; the initial values of both ψ and tψ may be freely chosen, and the density may thus become negative, something that is impossible for a legitimate probability density. Thus, one cannot get a simple generalization of the Schrödinger equation under the naive assumption that the wave function is a relativistic scalar, and the equation it satisfies, second order in time.

Although it is not a successful relativistic generalization of the Schrödinger equation, this equation is resurrected in the context of quantum field theory, where it is known as the Klein–Gordon equation, and describes a spinless particle field (e.g. pi meson or Higgs boson). Historically, Schrödinger himself arrived at this equation before the one that bears his name but soon discarded it. In the context of quantum field theory, the indefinite density is understood to correspond to the charge density, which can be positive or negative, and not the probability density.

Dirac's coup

Dirac thus thought to try an equation that was first order in both space and time. He postulated an equation of the form where the operators must be independent of for linearity and independent of for space-time homogeneity. These constraints implied additional dynamical variables that the operators will depend upon; from this requirement Dirac concluded that the operators would depend upon 4 × 4 matrices, related to the Pauli matrices.

One could, for example, formally (i.e. by abuse of notation, since it is not straightforward to take a functional square root of the sum of two differential operators) take the relativistic expression for the energy replace p by its operator equivalent, expand the square root in an infinite series of derivative operators, set up an eigenvalue problem, then solve the equation formally by iterations. Most physicists had little faith in such a process, even if it were technically possible.

As the story goes, Dirac was staring into the fireplace at Cambridge, pondering this problem, when he hit upon the idea of taking the square root of the wave operator (see also half derivative) thus:

On multiplying out the right side it is apparent that, in order to get all the cross-terms such as xy to vanish, one must assume with

Dirac, who had just then been intensely involved with working out the foundations of Heisenberg's matrix mechanics, immediately understood that these conditions could be met if A, B, C and D are matrices, with the implication that the wave function has multiple components. This immediately explained the appearance of two-component wave functions in Pauli's phenomenological theory of spin, something that up until then had been regarded as mysterious, even to Pauli himself. However, one needs at least 4 × 4 matrices to set up a system with the properties required – so the wave function had four components, not two, as in the Pauli theory, or one, as in the bare Schrödinger theory. The four-component wave function represents a new class of mathematical object in physical theories that makes its first appearance here.

Given the factorization in terms of these matrices, one can now write down immediately an equation with to be determined. Applying again the matrix operator on both sides yields

Taking shows that all the components of the wave function individually satisfy the relativistic energy–momentum relation. Thus the sought-for equation that is first-order in both space and time is

Setting and because , the Dirac equation is produced as written above.

Covariant form and relativistic invariance

To demonstrate the relativistic invariance of the equation, it is advantageous to cast it into a form in which the space and time derivatives appear on an equal footing. New matrices are introduced as follows: and the equation takes the form (remembering the definition of the covariant components of the 4-gradient and especially that 0 = 1/ct)

Dirac equation

where there is an implied summation over the values of the twice-repeated index μ = 0, 1, 2, 3, and μ is the 4-gradient. In practice one often writes the gamma matrices in terms of 2 × 2 sub-matrices taken from the Pauli matrices and the 2 × 2 identity matrix. Explicitly the standard representation is

The complete system is summarized using the Minkowski metric on spacetime in the form where the bracket expression denotes the anticommutator. These are the defining relations of a Clifford algebra over a pseudo-orthogonal 4-dimensional space with metric signature (+ − − −). The specific Clifford algebra employed in the Dirac equation is known today as the Dirac algebra. Although not recognized as such by Dirac at the time the equation was formulated, in hindsight the introduction of this geometric algebra represents an enormous stride forward in the development of quantum theory.

The Dirac equation may now be interpreted as an eigenvalue equation, where the rest mass is proportional to an eigenvalue of the 4-momentum operator, the proportionality constant being the speed of light:

Using ( is pronounced "d-slash"), according to Feynman slash notation, the Dirac equation becomes:

In practice, physicists often use units of measure such that ħ = c = 1, known as natural units. The equation then takes the simple form

Dirac equation (natural units)

A foundational theorem states that if two distinct sets of matrices are given that both satisfy the Clifford relations, then they are connected to each other by a similarity transform:

If in addition the matrices are all unitary, as are the Dirac set, then S itself is unitary;

The transformation U is unique up to a multiplicative factor of absolute value 1. Let us now imagine a Lorentz transformation to have been performed on the space and time coordinates, and on the derivative operators, which form a covariant vector. For the operator γμμ to remain invariant, the gammas must transform among themselves as a contravariant vector with respect to their spacetime index. These new gammas will themselves satisfy the Clifford relations, because of the orthogonality of the Lorentz transformation. By the previously mentioned foundational theorem, one may replace the new set by the old set subject to a unitary transformation. In the new frame, remembering that the rest mass is a relativistic scalar, the Dirac equation will then take the form

If the transformed spinor is defined as then the transformed Dirac equation is produced in a way that demonstrates manifest relativistic invariance:

Thus, settling on any unitary representation of the gammas is final, provided the spinor is transformed according to the unitary transformation that corresponds to the given Lorentz transformation.

The various representations of the Dirac matrices employed will bring into focus particular aspects of the physical content in the Dirac wave function. The representation shown here is known as the standard representation – in it, the wave function's upper two components go over into Pauli's 2 spinor wave function in the limit of low energies and small velocities in comparison to light.

The considerations above reveal the origin of the gammas in geometry, hearkening back to Grassmann's original motivation; they represent a fixed basis of unit vectors in spacetime. Similarly, products of the gammas such as γμγν represent oriented surface elements, and so on. With this in mind, one can find the form of the unit volume element on spacetime in terms of the gammas as follows. By definition, it is

For this to be an invariant, the epsilon symbol must be a tensor, and so must contain a factor of g, where g is the determinant of the metric tensor. Since this is negative, that factor is imaginary. Thus

This matrix is given the special symbol γ5, owing to its importance when one is considering improper transformations of space-time, that is, those that change the orientation of the basis vectors. In the standard representation, it is

This matrix will also be found to anticommute with the other four Dirac matrices:

It takes a leading role when questions of parity arise because the volume element as a directed magnitude changes sign under a space-time reflection. Taking the positive square root above thus amounts to choosing a handedness convention on spacetime.

Pauli theory

The necessity of introducing half-integer spin goes back experimentally to the results of the Stern–Gerlach experiment. A beam of atoms is run through a strong inhomogeneous magnetic field, which then splits into N parts depending on the intrinsic angular momentum of the atoms. It was found that for silver atoms, the beam was split in two; the ground state therefore could not be integer, because even if the intrinsic angular momentum of the atoms were as small as possible, 1, the beam would be split into three parts, corresponding to atoms with Lz = −1, 0, +1. The conclusion is that silver atoms have net intrinsic angular momentum of 1/2. Pauli set up a theory that explained this splitting by introducing a two-component wave function and a corresponding correction term in the Hamiltonian, representing a semi-classical coupling of this wave function to an applied magnetic field, as so in SI units: (Note that bold faced characters imply Euclidean vectors in 3 dimensions, whereas the Minkowski four-vector Aμ can be defined as .)

Here A and represent the components of the electromagnetic four-potential in their standard SI units, and the three sigmas are the Pauli matrices. On squaring out the first term, a residual interaction with the magnetic field is found, along with the usual classical Hamiltonian of a charged particle interacting with an applied field in SI units:

This Hamiltonian is now a 2 × 2 matrix, so the Schrödinger equation based on it must use a two-component wave function. On introducing the external electromagnetic 4-vector potential into the Dirac equation in a similar way, known as minimal coupling, it takes the form:

A second application of the Dirac operator will now reproduce the Pauli term exactly as before, because the spatial Dirac matrices multiplied by i, have the same squaring and commutation properties as the Pauli matrices. What is more, the value of the gyromagnetic ratio of the electron, standing in front of Pauli's new term, is explained from first principles. This was a major achievement of the Dirac equation and gave physicists great faith in its overall correctness. There is more however. The Pauli theory may be seen as the low energy limit of the Dirac theory in the following manner. First the equation is written in the form of coupled equations for 2-spinors with the SI units restored: so

Assuming the field is weak and the motion of the electron non-relativistic, the total energy of the electron is approximately equal to its rest energy, and the momentum going over to the classical value, and so the second equation may be written which is of order Thus, at typical energies and velocities, the bottom components of the Dirac spinor in the standard representation are much suppressed in comparison to the top components. Substituting this expression into the first equation gives after some rearrangement

The operator on the left represents the particle's total energy reduced by its rest energy, which is just its classical kinetic energy, so one can recover Pauli's theory upon identifying his 2-spinor with the top components of the Dirac spinor in the non-relativistic approximation. A further approximation gives the Schrödinger equation as the limit of the Pauli theory. Thus, the Schrödinger equation may be seen as the far non-relativistic approximation of the Dirac equation when one may neglect spin and work only at low energies and velocities. This also was a great triumph for the new equation, as it traced the mysterious i that appears in it, and the necessity of a complex wave function, back to the geometry of spacetime through the Dirac algebra. It also highlights why the Schrödinger equation, although ostensibly in the form of a diffusion equation, actually represents wave propagation.

It should be strongly emphasized that the entire Dirac spinor represents an irreducible whole. The separation, done here, of the Dirac spinor into large and small components depends on the low-energy approximation being valid. The components that were neglected above, to show that the Pauli theory can be recovered by a low-velocity approximation of Dirac's equation, are necessary to produce new phenomena observed in the relativistic regime – among them antimatter, and the creation and annihilation of particles.

Weyl theory

In the massless case , the Dirac equation reduces to the Weyl equation, which describes relativistic massless spin-1/2 particles.

The theory acquires a second symmetry: see below.

Physical interpretation

Identification of observables

The critical physical question in a quantum theory is this: what are the physically observable quantities defined by the theory? According to the postulates of quantum mechanics, such quantities are defined by self-adjoint operators that act on the Hilbert space of possible states of a system. The eigenvalues of these operators are then the possible results of measuring the corresponding physical quantity. In the Schrödinger theory, the simplest such object is the overall Hamiltonian, which represents the total energy of the system. To maintain this interpretation on passing to the Dirac theory, the Hamiltonian must be taken to be where, as always, there is an implied summation over the twice-repeated index k = 1, 2, 3. This looks promising, because one can see by inspection the rest energy of the particle and, in the case of A = 0, the energy of a charge placed in an electric potential cqA0. What about the term involving the vector potential? In classical electrodynamics, the energy of a charge moving in an applied potential is

Thus, the Dirac Hamiltonian is fundamentally distinguished from its classical counterpart, and one must take great care to correctly identify what is observable in this theory. Much of the apparently paradoxical behavior implied by the Dirac equation amounts to a misidentification of these observables.

Hole theory

The negative E solutions to the equation are problematic, for it was assumed that the particle has a positive energy. Mathematically speaking, however, there seems to be no reason for us to reject the negative-energy solutions. Since they exist, they cannot simply be ignored, for once the interaction between the electron and the electromagnetic field is included, any electron placed in a positive-energy eigenstate would decay into negative-energy eigenstates of successively lower energy. Real electrons obviously do not behave in this way, or they would disappear by emitting energy in the form of photons.

To cope with this problem, Dirac introduced the hypothesis, known as hole theory, that the vacuum is the many-body quantum state in which all the negative-energy electron eigenstates are occupied. This description of the vacuum as a "sea" of electrons is called the Dirac sea. Since the Pauli exclusion principle forbids electrons from occupying the same state, any additional electron would be forced to occupy a positive-energy eigenstate, and positive-energy electrons would be forbidden from decaying into negative-energy eigenstates.

Dirac further reasoned that if the negative-energy eigenstates are incompletely filled, each unoccupied eigenstate – called a hole – would behave like a positively charged particle. The hole possesses a positive energy because energy is required to create a particle–hole pair from the vacuum. As noted above, Dirac initially thought that the hole might be the proton, but Hermann Weyl pointed out that the hole should behave as if it had the same mass as an electron, whereas the proton is over 1800 times heavier. The hole was eventually identified as the positron, experimentally discovered by Carl Anderson in 1932.

It is not entirely satisfactory to describe the "vacuum" using an infinite sea of negative-energy electrons. The infinitely negative contributions from the sea of negative-energy electrons have to be canceled by an infinite positive "bare" energy and the contribution to the charge density and current coming from the sea of negative-energy electrons is exactly canceled by an infinite positive "jellium" background so that the net electric charge density of the vacuum is zero. In quantum field theory, a Bogoliubov transformation on the creation and annihilation operators (turning an occupied negative-energy electron state into an unoccupied positive energy positron state and an unoccupied negative-energy electron state into an occupied positive energy positron state) allows us to bypass the Dirac sea formalism even though, formally, it is equivalent to it.

In certain applications of condensed matter physics, however, the underlying concepts of "hole theory" are valid. The sea of conduction electrons in an electrical conductor, called a Fermi sea, contains electrons with energies up to the chemical potential of the system. An unfilled state in the Fermi sea behaves like a positively charged electron, and although it too is referred to as an "electron hole", it is distinct from a positron. The negative charge of the Fermi sea is balanced by the positively charged ionic lattice of the material.

In quantum field theory

In quantum field theories such as quantum electrodynamics, the Dirac field is subject to a process of second quantization, which resolves some of the paradoxical features of the equation.

Mathematical formulation

In its modern formulation for field theory, the Dirac equation is written in terms of a Dirac spinor field taking values in a complex vector space described concretely as , defined on flat spacetime (Minkowski space) . Its expression also contains gamma matrices and a parameter interpreted as the mass, as well as other physical constants. Dirac first obtained his equation through a factorization of Einstein's energy-momentum-mass equivalence relation assuming a scalar product of momentum vectors determined by the metric tensor and quantized the resulting relation by associating momenta to their respective operators.

In terms of a field , the Dirac equation is then

and in natural units, with Feynman slash notation,

The gamma matrices are a set of four complex matrices (elements of ) that satisfy the defining anti-commutation relations: where is the Minkowski metric element, and the indices run over 0,1,2 and 3. These matrices can be realized explicitly under a choice of representation. Two common choices are the Dirac representation and the chiral representation. The Dirac representation is where are the Pauli matrices.

For the chiral representation the are the same, but

The slash notation is a compact notation for where is a four-vector (often it is the four-vector differential operator ). The summation over the index is implied.

Alternatively the four coupled linear first-order partial differential equations for the four quantities that make up the wave function can be written as a vector. In Planck units this becomes: which makes it clearer that it is a set of four partial differential equations with four unknown functions. (Note that the term is not preceded by i because σy is imaginary.)

Dirac adjoint and the adjoint equation

The Dirac adjoint of the spinor field is defined as Using the property of gamma matrices (which follows straightforwardly from Hermicity properties of the ) that one can derive the adjoint Dirac equation by taking the Hermitian conjugate of the Dirac equation and multiplying on the right by : where the partial derivative acts from the right on : written in the usual way in terms of a left action of the derivative, we have

Klein–Gordon equation

Applying to the Dirac equation gives That is, each component of the Dirac spinor field satisfies the Klein–Gordon equation.

Conserved current

A conserved current of the theory is

Proof of conservation from Dirac equation

Adding the Dirac and adjoint Dirac equations gives so by Leibniz rule,

Another approach to derive this expression is by variational methods, applying Noether's theorem for the global symmetry to derive the conserved current

Proof of conservation from Noether's theorem

Recall the Lagrangian is Under a symmetry that sends we find the Lagrangian is invariant.

Now considering the variation parameter to be infinitesimal, we work at first order in and ignore terms. From the previous discussion we immediately see the explicit variation in the Lagrangian due to is vanishing, that is under the variation, where .

As part of Noether's theorem, we find the implicit variation in the Lagrangian due to variation of fields. If the equation of motion for are satisfied, then

This immediately simplifies as there are no partial derivatives of in the Lagrangian. is the infinitesimal variation We evaluate The equation (*) finally is

Solutions

Since the Dirac operator acts on 4-tuples of square-integrable functions, its solutions should be members of the same Hilbert space. The fact that the energies of the solutions do not have a lower bound is unexpected.

Plane-wave solutions

Plane-wave solutions are those arising from an ansatz which models a particle with definite 4-momentum where

For this ansatz, the Dirac equation becomes an equation for : After picking a representation for the gamma matrices , solving this is a matter of solving a system of linear equations. It is a representation-free property of gamma matrices that the solution space is two-dimensional (see here).

For example, in the chiral representation for , the solution space is parametrised by a vector , with where and is the Hermitian matrix square-root.

These plane-wave solutions provide a starting point for canonical quantization.

Lagrangian formulation

Both the Dirac equation and the Adjoint Dirac equation can be obtained from (varying) the action with a specific Lagrangian density that is given by:

If one varies this with respect to one gets the adjoint Dirac equation. Meanwhile, if one varies this with respect to one gets the Dirac equation.

In natural units and with the slash notation, the action is then

Dirac Action

For this action, the conserved current above arises as the conserved current corresponding to the global symmetry through Noether's theorem for field theory. Gauging this field theory by changing the symmetry to a local, spacetime point dependent one gives gauge symmetry (really, gauge redundancy). The resultant theory is quantum electrodynamics or QED. See below for a more detailed discussion.

Lorentz invariance

The Dirac equation is invariant under Lorentz transformations, that is, under the action of the Lorentz group or strictly , the component connected to the identity.

For a Dirac spinor viewed concretely as taking values in , the transformation under a Lorentz transformation is given by a complex matrix . There are some subtleties in defining the corresponding , as well as a standard abuse of notation.

Most treatments occur at the Lie algebra level. For a more detailed treatment see here. The Lorentz group of real matrices acting on is generated by a set of six matrices with components When both the indices are raised or lowered, these are simply the 'standard basis' of antisymmetric matrices.

These satisfy the Lorentz algebra commutation relations In the article on the Dirac algebra, it is also found that the spin generators satisfy the Lorentz algebra commutation relations.

A Lorentz transformation can be written as where the components are antisymmetric in .

The corresponding transformation on spin space is This is an abuse of notation, but a standard one. The reason is is not a well-defined function of , since there are two different sets of components (up to equivalence) that give the same but different . In practice we implicitly pick one of these and then is well defined in terms of

Under a Lorentz transformation, the Dirac equation becomes

Remainder of proof of Lorentz invariance

Multiplying both sides from the left by and returning the dummy variable to gives We'll have shown invariance if or equivalently This is most easily shown at the algebra level. Supposing the transformations are parametrised by infinitesimal components , then at first order in , on the left-hand side we get while on the right-hand side we get It's a standard exercise to evaluate the commutator on the left-hand side. Writing in terms of components completes the proof.

Associated to Lorentz invariance is a conserved Noether current, or rather a tensor of conserved Noether currents . Similarly, since the equation is invariant under translations, there is a tensor of conserved Noether currents , which can be identified as the stress-energy tensor of the theory. The Lorentz current can be written in terms of the stress-energy tensor in addition to a tensor representing internal angular momentum.

Further discussion of Lorentz covariance of the Dirac equation

The Dirac equation is Lorentz covariant. Articulating this helps illuminate not only the Dirac equation, but also the Majorana spinor and Elko spinor, which although closely related, have subtle and important differences.

Understanding Lorentz covariance is simplified by keeping in mind the geometric character of the process. Let be a single, fixed point in the spacetime manifold. Its location can be expressed in multiple coordinate systems. In the physics literature, these are written as and , with the understanding that both and describe the same point , but in different local frames of reference (a frame of reference over a small extended patch of spacetime). One can imagine as having a fiber of different coordinate frames above it. In geometric terms, one says that spacetime can be characterized as a fiber bundle, and specifically, the frame bundle. The difference between two points and in the same fiber is a combination of rotations and Lorentz boosts. A choice of coordinate frame is a (local) section through that bundle.

Coupled to the frame bundle is a second bundle, the spinor bundle. A section through the spinor bundle is just the particle field (the Dirac spinor, in the present case). Different points in the spinor fiber correspond to the same physical object (the fermion) but expressed in different Lorentz frames. Clearly, the frame bundle and the spinor bundle must be tied together in a consistent fashion to get consistent results; formally, one says that the spinor bundle is the associated bundle; it is associated to a principal bundle, which in the present case is the frame bundle. Differences between points on the fiber correspond to the symmetries of the system. The spinor bundle has two distinct generators of its symmetries: the total angular momentum and the intrinsic angular momentum. Both correspond to Lorentz transformations, but in different ways.

The presentation here follows that of Itzykson and Zuber. It is very nearly identical to that of Bjorken and Drell. A similar derivation in a general relativistic setting can be found in Weinberg. Here we fix our spacetime to be flat, that is, our spacetime is Minkowski space.

Under a Lorentz transformation the Dirac spinor to transform as It can be shown that an explicit expression for is given by where parameterizes the Lorentz transformation, and are the six 4×4 matrices satisfying:

This matrix can be interpreted as the intrinsic angular momentum of the Dirac field. That it deserves this interpretation arises by contrasting it to the generator of Lorentz transformations, having the form This can be interpreted as the total angular momentum. It acts on the spinor field as Note the above does not have a prime on it: the above is obtained by transforming obtaining the change to and then returning to the original coordinate system .

The geometrical interpretation of the above is that the frame field is affine, having no preferred origin. The generator generates the symmetries of this space: it provides a relabelling of a fixed point The generator generates a movement from one point in the fiber to another: a movement from with both and still corresponding to the same spacetime point These perhaps obtuse remarks can be elucidated with explicit algebra.

Let be a Lorentz transformation. The Dirac equation is If the Dirac equation is to be covariant, then it should have exactly the same form in all Lorentz frames: The two spinors and should both describe the same physical field, and so should be related by a transformation that does not change any physical observables (charge, current, mass, etc.) The transformation should encode only the change of coordinate frame. It can be shown that such a transformation is a 4×4 unitary matrix. Thus, one may presume that the relation between the two frames can be written as Inserting this into the transformed equation, the result is The coordinates related by Lorentz transformation satisfy: The original Dirac equation is then regained if An explicit expression for (equal to the expression given above) can be obtained by considering a Lorentz transformation of infinitesimal rotation near the identity transformation: where is the metric tensor : and is symmetric while is antisymmetric. After plugging and chugging, one obtains which is the (infinitesimal) form for above and yields the relation . To obtain the affine relabelling, write

After properly antisymmetrizing, one obtains the generator of symmetries given earlier. Thus, both and can be said to be the "generators of Lorentz transformations", but with a subtle distinction: the first corresponds to a relabelling of points on the affine frame bundle, which forces a translation along the fiber of the spinor on the spin bundle, while the second corresponds to translations along the fiber of the spin bundle (taken as a movement along the frame bundle, as well as a movement along the fiber of the spin bundle.) Weinberg provides additional arguments for the physical interpretation of these as total and intrinsic angular momentum.

Other formulations

The Dirac equation can be formulated in a number of other ways.

Curved spacetime

This article has developed the Dirac equation in flat spacetime according to special relativity. It is possible to formulate the Dirac equation in curved spacetime.

The algebra of physical space

This article developed the Dirac equation using four-vectors and Schrödinger operators. The Dirac equation in the algebra of physical space uses a Clifford algebra over the real numbers, a type of geometric algebra.

Coupled Weyl Spinors

As mentioned above, the massless Dirac equation immediately reduces to the homogeneous Weyl equation. By using the chiral representation of the gamma matrices, the nonzero-mass equation can also be decomposed into a pair of coupled inhomogeneous Weyl equations acting on the first and last pairs of indices of the original four-component spinor, i.e. , where and are each two-component Weyl spinors. This is because the skew block form of the chiral gamma matrices means that they swap the and and apply the two-by-two Pauli matrices to each:

So the Dirac equation becomes which in turn is equivalent to a pair of inhomogeneous Weyl equations for massless left- and right-helicity spinors, where the coupling strength is proportional to the mass: [clarification needed]

This has been proposed as an intuitive explanation of Zitterbewegung, as these massless components would propagate at the speed of light and move in opposite directions, since the helicity is the projection of the spin onto the direction of motion. Here the role of the "mass" is not to make the velocity less than the speed of light, but instead controls the average rate at which these reversals occur; specifically, the reversals can be modeled as a Poisson process.

U(1) symmetry

Natural units are used in this section. The coupling constant is labelled by convention with : this parameter can also be viewed as modelling the electron charge.

Vector symmetry

The Dirac equation and action admits a symmetry where the fields transform as This is a global symmetry, known as the vector symmetry (as opposed to the axial symmetry: see below). By Noether's theorem there is a corresponding conserved current: this has been mentioned previously as

Gauging the symmetry

If we 'promote' the global symmetry, parametrised by the constant , to a local symmetry, parametrised by a function , or equivalently the Dirac equation is no longer invariant: there is a residual derivative of .

The fix proceeds as in scalar electrodynamics: the partial derivative is promoted to a covariant derivative The covariant derivative depends on the field being acted on. The newly introduced is the 4-vector potential from electrodynamics, but also can be viewed as a gauge field (which, mathematically, is defined as a connection).

The transformation law under gauge transformations for is then the usual but can also be derived by asking that covariant derivatives transform under a gauge transformation as We then obtain a gauge-invariant Dirac action by promoting the partial derivative to a covariant one: The final step needed to write down a gauge-invariant Lagrangian is to add a Maxwell Lagrangian term, Putting these together gives

QED Action

Expanding out the covariant derivative allows the action to be written in a second useful form:

Axial symmetry

Massless Dirac fermions, that is, fields satisfying the Dirac equation with , admit a second, inequivalent symmetry.

This is seen most easily by writing the four-component Dirac fermion as a pair of two-component vector fields, and adopting the chiral representation for the gamma matrices, so that may be written where has components and has components .

The Dirac action then takes the form That is, it decouples into a theory of two Weyl spinors or Weyl fermions.

The earlier vector symmetry is still present, where and rotate identically. This form of the action makes the second inequivalent symmetry manifest: This can also be expressed at the level of the Dirac fermion as where is the exponential map for matrices.

This isn't the only symmetry possible, but it is conventional. Any 'linear combination' of the vector and axial symmetries is also a symmetry.

Classically, the axial symmetry admits a well-formulated gauge theory. But at the quantum level, there is an anomaly, that is, an obstruction to gauging.

Extension to color symmetry

We can extend this discussion from an abelian symmetry to a general non-abelian symmetry under a gauge group , the group of color symmetries for a theory.

For concreteness, we fix , the special unitary group of matrices acting on .

Before this section, could be viewed as a spinor field on Minkowski space, in other words a function , and its components in are labelled by spin indices, conventionally Greek indices taken from the start of the alphabet .

Promoting the theory to a gauge theory, informally acquires a part transforming like , and these are labelled by color indices, conventionally Latin indices . In total, has components, given in indices by . The 'spinor' labels only how the field transforms under spacetime transformations.

Formally, is valued in a tensor product, that is, it is a function

Gauging proceeds similarly to the abelian case, with a few differences. Under a gauge transformation the spinor fields transform as The matrix-valued gauge field or connection transforms as and the covariant derivatives defined transform as

Writing down a gauge-invariant action proceeds exactly as with the case, replacing the Maxwell Lagrangian with the Yang–Mills Lagrangian where the Yang–Mills field strength or curvature is defined here as and is the matrix commutator.

The action is then

QCD Action

Physical applications

For physical applications, the case describes the quark sector of the Standard Model, which models strong interactions. Quarks are modelled as Dirac spinors; the gauge field is the gluon field. The case describes part of the electroweak sector of the Standard Model. Leptons such as electrons and neutrinos are the Dirac spinors; the gauge field is the gauge boson.

Generalisations

This expression can be generalised to arbitrary Lie group with connection and a representation , where the colour part of is valued in . Formally, the Dirac field is a function

Then transforms under a gauge transformation as and the covariant derivative is defined where here we view as a Lie algebra representation of the Lie algebra associated to .

This theory can be generalised to curved spacetime, but there are subtleties that arise in gauge theory on a general spacetime (or more generally still, a manifold), which can be ignored on flat spacetime. This is ultimately due to the contractibility of flat spacetime that allows us to view a gauge field and gauge transformations as defined globally on ⁠.

Thursday, July 3, 2025

Emic and etic

From Wikipedia, the free encyclopedia
https://en.wikipedia.org/wiki/Emic_and_etic

In anthropology, folkloristics, linguistics, and the social and behavioral sciences, emic (/ˈmɪk/) and etic (/ˈɛtɪk/) refer to two kinds of field research done and viewpoints obtained.

The emic approach is an insider's perspective, which looks at the beliefs, values, and practices of a particular culture from the perspective of the people who live within that culture. This approach aims to understand the cultural meaning and significance of a particular behavior or practice, as it is understood by the people who engage in it.

The etic approach, on the other hand, is an outsider's perspective, which looks at a culture from the perspective of an outside observer or researcher. This approach tends to focus on the observable behaviors and practices of a culture, and aims to understand them in terms of their functional or evolutionary significance. The etic approach often involves the use of standardized measures and frameworks to compare different cultures and may involve the use of concepts and theories from other disciplines, such as psychology or sociology.

The emic and etic approaches each have their own strengths and limitations, and each can be useful in understanding different aspects of culture and behavior. Some anthropologists argue that a combination of both approaches is necessary for a complete understanding of a culture, while others argue that one approach may be more appropriate depending on the specific research question being addressed.

Definitions

"The emic approach investigates how local people think...". How they perceive and categorize the world, their rules for behavior, what has meaning for them, and how they imagine and explain things. "The etic (scientist-oriented) approach shifts the focus from local observations, categories, explanations, and interpretations to those of the anthropologist. The etic approach realizes that members of a culture often are too involved in what they are doing... to interpret their cultures impartially. When using the etic approach, the ethnographer emphasizes what he or she considers important."

Although emics and etics are sometimes regarded as inherently in conflict and one can be preferred to the exclusion of the other, the complementarity of emic and etic approaches to anthropological research has been widely recognized, especially in the areas of interest concerning the characteristics of human nature as well as the form and function of human social systems.

...Emic knowledge and interpretations are those existing within a culture, that are 'determined by local custom, meaning, and belief' (Ager and Loughry, 2004: n.p.) and best described by a 'native' of the culture. Etic knowledge refers to generalizations about human behavior that are considered universally true, and commonly links cultural practices to factors of interest to the researcher, such as economic or ecological conditions, that cultural insiders may not consider very relevant (Morris et al., 1999).

Emic and etic approaches of understanding behavior and personality fall under the study of cultural anthropology, which states that people are shaped by their cultures and their subcultures, and we must account for this in the study of personality. One way is looking at things through an emic approach. This approach "is culture specific because it focuses on a single culture and it is understood on its own terms." As explained below, the term "emic" originated from the specific linguistic term "phonemic", from phoneme, which is a language-specific way of abstracting speech sounds.

  • An emic account is a description of behavior or a belief in terms meaningful (consciously or unconsciously) to the actor; that is, an emic account comes from a person within the culture. Almost anything from within a culture can provide an emic account.
  • An etic account is a description of a behavior or belief by a social analyst or scientific observer (a student or scholar of anthropology or sociology, for example), in terms that can be applied across cultures; that is, an etic account attempts to be 'culturally neutral', limiting any ethnocentric, political or cultural bias or alienation by the observer.

When these two approaches are combined, the "richest" view of a culture or society can be understood. On its own, an emic approach would struggle with applying overarching values to a single culture. The etic approach is helpful in enabling researchers to see more than one aspect of one culture, and in applying observations to cultures around the world.

History

The terms were coined in 1954 by linguist Kenneth Pike, who argued that the tools developed for describing linguistic behaviors could be adapted to the description of any human social behavior. As Pike noted, social scientists have long debated whether their knowledge is objective or subjective. Pike's innovation was to turn away from an epistemological debate and turn instead to a methodological solution. Emic and etic are derived from the linguistic terms phonemic and phonetic, respectively, where a phone is a distinct speech sound or gesture (such distinction being referred to as phonetic), regardless of whether the exact sound is critical to the meanings of words, whereas a phoneme is a speech sound in a given language that, if swapped with another phoneme, could change one word to another. The possibility of a truly objective description was discounted by Pike himself in his original work; he proposed the emic–etic dichotomy in anthropology as a way around philosophic issues about the very nature of objectivity.

The terms were also championed by anthropologists Ward Goodenough and Marvin Harris with slightly different connotations from those used by Pike. Goodenough was primarily interested in understanding the culturally specific meaning of specific beliefs and practices; Harris was primarily interested in explaining human behavior.

Pike, Harris, and others have argued that cultural "insiders" and "outsiders" are equally capable of producing emic and etic accounts of their culture. Some researchers use "etic" to refer to outsider accounts, and "emic" to refer to insider accounts.

Margaret Mead was an anthropologist who studied the patterns of adolescence in Samoa. She discovered that the difficulties and the transitions that adolescents faced are culturally influenced. The hormones that are released during puberty can be defined using an etic framework, because adolescents globally have the same hormones being secreted. However, Mead concluded that how adolescents respond to these hormones is greatly influenced by their cultural norms. Through her studies, Mead found that simple classifications about behaviors and personality could not be used because peoples’ cultures influenced their behaviors in such a radical way. Her studies helped create an emic approach of understanding behaviors and personality. Her research deduced that culture has a significant impact in shaping an individual's personality.

Carl Jung, a Swiss psychoanalyst, is a researcher who took an emic approach in his studies. Jung studied mythology, religion, ancient rituals, and dreams, leading him to believe that there are archetypes that can be identified and used to categorize people's behaviors. Archetypes are universal structures of the collective unconscious that refer to the inherent way people are predisposed to perceive and process information. The main archetypes that Jung studied were the persona (how people choose to present themselves to the world), the anima and animus (part of people experiencing the world in viewing the opposite sex, that guides how they select their romantic partner), and the shadow (dark side of personalities because people have a concept of evil; well-adjusted people must integrate both good and bad parts of themselves). Jung looked at the role of the mother and deduced that all people have mothers and see their mothers in a similar way; they offer nurture and comfort. His studies also suggest that "infants have evolved to suck milk from the breast, it is also the case that all children have inborn tendencies to react in certain ways." This way of looking at the mother is an emic way of applying a concept cross-culturally and universally.

Importance as regards personality

Emic and etic approaches are important to understanding personality because problems can arise "when concepts, measures, and methods are carelessly transferred to other cultures in attempts to make cross-cultural generalizations about personality." It is hard to apply certain generalizations of behavior to people who are so diverse and culturally different. One example of this is the F-scale (Macleod). The F-scale, which was created by Theodor Adorno, is used to measure authoritarian personality, which can, in turn, be used to predict prejudiced behaviors. This test, when applied to Americans, accurately depicts prejudices towards black individuals. However, when a study was conducted in South Africa using the F-Scale (Pettigrew and Friedman), results did not predict any prejudices towards black individuals. This study used emic approaches of study by conducting interviews with the locals and etic approaches by giving participants generalized personality tests.

Schrödinger's cat

From Wikipedia, the free encyclopedia
Schrödinger's cat: a cat, a flask of poison, and a radioactive source connected to a Geiger counter are placed in a sealed box. As illustrated, the quantum description uses a superposition of an alive cat and one that has died.

In quantum mechanics, Schrödinger's cat is a thought experiment concerning quantum superposition. In the thought experiment, a hypothetical cat in a closed box may be considered to be simultaneously both alive and dead while it is unobserved, as a result of its fate being linked to a random subatomic event that may or may not occur. This experiment, viewed this way, is described as a paradox. This thought experiment was devised by physicist Erwin Schrödinger in 1935 in a discussion with Albert Einstein to illustrate what Schrödinger saw as the problems of the Copenhagen interpretation of quantum mechanics.

In Schrödinger's original formulation, a cat, a flask of poison, and a radioactive source are placed in a sealed box. If an internal radiation monitor such as a Geiger counter detects radioactivity (a single atom decaying), the flask is shattered, releasing the poison, which kills the cat. If no decaying atom triggers the monitor, the cat remains alive. The Copenhagen interpretation implies that the cat is therefore simultaneously alive and dead. Yet, when one looks in the box, one sees the cat either alive or dead, not both alive and dead. This poses the question of when exactly quantum superposition ends and reality resolves into one possibility or the other.

Although originally a critique on the Copenhagen interpretation, Schrödinger's seemingly paradoxical thought experiment became part of the foundation of quantum mechanics. It is often featured in theoretical discussions of the interpretations of quantum mechanics, particularly in situations involving the measurement problem. As a result, Schrödinger's cat has had enduring appeal in popular culture. The experiment is not intended to be actually performed on a cat, but rather as an easily understandable illustration of the behavior of atoms. Experiments at the atomic scale have been carried out, showing that very small objects may exist as superpositions, but superposing an object as large as a cat would pose considerable technical difficulties.

Fundamentally, the Schrödinger's cat experiment asks how long quantum superpositions last and when (or whether) they collapse. Different interpretations of the mathematics of quantum mechanics have been proposed that give different explanations for this process.

Origin and motivation

Unsolved problem in physics
 
How does the quantum description of reality, which includes elements such as the superposition of states and wavefunction collapse or quantum decoherence, give rise to the reality we perceive? Another way of stating this question regards the measurement problem: What constitutes a "measurement" that apparently causes the wave function to collapse into a definite state?
 

Schrödinger intended his thought experiment as a discussion of the EPR article—named after its authors Einstein, Podolsky, and Rosen—in 1935. The EPR article highlighted the counterintuitive nature of quantum superpositions, in which a quantum system for two particles does not separate even when the particles are detected far from their last point of contact. The EPR paper concludes with a claim that this lack of separability meant that quantum mechanics as a theory of reality was incomplete.

Schrödinger and Einstein exchanged letters about Einstein's EPR article, in the course of which Einstein pointed out that the state of an unstable keg of gunpowder will, after a while, contain a superposition of both exploded and unexploded states.

To further illustrate, Schrödinger described how one could, in principle, create a superposition in a large-scale system by making it dependent on a quantum particle that was in a superposition. He proposed a scenario with a cat in a closed steel chamber, wherein the cat's life or death depended on the state of a radioactive atom, whether it had decayed and emitted radiation or not. According to Schrödinger, the Copenhagen interpretation implies that the cat remains both alive and dead until the state has been observed. Schrödinger did not wish to promote the idea of dead-and-live cats as a serious possibility; on the contrary, he intended the example to illustrate the absurdity of the existing view of quantum mechanics, thus employing reductio ad absurdum.

Since Schrödinger's time, various interpretations of the mathematics of quantum mechanics have been advanced by physicists, some of which regard the "alive and dead" cat superposition as quite real, others do not. Intended as a critique of the Copenhagen interpretation (the prevailing orthodoxy in 1935), the Schrödinger's cat thought experiment remains a touchstone for modern interpretations of quantum mechanics and can be used to illustrate and compare their strengths and weaknesses.

Thought experiment

A life-size cat figure in the garden of Huttenstrasse 9, Zurich, where Erwin Schrödinger lived from 1921 to 1926. Depending on the light conditions, the figure appears to be either a live cat or a dead one.

Schrödinger wrote:

One can contrive even completely burlesque [farcical] cases. A cat is put in a steel chamber along with the following infernal device (which must be secured against direct interference by the cat): in a Geiger counter, there is a tiny amount of radioactive substance, so tiny that in the course of an hour one of the atoms will perhaps decay, but also, with equal probability, that none of them will; if it does happen, the counter tube will discharge and through a relay release a hammer that will shatter a small flask of hydrocyanic acid. If one has left this entire system to itself for an hour, one would tell oneself that the cat is still alive if no atom has decayed in the meantime. Even a single atomic decay would have poisoned it. The psi-function of the entire system would express this by having in it the living and dead cat (pardon the expression) mixed or spread out in equal parts.

It is typical of these cases that an indeterminacy originally restricted to the atomic domain turns into a sensually observable [macroscopic] indeterminacy, which can then be resolved by direct observation. This prevents us from so naïvely accepting a "blurred model" as representative of reality. Per se, it would not embody anything unclear or contradictory. There is a difference between a shaky or out-of-focus photograph and a snapshot of clouds and fog banks.

Schrödinger developed his famous thought experiment in correspondence with Einstein. He suggested this 'quite ridiculous case' to illustrate his conclusion that the wave function cannot represent reality. The wave function description of the complete cat system implies that the reality of the cat mixes the living and dead cat. Einstein was impressed by the ability of the thought experiment to highlight these issues. In a letter to Schrödinger dated 1950, he wrote:

You are the only contemporary physicist, besides Laue, who sees that one cannot get around the assumption of reality, if only one is honest. Most of them simply do not see what sort of risky game they are playing with reality—reality as something independent of what is experimentally established. Their interpretation is, however, refuted most elegantly by your system of radioactive atom + amplifier + charge of gun powder + cat in a box, in which the psi-function of the system contains both the cat alive and blown to bits. Nobody really doubts that the presence or absence of the cat is something independent of the act of observation.

Note that the charge of gunpowder is not mentioned in Schrödinger's setup, which uses a Geiger counter as an amplifier and hydrocyanic poison instead of gunpowder. The gunpowder had been mentioned in Einstein's original suggestion to Schrödinger 15 years before, and Einstein carried it forward to the present discussion.

Analysis

In modern terms Schrödinger's hypothetical cat experiment describes the measurement problem: quantum theory describes the cat system as a combination of two possible outcomes but only one outcome is ever observed. The experiment poses the question, "when does a quantum system stop existing as a superposition of states and become one or the other?" (More technically, when does the actual quantum state stop being a non-trivial linear combination of states, each of which resembles different classical states, and instead begin to have a unique classical description?) Standard microscopic quantum mechanics describes multiple possible outcomes of experiments but only one outcome is observed. The thought experiment illustrates this apparent paradox. Our intuition says that the cat cannot be in more than one state simultaneously—yet the quantum mechanical description of the thought experiment requires such a condition.

Interpretations

Since Schrödinger's time, other interpretations of quantum mechanics have been proposed that give different answers to the questions posed by Schrödinger's cat of how long superpositions last and when (or whether) they collapse.

Copenhagen interpretation

A commonly held interpretation of quantum mechanics is the Copenhagen interpretation. In the Copenhagen interpretation, a measurement results in only one state of a superposition. This thought experiment makes apparent the fact that this interpretation simply provides no explanation for the state of the cat while the box is closed. The wavefunction description of the system consists of a superposition of the states "decayed nucleus/dead cat" and "undecayed nucleus/living cat". Only when the box is opened and observed can we make a statement about the cat.

Role of consciousness

In 1932, John von Neumann described in his book Mathematical Foundations of Quantum Mechanics a pattern where the radioactive source is observed by a device, which itself is observed by another device and so on. It makes no difference in the predictions of quantum theory where along this chain of causal effects the superposition collapses. This potentially infinite chain could be broken if the last device is replaced by a conscious observer. This solved the problem because it was claimed that an individual's consciousness cannot be multiple. Eugene Wigner asserted that an observer is necessary for a collapse to one or the other (e.g., either a live cat or a dead cat) of the terms on the right-hand side of a wave function. Wigner discussed the interpretation in a thought experiment known as Wigner's friend.

Wigner supposed that a friend opened the box and observed the cat without telling anyone. From Wigner's conscious perspective, the friend is now part of the wave function and has seen a live cat and seen a dead cat. To a third person's conscious perspective, Wigner himself becomes part of the wave function once Wigner learns the outcome from the friend. This could be extended indefinitely.

A resolution of the paradox is that the triggering of the Geiger counter counts as a measurement of the state of the radioactive substance. Because a measurement has already occurred deciding the state of the cat, the subsequent observation by a human records only what has already occurred. Analysis of an actual experiment by Roger Carpenter and A. J. Anderson found that measurement alone (for example by a Geiger counter) is sufficient to collapse a quantum wave function before any human knows of the result. The apparatus indicates one of two colors depending on the outcome. The human observer sees which color is indicated, but they don't consciously know which outcome the color represents. A second human, the one who set up the apparatus, is told of the color and becomes conscious of the outcome, and the box is opened to check if the outcome matches. However, it is disputed whether merely observing the color counts as a conscious observation of the outcome.

Bohr's interpretation

Analysis of the work of Niels Bohr, one of the main scientists associated with the Copenhagen interpretation, suggests he viewed the state of the cat before the box is opened as indeterminate. The superposition itself had no physical meaning to Bohr: Schrödinger's cat would be either dead or alive long before the box is opened but the cat and box form a inseparable combination. Bohr saw no role for a human observer. Bohr emphasized the classical nature of measurement results. An "irreversible" or effectively irreversible process imparts the classical behavior of "observation" or "measurement".

Many-worlds interpretation

The quantum-mechanical "Schrödinger's cat" paradox according to the many-worlds interpretation. In this interpretation, every event is a branch point. The cat is both alive and dead—regardless of whether the box is opened—but the "alive" and "dead" cats are in different branches of the universe that are equally real but cannot interact with each other.

In 1957, Hugh Everett formulated the many-worlds interpretation of quantum mechanics, which does not single out observation as a special process. In the many-worlds interpretation, both alive and dead states of the cat persist after the box is opened, but are decoherent from each other. In other words, when the box is opened, the observer and the possibly-dead cat split into an observer looking at a box with a dead cat and an observer looking at a box with a live cat. But since the dead and alive states are decoherent, there is no communication or interaction between them.

When opening the box, the observer becomes entangled with the cat, so "observer states" corresponding to the cat's being alive and dead are formed; each observer state is entangled, or linked, with the cat so that the observation of the cat's state and the cat's state correspond with each other. Quantum decoherence ensures that the different outcomes have no interaction with each other. Decoherence is generally considered to prevent simultaneous observation of multiple states.

A variant of the Schrödinger's cat experiment, known as the quantum suicide machine, has been proposed by cosmologist Max Tegmark. It examines the Schrödinger's cat experiment from the point of view of the cat, and argues that by using this approach, one may be able to distinguish between the Copenhagen interpretation and many-worlds.

Ensemble interpretation

The ensemble interpretation states that superpositions are nothing but subensembles of a larger statistical ensemble. The state vector would not apply to individual cat experiments, but only to the statistics of many similarly prepared cat experiments. Proponents of this interpretation state that this makes the Schrödinger's cat paradox a trivial matter, or a non-issue.

This interpretation serves to discard the idea that a single physical system in quantum mechanics has a mathematical description that corresponds to it in any way.

Relational interpretation

The relational interpretation makes no fundamental distinction between the human experimenter, the cat, and the apparatus or between animate and inanimate systems; all are quantum systems governed by the same rules of wavefunction evolution, and all may be considered "observers". But the relational interpretation allows that different observers can give different accounts of the same series of events, depending on the information they have about the system. The cat can be considered an observer of the apparatus; meanwhile, the experimenter can be considered another observer of the system in the box (the cat plus the apparatus). Before the box is opened, the cat, by nature of its being alive or dead, has information about the state of the apparatus (the atom has either decayed or not decayed); but the experimenter does not have information about the state of the box contents. In this way, the two observers simultaneously have different accounts of the situation: To the cat, the wavefunction of the apparatus has appeared to "collapse"; to the experimenter, the contents of the box appear to be in superposition. Not until the box is opened, and both observers have the same information about what happened, do both system states appear to "collapse" into the same definite result, a cat that is either alive or dead.

Transactional interpretation

In the transactional interpretation the apparatus emits an advanced wave backward in time, which combined with the wave that the source emits forward in time, forms a standing wave. The waves are seen as physically real, and the apparatus is considered an "observer". In the transactional interpretation, the collapse of the wavefunction is "atemporal" and occurs along the whole transaction between the source and the apparatus. The cat is never in superposition. Rather the cat is only in one state at any particular time, regardless of when the human experimenter looks in the box. The transactional interpretation resolves this quantum paradox.

Objective collapse theories

According to objective collapse theories, superpositions are destroyed spontaneously (irrespective of external observation) when some objective physical threshold (of time, mass, temperature, irreversibility, etc.) is reached. Thus, the cat would be expected to have settled into a definite state long before the box is opened. This could loosely be phrased as "the cat observes itself" or "the environment observes the cat".

Objective collapse theories require a modification of standard quantum mechanics to allow superpositions to be destroyed by the process of time evolution. These theories could ideally be tested by creating mesoscopic superposition states in the experiment. For instance, energy cat states has been proposed as a precise detector of the quantum gravity related energy decoherence models.

Applications and tests

The experiment as described is a purely theoretical one, and the machine proposed is not known to have been constructed. However, successful experiments involving similar principles, e.g. superpositions of relatively large (by the standards of quantum physics) objects have been performed. These experiments do not show that a cat-sized object can be superposed, but the known upper limit on "cat states" has been pushed upwards by them. In many cases the state is short-lived, even when cooled to near absolute zero.

  • A "cat state" has been achieved with photons.
  • A beryllium ion has been trapped in a superposed state.
  • An experiment involving a superconducting quantum interference device ("SQUID") has been linked to the theme of the thought experiment: "The superposition state does not correspond to a billion electrons flowing one way and a billion others flowing the other way. Superconducting electrons move en masse. All the superconducting electrons in the SQUID flow both ways around the loop at once when they are in the Schrödinger's cat state."
  • A piezoelectric "tuning fork" has been constructed, which can be placed into a superposition of vibrating and non vibrating states. The resonator comprises about 10 trillion atoms.
  • An experiment involving a flu virus has been proposed.
  • An experiment involving a bacterium and an electromechanical oscillator has been proposed.

In quantum computing the phrase "cat state" sometimes refers to the GHZ state, wherein several qubits are in an equal superposition of all being 0 and all being 1; e.g.,

According to at least one proposal, it may be possible to determine the state of the cat before observing it.

Extensions

In August 2020, physicists presented studies involving interpretations of quantum mechanics that are related to the Schrödinger's cat and Wigner's friend paradoxes, resulting in conclusions that challenge seemingly established assumptions about reality.

Critical point (mathematics)

From Wikipedia, the free encyclopedia https://en.wikipedia.org/wiki/Critical_point_(mathematics) ...