Search This Blog

Friday, May 25, 2018

Wolfgang Pauli

    From Wikipedia, the free encyclopedia
    Wolfgang Ernst Pauli
    Pauli.jpg
    Born Wolfgang Ernst Pauli
    25 April 1900
    Vienna, Austria-Hungary
    Died 15 December 1958 (aged 58)
    Zurich, Switzerland
    Nationality Austria
    Citizenship Austria-Hungary
    Switzerland
    United States
    Alma mater Ludwig-Maximilians University
    Known for
    Awards
    Scientific career
    Fields Theoretical physics
    Institutions University of Göttingen
    University of Copenhagen
    University of Hamburg
    ETH Zurich
    Institute for Advanced Study
    Thesis About the Hydrogen Molecular Ion Model[3] (1921)
    Doctoral advisor Arnold Sommerfeld[3][2]
    Other academic advisors Max Born[citation needed]
    Doctoral students
    Other notable students
    Influences
    Influenced Ralph Kronig[citation needed]
    Notes
    His godfather was Ernst Mach. He is not to be confused with Wolfgang Paul, who called Pauli his "imaginary part",[4] a pun with the imaginary unit i.

    Wolfgang Ernst Pauli (/ˈpɔːli/;[5] German: [ˈpaʊli]; 25 April 1900 – 15 December 1958) was an Austrian-born Swiss and American theoretical physicist and one of the pioneers of quantum physics. In 1945, after having been nominated by Albert Einstein,[6] Pauli received the Nobel Prize in Physics for his "decisive contribution through his discovery of a new law of Nature, the exclusion principle or Pauli principle". The discovery involved spin theory, which is the basis of a theory of the structure of matter.

    Biography

    Early years

    Pauli was born in Vienna to a chemist Wolfgang Joseph Pauli ( Wolf Pascheles, 1869–1955) and his wife Bertha Camilla Schütz; his sister was Hertha Pauli, the writer and actress. Pauli's middle name was given in honor of his godfather, physicist Ernst Mach. Pauli's paternal grandparents were from prominent Jewish families of Prague; his great-grandfather was the Jewish publisher Wolf Pascheles.[7] Pauli's father converted from Judaism to Roman Catholicism shortly before his marriage in 1899. Pauli's mother, Bertha Schütz, was raised in her own mother's Roman Catholic religion; her father was Jewish writer Friedrich Schütz. Pauli was raised as a Roman Catholic, although eventually he and his parents left the Church.[8] He is considered to have been a deist and a mystic.[9][10]

    Pauli attended the Döblinger-Gymnasium in Vienna, graduating with distinction in 1918. Only two months after graduation, he published his first paper, on Albert Einstein's theory of general relativity. He attended the Ludwig-Maximilians University in Munich, working under Arnold Sommerfeld,[2] where he received his PhD in July 1921 for his thesis on the quantum theory of ionized diatomic hydrogen (H+
    2
    ).[3][11]

    Sommerfeld asked Pauli to review the theory of relativity for the Encyklopädie der mathematischen Wissenschaften (Encyclopedia of Mathematical Sciences). Two months after receiving his doctorate, Pauli completed the article, which came to 237 pages. It was praised by Einstein; published as a monograph, it remains a standard reference on the subject to this day.

    Wolfgang Pauli lecturing

    Pauli spent a year at the University of Göttingen as the assistant to Max Born, and the following year at the Institute for Theoretical Physics in Copenhagen, which later became the Niels Bohr Institute in 1965. From 1923 to 1928, he was a lecturer at the University of Hamburg. During this period, Pauli was instrumental in the development of the modern theory of quantum mechanics. In particular, he formulated the exclusion principle and the theory of nonrelativistic spin.

    In 1928, he was appointed Professor of Theoretical Physics at ETH Zurich in Switzerland where he made significant scientific progress. He held visiting professorships at the University of Michigan in 1931, and the Institute for Advanced Study in Princeton in 1935. He was awarded the Lorentz Medal in 1931.

    At the end of 1930, shortly after his postulation of the neutrino and immediately following his divorce and the suicide of his mother, Pauli experienced a personal crisis. He consulted psychiatrist and psychotherapist Carl Jung who, like Pauli, lived near Zurich. Jung immediately began interpreting Pauli's deeply archetypal dreams,[12] and Pauli became one of the depth psychologist's best students. He soon began to criticize the epistemology of Jung's theory scientifically, and this contributed to a certain clarification of the latter's thoughts, especially about the concept of synchronicity. A great many of these discussions are documented in the Pauli/Jung letters, today published as Atom and Archetype. Jung's elaborate analysis of more than 400 of Pauli's dreams is documented in Psychology and Alchemy.

    The German annexation of Austria in 1938 made him a German citizen, which became a problem for him in 1939 after the outbreak of World War II. In 1940, he tried in vain to obtain Swiss citizenship, which would have allowed him to remain at the ETH.[13]

    Pauli moved to the United States in 1940, where he was employed as a professor of theoretical physics at the Institute for Advanced Study. In 1946, after the war, he became a naturalized citizen of the United States and subsequently returned to Zurich, where he mostly remained for the rest of his life. In 1949, he was granted Swiss citizenship.

    In 1958, Pauli was awarded the Max Planck medal. In that same year, he fell ill with pancreatic cancer. When his last assistant, Charles Enz, visited him at the Rotkreuz hospital in Zurich, Pauli asked him: "Did you see the room number?" It was number 137. Throughout his life, Pauli had been preoccupied with the question of why the fine structure constant, a dimensionless fundamental constant, has a value nearly equal to 1/137. Pauli died in that room on 15 December 1958.[14]

    Scientific research

    Niels Bohr, Werner Heisenberg, and Wolfgang Pauli, ca. 1935

    Pauli made many important contributions as a physicist, primarily in the field of quantum mechanics. He seldom published papers, preferring lengthy correspondences with colleagues such as Niels Bohr and Werner Heisenberg, with whom he had close friendships. Many of his ideas and results were never published and appeared only in his letters, which were often copied and circulated by their recipients.

    Pauli proposed in 1924 a new quantum degree of freedom (or quantum number) with two possible values, in order to resolve inconsistencies between observed molecular spectra and the developing theory of quantum mechanics. He formulated the Pauli exclusion principle, perhaps his most important work, which stated that no two electrons could exist in the same quantum state, identified by four quantum numbers including his new two-valued degree of freedom. The idea of spin originated with Ralph Kronig. George Uhlenbeck and Samuel Goudsmit one year later identified Pauli's new degree of freedom as electron spin.

    In 1926, shortly after Heisenberg published the matrix theory of modern quantum mechanics, Pauli used it to derive the observed spectrum of the hydrogen atom. This result was important in securing credibility for Heisenberg's theory.

    Pauli introduced the 2 × 2 Pauli matrices as a basis of spin operators, thus solving the nonrelativistic theory of spin. This work is sometimes said to have influenced Paul Dirac in his creation of the Dirac equation for the relativistic electron, though Dirac stated that he invented these same matrices himself independently at the time, without Pauli's influence. Dirac invented similar but larger (4x4) spin matrices for use in his relativistic treatment of fermionic spin.

    In 1930, Pauli considered the problem of beta decay. In a letter of 4 December to Lise Meitner et al., beginning, "Dear radioactive ladies and gentlemen", he proposed the existence of a hitherto unobserved neutral particle with a small mass, no greater than 1% the mass of a proton, in order to explain the continuous spectrum of beta decay. In 1934, Enrico Fermi incorporated the particle, which he called a neutrino, into his theory of beta decay. The neutrino was first confirmed experimentally in 1956 by Frederick Reines and Clyde Cowan, two and a half years before Pauli's death. On receiving the news, he replied by telegram: "Thanks for message. Everything comes to him who knows how to wait. Pauli."[15]

    In 1940, he re-derived the spin-statistics theorem, a critical result of quantum field theory which states that particles with half-integer spin are fermions, while particles with integer spin are bosons.

    In 1949, he published a paper on Pauli–Villars regularization: regularization is the term for techniques which modify infinite mathematical integrals to make them finite during calculations, so that one can identify whether the intrinsically infinite quantities in the theory (mass, charge, wavefunction) form a finite and hence calculable set which can be redefined in terms of their experimental values, which criterion is termed renormalization, and which removes infinities from quantum field theories, but also importantly allows the calculation of higher order corrections in perturbation theory.

    Pauli made repeated criticisms of the modern synthesis of evolutionary biology,[16][17] and his contemporary admirers point to modes of epigenetic inheritance as supportive of his arguments.[18]

    Personality and reputation

    Wolfgang Pauli, ca. 1924

    The Pauli effect was named after the anecdotal bizarre ability of his to break experimental equipment simply by being in the vicinity. Pauli was aware of his reputation and was delighted whenever the Pauli effect manifested. These strange occurrences were in line with his investigations into the legitimacy of parapsychology, particularly his collaboration with C. G. Jung on the concept of synchronicity.

    Regarding physics, Pauli was famously a perfectionist. This extended not just to his own work, but also to the work of his colleagues. As a result, he became known in the physics community as the "conscience of physics," the critic to whom his colleagues were accountable. He could be scathing in his dismissal of any theory he found lacking, often labelling it ganz falsch, utterly wrong.

    However, this was not his most severe criticism, which he reserved for theories or theses so unclearly presented as to be untestable or unevaluatable and, thus, not properly belonging within the realm of science, even though posing as such. They were worse than wrong because they could not be proven wrong. Famously, he once said of such an unclear paper: It is not even wrong!"[2]

    His supposed remark when meeting another leading physicist, Paul Ehrenfest, illustrates this notion of an arrogant Pauli. The two met at a conference for the first time. Ehrenfest was familiar with Pauli's papers and was quite impressed with them. After a few minutes of conversation, Ehrenfest remarked, "I think I like your Encyclopedia article [on relativity theory] better than I like you," to which Pauli shot back, "That's strange. With me, regarding you, it is just the opposite."[19] The two became very good friends from then on.

    A somewhat warmer picture emerges from this story, which appears in the article on Dirac:
    "Werner Heisenberg [in Physics and Beyond, 1971] recollects a friendly conversation among young participants at the 1927 Solvay Conference, about Einstein and Planck's views on religion. Wolfgang Pauli, Heisenberg, and Dirac took part in it. Dirac's contribution was a poignant and clear criticism of the political manipulation of religion, that was much appreciated for its lucidity by Bohr, when Heisenberg reported it to him later. Among other things, Dirac said: "I cannot understand why we idle discussing religion. If we are honest – and as scientists honesty is our precise duty – we cannot help but admit that any religion is a pack of false statements, deprived of any real foundation. The very idea of God is a product of human imagination. [...] I do not recognize any religious myth, at least because they contradict one another. [...]" Heisenberg's view was tolerant. Pauli had kept silent, after some initial remarks. But when finally he was asked for his opinion, jokingly he said: "Well, I'd say that also our friend Dirac has got a religion and the first commandment of this religion is 'God does not exist and Paul Dirac is his prophet'". Everybody burst into laughter, including Dirac.
    Many of Pauli's ideas and results were never published and appeared only in his letters, which were often copied and circulated by their recipients. Pauli may have been unconcerned that much of his work thus went uncredited, but when it came to Heisenberg's world-renowned 1958 lecture at Göttingen on their joint work on a unified field theory, and the press release calling Pauli a mere "assistant to Professor Heisenberg", Pauli became offended, denouncing Heisenberg's physics prowess. The deterioration between them resulted in Heisenberg ignoring Pauli's funeral, and writing in his autobiography that Pauli's criticisms were overwrought.[20] Pauli was elected a Foreign Member of the Royal Society (ForMemRS) in 1953.[2] In 1958 he became foreign member of the Royal Netherlands Academy of Arts and Sciences.[21]

    Personal life

    Buste of Wolfgang Pauli (1962)

    In May 1929, Pauli left the Roman Catholic Church. In December of that year, he married Käthe Margarethe Deppner. The marriage was an unhappy one, ending in divorce in 1930 after less than a year. He married again in 1934 to Franziska Bertram (1901–1987). They had no children.

    Bibliography

  • Pauli, Wolfgang; Jung, C. G. (1955). The Interpretation of Nature and the Psyche. Ishi Press. ISBN 4-87187-713-2.
  • Pauli, Wolfgang (1981). Theory of Relativity. New York: Dover Publications. ISBN 0-486-64152-X.
  • Pauli, Wolfgang; Jung, C. G. (2001). ed. C. A. Meier, ed. Atom and Archetype, The Pauli/Jung Letters, 1932–1958. Princeton, New Jersey: Princeton University Press. ISBN 978-0-691012-07-0.

Dirac equation

From Wikipedia, the free encyclopedia

In particle physics, the Dirac equation is a relativistic wave equation derived by British physicist Paul Dirac in 1928. In its free form, or including electromagnetic interactions, it describes all spin-1/2 massive particles such as electrons and quarks for which parity is a symmetry. It is consistent with both the principles of quantum mechanics and the theory of special relativity,[1] and was the first theory to account fully for special relativity in the context of quantum mechanics. It was validated by accounting for the fine details of the hydrogen spectrum in a completely rigorous way.

The equation also implied the existence of a new form of matter, antimatter, previously unsuspected and unobserved and which was experimentally confirmed several years later. It also provided a theoretical justification for the introduction of several component wave functions in Pauli's phenomenological theory of spin; the wave functions in the Dirac theory are vectors of four complex numbers (known as bispinors), two of which resemble the Pauli wavefunction in the non-relativistic limit, in contrast to the Schrödinger equation which described wave functions of only one complex value. Moreover, in the limit of zero mass, the Dirac equation reduces to the Weyl equation.

Although Dirac did not at first fully appreciate the importance of his results, the entailed explanation of spin as a consequence of the union of quantum mechanics and relativity—and the eventual discovery of the positron—represents one of the great triumphs of theoretical physics. This accomplishment has been described as fully on a par with the works of Newton, Maxwell, and Einstein before him.[2] In the context of quantum field theory, the Dirac equation is reinterpreted to describe quantum fields corresponding to spin-1/2 particles.

Mathematical formulation

The Dirac equation in the form originally proposed by Dirac is:[3]
Dirac equation (original) \left(\beta mc^{2}+c\left(\sum _{n{\mathop {=}}1}^{3}\alpha _{n}p_{n}\right)\right)\psi (x,t)=i\hbar {\frac {\partial \psi (x,t)}{\partial t}}
where ψ = ψ(x, t) is the wave function for the electron of rest mass m with spacetime coordinates x, t. The p1, p2, p3 are the components of the momentum, understood to be the momentum operator in the Schrödinger equation. Also, c is the speed of light, and ħ is the Planck constant divided by . These fundamental physical constants reflect special relativity and quantum mechanics, respectively.
Dirac's purpose in casting this equation was to explain the behavior of the relativistically moving electron, and so to allow the atom to be treated in a manner consistent with relativity. His rather modest hope was that the corrections introduced this way might have a bearing on the problem of atomic spectra. Up until that time, attempts to make the old quantum theory of the atom compatible with the theory of relativity, attempts based on discretizing the angular momentum stored in the electron's possibly non-circular orbit of the atomic nucleus, had failed – and the new quantum mechanics of Heisenberg, Pauli, Jordan, Schrödinger, and Dirac himself had not developed sufficiently to treat this problem. Although Dirac's original intentions were satisfied, his equation had far deeper implications for the structure of matter and introduced new mathematical classes of objects that are now essential elements of fundamental physics.

The new elements in this equation are the 4 × 4 matrices αk and β, and the four-component wave function ψ. There are four components in ψ because the evaluation of it at any given point in configuration space is a bispinor. It is interpreted as a superposition of a spin-up electron, a spin-down electron, a spin-up positron, and a spin-down positron (see below for further discussion).

The 4 × 4 matrices αk and β are all Hermitian and have squares equal to the identity matrix:
\alpha _{i}^{2}=\beta ^{2}=I_{4}
and they all mutually anticommute (if i and j are distinct):
\alpha _{i}\alpha _{j}+\alpha _{j}\alpha _{i}=0
\alpha _{i}\beta +\beta \alpha _{i}=0
The single symbolic equation thus unravels into four coupled linear first-order partial differential equations for the four quantities that make up the wave function. These matrices and the form of the wave function have a deep mathematical significance. The algebraic structure represented by the gamma matrices had been created some 50 years earlier by the English mathematician W. K. Clifford. In turn, Clifford's ideas had emerged from the mid-19th-century work of the German mathematician Hermann Grassmann in his Lineale Ausdehnungslehre (Theory of Linear Extensions). The latter had been regarded as well-nigh incomprehensible by most of his contemporaries. The appearance of something so seemingly abstract, at such a late date, and in such a direct physical manner, is one of the most remarkable chapters in the history of physics.

Making the Schrödinger equation relativistic

The Dirac equation is superficially similar to the Schrödinger equation for a massive free particle:
-{\frac {\hbar ^{2}}{2m}}\nabla ^{2}\phi =i\hbar {\frac {\partial }{\partial t}}\phi .
The left side represents the square of the momentum operator divided by twice the mass, which is the non-relativistic kinetic energy. Because relativity treats space and time as a whole, a relativistic generalization of this equation requires that space and time derivatives must enter symmetrically as they do in the Maxwell equations that govern the behavior of light — the equations must be differentially of the same order in space and time. In relativity, the momentum and the energies are the space and time parts of a spacetime vector, the four-momentum, and they are related by the relativistically invariant relation
{\displaystyle E^{2}=m^{2}c^{4}+p^{2}c^{2}}
which says that the length of this four-vector is proportional to the rest mass m. Substituting the operator equivalents of the energy and momentum from the Schrödinger theory, we get the Klein-Gordon equation describing the propagation of waves, constructed from relativistically invariant objects,
\left(-{\frac {1}{c^{2}}}{\frac {\partial ^{2}}{\partial t^{2}}}+\nabla ^{2}\right)\phi ={\frac {m^{2}c^{2}}{\hbar ^{2}}}\phi
with the wave function ϕ being a relativistic scalar: a complex number which has the same numerical value in all frames of reference. Space and time derivatives both enter to second order. This has a telling consequence for the interpretation of the equation. Because the equation is second order in the time derivative, one must specify initial values both of the wave function itself and of its first time-derivative in order to solve definite problems. Since both may be specified more or less arbitrarily, the wave function cannot maintain its former role of determining the probability density of finding the electron in a given state of motion. In the Schrödinger theory, the probability density is given by the positive definite expression
\rho =\phi ^{*}\phi \,
and this density is convected according to the probability current vector
J=-{\frac {i\hbar }{2m}}(\phi ^{*}\nabla \phi -\phi \nabla \phi ^{*})
with the conservation of probability current and density following from the continuity equation:
\nabla \cdot J+{\frac {\partial \rho }{\partial t}}=0.
The fact that the density is positive definite and convected according to this continuity equation implies that we may integrate the density over a certain domain and set the total to 1, and this condition will be maintained by the conservation law. A proper relativistic theory with a probability density current must also share this feature. Now, if we wish to maintain the notion of a convected density, then we must generalize the Schrödinger expression of the density and current so that space and time derivatives again enter symmetrically in relation to the scalar wave function. We are allowed to keep the Schrödinger expression for the current, but must replace the probability density by the symmetrically formed expression
{\displaystyle \rho ={\frac {i\hbar }{2mc^{2}}}(\psi ^{*}\partial _{t}\psi -\psi \partial _{t}\psi ^{*}).}
which now becomes the 4th component of a spacetime vector, and the entire probability 4-current density has the relativistically covariant expression
J^{\mu }={\frac {i\hbar }{2m}}(\psi ^{*}\partial ^{\mu }\psi -\psi \partial ^{\mu }\psi ^{*})
The continuity equation is as before. Everything is compatible with relativity now, but we see immediately that the expression for the density is no longer positive definite – the initial values of both ψ and tψ may be freely chosen, and the density may thus become negative, something that is impossible for a legitimate probability density. Thus, we cannot get a simple generalization of the Schrödinger equation under the naive assumption that the wave function is a relativistic scalar, and the equation it satisfies, second order in time.

Although it is not a successful relativistic generalization of the Schrödinger equation, this equation is resurrected in the context of quantum field theory, where it is known as the Klein–Gordon equation, and describes a spinless particle field (e.g. pi meson). Historically, Schrödinger himself arrived at this equation before the one that bears his name but soon discarded it. In the context of quantum field theory, the indefinite density is understood to correspond to the charge density, which can be positive or negative, and not the probability density.

Dirac's coup

Dirac thus thought to try an equation that was first order in both space and time. One could, for example, formally (i.e. by abuse of notation) take the relativistic expression for the energy
E=c{\sqrt {p^{2}+m^{2}c^{2}}}\,,
replace p by its operator equivalent, expand the square root in an infinite series of derivative operators, set up an eigenvalue problem, then solve the equation formally by iterations. Most physicists had little faith in such a process, even if it were technically possible.

As the story goes, Dirac was staring into the fireplace at Cambridge, pondering this problem, when he hit upon the idea of taking the square root of the wave operator thus:
\nabla ^{2}-{\frac {1}{c^{2}}}{\frac {\partial ^{2}}{\partial t^{2}}}=\left(A\partial _{x}+B\partial _{y}+C\partial _{z}+{\frac {i}{c}}D\partial _{t}\right)\left(A\partial _{x}+B\partial _{y}+C\partial _{z}+{\frac {i}{c}}D\partial _{t}\right).



On multiplying out the right side we see that, in order to get all the cross-terms such as xy to vanish, we must assume
AB+BA=0,\;\ldots
with
A^{2}=B^{2}=\ldots =1.\,
Dirac, who had just then been intensely involved with working out the foundations of Heisenberg's matrix mechanics, immediately understood that these conditions could be met if A, B, C and D are matrices, with the implication that the wave function has multiple components. This immediately explained the appearance of two-component wave functions in Pauli's phenomenological theory of spin, something that up until then had been regarded as mysterious, even to Pauli himself. However, one needs at least 4 × 4 matrices to set up a system with the properties required — so the wave function had four components, not two, as in the Pauli theory, or one, as in the bare Schrödinger theory. The four-component wave function represents a new class of mathematical object in physical theories that makes its first appearance here.

Given the factorization in terms of these matrices, one can now write down immediately an equation
\left(A\partial _{x}+B\partial _{y}+C\partial _{z}+{\frac {i}{c}}D\partial _{t}\right)\psi =\kappa \psi
with κ to be determined. Applying again the matrix operator on both sides yields
\left(\nabla ^{2}-{\frac {1}{c^{2}}}\partial _{t}^{2}\right)\psi =\kappa ^{2}\psi .
On taking κ = mc/ħ we find that all the components of the wave function individually satisfy the relativistic energy–momentum relation. Thus the sought-for equation that is first-order in both space and time is
\left(A\partial _{x}+B\partial _{y}+C\partial _{z}+{\frac {i}{c}}D\partial _{t}-{\frac {mc}{\hbar }}\right)\psi =0.
Setting
{\displaystyle A=i\beta \alpha _{1}\,,\,B=i\beta \alpha _{2}\,,\,C=i\beta \alpha _{3}\,,\,D=\beta \,,}
and because {\displaystyle D^{2}=\beta ^{2}=I_{4}\,,}
we get the Dirac equation as written above.

Covariant form and relativistic invariance

To demonstrate the relativistic invariance of the equation, it is advantageous to cast it into a form in which the space and time derivatives appear on an equal footing. New matrices are introduced as follows:
{\displaystyle D=\gamma ^{0}\,,}
{\displaystyle A=i\gamma ^{1}\,,\,B=i\gamma ^{2}\,,\,C=i\gamma ^{3}\,,}
and the equation takes the form (remembering the definition of the covariant components of the 4-gradient and especially that 0 = 1/ct )
Dirac equation i\hbar \gamma ^{\mu }\partial _{\mu }\psi -mc\psi =0
where there is an implied summation over the values of the twice-repeated index μ = 0, 1, 2, 3, and μ is the 4-gradient. In practice one often writes the gamma matrices in terms of 2 × 2 sub-matrices taken from the Pauli matrices and the 2 × 2 identity matrix. Explicitly the standard representation is
{\displaystyle \gamma ^{0}={\begin{pmatrix}I_{2}&0\\0&-I_{2}\end{pmatrix}},\gamma ^{1}=\left({\begin{array}{cccc}0&\sigma _{x}\\-\sigma _{x}&0\end{array}}\right),\gamma ^{2}=\left({\begin{array}{cccc}0&\sigma _{y}\\-\sigma _{y}&0\end{array}}\right),\gamma ^{3}=\left({\begin{array}{cccc}0&\sigma _{z}\\-\sigma _{z}&0\end{array}}\right).\,}



The complete system is summarized using the Minkowski metric on spacetime in the form
{\displaystyle \{\gamma ^{\mu },\gamma ^{\nu }\}=2\eta ^{\mu \nu }I_{4}\,}
where the bracket expression
\{a,b\}=ab+ba
denotes the anticommutator. These are the defining relations of a Clifford algebra over a pseudo-orthogonal 4-dimensional space with metric signature (+ − − −). The specific Clifford algebra employed in the Dirac equation is known today as the Dirac algebra. Although not recognized as such by Dirac at the time the equation was formulated, in hindsight the introduction of this geometric algebra represents an enormous stride forward in the development of quantum theory.

The Dirac equation may now be interpreted as an eigenvalue equation, where the rest mass is proportional to an eigenvalue of the 4-momentum operator, the proportionality constant being the speed of light:
P_{\mathrm {op} }\psi =mc\psi .\,
Using {\displaystyle {\partial \!\!\!/}\ {\stackrel {\mathrm {def} }{=}}\ \gamma ^{\mu }\partial _{\mu }} ({\partial \!\!\!{\big /}} is pronounced "d-slash"[4]) , according to Feynman slash notation, the Dirac equation becomes:
{\displaystyle i\hbar {\partial \!\!\!{\big /}}\psi -mc\psi =0.}
In practice, physicists often use units of measure such that ħ = c = 1, known as natural units. The equation then takes the simple form
Dirac equation (natural units) (i{\partial \!\!\!{\big /}}-m)\psi =0\,
A fundamental theorem states that if two distinct sets of matrices are given that both satisfy the Clifford relations, then they are connected to each other by a similarity transformation:
\gamma ^{\mu \prime }=S^{-1}\gamma ^{\mu }S.
If in addition the matrices are all unitary, as are the Dirac set, then S itself is unitary;
\gamma ^{\mu \prime }=U^{\dagger }\gamma ^{\mu }U.
The transformation U is unique up to a multiplicative factor of absolute value 1. Let us now imagine a Lorentz transformation to have been performed on the space and time coordinates, and on the derivative operators, which form a covariant vector. For the operator γμμ to remain invariant, the gammas must transform among themselves as a contravariant vector with respect to their spacetime index. These new gammas will themselves satisfy the Clifford relations, because of the orthogonality of the Lorentz transformation. By the fundamental theorem, we may replace the new set by the old set subject to a unitary transformation. In the new frame, remembering that the rest mass is a relativistic scalar, the Dirac equation will then take the form
(iU^{\dagger }\gamma ^{\mu }U\partial _{\mu }^{\prime }-m)\psi (x^{\prime },t^{\prime })=0
U^{\dagger }(i\gamma ^{\mu }\partial _{\mu }^{\prime }-m)U\psi (x^{\prime },t^{\prime })=0.
If we now define the transformed spinor
\psi ^{\prime }=U\psi
then we have the transformed Dirac equation in a way that demonstrates manifest relativistic invariance:
(i\gamma ^{\mu }\partial _{\mu }^{\prime }-m)\psi ^{\prime }(x^{\prime },t^{\prime })=0.
Thus, once we settle on any unitary representation of the gammas, it is final provided we transform the spinor according to the unitary transformation that corresponds to the given Lorentz transformation. The various representations of the Dirac matrices employed will bring into focus particular aspects of the physical content in the Dirac wave function (see below). The representation shown here is known as the standard representation – in it, the wave function's upper two components go over into Pauli's 2-spinor wave function in the limit of low energies and small velocities in comparison to light.

The considerations above reveal the origin of the gammas in geometry, hearkening back to Grassmann's original motivation – they represent a fixed basis of unit vectors in spacetime. Similarly, products of the gammas such as γμγν represent oriented surface elements, and so on. With this in mind, we can find the form of the unit volume element on spacetime in terms of the gammas as follows. By definition, it is
V={\frac {1}{4!}}\epsilon _{\mu \nu \alpha \beta }\gamma ^{\mu }\gamma ^{\nu }\gamma ^{\alpha }\gamma ^{\beta }.
For this to be an invariant, the epsilon symbol must be a tensor, and so must contain a factor of g, where g is the determinant of the metric tensor. Since this is negative, that factor is imaginary. Thus
V=i\gamma ^{0}\gamma ^{1}\gamma ^{2}\gamma ^{3}.\
This matrix is given the special symbol γ5, owing to its importance when one is considering improper transformations of spacetime, that is, those that change the orientation of the basis vectors. In the standard representation, it is
\gamma _{5}={\begin{pmatrix}0&I_{2}\\I_{2}&0\end{pmatrix}}.
This matrix will also be found to anticommute with the other four Dirac matrices:
\gamma ^{5}\gamma ^{\mu }+\gamma ^{\mu }\gamma ^{5}=0
It takes a leading role when questions of parity arise because the volume element as a directed magnitude changes sign under a spacetime reflection. Taking the positive square root above thus amounts to choosing a handedness convention on spacetime .

Conservation of probability current

By defining the adjoint spinor
{\bar {\psi }}=\psi ^{\dagger }\gamma ^{0}
where ψ is the conjugate transpose of ψ, and noticing that
(\gamma ^{\mu })^{\dagger }\gamma ^{0}=\gamma ^{0}\gamma ^{\mu }\,,
we obtain, by taking the Hermitian conjugate of the Dirac equation and multiplying from the right by γ0, the adjoint equation:
{\bar {\psi }}(-i\gamma ^{\mu }\partial _{\mu }-m)=0\,
where μ is understood to act to the left. Multiplying the Dirac equation by ψ from the left, and the adjoint equation by ψ from the right, and subtracting, produces the law of conservation of the Dirac current:
\partial _{\mu }\left({\bar {\psi }}\gamma ^{\mu }\psi \right)=0.
Now we see the great advantage of the first-order equation over the one Schrödinger had tried – this is the conserved current density required by relativistic invariance, only now its 4th component is positive definite and thus suitable for the role of a probability density:
J^{0}={\bar {\psi }}\gamma ^{0}\psi =\psi ^{\dagger }\psi .
Because the probability density now appears as the fourth component of a relativistic vector and not a simple scalar as in the Schrödinger equation, it will be subject to the usual effects of the Lorentz transformations such as time dilation. Thus, for example, atomic processes that are observed as rates, will necessarily be adjusted in a way consistent with relativity, while those involving the measurement of energy and momentum, which themselves form a relativistic vector, will undergo parallel adjustment which preserves the relativistic covariance of the observed values.

Solutions

Note that since the Dirac operator acts on 4-tuples of square-integrable functions, its solutions should be members of the same Hilbert space. The fact that the energies of the solutions do not have a lower bound is unexpected – see the hole theory section below for more details.

Comparison with the Pauli theory

The necessity of introducing half-integer spin goes back experimentally to the results of the Stern–Gerlach experiment. A beam of atoms is run through a strong inhomogeneous magnetic field, which then splits into N parts depending on the intrinsic angular momentum of the atoms. It was found that for silver atoms, the beam was split in two—the ground state therefore could not be integer, because even if the intrinsic angular momentum of the atoms were as small as possible, 1, the beam would be split into three parts, corresponding to atoms with Lz = −1, 0, +1. The conclusion is that silver atoms have net intrinsic angular momentum of ​12. Pauli set up a theory which explained this splitting by introducing a two-component wave function and a corresponding correction term in the Hamiltonian, representing a semi-classical coupling of this wave function to an applied magnetic field, as so in SI units: (Note that bold faced characters imply Euclidean vectors in 3 dimensions, where as the Minkowski four-vector Aμ can be defined as Aμ = (Φ/c, -A).)
H={\frac {1}{2m}}\left({\boldsymbol {\sigma }}\cdot \left(\mathbf {p} -e\mathbf {A} \right)\right)^{2}+e\phi .
Here A and \phi represent the components of the electromagnetic four-potential in their standard SI units, and the three sigmas are the Pauli matrices. On squaring out the first term, a residual interaction with the magnetic field is found, along with the usual classical Hamiltonian of a charged particle interacting with an applied field in SI units:
H={\frac {1}{2m}}\left(\mathbf {p} -e\mathbf {A} \right)^{2}+e\phi -{\frac {e\hbar }{2m}}{\boldsymbol {\sigma }}\cdot \mathbf {B} .
This Hamiltonian is now a 2 × 2 matrix, so the Schrödinger equation based on it must use a two-component wave function. On introducing the external electromagnetic 4-vector potential into the Dirac equation in a similar way, known as minimal coupling, it takes the form :
{\displaystyle (\gamma ^{\mu }(i\hbar \partial _{\mu }-eA_{\mu })-mc)\psi =0\,.}
A second application of the Dirac operator will now reproduce the Pauli term exactly as before, because the spatial Dirac matrices multiplied by i, have the same squaring and commutation properties as the Pauli matrices. What is more, the value of the gyromagnetic ratio of the electron, standing in front of Pauli's new term, is explained from first principles. This was a major achievement of the Dirac equation and gave physicists great faith in its overall correctness. There is more however. The Pauli theory may be seen as the low energy limit of the Dirac theory in the following manner. First the equation is written in the form of coupled equations for 2-spinors with the SI units restored:
{\begin{pmatrix}(mc^{2}-E+e\phi )&c{\boldsymbol {\sigma }}\cdot \left(\mathbf {p} -e\mathbf {A} \right)\\-c{\boldsymbol {\sigma }}\cdot \left(\mathbf {p} -e\mathbf {A} \right)&\left(mc^{2}+E-e\phi \right)\end{pmatrix}}{\begin{pmatrix}\psi _{+}\\\psi _{-}\end{pmatrix}}={\begin{pmatrix}0\\0\end{pmatrix}}.
so
(E-e\phi )\psi _{+}-c{\boldsymbol {\sigma }}\cdot \left(\mathbf {p} -e\mathbf {A} \right)\psi _{-}=mc^{2}\psi _{+}
-(E-e\phi )\psi _{-}+c{\boldsymbol {\sigma }}\cdot \left(\mathbf {p} -e\mathbf {A} \right)\psi _{+}=mc^{2}\psi _{-}
Assuming the field is weak and the motion of the electron non-relativistic, we have the total energy of the electron approximately equal to its rest energy, and the momentum going over to the classical value,
E-e\phi \approx mc^{2}
\mathbf {p} \approx m\mathbf {v}
and so the second equation may be written
\psi _{-}\approx {\frac {1}{2mc}}{\boldsymbol {\sigma }}\cdot \left(\mathbf {p} -e\mathbf {A} \right)\psi _{+}
which is of order v/c – thus at typical energies and velocities, the bottom components of the Dirac spinor in the standard representation are much suppressed in comparison to the top components. Substituting this expression into the first equation gives after some rearrangement
(E-mc^{2})\psi _{+}={\frac {1}{2m}}\left[{\boldsymbol {\sigma }}\cdot \left(\mathbf {p} -e\mathbf {A} \right)\right]^{2}\psi _{+}+e\phi \psi _{+}
The operator on the left represents the particle energy reduced by its rest energy, which is just the classical energy, so we recover Pauli's theory if we identify his 2-spinor with the top components of the Dirac spinor in the non-relativistic approximation. A further approximation gives the Schrödinger equation as the limit of the Pauli theory. Thus, the Schrödinger equation may be seen as the far non-relativistic approximation of the Dirac equation when one may neglect spin and work only at low energies and velocities. This also was a great triumph for the new equation, as it traced the mysterious i that appears in it, and the necessity of a complex wave function, back to the geometry of spacetime through the Dirac algebra. It also highlights why the Schrödinger equation, although superficially in the form of a diffusion equation, actually represents the propagation of waves.

It should be strongly emphasized that this separation of the Dirac spinor into large and small components depends explicitly on a low-energy approximation. The entire Dirac spinor represents an irreducible whole, and the components we have just neglected to arrive at the Pauli theory will bring in new phenomena in the relativistic regime – antimatter and the idea of creation and annihilation of particles.

Comparison with the Weyl theory

In the limit m → 0, the Dirac equation reduces to the Weyl equation, which describes relativistic massless spin-​12 particles.[5]

Dirac Lagrangian

Both the Dirac equation and the Adjoint Dirac equation can be obtained from (varying) the action with a specific Lagrangian density that is given by:

{\mathcal {L}}=i\hbar c{\overline {\psi }}\gamma ^{\mu }\partial _{\mu }\psi -mc^{2}{\overline {\psi }}\psi

If one varies this with respect to ψ one gets the Adjoint Dirac equation. Meanwhile, if one varies this with respect to ψ one gets the Dirac equation.

Physical interpretation

Identification of observables

The critical physical question in a quantum theory is—what are the physically observable quantities defined by the theory? According to the postulates of quantum mechanics, such quantities are defined by Hermitian operators that act on the Hilbert space of possible states of a system. The eigenvalues of these operators are then the possible results of measuring the corresponding physical quantity. In the Schrödinger theory, the simplest such object is the overall Hamiltonian, which represents the total energy of the system. If we wish to maintain this interpretation on passing to the Dirac theory, we must take the Hamiltonian to be
{\displaystyle H=\gamma ^{0}\left[mc^{2}+c\gamma ^{k}\left(p_{k}-qA_{k}\right)\right]+qA^{0}.}
where, as always, there is an implied summation over the twice-repeated index k = 1, 2, 3. This looks promising, because we see by inspection the rest energy of the particle and, in case A = 0, the energy of a charge placed in an electric potential qA0. What about the term involving the vector potential? In classical electrodynamics, the energy of a charge moving in an applied potential is
{\displaystyle H=c{\sqrt {\left(p-qA\right)^{2}+m^{2}c^{2}}}+qA^{0}.}
Thus, the Dirac Hamiltonian is fundamentally distinguished from its classical counterpart, and we must take great care to correctly identify what is observable in this theory. Much of the apparently paradoxical behavior implied by the Dirac equation amounts to a misidentification of these observables.

Hole theory

The negative E solutions to the equation are problematic, for it was assumed that the particle has a positive energy. Mathematically speaking, however, there seems to be no reason for us to reject the negative-energy solutions. Since they exist, we cannot simply ignore them, for once we include the interaction between the electron and the electromagnetic field, any electron placed in a positive-energy eigenstate would decay into negative-energy eigenstates of successively lower energy. Real electrons obviously do not behave in this way, or they would disappear by emitting energy in the form of photons.

To cope with this problem, Dirac introduced the hypothesis, known as hole theory, that the vacuum is the many-body quantum state in which all the negative-energy electron eigenstates are occupied. This description of the vacuum as a "sea" of electrons is called the Dirac sea. Since the Pauli exclusion principle forbids electrons from occupying the same state, any additional electron would be forced to occupy a positive-energy eigenstate, and positive-energy electrons would be forbidden from decaying into negative-energy eigenstates.

If an electron is forbidden from simultaneously occupying positive-energy and negative-energy eigenstates, then the feature known as Zitterbewegung, which arises from the interference of positive-energy and negative-energy states, would have to be considered to be an unphysical prediction of time-dependent Dirac theory. This conclusion may be inferred from the explanation of hole theory given in the preceding paragraph. Recent results have been published in Nature [R. Gerritsma, G. Kirchmair, F. Zaehringer, E. Solano, R. Blatt, and C. Roos, Nature 463, 68-71 (2010)] in which the Zitterbewegung feature was simulated in a trapped-ion experiment. This experiment impacts the hole interpretation if one infers that the physics-laboratory experiment is not merely a check on the mathematical correctness of a Dirac-equation solution but the measurement of a real effect whose detectability in electron physics is still beyond reach.

Dirac further reasoned that if the negative-energy eigenstates are incompletely filled, each unoccupied eigenstate – called a hole – would behave like a positively charged particle. The hole possesses a positive energy since energy is required to create a particle–hole pair from the vacuum. As noted above, Dirac initially thought that the hole might be the proton, but Hermann Weyl pointed out that the hole should behave as if it had the same mass as an electron, whereas the proton is over 1800 times heavier. The hole was eventually identified as the positron, experimentally discovered by Carl Anderson in 1932.

It is not entirely satisfactory to describe the "vacuum" using an infinite sea of negative-energy electrons. The infinitely negative contributions from the sea of negative-energy electrons have to be canceled by an infinite positive "bare" energy and the contribution to the charge density and current coming from the sea of negative-energy electrons is exactly canceled by an infinite positive "jellium" background so that the net electric charge density of the vacuum is zero. In quantum field theory, a Bogoliubov transformation on the creation and annihilation operators (turning an occupied negative-energy electron state into an unoccupied positive energy positron state and an unoccupied negative-energy electron state into an occupied positive energy positron state) allows us to bypass the Dirac sea formalism even though, formally, it is equivalent to it.

In certain applications of condensed matter physics, however, the underlying concepts of "hole theory" are valid. The sea of conduction electrons in an electrical conductor, called a Fermi sea, contains electrons with energies up to the chemical potential of the system. An unfilled state in the Fermi sea behaves like a positively charged electron, though it is referred to as a "hole" rather than a "positron". The negative charge of the Fermi sea is balanced by the positively charged ionic lattice of the material.

In quantum field theory

In quantum field theories such as quantum electrodynamics, the Dirac field is subject to a process of second quantization, which resolves some of the paradoxical features of the equation.

Other formulations

The Dirac equation can be formulated in a number of other ways.

As a differential equation in one real component

Generically (if a certain linear function of electromagnetic field does not vanish identically), three out of four components of the spinor function in the Dirac equation can be algebraically eliminated, yielding an equivalent fourth-order partial differential equation for just one component. Furthermore, this remaining component can be made real by a gauge transform.[6]

Curved spacetime

This article has developed the Dirac equation in flat spacetime according to special relativity. It is possible to formulate the Dirac equation in curved spacetime.

The algebra of physical space

This article developed the Dirac equation using four vectors and Schrödinger operators. The Dirac equation in the algebra of physical space uses a Clifford algebra over the real numbers, a type of geometric algebra.

Delayed-choice quantum eraser

From Wikipedia, the free encyclopedia https://en.wikipedia.org/wiki/Delayed-choice_quantum_eraser A delayed-cho...