In physics, scattering is a wide range of physical processes where moving particles or radiation of some form, such as light or sound, are forced to deviate from a straight trajectory
by localized non-uniformities (including particles and radiation) in
the medium through which they pass. In conventional use, this also
includes deviation of reflected radiation from the angle predicted by
the law of reflection. Reflections of radiation that undergo scattering are often called diffuse reflections and unscattered reflections are called specular (mirror-like) reflections. Originally, the term was confined to light scattering (going back at least as far as Isaac Newton in the 17th century). As more "ray"-like phenomena were discovered, the idea of scattering was extended to them, so that William Herschel could refer to the scattering of "heat rays" (not then recognized as electromagnetic in nature) in 1800. John Tyndall, a pioneer in light scattering research, noted the connection between light scattering and acoustic scattering in the 1870s. Near the end of the 19th century, the scattering of cathode rays (electron beams) and X-rays was observed and discussed. With the discovery of subatomic particles (e.g. Ernest Rutherford in 1911)
and the development of quantum theory in the 20th century, the sense of
the term became broader as it was recognized that the same mathematical
frameworks used in light scattering could be applied to many other
phenomena.
The types of non-uniformities which can cause scattering, sometimes known as scatterers or scattering centers, are too numerous to list, but a small sample includes particles, bubbles, droplets, density fluctuations in fluids, crystallites in polycrystalline solids, defects in monocrystalline solids, surface roughness, cells in organisms, and textile fibers
in clothing. The effects of such features on the path of almost any
type of propagating wave or moving particle can be described in the
framework of scattering theory.
When radiation is only scattered by one localized scattering center, this is called single scattering.
It is more common that scattering centers are grouped together; in such
cases, radiation may scatter many times, in what is known as multiple scattering. The main difference between the effects of single and multiple
scattering is that single scattering can usually be treated as a random
phenomenon, whereas multiple scattering, somewhat counterintuitively,
can be modeled as a more deterministic process because the combined
results of a large number of scattering events tend to average out.
Multiple scattering can thus often be modeled well with diffusion theory.
Because the location of a single scattering center is not usually
well known relative to the path of the radiation, the outcome, which
tends to depend strongly on the exact incoming trajectory, appears
random to an observer. This type of scattering would be exemplified by
an electron being fired at an atomic nucleus. In this case, the atom's
exact position relative to the path of the electron is unknown and would
be unmeasurable, so the exact trajectory of the electron after the
collision cannot be predicted. Single scattering is therefore often
described by probability distributions.
With multiple scattering, the randomness of the interaction tends
to be averaged out by a large number of scattering events, so that the
final path of the radiation appears to be a deterministic distribution
of intensity. This is exemplified by a light beam passing through thick fog. Multiple scattering is highly analogous to diffusion, and the terms multiple scattering and diffusion are interchangeable in many contexts. Optical elements designed to produce multiple scattering are thus known as diffusers. Coherent backscattering, an enhancement of backscattering that occurs when coherent radiation is multiply scattered by a random medium, is usually attributed to weak localization.
Not all single scattering is random, however. A well-controlled
laser beam can be exactly positioned to scatter off a microscopic
particle with a deterministic outcome, for instance. Such situations are
encountered in radar scattering as well, where the targets tend to be macroscopic objects such as people or aircraft.
Similarly, multiple scattering can sometimes have somewhat random
outcomes, particularly with coherent radiation. The random fluctuations
in the multiply scattered intensity of coherent radiation are called speckles.
Speckle also occurs if multiple parts of a coherent wave scatter from
different centers. In certain rare circumstances, multiple scattering
may only involve a small number of interactions such that the randomness
is not completely averaged out. These systems are considered to be some
of the most difficult to model accurately.
The description of scattering and the distinction between single and multiple scattering are tightly related to wave–particle duality.
Theory
Scattering theory is a framework for studying and understanding the scattering of waves and particles.
Wave scattering corresponds to the collision and scattering of a wave
with some material object, for instance sunlight scattered by rain drops to form a rainbow. Scattering also includes the interaction of billiard balls on a table, the Rutherford scattering (or angle change) of alpha particles by goldnuclei, the Bragg scattering (or diffraction) of electrons and X-rays by a cluster of atoms, and the inelastic scattering of a fission fragment as it traverses a thin foil. More precisely, scattering consists of the study of how solutions of partial differential equations, propagating freely "in the distant past", come together and interact with one another or with a boundary condition, and then propagate away "to the distant future".
The direct scattering problem is the problem of
determining the distribution of scattered radiation/particle flux basing
on the characteristics of the scatterer. The inverse scattering problem
is the problem of determining the characteristics of an object (e.g.,
its shape, internal constitution) from measurement data of radiation or
particles scattered from the object.
Attenuation due to scattering
Equivalent quantities used in the theory of scattering from composite specimens, but with a variety of units
When the target is a set of many scattering centers whose relative
position varies unpredictably, it is customary to think of a range
equation whose arguments take different forms in different application
areas. In the simplest case consider an interaction that removes
particles from the "unscattered beam" at a uniform rate that is
proportional to the incident number of particles per unit area per unit
time (), i.e. that
where Q is an interaction coefficient and x is the distance traveled in the target.
where Io is the initial flux, path length Δx ≡ x − xo, the second equality defines an interaction mean free path λ, the third uses the number of targets per unit volume η to define an area cross-section
σ, and the last uses the target mass density ρ to define a density mean
free path τ. Hence one converts between these quantities via Q = 1/λ = ησ = ρ/τ, as shown in the figure at left.
In electromagnetic absorption spectroscopy, for example, interaction coefficient (e.g. Q in cm−1) is variously called opacity, absorption coefficient, and attenuation coefficient. In nuclear physics, area cross-sections (e.g. σ in barns or units of 10−24 cm2), density mean free path (e.g. τ in grams/cm2), and its reciprocal the mass attenuation coefficient (e.g. in cm2/gram) or area per nucleon are all popular, while in electron microscopy the inelastic mean free path (e.g. λ in nanometers) is often discussed instead.
Elastic and inelastic scattering
The term "elastic scattering" implies that the internal states of the
scattering particles do not change, and hence they emerge unchanged
from the scattering process. In inelastic scattering, by contrast, the
particles' internal state is changed, which may amount to exciting some
of the electrons of a scattering atom, or the complete annihilation of a
scattering particle and the creation of entirely new particles.
The example of scattering in quantum chemistry
is particularly instructive, as the theory is reasonably complex while
still having a good foundation on which to build an intuitive
understanding. When two atoms are scattered off one another, one can
understand them as being the bound state solutions of some differential equation. Thus, for example, the hydrogen atom corresponds to a solution to the Schrödinger equation with a negative inverse-power (i.e., attractive Coulombic) central potential. The scattering of two hydrogen atoms will disturb the state of each atom, resulting in one or both becoming excited, or even ionized, representing an inelastic scattering process.
The term "deep inelastic scattering" refers to a special kind of scattering experiment in particle physics.
Mathematical framework
In mathematics, scattering theory deals with a more abstract formulation of the same set of concepts. For example, if a differential equation
is known to have some simple, localized solutions, and the solutions
are a function of a single parameter, that parameter can take the
conceptual role of time.
One then asks what might happen if two such solutions are set up far
away from each other, in the "distant past", and are made to move
towards each other, interact (under the constraint of the differential
equation) and then move apart in the "future". The scattering matrix
then pairs solutions in the "distant past" to those in the "distant
future".
Solutions to differential equations are often posed on manifolds. Frequently, the means to the solution requires the study of the spectrum of an operator on the manifold. As a result, the solutions often have a spectrum that can be identified with a Hilbert space, and scattering is described by a certain map, the S matrix, on Hilbert spaces. Solutions with a discrete spectrum correspond to bound states in quantum mechanics, while a continuous spectrum
is associated with scattering states. The study of inelastic
scattering then asks how discrete and continuous spectra are mixed
together.
Top: the real part of a plane wave
travelling upwards. Bottom: The real part of the field after inserting
in the path of the plane wave a small transparent disk of index of refraction
higher than the index of the surrounding medium. This object scatters
part of the wave field, although at any individual point, the wave's
frequency and wavelength remain intact.
In regular quantum mechanics, which includes quantum chemistry, the relevant equation is the Schrödinger equation, although equivalent formulations, such as the Lippmann-Schwinger equation and the Faddeev equations,
are also largely used. The solutions of interest describe the long-term
motion of free atoms, molecules, photons, electrons, and protons. The
scenario is that several particles come together from an infinite
distance away. These reagents then collide, optionally reacting, getting
destroyed or creating new particles. The products and unused reagents
then fly away to infinity again. (The atoms and molecules are
effectively particles for our purposes. Also, under everyday
circumstances, only photons are being created and destroyed.) The
solutions reveal which directions the products are most likely to fly
off to and how quickly. They also reveal the probability of various
reactions, creations, and decays occurring. There are two predominant
techniques of finding solutions to scattering problems: partial wave analysis, and the Born approximation.
Electromagnetics
A Feynman diagram of scattering between two electrons by emission of a virtual photon
Electromagnetic waves are one of the best known and most commonly encountered forms of radiation that undergo scattering. Scattering of light and radio waves (especially in radar) is
particularly important. Several different aspects of electromagnetic
scattering are distinct enough to have conventional names. Major forms
of elastic light scattering (involving negligible energy transfer) are Rayleigh scattering and Mie scattering. Inelastic scattering includes Brillouin scattering, Raman scattering, inelastic X-ray scattering and Compton scattering.
Light scattering is one of the two major physical processes that
contribute to the visible appearance of most objects, the other being
absorption. Surfaces described as white owe their appearance to
multiple scattering of light by internal or surface inhomogeneities in
the object, for example by the boundaries of transparent microscopic
crystals that make up a stone or by the microscopic fibers in a sheet of
paper. More generally, the gloss (or lustre or sheen)
of the surface is determined by scattering. Highly scattering surfaces
are described as being dull or having a matte finish, while the absence
of surface scattering leads to a glossy appearance, as with polished
metal or stone.
Spectral absorption, the selective absorption of certain colors, determines the color of most objects with some modification by elastic scattering. The apparent blue color of veins
in skin is a common example where both spectral absorption and
scattering play important and complex roles in the coloration. Light
scattering can also create color without absorption, often shades of
blue, as with the sky (Rayleigh scattering), the human blue iris, and the feathers of some birds (Prum et al. 1998). However, resonant light scattering in nanoparticles can produce many different highly saturated and vibrant hues, especially when surface plasmon resonance is involved (Roqué et al. 2006).
Models of light scattering can be divided into three domains based on a dimensionless size parameter, α which is defined as:
where πDp is the circumference of a particle and λ is the wavelength of incident radiation in the medium. Based on the value of α, these domains are:
α ≪ 1: Rayleigh scattering (small particle compared to wavelength of light);
α ≈ 1: Mie scattering (particle about the same size as wavelength of light, valid only for spheres);
α ≫ 1: geometric scattering (particle much larger than wavelength of light).
Rayleigh scattering is a process in which electromagnetic radiation
(including light) is scattered by a small spherical volume of variant
refractive indexes, such as a particle, bubble, droplet, or even a
density fluctuation. This effect was first modeled successfully by Lord Rayleigh, from whom it gets its name. In order for Rayleigh's model to apply, the sphere must be much smaller in diameter than the wavelength (λ)
of the scattered wave; typically the upper limit is taken to be about
1/10 the wavelength. In this size regime, the exact shape of the
scattering center is usually not very significant and can often be
treated as a sphere of equivalent volume. The inherent scattering that
radiation undergoes passing through a pure gas is due to microscopic
density fluctuations as the gas molecules move around, which are
normally small enough in scale for Rayleigh's model to apply. This
scattering mechanism is the primary cause of the blue color of the
Earth's sky on a clear day, as the shorter blue wavelengths of sunlight
passing overhead are more strongly scattered than the longer red
wavelengths according to Rayleigh's famous 1/λ4 relation. Along with absorption, such scattering is a major cause of the attenuation of radiation by the atmosphere. The degree of scattering varies as a function of the ratio of the
particle diameter to the wavelength of the radiation, along with many
other factors including polarization, angle, and coherence.
For larger diameters, the problem of electromagnetic scattering by spheres was first solved by Gustav Mie,
and scattering by spheres larger than the Rayleigh range is therefore
usually known as Mie scattering. In the Mie regime, the shape of the
scattering center becomes much more significant and the theory only
applies well to spheres and, with some modification, spheroids and ellipsoids.
Closed-form solutions for scattering by certain other simple shapes
exist, but no general closed-form solution is known for arbitrary
shapes.
Both Mie and Rayleigh scattering are considered elastic
scattering processes, in which the energy (and thus wavelength and
frequency) of the light is not substantially changed. However,
electromagnetic radiation scattered by moving scattering centers does
undergo a Doppler shift, which can be detected and used to measure the velocity of the scattering center/s in forms of techniques such as lidar and radar. This shift involves a slight change in energy.
At values of the ratio of particle diameter to wavelength more than about 10, the laws of geometric optics
are mostly sufficient to describe the interaction of light with the
particle. Mie theory can still be used for these larger spheres, but the
solution often becomes numerically unwieldy.
For modeling of scattering in cases where the Rayleigh and Mie
models do not apply such as larger, irregularly shaped particles, there
are many numerical methods that can be used. The most common are finite-element methods which solve Maxwell's equations
to find the distribution of the scattered electromagnetic field.
Sophisticated software packages exist which allow the user to specify
the refractive index or indices of the scattering feature in space,
creating a 2- or sometimes 3-dimensional model of the structure. For
relatively large and complex structures, these models usually require
substantial execution times on a computer.
Electrophoresis involves the migration of macromolecules under the influence of an electric field. Electrophoretic light scattering
involves passing an electric field through a liquid which makes
particles move. The bigger the charge is on the particles, the faster
they are able to move.
As with other elementary particles, photons are best explained by quantum mechanics and exhibit wave–particle duality, their behavior featuring properties of both waves and particles. The modern photon concept originated during the first two decades of the 20th century with the work of Albert Einstein, who built upon the research of Max Planck. While Planck was trying to explain how matter and electromagnetic radiation could be in thermal equilibrium with one another, he proposed that the energy stored within a material object should be regarded as composed of an integer number of discrete, equal-sized parts. To explain the photoelectric effect, Einstein introduced the idea that light itself is made of discrete units of energy. In 1926, Gilbert N. Lewis popularized the term photon for these energy units. Subsequently, many other experiments validated Einstein's approach.
The photon has no electric charge, is generally considered to have zero rest mass, and is a stable particle. The experimental upper limit on the photon massis very small, on the order of 10−53 g; its lifetime would be more than 1018 years. For comparison, the age of the universe is about 1.38×1010 years.
Single photons have been shown to travel at the speed of light in vacuum. The measurement uses heralded single photon source.
The cone shows possible values of wave 4-vector of a photon. The "time" axis gives the angular frequency (rad⋅s−1) and the "space" axis represents the angular wavenumber (rad⋅m−1). Green and indigo represent left and right polarization.
In a quantum mechanical model, electromagnetic waves transfer energy in photons with energy proportional to frequency ()
The angular momentum of the photon has two possible values, either +ħ or −ħ. These two possible values correspond to the two possible pure states of circular polarization.
Collections of photons in a light beam may have mixtures of these two
values; a linearly polarized light beam will act as if it were composed
of equal numbers of the two possible angular momenta.
The spin angular momentum of light does not depend on its frequency, and was experimentally verified by C. V. Raman and Suri Bhagavantam in 1931.
The collision of a particle with its antiparticle can create photons. In free space at least two photons must be created since, in the center of momentum frame,
the colliding antiparticles have no net momentum, whereas a single
photon always has momentum (determined by the photon's frequency or
wavelength, which cannot be zero). Hence, conservation of momentum (or equivalently, translational invariance) requires that at least two photons are created, with zero net momentum. The energy of the two photons, or, equivalently, their frequency, may be determined from conservation of four-momentum.
Seen another way, the
photon can be considered as its own antiparticle (thus an "antiphoton"
is simply a normal photon with opposite momentum, equal polarization,
and 180° out of phase). The reverse process, pair production, is the dominant mechanism by which high-energy photons such as gamma rays lose energy while passing through matter. That process is the reverse of "annihilation to one photon" allowed in the electric field of an atomic nucleus.
The classical formulae for the energy and momentum of electromagnetic radiation can be re-expressed in terms of photon events. For example, the pressure of electromagnetic radiation
on an object derives from the transfer of photon momentum per unit time
and unit area to that object, since pressure is force per unit area and
force is the change in momentum per unit time.
Experimental checks on photon mass
Current commonly accepted physical theories imply or assume the photon to be strictly massless. If photons were not purely massless, their speeds would vary with
frequency, with lower-energy (redder) photons moving slightly slower
than higher-energy photons. Relativity would be unaffected by this; the
so-called speed of light, c, would then not be the actual speed at which light moves, but a constant of nature which is the upper bound on speed that any object could theoretically attain in spacetime. Thus, it would still be the speed of spacetime ripples (gravitational waves and gravitons), but it would not be the speed of photons.
If a photon did have non-zero mass, there would be other effects as well. Coulomb's law would be modified and the electromagnetic field would have an extra physical degree of freedom.
These effects yield more sensitive experimental probes of the photon
mass than the frequency dependence of the speed of light. If Coulomb's
law is not exactly valid, then that would allow the presence of an electric field to exist within a hollow conductor when it is subjected to an external electric field. This provides a means for precision tests of Coulomb's law. A null result of such an experiment has set a limit of m ≲ 10−14 eV/c2.
Sharper upper limits on the mass of light have been obtained in experiments designed to detect effects caused by the galactic vector potential. Although the galactic vector potential is large because the galactic magnetic field
exists on great length scales, only the magnetic field would be
observable if the photon is massless. In the case that the photon has
mass, the mass term 1/2m2AμAμ would affect the galactic plasma. The fact that no such effects are seen implies an upper bound on the photon mass of m < 3×10−27 eV/c2. The galactic vector potential can also be probed directly by measuring the torque exerted on a magnetized ring. Such methods were used to obtain the sharper upper limit of 1.07×10−27 eV/c2 (10−36Da) given by the Particle Data Group.
These sharp limits from the non-observation of the effects caused
by the galactic vector potential have been shown to be model-dependent. If the photon mass is generated via the Higgs mechanism then the upper limit of m ≲ 10−14 eV/c2 from the test of Coulomb's law is valid.
Thomas Young's sketch of interference based on observations of water waves. Young reasoned that the similar effects observed with light supported a wave model and not Newton's particle theory of light.
In most theories up to the eighteenth century, light was pictured as being made of particles. Since particle models cannot easily account for the refraction, diffraction and birefringence of light, wave theories of light were proposed by René Descartes (1637), Robert Hooke (1665), and Christiaan Huygens (1678); however, particle models remained dominant, chiefly due to the influence of Isaac Newton. In the early 19th century, Thomas Young and August Fresnel clearly demonstrated the interference and diffraction of light, and by 1850 wave models were generally accepted. James Clerk Maxwell's 1865 prediction that light was an electromagnetic wave – which was confirmed experimentally in 1888 by Heinrich Hertz's detection of radio waves – seemed to be the final blow to particle models of light.
In 1900, Maxwell'stheoretical model of light as oscillating electric and magnetic fields seemed complete. However, several observations could not be explained by any wave model of electromagnetic radiation, leading to the idea that light-energy was packaged into quanta described by E = hν. Later experiments showed that these light-quanta also carry momentum and, thus, can be considered particles: The photon concept was born, leading to a deeper understanding of the electric and magnetic fields themselves.
The Maxwell wave theory, however, does not account for all properties of light. The Maxwell theory predicts that the energy of a light wave depends only on its intensity, not on its frequency;
nevertheless, several independent types of experiments show that the
energy imparted by light to atoms depends only on the light's frequency,
not on its intensity. For example, some chemical reactions
are provoked only by light of frequency higher than a certain
threshold; light of frequency lower than the threshold, no matter how
intense, does not initiate the reaction. Similarly, electrons can be
ejected from a metal plate by shining light of sufficiently high
frequency on it (the photoelectric effect); the energy of the ejected electron is related only to the light's frequency, not to its intensity.
At the same time, investigations of black-body radiation carried out over four decades (1860–1900) by various researchers culminated in Max Planck's hypothesis that the energy of any system that absorbs or emits electromagnetic radiation of frequency ν is an integer multiple of an energy quantum E = hν . As shown by Albert Einstein, some form of energy quantization must be assumed to account for the thermal equilibrium observed between matter and electromagnetic radiation; for this explanation of the photoelectric effect, Einstein received the 1921 Nobel Prize in physics.
Since the Maxwell theory of light allows for all possible
energies of electromagnetic radiation, most physicists assumed initially
that the energy quantization resulted from some unknown constraint on
the matter that absorbs or emits the radiation. In 1905, Einstein was
the first to propose that energy quantization was a property of
electromagnetic radiation itself. Although he accepted the validity of Maxwell's theory, Einstein pointed
out that many anomalous experiments could be explained if the energy
of a Maxwellian light wave were localized into point-like quanta that
move independently of one another, even if the wave itself is spread
continuously over space. In 1909 and 1916, Einstein showed that, if Planck's law regarding black-body radiation is accepted, the energy quanta must also carry momentum p = h / λ , making them full-fledged particles.
Up
to 1923, most physicists were reluctant to accept that light itself was
quantized. Instead, they tried to explain photon behaviour by
quantizing only matter, as in the Bohr model of the hydrogen atom
(shown here). Even though these semiclassical models were only a first
approximation, they were accurate for simple systems and they led to quantum mechanics.
As recounted in Robert Millikan's
1923 Nobel lecture, Einstein's 1905 predicted energy relationship was
verified experimentally by 1916 but the local concept of the quanta
remained unsettled. Most physicists were reluctant to believe that electromagnetic radiation
itself might be particulate and thus an example of wave-particle
duality. Then in 1922 the Arthur Compton experiment showed that photons carried momentum proportional to their wave number (and therefore energy) in an effect now called Compton scattering that appeared to clearly support a localized quantum model. At least for Millikan, this settled the matter. Compton received the Nobel Prize in 1927 for his scattering work.
Even after Compton's experiment, Niels Bohr, Hendrik Kramers and John Slater made one last attempt to preserve the Maxwellian continuous electromagnetic field model of light, the so-called BKS theory. An important feature of the BKS theory is how it treated the conservation of energy and the conservation of momentum.
In the BKS theory, energy and momentum are only conserved on the
average across many interactions between matter and radiation. However,
refined Compton experiments showed that the conservation laws hold for
individual interactions. Accordingly, Bohr and his co-workers gave their model "as honorable a funeral as possible". Nevertheless, the failures of the BKS model inspired Werner Heisenberg in his development of matrix mechanics.
By the late 1920, the pivotal question was how to unify Maxwell's
wave theory of light with its experimentally observed particle nature.
The answer to this question occupied Albert Einstein for the rest of his
life, and was solved in quantum electrodynamics and its successor, the Standard Model. (See § Quantum field theory and § As a gauge boson, below.)
A few physicists persisted in developing semiclassical models in which electromagnetic radiation is not quantized, but matter appears to obey the laws of quantum mechanics.
Although the evidence from chemical and physical experiments for the
existence of photons was overwhelming by the 1970s, this evidence could
not be considered as absolutely definitive; since it relied on
the interaction of light with matter, and a sufficiently complete theory
of matter could in principle account for the evidence.
In the 1970s and 1980s photon-correlation experiments
definitively demonstrated quantum photon effects.
These experiments produce results that cannot be explained by any
classical theory of light, since they involve anticorrelations that
result from the quantum measurement process. In 1974, the first such experiment was carried out by Clauser, who reported a violation of a classical Cauchy–Schwarz inequality. In 1977, Kimble et al.
demonstrated an analogous anti-bunching effect of photons interacting
with a beam splitter; this approach was simplified and sources of error
eliminated in the photon-anticorrelation experiment of Grangier, Roger,
& Aspect (1986); This work is reviewed and simplified further in Thorn, Neel, et al. (2004).
Nomenclature
Photoelectric effect: the emission of electrons from a metal plate caused by light quanta – photons
The word quanta (singular quantum, Latin for how much) was used before 1900 to mean particles or amounts of different quantities, including electricity. In 1900, the German physicist Max Planck was studying black-body radiation, and he suggested that the experimental observations, specifically at shorter wavelengths,
would be explained if the energy was "made up of a completely
determinate number of finite equal parts", which he called "energy
elements". In 1905, Albert Einstein published a paper in which he proposed that many light-related phenomena—including black-body radiation and the photoelectric effect—would be better explained by modelling electromagnetic waves as consisting of spatially localized, discrete energy quanta. He called these a light quantum (German: ein Lichtquant).
The name photon derives from the Greek word for light, φῶς (transliterated phôs). The name was used 1916 by the American physicist and psychologist Leonard T. Troland for a unit of illumination of the retina and in several other contexts before being adopted for physics. The use of the term photon for the light quantum was popularized by Gilbert N. Lewis, who used the term in a letter to Nature on 18 December 1926. Arthur Compton, who had performed a key experiment demonstrating light quanta, cited Lewis in the 1927 Solvay conference proceedings for suggesting the name photon. Einstein never did use the term.
Photons obey the laws of quantum mechanics, and so their behavior has
both wave-like and particle-like aspects. When a photon is detected by a
measuring instrument, it is registered as a single, particulate unit.
However, the probability of detecting a photon is calculated by equations that describe waves. This combination of aspects is known as wave–particle duality. For example, the probability distribution for the location at which a photon might be detected displays clearly wave-like phenomena such as diffraction and interference. A single photon passing through a double slit
has its energy received at a point on the screen with a probability
distribution given by its interference pattern determined by Maxwell's wave equations. However, experiments confirm that the photon is not
a short pulse of electromagnetic radiation; a photon's Maxwell waves
will diffract, but photon energy does not spread out as it propagates,
nor does this energy divide when it encounters a beam splitter. Rather, the received photon acts like a point-like particle since it is absorbed or emitted as a whole by arbitrarily small systems, including systems much smaller than its wavelength, such as an atomic nucleus (≈10−15 m across) or even the point-like electron.
While many introductory texts treat photons using the
mathematical techniques of non-relativistic quantum mechanics, this is
in some ways an awkward oversimplification, as photons are by nature
intrinsically relativistic. Because photons have zero rest mass, no wave function defined for a photon can have all the properties familiar from wave functions in non-relativistic quantum mechanics. In order to avoid these difficulties, physicists employ the second-quantized theory of photons described below, quantum electrodynamics, in which photons are quantized excitations of electromagnetic modes.
Another difficulty is finding the proper analogue for the uncertainty principle, an idea frequently attributed to Heisenberg, who introduced the concept in analyzing a thought experiment involving an electron and a high-energy photon.
However, Heisenberg did not give precise mathematical definitions of
what the "uncertainty" in these measurements meant. The precise
mathematical statement of the position–momentum uncertainty principle is
due to Kennard, Pauli, and Weyl. The uncertainty principle applies to situations where an experimenter
has a choice of measuring either one of two "canonically conjugate"
quantities, like the position and the momentum of a particle. According
to the uncertainty principle, no matter how the particle is prepared, it
is not possible to make a precise prediction for both of the two
alternative measurements: if the outcome of the position measurement is
made more certain, the outcome of the momentum measurement becomes less
so, and vice versa. A coherent state minimizes the overall uncertainty as far as quantum mechanics allows. Quantum optics
makes use of coherent states for modes of the electromagnetic field.
There is a tradeoff, reminiscent of the position–momentum uncertainty
relation, between measurements of an electromagnetic wave's amplitude
and its phase. This is sometimes informally expressed in terms of the uncertainty in
the number of photons present in the electromagnetic wave, , and the uncertainty in the phase of the wave, . However, this cannot be an uncertainty relation of the Kennard–Pauli–Weyl type, since unlike position and momentum, the phase cannot be represented by a Hermitian operator.
In 1924, Satyendra Nath Bose derived Planck's law of black-body radiation without using any electromagnetism, but rather by using a modification of coarse-grained counting of phase space. Einstein showed that this modification is equivalent to assuming that
photons are rigorously identical and that it implied a "mysterious
non-local interaction", now understood as the requirement for a symmetric quantum mechanical state. This work led to the concept of coherent states
and the development of the laser. In the same papers, Einstein extended
Bose's formalism to material particles (bosons) and predicted that they
would condense into their lowest quantum state at low enough temperatures; this Bose–Einstein condensation was observed experimentally in 1995. It was later used by Lene Hau to slow, and then completely stop, light in 1999 and 2001.
The modern view on this is that photons are, by virtue of their integer spin, bosons (as opposed to fermions with half-integer spin). By the spin-statistics theorem, all bosons obey Bose–Einstein statistics (whereas all fermions obey Fermi–Dirac statistics).
Stimulated emission (in which photons "clone" themselves) was predicted by Einstein in his kinetic analysis, and led to the development of the laser.
Einstein's derivation inspired further developments in the quantum
treatment of light, which led to the statistical interpretation of
quantum mechanics.
In 1916, Albert Einstein showed that Planck's radiation law could be
derived from a semi-classical, statistical treatment of photons and
atoms, which implies a link between the rates at which atoms emit and
absorb photons. The condition follows from the assumption that functions
of the emission and absorption of radiation by the atoms are
independent of each other, and that thermal equilibrium is made by way
of the radiation's interaction with the atoms. Consider a cavity in thermal equilibrium with all parts of itself and filled with electromagnetic radiation and that the atoms can emit and absorb that radiation. Thermal equilibrium requires that the energy density of photons with frequency (which is proportional to their number density) is, on average, constant in time; hence, the rate at which photons of any particular frequency are emitted must equal the rate at which they are absorbed.
Einstein began by postulating simple proportionality relations
for the different reaction rates involved. In his model, the rate for a system to absorb a photon of frequency and transition from a lower energy to a higher energy is proportional to the number of atoms with energy and to the energy density of ambient photons of that frequency,
where is the rate constant
for absorption. For the reverse process, there are two possibilities:
spontaneous emission of a photon, or the emission of a photon initiated
by the interaction of the atom with a passing photon and the return of
the atom to the lower-energy state. Following Einstein's approach, the
corresponding rate for the emission of photons of frequency and transition from a higher energy to a lower energy is
where is the rate constant for emitting a photon spontaneously, and is the rate constant for emissions in response to ambient photons (induced or stimulated emission). In thermodynamic equilibrium, the number of atoms in state and those in state must, on average, be constant; hence, the rates and must be equal. Also, by arguments analogous to the derivation of Boltzmann statistics, the ratio of and is where and are the degeneracy of the state and that of , respectively, and their energies, the Boltzmann constant and the system's temperature. From this, it is readily derived that
and
The and are collectively known as the Einstein coefficients.
Einstein could not fully justify his rate equations, but claimed that it should be possible to calculate the coefficients , and once physicists had obtained "mechanics and electrodynamics modified to accommodate the quantum hypothesis". Not long thereafter, in 1926, Paul Dirac derived the rate constants by using a semiclassical approach, and, in 1927, succeeded in deriving all the rate constants from first principles within the framework of quantum theory. Dirac's work was the foundation of quantum electrodynamics, i.e., the
quantization of the electromagnetic field itself. Dirac's approach is
also called second quantization or quantum field theory; earlier quantum mechanical treatments only treat material particles as quantum mechanical, not the electromagnetic field.
Einstein was troubled by the fact that his theory seemed incomplete, since it did not determine the direction of a spontaneously emitted photon. A probabilistic nature of light-particle motion was first considered by Newton in his treatment of birefringence
and, more generally, of the splitting of light beams at interfaces into
a transmitted beam and a reflected beam. Newton hypothesized that
hidden variables in the light particle determined which of the two paths
a single photon would take. Similarly, Einstein hoped for a more complete theory that would leave nothing to chance, beginning his separation from quantum mechanics. Ironically, Max Born's probabilistic interpretation of the wave function was inspired by Einstein's later work searching for a more complete theory.
Different electromagnetic modes (such as those depicted here) can be treated as independent simple harmonic oscillators. A photon corresponds to a unit of energy E = hν in its electromagnetic mode.
In 1910, Peter Debye derived Planck's law of black-body radiation from a relatively simple assumption. He decomposed the electromagnetic field in a cavity into its Fourier modes, and assumed that the energy in any mode was an integer multiple of , where
is the frequency of the electromagnetic mode. Planck's law of
black-body radiation follows immediately as a geometric sum. However,
Debye's approach failed to give the correct formula for the energy
fluctuations of black-body radiation, which were derived by Einstein in
1909.
In 1925, Born, Heisenberg and Jordan reinterpreted Debye's concept in a key way. As may be shown classically, the Fourier modes of the electromagnetic field—a complete set of electromagnetic plane waves indexed by their wave vector k and polarization state—are equivalent to a set of uncoupled simple harmonic oscillators. Treated quantum mechanically, the energy levels of such oscillators are known to be , where is the oscillator frequency. The key new step was to identify an electromagnetic mode with energy as a state with photons, each of energy . This approach gives the correct energy fluctuation formula.
Feynman diagram of two electrons interacting by exchange of a virtual photon
Dirac took this one step further.He treated the interaction between a charge and an electromagnetic
field as a small perturbation that induces transitions in the photon
states, changing the numbers of photons in the modes, while conserving
energy and momentum overall. Dirac was able to derive Einstein's and
coefficients from first principles, and showed that the Bose–Einstein
statistics of photons is a natural consequence of quantizing the
electromagnetic field correctly (Bose's reasoning went in the opposite
direction; he derived Planck's law of black-body radiation by assuming B–E statistics). In Dirac's time, it was not yet known that all bosons, including photons, must obey Bose–Einstein statistics.
Dirac's second-order perturbation theory can involve virtual photons, transient intermediate states of the electromagnetic field; the static electric and magnetic interactions are mediated by such virtual photons. In such quantum field theories, the probability amplitude of observable events is calculated by summing over all possible intermediate steps, even ones that are unphysical; hence, virtual photons are not constrained to satisfy , and may have extra polarization states; depending on the gauge
used, virtual photons may have three or four polarization states,
instead of the two states of real photons. Although these transient
virtual photons can never be observed, they contribute measurably to the
probabilities of observable events.
Second-order and higher-order perturbation calculations can give infinite contributions to the sum. Such unphysical results are corrected for using the technique of renormalization.
Other virtual particles may contribute to the summation as well;
for example, two photons may interact indirectly through virtual electron–positronpairs. Such photon–photon scattering (see two-photon physics),
as well as electron–photon scattering, is meant to be one of the modes
of operations of the planned particle accelerator, the International Linear Collider.
where represents the state in which photons are in the mode . In this notation, the creation of a new photon in mode (e.g., emitted from an atomic transition) is written as . This notation merely expresses the concept of Born, Heisenberg and Jordan described above, and does not add any physics.
Measurements of the interaction between energetic photons and hadrons
show that the interaction is much more intense than expected by the
interaction of merely photons with the hadron's electric charge.
Furthermore, the interaction of energetic photons with protons is
similar to the interaction of photons with neutrons in spite of the fact that the electrical charge structures of protons
and neutrons are substantially different. A theory called vector meson dominance
(VMD) was developed to explain this effect. According to VMD, the
photon is a superposition of the pure electromagnetic photon, which
interacts only with electric charges, and vector mesons, which mediate
the residual nuclear force. However, if experimentally probed at very short distances, the
intrinsic structure of the photon appears to have as components a
charge-neutral flux of quarks and gluons, quasi-free according to
asymptotic freedom in QCD. That flux is described by the photon structure function.A review by Nisius (2000) presented a comprehensive comparison of data with theoretical predictions.
The energy of a system that emits a photon is decreased by the energy of the photon as measured in the rest frame of the emitting system, which may result in a reduction in mass in the amount . Similarly, the mass of a system that absorbs a photon is increased
by a corresponding amount. As an application, the energy balance of
nuclear reactions involving photons is commonly written in terms of the
masses of the nuclei involved, and terms of the form for the gamma photons (and for other relevant energies, such as the recoil energy of nuclei).
Light that travels through transparent matter does so at a lower speed than c, the speed of light in vacuum. The factor by which the speed is decreased is called the refractive index of the material. In a classical wave picture, the slowing can be explained by the light inducing electric polarization
in the matter, the polarized matter radiating new light, and that new
light interfering with the original light wave to form a delayed wave.
In a particle picture, the slowing can instead be described as a
blending of the photon with quantum excitations of the matter to produce
quasi-particles known as polaritons. Polaritons have a nonzero effective mass, which means that they cannot travel at c. Light of different frequencies may travel through matter at different speeds; this is called dispersion (not to be confused with scattering). In some cases, it can result in extremely slow speeds of light in matter. The effects of photon interactions with other quasi-particles may be observed directly in Raman scattering and Brillouin scattering.
Photons can be scattered by matter. For example, photons scatter so many times in the solar radiative zone after leaving the core of the Sun that radiant energy takes about a million years to reach the convection zone. However, photons emitted from the sun's photosphere take only 8.3 minutes to reach Earth.
Photons can also be absorbed by nuclei, atoms or molecules, provoking transitions between their energy levels. A classic example is the molecular transition of retinal (C20H28O), which is responsible for vision, as discovered in 1958 by Nobel laureate biochemistGeorge Wald and co-workers. The absorption provokes a cis–transisomerization
that, in combination with other such transitions, is transduced into
nerve impulses. The absorption of photons can even break chemical bonds,
as in the photodissociation of chlorine; this is the subject of photochemistry.
Technological applications
Photons have many applications in technology. These examples are chosen to illustrate applications of photons per se,
rather than general optical devices such as lenses, etc. that could
operate under a classical theory of light. The laser is an important
application and is discussed above under stimulated emission.
Individual photons can be detected by several methods. The classic photomultiplier tube exploits the photoelectric effect:
a photon of sufficient energy strikes a metal plate and knocks free an
electron, initiating an ever-amplifying avalanche of electrons. Semiconductorcharge-coupled device chips use a similar effect: an incident photon generates a charge on a microscopic capacitor that can be detected. Other detectors such as Geiger counters use the ability of photons to ionize gas molecules contained in the device, causing a detectable change of conductivity of the gas.
Planck's energy formula
is often used by engineers and chemists in design, both to compute the
change in energy resulting from a photon absorption and to determine the
frequency of the light emitted from a given photon emission. For
example, the emission spectrum of a gas-discharge lamp can be altered by filling it with (mixtures of) gases with different electronic energy level configurations.
Under some conditions, an energy transition can be excited by
"two" photons that individually would be insufficient. This allows for
higher resolution microscopy, because the sample absorbs energy only in
the spectrum where two beams of different colors overlap significantly,
which can be made much smaller than the excitation volume of a single
beam (see two-photon excitation microscopy). Moreover, these photons cause less damage to the sample, since they are of lower energy.
In some cases, two energy transitions can be coupled so that, as
one system absorbs a photon, another nearby system "steals" its energy
and re-emits a photon of a different frequency. This is the basis of fluorescence resonance energy transfer, a technique that is used in molecular biology to study the interaction of suitable proteins.
Several different kinds of hardware random number generators
involve the detection of single photons. In one example, for each bit
in the random sequence that is to be produced, a photon is sent to a beam-splitter.
In such a situation, there are two possible outcomes of equal
probability. The actual outcome is used to determine whether the next
bit in the sequence is 0 or 1.
Two-photon physics
studies interactions between photons, which are rare. In 2018,
Massachusetts Institute of Technology researchers announced the
discovery of bound photon triplets, which may involve polaritons.