Gamma-ray burst emission mechanisms are theories that explain how the energy from a gamma-ray burst progenitor
(regardless of the actual nature of the progenitor) is turned into
radiation. These mechanisms are a major topic of research as of 2007.
Neither the light curves nor the early-time spectra of GRBs show resemblance to the radiation emitted by any familiar physical process.
Compactness problem
It has been known for many years that ejection of matter at relativistic velocities (velocities very close to the speed of light)
is a necessary requirement for producing the emission in a gamma-ray
burst. GRBs vary on such short timescales (as short as milliseconds)
that the size of the emitting region must be very small, or else the
time delay due to the finite speed of light would "smear" the emission
out in time, wiping out any short-timescale behavior. At the energies
involved in a typical GRB, so much energy crammed into such a small
space would make the system opaque to photon-photon pair production,
making the burst far less luminous and also giving it a very different
spectrum from what is observed. However, if the emitting system is
moving towards Earth at relativistic velocities, the burst is compressed
in time (as seen by an Earth observer, due to the relativistic Doppler effect) and the emitting region inferred from the finite speed of light becomes much smaller than the true size of the GRB (see relativistic beaming).
GRBs and internal shocks
A related constraint is imposed by the relative
timescales seen in some bursts between the short-timescale variability
and the total length of the GRB. Often this variability timescale is far
shorter than the total burst length. For example, in bursts as long as
100 seconds, the majority of the energy can be released in short
episodes less than 1 second long. If the GRB were due to matter moving
towards Earth (as the relativistic motion argument enforces), it is hard
to understand why it would release its energy in such brief interludes.
The generally accepted explanation for this is that these bursts
involve the collision of multiple shells traveling at slightly different velocities; so-called "internal shocks".
The collision of two thin shells flash-heats the matter, converting
enormous amounts of kinetic energy into the
random motion of particles, greatly amplifying the energy release due to
all emission mechanisms. Which physical mechanisms are at play in
producing the observed photons is still an area of debate, but the most
likely candidates appear to be synchrotron radiation and inverse Compton scattering.
As of 2007 there is no theory that has successfully described the spectrum of all gamma-ray bursts (though some theories work for a subset). However, the so-called Band function (named after David Band) has been fairly successful at fitting, empirically, the spectra of most gamma-ray bursts:
A few gamma-ray bursts have shown evidence for an additional,
delayed emission component at very high energies (GeV and higher). One
theory for this emission invokes inverse Compton scattering. If a GRB progenitor, such as a Wolf-Rayet star, were to explode within a stellar cluster,
the resulting shock wave could generate gamma-rays by scattering
photons from neighboring stars. About 30% of known galactic Wolf-Rayet
stars, are located in dense clusters of O stars with intense ultraviolet radiation fields, and the collapsar model
suggests that WR stars are likely GRB progenitors. Therefore, a
substantial fraction of GRBs are expected to occur in such clusters. As
the relativistic matter
ejected from an explosion slows and interacts with
ultraviolet-wavelength photons, some photons gain energy, generating
gamma-rays.
Afterglows and external shocks
The GRB itself is very rapid, lasting from less than a second up to a
few minutes at most. Once it disappears, it leaves behind a counterpart
at longer wavelengths (X-ray, UV, optical, infrared, and radio) known
as the afterglow that generally remains detectable for days or longer.
In contrast to the GRB emission, the afterglow emission is not
believed to be dominated by internal shocks. In general, all the ejected
matter has by this time coalesced into a single shell traveling outward
into the interstellar medium (or possibly the stellar wind) around the star. At the front of this shell of matter is a shock wave referred to as the "external shock" as the still relativistically moving matter ploughs into the tenuous interstellar gas or the gas surrounding the star.
As the interstellar matter moves across the shock, it is
immediately heated to extreme temperatures. (How this happens is still
poorly understood as of 2007, since the particle density across the
shock wave is too low to create a shock wave comparable to those
familiar in dense terrestrial environments – the topic of "collisionless
shocks" is still largely hypothesis but seems to accurately describe a
number of astrophysical situations. Magnetic fields are probably
critically involved.) These particles, now relativistically moving,
encounter a strong local magnetic field and are accelerated
perpendicular to the
magnetic field, causing them to radiate their energy via synchrotron
radiation.
Synchrotron radiation is well understood, and the afterglow spectrum has been modeled fairly successfully using this template. It is generally dominated by electrons (which move and therefore radiate much faster than protons and other particles) so radiation from other particles is generally ignored.
In general, the GRB assumes the form of a power-law with three
break points (and therefore four different power-law segments.) The
lowest break point, ,
corresponds to the frequency below which the GRB is opaque to radiation
and so the spectrum attains the form Rayleigh-Jeans tail of blackbody radiation. The two other break points, and ,
are related to the minimum energy acquired by an electron after it
crosses the shock wave and the time it takes an electron to radiate most
of its energy, respectively. Depending on which of these two
frequencies is higher, two different regimes are possible:
Fast cooling ()
- Shortly after the GRB, the shock wave imparts immense energy to the
electrons and the minimum electron Lorentz factor is very high. In this
case, the spectrum looks like:
Slow cooling () – Later after the GRB, the shock wave has slowed down and the minimum electron Lorentz factor is much lower.:
The afterglow changes with time. It must fade, obviously, but the spectrum changes as well. For the simplest case of adiabatic expansion into a uniform-density medium, the critical parameters evolve as:
Here is the flux at the current peak frequency of the GRB spectrum. (During fast-cooling this is at ; during slow-cooling it is at .) Note that because drops faster than , the system eventually switches from fast-cooling to slow-cooling.
Different scalings are derived for radiative evolution and for a non-constant-density environment (such as a stellar wind), but share the general power-law behavior observed in this case.
Several other known effects can modify the evolution of the afterglow:
Reverse shocks and the optical flash
There can be "reverse shocks", which propagate back into the shocked matter once it begins to encounter the interstellar medium. The twice-shocked material can produce a bright optical/UV flash, which has been seen in a few GRBs, though it appears not to be a common phenomenon.
Refreshed shocks and late-time flares
There
can be "refreshed" shocks if the central engine continues to release
fast-moving matter in small amounts even out to late times, these new
shocks will catch up with the external shock to produce something like a
late-time internal shock. This explanation has been invoked to explain
the frequent flares seen in X-rays and at other wavelengths in many
bursts, though some theorists are uncomfortable with the apparent demand
that the progenitor (which one would think would be destroyed by the
GRB) remains active for very long.
Jet effects
Gamma-ray burst emission is believed to be released in jets, not spherical shells. Initially the two scenarios are equivalent: the center of the jet is not "aware" of the jet edge, and due to relativistic beaming
we only see a small fraction of the jet. However, as the jet slows
down, two things eventually occur (each at about the same time): First,
information from the edge of the jet that there is no pressure to the
side propagates to its center, and the jet matter can spread laterally.
Second, relativistic beaming effects subside, and once Earth observers
see the entire jet the widening of the relativistic beam is no longer
compensated by the fact that we see a larger emitting region. Once these
effects appear the jet fades very rapidly, an effect that is visible as
a power-law "break" in the afterglow light curve. This is the so-called
"jet break" that has been seen in some events and is often cited as
evidence for the consensus view of GRBs as jets. Many GRB afterglows do
not display jet breaks, especially in the X-ray, but they are more
common in the optical light curves. Though as jet breaks generally occur
at very late times (~1 day or more) when the afterglow is quite faint,
and often undetectable, this is not necessarily surprising.
Dust extinction and hydrogen absorption
There may be dust along the line of sight from the GRB to Earth, both in the host galaxy and in the Milky Way. If so, the light will be attenuated and reddened and an afterglow spectrum may look very different from that modeled.
At very high frequencies (far-ultraviolet and X-ray) interstellar
hydrogen gas becomes a significant absorber. In particular, a photon
with a wavelength of less than 91 nanometers is energetic enough to
completely ionize neutral hydrogen and is absorbed with almost 100%
probability even through relatively thin gas clouds. (At much shorter
wavelengths the probability of absorption begins to drop again, which is
why X-ray afterglows are still detectable.) As a result, observed
spectra of very high-redshift GRBs often drop to zero at wavelengths
less than that of where this hydrogen ionization threshold (known as the
Lyman break)
would be in the GRB host's reference frame. Other, less dramatic
hydrogen absorption features are also commonly seen in high-z GRBs, such
as the Lyman alpha forest.
In mathematics, exponentiation is an operation involving two numbers, the base and the exponent or power. Exponentiation is written as bn, where b is the base and n is the power; this is pronounced as "b (raised) to the (power of) n". When n is a positive integer, exponentiation corresponds to repeated multiplication of the base: that is, bn is the product of multiplying n bases:
The exponent is usually shown as a superscript to the right of the base. In that case, bn is called "b raised to the nth power", "b (raised) to the power of n", "the nth power of b", "b to the nth power", or most briefly as "b to the nth".
Starting from the basic fact stated above that, for any positive integer , is occurrences of all multiplied by each other, several other properties of exponentiation directly follow. In particular:
In other words, when multiplying a base raised to one exponent
by the same base raised to another exponent, the exponents add. From
this basic rule that exponents add, we can derive that must be equal to 1 for any , as follows. For any , . Dividing both sides by gives .
The fact that can similarly be derived from the same rule. For example, . Taking the cube root of both sides gives .
The rule that multiplying makes exponents add can also be used to
derive the properties of negative integer exponents. Consider the
question of what should mean. In order to respect the "exponents add" rule, it must be the case that . Dividing both sides by gives , which can be more simply written as , using the result from above that . By a similar argument, .
The properties of fractional exponents also follow from the same rule. For example, suppose we consider and ask if there is some suitable exponent, which we may call , such that . From the definition of the square root, we have that . Therefore, the exponent must be such that . Using the fact that multiplying makes exponents add gives . The on the right-hand side can also be written as , giving . Equating the exponents on both sides, we have . Therefore, , so .
The definition of exponentiation can be extended to allow any real or complex exponent. Exponentiation by integer exponents can also be defined for a wide variety of algebraic structures, including matrices.
The term power (Latin: potentia, potestas, dignitas) is a mistranslation of the ancient Greek δύναμις (dúnamis, here: "amplification") used by the Greek mathematician Euclid for the square of a line, following Hippocrates of Chios.
In The Sand Reckoner, Archimedes discovered and proved the law of exponents, 10a · 10b = 10a+b, necessary to manipulate powers of 10.
Māl and kaʿbah ("square" and "cube")
In the 9th century, the Persian mathematician Muhammad ibn Mūsā al-Khwārizmī used the terms مَال (māl, "possessions", "property") for a square—the
Muslims, "like most mathematicians of those and earlier times, thought
of a squared number as a depiction of an area, especially of land, hence
property"—and كَعْبَة (kaʿbah, "cube") for a cube, which later Islamic mathematicians represented in mathematical notation as the letters mīm (m) and kāf (k), respectively, by the 15th century, as seen in the work of Abū al-Hasan ibn Alī al-Qalasādī.
15th–18th century
Introducing exponents
Nicolas Chuquet used a form of exponential notation in the 15th century, for example 122 for 12x2. This was later used by Henricus Grammateus and Michael Stifel in the 16th century. In the late 16th century, Jost Bürgi would use Roman numerals for exponents in a way similar to that of Chuquet, for example iii4 for 4x3.
"Exponent"; "square" and "cube"
The word exponent was coined in 1544 by Michael Stifel. In the 16th century, Robert Recorde used the terms square, cube, zenzizenzic (fourth power), sursolid (fifth), zenzicube (sixth), second sursolid (seventh), and zenzizenzizenzic (eighth). Biquadrate has been used to refer to the fourth power as well.
Modern exponential notation
In 1636, James Hume used in essence modern notation, when in L'algèbre de Vietè he wrote Aiii for A3. Early in the 17th century, the first form of our modern exponential notation was introduced by René Descartes in his text titled La Géométrie; there, the notation is introduced in Book I.
I designate ... aa, or a2 in multiplying a by itself; and a3 in multiplying it once more again by a, and thus to infinity.
— René Descartes, La Géométrie
Some mathematicians (such as Descartes) used exponents only for
powers greater than two, preferring to represent squares as repeated
multiplication. Thus they would write polynomials, for example, as ax + bxx + cx3 + d.
"Indices"
Samuel Jeake introduced the term indices in 1696. A historical synonym, involution, is now rare and should not be confused with its more common meaning.
Variable exponents, non-integer exponents
In 1748, Leonhard Euler introduced variable exponents, and, implicitly, non-integer exponents by writing:
Consider exponentials or powers in which the exponent itself is a variable. It is clear that quantities of this kind are not algebraic functions, since in those the exponents must be constant.
Terminology
The expression b2 = b · b is called "the square of b" or "b squared", because the area of a square with side-length b is b2. (It is true that it could also be called "b to the second power", but "the square of b" and "b squared" are so ingrained by tradition and convenience that "b to the second power" tends to sound unusual or clumsy.)
Similarly, the expression b3 = b · b · b is called "the cube of b" or "b cubed", because the volume of a cube with side-length b is b3.
When an exponent is a positive integer, that exponent indicates how many copies of the base are multiplied together. For example, 35 = 3 · 3 · 3 · 3 · 3 = 243. The base 3 appears 5 times in the multiplication, because the exponent is 5. Here, 243 is the 5th power of 3, or 3 raised to the 5th power.
The word "raised" is usually omitted, and sometimes "power" as well, so 35 can be simply read "3 to the 5th", or "3 to the 5". Therefore, the exponentiation bn can be expressed as "b to the power of n", "b to the nth power", "b to the nth", or most briefly as "b to the n".
A formula with nested exponentiation, such as 357 (which means 3(57) and not (35)7), is called a tower of powers, or simply a tower. For example, writing is equivalent to writing . The same idea continues to apply when more levels are added; writing means , and so on. For example, can be computed as , which can be computed as , which is equal to , which is equal to 10.
Integer exponents
The exponentiation operation with integer exponents may be defined directly from elementary arithmetic operations.
Positive exponents
The definition of the exponentiation as an iterated multiplication can be formalized by using induction, and this definition can be used as soon one has an associative multiplication:
Intuitionally, may be interpreted as the empty product of copies of b. So, the equality is a special case of the general convention for the empty product.
The case of 00 is more complicated. In contexts where only integer powers are considered, the value 1 is generally assigned to 00 but, otherwise, the choice of whether to assign it a value and what value to assign may depend on context. For more details, see Zero to the power of zero.
Negative exponents
Exponentiation with negative exponents is defined by the following identity, which holds for any integer n and nonzero b:
.
Raising 0 to a negative exponent is undefined but, in some circumstances, it may be interpreted as infinity ().
This definition of exponentiation with negative exponents is the only one that allows extending the identity to negative exponents (consider the case ).
The following identities, often called exponent rules, hold for all integer exponents, provided that the base is non-zero:
Unlike addition and multiplication, exponentiation is not commutative. For example, 23 = 8 ≠ 32 = 9. Also unlike addition and multiplication, exponentiation is not associative. For example, (23)2 = 82 = 64, whereas 2(32) = 29 = 512. Without parentheses, the conventional order of operations for serial exponentiation in superscript notation is top-down (or right-associative), not bottom-up(or left-associative). That is,
which, in general, is different from
Powers of a sum
The powers of a sum can normally be computed from the powers of the summands by the binomial formula
However, this formula is true only if the summands commute (i.e. that ab = ba), which is implied if they belong to a structure that is commutative. Otherwise, if a and b are, say, square matrices of the same size, this formula cannot be used. It follows that in computer algebra, many algorithms involving integer exponents must be changed when the exponentiation bases do not commute. Some general purpose computer algebra systems use a different notation (sometimes ^^ instead of ^) for exponentiation with non-commuting bases, which is then called non-commutative exponentiation.
For nonnegative integers n and m, the value of nm is the number of functions from a set of m elements to a set of n elements (see cardinal exponentiation). Such functions can be represented as m-tuples from an n-element set (or as m-letter words from an n-letter alphabet). Some examples for particular values of m and n are given in the following table:
nm
The nm possible m-tuples of elements from the set {1, ..., n}
In the base ten (decimal) number system, integer powers of 10 are written as the digit 1 followed or preceded by a number of zeroes determined by the sign and magnitude of the exponent. For example, 103 = 1000 and 10−4 = 0.0001.
SI prefixes based on powers of 10 are also used to describe small or large quantities. For example, the prefix kilo means 103 = 1000, so a kilometre is 1000 m.
The first negative powers of 2 are commonly used, and have special names, e.g.: half and quarter.
Powers of 2 appear in set theory, since a set with n members has a power set, the set of all of its subsets, which has 2n members.
Integer powers of 2 are important in computer science. The positive integer powers 2n give the number of possible values for an n-bit integer binary number; for example, a byte may take 28 = 256 different values. The binary number system expresses any number as a sum of powers of 2, and denotes it as a sequence of 0 and 1, separated by a binary point, where 1 indicates a power of 2 that appears in the sum; the exponent is determined by the place of this 1: the nonnegative exponents are the rank of the 1 on the left of the point (starting from 0), and the negative exponents are determined by the rank on the right of the point.
Powers of one
Every power of one equals: 1n = 1. This is true even if n is negative.
The first power of a number is the number itself: n1 = n.
Powers of zero
If the exponent n is positive (n > 0), the nth power of zero is zero: 0n = 0.
If the exponent n is negative (n < 0), the nth power of zero 0n is undefined, because it must equal with −n > 0, and this would be according to above.
The expression 00 is either defined as 1, or it is left undefined.
Powers of negative one
If n is an even integer, then (−1)n = 1. This is because a negative number multiplied by another negative number cancels the sign, and thus gives a positive number.
If n is an odd integer, then (−1)n = −1. This is because there will be a remaining −1 after removing −1 pairs.
Because of this, powers of −1 are useful for expressing alternating sequences. For a similar discussion of powers of the complex number i, see § nth roots of a complex number.
Large exponents
The limit of a sequence of powers of a number greater than one diverges; in other words, the sequence grows without bound:
bn → ∞ as n → ∞ when b > 1
This can be read as "b to the power of n tends to +∞ as n tends to infinity when b is greater than one".
Powers of a number with absolute value less than one tend to zero:
bn → 0 as n → ∞ when |b| < 1
Any power of one is always one:
bn = 1 for all n if b = 1
Powers of –1 alternate between 1 and –1 as n alternates between even and odd, and thus do not tend to any limit as n grows.
If b < –1, bn alternates between larger and larger positive and negative numbers as n alternates between even and odd, and thus does not tend to any limit as n grows.
If the exponentiated number varies while tending to 1 as the exponent tends to infinity, then the limit is not necessarily one of those above. A particularly important case is
Real functions of the form , where , are sometimes called power functions. When is an integer and , two primary families exist: for even, and for odd. In general for , when is even will tend towards positive infinity with increasing , and also towards positive infinity with decreasing . All graphs from the family of even power functions have the general shape of , flattening more in the middle as increases. Functions with this kind of symmetry() are called even functions.
When is odd, 's asymptotic behavior reverses from positive to negative . For , will also tend towards positive infinity with increasing , but towards negative infinity with decreasing . All graphs from the family of odd power functions have the general shape of , flattening more in the middle as increases and losing all flatness there in the straight line for . Functions with this kind of symmetry () are called odd functions.
For , the opposite asymptotic behavior is true in each case.
Table of powers of decimal digits
n
n2
n3
n4
n5
n6
n7
n8
n9
n10
1
1
1
1
1
1
1
1
1
1
2
4
8
16
32
64
128
256
512
1024
3
9
27
81
243
729
2187
6561
19683
59049
4
16
64
256
1024
4096
16384
65536
262144
1048576
5
25
125
625
3125
15625
78125
390625
1953125
9765625
6
36
216
1296
7776
46656
279936
1679616
10077696
60466176
7
49
343
2401
16807
117649
823543
5764801
40353607
282475249
8
64
512
4096
32768
262144
2097152
16777216
134217728
1073741824
9
81
729
6561
59049
531441
4782969
43046721
387420489
3486784401
10
100
1000
10000
100000
1000000
10000000
100000000
1000000000
10000000000
Rational exponents
If x is a nonnegative real number, and n is a positive integer, or denotes the unique positive real nth root of x, that is, the unique positive real number y such that
If x is a positive real number, and is a rational number, with p and q > 0 integers, then is defined as
The equality on the right may be derived by setting and writing
If r is a positive rational number, 0r = 0, by definition.
All these definitions are required for extending the identity to rational exponents.
On the other hand, there are problems with the extension of these
definitions to bases that are not positive real numbers. For example, a
negative real number has a real nth root, which is negative, if n is odd, and no real root if n is even. In the latter case, whichever complex nth root one chooses for the identity cannot be satisfied. For example,
For
positive real numbers, exponentiation to real powers can be defined in
two equivalent ways, either by extending the rational powers to reals by
continuity (§ Limits of rational exponents, below), or in terms of the logarithm of the base and the exponential function (§ Powers via logarithms, below). The result is always a positive real number, and the identities and properties
shown above for integer exponents remain true with these definitions
for real exponents. The second definition is more commonly used, since
it generalizes straightforwardly to complex exponents.
On the other hand, exponentiation to a real power of a negative
real number is much more difficult to define consistently, as it may be
non-real and have several values (see § Real exponents with negative bases). One may choose one of these values, called the principal value, but there is no choice of the principal value for which the identity
Since any irrational number can be expressed as the limit of a sequence of rational numbers, exponentiation of a positive real number b with an arbitrary real exponent x can be defined by continuity with the rule
where the limit is taken over rational values of r only. This limit exists for every positive b and every real x.
For example, if x = π, the non-terminating decimal representation π = 3.14159... and the monotonicity
of the rational powers can be used to obtain intervals bounded by
rational powers that are as small as desired, and must contain
So, the upper bounds and the lower bounds of the intervals form two sequences that have the same limit, denoted
The exponential function is often defined as where is Euler's number. For avoiding circular reasoning, this definition cannot be used here. So, a definition of the exponential function, denoted
and of Euler's number are given, which rely only on exponentiation with
positive integer exponents. Then a proof is sketched that, if one uses
the definition of exponentiation given in preceding sections, one has
One has and the exponential identity holds as well, since
and the second-order term does not affect the limit, yielding .
Euler's number can be defined as . It follows from the preceding equations that when x is an integer (this results from the repeated-multiplication definition of the exponentiation). If x is real, results from the definitions given in preceding sections, by using the exponential identity if x is rational, and the continuity of the exponential function otherwise.
The limit that defines the exponential function converges for every complex value of x, and therefore it can be used to extend the definition of , and thus from the real numbers to any complex argument z.
This extended exponential function still satisfies the exponential
identity, and is commonly used for defining exponentiation for complex
base and exponent.
Powers via logarithms
The definition of ex as the exponential function allows defining bx for every positive real numbers b, in terms of exponential and logarithm function. Specifically, the fact that the natural logarithmln(x) is the inverse of the exponential function ex means that one has
for every b > 0. For preserving the identity one must have
So, can be used as an alternative definition of bx for any positive real b.
This agrees with the definition given above using rational exponents
and continuity, with the advantage to extend straightforwardly to any
complex exponent.
Complex exponents with a positive real base
If b is a positive real number, exponentiation with base b and complex exponent z is defined by means of the exponential function with complex argument (see the end of § Exponential function, above) as
In general,
is not defined, since bz is not a real number. If a meaning is given to the exponentiation of a complex number (see § Non-integer powers of complex numbers, below), one has, in general,
In
the preceding sections, exponentiation with non-integer exponents has
been defined for positive real bases only. For other bases, difficulties
appear already with the apparently simple case of nth roots, that is, of exponents where n is a positive integer. Although the general theory of exponentiation with non-integer exponents applies to nth roots, this case deserves to be considered first, since it does not need to use complex logarithms, and is therefore easier to understand.
nth roots of a complex number
Every nonzero complex number z may be written in polar form as
where is the absolute value of z, and is its argument. The argument is defined up to an integer multiple of 2π; this means that, if is the argument of a complex number, then is also an argument of the same complex number for every integer .
The polar form of the product of two complex numbers is obtained
by multiplying the absolute values and adding the arguments. It follows
that the polar form of an nth root of a complex number can be obtained by taking the nth root of the absolute value and dividing its argument by n:
If is added to , the complex number is not changed, but this adds to the argument of the nth root, and provides a new nth root. This can be done n times, and provides the nnth roots of the complex number.
It is usual to choose one of the nnth root as the principal root. The common choice is to choose the nth root for which that is, the nth root that has the largest real part, and, if there are two, the one with positive imaginary part. This makes the principal nth root a continuous function in the whole complex plane, except for negative real values of the radicand. This function equals the usual nth root for positive real radicands. For negative real radicands, and odd exponents, the principal nth root is not real, although the usual nth root is real. Analytic continuation shows that the principal nth root is the unique complex differentiable function that extends the usual nth root to the complex plane without the nonpositive real numbers.
If the complex number is moved around zero by increasing its argument, after an increment of the complex number comes back to its initial position, and its nth roots are permuted circularly (they are multiplied by ). This shows that it is not possible to define a nth root function that is continuous in the whole complex plane.
The nth roots of unity are the n complex numbers such that wn = 1, where n is a positive integer. They arise in various areas of mathematics, such as in discrete Fourier transform or algebraic solutions of algebraic equations (Lagrange resolvent).
The nnth roots of unity are the n first powers of , that is The nth roots of unity that have this generating property are called primitive nth roots of unity; they have the form with kcoprime with n. The unique primitive square root of unity is the primitive fourth roots of unity are and
The nth roots of unity allow expressing all nth roots of a complex number z as the n products of a given nth roots of z with a nth root of unity.
Geometrically, the nth roots of unity lie on the unit circle of the complex plane at the vertices of a regular n-gon with one vertex on the real number 1.
As the number is the primitive nth root of unity with the smallest positive argument, it is called the principal primitive nth root of unity, sometimes shortened as principal nth root of unity, although this terminology can be confused with the principal value of , which is 1.
Complex exponentiation
Defining
exponentiation with complex bases leads to difficulties that are
similar to those described in the preceding section, except that there
are, in general, infinitely many possible values for . So, either a principal value is defined, which is not continuous for the values of z that are real and nonpositive, or is defined as a multivalued function.
In all cases, the complex logarithm is used to define complex exponentiation as
where is the variant of the complex logarithm that is used, which is, a function or a multivalued function such that
The principal value of the complex logarithm is not defined for it is discontinuous at negative real values of z, and it is holomorphic (that is, complex differentiable) elsewhere. If z is real and positive, the principal value of the complex logarithm is the natural logarithm:
The principal value of is defined as
where is the principal value of the logarithm.
The function is holomorphic except in the neighbourhood of the points where z is real and nonpositive.
If z is real and positive, the principal value of equals its usual value defined above. If where n is an integer, this principal value is the same as the one defined above.
Multivalued function
In some contexts, there is a problem with the discontinuity of the principal values of and at the negative real values of z. In this case, it is useful to consider these functions as multivalued functions.
If denotes one of the values of the multivalued logarithm (typically its principal value), the other values are where k is any integer. Similarly, if is one value of the exponentiation, then the other values are given by
where k is any integer.
Different values of k give different values of unless w is a rational number, that is, there is an integer d such that dw is an integer. This results from the periodicity of the exponential function, more specifically, that if and only if is an integer multiple of
If is a rational number with m and ncoprime integers with then has exactly n values. In the case these values are the same as those described in § nth roots of a complex number. If w is an integer, there is only one value that agrees with that of § Integer exponents.
The multivalued exponentiation is holomorphic for in the sense that its graph consists of several sheets that define each a holomorphic function in the neighborhood of every point. If z varies continuously along a circle around 0, then, after a turn, the value of has changed of sheet.
Computation
The canonical form of can be computed from the canonical form of z and w. Although this can be described by a single formula, it is clearer to split the computation in several steps.
Polar form of z. If is the canonical form of z (a and b being real), then its polar form is
where and (see atan2 for the definition of this function).
Logarithm of z. The principal value of this logarithm is where denotes the natural logarithm. The other values of the logarithm are obtained by adding for any integer k.
Canonical form of If with c and d real, the values of are
the principal value corresponding to
Final result. Using the identities and one gets
with for the principal value.
Examples
The polar form of i is and the values of are thus
It follows that
So, all values of are real, the principal one being
Similarly, the polar form of −2 is So, the above described method gives the values
In this case, all the values have the same argument and different absolute values.
In both examples, all values of have the same argument. More generally, this is true if and only if the real part of w is an integer.
Failure of power and logarithm identities
Some
identities for powers and logarithms for positive real numbers will
fail for complex numbers, no matter how complex powers and complex
logarithms are defined as single-valued functions. For example:
The identity log(bx) = x ⋅ log b holds whenever b is a positive real number and x is a real number. But for the principal branch of the complex logarithm one has
Regardless of which branch of the logarithm is used, a similar
failure of the identity will exist. The best that can be said (if only
using this result) is that:
This identity does not hold even when considering log as a multivalued function. The possible values of log(wz) contain those of z ⋅ log w as a proper subset. Using Log(w) for the principal value of log(w) and m, n as any integers the possible values of both sides are:
The identities (bc)x = bxcx and (b/c)x = bx/cx are valid when b and c are positive real numbers and x is a real number. But, for the principal values, one has
and
On the other hand, when x is an integer, the identities are valid for all nonzero complex numbers.
If exponentiation is considered as a multivalued function then the possible values of (−1 ⋅ −1)1/2 are {1, −1}. The identity holds, but saying {1} = {(−1 ⋅ −1)1/2} is incorrect.
The identity (ex)y = exy holds for real numbers x and y, but assuming its truth for complex numbers leads to the following paradox, discovered in 1827 by Clausen:
For any integer n, we have:
(taking the -th power of both sides)
(using and expanding the exponent)
(using )
(dividing by e)
but this is false when the integer n is nonzero.
The error is the following: by definition, is a notation for a true function, and is a notation for which is a multi-valued function. Thus the notation is ambiguous when x = e. Here, before expanding the exponent, the second line should be
Therefore, when expanding the exponent, one has implicitly supposed that for complex values of z, which is wrong, as the complex logarithm is multivalued. In other words, the wrong identity (ex)y = exy must be replaced by the identity
which is a true identity between multivalued functions.
If b is a positive real algebraic number, and x is a rational number, then bx is an algebraic number. This results from the theory of algebraic extensions. This remains true if b is any algebraic number, in which case, all values of bx (as a multivalued function) are algebraic. If x is irrational (that is, not rational), and both b and x are algebraic, Gelfond–Schneider theorem asserts that all values of bx are transcendental (that is, not algebraic), except if b equals 0 or 1.
In other words, if x is irrational and then at least one of b, x and bx is transcendental.
Integer powers in algebra
The definition of exponentiation with positive integer exponents as repeated multiplication may apply to any associative operation denoted as a multiplication. The definition of x0 requires further the existence of a multiplicative identity.
An algebraic structure consisting of a set together with an associative operation denoted multiplicatively, and a multiplicative identity denoted by 1 is a monoid. In such a monoid, exponentiation of an element x is defined inductively by
for every nonnegative integer n.
If n is a negative integer, is defined only if x has a multiplicative inverse. In this case, the inverse of x is denoted x−1, and xn is defined as
Exponentiation with integer exponents obeys the following laws, for x and y in the algebraic structure, and m and n integers:
When there are several operations that may be repeated, it is
common to indicate the repeated operation by placing its symbol in the
superscript, before the exponent. For example, if f is a real function whose valued can be multiplied, denotes the exponentiation with respect of multiplication, and may denote exponentiation with respect of function composition. That is,
So, if G is a group, is defined for every and every integer n.
The set of all powers of an element of a group form a subgroup. A group (or subgroup) that consists of all powers of a specific element x is the cyclic group generated by x. If all the powers of x are distinct, the group is isomorphic to the additive group of the integers. Otherwise, the cyclic group is finite (it has a finite number of elements), and its number of elements is the order of x. If the order of x is n, then and the cyclic group generated by x consists of the n first powers of x (starting indifferently from the exponent 0 or 1).
Order of elements play a fundamental role in group theory. For example, the order of an element in a finite group is always a divisor of the number of elements of the group (the order of the group). The possible orders of group elements are important in the study of the structure of a group (see Sylow theorems), and in the classification of finite simple groups.
Superscript notation is also used for conjugation; that is, gh = h−1gh, where g and h
are elements of a group. This notation cannot be confused with
exponentiation, since the superscript is not an integer. The motivation
of this notation is that conjugation obeys some of the laws of
exponentiation, namely and
In a ring
In a ring, it may occur that some nonzero elements satisfy for some integer n. Such an element is said to be nilpotent. In a commutative ring, the nilpotent elements form an ideal, called the nilradical of the ring.
More generally, given an ideal I in a commutative ring R, the set of the elements of R that have a power in I is an ideal, called the radical of I. The nilradical is the radical of the zero ideal. A radical ideal is an ideal that equals its own radical. In a polynomial ring over a fieldk,
an ideal is radical if and only if it is the set of all polynomials
that are zero on an affine algebraic set (this is a consequence of Hilbert's Nullstellensatz).
Matrices and linear operators
If A is a square matrix, then the product of A with itself n times is called the matrix power. Also is defined to be the identity matrix, and if A is invertible, then .
Matrix powers appear often in the context of discrete dynamical systems, where the matrix A expresses a transition from a state vector x of some system to the next state Ax of the system. This is the standard interpretation of a Markov chain, for example. Then is the state of the system after two time steps, and so forth: is the state of the system after n time steps. The matrix power is the transition matrix between the state now and the state at a time n
steps in the future. So computing matrix powers is equivalent to
solving the evolution of the dynamical system. In many cases, matrix
powers can be expediently computed by using eigenvalues and eigenvectors.
Apart from matrices, more general linear operators can also be exponentiated. An example is the derivative operator of calculus, , which is a linear operator acting on functions to give a new function . The nth power of the differentiation operator is the nth derivative:
These examples are for discrete exponents of linear operators, but in
many circumstances it is also desirable to define powers of such
operators with continuous exponents. This is the starting point of the
mathematical theory of semigroups.
Just as computing matrix powers with discrete exponents solves
discrete dynamical systems, so does computing matrix powers with
continuous exponents solve systems with continuous dynamics. Examples
include approaches to solving the heat equation, Schrödinger equation, wave equation,
and other partial differential equations including a time evolution.
The special case of exponentiating the derivative operator to a
non-integer power is called the fractional derivative which, together with the fractional integral, is one of the basic operations of the fractional calculus.
A field
is an algebraic structure in which multiplication, addition,
subtraction, and division are defined and satisfy the properties that
multiplication is associative and every nonzero element has a multiplicative inverse. This implies that exponentiation with integer exponents is well-defined, except for nonpositive powers of 0. Common examples are the field of complex numbers, the real numbers and the rational numbers, considered earlier in this article, which are all infinite.
A finite field is a field with a finite number of elements. This number of elements is either a prime number or a prime power; that is, it has the form where p is a prime number, and k is a positive integer. For every such q, there are fields with q elements. The fields with q elements are all isomorphic, which allows, in general, working as if there were only one field with q elements, denoted
One has
for every
A primitive element in is an element g such that the set of the q − 1 first powers of g (that is, ) equals the set of the nonzero elements of There are primitive elements in where is Euler's totient function.
The Diffie–Hellman key exchange is an application of exponentiation in finite fields that is widely used for secure communications. It uses the fact that exponentiation is computationally inexpensive, whereas the inverse operation, the discrete logarithm, is computationally expensive. More precisely, if g is a primitive element in then can be efficiently computed with exponentiation by squaring for any e, even if q is large, while there is no known computationally practical algorithm that allows retrieving e from if q is sufficiently large.
This allows defining the nth power of a set S as the set of all n-tuples of elements of S.
When S is endowed with some structure, it is frequent that is naturally endowed with a similar structure. In this case, the term "direct product" is generally used instead of "Cartesian product", and exponentiation denotes product structure. For example (where denotes the real numbers) denotes the Cartesian product of n copies of as well as their direct product as vector space, topological spaces, rings, etc.
A n-tuple of elements of S can be considered as a function from This generalizes to the following notation.
Given two sets S and T, the set of all functions from T to S is denoted . This exponential notation is justified by the following canonical isomorphisms (for the first one, see Currying):
where denotes the Cartesian product, and the disjoint union.
One can use sets as exponents for other operations on sets, typically for direct sums of abelian groups, vector spaces, or modules. For distinguishing direct sums from direct products, the exponent of a direct sum is placed between parentheses. For example, denotes the vector space of the infinite sequences of real numbers, and the vector space of those sequences that have a finite number of nonzero elements. The latter has a basis consisting of the sequences with exactly one nonzero element that equals 1, while the Hamel bases of the former cannot be explicitly described (because their existence involves Zorn's lemma).
In this context, 2 can represents the set So, denotes the power set of S, that is the set of the functions from S to which can be identified with the set of the subsets of S, by mapping each function to the inverse image of 1.
In the category of sets, the morphisms between sets X and Y are the functions from X to Y. It results that the set of the functions from X to Y that is denoted in the preceding section can also be denoted The isomorphism can be rewritten
This means the functor "exponentiation to the power T" is a right adjoint to the functor "direct product with T".
This generalizes to the definition of exponentiation in a category in which finite direct products exist: in such a category, the functor is, if it exists, a right adjoint to the functor A category is called a Cartesian closed category, if direct products exist, and the functor has a right adjoint for every T.
Just as exponentiation of natural numbers is motivated by repeated
multiplication, it is possible to define an operation based on repeated
exponentiation; this operation is sometimes called hyper-4 or tetration. Iterating tetration leads to another operation, and so on, a concept named hyperoperation. This sequence of operations is expressed by the Ackermann function and Knuth's up-arrow notation.
Just as exponentiation grows faster than multiplication, which is
faster-growing than addition, tetration is faster-growing than
exponentiation. Evaluated at (3, 3), the functions addition, multiplication, exponentiation, and tetration yield 6, 9, 27, and 7625597484987 (= 327 = 333 = 33) respectively.
Limits of powers
Zero to the power of zero gives a number of examples of limits that are of the indeterminate form 00. The limits in these examples exist, but have different values, showing that the two-variable function xy has no limit at the point (0, 0). One may consider at what points this function does have a limit.
More precisely, consider the function defined on . Then D can be viewed as a subset of R2 (that is, the set of all pairs (x, y) with x, y belonging to the extended real number lineR = [−∞, +∞], endowed with the product topology), which will contain the points at which the function f has a limit.
In fact, f has a limit at all accumulation points of D, except for (0, 0), (+∞, 0), (1, +∞) and (1, −∞). Accordingly, this allows one to define the powers xy by continuity whenever 0 ≤ x ≤ +∞, −∞ ≤ y ≤ +∞, except for 00, (+∞)0, 1+∞ and 1−∞, which remain indeterminate forms.
Under this definition by continuity, we obtain:
x+∞ = +∞ and x−∞ = 0, when 1 < x ≤ +∞.
x+∞ = 0 and x−∞ = +∞, when 0 ≤ x < 1.
0y = 0 and (+∞)y = +∞, when 0 < y ≤ +∞.
0y = +∞ and (+∞)y = 0, when −∞ ≤ y < 0.
These powers are obtained by taking limits of xy for positive values of x. This method does not permit a definition of xy when x < 0, since pairs (x, y) with x < 0 are not accumulation points of D.
On the other hand, when n is an integer, the power xn is already meaningful for all values of x, including negative ones. This may make the definition 0n = +∞ obtained above for negative n problematic when n is odd, since in this case xn → +∞ as x tends to 0 through positive values, but not negative ones.
Efficient computation with integer exponents
Computing bn using iterated multiplication requires n − 1
multiplication operations, but it can be computed more efficiently than
that, as illustrated by the following example. To compute 2100, apply Horner's rule to the exponent 100 written in binary:
.
Then compute the following terms in order, reading Horner's rule from right to left.
22 = 4
2 (22) = 23 = 8
(23)2 = 26 = 64
(26)2 = 212 = 4096
(212)2 = 224 = 16777216
2 (224) = 225 = 33554432
(225)2 = 250 = 1125899906842624
(250)2 = 2100 = 1267650600228229401496703205376
This series of steps only requires 8 multiplications instead of 99.
In general, the number of multiplication operations required to compute bn can be reduced to by using exponentiation by squaring, where denotes the number of 1 in the binary representation of n.
For some exponents (100 is not among them), the number of
multiplications can be further reduced by computing and using the
minimal addition-chain exponentiation. Finding the minimal sequence of multiplications (the minimal-length addition chain for the exponent) for bn is a difficult problem, for which no efficient algorithms are currently known (see Subset sum problem), but many reasonably efficient heuristic algorithms are available. However, in practical computations, exponentiation by squaring is efficient enough, and much more easy to implement.
If the domain of a function f equals its codomain, one may compose the function with itself an arbitrary number of time, and this defines the nth power of the function under composition, commonly called the nth iterate of the function. Thus denotes generally the nth iterate of f; for example, means
When a multiplication is defined on the codomain of the function, this defines a multiplication on functions, the pointwise multiplication, which induces another exponentiation. When using functional notation, the two kinds of exponentiation are generally distinguished by placing the exponent of the functional iteration before the parentheses enclosing the arguments of the function, and placing the exponent of pointwise multiplication after the parentheses. Thus and
When functional notation is not used, disambiguation is often done by
placing the composition symbol before the exponent; for example and
For historical reasons, the exponent of a repeated multiplication is
placed before the argument for some specific functions, typically the trigonometric functions. So, and both mean and not which, in any case, is rarely considered. Historically, several variants of these notations were used by different authors.
Programming languages generally express exponentiation either as an infix operator or as a function application, as they do not support superscripts. The most common operator symbol for exponentiation is the caret (^). The original version of ASCII included an uparrow symbol (↑), intended for exponentiation, but this was replaced by the caret in 1967, so the caret became usual in programming languages.
The notations include:
In most programming languages with an infix exponentiation operator, it is right-associative, that is, a^b^c is interpreted as a^(b^c). This is because (a^b)^c is equal to a^(b*c) and thus not as useful. In some languages, it is left-associative, notably in Algol, Matlab and the Microsoft Excel formula language.
Other programming languages use functional notation: