Search This Blog

Saturday, September 16, 2023

Gamma-ray burst

From Wikipedia, the free encyclopedia
Artist's illustration showing the life of a massive star as nuclear fusion converts lighter elements into heavier ones. When fusion no longer generates enough pressure to counteract gravity, the star rapidly collapses to form a black hole. Theoretically, energy may be released during the collapse along the axis of rotation to form a GRB.

In gamma-ray astronomy, gamma-ray bursts (GRBs) are immensely energetic explosions that have been observed in distant galaxies. They are the most energetic and luminous electromagnetic events since the Big Bang. Bursts can last from ten milliseconds to several hours. After an initial flash of gamma rays, a longer-lived "afterglow" is usually emitted at longer wavelengths (X-ray, ultraviolet, optical, infrared, microwave and radio).

The intense radiation of most observed GRBs is thought to be released during a supernova or superluminous supernova as a high-mass star implodes to form a neutron star or a black hole. A subclass of GRBs appears to originate from the merger of binary neutron stars.

The sources of most GRBs are billions of light years away from Earth, implying that the explosions are both extremely energetic (a typical burst releases as much energy in a few seconds as the Sun will in its entire 10-billion-year lifetime) and extremely rare (a few per galaxy per million years). All observed GRBs have originated from outside the Milky Way galaxy, although a related class of phenomena, soft gamma repeaters, are associated with magnetars within the Milky Way. It has been hypothesized that a gamma-ray burst in the Milky Way, pointing directly towards the Earth, could cause a mass extinction event. The Late Ordovician mass extinction has been hypothesised by some researchers to have occurred as a result of such a gamma-ray burst.

GRBs were first detected in 1967 by the Vela satellites, which had been designed to detect covert nuclear weapons tests; after thorough analysis, this was published in 1973. Following their discovery, hundreds of theoretical models were proposed to explain these bursts, such as collisions between comets and neutron stars. Little information was available to verify these models until the 1997 detection of the first X-ray and optical afterglows and direct measurement of their redshifts using optical spectroscopy, and thus their distances and energy outputs. These discoveries, and subsequent studies of the galaxies and supernovae associated with the bursts, clarified the distance and luminosity of GRBs, definitively placing them in distant galaxies.

History

Positions on the sky of all gamma-ray bursts detected during the BATSE mission. The distribution is isotropic, with no concentration towards the plane of the Milky Way, which runs horizontally through the center of the image.

Gamma-ray bursts were first observed in the late 1960s by the U.S. Vela satellites, which were built to detect gamma radiation pulses emitted by nuclear weapons tested in space. The United States suspected that the Soviet Union might attempt to conduct secret nuclear tests after signing the Nuclear Test Ban Treaty in 1963. On July 2, 1967, at 14:19 UTC, the Vela 4 and Vela 3 satellites detected a flash of gamma radiation unlike any known nuclear weapons signature. Uncertain what had happened but not considering the matter particularly urgent, the team at the Los Alamos National Laboratory, led by Ray Klebesadel, filed the data away for investigation. As additional Vela satellites were launched with better instruments, the Los Alamos team continued to find inexplicable gamma-ray bursts in their data. By analyzing the different arrival times of the bursts as detected by different satellites, the team was able to determine rough estimates for the sky positions of 16 bursts and definitively rule out a terrestrial or solar origin. Contrary to popular belief, the data was never classified. After thorough analysis, the findings were published in 1973 as an Astrophysical Journal article entitled "Observations of Gamma-Ray Bursts of Cosmic Origin".

Most early theories of gamma-ray bursts posited nearby sources within the Milky Way Galaxy. From 1991, the Compton Gamma Ray Observatory (CGRO) and its Burst and Transient Source Explorer (BATSE) instrument, an extremely sensitive gamma-ray detector, provided data that showed the distribution of GRBs is isotropic – not biased towards any particular direction in space. If the sources were from within our own galaxy, they would be strongly concentrated in or near the galactic plane. The absence of any such pattern in the case of GRBs provided strong evidence that gamma-ray bursts must come from beyond the Milky Way. However, some Milky Way models are still consistent with an isotropic distribution.

Counterpart objects as candidate sources

For decades after the discovery of GRBs, astronomers searched for a counterpart at other wavelengths: i.e., any astronomical object in positional coincidence with a recently observed burst. Astronomers considered many distinct classes of objects, including white dwarfs, pulsars, supernovae, globular clusters, quasars, Seyfert galaxies, and BL Lac objects. All such searches were unsuccessful, and in a few cases particularly well-localized bursts (those whose positions were determined with what was then a high degree of accuracy) could be clearly shown to have no bright objects of any nature consistent with the position derived from the detecting satellites. This suggested an origin of either very faint stars or extremely distant galaxies. Even the most accurate positions contained numerous faint stars and galaxies, and it was widely agreed that final resolution of the origins of cosmic gamma-ray bursts would require both new satellites and faster communication.

Afterglow

The Italian–Dutch satellite BeppoSAX, launched in April 1996, provided the first accurate positions of gamma-ray bursts, allowing follow-up observations and identification of the sources.

Several models for the origin of gamma-ray bursts postulated that the initial burst of gamma rays should be followed by afterglow: slowly fading emission at longer wavelengths created by collisions between the burst ejecta and interstellar gas. Early searches for this afterglow were unsuccessful, largely because it is difficult to observe a burst's position at longer wavelengths immediately after the initial burst. The breakthrough came in February 1997 when the satellite BeppoSAX detected a gamma-ray burst (GRB 970228) and when the X-ray camera was pointed towards the direction from which the burst had originated, it detected fading X-ray emission. The William Herschel Telescope identified a fading optical counterpart 20 hours after the burst. Once the GRB faded, deep imaging was able to identify a faint, distant host galaxy at the location of the GRB as pinpointed by the optical afterglow.

Because of the very faint luminosity of this galaxy, its exact distance was not measured for several years. Well after then, another major breakthrough occurred with the next event registered by BeppoSAX, GRB 970508. This event was localized within four hours of its discovery, allowing research teams to begin making observations much sooner than any previous burst. The spectrum of the object revealed a redshift of z = 0.835, placing the burst at a distance of roughly 6 billion light years from Earth. This was the first accurate determination of the distance to a GRB, and together with the discovery of the host galaxy of 970228 proved that GRBs occur in extremely distant galaxies. Within a few months, the controversy about the distance scale ended: GRBs were extragalactic events originating within faint galaxies at enormous distances. The following year, GRB 980425 was followed within a day by a bright supernova (SN 1998bw), coincident in location, indicating a clear connection between GRBs and the deaths of very massive stars. This burst provided the first strong clue about the nature of the systems that produce GRBs.

More recent instruments

NASA's Swift Spacecraft launched in November 2004

BeppoSAX functioned until 2002 and CGRO (with BATSE) was deorbited in 2000. However, the revolution in the study of gamma-ray bursts motivated the development of a number of additional instruments designed specifically to explore the nature of GRBs, especially in the earliest moments following the explosion. The first such mission, HETE-2, was launched in 2000 and functioned until 2006, providing most of the major discoveries during this period. One of the most successful space missions to date, Swift, was launched in 2004 and as of January 2023 is still operational. Swift is equipped with a very sensitive gamma-ray detector as well as on-board X-ray and optical telescopes, which can be rapidly and automatically slewed to observe afterglow emission following a burst. More recently, the Fermi mission was launched carrying the Gamma-Ray Burst Monitor, which detects bursts at a rate of several hundred per year, some of which are bright enough to be observed at extremely high energies with Fermi's Large Area Telescope. Meanwhile, on the ground, numerous optical telescopes have been built or modified to incorporate robotic control software that responds immediately to signals sent through the Gamma-ray Burst Coordinates Network. This allows the telescopes to rapidly repoint towards a GRB, often within seconds of receiving the signal and while the gamma-ray emission itself is still ongoing.

New developments since the 2000s include the recognition of short gamma-ray bursts as a separate class (likely from merging neutron stars and not associated with supernovae), the discovery of extended, erratic flaring activity at X-ray wavelengths lasting for many minutes after most GRBs, and the discovery of the most luminous (GRB 080319B) and the former most distant (GRB 090423) objects in the universe. The most distant known GRB, GRB 090429B, is now the most distant known object in the universe.

In October 2018, astronomers reported that GRB 150101B (detected in 2015) and GW170817, a gravitational wave event detected in 2017 (which has been associated with GRB170817A, a burst detected 1.7 seconds later), may have been produced by the same mechanism – the merger of two neutron stars. The similarities between the two events, in terms of gamma ray, optical, and x-ray emissions, as well as to the nature of the associated host galaxies, are "striking", suggesting the two separate events may both be the result of the merger of neutron stars, and both may be a kilonova, which may be more common in the universe than previously understood, according to the researchers.

The highest energy light observed from a gamma-ray burst was one teraelectronvolt, from GRB 190114C in 2019. (Note, this is about a thousand times lower energy than the highest energy light observed from any source, which is 1.4 petaelectronvolts as of the year 2021.)

Classification

Gamma-ray burst light curves

The light curves of gamma-ray bursts are extremely diverse and complex. No two gamma-ray burst light curves are identical, with large variation observed in almost every property: the duration of observable emission can vary from milliseconds to tens of minutes, there can be a single peak or several individual subpulses, and individual peaks can be symmetric or with fast brightening and very slow fading. Some bursts are preceded by a "precursor" event, a weak burst that is then followed (after seconds to minutes of no emission at all) by the much more intense "true" bursting episode. The light curves of some events have extremely chaotic and complicated profiles with almost no discernible patterns.

Although some light curves can be roughly reproduced using certain simplified models, little progress has been made in understanding the full diversity observed. Many classification schemes have been proposed, but these are often based solely on differences in the appearance of light curves and may not always reflect a true physical difference in the progenitors of the explosions. However, plots of the distribution of the observed duration for a large number of gamma-ray bursts show a clear bimodality, suggesting the existence of two separate populations: a "short" population with an average duration of about 0.3 seconds and a "long" population with an average duration of about 30 seconds. Both distributions are very broad with a significant overlap region in which the identity of a given event is not clear from duration alone. Additional classes beyond this two-tiered system have been proposed on both observational and theoretical grounds.

Short gamma-ray bursts

Hubble Space Telescope captures infrared glow of a kilonova blast.
GRB 211106A , one of the most energetic short GRB registered, in the first-ever time-lapse movie of a short GRB in millimeter-wavelength light, as seen with the Atacama Large Millimeter/submillimeter Array (ALMA) and pinpointed to a distant host galaxy captured using the Hubble Space Telescope.

Events with a duration of less than about two seconds are classified as short gamma-ray bursts. These account for about 30% of gamma-ray bursts, but until 2005, no afterglow had been successfully detected from any short event and little was known about their origins. Since then, several dozen short gamma-ray burst afterglows have been detected and localized, several of which are associated with regions of little or no star formation, such as large elliptical galaxies. This rules out a link to massive stars, confirming that short events are physically distinct from long events. In addition, there has been no association with supernovae.

The true nature of these objects was initially unknown, and the leading hypothesis was that they originated from the mergers of binary neutron stars or a neutron star with a black hole. Such mergers were theorized to produce kilonovae, and evidence for a kilonova associated with GRB 130603B was seen. The mean duration of these events of 0.2 seconds suggests (because of causality) a source of very small physical diameter in stellar terms; less than 0.2 light-seconds (about 60,000 km or 37,000 miles – four times the Earth's diameter). The observation of minutes to hours of X-ray flashes after a short gamma-ray burst is consistent with small particles of a primary object like a neutron star initially swallowed by a black hole in less than two seconds, followed by some hours of lesser energy events, as remaining fragments of tidally disrupted neutron star material (no longer neutronium) remain in orbit to spiral into the black hole, over a longer period of time. A small fraction of short gamma-ray bursts are probably produced by giant flares from soft gamma repeaters in nearby galaxies.

The origin of short GRBs in kilonovae was confirmed when short GRB 170817A was detected only 1.7 s after the detection of gravitational wave GW170817, which was a signal from the merger of two neutron stars.

Long gamma-ray bursts

Swift captured the afterglow of GRB 221009A about an hour after it was first detected reaching Earth on October 9, 2022. The bright rings form as a result of X-rays scattered from otherwise unobservable dust layers within our galaxy that lie in the direction of the burst.

Most observed events (70%) have a duration of greater than two seconds and are classified as long gamma-ray bursts. Because these events constitute the majority of the population and because they tend to have the brightest afterglows, they have been observed in much greater detail than their short counterparts. Almost every well-studied long gamma-ray burst has been linked to a galaxy with rapid star formation, and in many cases to a core-collapse supernova as well, unambiguously associating long GRBs with the deaths of massive stars. Long GRB afterglow observations, at high redshift, are also consistent with the GRB having originated in star-forming regions. In December 2022, astronomers reported the first evidence of a long GRB produced by a neutron star merger.

Ultra-long gamma-ray bursts

These events are at the tail end of the long GRB duration distribution, lasting more than 10,000 seconds. They have been proposed to form a separate class, caused by the collapse of a blue supergiant star, a tidal disruption event or a new-born magnetar. Only a small number have been identified to date, their primary characteristic being their gamma ray emission duration. The most studied ultra-long events include GRB 101225A and GRB 111209A. The low detection rate may be a result of low sensitivity of current detectors to long-duration events, rather than a reflection of their true frequency. A 2013 study, on the other hand, shows that the existing evidence for a separate ultra-long GRB population with a new type of progenitor is inconclusive, and further multi-wavelength observations are needed to draw a firmer conclusion.

Energetics and beaming

Artist's illustration of a bright gamma-ray burst occurring in a star-forming region. Energy from the explosion is beamed into two narrow, oppositely directed jets.

Gamma-ray bursts are very bright as observed from Earth despite their typically immense distances. An average long GRB has a bolometric flux comparable to a bright star of our galaxy despite a distance of billions of light years (compared to a few tens of light years for most visible stars). Most of this energy is released in gamma rays, although some GRBs have extremely luminous optical counterparts as well. GRB 080319B, for example, was accompanied by an optical counterpart that peaked at a visible magnitude of 5.8, comparable to that of the dimmest naked-eye stars despite the burst's distance of 7.5 billion light years. This combination of brightness and distance implies an extremely energetic source. Assuming the gamma-ray explosion to be spherical, the energy output of GRB 080319B would be within a factor of two of the rest-mass energy of the Sun (the energy which would be released were the Sun to be converted entirely into radiation).

Gamma-ray bursts are thought to be highly focused explosions, with most of the explosion energy collimated into a narrow jet. The approximate angular width of the jet (that is, the degree of spread of the beam) can be estimated directly by observing the achromatic "jet breaks" in afterglow light curves: a time after which the slowly decaying afterglow begins to fade rapidly as the jet slows and can no longer beam its radiation as effectively. Observations suggest significant variation in the jet angle from between 2 and 20 degrees.

Because their energy is strongly focused, the gamma rays emitted by most bursts are expected to miss the Earth and never be detected. When a gamma-ray burst is pointed towards Earth, the focusing of its energy along a relatively narrow beam causes the burst to appear much brighter than it would have been were its energy emitted spherically. When this effect is taken into account, typical gamma-ray bursts are observed to have a true energy release of about 1044 J, or about 1/2000 of a Solar mass (M) energy equivalent – which is still many times the mass-energy equivalent of the Earth (about 5.5 × 1041 J). This is comparable to the energy released in a bright type Ib/c supernova and within the range of theoretical models. Very bright supernovae have been observed to accompany several of the nearest GRBs. Additional support for focusing of the output of GRBs has come from observations of strong asymmetries in the spectra of nearby type Ic supernovae and from radio observations taken long after bursts when their jets are no longer relativistic.

Short (time duration) GRBs appear to come from a lower-redshift (i.e. less distant) population and are less luminous than long GRBs. The degree of beaming in short bursts has not been accurately measured, but as a population they are likely less collimated than long GRBs or possibly not collimated at all in some cases.

Progenitors

Hubble Space Telescope image of Wolf–Rayet star WR 124 and its surrounding nebula. Wolf–Rayet stars are candidates for being progenitors of long-duration GRBs.

Because of the immense distances of most gamma-ray burst sources from Earth, identification of the progenitors, the systems that produce these explosions, is challenging. The association of some long GRBs with supernovae and the fact that their host galaxies are rapidly star-forming offer very strong evidence that long gamma-ray bursts are associated with massive stars. The most widely accepted mechanism for the origin of long-duration GRBs is the collapsar model, in which the core of an extremely massive, low-metallicity, rapidly rotating star collapses into a black hole in the final stages of its evolution. Matter near the star's core rains down towards the center and swirls into a high-density accretion disk. The infall of this material into a black hole drives a pair of relativistic jets out along the rotational axis, which pummel through the stellar envelope and eventually break through the stellar surface and radiate as gamma rays. Some alternative models replace the black hole with a newly formed magnetar, although most other aspects of the model (the collapse of the core of a massive star and the formation of relativistic jets) are the same.

The closest analogs within the Milky Way galaxy of the stars producing long gamma-ray bursts are likely the Wolf–Rayet stars, extremely hot and massive stars, which have shed most or all of their hydrogen envelope. Eta Carinae, Apep, and WR 104 have been cited as possible future gamma-ray burst progenitors. It is unclear if any star in the Milky Way has the appropriate characteristics to produce a gamma-ray burst.

The massive-star model probably does not explain all types of gamma-ray burst. There is strong evidence that some short-duration gamma-ray bursts occur in systems with no star formation and no massive stars, such as elliptical galaxies and galaxy halos. The favored theory for the origin of most short gamma-ray bursts is the merger of a binary system consisting of two neutron stars. According to this model, the two stars in a binary slowly spiral towards each other because gravitational radiation releases energy until tidal forces suddenly rip the neutron stars apart and they collapse into a single black hole. The infall of matter into the new black hole produces an accretion disk and releases a burst of energy, analogous to the collapsar model. Numerous other models have also been proposed to explain short gamma-ray bursts, including the merger of a neutron star and a black hole, the accretion-induced collapse of a neutron star, or the evaporation of primordial black holes.

An alternative explanation proposed by Friedwardt Winterberg is that in the course of a gravitational collapse and in reaching the event horizon of a black hole, all matter disintegrates into a burst of gamma radiation.

Tidal disruption events

This new class of GRB-like events was first discovered through the detection of GRB 110328A by the Swift Gamma-Ray Burst Mission on 28 March 2011. This event had a gamma-ray duration of about 2 days, much longer than even ultra-long GRBs, and was detected in X-rays for many months. It occurred at the center of a small elliptical galaxy at redshift z = 0.3534. There is an ongoing debate as to whether the explosion was the result of stellar collapse or a tidal disruption event accompanied by a relativistic jet, although the latter explanation has become widely favoured.

A tidal disruption event of this sort is when a star interacts with a supermassive black hole, shredding the star, and in some cases creating a relativistic jet which produces bright emission of gamma ray radiation. The event GRB 110328A (also denoted Swift J1644+57) was initially argued to be produced by the disruption of a main sequence star by a black hole of several million times the mass of the Sun, although it has subsequently been argued that the disruption of a white dwarf by a black hole of mass about 10 thousand times the Sun may be more likely.

Emission mechanisms

Gamma-ray burst mechanism

The means by which gamma-ray bursts convert energy into radiation remains poorly understood, and as of 2010 there was still no generally accepted model for how this process occurs. Any successful model of GRB emission must explain the physical process for generating gamma-ray emission that matches the observed diversity of light curves, spectra, and other characteristics. Particularly challenging is the need to explain the very high efficiencies that are inferred from some explosions: some gamma-ray bursts may convert as much as half (or more) of the explosion energy into gamma-rays. Early observations of the bright optical counterparts to GRB 990123 and to GRB 080319B, whose optical light curves were extrapolations of the gamma-ray light spectra, have suggested that inverse Compton scattering may be the dominant process in some events. In this model, pre-existing low-energy photons are scattered by relativistic electrons within the explosion, augmenting their energy by a large factor and transforming them into gamma-rays.

The nature of the longer-wavelength afterglow emission (ranging from X-ray through radio) that follows gamma-ray bursts is better understood. Any energy released by the explosion not radiated away in the burst itself takes the form of matter or energy moving outward at nearly the speed of light. As this matter collides with the surrounding interstellar gas, it creates a relativistic shock wave that then propagates forward into interstellar space. A second shock wave, the reverse shock, may propagate back into the ejected matter. Extremely energetic electrons within the shock wave are accelerated by strong local magnetic fields and radiate as synchrotron emission across most of the electromagnetic spectrum. This model has generally been successful in modeling the behavior of many observed afterglows at late times (generally, hours to days after the explosion), although there are difficulties explaining all features of the afterglow very shortly after the gamma-ray burst has occurred.

Rate of occurrence and potential effects on life

On 27 October 2015, at 22:40 GMT, the NASA/ASI/UKSA Swift satellite discovered its 1000th gamma-ray burst (GRB).

Gamma ray bursts can have harmful or destructive effects on life. Considering the universe as a whole, the safest environments for life similar to that on Earth are the lowest density regions in the outskirts of large galaxies. Our knowledge of galaxy types and their distribution suggests that life as we know it can only exist in about 10% of all galaxies. Furthermore, galaxies with a redshift, z, higher than 0.5 are unsuitable for life as we know it, because of their higher rate of GRBs and their stellar compactness.

All GRBs observed to date have occurred well outside the Milky Way galaxy and have been harmless to Earth. However, if a GRB were to occur within the Milky Way within 5,000 to 8,000 light-years and its emission were beamed straight towards Earth, the effects could be harmful and potentially devastating for its ecosystems. Currently, orbiting satellites detect on average approximately one GRB per day. The closest observed GRB as of March 2014 was GRB 980425, located 40 megaparsecs (130,000,000 ly) away (z=0.0085) in an SBc-type dwarf galaxy.[118] GRB 980425 was far less energetic than the average GRB and was associated with the Type Ib supernova SN 1998bw.

Estimating the exact rate at which GRBs occur is difficult; for a galaxy of approximately the same size as the Milky Way, estimates of the expected rate (for long-duration GRBs) can range from one burst every 10,000 years, to one burst every 1,000,000 years. Only a small percentage of these would be beamed towards Earth. Estimates of rate of occurrence of short-duration GRBs are even more uncertain because of the unknown degree of collimation, but are probably comparable.

Since GRBs are thought to involve beamed emission along two jets in opposing directions, only planets in the path of these jets would be subjected to the high energy gamma radiation.

Although nearby GRBs hitting Earth with a destructive shower of gamma rays are only hypothetical events, high energy processes across the galaxy have been observed to affect the Earth's atmosphere.

Effects on Earth

Earth's atmosphere is very effective at absorbing high energy electromagnetic radiation such as x-rays and gamma rays, so these types of radiation would not reach any dangerous levels at the surface during the burst event itself. The immediate effect on life on Earth from a GRB within a few kiloparsecs would only be a short increase in ultraviolet radiation at ground level, lasting from less than a second to tens of seconds. This ultraviolet radiation could potentially reach dangerous levels depending on the exact nature and distance of the burst, but it seems unlikely to be able to cause a global catastrophe for life on Earth.

The long-term effects from a nearby burst are more dangerous. Gamma rays cause chemical reactions in the atmosphere involving oxygen and nitrogen molecules, creating first nitrogen oxide then nitrogen dioxide gas. The nitrogen oxides cause dangerous effects on three levels. First, they deplete ozone, with models showing a possible global reduction of 25–35%, with as much as 75% in certain locations, an effect that would last for years. This reduction is enough to cause a dangerously elevated UV index at the surface. Secondly, the nitrogen oxides cause photochemical smog, which darkens the sky and blocks out parts of the sunlight spectrum. This would affect photosynthesis, but models show only about a 1% reduction of the total sunlight spectrum, lasting a few years. However, the smog could potentially cause a cooling effect on Earth's climate, producing a "cosmic winter" (similar to an impact winter, but without an impact), but only if it occurs simultaneously with a global climate instability. Thirdly, the elevated nitrogen dioxide levels in the atmosphere would wash out and produce acid rain. Nitric acid is toxic to a variety of organisms, including amphibian life, but models predict that it would not reach levels that would cause a serious global effect. The nitrates might in fact be of benefit to some plants.

All in all, a GRB within a few kiloparsecs, with its energy directed towards Earth, will mostly damage life by raising the UV levels during the burst itself and for a few years thereafter. Models show that the destructive effects of this increase can cause up to 16 times the normal levels of DNA damage. It has proved difficult to assess a reliable evaluation of the consequences of this on the terrestrial ecosystem, because of the uncertainty in biological field and laboratory data.

Hypothetical effects on Earth in the past

There is a very good chance (but no certainty) that at least one lethal GRB took place during the past 5 billion years close enough to Earth as to significantly damage life. There is a 50% chance that such a lethal GRB took place within two kiloparsecs of Earth during the last 500 million years, causing one of the major mass extinction events.

The major Ordovician–Silurian extinction event 450 million years ago may have been caused by a GRB. Estimates suggest that approximately 20–60% of the total phytoplankton biomass in the Ordovician oceans would have perished in a GRB, because the oceans were mostly oligotrophic and clear. The late Ordovician species of trilobites that spent portions of their lives in the plankton layer near the ocean surface were much harder hit than deep-water dwellers, which tended to remain within quite restricted areas. This is in contrast to the usual pattern of extinction events, wherein species with more widely spread populations typically fare better. A possible explanation is that trilobites remaining in deep water would be more shielded from the increased UV radiation associated with a GRB. Also supportive of this hypothesis is the fact that during the late Ordovician, burrowing bivalve species were less likely to go extinct than bivalves that lived on the surface.

A case has been made that the 774–775 carbon-14 spike was the result of a short GRB, though a very strong solar flare is another possibility.

GRB candidates in the Milky Way

Illustration of a short gamma-ray burst caused by a collapsing star.

No gamma-ray bursts from within our own galaxy, the Milky Way, have been observed, and the question of whether one has ever occurred remains unresolved. In light of evolving understanding of gamma-ray bursts and their progenitors, the scientific literature records a growing number of local, past, and future GRB candidates. Long duration GRBs are related to superluminous supernovae, or hypernovae, and most luminous blue variables (LBVs) and rapidly spinning Wolf–Rayet stars are thought to end their life cycles in core-collapse supernovae with an associated long-duration GRB. Knowledge of GRBs, however, is from metal-poor galaxies of former epochs of the universe's evolution, and it is impossible to directly extrapolate to encompass more evolved galaxies and stellar environments with a higher metallicity, such as the Milky Way.

Complex number

From Wikipedia, the free encyclopedia
A complex number can be visually represented as a pair of numbers (a, b) forming a vector on a diagram called an Argand diagram, representing the complex plane. Re is the real axis, Im is the imaginary axis, and i is the "imaginary unit", that satisfies i2 = −1.

In mathematics, a complex number is an element of a number system that extends the real numbers with a specific element denoted i, called the imaginary unit and satisfying the equation ; every complex number can be expressed in the form , where a and b are real numbers. Because no real number satisfies the above equation, i was called an imaginary number by René Descartes. For the complex number , a is called the real part, and b is called the imaginary part. The set of complex numbers is denoted by either of the symbols or C. Despite the historical nomenclature "imaginary", complex numbers are regarded in the mathematical sciences as just as "real" as the real numbers and are fundamental in many aspects of the scientific description of the natural world.

Complex numbers allow solutions to all polynomial equations, even those that have no solutions in real numbers. More precisely, the fundamental theorem of algebra asserts that every non-constant polynomial equation with real or complex coefficients has a solution which is a complex number. For example, the equation has no real solution, since the square of a real number cannot be negative, but has the two nonreal complex solutions and .

Addition, subtraction and multiplication of complex numbers can be naturally defined by using the rule combined with the associative, commutative, and distributive laws. Every nonzero complex number has a multiplicative inverse. This makes the complex numbers a field that has the real numbers as a subfield. The complex numbers also form a real vector space of dimension two, with {1, i} as a standard basis.

This standard basis makes the complex numbers a Cartesian plane, called the complex plane. This allows a geometric interpretation of the complex numbers and their operations, and conversely expressing in terms of complex numbers some geometric properties and constructions. For example, the real numbers form the real line which is identified to the horizontal axis of the complex plane. The complex numbers of absolute value one form the unit circle. The addition of a complex number is a translation in the complex plane, and the multiplication by a complex number is a similarity centered at the origin. The complex conjugation is the reflection symmetry with respect to the real axis. The complex absolute value is a Euclidean norm.

In summary, the complex numbers form a rich structure that is simultaneously an algebraically closed field, a commutative algebra over the reals, and a Euclidean vector space of dimension two.

Definition

An illustration of the complex number z = x + iy on the complex plane. The real part is x, and its imaginary part is y.

A complex number is a number of the form a + bi, where a and b are real numbers, and i is an indeterminate satisfying i2 = −1. For example, 2 + 3i is a complex number.

This way, a complex number is defined as a polynomial with real coefficients in the single indeterminate i, for which the relation i2 + 1 = 0 is imposed. Based on this definition, complex numbers can be added and multiplied, using the addition and multiplication for polynomials. The relation i2 + 1 = 0 induces the equalities i4k = 1, i4k+1 = i, i4k+2 = −1, and i4k+3 = −i, which hold for all integers k; these allow the reduction of any polynomial that results from the addition and multiplication of complex numbers to a linear polynomial in i, again of the form a + bi with real coefficients a, b.

The real number a is called the real part of the complex number a + bi; the real number b is called its imaginary part. To emphasize, the imaginary part does not include a factor i; that is, the imaginary part is b, not bi.

Formally, the complex numbers are defined as the quotient ring of the polynomial ring in the indeterminate i, by the ideal generated by the polynomial i2 + 1 (see below).

Notation

A real number a can be regarded as a complex number a + 0i, whose imaginary part is 0. A purely imaginary number bi is a complex number 0 + bi, whose real part is zero. As with polynomials, it is common to write a for a + 0i and bi for 0 + bi. Moreover, when the imaginary part is negative, that is, b = −|b| < 0, it is common to write a|b|i instead of a + (−|b|)i; for example, for b = −4, 3 − 4i can be written instead of 3 + (−4)i.

Since the multiplication of the indeterminate i and a real is commutative in polynomials with real coefficients, the polynomial a + bi may be written as a + ib. This is often expedient for imaginary parts denoted by expressions, for example, when b is a radical.

The real part of a complex number z is denoted by Re(z), , or ; the imaginary part of a complex number z is denoted by Im(z), , or For example,

The set of all complex numbers is denoted by (blackboard bold) or C (upright bold).

In some disciplines, particularly in electromagnetism and electrical engineering, j is used instead of i as i is frequently used to represent electric current. In these cases, complex numbers are written as a + bj, or a + jb.

Visualization

A complex number z, as a point (black) and its position vector (blue)

A complex number z can thus be identified with an ordered pair of real numbers, which in turn may be interpreted as coordinates of a point in a two-dimensional space. The most immediate space is the Euclidean plane with suitable coordinates, which is then called complex plane or Argand diagram, named after Jean-Robert Argand. Another prominent space on which the coordinates may be projected is the two-dimensional surface of a sphere, which is then called Riemann sphere.

Cartesian complex plane

The definition of the complex numbers involving two arbitrary real values immediately suggests the use of Cartesian coordinates in the complex plane. The horizontal (real) axis is generally used to display the real part, with increasing values to the right, and the imaginary part marks the vertical (imaginary) axis, with increasing values upwards.

A charted number may be viewed either as the coordinatized point or as a position vector from the origin to this point. The coordinate values of a complex number z can hence be expressed in its Cartesian, rectangular, or algebraic form.

Notably, the operations of addition and multiplication take on a very natural geometric character, when complex numbers are viewed as position vectors: addition corresponds to vector addition, while multiplication (see below) corresponds to multiplying their magnitudes and adding the angles they make with the real axis. Viewed in this way, the multiplication of a complex number by i corresponds to rotating the position vector counterclockwise by a quarter turn (90°) about the origin—a fact which can be expressed algebraically as

Polar complex plane

Argument φ and modulus r locate a point in the complex plane.

Modulus and argument

An alternative option for coordinates in the complex plane is the polar coordinate system that uses the distance of the point z from the origin (O), and the angle subtended between the positive real axis and the line segment Oz in a counterclockwise sense. This leads to the polar form

of a complex number, where r is the absolute value of z, and is the argument of z.

The absolute value (or modulus or magnitude) of a complex number z = x + yi is

If z is a real number (that is, if y = 0), then r = |x|. That is, the absolute value of a real number equals its absolute value as a complex number.

By Pythagoras' theorem, the absolute value of a complex number is the distance to the origin of the point representing the complex number in the complex plane.

The argument of z (in many applications referred to as the "phase" φ) is the angle of the radius Oz with the positive real axis, and is written as arg z. As with the modulus, the argument can be found from the rectangular form x + yi—by applying the inverse tangent to the quotient of imaginary-by-real parts. By using a half-angle identity, a single branch of the arctan suffices to cover the range (−π, π] of the arg-function, and avoids a more subtle case-by-case analysis

Normally, as given above, the principal value in the interval (−π, π] is chosen. If the arg value is negative, values in the range (−π, π] or [0, 2π) can be obtained by adding 2π. The value of φ is expressed in radians in this article. It can increase by any integer multiple of 2π and still give the same angle, viewed as subtended by the rays of the positive real axis and from the origin through z. Hence, the arg function is sometimes considered as multivalued. The polar angle for the complex number 0 is indeterminate, but arbitrary choice of the polar angle 0 is common.

The value of φ equals the result of atan2:

Together, r and φ give another way of representing complex numbers, the polar form, as the combination of modulus and argument fully specify the position of a point on the plane. Recovering the original rectangular co-ordinates from the polar form is done by the formula called trigonometric form

Using Euler's formula this can be written as

Using the cis function, this is sometimes abbreviated to

In angle notation, often used in electronics to represent a phasor with amplitude r and phase φ, it is written as

Complex graphs

A color wheel graph of the expression (z2 − 1)(z − 2 − i)2/z2 + 2 + 2i

When visualizing complex functions, both a complex input and output are needed. Because each complex number is represented in two dimensions, visually graphing a complex function would require the perception of a four dimensional space, which is possible only in projections. Because of this, other ways of visualizing complex functions have been designed.

In domain coloring the output dimensions are represented by color and brightness, respectively. Each point in the complex plane as domain is ornated, typically with color representing the argument of the complex number, and brightness representing the magnitude. Dark spots mark moduli near zero, brighter spots are farther away from the origin, the gradation may be discontinuous, but is assumed as monotonous. The colors often vary in steps of π/3 for 0 to 2π from red, yellow, green, cyan, blue, to magenta. These plots are called color wheel graphs. This provides a simple way to visualize the functions without losing information. The picture shows zeros for ±1, (2 + i) and poles at

History

The solution in radicals (without trigonometric functions) of a general cubic equation, when all three of its roots are real numbers, contains the square roots of negative numbers, a situation that cannot be rectified by factoring aided by the rational root test, if the cubic is irreducible; this is the so-called casus irreducibilis ("irreducible case"). This conundrum led Italian mathematician Gerolamo Cardano to conceive of complex numbers in around 1545 in his Ars Magna, though his understanding was rudimentary; moreover he later described complex numbers as "as subtle as they are useless". Cardano did use imaginary numbers, but described using them as "mental torture." This was prior to the use of the graphical complex plane. Cardano and other Italian mathematicians, notably Scipione del Ferro, in the 1500s created an algorithm for solving cubic equations which generally had one real solution and two solutions containing an imaginary number. Since they ignored the answers with the imaginary numbers, Cardano found them useless.

Work on the problem of general polynomials ultimately led to the fundamental theorem of algebra, which shows that with complex numbers, a solution exists to every polynomial equation of degree one or higher. Complex numbers thus form an algebraically closed field, where any polynomial equation has a root.

Many mathematicians contributed to the development of complex numbers. The rules for addition, subtraction, multiplication, and root extraction of complex numbers were developed by the Italian mathematician Rafael Bombelli. A more abstract formalism for the complex numbers was further developed by the Irish mathematician William Rowan Hamilton, who extended this abstraction to the theory of quaternions.

The earliest fleeting reference to square roots of negative numbers can perhaps be said to occur in the work of the Greek mathematician Hero of Alexandria in the 1st century AD, where in his Stereometrica he considered, apparently in error, the volume of an impossible frustum of a pyramid to arrive at the term in his calculations, which today would simplify to . Negative quantities were not conceived of in Hellenistic mathematics and Hero merely replaced it by its positive

The impetus to study complex numbers as a topic in itself first arose in the 16th century when algebraic solutions for the roots of cubic and quartic polynomials were discovered by Italian mathematicians (Niccolò Fontana Tartaglia and Gerolamo Cardano). It was soon realized (but proved much later) that these formulas, even if one were interested only in real solutions, sometimes required the manipulation of square roots of negative numbers. In fact, it was proved later that the use of complex numbers is unavoidable when all three roots are real and distinct. However, the general formula can still be used in this case, with some care to deal with the ambiguity resulting from the existence of three cubic roots for nonzero complex numbers. Rafael Bombelli was the first to address explicitly these seemingly paradoxical solutions of cubic equations and developed the rules for complex arithmetic, trying to resolve these issues.

The term "imaginary" for these quantities was coined by René Descartes in 1637, who was at pains to stress their unreal nature:

... sometimes only imaginary, that is one can imagine as many as I said in each equation, but sometimes there exists no quantity that matches that which we imagine.
[... quelquefois seulement imaginaires c'est-à-dire que l'on peut toujours en imaginer autant que j'ai dit en chaque équation, mais qu'il n'y a quelquefois aucune quantité qui corresponde à celle qu'on imagine.]

A further source of confusion was that the equation seemed to be capriciously inconsistent with the algebraic identity , which is valid for non-negative real numbers a and b, and which was also used in complex number calculations with one of a, b positive and the other negative. The incorrect use of this identity in the case when both a and b are negative, and the related identity , even bedeviled Leonhard Euler. This difficulty eventually led to the convention of using the special symbol i in place of to guard against this mistake. Even so, Euler considered it natural to introduce students to complex numbers much earlier than we do today. In his elementary algebra text book, Elements of Algebra, he introduces these numbers almost at once and then uses them in a natural way throughout.

In the 18th century complex numbers gained wider use, as it was noticed that formal manipulation of complex expressions could be used to simplify calculations involving trigonometric functions. For instance, in 1730 Abraham de Moivre noted that the identities relating trigonometric functions of an integer multiple of an angle to powers of trigonometric functions of that angle could be re-expressed by the following de Moivre's formula:

In 1748, Euler went further and obtained Euler's formula of complex analysis:

by formally manipulating complex power series and observed that this formula could be used to reduce any trigonometric identity to much simpler exponential identities.

The idea of a complex number as a point in the complex plane (above) was first described by DanishNorwegian mathematician Caspar Wessel in 1799, although it had been anticipated as early as 1685 in Wallis's A Treatise of Algebra.

Wessel's memoir appeared in the Proceedings of the Copenhagen Academy but went largely unnoticed. In 1806 Jean-Robert Argand independently issued a pamphlet on complex numbers and provided a rigorous proof of the fundamental theorem of algebra. Carl Friedrich Gauss had earlier published an essentially topological proof of the theorem in 1797 but expressed his doubts at the time about "the true metaphysics of the square root of −1". It was not until 1831 that he overcame these doubts and published his treatise on complex numbers as points in the plane, largely establishing modern notation and terminology:

If one formerly contemplated this subject from a false point of view and therefore found a mysterious darkness, this is in large part attributable to clumsy terminology. Had one not called +1, −1, positive, negative, or imaginary (or even impossible) units, but instead, say, direct, inverse, or lateral units, then there could scarcely have been talk of such darkness.

In the beginning of the 19th century, other mathematicians discovered independently the geometrical representation of the complex numbers: Buée, Mourey, Warren, Français and his brother, Bellavitis.

The English mathematician G.H. Hardy remarked that Gauss was the first mathematician to use complex numbers in "a really confident and scientific way" although mathematicians such as Norwegian Niels Henrik Abel and Carl Gustav Jacob Jacobi were necessarily using them routinely before Gauss published his 1831 treatise.

Augustin-Louis Cauchy and Bernhard Riemann together brought the fundamental ideas of complex analysis to a high state of completion, commencing around 1825 in Cauchy's case.

The common terms used in the theory are chiefly due to the founders. Argand called cos φ + i sin φ the direction factor, and the modulus; Cauchy (1821) called cos φ + i sin φ the reduced form (l'expression réduite) and apparently introduced the term argument; Gauss used i for , introduced the term complex number for a + bi, and called a2 + b2 the norm. The expression direction coefficient, often used for cos φ + i sin φ, is due to Hankel (1867), and absolute value, for modulus, is due to Weierstrass.

Later classical writers on the general theory include Richard Dedekind, Otto Hölder, Felix Klein, Henri Poincaré, Hermann Schwarz, Karl Weierstrass and many others. Important work (including a systematization) in complex multivariate calculus has been started at beginning of the 20th century. Important results have been achieved by Wilhelm Wirtinger in 1927.

Relations and operations

Equality

Complex numbers have a similar definition of equality to real numbers; two complex numbers a1 + b1i and a2 + b2i are equal if and only if both their real and imaginary parts are equal, that is, if a1 = a2 and b1 = b2. Nonzero complex numbers written in polar form are equal if and only if they have the same magnitude and their arguments differ by an integer multiple of 2π.

Ordering

Unlike the real numbers, there is no natural ordering of the complex numbers. In particular, there is no linear ordering on the complex numbers that is compatible with addition and multiplication. Hence, the complex numbers do not have the structure of an ordered field. One explanation for this is that every non-trivial sum of squares in an ordered field is nonzero, and i2 + 12 = 0 is a non-trivial sum of squares. Thus, complex numbers are naturally thought of as existing on a two-dimensional plane.

Conjugate

Geometric representation of z and its conjugate z in the complex plane

The complex conjugate of the complex number z = x + yi is given by xyi. It is denoted by either z or z*.[46] This unary operation on complex numbers cannot be expressed by applying only their basic operations addition, subtraction, multiplication and division.

Geometrically, z is the "reflection" of z about the real axis. Conjugating twice gives the original complex number

which makes this operation an involution. The reflection leaves both the real part and the magnitude of z unchanged, that is

and

The imaginary part and the argument of a complex number z change their sign under conjugation

For details on argument and magnitude, see the section on Polar form.

The product of a complex number z = x + yi and its conjugate is known as the absolute square. It is always a non-negative real number and equals the square of the magnitude of each:

This property can be used to convert a fraction with a complex denominator to an equivalent fraction with a real denominator by expanding both numerator and denominator of the fraction by the conjugate of the given denominator. This process is sometimes called "rationalization" of the denominator (although the denominator in the final expression might be an irrational real number), because it resembles the method to remove roots from simple expressions in a denominator.

The real and imaginary parts of a complex number z can be extracted using the conjugation:

Moreover, a complex number is real if and only if it equals its own conjugate.

Conjugation distributes over the basic complex arithmetic operations:

Conjugation is also employed in inversive geometry, a branch of geometry studying reflections more general than ones about a line. In the network analysis of electrical circuits, the complex conjugate is used in finding the equivalent impedance when the maximum power transfer theorem is looked for.

Addition and subtraction

Addition of two complex numbers can be done geometrically by constructing a parallelogram.

Two complex numbers and are most easily added by separately adding their real and imaginary parts. That is to say:

Similarly, subtraction can be performed as

Multiplication of a complex number and a real number r can be done similarly by multiplying separately r and the real and imaginary parts of a:

In particular, subtraction can be done by negating the subtrahend (that is multiplying it with −1) and adding the result to the minuend:

Using the visualization of complex numbers in the complex plane, addition has the following geometric interpretation: the sum of two complex numbers a and b, interpreted as points in the complex plane, is the point obtained by building a parallelogram from the three vertices O, and the points of the arrows labeled a and b (provided that they are not on a line). Equivalently, calling these points A, B, respectively and the fourth point of the parallelogram X the triangles OAB and XBA are congruent.

Multiplication and square

The rules of the distributive property, the commutative properties (of addition and multiplication), and the defining property i2 = −1 apply to complex numbers. It follows that

In particular,

Reciprocal and division

Using the conjugate, the reciprocal of a nonzero complex number can be broken into real and imaginary components

This can be used to express a division of an arbitrary complex number by a non-zero complex number as

Multiplication and division in polar form

Multiplication of 2 + i (blue triangle) and 3 + i (red triangle). The red triangle is rotated to match the vertex of the blue one (the adding of both angles in the terms φ1+φ2 in the equation) and stretched by the length of the hypotenuse of the blue triangle (the multiplication of both radiuses, as per term r1r2 in the equation).

Formulas for multiplication, division and exponentiation are simpler in polar form than the corresponding formulas in Cartesian coordinates. Given two complex numbers z1 = r1(cos φ1 + i sin φ1) and z2 = r2(cos φ2 + i sin φ2), because of the trigonometric identities

we may derive

In other words, the absolute values are multiplied and the arguments are added to yield the polar form of the product. For example, multiplying by i corresponds to a quarter-turn counter-clockwise, which gives back i2 = −1. The picture at the right illustrates the multiplication of
Since the real and imaginary part of 5 + 5i are equal, the argument of that number is 45 degrees, or π/4 (in radian). On the other hand, it is also the sum of the angles at the origin of the red and blue triangles are arctan(1/3) and arctan(1/2), respectively. Thus, the formula
holds. As the arctan function can be approximated highly efficiently, formulas like this – known as Machin-like formulas – are used for high-precision approximations of π.

Similarly, division is given by

Square root

The square roots of a + bi (with b ≠ 0) are , where

and

where sgn is the signum function. This can be seen by squaring to obtain a + bi. Here is called the modulus of a + bi, and the square root sign indicates the square root with non-negative real part, called the principal square root; also where z = a + bi.

Exponential function

The exponential function can be defined for every complex number z by the power series

which has an infinite radius of convergence.

The value at 1 of the exponential function is Euler's number

If z is real, one has Analytic continuation allows extending this equality for every complex value of z, and thus to define the complex exponentiation with base e as

Functional equation

The exponential function satisfies the functional equation This can be proved either by comparing the power series expansion of both members or by applying analytic continuation from the restriction of the equation to real arguments.

Euler's formula

Euler's formula states that, for any real number y,

The functional equation implies thus that, if x and y are real, one has

which is the decomposition of the exponential function into its real and imaginary parts.

Complex logarithm

In the real case, the natural logarithm can be defined as the inverse of the exponential function. For extending this to the complex domain, one can start from Euler's formula. It implies that, if a complex number is written in polar form

with then with
as complex logarithm one has a proper inverse:

However, because cosine and sine are periodic functions, the addition of an integer multiple of 2π to φ does not change z. For example, e = e3 = −1 , so both and 3 are possible values for the natural logarithm of −1.

Therefore, if the complex logarithm is not to be defined as a multivalued function

one has to use a branch cut and to restrict the codomain, resulting in the bijective function

If is not a non-positive real number (a positive or a non-real number), the resulting principal value of the complex logarithm is obtained with π < φ < π. It is an analytic function outside the negative real numbers, but it cannot be prolongated to a function that is continuous at any negative real number , where the principal value is ln z = ln(−z) + .

Exponentiation

If x > 0 is real and z complex, the exponentiation is defined as

where ln denotes the natural logarithm.

It seems natural to extend this formula to complex values of x, but there are some difficulties resulting from the fact that the complex logarithm is not really a function, but a multivalued function.

It follows that if z is as above, and if t is another complex number, then the exponentiation is the multivalued function

Integer and fractional exponents

Geometric representation of the 2nd to 6th roots of a complex number z, in polar form re where r = |z | and φ = arg z. If z is real, φ = 0 or π. Principal roots are shown in black.

If, in the preceding formula, t is an integer, then the sine and the cosine are independent of k. Thus, if the exponent n is an integer, then zn is well defined, and the exponentiation formula simplifies to de Moivre's formula:

The n nth roots of a complex number z are given by

for 0 ≤ kn − 1. (Here is the usual (positive) nth root of the positive real number r.) Because sine and cosine are periodic, other integer values of k do not give other values.

While the nth root of a positive real number r is chosen to be the positive real number c satisfying cn = r, there is no natural way of distinguishing one particular complex nth root of a complex number. Therefore, the nth root is a n-valued function of z. This implies that, contrary to the case of positive real numbers, one has

since the left-hand side consists of n values, and the right-hand side is a single value.

Properties

Field structure

The set of complex numbers is a field. Briefly, this means that the following facts hold: first, any two complex numbers can be added and multiplied to yield another complex number. Second, for any complex number z, its additive inverse z is also a complex number; and third, every nonzero complex number has a reciprocal complex number. Moreover, these operations satisfy a number of laws, for example the law of commutativity of addition and multiplication for any two complex numbers z1 and z2:

These two laws and the other requirements on a field can be proven by the formulas given above, using the fact that the real numbers themselves form a field.

Unlike the reals, is not an ordered field, that is to say, it is not possible to define a relation z1 < z2 that is compatible with the addition and multiplication. In fact, in any ordered field, the square of any element is necessarily positive, so i2 = −1 precludes the existence of an ordering on

When the underlying field for a mathematical topic or construct is the field of complex numbers, the topic's name is usually modified to reflect that fact. For example: complex analysis, complex matrix, complex polynomial, and complex Lie algebra.

Solutions of polynomial equations

Given any complex numbers (called coefficients) a0, ..., an, the equation

has at least one complex solution z, provided that at least one of the higher coefficients a1, ..., an is nonzero. This is the statement of the fundamental theorem of algebra, of Carl Friedrich Gauss and Jean le Rond d'Alembert. Because of this fact, is called an algebraically closed field. This property does not hold for the field of rational numbers (the polynomial x2 − 2 does not have a rational root, since √2 is not a rational number) nor the real numbers (the polynomial x2 + a does not have a real root for a > 0, since the square of x is positive for any real number x).

There are various proofs of this theorem, by either analytic methods such as Liouville's theorem, or topological ones such as the winding number, or a proof combining Galois theory and the fact that any real polynomial of odd degree has at least one real root.

Because of this fact, theorems that hold for any algebraically closed field apply to For example, any non-empty complex square matrix has at least one (complex) eigenvalue.

Algebraic characterization

The field has the following three properties:

It can be shown that any field having these properties is isomorphic (as a field) to For example, the algebraic closure of the field of the p-adic number also satisfies these three properties, so these two fields are isomorphic (as fields, but not as topological fields). Also, is isomorphic to the field of complex Puiseux series. However, specifying an isomorphism requires the axiom of choice. Another consequence of this algebraic characterization is that contains many proper subfields that are isomorphic to .

Characterization as a topological field

The preceding characterization of describes only the algebraic aspects of That is to say, the properties of nearness and continuity, which matter in areas such as analysis and topology, are not dealt with. The following description of as a topological field (that is, a field that is equipped with a topology, which allows the notion of convergence) does take into account the topological properties. contains a subset P (namely the set of positive real numbers) of nonzero elements satisfying the following three conditions:

  • P is closed under addition, multiplication and taking inverses.
  • If x and y are distinct elements of P, then either xy or yx is in P.
  • If S is any nonempty subset of P, then S + P = x + P for some x in

Moreover, has a nontrivial involutive automorphism xx* (namely the complex conjugation), such that x x* is in P for any nonzero x in

Any field F with these properties can be endowed with a topology by taking the sets B(x, p) = { y | p − (yx)(yx)* ∈ P }  as a base, where x ranges over the field and p ranges over P. With this topology F is isomorphic as a topological field to

The only connected locally compact topological fields are and This gives another characterization of as a topological field, since can be distinguished from because the nonzero complex numbers are connected, while the nonzero real numbers are not.

Formal construction

Construction as ordered pairs

William Rowan Hamilton introduced the approach to define the set of complex numbers as the set of ordered pairs (a, b) of real numbers, in which the following rules for addition and multiplication are imposed:

It is then just a matter of notation to express (a, b) as a + bi.

Construction as a quotient field

Though this low-level construction does accurately describe the structure of the complex numbers, the following equivalent definition reveals the algebraic nature of more immediately. This characterization relies on the notion of fields and polynomials. A field is a set endowed with addition, subtraction, multiplication and division operations that behave as is familiar from, say, rational numbers. For example, the distributive law

must hold for any three elements x, y and z of a field. The set of real numbers does form a field. A polynomial p(X) with real coefficients is an expression of the form
where the a0, ..., an are real numbers. The usual addition and multiplication of polynomials endows the set of all such polynomials with a ring structure. This ring is called the polynomial ring over the real numbers.

The set of complex numbers is defined as the quotient ring This extension field contains two square roots of −1, namely (the cosets of) X and X, respectively. (The cosets of) 1 and X form a basis of as a real vector space, which means that each element of the extension field can be uniquely written as a linear combination in these two elements. Equivalently, elements of the extension field can be written as ordered pairs (a, b) of real numbers. The quotient ring is a field, because X2 + 1 is irreducible over so the ideal it generates is maximal.

The formulas for addition and multiplication in the ring modulo the relation X2 = −1, correspond to the formulas for addition and multiplication of complex numbers defined as ordered pairs. So the two definitions of the field are isomorphic (as fields).

Accepting that is algebraically closed, since it is an algebraic extension of in this approach, is therefore the algebraic closure of

Matrix representation of complex numbers

Complex numbers a + bi can also be represented by 2 × 2 matrices that have the form:

Here the entries a and b are real numbers. As the sum and product of two such matrices is again of this form, these matrices form a subring of the ring 2 × 2 matrices.

A simple computation shows that the map:

is a ring isomorphism from the field of complex numbers to the ring of these matrices. This isomorphism associates the square of the absolute value of a complex number with the determinant of the corresponding matrix, and the conjugate of a complex number with the transpose of the matrix.

The geometric description of the multiplication of complex numbers can also be expressed in terms of rotation matrices by using this correspondence between complex numbers and such matrices. The action of the matrix on a vector (x, y) corresponds to the multiplication of x + iy by a + ib. In particular, if the determinant is 1, there is a real number t such that the matrix has the form:

In this case, the action of the matrix on vectors and the multiplication by the complex number are both the rotation of the angle t.

Complex analysis

Color wheel graph of sin(1/z). White parts inside refer to numbers having large absolute values.

The study of functions of a complex variable is known as complex analysis and has enormous practical use in applied mathematics as well as in other branches of mathematics. Often, the most natural proofs for statements in real analysis or even number theory employ techniques from complex analysis (see prime number theorem for an example). Unlike real functions, which are commonly represented as two-dimensional graphs, complex functions have four-dimensional graphs and may usefully be illustrated by color-coding a three-dimensional graph to suggest four dimensions, or by animating the complex function's dynamic transformation of the complex plane.

Complex exponential and related functions

The notions of convergent series and continuous functions in (real) analysis have natural analogs in complex analysis. A sequence of complex numbers is said to converge if and only if its real and imaginary parts do. This is equivalent to the (ε, δ)-definition of limits, where the absolute value of real numbers is replaced by the one of complex numbers. From a more abstract point of view, , endowed with the metric

is a complete metric space, which notably includes the triangle inequality
for any two complex numbers z1 and z2.

Like in real analysis, this notion of convergence is used to construct a number of elementary functions: the exponential function exp z, also written ez, is defined as the infinite series

The series defining the real trigonometric functions sine and cosine, as well as the hyperbolic functions sinh and cosh, also carry over to complex arguments without change. For the other trigonometric and hyperbolic functions, such as tangent, things are slightly more complicated, as the defining series do not converge for all complex values. Therefore, one must define them either in terms of sine, cosine and exponential, or, equivalently, by using the method of analytic continuation.

Euler's formula states:

for any real number φ, in particular
, which is Euler's identity. Unlike in the situation of real numbers, there is an infinitude of complex solutions z of the equation
for any complex number w ≠ 0. It can be shown that any such solution z – called complex logarithm of w – satisfies
where arg is the argument defined above, and ln the (real) natural logarithm. As arg is a multivalued function, unique only up to a multiple of 2π, log is also multivalued. The principal value of log is often taken by restricting the imaginary part to the interval (−π, π].

Complex exponentiation zω is defined as

and is multi-valued, except when ω is an integer. For ω = 1 / n, for some natural number n, this recovers the non-uniqueness of nth roots mentioned above.

Complex numbers, unlike real numbers, do not in general satisfy the unmodified power and logarithm identities, particularly when naïvely treated as single-valued functions; see failure of power and logarithm identities. For example, they do not satisfy

Both sides of the equation are multivalued by the definition of complex exponentiation given here, and the values on the left are a subset of those on the right.

Holomorphic functions

A function f: is called holomorphic if it satisfies the Cauchy–Riemann equations. For example, any -linear map can be written in the form

with complex coefficients a and b. This map is holomorphic if and only if b = 0. The second summand is real-differentiable, but does not satisfy the Cauchy–Riemann equations.

Complex analysis shows some features not apparent in real analysis. For example, any two holomorphic functions f and g that agree on an arbitrarily small open subset of necessarily agree everywhere. Meromorphic functions, functions that can locally be written as f(z)/(zz0)n with a holomorphic function f, still share some of the features of holomorphic functions. Other functions have essential singularities, such as sin(1/z) at z = 0.

Applications

Complex numbers have applications in many scientific areas, including signal processing, control theory, electromagnetism, fluid dynamics, quantum mechanics, cartography, and vibration analysis. Some of these applications are described below.

Geometry

Shapes

Three non-collinear points in the plane determine the shape of the triangle . Locating the points in the complex plane, this shape of a triangle may be expressed by complex arithmetic as

The shape of a triangle will remain the same, when the complex plane is transformed by translation or dilation (by an affine transformation), corresponding to the intuitive notion of shape, and describing similarity. Thus each triangle is in a similarity class of triangles with the same shape.

Fractal geometry

The Mandelbrot set with the real and imaginary axes labeled.

The Mandelbrot set is a popular example of a fractal formed on the complex plane. It is defined by plotting every location where iterating the sequence does not diverge when iterated infinitely. Similarly, Julia sets have the same rules, except where remains constant.

Triangles

Every triangle has a unique Steiner inellipse – an ellipse inside the triangle and tangent to the midpoints of the three sides of the triangle. The foci of a triangle's Steiner inellipse can be found as follows, according to Marden's theorem: Denote the triangle's vertices in the complex plane as a = xA + yAi, b = xB + yBi, and c = xC + yCi. Write the cubic equation , take its derivative, and equate the (quadratic) derivative to zero. Marden's theorem says that the solutions of this equation are the complex numbers denoting the locations of the two foci of the Steiner inellipse.

Algebraic number theory

Construction of a regular pentagon using straightedge and compass.

As mentioned above, any nonconstant polynomial equation (in complex coefficients) has a solution in . A fortiori, the same is true if the equation has rational coefficients. The roots of such equations are called algebraic numbers – they are a principal object of study in algebraic number theory. Compared to , the algebraic closure of , which also contains all algebraic numbers, has the advantage of being easily understandable in geometric terms. In this way, algebraic methods can be used to study geometric questions and vice versa. With algebraic methods, more specifically applying the machinery of field theory to the number field containing roots of unity, it can be shown that it is not possible to construct a regular nonagon using only compass and straightedge – a purely geometric problem.

Another example is the Gaussian integers; that is, numbers of the form x + iy, where x and y are integers, which can be used to classify sums of squares.

Analytic number theory

Analytic number theory studies numbers, often integers or rationals, by taking advantage of the fact that they can be regarded as complex numbers, in which analytic methods can be used. This is done by encoding number-theoretic information in complex-valued functions. For example, the Riemann zeta function ζ(s) is related to the distribution of prime numbers.

Improper integrals

In applied fields, complex numbers are often used to compute certain real-valued improper integrals, by means of complex-valued functions. Several methods exist to do this; see methods of contour integration.

Dynamic equations

In differential equations, it is common to first find all complex roots r of the characteristic equation of a linear differential equation or equation system and then attempt to solve the system in terms of base functions of the form f(t) = ert. Likewise, in difference equations, the complex roots r of the characteristic equation of the difference equation system are used, to attempt to solve the system in terms of base functions of the form f(t) = rt.

Linear algebra

Eigendecomposition is a useful tool for computing matrix powers and matrix exponentials. However, it often requires the use of complex numbers, even if the matrix is real (for example, a rotation matrix).

Complex numbers often generalize concepts originally conceived in the real numbers. For example, the conjugate transpose generalizes the transpose, hermitian matrices generalize symmetric matrices, and unitary matrices generalize orthogonal matrices.

In applied mathematics

Control theory

In control theory, systems are often transformed from the time domain to the complex frequency domain using the Laplace transform. The system's zeros and poles are then analyzed in the complex plane. The root locus, Nyquist plot, and Nichols plot techniques all make use of the complex plane.

In the root locus method, it is important whether zeros and poles are in the left or right half planes, that is, have real part greater than or less than zero. If a linear, time-invariant (LTI) system has poles that are

If a system has zeros in the right half plane, it is a nonminimum phase system.

Signal analysis

Complex numbers are used in signal analysis and other fields for a convenient description for periodically varying signals. For given real functions representing actual physical quantities, often in terms of sines and cosines, corresponding complex functions are considered of which the real parts are the original quantities. For a sine wave of a given frequency, the absolute value |z| of the corresponding z is the amplitude and the argument arg z is the phase.

If Fourier analysis is employed to write a given real-valued signal as a sum of periodic functions, these periodic functions are often written as complex-valued functions of the form

and

where ω represents the angular frequency and the complex number A encodes the phase and amplitude as explained above.

This use is also extended into digital signal processing and digital image processing, which use digital versions of Fourier analysis (and wavelet analysis) to transmit, compress, restore, and otherwise process digital audio signals, still images, and video signals.

Another example, relevant to the two side bands of amplitude modulation of AM radio, is:

In physics

Electromagnetism and electrical engineering

In electrical engineering, the Fourier transform is used to analyze varying voltages and currents. The treatment of resistors, capacitors, and inductors can then be unified by introducing imaginary, frequency-dependent resistances for the latter two and combining all three in a single complex number called the impedance. This approach is called phasor calculus.

In electrical engineering, the imaginary unit is denoted by j, to avoid confusion with I, which is generally in use to denote electric current, or, more particularly, i, which is generally in use to denote instantaneous electric current.

Since the voltage in an AC circuit is oscillating, it can be represented as

To obtain the measurable quantity, the real part is taken:

The complex-valued signal V(t) is called the analytic representation of the real-valued, measurable signal v(t)

Fluid dynamics

In fluid dynamics, complex functions are used to describe potential flow in two dimensions.

Quantum mechanics

The complex number field is intrinsic to the mathematical formulations of quantum mechanics, where complex Hilbert spaces provide the context for one such formulation that is convenient and perhaps most standard. The original foundation formulas of quantum mechanics – the Schrödinger equation and Heisenberg's matrix mechanics – make use of complex numbers.

Relativity

In special and general relativity, some formulas for the metric on spacetime become simpler if one takes the time component of the spacetime continuum to be imaginary. (This approach is no longer standard in classical relativity, but is used in an essential way in quantum field theory.) Complex numbers are essential to spinors, which are a generalization of the tensors used in relativity.

Generalizations and related notions

Cayley Q8 quaternion graph showing cycles of multiplication by i, j and k

The process of extending the field of reals to is known as the Cayley–Dickson construction. It can be carried further to higher dimensions, yielding the quaternions and octonions which (as a real vector space) are of dimension 4 and 8, respectively. In this context the complex numbers have been called the binarions.

Just as by applying the construction to reals the property of ordering is lost, properties familiar from real and complex numbers vanish with each extension. The quaternions lose commutativity, that is, x·yy·x for some quaternions x, y, and the multiplication of octonions, additionally to not being commutative, fails to be associative: (x·yzx·(y·z) for some octonions x, y, z.

Reals, complex numbers, quaternions and octonions are all normed division algebras over . By Hurwitz's theorem they are the only ones; the sedenions, the next step in the Cayley–Dickson construction, fail to have this structure.

The Cayley–Dickson construction is closely related to the regular representation of thought of as an -algebra (an -vector space with a multiplication), with respect to the basis (1, i). This means the following: the -linear map

for some fixed complex number w can be represented by a 2 × 2 matrix (once a basis has been chosen). With respect to the basis (1, i), this matrix is
that is, the one mentioned in the section on matrix representation of complex numbers above. While this is a linear representation of in the 2 × 2 real matrices, it is not the only one. Any matrix
has the property that its square is the negative of the identity matrix: J2 = −I. Then
is also isomorphic to the field and gives an alternative complex structure on This is generalized by the notion of a linear complex structure.

Hypercomplex numbers also generalize and For example, this notion contains the split-complex numbers, which are elements of the ring (as opposed to for complex numbers). In this ring, the equation a2 = 1 has four solutions.

The field is the completion of the field of rational numbers, with respect to the usual absolute value metric. Other choices of metrics on lead to the fields of p-adic numbers (for any prime number p), which are thereby analogous to . There are no other nontrivial ways of completing than and by Ostrowski's theorem. The algebraic closures of still carry a norm, but (unlike ) are not complete with respect to it. The completion of turns out to be algebraically closed. By analogy, the field is called p-adic complex numbers.

The fields and their finite field extensions, including are called local fields.

Authorship of the Bible

From Wikipedia, the free encyclopedia ...