i in the complex or cartesian plane. Real numbers lie on the horizontal axis, and imaginary numbers lie on the vertical axis
The imaginary unit or unit imaginary number (i) is a solution to the quadratic equationx2 + 1 = 0. Although there is no real number with this property, i can be used to extend the real numbers to what are called complex numbers, using addition and multiplication. A simple example of the use of i in a complex number is 2 + 3i.
Imaginary numbers are an important mathematical concept, which extend the real number system ℝ to the complex number system ℂ, which in turn provides at least one root for every nonconstant polynomialP(x). The term "imaginary" is used because there is no real number having a negative square.
There are two complex square roots of −1, namely i and −i, just as there are two complex square roots of every real number other than zero, which has one double square root.
In contexts where i is ambiguous or problematic, j or the Greek ι is sometimes used. In the disciplines of electrical engineering and control systems engineering, the imaginary unit is normally denoted by j instead of i, because i is commonly used to denote electric current.
Definition
The powers of i
return cyclic values:
... (repeats the pattern
from blue area)
i−3 = i
i−2 = −1
i−1 = −i
i0 = 1
i1 = i
i2 = −1
i3 = −i
i4 = 1
i5 = i
i6 = −1
... (repeats the pattern
from the blue area)
The imaginary number i is defined solely by the property that its square is −1:
With i defined this way, it follows directly from algebra that i and −i are both square roots of −1.
Although the construction is called "imaginary", and although the
concept of an imaginary number may be intuitively more difficult to
grasp than that of a real number, the construction is perfectly valid
from a mathematical standpoint. Real number operations can be extended
to imaginary and complex numbers by treating i as an unknown quantity while manipulating an expression, and then using the definition to replace any occurrence of i2 with −1. Higher integral powers of i can also be replaced with −i, 1, i, or −1:
Similarly, as with any non-zero real number:
As a complex number, i is represented in rectangular form as 0 + i, having a unit imaginary component and no real component (i.e., the real component is zero). In polar form, i is represented as 1eiπ/2 (or just eiπ/2), having an absolute value (or magnitude) of 1 and an argument (or angle) of π/2. In the complex plane (also known as the Cartesian plane), i is the point located one unit from the origin along the imaginary axis (which is at a right angle to the real axis).
i and −i
Being a quadratic polynomial with no multiple root, the defining equation x2 = −1 has two distinct solutions, which are equally valid and which happen to be additive and multiplicative inverses of each other. More precisely, once a solution i of the equation has been fixed, the value −i, which is distinct from i, is also a solution. Since the equation is the only definition of i, it appears that the definition is ambiguous (more precisely, not well-defined). However, no ambiguity results as long as one or other of the solutions is chosen and labelled as "i", with the other one then being labelled as −i.
This is because, although −i and i are not quantitatively equivalent (they are negatives of each other), there is no algebraic difference between i and −i.
Both imaginary numbers have equal claim to being the number whose
square is −1. If all mathematical textbooks and published literature
referring to imaginary or complex numbers were rewritten with −i replacing every occurrence of +i (and therefore every occurrence of −i replaced by −(−i) = +i), all facts and theorems would continue to be equivalently valid. The distinction between the two roots x of x2 + 1 = 0
with one of them labelled with a minus sign is purely a notational
relic; neither root can be said to be more primary or fundamental than
the other, and neither of them is "positive" or "negative".
The issue can be a subtle one. The most precise explanation is to say that although the complex field, defined as ℝ[x]/(x2 + 1) (see complex number), is uniqueup toisomorphism, it is not unique up to a unique isomorphism — there are exactly two field automorphisms of ℝ[x]/(x2 + 1) which keep each real number fixed: the identity and the automorphism sending x to −x.
Matrices
A similar issue arises if the complex numbers are interpreted as 2 × 2 real matrices, because then both
and
are solutions to the matrix equation
In this case, the ambiguity results from the geometric choice of which "direction" around the unit circle is "positive" rotation. A more precise explanation is to say that the automorphism group of the special orthogonal groupSO(2, ℝ)
has exactly two elements—the identity and the automorphism which
exchanges "CW" (clockwise) and "CCW" (counter-clockwise) rotations.
All these ambiguities can be solved by adopting a more rigorous definition of complex number, and explicitly choosing
one of the solutions to the equation to be the imaginary unit. For
example, the ordered pair (0, 1), in the usual construction of the
complex numbers with two-dimensional vectors.
When the set of 2 × 2 real matricesM (2, ℝ) is used for a source, and the number one (1) is identified with the identity matrix, and minus one (−1) with the negative of the identity matrix, then there are many solutions to X2 = −1. In fact, there are many solutions to X2 = +1 and X2 = 0 also. Any such X can be taken as a basis vector, along with 1, to form a planar subalgebra
Proper use
The imaginary unit is sometimes written √−1
in advanced mathematics contexts (as well as in less advanced popular
texts). However, great care needs to be taken when manipulating formulas
involving radicals. The radical sign notation is reserved either for the principal square root function, which is only defined for real x ≥ 0,
or for the principal branch of the complex square root function.
Attempting to apply the calculation rules of the principal (real) square
root function to manipulate the principal branch of the complex square
root function can produce false results:
(incorrect).
Similarly:
(incorrect).
The calculation rules
and
are only valid for real, non-negative values of a and b.
These problems are avoided by writing and manipulating expressions like i√7, rather than √−7.
Properties
Square roots
The two square roots of i in the complex plane
The three cube roots of i in the complex plane
i has
two square roots, just like all complex numbers (except zero, which has a
double root). These two roots can be expressed as the complex numbers:
(This is equivalent to a 90° counter-clockwise rotation of a vector about the origin in the complex plane.)
Dividing by i is equivalent to multiplying by the reciprocal of i:
Using this identity to generalize division by i to all complex numbers gives:
(This is equivalent to a 90° clockwise rotation of a vector about the origin in the complex plane.)
Powers
The powers of i repeat in a cycle expressible with the following pattern, where n is any integer:
Many mathematical operations that can be carried out with real numbers can also be carried out with i, such as exponentiation, roots, logarithms, and trigonometric functions. All of the following functions are complexmulti-valued functions, and it should be clearly stated which branch of the Riemann surface the function is defined on in practice. Listed below are results for the most commonly chosen branch.
In electrical engineering and related fields, the imaginary unit is normally denoted by j to avoid confusion with electric current as a function of time, traditionally denoted by i(t) or just i. The Python programming language also uses j to mark the imaginary part of a complex number. MATLAB associates both i and j with the imaginary unit, although 1i or 1j is preferable, for speed and improved robustness.
Some texts use the Greek letter iota (ι) for the imaginary unit, to avoid confusion, especially with index and subscripts.
The conjugate of a Gaussian integer a + bi is the Gaussian integer a – bi.
The norm of a Gaussian integer is its product with its conjugate.
The norm of a Gaussian integer is thus the square of its absolute value as a complex number. The norm of a Gaussian integer is a nonnegative integer, which is a sum of two squares. Thus a norm cannot be of the form 4k + 3, with k integer.
for every pair of Gaussian integers z,w. This can be shown either by a direct check, or by using the multiplicative property of the absolute value of complex numbers.
The units of the ring of Gaussian integers (that is the Gaussian integers whose multiplicative inverse is also a Gaussian integer) are precisely the Gaussian integers with norm 1, that is, 1, –1, i and –i.
Euclidean division
Visualization of maximal distance to some Gaussian integer
A Euclidean division algorithm takes, in the ring of Gaussian integers, a dividend a and divisor b ≠ 0, and produces a quotient q and remainder r such that
In fact, one may make the remainder smaller:
Even with this better inequality,
the quotient and the remainder are not necessarily unique, but one may
refine the choice to ensure uniqueness.
To prove this, one may consider the complex number quotient x + iy = a/b. There are unique integers m and n such that –1/2 < x – m ≤ 1/2 and –1/2 < y – n ≤ 1/2, and thus N(x – m + i(y – n)) ≤ 1/2. Taking q = m + in, one has
with
and
The choice of x – m and y – n in a semi-open interval
is required for uniqueness.
This definition of Euclidean division may be interpreted geometrically
in the complex plane (see the figure), by remarking that the distance
from a complex number ξ to the closest Gaussian integer is at most √2/2.
Principal ideals
Since the ring G of Gaussian integers is a Euclidean domain, G is a principal ideal domain, which means that every ideal of G is principal. Explicitly, an idealI is a subset of a ring R such that every sum of elements of I and every product of an element of I by an element of R belong to I. An ideal is principal, if it consists of all multiples of a single element g, that is, it has the form
In this case, one says that the ideal is generated by g or that g is a generator of the ideal.
Every ideal I in the ring of the Gaussian integers is principal, because, if one chooses in I an nonzero element g of minimal norm, for every element x of I, the remainder of Euclidean division of x by g belongs also to I and has a norm that is smaller than that of g; because of the choice of g, this norm is zero, and thus the remainder is also zero. That is, one has x = qg, where q is the quotient.
For any g, the ideal generated by g is also generated by any associate of g, that is, g, gi, –g, –gi; no other element generates the same ideal. As all the generators of an ideal have the same norm, the norm of an ideal is the norm of any of its generators.
In some circumstances, it
is useful to choose, once for all, a generator for each ideal. There are
two classical ways for doing that, both considering first the ideals of
odd norm. If the g = a + bi has an odd norm a2 + b2, then one of a and b is odd, and the other is even. Thus g has exactly one associate with a real part a that is odd and positive. In its original paper, Gauss did another choice, by choosing the unique associate such that the remainder of its division by 2 + 2i is one. In fact, as N(2 + 2i) = 8,
the norm of the remainder is not greater than 4. As this norm is odd,
and 3 is not the norm of a Gaussian integer, the norm of the remainder
is one, that is, the remainder is a unit. Multiplying g by the inverse of this unit, one finds an associate that has one as a remainder, when divided by 2 + 2i.
If the norm of g is even, then either g = 2kh or g = 2kh(1 + i), where k is a positive integer, and N(h) is odd. Thus, one chooses the associate of g for getting a h which fits the choice of the associates for elements of odd norm.
The prime elements of Z[i] are also known as Gaussian primes.
An associate of a Gaussian prime is also a Gaussian prime. The
conjugate of a Gaussian prime is also a Gaussian prime (this implies
that Gaussian primes are symmetric about the real and imaginary axes).
A positive integer is a Gaussian prime if and only if it is a prime number that is congruent to 3 modulo 4 (that is, it may be written 4n + 3, with n a nonnegative integer) (sequence A002145 in the OEIS). The other prime numbers are not Gaussian primes, but each is the product of two conjugate Gaussian primes.
A Gaussian integer a + bi is a Gaussian prime if and only if either:
one of a, b is zero and absolute value of the other is a prime number of the form 4n + 3 (with n a nonnegative integer), or
both are nonzero and a2 + b2 is a prime number (which will not be of the form 4n + 3).
In other words, a Gaussian integer is a Gaussian prime if and only if
either its norm is a prime number, or it is the product of a unit (±1, ±i) and a prime number of the form 4n + 3.
It follows that there are three cases for the factorization of a prime number p in the Gaussian integers:
If p is congruent to 3 modulo 4, then it is a Gaussian prime; in the language of algebraic number theory, p is said to be inert in the Gaussian integers.
If p is congruent to 1 modulo 4,
then it is the product of a Gaussian prime by its conjugate, both of
which are non-associated Gaussian primes (neither is the product of the
other by a unit); p is said to be a decomposed prime in the Gaussian integers. For example, 5 = (2 + i)(2 − i) and 13 = (3 + 2i)(3 − 2i).
If p = 2, we have 2 = (1 + i)(1 − i) = i(1 − i)2; that is, 2 is the product of the square of a Gaussian prime by a unit; it is the unique ramified prime in the Gaussian integers.
Unique factorization
As for every unique factorization domain, every Gaussian integer may be factored as a product of a unit
and Gaussian primes, and this factorization is unique up to the order
of the factors, and the replacement of any prime by any of its
associates (together with a corresponding change of the unit factor).
If one chooses, once for all, a fixed Gaussian prime for each equivalence class
of associated primes, and if one takes only these selected primes in
the factorization, then one obtains a prime factorization which is
unique up to the order of the factors. With the choices described above, the resulting unique factorization has the form
where u is a unit (that is, u ∈ {1, –1, i, –i}), e0 and k are nonnegative integers, e1, …, ek are positive integers, and p1, …, pk are distinct Gaussian primes such that, depending on the choice of selected associates,
either pk = ak + ibk with a odd and positive, and b even,
or the remainder of the Euclidean division of pk by 2 + 2i equals 1 (this is Gauss's original choice).
An advantage of the second choice is that the selected associates
behave well under products for Gaussian integers of odd norm. On the
other hand, the selected associate for the real Gaussian primes are
negative integers. For example, the factorization of 231 in the
integers, and with the first choice of associates is 3 × 7 × 11, while it is –1 × –3 × –7 × –11 with the second choice.
The ring of Gaussian integers is the integral closure of the integers in the Gaussian rationals.
This implies that Gaussian integers are quadratic integers and that a Gaussian rational is a Gaussian integer, if and only if it is a solution of an equation
with c and d integers. In fact a + bi is solution of the equation
and this equation has integer coefficients if and only if a and b are both integers.
Greatest common divisor
As for any unique factorization domain, a greatest common divisor (gcd) of two Gaussian integers a, b is a Gaussian integer d that is a common divisor of a and b, which has all common divisors of a and b as divisor. That is (where | denotes the divisibility relation),
d | a and d | b, and
c | a and c | b implies c | d.
Thus, greatest is meant relatively to the divisibility relation, and not for an ordering of the ring (for integers, both meanings of greatest coincide).
More technically, a greatest common divisor of a and b is a generator of the ideal generated by a and b (this characterization is valid for principal ideal domains, but not, in general, for unique factorization domains).
The greatest common divisor of two Gaussian integers is not unique, but is defined up to the multiplication by a unit. That is, given a greatest common divisor d of a and b, the greatest common divisors of a and b are d, –d, id, and –id.
There are several ways for computing a greatest common divisor of two Gaussian integers a and b. When one know prime factorizations of a and b,
where the primes pm are pairwise non associated, and the exponents μm non-associated, a greatest common divisor is
with
Unfortunately, except in simple cases, the prime factorization is difficult to compute, and Euclidean algorithm leads to a much easier (and faster) computation. This algorithm consists of replacing of the input (a, b) by (b, r), where r is the remainder of the Euclidean division of a by b, and repeating this operation until getting a zero remainder, that is a pair (d, 0). This process terminates, because, at each step, the norm of the second Gaussian integer decreases. The resulting d is a greatest common divisor, because (at each step) b and r = a – bq have the same divisors as a and b, and thus the same greatest common divisor.
This method of computation works always, but is not as simple as
for integers because Euclidean division is more complicate. Therefore, a
third method is often preferred for hand-written computations. It
consists in remarking that the norm N(d) of the greatest common divisor of a and b is a common divisor of N(a), N(b), and N(a + b). When the greatest common divisor D
of these three integers has few factors, then it is easy to test, for
common divisor, all Gaussian integers with a norm dividing D.
For example, if a = 5 + 3i, and b = 2 – 8i, one has N(a) = 34, N(b) = 68, and N(a + b) = 74. As the greatest common divisor of the three norms is 2, the greatest common divisor of a and b has 1 or 2 as a norm. As a gaussian integer of norm 2 is necessary associated to 1 + i, and as 1 + i divides a and b, then the greatest common divisor is 1 + i.
If b is replaced by its conjugate b = 2 + 8i, then the greatest common divisor of the three norms is 34, the norm of a, thus one may guess that the greatest common divisor is a, that is, that a | b. In fact, one has 2 + 8i = (5 + 3i)(1 + i).
Congruences and residue classes
Given a Gaussian integer z0, called a modulus, two Gaussian integers z1,z2 are congruent moduloz0, if their difference is a multiple of z0, that is if there exists a Gaussian integer q such that z1 − z2 = qz0. In other words, two Gaussian integers are congruent modulo z0, if their difference belongs to the ideal generated by z0. This is denoted as z1 ≡ z2 (mod z0).
The residue class of a Gaussian integer a is the set
of all Gaussian integers that are congruent to a. It follows that a = bif and only ifa ≡ b (mod z0).
Addition and multiplication are compatible with congruences. This means that a1 ≡ b1 (mod z0) and a2 ≡ b2 (mod z0) imply a1 + a2 ≡ b1 + b2 (mod z0) and a1a2 ≡ b1b2 (mod z0).
This defines well-defined operations (that is independent of the choice of representatives) on the residue classes:
With these operations, the residue classes form a commutative ring, the quotient ring of the Gaussian integers by the ideal generated by z0, which is also traditionally called the residue class ring moduloz0 (for more details, see Quotient ring).
Examples
There are exactly two residue classes for the modulus 1 + i, namely 0 = {0, ±2, ±4,…,±1 ± i, ±3 ± i,…} (all multiples of 1 + i), and 1 = {±1, ±3, ±5,…, ±i, ±2 ± i,…},
which form a checkerboard pattern in the complex plane. These two
classes form thus a ring with two elements, which is, in fact, a field, the unique (up to an isomorphism) field with two elements, and may thus be identified with the integers modulo 2.
These two classes may be considered as a generalization of the
partition of integers into even and odd integers. Thus one may speak of even and odd Gaussian integers (Gauss divided further even Gaussian integers into even, that is divisible by 2, and half-even).
For the modulus 2 there are four residue classes, namely 0, 1, i, 1 + i. These form a ring with four elements, in which x = –x for every x. Thus this ring is not isomorphic with the ring of integers modulo 4, another ring with four elements. One has 1 + i2 = 0, and thus this ring is not the finite field with four elements, nor the direct product of two copies of the ring of integers modulo 2.
For the modulus 2 + 2i = (i − 1)3 there are eight residue classes, namely 0, ±1, ±i, 1 ± i, 2, whereof four contain only even Gaussian integers and four contain only odd Gaussian integers.
Describing residue classes
All 13 residue classes with their minimal residues (blue dots) in the square Q00 (light green background) for the modulus z0 = 3 + 2i. One residue class with z = 2 − 4i ≡ −i (mod z0) is highlighted with yellow/orange dots.
Given a modulus z0, all elements of a residue class have the same remainder for the Euclidean division by z0, provided one uses the division with unique quotient and remainder, which is described above.
Thus enumerating the residue classes is equivalent with enumerating the
possible remainders. This can be done geometrically in the following
way.
In the complex plane, one may consider a square grid, whose squares are delimited by the two lines
with s and t integers (blue lines in the figure). These divide the plane in semi-open squares (where m and n are integers)
The semi-open intervals that occur in the definition of Qmn have been chosen in order that every complex number belong to exactly one square; that is, the squares Qmn form a partition of the complex plane. One has
This implies that every Gaussian integer is congruent modulo z0 to a unique Gaussian integer in Q00 (the green square in the figure), which its remainder for the division by z0. In other words, every residue class contains exactly one element in Q00.
The Gaussian integers in Q00 (or in its boundary) are sometimes called minimal residues because their norm are not greater than the norm of any other Gaussian integer in the same residue class (Gauss called them absolutely smallest residues).
From this one can deduce by geometrical considerations, that the number of residue classes modulo a Gaussian integer z0 = a + bi equals his norm N(z0) = a2 + b2.
Residue class fields
The residue class ring modulo a Gaussian integer z0 is a field if and only if is a Gaussian prime.
If z0 is a decomposed prime or the ramified prime 1 + i (that is, if its norm N(z0)
is a prime number, which is either 2 or a prime congruent to 1 modulo
4), then the residue class field has a prime number of elements (that
is, N(z0)). It is thus isomorphic to the field of the integers modulo N(z0).
If, on the other hand, z0 is an inert prime (that is, N(z0) = p2 is the square of a prime number, which is congruent to 3 modulo 4), then the residue class field has p2 elements, and it is an extension of degree 2 (unique, up to an isomorphism) of the prime field with p elements (the integers modulo p).
Primitive residue class group and Euler's totient function
Many
theorems (and their proofs) for moduli of integers can be directly
transferred to moduli of Gaussian integers, if one replaces the absolute
value of the modulus by the norm. This holds especially for the primitive residue class group (also called multiplicative group of integers modulo n) and Euler's totient function. The primitive residue class group of a modulus z is defined as the subset of its residue classes, which contains all residue classes a that are coprime to z, i.e. (a,z) = 1. Obviously, this system builds a multiplicative group. The number of its elements shall be denoted by ϕ(z) (analogously to Euler's totient function φ(n) for integers n).
For Gaussian primes it immediately follows that ϕ(p) = |p|2 − 1 and for arbitrary composite Gaussian integers
where the product is to build over all prime divisors pm of z (with νm > 0). Also the important theorem of Euler can be directly transferred:
For all a with (a,z) = 1, it holds that aϕ(z) ≡ 1 (mod z).
Historical background
The ring of Gaussian integers was introduced by Carl Friedrich Gauss in his second monograph on quartic reciprocity (1832). The theorem of quadratic reciprocity (which he had first succeeded in proving in 1796) relates the solvability of the congruence x2 ≡ q (mod p) to that of x2 ≡ p (mod q). Similarly, cubic reciprocity relates the solvability of x3 ≡ q (mod p) to that of x3 ≡ p (mod q), and biquadratic (or quartic) reciprocity is a relation between x4 ≡ q (mod p) and x4 ≡ p (mod q).
Gauss discovered that the law of biquadratic reciprocity and its
supplements were more easily stated and proved as statements about
"whole complex numbers" (i.e. the Gaussian integers) than they are as
statements about ordinary whole numbers (i.e. the integers).
In a footnote he notes that the Eisenstein integers are the natural domain for stating and proving results on cubic reciprocity and indicates that similar extensions of the integers are the appropriate domains for studying higher reciprocity laws.
This paper not only introduced the Gaussian integers and proved
they are a unique factorization domain, it also introduced the terms
norm, unit, primary, and associate, which are now standard in algebraic
number theory.
Unsolved problems
The distribution of the small Gaussian primes in the complex plane
Most of the unsolved problems are related to distribution of Gaussian primes in the plane.
Gauss's circle problem does not deal with the Gaussian integers per se, but instead asks for the number of lattice points
inside a circle of a given radius centered at the origin. This is
equivalent to determining the number of Gaussian integers with norm less
than a given value.
There are also conjectures and unsolved problems about the Gaussian primes. Two of them are:
The real and imaginary axes have the infinite set of Gaussian
primes 3, 7, 11, 19, ... and their associates. Are there any other lines
that have infinitely many Gaussian primes on them? In particular, are
there infinitely many Gaussian primes of the form 1 + ki?
Is it possible to walk to infinity using the Gaussian primes as
stepping stones and taking steps of a uniformly bounded length? This is
known as the Gaussian moat problem; it was posed in 1962 by Basil Gordon and remains unsolved.