A Medley of Potpourri: Greatest common divisor

Saturday, December 16, 2023

Greatest common divisor

From Wikipedia, the free encyclopedia

https://en.wikipedia.org/wiki/Greatest_common_divisor

In mathematics, the greatest common divisor (GCD) of two or more integers, which are not all zero, is the largest positive integer that divides each of the integers. For two integers $x$ , $y$ , the greatest common divisor of $x$ and $y$ is denoted $gcd (x, y)$ . For example, the GCD of 8 and 12 is 4, that is, $gcd (8, 12) = 4$ .

In the name "greatest common divisor", the adjective "greatest" may be replaced by "highest", and the word "divisor" may be replaced by "factor", so that other names include highest common factor (hcf), etc. Historically, other names for the same concept have included greatest common measure.

This notion can be extended to polynomials (see Polynomial greatest common divisor) and other commutative rings (see § In commutative rings below).

Overview

Definition

The greatest common divisor (GCD) of integers $a$ and $b$ , at least one of which is nonzero, is the greatest positive integer $d$ such that $d$ is a divisor of both $a$ and $b$ ; that is, there are integers $e$ and $f$ such that $a = de$ and $b = df$ , and $d$ is the largest such integer. The GCD of $a$ and $b$ is generally denoted $gcd(a, b)$ .

When one of $a$ and $b$ is zero, the GCD is the absolute value of the nonzero integer: $gcd(a, 0) = gcd(0, a) = | a |$ . This case is important as the terminating step of the Euclidean algorithm.

The above definition is unsuitable for defining $gcd(0, 0)$ , since there is no greatest integer $n$ such that $0 \times n = 0$ . However, zero is its own greatest divisor if greatest is understood in the context of the divisibility relation, so $gcd(0, 0)$ is commonly defined as $0$ . This preserves the usual identities for GCD, and in particular Bézout's identity, namely that $gcd(a, b)$ generates the same ideal as ${a, b}$ .This convention is followed by many computer algebra systems.^[12] Nonetheless, some authors leave $gcd(0, 0)$ undefined.

The GCD of $a$ and $b$ is their greatest positive common divisor in the preorder relation of divisibility. This means that the common divisors of $a$ and $b$ are exactly the divisors of their GCD. This is commonly proved by using either Euclid's lemma, the fundamental theorem of arithmetic, or the Euclidean algorithm. This is the meaning of "greatest" that is used for the generalizations of the concept of GCD.

Example

The number 54 can be expressed as a product of two integers in several different ways:

54 \times 1 = 27 \times 2 = 18 \times 3 = 9 \times 6.

Thus the complete list of divisors of 54 is $1, 2, 3, 6, 9, 18, 27, 54$ . Similarly, the divisors of 24 are $1, 2, 3, 4, 6, 8, 12, 24$ . The numbers that these two lists have in common are the common divisors of 54 and 24, that is,

1, 2, 3, 6.

Of these, the greatest is 6, so it is the greatest common divisor:

gcd (54, 24) = 6.

Computing all divisors of the two numbers in this way is usually not efficient, especially for large numbers that have many divisors. Much more efficient methods are described in § Calculation.

Coprime numbers

Two numbers are called relatively prime, or coprime, if their greatest common divisor equals 1. For example, 9 and 28 are coprime.

A geometric view

"Tall, slender rectangle divided into a grid of squares. The rectangle is two squares wide and five squares tall." — A 24-by-60 rectangle is covered with ten 12-by-12 square tiles, where 12 is the GCD of 24 and 60. More generally, an a-by-b rectangle can be covered with square tiles of side length c only if c is a common divisor of a and b.

For example, a 24-by-60 rectangular area can be divided into a grid of: 1-by-1 squares, 2-by-2 squares, 3-by-3 squares, 4-by-4 squares, 6-by-6 squares or 12-by-12 squares. Therefore, 12 is the greatest common divisor of 24 and 60. A 24-by-60 rectangular area can thus be divided into a grid of 12-by-12 squares, with two squares along one edge (24/12 = 2) and five squares along the other (60/12 = 5).

Applications

Reducing fractions

The greatest common divisor is useful for reducing fractions to the lowest terms. For example, gcd(42, 56) = 14, therefore,

\frac{42}{56} = \frac{3 \cdot 14}{4 \cdot 14} = \frac{3}{4} .

Least common multiple

The least common multiple of two integers that are not both zero can be computed from their greatest common divisor, by using the relation

lcm (a, b) = \frac{| a \cdot b |}{\gcd (a, b)} .

Calculation

Using prime factorizations

Greatest common divisors can be computed by determining the prime factorizations of the two numbers and comparing factors. For example, to compute gcd(48, 180), we find the prime factorizations 48 = 2⁴ · 3¹ and 180 = 2² · 3² · 5¹; the GCD is then 2^min(4,2) · 3^min(1,2) · 5^min(0,1) = 2² · 3¹ · 5⁰ = 12 The corresponding LCM is then 2^max(4,2) · 3^max(1,2) · 5^max(0,1) = 2⁴ · 3² · 5¹ = 720.

In practice, this method is only feasible for small numbers, as computing prime factorizations takes too long.

Euclid's algorithm

The method introduced by Euclid for computing greatest common divisors is based on the fact that, given two positive integers $a$ and $b$ such that $a > b$ , the common divisors of $a$ and $b$ are the same as the common divisors of $a - b$ and $b$ .

So, Euclid's method for computing the greatest common divisor of two positive integers consists of replacing the larger number by the difference of the numbers, and repeating this until the two numbers are equal: that is their greatest common divisor.

For example, to compute $gcd(48,18)$ , one proceeds as follows:

\begin{aligned} gcd (48, 18) & \to gcd (48 - 18, 18) = gcd (30, 18) & \to gcd (30 - 18, 18) = gcd (12, 18) \\ \to gcd (12, 18 - 12) = gcd (12, 6) & \to gcd (12 - 6, 6) = gcd (6, 6) . \end{aligned}

So $gcd(48, 18) = 6$ .

This method can be very slow if one number is much larger than the other. So, the variant that follows is generally preferred.

Euclidean algorithm

Animation showing an application of the Euclidean algorithm to find the greatest common divisor of 62 and 36, which is 2.

A more efficient method is the Euclidean algorithm, a variant in which the difference of the two numbers $a$ and $b$ is replaced by the remainder of the Euclidean division (also called division with remainder) of $a$ by $b$ .

Denoting this remainder as $a mod b$ , the algorithm replaces $(a, b)$ by $(b, a mod b)$ repeatedly until the pair is $(d, 0)$ , where $d$ is the greatest common divisor.

For example, to compute gcd(48,18), the computation is as follows:

\begin{aligned} gcd (48, 18) & \to gcd (18, 48 mod 1 8) = gcd (18, 12) \\ \to gcd (12, 18 mod 1 2) = gcd (12, 6) \\ \to gcd (6, 12 mod 6) = gcd (6, 0) . \end{aligned}

This again gives $gcd(48, 18) = 6$ .

Lehmer's GCD algorithm

Lehmer's algorithm is based on the observation that the initial quotients produced by Euclid's algorithm can be determined based on only the first few digits; this is useful for numbers that are larger than a computer word. In essence, one extracts initial digits, typically forming one or two computer words, and runs Euclid's algorithms on these smaller numbers, as long as it is guaranteed that the quotients are the same with those that would be obtained with the original numbers. The quotients are collected into a small 2-by-2 transformation matrix (a matrix of single-word integers) to reduce the original numbers. This process is repeated until numbers are small enough that the binary algorithm (see below) is more efficient.

This algorithm improves speed, because it reduces the number of operations on very large numbers, and can use hardware arithmetic for most operations. In fact, most of the quotients are very small, so a fair number of steps of the Euclidean algorithm can be collected in a 2-by-2 matrix of single-word integers. When Lehmer's algorithm encounters a quotient that is too large, it must fall back to one iteration of Euclidean algorithm, with a Euclidean division of large numbers.

Binary GCD algorithm

The binary GCD algorithm uses only subtraction and division by 2. The method is as follows: Let a and b be the two non-negative integers. Let the integer d be 0. There are five possibilities:

a = b.

As gcd(a, a) = a, the desired GCD is a × 2^d (as a and b are changed in the other cases, and d records the number of times that a and b have been both divided by 2 in the next step, the GCD of the initial pair is the product of a and 2^d).

Both a and b are even.

Then 2 is a common divisor. Divide both a and b by 2, increment d by 1 to record the number of times 2 is a common divisor and continue.

a is even and b is odd.

Then 2 is not a common divisor. Divide a by 2 and continue.

a is odd and b is even.

Then 2 is not a common divisor. Divide b by 2 and continue.

Both a and b are odd.

As gcd(a,b) = gcd(b,a), if a < b then exchange a and b. The number c = a − b is positive and smaller than a. Any number that divides a and b must also divide c so every common divisor of a and b is also a common divisor of b and c. Similarly, a = b + c and every common divisor of b and c is also a common divisor of a and b. So the two pairs (a, b) and (b, c) have the same common divisors, and thus gcd(a,b) = gcd(b,c). Moreover, as a and b are both odd, c is even, the process can be continued with the pair (a, b) replaced by the smaller numbers (c/2, b) without changing the GCD.

Each of the above steps reduces at least one of a and b while leaving them non-negative and so can only be repeated a finite number of times. Thus eventually the process results in a = b, the stopping case. Then the GCD is a × 2^d.

Example: (a, b, d) = (48, 18, 0) → (24, 9, 1) → (12, 9, 1) → (6, 9, 1) → (3, 9, 1) → (3, 3, 1) ; the original GCD is thus the product 6 of 2^d = 2¹ and a = b = 3.

The binary GCD algorithm is particularly easy to implement on binary computers. Its computational complexity is

O ((\log a + \log b)^{2})

The computational complexity is usually given in terms of the length $n$ of the input. Here, this length is $n = \log a + \log b,$ and the complexity is thus

O (n^{2})

Other methods

If a and b are both nonzero, the greatest common divisor of a and b can be computed by using least common multiple (LCM) of a and b:

gcd (a, b) = \frac{| a \cdot b |}{lcm (a, b)}

but more commonly the LCM is computed from the GCD.

Using Thomae's function f,

gcd (a, b) = a f (\frac{b}{a}),

which generalizes to a and b rational numbers or commensurable real numbers.

Keith Slavin has shown that for odd a ≥ 1:

gcd (a, b) = \log_{2} \prod_{k = 0}^{a - 1} (1 + e^{- 2 i π k b / a})

which is a function that can be evaluated for complex b. Wolfgang Schramm has shown that

gcd (a, b) = \sum_{k = 1}^{a} \exp (2 π i k b / a) \cdot \sum_{d | a} \frac{c_{d} (k)}{d}

is an entire function in the variable b for all positive integers a where c_d(k) is Ramanujan's sum.

Complexity

The computational complexity of the computation of greatest common divisors has been widely studied. If one uses the Euclidean algorithm and the elementary algorithms for multiplication and division, the computation of the greatest common divisor of two integers of at most $n$ bits is $O (n^{2}) .$ This means that the computation of greatest common divisor has, up to a constant factor, the same complexity as the multiplication.

However, if a fast multiplication algorithm is used, one may modify the Euclidean algorithm for improving the complexity, but the computation of a greatest common divisor becomes slower than the multiplication. More precisely, if the multiplication of two integers of $n$ bits takes a time of $T (n)$ , then the fastest known algorithm for greatest common divisor has a complexity $O (T (n) \log n) .$ This implies that the fastest known algorithm has a complexity of $O (n (\log n)^{2}) .$

Previous complexities are valid for the usual models of computation, specifically multitape Turing machines and random-access machines.

The computation of the greatest common divisors belongs thus to the class of problems solvable in quasilinear time. A fortiori, the corresponding decision problem belongs to the class P of problems solvable in polynomial time. The GCD problem is not known to be in NC, and so there is no known way to parallelize it efficiently; nor is it known to be P-complete, which would imply that it is unlikely to be possible to efficiently parallelize GCD computation. Shallcross et al. showed that a related problem (EUGCD, determining the remainder sequence arising during the Euclidean algorithm) is NC-equivalent to the problem of integer linear programming with two variables; if either problem is in NC or is P-complete, the other is as well.^[19] Since NC contains NL, it is also unknown whether a space-efficient algorithm for computing the GCD exists, even for nondeterministic Turing machines.

Although the problem is not known to be in NC, parallel algorithms asymptotically faster than the Euclidean algorithm exist; the fastest known deterministic algorithm is by Chor and Goldreich, which (in the CRCW-PRAM model) can solve the problem in $O (n /log n)$ time with $n 1+ ε$ processors. Randomized algorithms can solve the problem in $O ((log n) 2)$ time on $\exp (O (\sqrt{n \log n}))$ processors (this is superpolynomial).

Properties

For positive integers a, $gcd(a, a) = a$ .
Every common divisor of a and b is a divisor of $gcd(a, b)$ .
$gcd(a, b)$ , where a and b are not both zero, may be defined alternatively and equivalently as the smallest positive integer d which can be written in the form $d = a \cdot p + b \cdot q$ , where p and q are integers. This expression is called Bézout's identity. Numbers p and q like this can be computed with the extended Euclidean algorithm.
$gcd(a, 0) = | a |$ , for $a \neq 0$ , since any number is a divisor of 0, and the greatest divisor of a is $| a |$ .^[2]^[5] This is usually used as the base case in the Euclidean algorithm.
If a divides the product b⋅c, and $gcd(a, b) = d$ , then a/d divides c.
If m is a positive integer, then $gcd(m \cdot a, m \cdot b) = m \cdotgcd(a, b)$ .
If m is any integer, then $gcd(a + m \cdot b, b) = gcd(a, b)$ . Equivalently, $gcd(a mod b, b) = gcd(a, b)$ .
If m is a positive common divisor of a and b, then $gcd(a / m, b / m) = gcd(a, b)/ m$ .
The GCD is a commutative function: $gcd(a, b) = gcd(b, a)$ .
The GCD is an associative function: $gcd(a, gcd(b, c)) = gcd(gcd(a, b), c)$ . Thus $gcd(a, b, c, ...)$ can be used to denote the GCD of multiple arguments.
The GCD is a multiplicative function in the following sense: if a₁ and a₂ are relatively prime, then $gcd(a 1 \cdot a 2, b) = gcd(a 1, b)\cdotgcd(a 2, b)$ .
$gcd(a, b)$ is closely related to the least common multiple $lcm(a, b)$ : we have
$gcd(a, b)\cdotlcm(a, b) = | a \cdot b |$ .

This formula is often used to compute least common multiples: one first computes the GCD with Euclid's algorithm and then divides the product of the given numbers by their GCD.

The following versions of distributivity hold true:
$gcd(a, lcm(b, c)) = lcm(gcd(a, b), gcd(a, c))$
$lcm(a, gcd(b, c)) = gcd(lcm(a, b), lcm(a, c))$ .
If we have the unique prime factorizations of $a = p 1 e 1 p 2 e 2 \cdot\cdot\cdot p m e m$ and $b = p 1 f 1 p 2 f 2 \cdot\cdot\cdot p m f m$ where $e i \geq 0$ and $f i \geq 0$ , then the GCD of a and b is
$gcd(a, b) = p 1 min(e 1, f 1) p 2 min(e 2, f 2) \cdot\cdot\cdot p m min(e m, f m)$ .
It is sometimes useful to define $gcd(0, 0) = 0$ and $lcm(0, 0) = 0$ because then the natural numbers become a complete distributive lattice with GCD as meet and LCM as join operation. This extension of the definition is also compatible with the generalization for commutative rings given below.
In a Cartesian coordinate system, $gcd(a, b)$ can be interpreted as the number of segments between points with integral coordinates on the straight line segment joining the points $(0, 0)$ and $(a, b)$ .
For non-negative integers a and b, where a and b are not both zero, provable by considering the Euclidean algorithm in base n:
$gcd(n a - 1, n b - 1) = n gcd(a, b) - 1$ .
An identity involving Euler's totient function:
$gcd (a, b) = \sum_{k | a and k | b} φ (k) .$
$\sum_{k = 1}^{n} gcd (k, n) = n \prod_{p | n} (1 + ν_{p} (n) (1 - \frac{1}{p}))$ where $ν_{p} (n)$ is the p-adic valuation.

Probabilities and expected value

In 1972, James E. Nymann showed that k integers, chosen independently and uniformly from {1, ..., n}, are coprime with probability 1/ζ(k) as n goes to infinity, where ζ refers to the Riemann zeta function. (See coprime for a derivation.) This result was extended in 1987 to show that the probability that k random integers have greatest common divisor d is d^−k/ζ(k).

Using this information, the expected value of the greatest common divisor function can be seen (informally) to not exist when k = 2. In this case the probability that the GCD equals d is d⁻²/ζ(2), and since ζ(2) = π²/6 we have

E (2) = \sum_{d = 1}^{\infty} d \frac{6}{π^{2} d^{2}} = \frac{6}{π^{2}} \sum_{d = 1}^{\infty} \frac{1}{d} .

This last summation is the harmonic series, which diverges. However, when k ≥ 3, the expected value is well-defined, and by the above argument, it is

E (k) = \sum_{d = 1}^{\infty} d^{1 - k} ζ (k)^{- 1} = \frac{ζ (k - 1)}{ζ (k)} .

For k = 3, this is approximately equal to 1.3684. For k = 4, it is approximately 1.1106.

In commutative rings

The notion of greatest common divisor can more generally be defined for elements of an arbitrary commutative ring, although in general there need not exist one for every pair of elements.

If $R$ is a commutative ring, and $a$ and $b$ are in $R$ , then an element $d$ of $R$ is called a common divisor of $a$ and $b$ if it divides both $a$ and $b$ (that is, if there are elements $x$ and $y$ in $R$ such that d·x = a and d·y = b). If $d$ is a common divisor of $a$ and $b$ , and every common divisor of $a$ and $b$ divides $d$ , then $d$ is called a greatest common divisor of $a$ and b.

With this definition, two elements $a$ and $b$ may very well have several greatest common divisors, or none at all. If $R$ is an integral domain then any two GCD's of $a$ and $b$ must be associate elements, since by definition either one must divide the other; indeed if a GCD exists, any one of its associates is a GCD as well. Existence of a GCD is not assured in arbitrary integral domains. However, if $R$ is a unique factorization domain, then any two elements have a GCD, and more generally this is true in GCD domains. If $R$ is a Euclidean domain in which euclidean division is given algorithmically (as is the case for instance when R = F[X] where $F$ is a field, or when $R$ is the ring of Gaussian integers), then greatest common divisors can be computed using a form of the Euclidean algorithm based on the division procedure.

The following is an example of an integral domain with two elements that do not have a GCD:

R = Z [\sqrt{- 3}], a = 4 = 2 \cdot 2 = (1 + \sqrt{- 3}) (1 - \sqrt{- 3}), b = (1 + \sqrt{- 3}) \cdot 2.

The elements 2 and 1 + √−3 are two maximal common divisors (that is, any common divisor which is a multiple of 2 is associated to 2, the same holds for 1 + √−3, but they are not associated, so there is no greatest common divisor of $a$ and b.

Corresponding to the Bézout property we may, in any commutative ring, consider the collection of elements of the form pa + qb, where $p$ and $q$ range over the ring. This is the ideal generated by $a$ and $b$ , and is denoted simply (a, b). In a ring all of whose ideals are principal (a principal ideal domain or PID), this ideal will be identical with the set of multiples of some ring element d; then this $d$ is a greatest common divisor of $a$ and b. But the ideal (a, b) can be useful even when there is no greatest common divisor of $a$ and b. (Indeed, Ernst Kummer used this ideal as a replacement for a GCD in his treatment of Fermat's Last Theorem, although he envisioned it as the set of multiples of some hypothetical, or ideal, ring element $d$ , whence the ring-theoretic term.)

A Medley of Potpourri

Search This Blog