Search This Blog

Sunday, October 21, 2018

Integral

From Wikipedia, the free encyclopedia

Definite integral example
A definite integral of a function can be represented as the signed area of the region bounded by its graph.

In mathematics, an integral assigns numbers to functions in a way that can describe displacement, area, volume, and other concepts that arise by combining infinitesimal data. Integration is one of the two main operations of calculus, with its inverse operation, differentiation, being the other. Given a function f of a real variable x and an interval [a, b] of the real line, the definite integral
\int _{a}^{b}\!f(x)\,dx
is defined informally as the signed area of the region in the xy-plane that is bounded by the graph of f, the x-axis and the vertical lines x = a and x = b. The area above the x-axis adds to the total and that below the x-axis subtracts from the total.

The operation of integration, up to an additive constant, is the inverse of the operation of differentiation. For this reason, the term integral may also refer to the related notion of the antiderivative, a function F whose derivative is the given function f. In this case, it is called an indefinite integral and is written:
F(x)=\int f(x)\,dx.
The integrals discussed in this article are those termed definite integrals. It is the fundamental theorem of calculus that connects differentiation with the definite integral: if f is a continuous real-valued function defined on a closed interval [a, b], then, once an antiderivative F of f is known, the definite integral of f over that interval is given by
{\displaystyle \int _{a}^{b}f(x)dx=\left[F(x)\right]_{a}^{b}=F(b)-F(a)\,.}
The principles of integration were formulated independently by Isaac Newton and Gottfried Wilhelm Leibniz in the late 17th century, who thought of the integral as an infinite sum of rectangles of infinitesimal width. Bernhard Riemann gave a rigorous mathematical definition of integrals. It is based on a limiting procedure that approximates the area of a curvilinear region by breaking the region into thin vertical slabs. Beginning in the nineteenth century, more sophisticated notions of integrals began to appear, where the type of the function as well as the domain over which the integration is performed has been generalised. A line integral is defined for functions of two or more variables, and the interval of integration [a, b] is replaced by a curve connecting the two endpoints. In a surface integral, the curve is replaced by a piece of a surface in three-dimensional space.

History

Pre-calculus integration

The first documented systematic technique capable of determining integrals is the method of exhaustion of the ancient Greek astronomer Eudoxus (ca. 370 BC), which sought to find areas and volumes by breaking them up into an infinite number of divisions for which the area or volume was known. This method was further developed and employed by Archimedes in the 3rd century BC and used to calculate areas for parabolas and an approximation to the area of a circle.

A similar method was independently developed in China around the 3rd century AD by Liu Hui, who used it to find the area of the circle. This method was later used in the 5th century by Chinese father-and-son mathematicians Zu Chongzhi and Zu Geng to find the volume of a sphere (Shea 2007; Katz 2004, pp. 125–126).

The next significant advances in integral calculus did not begin to appear until the 17th century. At this time, the work of Cavalieri with his method of Indivisibles, and work by Fermat, began to lay the foundations of modern calculus, with Cavalieri computing the integrals of xn up to degree n = 9 in Cavalieri's quadrature formula. Further steps were made in the early 17th century by Barrow and Torricelli, who provided the first hints of a connection between integration and differentiation. Barrow provided the first proof of the fundamental theorem of calculus. Wallis generalized Cavalieri's method, computing integrals of x to a general power, including negative powers and fractional powers.

Newton and Leibniz

The major advance in integration came in the 17th century with the independent discovery of the fundamental theorem of calculus by Newton and Leibniz. The theorem demonstrates a connection between integration and differentiation. This connection, combined with the comparative ease of differentiation, can be exploited to calculate integrals. In particular, the fundamental theorem of calculus allows one to solve a much broader class of problems. Equal in importance is the comprehensive mathematical framework that both Newton and Leibniz developed. Given the name infinitesimal calculus, it allowed for precise analysis of functions within continuous domains. This framework eventually became modern calculus, whose notation for integrals is drawn directly from the work of Leibniz.

Formalization

While Newton and Leibniz provided a systematic approach to integration, their work lacked a degree of rigour. Bishop Berkeley memorably attacked the vanishing increments used by Newton, calling them "ghosts of departed quantities". Calculus acquired a firmer footing with the development of limits. Integration was first rigorously formalized, using limits, by Riemann. Although all bounded piecewise continuous functions are Riemann-integrable on a bounded interval, subsequently more general functions were considered—particularly in the context of Fourier analysis—to which Riemann's definition does not apply, and Lebesgue formulated a different definition of integral, founded in measure theory (a subfield of real analysis). Other definitions of integral, extending Riemann's and Lebesgue's approaches, were proposed. These approaches based on the real number system are the ones most common today, but alternative approaches exist, such as a definition of integral as the standard part of an infinite Riemann sum, based on the hyperreal number system.

Historical notation

Isaac Newton used a small vertical bar above a variable to indicate integration, or placed the variable inside a box. The vertical bar was easily confused with .x or x, which are used to indicate differentiation, and the box notation was difficult for printers to reproduce, so these notations were not widely adopted.

The modern notation for the indefinite integral was introduced by Gottfried Wilhelm Leibniz in 1675 (Burton 1988, p. 359; Leibniz 1899, p. 154). He adapted the integral symbol, , from the letter ſ (long s), standing for summa (written as ſumma; Latin for "sum" or "total"). The modern notation for the definite integral, with limits above and below the integral sign, was first used by Joseph Fourier in Mémoires of the French Academy around 1819–20, reprinted in his book of 1822 (Cajori 1929, pp. 249–250; Fourier 1822, §231).

Applications

Integrals are used extensively in many areas of mathematics as well as in many other areas that rely on mathematics.

For example, in probability theory, integrals are used to determine the probability of some random variable falling within a certain range. Moreover, the integral under an entire probability density function must equal 1, which provides a test of whether a function with no negative values could be a density function or not.

Integrals can be used for computing the area of a two-dimensional region that has a curved boundary, as well as computing the volume of a three-dimensional object that has a curved boundary.

Integrals are also used in physics, in areas like kinematics to find quantities like displacement, time, and velocity. For example, in rectilinear motion, the displacement of an object over the time interval [a,b] is given by:
{\displaystyle x(b)-x(a)=\int _{a}^{b}v(t)\,dt,}
where v(t) is the velocity expressed as a function of time. The work done by a force F(x) (given as a function of position) from an initial position A to a final position B is:
{\displaystyle W_{A\rightarrow B}=\int _{A}^{B}F(x)\,dx.}
Integrals are also used in thermodynamics, where thermodynamic integration is used to calculate the difference in free energy between two given states.

Terminology and notation

Standard

The integral with respect to x of a real-valued function f(x) of a real variable x on the interval [a, b] is written as
\displaystyle \int _{a}^{b}f(x)\,dx.
The integral sign represents integration. The symbol dx, called the differential of the variable x, indicates that the variable of integration is x. The function f(x) to be integrated is called the integrand. The symbol dx is separated from the integrand by a space (as shown). If a function has an integral, it is said to be integrable. The points a and b are called the limits of the integral. An integral where the limits are specified is called a definite integral. The integral is said to be over the interval [a, b].

If the integral goes from a finite value a to the upper limit infinity, the integral expresses the limit of the integral from a to a value b as b goes to infinity. If the value of the integral gets closer and closer to a finite value, the integral is said to converge to that value. If not, the integral is said to diverge.

When the limits are omitted, as in
\int f(x)\,dx,
the integral is called an indefinite integral, which represents a class of functions (the antiderivative) whose derivative is the integrand. The fundamental theorem of calculus relates the evaluation of definite integrals to indefinite integrals. Occasionally, limits of integration are omitted for definite integrals when the same limits occur repeatedly in a particular context. Usually, the author will make this convention clear at the beginning of the relevant text.

There are several extensions of the notation for integrals to encompass integration on unbounded domains and/or in multiple dimensions (see later sections of this article).

Meaning of the symbol dx

Historically, the symbol dx was taken to represent an infinitesimally "small piece" of the independent variable x to be multiplied by the integrand and summed up in an infinite sense. While this notion is still heuristically useful, later mathematicians have deemed infinitesimal quantities to be untenable from the standpoint of the real number system. In introductory calculus, the expression dx is therefore not assigned an independent meaning; instead, it is viewed as part of the symbol for integration and serves as its delimiter on the right side of the expression being integrated.

In more sophisticated contexts, dx can have its own significance, the meaning of which depending on the particular area of mathematics being discussed. When used in one of these ways, the original Leibnitz notation is co-opted to apply to a generalization of the original definition of the integral. Some common interpretations of dx include: an integrator function in Riemann-Stieltjes integration (indicated by (x) in general), a measure in Lebesgue theory (indicated by in general), or a differential form in exterior calculus (indicated by {\displaystyle dx^{i_{1}}\wedge \cdots \wedge dx^{i_{k}}} in general). In the last case, even the letter d has an independent meaning — as the exterior derivative operator on differential forms.

Conversely, in advanced settings, it is not uncommon to leave out dx when only the simple Riemann integral is being used, or the exact type of integral is immaterial. For instance, one might write {\textstyle \int _{a}^{b}(c_{1}f+c_{2}g)=c_{1}\int _{a}^{b}f+c_{2}\int _{a}^{b}g} to express the linearity of the integral, a property shared by the Riemann integral and all generalizations thereof.

Variants

In modern Arabic mathematical notation, a reflected integral symbol ArabicIntegralSign.svg is used instead of the symbol , since the Arabic script and mathematical expressions go right to left. Some authors, particularly of European origin, use an upright "d" to indicate the variable of integration (i.e., dx instead of dx), since properly speaking, "d" is not a variable. Also, the symbol dx is not always placed after f(x), as for instance in
{\displaystyle \int \limits _{0}^{1}{\frac {3\ dx}{x^{2}+1}}\quad } or {\displaystyle \quad \int _{0}^{1}dx\int _{0}^{1}dy\ e^{-(x^{2}+y^{2})}}.
In the first expression, the differential is treated as an infinitesimal "multiplicative" factor, formally following a "commutative property" when "multiplied" by the expression 3/(x2+1). In the second expression, showing the differentials first highlights and clarifies the variables that are being integrated with respect to, a practice particularly popular with physicists.

Interpretations of the integral

Integrals appear in many practical situations. If a swimming pool is rectangular with a flat bottom, then from its length, width, and depth we can easily determine the volume of water it can contain (to fill it), the area of its surface (to cover it), and the length of its edge (to rope it). But if it is oval with a rounded bottom, all of these quantities call for integrals. Practical approximations may suffice for such trivial examples, but precision engineering (of any discipline) requires exact and rigorous values for these elements.
Integral approximation example
Approximations to integral of x from 0 to 1, with 5   (yellow) right endpoint partitions and 12   (green) left endpoint partitions

To start off, consider the curve y = f(x) between x = 0 and x = 1 with f(x) = x (see figure). We ask:  what is the area under the function f, in the interval from 0 to 1?

We call this (yet unknown) area the (definite) integral of f. The notation for this integral will be
{\displaystyle \int _{0}^{1}{\sqrt {x}}\ dx.}
As a first approximation, look at the unit square given by the sides x = 0 to x = 1 and y = f(0) = 0 and y = f(1) = 1. Its area is exactly 1. Actually, the true value of the integral must be somewhat less than 1. Decreasing the width of the approximation rectangles and increasing the number of rectangles gives a better result; so cross the interval in five steps, using the approximation points 0, 1/5, 2/5, and so on to 1. Fit a box for each step using the right end height of each curve piece, thus 1/5, 2/5, and so on to 1 = 1. Summing the areas of these rectangles, we get a better approximation for the sought integral, namely
{\displaystyle \textstyle {\sqrt {\frac {1}{5}}}\left({\frac {1}{5}}-0\right)+{\sqrt {\frac {2}{5}}}\left({\frac {2}{5}}-{\frac {1}{5}}\right)+\cdots +{\sqrt {\frac {5}{5}}}\left({\frac {5}{5}}-{\frac {4}{5}}\right)\approx 0.7497.}
We are taking a sum of finitely many function values of f, multiplied with the differences of two subsequent approximation points. We can easily see that the approximation is still too large. Using more steps produces a closer approximation, but will always be too high and will never be exact. Alternatively, replacing these subintervals by ones with the left end height of each piece, we will get an approximation that is too low: for example, with twelve such subintervals we will get an approximate value for the area of 0.6203.

The key idea is the transition from adding finitely many differences of approximation points multiplied by their respective function values to using infinitely many fine, or infinitesimal steps. When this transition is completed in the above example, it turns out that the area under the curve within the stated bounds is 2/3.

The notation
{\displaystyle \int f(x)\ dx}
conceives the integral as a weighted sum, denoted by the elongated s, of function values, f(x), multiplied by infinitesimal step widths, the so-called differentials, denoted by dx.

Historically, after the failure of early efforts to rigorously interpret infinitesimals, Riemann formally defined integrals as a limit of weighted sums, so that the dx suggested the limit of a difference (namely, the interval width). Shortcomings of Riemann's dependence on intervals and continuity motivated newer definitions, especially the Lebesgue integral, which is founded on an ability to extend the idea of "measure" in much more flexible ways. Thus the notation
{\displaystyle \int _{A}f(x)\ d\mu }
refers to a weighted sum in which the function values are partitioned, with μ measuring the weight to be assigned to each value. Here A denotes the region of integration.

Darboux sums
Upper Darboux sum example
Darboux upper sums of the function y = x2
Lower Darboux sum example
Darboux lower sums of the function y = x2

Formal definitions

Riemann integral approximation example
Integral example with irregular partitions (largest marked in red)
Riemann sum convergence
Riemann sums converging

There are many ways of formally defining an integral, not all of which are equivalent. The differences exist mostly to deal with differing special cases which may not be integrable under other definitions, but also occasionally for pedagogical reasons. The most commonly used definitions of integral are Riemann integrals and Lebesgue integrals.

Riemann integral

The Riemann integral is defined in terms of Riemann sums of functions with respect to tagged partitions of an interval. Let [a, b] be a closed interval of the real line; then a tagged partition of [a, b] is a finite sequence
a=x_{0}\leq t_{1}\leq x_{1}\leq t_{2}\leq x_{2}\leq \cdots \leq x_{n-1}\leq t_{n}\leq x_{n}=b.\,\!
This partitions the interval [a, b] into n sub-intervals [xi−1, xi] indexed by i, each of which is "tagged" with a distinguished point ti ∈ [xi−1, xi]. A Riemann sum of a function f with respect to such a tagged partition is defined as
\sum _{i=1}^{n}f(t_{i})\Delta _{i};
thus each term of the sum is the area of a rectangle with height equal to the function value at the distinguished point of the given sub-interval, and width the same as the sub-interval width. Let Δi = xixi−1 be the width of sub-interval i; then the mesh of such a tagged partition is the width of the largest sub-interval formed by the partition, maxi=1...n Δi. The Riemann integral of a function f over the interval [a, b] is equal to S if:

For all ε > 0 there exists δ > 0 such that, for any tagged partition [a, b] with mesh less than δ, we have


\left|S-\sum _{i=1}^{n}f(t_{i})\Delta _{i}\right|<\varepsilon .
When the chosen tags give the maximum (respectively, minimum) value of each interval, the Riemann sum becomes an upper (respectively, lower) Darboux sum, suggesting the close connection between the Riemann integral and the Darboux integral.

Lebesgue integral

Comparison of Riemann and Lebesgue integrals
Riemann–Darboux's integration (top) and Lebesgue integration (bottom)

It is often of interest, both in theory and applications, to be able to pass to the limit under the integral. For instance, a sequence of functions can frequently be constructed that approximate, in a suitable sense, the solution to a problem. Then the integral of the solution function should be the limit of the integrals of the approximations. However, many functions that can be obtained as limits are not Riemann-integrable, and so such limit theorems do not hold with the Riemann integral. Therefore, it is of great importance to have a definition of the integral that allows a wider class of functions to be integrated.

Such an integral is the Lebesgue integral, that exploits the following fact to enlarge the class of integrable functions: if the values of a function are rearranged over the domain, the integral of a function should remain the same. Thus Henri Lebesgue introduced the integral bearing his name, explaining this integral thus in a letter to Paul Montel:
I have to pay a certain sum, which I have collected in my pocket. I take the bills and coins out of my pocket and give them to the creditor in the order I find them until I have reached the total sum. This is the Riemann integral. But I can proceed differently. After I have taken all the money out of my pocket I order the bills and coins according to identical values and then I pay the several heaps one after the other to the creditor. This is my integral.
As Folland (1984, p. 56) puts it, "To compute the Riemann integral of f, one partitions the domain [a, b] into subintervals", while in the Lebesgue integral, "one is in effect partitioning the range of f ". The definition of the Lebesgue integral thus begins with a measure, μ. In the simplest case, the Lebesgue measure μ(A) of an interval A = [a, b] is its width, ba, so that the Lebesgue integral agrees with the (proper) Riemann integral when both exist. In more complicated cases, the sets being measured can be highly fragmented, with no continuity and no resemblance to intervals.

Using the "partitioning the range of f " philosophy, the integral of a non-negative function f : RR should be the sum over t of the areas between a thin horizontal strip between y = t and y = t + dt. This area is just μ{ x : f(x) > t} dt. Let f(t) = μ{ x : f(x) > t}. The Lebesgue integral of f is then defined by
\int f=\int _{0}^{\infty }f^{*}(t)\,dt
where the integral on the right is an ordinary improper Riemann integral (f is a strictly decreasing positive function, and therefore has a well-defined improper Riemann integral). For a suitable class of functions (the measurable functions) this defines the Lebesgue integral.

A general measurable function f is Lebesgue-integrable if the sum of the absolute values of the areas of the regions between the graph of f and the x-axis is finite:
\int _{E}|f|\,d\mu <+\infty .
In that case, the integral is, as in the Riemannian case, the difference between the area above the x-axis and the area below the x-axis:
\int _{E}f\,d\mu =\int _{E}f^{+}\,d\mu -\int _{E}f^{-}\,d\mu
where
{\displaystyle {\begin{alignedat}{3}&f^{+}(x)&&{}={}\max\{f(x),0\}&&{}={}{\begin{cases}f(x),&{\text{if }}f(x)>0,\\0,&{\text{otherwise,}}\end{cases}}\\&f^{-}(x)&&{}={}\max\{-f(x),0\}&&{}={}{\begin{cases}-f(x),&{\text{if }}f(x)<0,\\0,&{\text{otherwise.}}\end{cases}}\end{alignedat}}}

Other integrals

Although the Riemann and Lebesgue integrals are the most widely used definitions of the integral, a number of others exist, including:

Properties

Linearity

The collection of Riemann-integrable functions on a closed interval [a, b] forms a vector space under the operations of pointwise addition and multiplication by a scalar, and the operation of integration
f\mapsto \int _{a}^{b}f(x)\;dx
is a linear functional on this vector space. Thus, firstly, the collection of integrable functions is closed under taking linear combinations; and, secondly, the integral of a linear combination is the linear combination of the integrals,
\int _{a}^{b}(\alpha f+\beta g)(x)\,dx=\alpha \int _{a}^{b}f(x)\,dx+\beta \int _{a}^{b}g(x)\,dx.\,
Similarly, the set of real-valued Lebesgue-integrable functions on a given measure space E with measure μ is closed under taking linear combinations and hence form a vector space, and the Lebesgue integral
f\mapsto \int _{E}f\,d\mu
is a linear functional on this vector space, so that
\int _{E}(\alpha f+\beta g)\,d\mu =\alpha \int _{E}f\,d\mu +\beta \int _{E}g\,d\mu .
More generally, consider the vector space of all measurable functions on a measure space (E,μ), taking values in a locally compact complete topological vector space V over a locally compact topological field K, f : EV. Then one may define an abstract integration map assigning to each function f an element of V or the symbol ,
f\mapsto \int _{E}f\,d\mu ,\,
that is compatible with linear combinations. In this situation, the linearity holds for the subspace of functions whose integral is an element of V (i.e. "finite"). The most important special cases arise when K is R, C, or a finite extension of the field Qp of p-adic numbers, and V is a finite-dimensional vector space over K, and when K = C and V is a complex Hilbert space.

Linearity, together with some natural continuity properties and normalisation for a certain class of "simple" functions, may be used to give an alternative definition of the integral. This is the approach of Daniell for the case of real-valued functions on a set X, generalized by Nicolas Bourbaki to functions with values in a locally compact topological vector space. See (Hildebrandt 1953) for an axiomatic characterisation of the integral.

Inequalities

A number of general inequalities hold for Riemann-integrable functions defined on a closed and bounded interval [a, b] and can be generalized to other notions of integral (Lebesgue and Daniell).
  • Upper and lower bounds. An integrable function f on [a, b], is necessarily bounded on that interval. Thus there are real numbers m and M so that mf (x) ≤ M for all x in [a, b]. Since the lower and upper sums of f over [a, b] are therefore bounded by, respectively, m(ba) and M(ba), it follows that
m(b-a)\leq \int _{a}^{b}f(x)\,dx\leq M(b-a).
  • Inequalities between functions. If f(x) ≤ g(x) for each x in [a, b] then each of the upper and lower sums of f is bounded above by the upper and lower sums, respectively, of g. Thus
\int _{a}^{b}f(x)\,dx\leq \int _{a}^{b}g(x)\,dx.

This is a generalization of the above inequalities, as M(ba) is the integral of the constant function with value M over [a, b].
In addition, if the inequality between functions is strict, then the inequality between integrals is also strict. That is, if f(x) < g(x) for each x in [a, b], then

\int _{a}^{b}f(x)\,dx<\int _{a}^{b}g(x)\,dx.
  • Subintervals. If [c, d] is a subinterval of [a, b] and f(x) is non-negative for all x, then
\int _{c}^{d}f(x)\,dx\leq \int _{a}^{b}f(x)\,dx.
(fg)(x)=f(x)g(x),\;f^{2}(x)=(f(x))^{2},\;|f|(x)=|f(x)|.\,

If f is Riemann-integrable on [a, b] then the same is true for |f|, and

\left|\int _{a}^{b}f(x)\,dx\right|\leq \int _{a}^{b}|f(x)|\,dx.

Moreover, if f and g are both Riemann-integrable then fg is also Riemann-integrable, and

\left(\int _{a}^{b}(fg)(x)\,dx\right)^{2}\leq \left(\int _{a}^{b}f(x)^{2}\,dx\right)\left(\int _{a}^{b}g(x)^{2}\,dx\right).

This inequality, known as the Cauchy–Schwarz inequality, plays a prominent role in Hilbert space theory, where the left hand side is interpreted as the inner product of two square-integrable functions f and g on the interval [a, b].
  • Hölder's inequality. Suppose that p and q are two real numbers, 1 ≤ p, q ≤ ∞ with 1/p + 1/q = 1, and f and g are two Riemann-integrable functions. Then the functions |f|p and |g|q are also integrable and the following Hölder's inequality holds:
\left|\int f(x)g(x)\,dx\right|\leq \left(\int \left|f(x)\right|^{p}\,dx\right)^{1/p}\left(\int \left|g(x)\right|^{q}\,dx\right)^{1/q}.

For p = q = 2, Hölder's inequality becomes the Cauchy–Schwarz inequality.
  • Minkowski inequality. Suppose that p ≥ 1 is a real number and f and g are Riemann-integrable functions. Then | f |p, | g |p and | f + g |p are also Riemann-integrable and the following Minkowski inequality holds:
\left(\int \left|f(x)+g(x)\right|^{p}\,dx\right)^{1/p}\leq \left(\int \left|f(x)\right|^{p}\,dx\right)^{1/p}+\left(\int \left|g(x)\right|^{p}\,dx\right)^{1/p}.
An analogue of this inequality for Lebesgue integral is used in construction of Lp spaces.

Conventions

In this section, f is a real-valued Riemann-integrable function. The integral
\int _{a}^{b}f(x)\,dx
over an interval [a, b] is defined if a < b. This means that the upper and lower sums of the function f are evaluated on a partition a = x0x1 ≤ . . . ≤ xn = b whose values xi are increasing. Geometrically, this signifies that integration takes place "left to right", evaluating f within intervals [xi , xi +1] where an interval with a higher index lies to the right of one with a lower index. The values a and b, the end-points of the interval, are called the limits of integration of f. Integrals can also be defined if a > b:
  • Reversing limits of integration. If a > b then define
\int _{a}^{b}f(x)\,dx=-\int _{b}^{a}f(x)\,dx.
This, with a = b, implies:
  • Integrals over intervals of length zero. If a is a real number then
\int _{a}^{a}f(x)\,dx=0.
The first convention is necessary in consideration of taking integrals over subintervals of [a, b]; the second says that an integral taken over a degenerate interval, or a point, should be zero. One reason for the first convention is that the integrability of f on an interval [a, b] implies that f is integrable on any subinterval [c, d], but in particular integrals have the property that:
  • Additivity of integration on intervals. If c is any element of [a, b], then
\int _{a}^{b}f(x)\,dx=\int _{a}^{c}f(x)\,dx+\int _{c}^{b}f(x)\,dx.
With the first convention, the resulting relation
{\begin{aligned}\int _{a}^{c}f(x)\,dx&{}=\int _{a}^{b}f(x)\,dx-\int _{c}^{b}f(x)\,dx\\&{}=\int _{a}^{b}f(x)\,dx+\int _{b}^{c}f(x)\,dx\end{aligned}}
is then well-defined for any cyclic permutation of a, b, and c.

Fundamental theorem of calculus

The fundamental theorem of calculus is the statement that differentiation and integration are inverse operations: if a continuous function is first integrated and then differentiated, the original function is retrieved. An important consequence, sometimes called the second fundamental theorem of calculus, allows one to compute integrals by using an antiderivative of the function to be integrated.

Statements of theorems

Fundamental theorem of calculus

Let f be a continuous real-valued function defined on a closed interval [a, b]. Let F be the function defined, for all x in [a, b], by
F(x)=\int _{a}^{x}f(t)\,dt.
Then, F is continuous on [a, b], differentiable on the open interval (a, b), and
F'(x)=f(x)
for all x in (a, b).

Second fundamental theorem of calculus

Let f be a real-valued function defined on a closed interval [a, b] that admits an antiderivative F on [a, b]. That is, f and F are functions such that for all x in [a, b],
f(x)=F'(x).
If f is integrable on [a, b] then
\int _{a}^{b}f(x)\,dx=F(b)-F(a).

Calculating integrals

The second fundamental theorem allows many integrals to be calculated explicitly. For example, to calculate the integral
\int _{0}^{1}x^{1/2}\,dx,
of the square root function f(x) = x1/2 between 0 and 1, it is sufficient to find an antiderivative, that is, a function F(x) whose derivative equals f(x):
F'(x)=f(x).
One such function is F(x) = 2/3x3/2. Then the value of the integral in question is
\int _{0}^{1}x^{1/2}\,dx=F(1)-F(0)={\frac {2}{3}}(1)^{3/2}-{\frac {2}{3}}(0)^{3/2}={\frac {2}{3}}.
This is a case of a general rule, that for f(x) = xq, with q ≠ −1, the related function, the so-called antiderivative is F(x) = xq + 1/(q + 1). Tables of this and similar antiderivatives can be used to calculate integrals explicitly, in much the same way that tables of derivatives can be used.

Extensions

Improper integrals

The improper integral
\int _{0}^{\infty }{\frac {dx}{(x+1){\sqrt {x}}}}=\pi
has unbounded intervals for both domain and range.

A "proper" Riemann integral assumes the integrand is defined and finite on a closed and bounded interval, bracketed by the limits of integration. An improper integral occurs when one or more of these conditions is not satisfied. In some cases such integrals may be defined by considering the limit of a sequence of proper Riemann integrals on progressively larger intervals.

If the interval is unbounded, for instance at its upper end, then the improper integral is the limit as that endpoint goes to infinity.
\int _{a}^{\infty }f(x)\,dx=\lim _{b\to \infty }\int _{a}^{b}f(x)\,dx
If the integrand is only defined or finite on a half-open interval, for instance (a, b], then again a limit may provide a finite result.
\int _{a}^{b}f(x)\,dx=\lim _{\epsilon \to 0}\int _{a+\epsilon }^{b}f(x)\,dx
That is, the improper integral is the limit of proper integrals as one endpoint of the interval of integration approaches either a specified real number, or , or −∞. In more complicated cases, limits are required at both endpoints, or at interior points.

Multiple integration

Double integral as volume under a surface

Just as the definite integral of a positive function of one variable represents the area of the region between the graph of the function and the x-axis, the double integral of a positive function of two variables represents the volume of the region between the surface defined by the function and the plane that contains its domain. For example, a function in two dimensions depends on two real variables, x and y, and the integral of a function f over the rectangle R given as the Cartesian product of two intervals R=[a,b]\times [c,d] can be written
\int _{R}f(x,y)\,dA
where the differential dA indicates that integration is taken with respect to area. This double integral can be defined using Riemann sums, and represents the (signed) volume under the graph of z = f(x,y) over the domain R. Under suitable conditions (e.g., if f is continuous), then Fubini's theorem guarantees that this integral can be expressed as an equivalent iterated integral
\int _{a}^{b}\left[\int _{c}^{d}f(x,y)\,dy\right]\,dx.
This reduces the problem of computing a double integral to computing one-dimensional integrals. Because of this, another notation for the integral over R uses a double integral sign:
\iint _{R}f(x,y)dA.
Integration over more general domains is possible. The integral of a function f, with respect to volume, over a subset D of ℝn is denoted by notation such as
\int _{D}f(\mathbf {x} )d^{n}\mathbf {x} ,\quad \int _{D}f\,dV
or similar.

Line integrals

A line integral sums together elements along a curve.

The concept of an integral can be extended to more general domains of integration, such as curved lines and surfaces. Such integrals are known as line integrals and surface integrals respectively. These have important applications in physics, as when dealing with vector fields.

A line integral (sometimes called a path integral) is an integral where the function to be integrated is evaluated along a curve. Various different line integrals are in use. In the case of a closed curve it is also called a contour integral.

The function to be integrated may be a scalar field or a vector field. The value of the line integral is the sum of values of the field at all points on the curve, weighted by some scalar function on the curve (commonly arc length or, for a vector field, the scalar product of the vector field with a differential vector in the curve). This weighting distinguishes the line integral from simpler integrals defined on intervals. Many simple formulas in physics have natural continuous analogs in terms of line integrals; for example, the fact that work is equal to force, F, multiplied by displacement, s, may be expressed (in terms of vector quantities) as:
W=\mathbf {F} \cdot \mathbf {s} .
For an object moving along a path C in a vector field F such as an electric field or gravitational field, the total work done by the field on the object is obtained by summing up the differential work done in moving from s to s + ds. This gives the line integral
W=\int _{C}\mathbf {F} \cdot d\mathbf {s} .

Surface integrals

The definition of surface integral relies on splitting the surface into small surface elements.

A surface integral is a definite integral taken over a surface (which may be a curved set in space); it can be thought of as the double integral analog of the line integral. The function to be integrated may be a scalar field or a vector field. The value of the surface integral is the sum of the field at all points on the surface. This can be achieved by splitting the surface into surface elements, which provide the partitioning for Riemann sums.

For an example of applications of surface integrals, consider a vector field v on a surface S; that is, for each point x in S, v(x) is a vector. Imagine that we have a fluid flowing through S, such that v(x) determines the velocity of the fluid at x. The flux is defined as the quantity of fluid flowing through S in unit amount of time. To find the flux, we need to take the dot product of v with the unit surface normal to S at each point, which will give us a scalar field, which we integrate over the surface:
\int _{S}{\mathbf {v} }\cdot \,d{\mathbf {S} }.
The fluid flux in this example may be from a physical fluid such as water or air, or from electrical or magnetic flux. Thus surface integrals have applications in physics, particularly with the classical theory of electromagnetism.

Contour integrals

In complex analysis, the integrand is a complex-valued function of a complex variable z instead of a real function of a real variable x. When a complex function is integrated along a curve \gamma in the complex plane, the integral is denoted as follows
\int _{\gamma }f(z)\,dz.
This is known as a contour integral.

Integrals of differential forms

A differential form is a mathematical concept in the fields of multivariable calculus, differential topology, and tensors. Differential forms are organized by degree. For example, a one-form is a weighted sum of the differentials of the coordinates, such as:
E(x,y,z)\,dx+F(x,y,z)\,dy+G(x,y,z)\,dz
where E, F, G are functions in three dimensions. A differential one-form can be integrated over an oriented path, and the resulting integral is just another way of writing a line integral. Here the basic differentials dx, dy, dz measure infinitesimal oriented lengths parallel to the three coordinate axes.
A differential two-form is a sum of the form
G(x,y,z)dx\wedge dy+E(x,y,z)dy\wedge dz+F(x,y,z)dz\wedge dx.
Here the basic two-forms dx\wedge dy,dz\wedge dx,dy\wedge dz measure oriented areas parallel to the coordinate two-planes. The symbol \wedge denotes the wedge product, which is similar to the cross product in the sense that the wedge product of two forms representing oriented lengths represents an oriented area. A two-form can be integrated over an oriented surface, and the resulting integral is equivalent to the surface integral giving the flux of E\mathbf {i} +F\mathbf {j} +G\mathbf {k} .

Unlike the cross product, and the three-dimensional vector calculus, the wedge product and the calculus of differential forms makes sense in arbitrary dimension and on more general manifolds (curves, surfaces, and their higher-dimensional analogs). The exterior derivative plays the role of the gradient and curl of vector calculus, and Stokes' theorem simultaneously generalizes the three theorems of vector calculus: the divergence theorem, Green's theorem, and the Kelvin-Stokes theorem.

Summations

The discrete equivalent of integration is summation. Summations and integrals can be put on the same foundations using the theory of Lebesgue integrals or time scale calculus.

Computation

Analytical

The most basic technique for computing definite integrals of one real variable is based on the fundamental theorem of calculus. Let f(x) be the function of x to be integrated over a given interval [a, b]. Then, find an antiderivative of f; that is, a function F such that F′ = f on the interval. Provided the integrand and integral have no singularities on the path of integration, by the fundamental theorem of calculus,
{\displaystyle \int _{a}^{b}f(x)\,dx=F(b)-F(a).}
The integral is not actually the antiderivative, but the fundamental theorem provides a way to use antiderivatives to evaluate definite integrals.

The most difficult step is usually to find the antiderivative of f. It is rarely possible to glance at a function and write down its antiderivative. More often, it is necessary to use one of the many techniques that have been developed to evaluate integrals. Most of these techniques rewrite one integral as a different one which is hopefully more tractable. Techniques include:
Alternative methods exist to compute more complex integrals. Many nonelementary integrals can be expanded in a Taylor series and integrated term by term. Occasionally, the resulting infinite series can be summed analytically. The method of convolution using Meijer G-functions can also be used, assuming that the integrand can be written as a product of Meijer G-functions. There are also many less common ways of calculating definite integrals; for instance, Parseval's identity can be used to transform an integral over a rectangular region into an infinite sum. Occasionally, an integral can be evaluated by a trick; for an example of this, see Gaussian integral.

Computations of volumes of solids of revolution can usually be done with disk integration or shell integration.

Specific results which have been worked out by various techniques are collected in the list of integrals.

Symbolic

Many problems in mathematics, physics, and engineering involve integration where an explicit formula for the integral is desired. Extensive tables of integrals have been compiled and published over the years for this purpose. With the spread of computers, many professionals, educators, and students have turned to computer algebra systems that are specifically designed to perform difficult or tedious tasks, including integration. Symbolic integration has been one of the motivations for the development of the first such systems, like Macsyma.

A major mathematical difficulty in symbolic integration is that in many cases, a closed formula for the antiderivative of a rather simple-looking function does not exist. For instance, it is known that the antiderivatives of the functions exp(x2), xx and (sin x)/x cannot be expressed in the closed form involving only rational and exponential functions, logarithm, trigonometric functions and inverse trigonometric functions, and the operations of multiplication and composition; in other words, none of the three given functions is integrable in elementary functions, which are the functions which may be built from rational functions, roots of a polynomial, logarithm, and exponential functions. The Risch algorithm provides a general criterion to determine whether the antiderivative of an elementary function is elementary, and, if it is, to compute it. Unfortunately, it turns out that functions with closed expressions of antiderivatives are the exception rather than the rule. Consequently, computerized algebra systems have no hope of being able to find an antiderivative for a randomly constructed elementary function. On the positive side, if the 'building blocks' for antiderivatives are fixed in advance, it may be still be possible to decide whether the antiderivative of a given function can be expressed using these blocks and operations of multiplication and composition, and to find the symbolic answer whenever it exists. The Risch algorithm, implemented in Mathematica and other computer algebra systems, does just that for functions and antiderivatives built from rational functions, radicals, logarithm, and exponential functions.

Some special integrands occur often enough to warrant special study. In particular, it may be useful to have, in the set of antiderivatives, the special functions (like the Legendre functions, the hypergeometric function, the gamma function, the incomplete gamma function and so on — see Symbolic integration for more details). Extending the Risch's algorithm to include such functions is possible but challenging and has been an active research subject.

More recently a new approach has emerged, using D-finite functions, which are the solutions of linear differential equations with polynomial coefficients. Most of the elementary and special functions are D-finite, and the integral of a D-finite function is also a D-finite function. This provides an algorithm to express the antiderivative of a D-finite function as the solution of a differential equation.

This theory also allows one to compute the definite integral of a D-function as the sum of a series given by the first coefficients, and provides an algorithm to compute any coefficient.

Numerical

Some integrals found in real applications can be computed by closed-form antiderivatives. Others are not so accommodating. Some antiderivatives do not have closed forms, some closed forms require special functions that themselves are a challenge to compute, and others are so complex that finding the exact answer is too slow. This motivates the study and application of numerical approximations of integrals. This subject, called numerical integration or numerical quadrature, arose early in the study of integration for the purpose of making hand calculations. The development of general-purpose computers made numerical integration more practical and drove a desire for improvements. The goals of numerical integration are accuracy, reliability, efficiency, and generality, and sophisticated modern methods can vastly outperform a naive method by all four measures.

Consider, for example, the integral
{\displaystyle \int _{-2}^{2}{\tfrac {1}{5}}\left({\tfrac {1}{100}}(322+3x(98+x(37+x)))-24{\frac {x}{1+x^{2}}}\right)dx}
which has the exact answer 94/25 = 3.76. (In ordinary practice, the answer is not known in advance, so an important task — not explored here — is to decide when an approximation is good enough.) A “calculus book” approach divides the integration range into, say, 16 equal pieces, and computes function values.

Spaced function values
x −2.00 −1.50 −1.00 −0.50  0.00  0.50  1.00  1.50  2.00
f(x)  2.22800  2.45663  2.67200  2.32475  0.64400 −0.92575 −0.94000 −0.16963  0.83600
x   −1.75 −1.25 −0.75 −0.25  0.25  0.75  1.25  1.75
f(x)
 2.33041  2.58562  2.62934  1.64019 −0.32444 −1.09159 −0.60387  0.31734



















Numerical quadrature methods:  Rectangle,  Trapezoid,  Romberg,  Gauss

Using the left end of each piece, the rectangle method sums 16 function values and multiplies by the step width, h, here 0.25, to get an approximate value of 3.94325 for the integral. The accuracy is not impressive, but calculus formally uses pieces of infinitesimal width, so initially this may seem little cause for concern. Indeed, repeatedly doubling the number of steps eventually produces an approximation of 3.76001. However, 218 pieces are required, a great computational expense for such little accuracy; and a reach for greater accuracy can force steps so small that arithmetic precision becomes an obstacle.

A better approach replaces the rectangles used in a Riemann sum with trapezoids. The trapezoid rule is almost as easy to calculate; it sums all 17 function values, but weights the first and last by one half, and again multiplies by the step width. This immediately improves the approximation to 3.76925, which is noticeably more accurate. Furthermore, only 210 pieces are needed to achieve 3.76000, substantially less computation than the rectangle method for comparable accuracy. The idea behind the trapezoid rule, that more accurate approximations to the function yield better approximations to the integral, can be carried further. Simpson's rule approximates the integrand by a piecewise quadratic function. Riemann sums, the trapezoid rule, and Simpson's rule are examples of a family of quadrature rules called Newton–Cotes formulas. The degree n Newton–Cotes quadrature rule approximates the polynomial on each subinterval by a degree n polynomial. This polynomial is chosen to interpolate the values of the function on the interval. Higher degree Newton-Cotes approximations can be more accurate, but they require more function evaluations (already Simpson's rule requires twice the function evaluations of the trapezoid rule), and they can suffer from numerical inaccuracy due to Runge's phenomenon. One solution to this problem is Clenshaw–Curtis quadrature, in which the integrand is approximated by expanding it in terms of Chebyshev polynomials. This produces an approximation whose values never deviate far from those of the original function.

Romberg's method builds on the trapezoid method to great effect. First, the step lengths are halved incrementally, giving trapezoid approximations denoted by T(h0), T(h1), and so on, where hk+1 is half of hk. For each new step size, only half the new function values need to be computed; the others carry over from the previous size (as shown in the table above). But the really powerful idea is to interpolate a polynomial through the approximations, and extrapolate to T(0). With this method a numerically exact answer here requires only four pieces (five function values). The Lagrange polynomial interpolating {hk,T(hk)}k = 0...2 = {(4.00,6.128), (2.00,4.352), (1.00,3.908)} is 3.76 + 0.148h2, producing the extrapolated value 3.76 at h = 0.

Gaussian quadrature often requires noticeably less work for superior accuracy. In this example, it can compute the function values at just two x positions, ±2 ⁄ 3, then double each value and sum to get the numerically exact answer. The explanation for this dramatic success lies in the choice of points. Unlike Newton–Cotes rules, which interpolate the integrand at evenly spaced points, Gaussian quadrature evaluates the function at the roots of a set of orthogonal polynomials. An n-point Gaussian method is exact for polynomials of degree up to 2n − 1. The function in this example is a degree 3 polynomial, plus a term that cancels because the chosen endpoints are symmetric around zero. (Cancellation also benefits the Romberg method.)

In practice, each method must use extra evaluations to ensure an error bound on an unknown function; this tends to offset some of the advantage of the pure Gaussian method, and motivates the popular Gauss–Kronrod quadrature formulae. More broadly, adaptive quadrature partitions a range into pieces based on function properties, so that data points are concentrated where they are needed most.

The computation of higher-dimensional integrals (for example, volume calculations) makes important use of such alternatives as Monte Carlo integration.

A calculus text is no substitute for numerical analysis, but the reverse is also true. Even the best adaptive numerical code sometimes requires a user to help with the more demanding integrals. For example, improper integrals may require a change of variable or methods that can avoid infinite function values, and known properties like symmetry and periodicity may provide critical leverage. For example, the integral \int _{0}^{1}x^{-1/2}e^{-x}\,dx is difficult to evaluate numerically because it is infinite at x = 0. However, the substitution u = x transforms the integral into {\displaystyle 2\int _{0}^{1}e^{-u^{2}}\,du}, which has no singularities at all.

Mechanical

The area of an arbitrary two-dimensional shape can be determined using a measuring instrument called planimeter. The volume of irregular objects can be measured with precision by the fluid displaced as the object is submerged.

Geometrical

Area can sometimes be found via geometrical compass-and-straightedge constructions of an equivalent square.

Differential calculus

From Wikipedia, the free encyclopedia
The graph of a function, drawn in black, and a tangent line to that function, drawn in red. The slope of the tangent line equals the derivative of the function at the marked point.

In mathematics, differential calculus is a subfield of calculus concerned with the study of the rates at which quantities change. It is one of the two traditional divisions of calculus, the other being integral calculus, the study of the area beneath a curve.

The primary objects of study in differential calculus are the derivative of a function, related notions such as the differential, and their applications. The derivative of a function at a chosen input value describes the rate of change of the function near that input value. The process of finding a derivative is called differentiation. Geometrically, the derivative at a point is the slope of the tangent line to the graph of the function at that point, provided that the derivative exists and is defined at that point. For a real-valued function of a single real variable, the derivative of a function at a point generally determines the best linear approximation to the function at that point.

Differential calculus and integral calculus are connected by the fundamental theorem of calculus, which states that differentiation is the reverse process to integration.

Differentiation has applications to nearly all quantitative disciplines. For example, in physics, the derivative of the displacement of a moving body with respect to time is the velocity of the body, and the derivative of velocity with respect to time is acceleration. The derivative of the momentum of a body equals the force applied to the body; rearranging this derivative statement leads to the famous F = ma equation associated with Newton's second law of motion. The reaction rate of a chemical reaction is a derivative. In operations research, derivatives determine the most efficient ways to transport materials and design factories.

Derivatives are frequently used to find the maxima and minima of a function. Equations involving derivatives are called differential equations and are fundamental in describing natural phenomena. Derivatives and their generalizations appear in many fields of mathematics, such as complex analysis, functional analysis, differential geometry, measure theory, and abstract algebra.

Derivative

The tangent line at (x,f(x))
The derivative at different points of a differentiable function

Suppose that x and y are real numbers and that y is a function of x, that is, for every value of x, there is a corresponding value of y. This relationship can be written as y = f(x). If f(x) is the equation for a straight line (called a linear equation), then there are two real numbers m and b such that y = mx + b. In this "slope-intercept form", the term m is called the slope and can be determined from the formula:
m={\frac {{\text{change in }}y}{{\text{change in }}x}}={\frac {\Delta y}{\Delta x}},
where the symbol Δ (the uppercase form of the Greek letter Delta) is an abbreviation for "change in". It follows that Δy = m Δx.

A general function is not a line, so it does not have a slope. Geometrically, the derivative of f at the point x = a is the slope of the tangent line to the function f at the point a (see figure). This is often denoted f ′(a) in Lagrange's notation or dy/dx|x = a in Leibniz's notation. Since the derivative is the slope of the linear approximation to f at the point a, the derivative (together with the value of f at a) determines the best linear approximation, or linearization, of f near the point a.

If every point a in the domain of f has a derivative, there is a function that sends every point a to the derivative of f at a. For example, if f(x) = x2, then the derivative function f ′(x) = dy/dx = 2x.

A closely related notion is the differential of a function. When x and y are real variables, the derivative of f at x is the slope of the tangent line to the graph of f at x. Because the source and target of f are one-dimensional, the derivative of f is a real number. If x and y are vectors, then the best linear approximation to the graph of f depends on how f changes in several directions at once. Taking the best linear approximation in a single direction determines a partial derivative, which is usually denoted y/x. The linearization of f in all directions at once is called the total derivative.

History of differentiation

The concept of a derivative in the sense of a tangent line is a very old one, familiar to Greek geometers such as Euclid (c. 300 BC), Archimedes (c. 287–212 BC) and Apollonius of Perga (c. 262–190 BC). Archimedes also introduced the use of infinitesimals, although these were primarily used to study areas and volumes rather than derivatives and tangents; see Archimedes' use of infinitesimals.

The use of infinitesimals to study rates of change can be found in Indian mathematics, perhaps as early as 500 AD, when the astronomer and mathematician Aryabhata (476–550) used infinitesimals to study the motion of the moon. The use of infinitesimals to compute rates of change was developed significantly by Bhāskara II (1114–1185); indeed, it has been argued that many of the key notions of differential calculus can be found in his work, such as "Rolle's theorem".

The Persian mathematician, Sharaf al-Dīn al-Tūsī (1135–1213), in his Treatise on Equations, established conditions for some cubic equations to have solutions, by finding the maxima of appropriate cubic polynomials. He proved, for example, that the maximum of the cubic a x2 — x3 occurs when x = 2a/3, and concluded therefrom that the equation a x2x3 = c has exactly one positive solution when c = 4 a3/27, and two positive solutions whenever 0 < c < 4 a3/27. The historian of science, Roshdi Rashed, has argued that al-Tūsī must have used the derivative of the cubic to obtain this result. Rashed's conclusion has been contested by other scholars, however, who argue that he could have obtained the result by other methods which do not require the derivative of the function to be known.

The modern development of calculus is usually credited to Isaac Newton (1643–1727) and Gottfried Leibniz (1646–1716), who provided independent and unified approaches to differentiation and derivatives. The key insight, however, that earned them this credit, was the fundamental theorem of calculus relating differentiation and integration: this rendered obsolete most previous methods for computing areas and volumes, which had not been significantly extended since the time of Ibn al-Haytham (Alhazen). For their ideas on derivatives, both Newton and Leibniz built on significant earlier work by mathematicians such as Pierre de Fermat (1607-1665), Isaac Barrow (1630–1677), René Descartes (1596–1650), Christiaan Huygens (1629–1695), Blaise Pascal (1623–1662) and John Wallis (1616–1703). Regarding Fermat's influence, Newton once wrote in a letter that "I had the hint of this method [of fluxions] from Fermat's way of drawing tangents, and by applying it to abstract equations, directly and invertedly, I made it general." Isaac Barrow is generally given credit for the early development of the derivative. Nevertheless, Newton and Leibniz remain key figures in the history of differentiation, not least because Newton was the first to apply differentiation to theoretical physics, while Leibniz systematically developed much of the notation still used today.

Since the 17th century many mathematicians have contributed to the theory of differentiation. In the 19th century, calculus was put on a much more rigorous footing by mathematicians such as Augustin Louis Cauchy (1789–1857), Bernhard Riemann (1826–1866), and Karl Weierstrass (1815–1897). It was also during this period that the differentiation was generalized to Euclidean space and the complex plane.

Applications of derivatives

Optimization

If f is a differentiable function on (or an open interval) and x is a local maximum or a local minimum of f, then the derivative of f at x is zero. Points where f'(x) = 0 are called critical points or stationary points (and the value of f at x is called a critical value). If f is not assumed to be everywhere differentiable, then points at which it fails to be differentiable are also designated critical points.

If f is twice differentiable, then conversely, a critical point x of f can be analysed by considering the second derivative of f at x :
  • if it is positive, x is a local minimum;
  • if it is negative, x is a local maximum;
  • if it is zero, then x could be a local minimum, a local maximum, or neither. (For example, f(x) = x3 has a critical point at x = 0, but it has neither a maximum nor a minimum there, whereas f(x) = ± x4 has a critical point at x = 0 and a minimum and a maximum, respectively, there.)
This is called the second derivative test. An alternative approach, called the first derivative test, involves considering the sign of the f' on each side of the critical point.

Taking derivatives and solving for critical points is therefore often a simple way to find local minima or maxima, which can be useful in optimization. By the extreme value theorem, a continuous function on a closed interval must attain its minimum and maximum values at least once. If the function is differentiable, the minima and maxima can only occur at critical points or endpoints.

This also has applications in graph sketching: once the local minima and maxima of a differentiable function have been found, a rough plot of the graph can be obtained from the observation that it will be either increasing or decreasing between critical points.

In higher dimensions, a critical point of a scalar valued function is a point at which the gradient is zero. The second derivative test can still be used to analyse critical points by considering the eigenvalues of the Hessian matrix of second partial derivatives of the function at the critical point. If all of the eigenvalues are positive, then the point is a local minimum; if all are negative, it is a local maximum. If there are some positive and some negative eigenvalues, then the critical point is called a "saddle point", and if none of these cases hold (i.e., some of the eigenvalues are zero) then the test is considered to be inconclusive.

Calculus of variations

One example of an optimization problem is: Find the shortest curve between two points on a surface, assuming that the curve must also lie on the surface. If the surface is a plane, then the shortest curve is a line. But if the surface is, for example, egg-shaped, then the shortest path is not immediately clear. These paths are called geodesics, and one of the simplest problems in the calculus of variations is finding geodesics. Another example is: Find the smallest area surface filling in a closed curve in space. This surface is called a minimal surface and it, too, can be found using the calculus of variations.

Physics

Calculus is of vital importance in physics: many physical processes are described by equations involving derivatives, called differential equations. Physics is particularly concerned with the way quantities change and develop over time, and the concept of the "time derivative" — the rate of change over time — is essential for the precise definition of several important concepts. In particular, the time derivatives of an object's position are significant in Newtonian physics:
  • velocity is the derivative (with respect to time) of an object's displacement (distance from the original position)
  • acceleration is the derivative (with respect to time) of an object's velocity, that is, the second derivative (with respect to time) of an object's position.
For example, if an object's position on a line is given by
x(t)=-16t^{2}+16t+32,\,\!
then the object's velocity is
{\dot {x}}(t)=x'(t)=-32t+16,\,\!
and the object's acceleration is
{\ddot {x}}(t)=x''(t)=-32,\,\!
which is constant.

Differential equations

A differential equation is a relation between a collection of functions and their derivatives. An ordinary differential equation is a differential equation that relates functions of one variable to their derivatives with respect to that variable. A partial differential equation is a differential equation that relates functions of more than one variable to their partial derivatives. Differential equations arise naturally in the physical sciences, in mathematical modelling, and within mathematics itself. For example, Newton's second law, which describes the relationship between acceleration and force, can be stated as the ordinary differential equation
F(t)=m{\frac {d^{2}x}{dt^{2}}}.
The heat equation in one space variable, which describes how heat diffuses through a straight rod, is the partial differential equation
{\frac {\partial u}{\partial t}}=\alpha {\frac {\partial ^{2}u}{\partial x^{2}}}.
Here u(x,t) is the temperature of the rod at position x and time t and α is a constant that depends on how fast heat diffuses through the rod.

Mean value theorem

The mean value theorem: For each differentiable function {\displaystyle f:[a,b]\to \mathbb {R} } with a<b there is a c\in (a,b) with {\displaystyle f'(c)={\tfrac {f(b)-f(a)}{b-a}}}.

The mean value theorem gives a relationship between values of the derivative and values of the original function. If f(x) is a real-valued function and a and b are numbers with a < b, then the mean value theorem says that under mild hypotheses, the slope between the two points (a, f(a)) and (b, f(b)) is equal to the slope of the tangent line to f at some point c between a and b. In other words,
f'(c)={\frac {f(b)-f(a)}{b-a}}.
In practice, what the mean value theorem does is control a function in terms of its derivative. For instance, suppose that f has derivative equal to zero at each point. This means that its tangent line is horizontal at every point, so the function should also be horizontal. The mean value theorem proves that this must be true: The slope between any two points on the graph of f must equal the slope of one of the tangent lines of f. All of those slopes are zero, so any line from one point on the graph to another point will also have slope zero. But that says that the function does not move up or down, so it must be a horizontal line. More complicated conditions on the derivative lead to less precise but still highly useful information about the original function.

Taylor polynomials and Taylor series

The derivative gives the best possible linear approximation of a function at a given point, but this can be very different from the original function. One way of improving the approximation is to take a quadratic approximation. That is to say, the linearization of a real-valued function f(x) at the point x0 is a linear polynomial a + b(xx0), and it may be possible to get a better approximation by considering a quadratic polynomial a + b(xx0) + c(xx0)2. Still better might be a cubic polynomial a + b(xx0) + c(xx0)2 + d(xx0)3, and this idea can be extended to arbitrarily high degree polynomials. For each one of these polynomials, there should be a best possible choice of coefficients a, b, c, and d that makes the approximation as good as possible.

In the neighbourhood of x0, for a the best possible choice is always f(x0), and for b the best possible choice is always f'(x0). For c, d, and higher-degree coefficients, these coefficients are determined by higher derivatives of f. c should always be f''(x0)/2, and d should always be f'''(x0)/3!. Using these coefficients gives the Taylor polynomial of f. The Taylor polynomial of degree d is the polynomial of degree d which best approximates f, and its coefficients can be found by a generalization of the above formulas. Taylor's theorem gives a precise bound on how good the approximation is. If f is a polynomial of degree less than or equal to d, then the Taylor polynomial of degree d equals f.

The limit of the Taylor polynomials is an infinite series called the Taylor series. The Taylor series is frequently a very good approximation to the original function. Functions which are equal to their Taylor series are called analytic functions. It is impossible for functions with discontinuities or sharp corners to be analytic, but there are smooth functions which are not analytic.

Implicit function theorem

Some natural geometric shapes, such as circles, cannot be drawn as the graph of a function. For instance, if f(x, y) = x2 + y2 − 1, then the circle is the set of all pairs (x, y) such that f(x, y) = 0. This set is called the zero set of f. It is not the same as the graph of f, which is a paraboloid. The implicit function theorem converts relations such as f(x, y) = 0 into functions. It states that if f is continuously differentiable, then around most points, the zero set of f looks like graphs of functions pasted together. The points where this is not true are determined by a condition on the derivative of f. The circle, for instance, can be pasted together from the graphs of the two functions ± 1 - x2. In a neighborhood of every point on the circle except (−1, 0) and (1, 0), one of these two functions has a graph that looks like the circle. (These two functions also happen to meet (−1, 0) and (1, 0), but this is not guaranteed by the implicit function theorem.)

The implicit function theorem is closely related to the inverse function theorem, which states when a function looks like graphs of invertible functions pasted together.

Entropy (information theory)

From Wikipedia, the free encyclopedia https://en.wikipedia.org/wiki/Entropy_(information_theory) In info...