Search This Blog

Saturday, July 2, 2022

Poisson distribution

From Wikipedia, the free encyclopedia
 
Poisson Distribution
Probability mass function
Poisson pmf.svg
The horizontal axis is the index k, the number of occurrences. λ is the expected rate of occurrences. The vertical axis is the probability of k occurrences given λ. The function is defined only at integer values of k; the connecting lines are only guides for the eye.
 
Cumulative distribution function
Poisson cdf.svg
The horizontal axis is the index k, the number of occurrences. The CDF is discontinuous at the integers of k and flat everywhere else because a variable that is Poisson distributed takes on only integer values.
Notation
Parameters (rate)
Support (Natural numbers starting from 0)
PMF
CDF

, or , or

(for , where is the upper incomplete gamma function, is the floor function, and Q is the regularized gamma function)
Mean
Median
Mode
Variance
Skewness
Ex. kurtosis
Entropy

(for large )

MGF
CF
PGF
Fisher information

In probability theory and statistics, the Poisson distribution is a discrete probability distribution that expresses the probability of a given number of events occurring in a fixed interval of time or space if these events occur with a known constant mean rate and independently of the time since the last event. It is named after French mathematician Siméon Denis Poisson (/ˈpwɑːsɒn/; French pronunciation: ​[pwasɔ̃]). The Poisson distribution can also be used for the number of events in other specified interval types such as distance, area or volume.

For instance, a call center receives an average of 180 calls per hour, 24 hours a day. The calls are independent; receiving one does not change the probability of when the next one will arrive. The number of calls received during any minute has a Poisson probability distribution: the most likely numbers are 2 and 3 but 1 and 4 are also likely and there is a small probability of it being as low as zero and a very small probability it could be 10. Another example is the number of decay events that occur from a radioactive source during a defined observation period.

History

The distribution was first introduced by Siméon Denis Poisson (1781–1840) and published together with his probability theory in his work Recherches sur la probabilité des jugements en matière criminelle et en matière civile (1837). The work theorized about the number of wrongful convictions in a given country by focusing on certain random variables N that count, among other things, the number of discrete occurrences (sometimes called "events" or "arrivals") that take place during a time-interval of given length. The result had already been given in 1711 by Abraham de Moivre in De Mensura Sortis seu; de Probabilitate Eventuum in Ludis a Casu Fortuito Pendentibus. This makes it an example of Stigler's law and it has prompted some authors to argue that the Poisson distribution should bear the name of de Moivre.

In 1860, Simon Newcomb fitted the Poisson distribution to the number of stars found in a unit of space. A further practical application of this distribution was made by Ladislaus Bortkiewicz in 1898 when he was given the task of investigating the number of soldiers in the Prussian army killed accidentally by horse kicks; this experiment introduced the Poisson distribution to the field of reliability engineering.

Definitions

Probability mass function

A discrete random variable X is said to have a Poisson distribution, with parameter , if it has a probability mass function given by:

where

  • k is the number of occurrences ()
  • e is Euler's number ()
  • ! is the factorial function.

The positive real number λ is equal to the expected value of X and also to its variance.

The Poisson distribution can be applied to systems with a large number of possible events, each of which is rare. The number of such events that occur during a fixed time interval is, under the right circumstances, a random number with a Poisson distribution.

The equation can be adapted if, instead of the average number of events , we are given the average rate at which events occur. Then , and

Example

Chewing gum on a sidewalk in Reykjavík.
Chewing gum on a sidewalk. The number of chewing gums on a single tile is approximately Poisson distributed.

The Poisson distribution may be useful to model events such as:

  • the number of meteorites greater than 1 meter diameter that strike Earth in a year;
  • the number of patients arriving in an emergency room between 10 and 11 pm; and
  • the number of laser photons hitting a detector in a particular time interval.

Assumptions and validity

The Poisson distribution is an appropriate model if the following assumptions are true:

  • k is the number of times an event occurs in an interval and k can take values 0, 1, 2, ….
  • The occurrence of one event does not affect the probability that a second event will occur. That is, events occur independently.
  • The average rate at which events occur is independent of any occurrences. For simplicity, this is usually assumed to be constant, but may in practice vary with time.
  • Two events cannot occur at exactly the same instant; instead, at each very small sub-interval, either exactly one event occurs, or no event occurs.

If these conditions are true, then k is a Poisson random variable, and the distribution of k is a Poisson distribution.

The Poisson distribution is also the limit of a binomial distribution, for which the probability of success for each trial equals λ divided by the number of trials, as the number of trials approaches infinity (see Related distributions).

Examples of probability for Poisson distributions

Once in an interval events: The special case of λ = 1 and k = 0

Suppose that astronomers estimate that large meteorites (above a certain size) hit the earth on average once every 100 years (λ = 1 event per 100 years), and that the number of meteorite hits follows a Poisson distribution. What is the probability of k = 0 meteorite hits in the next 100 years?

Under these assumptions, the probability that no large meteorites hit the earth in the next 100 years is roughly 0.37. The remaining 1 − 0.37 = 0.63 is the probability of 1, 2, 3, or more large meteorite hits in the next 100 years. In an example above, an overflow flood occurred once every 100 years (λ = 1). The probability of no overflow floods in 100 years was roughly 0.37, by the same calculation.

In general, if an event occurs on average once per interval (λ = 1), and the events follow a Poisson distribution, then P(0 events in next interval) = 0.37. In addition, P(exactly one event in next interval) = 0.37, as shown in the table for overflow floods.

Examples that violate the Poisson assumptions

The number of students who arrive at the student union per minute will likely not follow a Poisson distribution, because the rate is not constant (low rate during class time, high rate between class times) and the arrivals of individual students are not independent (students tend to come in groups).

The number of magnitude 5 earthquakes per year in a country may not follow a Poisson distribution if one large earthquake increases the probability of aftershocks of similar magnitude.

Examples in which at least one event is guaranteed are not Poisson distributed; but may be modeled using a zero-truncated Poisson distribution.

Count distributions in which the number of intervals with zero events is higher than predicted by a Poisson model may be modeled using a zero-inflated model.

Properties

Descriptive statistics

  • The expected value and variance of a Poisson-distributed random variable are both equal to λ.
  • The coefficient of variation is , while the index of dispersion is 1.
  • The mean absolute deviation about the mean is
  • The mode of a Poisson-distributed random variable with non-integer λ is equal to , which is the largest integer less than or equal to λ. This is also written as floor(λ). When λ is a positive integer, the modes are λ and λ − 1.
  • All of the cumulants of the Poisson distribution are equal to the expected value λ. The nth factorial moment of the Poisson distribution is λn.
  • The expected value of a Poisson process is sometimes decomposed into the product of intensity and exposure (or more generally expressed as the integral of an "intensity function" over time or space, sometimes described as “exposure”).

Median

Bounds for the median () of the distribution are known and are sharp:

Higher moments

The higher non-centered moments, mk of the Poisson distribution, are Touchard polynomials in λ:

where the {braces} denote Stirling numbers of the second kind. The coefficients of the polynomials have a combinatorial meaning. In fact, when the expected value of the Poisson distribution is 1, then Dobinski's formula says that the nth moment equals the number of partitions of a set of size n.

A simple bound is

Sums of Poisson-distributed random variables

If for are independent, then A converse is Raikov's theorem, which says that if the sum of two independent random variables is Poisson-distributed, then so are each of those two independent random variables.

Other properties

  • The Poisson distributions are infinitely divisible probability distributions.
  • The directed Kullback–Leibler divergence of from is given by
  • Bounds for the tail probabilities of a Poisson random variable can be derived using a Chernoff bound argument.
  • The upper tail probability can be tightened (by a factor of at least two) as follows:
    where is the directed Kullback–Leibler divergence, as described above.
  • Inequalities that relate the distribution function of a Poisson random variable to the Standard normal distribution function are as follows:
    where is again the directed Kullback–Leibler divergence.

Poisson races

Let and be independent random variables, with , then we have that

The upper bound is proved using a standard Chernoff bound.

The lower bound can be proved by noting that is the probability that , where , which is bounded below by , where is relative entropy (See the entry on bounds on tails of binomial distributions for details). Further noting that , and computing a lower bound on the unconditional probability gives the result. More details can be found in the appendix of Kamath et al..

Related distributions

General

  • If and are independent, then the difference follows a Skellam distribution.
  • If and are independent, then the distribution of conditional on is a binomial distribution.
    Specifically, if , then .
    More generally, if X1, X2, …, Xn are independent Poisson random variables with parameters λ1, λ2, …, λn then
    given it follows that . In fact, .
  • If and the distribution of , conditional on X = k, is a binomial distribution, , then the distribution of Y follows a Poisson distribution . In fact, if , conditional on X = k, follows a multinomial distribution, , then each follows an independent Poisson distribution .
  • The Poisson distribution can be derived as a limiting case to the binomial distribution as the number of trials goes to infinity and the expected number of successes remains fixed — see law of rare events below. Therefore, it can be used as an approximation of the binomial distribution if n is sufficiently large and p is sufficiently small. There is a rule of thumb stating that the Poisson distribution is a good approximation of the binomial distribution if n is at least 20 and p is smaller than or equal to 0.05, and an excellent approximation if n ≥ 100 and np ≤ 10.
  • The Poisson distribution is a special case of the discrete compound Poisson distribution (or stuttering Poisson distribution) with only a parameter. The discrete compound Poisson distribution can be deduced from the limiting distribution of univariate multinomial distribution. It is also a special case of a compound Poisson distribution.
  • For sufficiently large values of λ, (say λ>1000), the normal distribution with mean λ and variance λ (standard deviation ) is an excellent approximation to the Poisson distribution. If λ is greater than about 10, then the normal distribution is a good approximation if an appropriate continuity correction is performed, i.e., if P(Xx), where x is a non-negative integer, is replaced by P(Xx + 0.5).
  • Variance-stabilizing transformation: If , then
    and
    Under this transformation, the convergence to normality (as increases) is far faster than the untransformed variable. Other, slightly more complicated, variance stabilizing transformations are available, one of which is Anscombe transform. See Data transformation (statistics) for more general uses of transformations.
  • If for every t > 0 the number of arrivals in the time interval [0, t] follows the Poisson distribution with mean λt, then the sequence of inter-arrival times are independent and identically distributed exponential random variables having mean 1/λ.
  • The cumulative distribution functions of the Poisson and chi-squared distributions are related in the following ways:
    and

Poisson approximation

Assume where , then is multinomially distributed conditioned on .

This means, among other things, that for any nonnegative function , if is multinomially distributed, then

where .

The factor of can be replaced by 2 if is further assumed to be monotonically increasing or decreasing.

Bivariate Poisson distribution

This distribution has been extended to the bivariate case. The generating function for this distribution is

with

The marginal distributions are Poisson(θ1) and Poisson(θ2) and the correlation coefficient is limited to the range

A simple way to generate a bivariate Poisson distribution is to take three independent Poisson distributions with means and then set . The probability function of the bivariate Poisson distribution is

Free Poisson distribution

The free Poisson distribution with jump size and rate arises in free probability theory as the limit of repeated free convolution

as N → ∞.

In other words, let be random variables so that has value with probability and value 0 with the remaining probability. Assume also that the family are freely independent. Then the limit as of the law of is given by the Free Poisson law with parameters .

This definition is analogous to one of the ways in which the classical Poisson distribution is obtained from a (classical) Poisson process.

The measure associated to the free Poisson law is given by

where
and has support .

This law also arises in random matrix theory as the Marchenko–Pastur law. Its free cumulants are equal to .

Some transforms of this law

We give values of some important transforms of the free Poisson law; the computation can be found in e.g. in the book Lectures on the Combinatorics of Free Probability by A. Nica and R. Speicher.

The R-transform of the free Poisson law is given by

The Cauchy transform (which is the negative of the Stieltjes transformation) is given by

The S-transform is given by

in the case that .

Weibull and Stable count

Poisson's probability mass function can be expressed in a form similar to the product distribution of a Weibull distribution and a variant form of the stable count distribution. The variable can be regarded as inverse of Lévy's stability parameter in the stable count distribution:

where is a standard stable count distribution of shape , and is a standard Weibull distribution of shape .

Statistical inference

Parameter estimation

Given a sample of n measured values , for i = 1, …, n, we wish to estimate the value of the parameter λ of the Poisson population from which the sample was drawn. The maximum likelihood estimate is 

Since each observation has expectation λ so does the sample mean. Therefore, the maximum likelihood estimate is an unbiased estimator of λ. It is also an efficient estimator since its variance achieves the Cramér–Rao lower bound (CRLB). Hence it is minimum-variance unbiased. Also it can be proven that the sum (and hence the sample mean as it is a one-to-one function of the sum) is a complete and sufficient statistic for λ.

To prove sufficiency we may use the factorization theorem. Consider partitioning the probability mass function of the joint Poisson distribution for the sample into two parts: one that depends solely on the sample (called ) and one that depends on the parameter and the sample only through the function . Then is a sufficient statistic for .

The first term, , depends only on . The second term, , depends on the sample only through . Thus, is sufficient.

To find the parameter λ that maximizes the probability function for the Poisson population, we can use the logarithm of the likelihood function:

We take the derivative of with respect to λ and compare it to zero:

Solving for λ gives a stationary point.

So λ is the average of the ki values. Obtaining the sign of the second derivative of L at the stationary point will determine what kind of extreme value λ is.

Evaluating the second derivative at the stationary point gives:

which is the negative of n times the reciprocal of the average of the ki. This expression is negative when the average is positive. If this is satisfied, then the stationary point maximizes the probability function.

For completeness, a family of distributions is said to be complete if and only if implies that for all . If the individual are iid , then . Knowing the distribution we want to investigate, it is easy to see that the statistic is complete.

For this equality to hold, must be 0. This follows from the fact that none of the other terms will be 0 for all in the sum and for all possible values of . Hence, for all implies that , and the statistic has been shown to be complete.

Confidence interval

The confidence interval for the mean of a Poisson distribution can be expressed using the relationship between the cumulative distribution functions of the Poisson and chi-squared distributions. The chi-squared distribution is itself closely related to the gamma distribution, and this leads to an alternative expression. Given an observation k from a Poisson distribution with mean μ, a confidence interval for μ with confidence level 1 – α is

or equivalently,

where is the quantile function (corresponding to a lower tail area p) of the chi-squared distribution with n degrees of freedom and is the quantile function of a gamma distribution with shape parameter n and scale parameter 1. This interval is 'exact' in the sense that its coverage probability is never less than the nominal 1 – α.

When quantiles of the gamma distribution are not available, an accurate approximation to this exact interval has been proposed (based on the Wilson–Hilferty transformation):

where denotes the standard normal deviate with upper tail area α / 2.

For application of these formulae in the same context as above (given a sample of n measured values ki each drawn from a Poisson distribution with mean λ), one would set

calculate an interval for μ = , and then derive the interval for λ.

Bayesian inference

In Bayesian inference, the conjugate prior for the rate parameter λ of the Poisson distribution is the gamma distribution. Let

denote that λ is distributed according to the gamma density g parameterized in terms of a shape parameter α and an inverse scale parameter β:

Then, given the same sample of n measured values ki as before, and a prior of Gamma(α, β), the posterior distribution is

Note that the posterior mean is linear and is given by

It can be show that gamma distribution is the only prior that induces linearity of the conditional mean. Moreover, a converse results exists which states that if the conditional mean is close to a linear function in the distance than the prior distribution of λ must be close to gamma distribution in Levy distance.

The posterior mean E[λ] approaches the maximum likelihood estimate in the limit as , which follows immediately from the general expression of the mean of the gamma distribution.

The posterior predictive distribution for a single additional observation is a negative binomial distribution, sometimes called a gamma–Poisson distribution.

Simultaneous estimation of multiple Poisson means

Suppose is a set of independent random variables from a set of Poisson distributions, each with a parameter , , and we would like to estimate these parameters. Then, Clevenson and Zidek show that under the normalized squared error loss , when , then, similar as in Stein's example for the Normal means, the MLE estimator is inadmissible

In this case, a family of minimax estimators is given for any and as

Occurrence and applications

Applications of the Poisson distribution can be found in many fields including:

The Poisson distribution arises in connection with Poisson processes. It applies to various phenomena of discrete properties (that is, those that may happen 0, 1, 2, 3, … times during a given period of time or in a given area) whenever the probability of the phenomenon happening is constant in time or space. Examples of events that may be modelled as a Poisson distribution include:

  • The number of soldiers killed by horse-kicks each year in each corps in the Prussian cavalry. This example was used in a book by Ladislaus Bortkiewicz (1868–1931).
  • The number of yeast cells used when brewing Guinness beer. This example was used by William Sealy Gosset (1876–1937).
  • The number of phone calls arriving at a call centre within a minute. This example was described by A.K. Erlang (1878–1929).
  • Internet traffic.
  • The number of goals in sports involving two competing teams.
  • The number of deaths per year in a given age group.
  • The number of jumps in a stock price in a given time interval.
  • Under an assumption of homogeneity, the number of times a web server is accessed per minute.
  • The number of mutations in a given stretch of DNA after a certain amount of radiation.
  • The proportion of cells that will be infected at a given multiplicity of infection.
  • The number of bacteria in a certain amount of liquid.
  • The arrival of photons on a pixel circuit at a given illumination and over a given time period.
  • The targeting of V-1 flying bombs on London during World War II investigated by R. D. Clarke in 1946.

Gallagher showed in 1976 that the counts of prime numbers in short intervals obey a Poisson distribution provided a certain version of the unproved prime r-tuple conjecture of Hardy-Littlewood is true.

Law of rare events

Comparison of the Poisson distribution (black lines) and the binomial distribution with n = 10 (red circles), n = 20 (blue circles), n = 1000 (green circles). All distributions have a mean of 5. The horizontal axis shows the number of events k. As n gets larger, the Poisson distribution becomes an increasingly better approximation for the binomial distribution with the same mean.

The rate of an event is related to the probability of an event occurring in some small subinterval (of time, space or otherwise). In the case of the Poisson distribution, one assumes that there exists a small enough subinterval for which the probability of an event occurring twice is "negligible". With this assumption one can derive the Poisson distribution from the Binomial one, given only the information of expected number of total events in the whole interval.

Let the total number of events in the whole interval be denoted by . Divide the whole interval into subintervals of equal size, such that (since we are interested in only very small portions of the interval this assumption is meaningful). This means that the expected number of events in each of the n subintervals is equal to .

Now we assume that the occurrence of an event in the whole interval can be seen as a sequence of n Bernoulli trials, where the -th Bernoulli trial corresponds to looking whether an event happens at the subinterval with probability . The expected number of total events in such trials would be , the expected number of total events in the whole interval. Hence for each subdivision of the interval we have approximated the occurrence of the event as a Bernoulli process of the form . As we have noted before we want to consider only very small subintervals. Therefore, we take the limit as goes to infinity.

In this case the binomial distribution converges to what is known as the Poisson distribution by the Poisson limit theorem.

In several of the above examples — such as, the number of mutations in a given sequence of DNA—the events being counted are actually the outcomes of discrete trials, and would more precisely be modelled using the binomial distribution, that is

In such cases n is very large and p is very small (and so the expectation np is of intermediate magnitude). Then the distribution may be approximated by the less cumbersome Poisson distribution

This approximation is sometimes known as the law of rare events, since each of the n individual Bernoulli events rarely occurs.

The name "law of rare events" may be misleading because the total count of success events in a Poisson process need not be rare if the parameter np is not small. For example, the number of telephone calls to a busy switchboard in one hour follows a Poisson distribution with the events appearing frequent to the operator, but they are rare from the point of view of the average member of the population who is very unlikely to make a call to that switchboard in that hour.

The variance of the binomial distribution is 1 − p times that of the Poisson distribution, so almost equal when p is very small.

The word law is sometimes used as a synonym of probability distribution, and convergence in law means convergence in distribution. Accordingly, the Poisson distribution is sometimes called the "law of small numbers" because it is the probability distribution of the number of occurrences of an event that happens rarely but has very many opportunities to happen. The Law of Small Numbers is a book by Ladislaus Bortkiewicz about the Poisson distribution, published in 1898.

Poisson point process

The Poisson distribution arises as the number of points of a Poisson point process located in some finite region. More specifically, if D is some region space, for example Euclidean space Rd, for which |D|, the area, volume or, more generally, the Lebesgue measure of the region is finite, and if N(D) denotes the number of points in D, then

Poisson regression and negative binomial regression

Poisson regression and negative binomial regression are useful for analyses where the dependent (response) variable is the count (0, 1, 2, …) of the number of events or occurrences in an interval.

Other applications in science

In a Poisson process, the number of observed occurrences fluctuates about its mean λ with a standard deviation . These fluctuations are denoted as Poisson noise or (particularly in electronics) as shot noise.

The correlation of the mean and standard deviation in counting independent discrete occurrences is useful scientifically. By monitoring how the fluctuations vary with the mean signal, one can estimate the contribution of a single occurrence, even if that contribution is too small to be detected directly. For example, the charge e on an electron can be estimated by correlating the magnitude of an electric current with its shot noise. If N electrons pass a point in a given time t on the average, the mean current is ; since the current fluctuations should be of the order (i.e., the standard deviation of the Poisson process), the charge can be estimated from the ratio .

An everyday example is the graininess that appears as photographs are enlarged; the graininess is due to Poisson fluctuations in the number of reduced silver grains, not to the individual grains themselves. By correlating the graininess with the degree of enlargement, one can estimate the contribution of an individual grain (which is otherwise too small to be seen unaided). Many other molecular applications of Poisson noise have been developed, e.g., estimating the number density of receptor molecules in a cell membrane.

In causal set theory the discrete elements of spacetime follow a Poisson distribution in the volume.

Computational methods

The Poisson distribution poses two different tasks for dedicated software libraries: Evaluating the distribution , and drawing random numbers according to that distribution.

Evaluating the Poisson distribution

Computing for given and is a trivial task that can be accomplished by using the standard definition of in terms of exponential, power, and factorial functions. However, the conventional definition of the Poisson distribution contains two terms that can easily overflow on computers: λk and k!. The fraction of λk to k! can also produce a rounding error that is very large compared to e−λ, and therefore give an erroneous result. For numerical stability the Poisson probability mass function should therefore be evaluated as

which is mathematically equivalent but numerically stable. The natural logarithm of the Gamma function can be obtained using the lgamma function in the C standard library (C99 version) or R, the gammaln function in MATLAB or SciPy, or the log_gamma function in Fortran 2008 and later.

Some computing languages provide built-in functions to evaluate the Poisson distribution, namely

  • R: function dpois(x, lambda);
  • Excel: function POISSON( x, mean, cumulative), with a flag to specify the cumulative distribution;
  • Mathematica: univariate Poisson distribution as PoissonDistribution[], bivariate Poisson distribution as MultivariatePoissonDistribution[,{ , }].

Random drawing from the Poisson distribution

The less trivial task is to draw random integers from the Poisson distribution with given .

Solutions are provided by:

Generating Poisson-distributed random variables

A simple algorithm to generate random Poisson-distributed numbers (pseudo-random number sampling) has been given by Knuth:

algorithm poisson random number (Knuth):
    init:
        Let L ← e−λ, k ← 0 and p ← 1.
    do:
        k ← k + 1.
        Generate uniform random number u in [0,1] and let p ← p × u.
    while p > L.
    return k − 1.

The complexity is linear in the returned value k, which is λ on average. There are many other algorithms to improve this. Some are given in Ahrens & Dieter.

For large values of λ, the value of L = e−λ may be so small that it is hard to represent. This can be solved by a change to the algorithm which uses an additional parameter STEP such that e−STEP does not underflow:

algorithm poisson random number (Junhao, based on Knuth):
    init:
        Let λLeft ← λ, k ← 0 and p ← 1.
    do:
        k ← k + 1.
        Generate uniform random number u in (0,1) and let p ← p × u.
        while p < 1 and λLeft > 0:
            if λLeft > STEP:
                p ← p × eSTEP
                λLeft ← λLeft − STEP
            else:
                p ← p × eλLeft
                λLeft ← 0
    while p > 1.
    return k − 1.

The choice of STEP depends on the threshold of overflow. For double precision floating point format the threshold is near e700, so 500 should be a safe STEP.

Other solutions for large values of λ include rejection sampling and using Gaussian approximation.

Inverse transform sampling is simple and efficient for small values of λ, and requires only one uniform random number u per sample. Cumulative probabilities are examined in turn until one exceeds u.

algorithm Poisson generator based upon the inversion by sequential search:
    init:
        Let x ← 0, p ← e−λ, s ← p.
        Generate uniform random number u in [0,1].
    while u > s do:
        x ← x + 1.
        p ← p × λ / x.
        s ← s + p.
    return x.

Vegetable oil

From Wikipedia, the free encyclopedia
 

Vegetable oils, or vegetable fats, are oils extracted from seeds or from other parts of fruits. Like animal fats, vegetable fats are mixtures of triglycerides. Soybean oil, grape seed oil, and cocoa butter are examples of seed oils, or fats from seeds. Olive oil, palm oil, and rice bran oil are examples of fats from other parts of fruits. In common usage, vegetable oil may refer exclusively to vegetable fats which are liquid at room temperature. Vegetable oils are usually edible.

Uses

In antiquity

Oils extracted from plants have been used since ancient times and in many cultures. Archaeological evidence shows that olives were turned into olive oil by 6000 BCE and 4500 BCE in present-day Israel and Palestine.

In addition to use as food, fats and oils (both vegetable and mineral) have long been used as fuel, typically in lamps which were a principal source of illumination in ancient times. Oils may have been used for lubrication, but there is no evidence for this. Vegetable oils were probably more valuable as food and lamp-oil; Babylonian mineral oil was known to be used as fuel, but there are no references to lubrication. Pliny the Elder reported that animal-derived fats such as lard were used to lubricate the axles of carts.

Culinary

Many vegetable oils are consumed directly, or indirectly as ingredients in food – a role that they share with some animal fats, including butter, ghee, lard, and schmaltz. The oils serve a number of purposes in this role:

  • Shortening – as in giving pastries a crumbly texture.
  • Enriching – adding calories and satisfaction in consumption
  • Texture – altering how ingredients combine, especially fats and starches
  • Flavoring – examples include olive, sesame, or almond oil
  • Flavor base – oils can also "carry" flavors of other ingredients, such as peppers, since many flavors are due to chemicals that are soluble in oil.

Oils can be heated to temperatures significantly higher than the boiling point of water, 100 °C (212 °F), and used to fry foods. Oils for this purpose must have a high flash point. Such oils include both the major cooking oils – soybean, rapeseed, canola, sunflower, safflower, peanut, cottonseed, etc. – and tropical oils, such as coconut, palm, and rice bran. The latter are particularly valued in Asian cultures for high-temperature cooking, because of their unusually high flash points.

Hydrogenated oils

Unsaturated vegetable oils can be transformed through partial or complete "hydrogenation" into oils of higher melting point, some of which, such as vegetable shortening, will remain solid at room temperature.

The hydrogenation process involves "sparging" the oil at high temperature and pressure with hydrogen in the presence of a catalyst, typically a powdered nickel compound, such as Raney nickel. Chemically, hydrogenation is the reduction of a carbon-carbon double bond to a single bond, by addition of hydrogen atoms. Since the surface of the metal catalyst is covered with hydrogen atoms, when the double bonds of the unsaturated oil come into contact with the catalyst, it reacts with the hydrogen atoms, forming new bonds with the two carbon atoms; each carbon atom becomes single-bonded to an individual hydrogen atom, and the double bond between carbons can no longer exist. In organic chemistry, unsaturation is considered as a pair of hydrogen atoms missing from the (hypothetical) fully-saturated carbon chain. The level to which an organic molecule is deficient in hydrogen, is called the degree of unsaturation (DoU); as the degree of unsaturation decreases, the oil progresses toward being fully hydrogenated (when DoU = 0). A fully hydrogenated oil, also called a saturated fat, has had all of its double bonds converted into single bonds. If a polyunsaturated oil is left incompletely-hydrogenated (not all of the double bonds are reduced to single bonds), then it is a "partially hydrogenated oil" (PHO). PHOs are the primary dietary source of artificial trans fat in processed foods. An oil may be hydrogenated to increase resistance to rancidity (oxidation) or to change its physical characteristics. As the degree of saturation is raised by full or partial hydrogenation, the oil's viscosity and melting point increase.

The use of hydrogenated oils in foods has never been completely satisfactory. Because the center arm of the triglyceride is shielded somewhat by the end fatty acids, most of the hydrogenation occurs on the end fatty acids, thus making the resulting fat more brittle. A margarine made from naturally more saturated oils will be more plastic (more "spreadable") than a margarine made from hydrogenated soy oil. While full hydrogenation produces largely saturated fatty acids, partial hydrogenation results in the transformation of unsaturated cis fatty acids to unsaturated trans fatty acids in the oil mixture due to the heat used in hydrogenation. Partially hydrogenated oils and their trans fats have been linked to an increased risk of mortality from coronary heart disease, among other increased health risks.

In the US, the Standard of Identity for a product labeled as "vegetable oil margarine" specifies only canola, safflower, sunflower, corn, soybean, or peanut oil may be used. Products not labeled "vegetable oil margarine" do not have that restriction.

Industrial

Vegetable oils are used as an ingredient or component in many manufactured products.

Many vegetable oils are used to make soaps, skin products, candles, perfumes and other personal care and cosmetic products. Some oils are particularly suitable as drying oils, and are used in making paints and other wood treatment products. They are used in alkyd resin production. Dammar oil (a mixture of linseed oil and dammar resin), for example, is used almost exclusively in treating the hulls of wooden boats. Vegetable oils are increasingly being used in the electrical industry as insulators as vegetable oils are not toxic to the environment, biodegradable if spilled and have high flash and fire points. However, vegetable oils are less stable chemically, so they are generally used in systems where they are not exposed to oxygen, and they are more expensive than crude oil distillate. Synthetic tetraesters, which are similar to vegetable oils but with four fatty acid chains compared to the normal three found in a natural ester, are manufactured by Fischer esterification. Tetraesters generally have high stability to oxidation and have found use as engine lubricants. Vegetable oil is being used to produce biodegradable hydraulic fluid and lubricant.

One limiting factor in industrial uses of vegetable oils is that all such oils are susceptible to becoming rancid. Oils that are more stable, such as ben oil or mineral oil, are thus preferred for industrial uses. Castor oil has numerous industrial uses, owing to the presence of hydroxyl group on the fatty acid. Castor oil is a precursor to Nylon 11.Castor oil may also be reacted with epichlorohydrin to make a glycidyl ether which is used as a diluent and flexibilizer with epoxy resins.

Pet food additive

Vegetable oil is used in the production of some pet foods. AAFCO defines vegetable oil in this context as the product of vegetable origin obtained by extracting the oil from seeds or fruits which are processed for edible purposes.

Fuel

Vegetable oils are also used to make biodiesel, which can be used like conventional diesel. Some vegetable oil blends are used in unmodified vehicles but straight vegetable oil, also known as pure plant oil, needs specially prepared vehicles which have a method of heating the oil to reduce its viscosity. The use of vegetable oils as alternative energy is growing and the availability of biodiesel around the world is increasing.

The NNFCC estimates that the total net greenhouse gas savings when using vegetable oils in place of fossil fuel-based alternatives for fuel production, range from 18 to 100%.

Production

The production process of vegetable oil involves the removal of oil from plant components, typically seeds. This can be done via mechanical extraction using an oil mill or chemical extraction using a solvent. The extracted oil can then be purified and, if required, refined or chemically altered.

Mechanical extraction

Oils can be removed via mechanical extraction, termed "crushing" or "pressing." This method is typically used to produce the more traditional oils (e.g., olive, coconut etc.), and it is preferred by most "health-food" customers in the United States and in Europe. There are several different types of mechanical extraction. Expeller-pressing extraction is common, though the screw press, ram press, and ghani (powered mortar and pestle) are also used. Oilseed presses are commonly used in developing countries, among people for whom other extraction methods would be prohibitively expensive; the ghani is primarily used in India. The amount of oil extracted using these methods varies widely, as shown in the following table for extracting mowrah butter in India:

Method Percentage extracted
Ghani 20–30%
Expellers 34–37%
Solvent 40–43%

Solvent extraction

The processing of vegetable oil in commercial applications is commonly done by chemical extraction, using solvent extracts, which produces higher yields and is quicker and less expensive. The most common solvent is petroleum-derived hexane. This technique is used for most of the "newer" industrial oils such as soybean and corn oils. After extraction, the solvent is evaporated out by heating the mixture to about 300 °F (149 °C).

Supercritical carbon dioxide can be used as a non-toxic alternative to other solvents.

Hydrogenation

Oils may be partially hydrogenated to produce various ingredient oils. Lightly hydrogenated oils have very similar physical characteristics to regular soy oil, but are more resistant to becoming rancid. Margarine oils need to be mostly solid at 32 °C (90 °F) so that the margarine does not melt in warm rooms, yet it needs to be completely liquid at 37 °C (98 °F), so that it doesn't leave a "lardy" taste in the mouth.

Hardening vegetable oil is done by raising a blend of vegetable oil and a catalyst in near-vacuum to very high temperatures, and introducing hydrogen. This causes the carbon atoms of the oil to break double-bonds with other carbons, each carbon forming a new single-bond with a hydrogen atom. Adding these hydrogen atoms to the oil makes it more solid, raises the smoke point, and makes the oil more stable.

Hydrogenated vegetable oils differ in two major ways from other oils which are equally saturated. During hydrogenation, it is easier for hydrogen to come into contact with the fatty acids on the end of the triglyceride, and less easy for them to come into contact with the center fatty acid. This makes the resulting fat more brittle than a tropical oil; soy margarines are less "spreadable". The other difference is that trans fatty acids (often called trans fat) are formed in the hydrogenation reactor, and may amount to as much as 40 percent by weight of a partially hydrogenated oil. Hydrogenated oils, especially partially hydrogenated oils with their higher amounts of trans fatty acids, are increasingly thought to be unhealthy.

Deodorization

In the processing of edible oils, the oil is heated under vacuum to near the smoke point or to about 450 °F (232 °C), and water is introduced at the bottom of the oil. The water immediately is converted to steam, which bubbles through the oil, carrying with it any chemicals which are water-soluble. The steam sparging removes impurities that can impart unwanted flavors and odors to the oil. Deodorization is key to the manufacture of vegetable oils. Nearly all soybean, corn, and canola oils found on supermarket shelves go through a deodorization stage that removes trace amounts of odors and flavors, and lightens the color of the oil. However, the process commonly results in higher levels of trans fatty acids and distillation of the oil's natural compounds.

Occupational exposure

People can breathe in vegetable oil mist in the workplace. The U.S. Occupational Safety and Health Administration (OSHA) has set the legal limit (permissible exposure limit) for vegetable oil mist exposure in the workplace as 15 mg/m3 total exposure and 5 mg/m3 respiratory exposure over an 8-hour workday. The U.S. National Institute for Occupational Safety and Health (NIOSH) has set a recommended exposure limit (REL) of 10 mg/m3 total exposure and 5 mg/m3 respiratory exposure over an 8-hour workday.

Yield

Typical productivity of some oil crops, measured in tons (t) of oil produced per hectare (ha) of land per year (yr). Oil palm is by far the highest yielding crop, capable of producing about 4 tons of palm oil per hectare per year.

Crop Yield
(t/ha/yr)
Palm oil  4.0
Coconut oil  1.4
Canola oil  1.4
Soybean oil  0.6
Sunflower oil  0.6

Particular oils

The following triglyceride vegetable oils account for almost all worldwide production, by volume. All are used as both cooking oils and as SVO or to make biodiesel. According to the USDA, the total world consumption of major vegetable oils in 2007/08 was:

Oil source World consumption
(million metric tons)
Notes
Palm 41.31 The most widely produced tropical oil, also used to make biofuel
Soybean 41.28 One of the most widely consumed cooking oils
Rapeseed 18.24 One of the most widely used cooking oils, also used as fuel. Canola is a variety (cultivar) of rapeseed.
Sunflower seed 9.91 A common cooking oil, also used to make biodiesel
Peanut 4.82 Mild-flavored cooking oil
Cottonseed 4.99 A major food oil, often used in industrial food processing
Palm kernel 4.85 From the seed of the African palm tree
Coconut 3.48 Used in cooking, cosmetics and soaps
Olive 2.84 Used in cooking, cosmetics, soaps and as a fuel for traditional oil lamps

Note that these figures include industrial and animal feed use. The majority of European rapeseed oil production is used to produce biodiesel, or used directly as fuel in diesel cars which may require modification to heat the oil to reduce its higher viscosity.

Other significant oils include:

History

Such oils have been part of human culture for millennia. Oils such as poppy seed, rapeseed, linseed, almond oil, sesame seed, safflower, and cottonseed were variously used since at least the Bronze Age in the Middle East, Africa and Central Asia. Vegetable oils have been used for lighting fuel, cooking, medicine and lubrication. The Chinese started to use vegetable oil for stir-frying instead of animal fats during the Song dynasty (960–1279) Palm oil has long been recognized in West and Central African countries, and European merchants trading with West Africa occasionally purchased palm oil for use as a cooking oil in Europe and it became highly sought-after commodity by British traders for use as an industrial lubricant for machinery during Britain's Industrial Revolution. Palm oil formed the basis of soap products, such as Lever Brothers' (now Unilever) "Sunlight", and B. J. Johnson Company's (now Colgate-Palmolive) "Palmolive", and by around 1870, palm oil constituted the primary export of some West African countries.

In 1780 Carl Wilhelm Scheele demonstrated that fats were derived from glycerol. Thirty years later Michel Eugène Chevreul deduced that these fats were esters of fatty acids and glycerol. Wilhelm Normann, a German chemist introduced the hydrogenation of liquid fats in 1901, creating what later became known as trans fats, leading to the development of the global production of margarine and vegetable shortening.

In the USA cottonseed oil was developed, and marketed by Procter & Gamble as a creamed shortening – Crisco – as early as 1911. Ginning mills were happy to have someone haul away the cotton seeds. The extracted oil was refined and partially hydrogenated to give a solid at room temperature and thus mimic natural lard, and canned under nitrogen gas. Compared to the rendered lard Procter & Gamble was already selling to consumers, Crisco was cheaper, easier to stir into a recipe, and could be stored at room temperature for two years without turning rancid.

Soybean oil has been used in China since before historical records. It arrived in the US in the 1930s. Soybeans are protein-rich, and the medium viscosity oil rendered from them was high in polyunsaturates. Henry Ford established a soybean research laboratory, developed soybean plastics and a soy-based synthetic wool, and built a car "almost entirely" out of soybeans. Roger Drackett had a successful new product with Windex, but he invested heavily in soybean research, seeing it as a smart investment. By the 1950s and 1960s, soybean oil had become the most popular vegetable oil in the US; today it is second only to palm oil. In 2018–2019, world production was at 57.4 MT with the leading producers including China (16.6 MT), US (10.9 MT), Argentina (8.4 MT), Brazil (8.2 MT), and EU (3.2 MT).

The early 20th century also saw the start of the use of vegetable oil as a fuel in diesel engines and in heating oil burners. Rudolf Diesel designed his engine to run on vegetable oil. The idea, he hoped, would make his engines more attractive to farmers having a source of fuel readily available. Diesel's first engine ran on its own power for the first time in Augsburg, Germany, on 10 August 1893 on nothing but peanut oil. In remembrance of this event, 10 August has been declared "International Biodiesel Day". The first patent on Biodiesel was granted in 1937. Periodic petroleum shortages spurred research into vegetable oil as a diesel substitute during the 1930s and 1940s, and again in the 1970s and early 1980s when straight vegetable oil enjoyed its highest level of scientific interest. The 1970s also saw the formation of the first commercial enterprise to allow consumers to run straight vegetable oil in their vehicles. However, biodiesel, produced from oils or fats using transesterification is more widely used. It is Led by Brazil, many countries built biodiesel plants during the 1990s, and it is now widely available for use in motor vehicles, and is the most common biofuel in Europe today. In France, biodiesel is incorporated at a rate of 8% in the fuel used by all French diesel vehicles.

In the mid-1970s, Canadian researchers developed a low-erucic-acid rapeseed cultivar. Because the word "rape" was not considered optimal for marketing, they coined the name "canola" (from "Canada Oil low acid"). The U.S. Food and Drug Administration approved use of the canola name in January 1985, and U.S. farmers started planting large areas that spring. Canola oil is lower in saturated fats, and higher in monounsaturates. Canola is very thin (unlike corn oil) and flavorless (unlike olive oil), so it largely succeeds by displacing soy oil, just as soy oil largely succeeded by displacing cottonseed oil.

Used oil

A large quantity of used vegetable oil is produced and recycled, mainly from industrial deep fryers in potato processing plants, snack food factories and fast food restaurants.

Recycled oil has numerous uses, including use as a direct fuel, as well as in the production of biodiesel, livestock feed, pet food, soap, detergent, cosmetics, and industrial chemicals.

Since 2002, an increasing number of European Union countries have prohibited the inclusion of recycled vegetable oil from catering in animal feed. Used cooking oils from food manufacturing, however, as well as fresh or unused cooking oil, continue to be used in their animal feed.

Shelf life

Due to their susceptibility to oxidation from the exposure to oxygen, heat and light, resulting in the formation of oxidation products, such as peroxides and hydroperoxides, plant oils rich in polyunsaturated fatty acids have a limited shelf-life.

Product labeling

In Canada, palm oil is one of five vegetable oils, along with palm kernel oil, coconut oil, peanut oil, and cocoa butter, which must be specifically named in the list of ingredients for a food product. Also, oils in Canadian food products which have been modified or hydrogenated must contain the word "modified" or "hydrogenated" when listed as an ingredient. A mix of oils other than the aforementioned exceptions may simply be listed as "vegetable oil" in Canada; however, if the food product is a cooking oil, salad oil or table oil, the type of oil must be specified and listing "vegetable oil" as an ingredient is not acceptable.

From December 2014, all food products produced in the European Union were legally required to indicate the specific vegetable oil used in their manufacture, following the introduction of the Food Information to Consumers Regulation.

Inequality (mathematics)

From Wikipedia, the free encyclopedia https://en.wikipedia.org/wiki/Inequality...