The arithmetic mean is a function of a set of numbers that gives the sum of all the numbers divided the count of numbers. Calling the numbers, the arithmetic mean is
It is a measure of central tendency.
In statistics#
The arithmetic mean commonly appears in statistics as a descriptive statistic of a sample, both in the sense random variables and data points. Despite its simplicity, it benefits from remarkable properties that make it a commonly employed statistic.
Consider a sample of independent random variables . The sample mean is
This quantity is a function of random variables and hence follows its own probability distribution with its own true mean and true variance . Estimating these quantities through the sample mean is rather straightforward. The expected value is simple:
In other words, the sample (arithmetic) mean is an unbiased estimator of the true mean. The variance of this estimator is:
by using properties of the variance for independent variables. If the are iid, then and so
If they are not, we can define the average population variance
for which
This form goes back to the previous one for iid variables. These variances tell us how precise the estimate is.
These formulas are true regardless of the probability distributions of and they become particularly useful after remembering that the arithmetic mean obeys the law of large numbers:
In other words, as the number of random variables (in practice, data points) grows large, the sample mean becomes a progressively better (more precise) estimator of the true mean.1 Moreover, when rescaled to , then sample mean's distribution approaches a Gaussian distribution , as proven by the central limit theorem.
Footnotes#
-
You can also see this due to how when . ↩