Bayes' theorem is the central theorem of Bayesian statistics and relates joint probability distributions with conditional and marginal distributions. For two random variables and , Bayes' theorem states
The notable consequence of this theorem is that it allows one to invert a conditional probability into a joint probability, which makes it possible to determine the probability of the cause of an event from its effect.
For parameter estimation#
Bayes' theorem can be adapted to be used specifically for Parameter estimation. In this form, it states that the posterior distribution, or simply the Posterior, is given by
where is the measured data, represents the model in use (a statistical distribution) and are the parameters of that model (e.g., mean and variance for the distribution). Each term in the equation has its own interpretation:
- is the likelihood function, simply called Likelihood.
- is the prior knowledge, or simply Prior, which quantifies what we know about the model before measuring the data.
- is the evidence (marginalized likelihood), computed by integrating the likelihood over all possible parameter values the posterior can take.
- The posterior distribution , or Posterior, represents the probability that the model parameters take a certain value. In other words, the posterior combines our prior knowledge with the insights gained from measurements (the likelihood).