Skip to main content

Some useful probability distributions

In practice in statistics we often use a set of well-known distributions. These have specific mathematical forms that come with parameters to make them flexibly useful.

Here are some of the commonly-used ones:

NameDomainExpression
and R function
ParametersExplanation
Binomialx0,,nx\in 0, \cdots, n
(nx)px(1p)(nx){n \choose x}p^x(1-p)^{(n-x)}

(dbinom() in R)
Number of 'trials' nn
'Success probability' pp
How many 'successes'
from nn trials?
Normal
or Gaussian
xReal numbersx\in\text{Real numbers}
12πve12(xμ)2v\frac{1}{\sqrt{2\pi v}} e^{-\frac{1}{2}\frac{(x-\mu)^2}{v}}

(dnorm() in R)
Mean μ\mu
Variance vv
Ubiquitously useful
Betax[0,1]x\in[0,1]
1B(α,β)xα1(1x)β1\frac{1}{B(\alpha,\beta)} x^{\alpha-1} (1-x)^{\beta-1}

(dbeta() in R)
'Shape' parameters
α\alpha and β\beta
E.g. allele frequency estimates

If you don't understand the maths above, don't worry. You can understand these distributions by plotting what they look like as we'll do below.

Normalising constants

Many of these mathematical expressions have complicated-looking bit at the front that doesn't depend on xx. For example - the normal distribution has this bit:

12πv\frac{1}{\sqrt{2\pi v}}

while the beta distribution has this bit:

1B(α,β)\frac{1}{B(\alpha,\beta)}

(here B()B() is the 'beta function').

This bits can look complicated but the don't depend on xx. In fact, they are just normalising constants: their purpose is to ensure the distribution sums to 11 over all the possible values of xx.

Question. However, the expression (nx)n \choose x in front of the binomial isn't a normalising constant in the same why - why not?

Binomial distribution

Challenge

Pick a number of trials n (start between 5 and 20) and a probability pp (start between 0.1 and 0.9). Then plot the binomial distribution over the range of integers x=0,1,2,,nx = 0, 1, 2, \cdots, n.

The binomial distribution is given by dbinom() in R, and can be used like this:

dbinom( x, size = n, prob = p )

How does the shape of the binomial differ as you vary nn and pp?

Note. The expression for the binomial distribution is:

xn,p(nx)px(1p)nxx|n,p \sim {n \choose x} p^x (1-p)^{n-x}

Here (nx){n \choose x} means 'n choose x' - the number of ways of choosing x things from n things - which can be computed using choose(n,x) in R. For extra kudos, plot this using your own function binomial(x, n, p) implementing the above formula.

Normal distribution

Challenge

Pick a mean value μ\mu (start somewhere between 10-10 and 1010) and a variance vv (which must be positive - for example, 22 is a good starting choice). Then plot the density of the normal distribution over the continuous range x=2020x=-20 \cdots 20.

Note. the normal distribution density is given by dnorm() in R, but you have to specify the standard deviation (i.e. the square root of the variance) instead of the variance:

dnorm( x, mean = mu, sd = sqrt(v) )

How does the distribution differ as you vary μ\mu and vv?

For extra kudos, ignore dnorm() and write your own function normal() to compute this based on the normal distributino density formula:

xμ,v12πve12(xμ)2vx|\mu,v \sim \frac{1}{\sqrt{2*\pi*v}}\cdot e^{\frac{1}{2}\frac{(x-\mu)^2}{v}}

Beta distribution

Challenge

Pick 'shape' parameters α\alpha and β\beta (make them between 1 and 10 to start) and plot the beta distribution:

xα,β1B(α,β)xα1(1x)β1x | \alpha,\beta \sim \frac{1}{B(\alpha,\beta)} x^{\alpha-1} (1-x)^{\beta-1}

over the (continuous) range x=01x=0 \cdots 1.

Note. The beta distribution is implemented as dbeta() in R, but you have to use "shape1" and "shape2" instead of α\alpha and β\beta:

dbeta( x, shape1 = alpha, shape2 = beta )

How does the shape vary as you change the parameters? What happens if they are less than 1?

For extra kudos, ignore dbeta() and write your own function beta_distribution() to compute this based on the normal distributino density formula:

xα,β1B(α,β)x(α1)(1x)(β1)x | \alpha, \beta \sim \frac{1}{B(\alpha,\beta)} x^{(\alpha-1)} \cdot (1-x)^{(\beta-1)}

(You can use the beta() function to compute the value B(α,β)B(\alpha,\beta) on the demoninator.)