Some population-genetic statistics
If you followed the basic simulation section you'll know that random inheritance patterns in the population cause haplotypes to be lost over time, and others to increase in frequency. As a result genetic diversity is lost.
Key popgen metrics
Let's use be mean the haplotype of individual , and to mean the allele carried by haplotype at SNP . And will denote the indicator function, which is or according to whether the condition is true.
Two key measures of diversity are:
- The heterozygosity . This is the probability that two individuals drawn at random carry different haplotypes:
- The nucleotide diversity. This is often denoted , and is the average number of genotype differences between two samples, where the average over all pairs of samples in the data. It is usually denoted .