Skip to main content

Genome-wide association with quantitative antibody levels

Author: Amanda Chong

In this tutorial, you will be exploring a dataset simulated from data in the 1000 Genomes Project. You will use this data to test for an association between genotype and antibody response to norovirus in an admixed population derived from the Americas.

Noroviruses are a leading cause of viral gastroenteritis, and are thought to be responsible for over 200,000 deaths/year, with the majority in children under 5 years of age.

The antibody response is a quantitative phenotype, so you will have to consider things like the best scale, or normalisation, of the phenotype to run the test.

The 1000 Genomes Project

The 1000 Genomes Project ran between 2008 and 2015, and generated genotype and sequence data from ~2500 individuals from around the world. At the time it was the largest single publicly available catalogue of human genomic data. The final dataset contains data from populations spanning Europe, East Asia, South Asia, Africa, and the Americas:

img

Importantly, the 1000 Genomes Project data is open-access, so that you can freely use it. For more on the 1000 Genomes Project, read the 1000 Genomes Phase 3 paper or see the [https://www.internationalgenome.org].

To get started, go and get the data.