Skip to main content

Analysing gene annotations in R or python

Welcome! This tutorial has two aims. On one hand it is about programming, specifically the type of programming we often have to do in bioinformatics. This means handling large bits of data, making sense of data, computing statistics and plotting. On the other hand, this tutorial aims to make you familiar with an important type of genomic data: gene annotation files, what they contain and how to interpret them. The ultimate aim will be for you to put together an analysis of genes across a range of species of interest.

Note

You can also try the command-line genes tutorial tutorial first.

To begin go to the introduction.

Table of contents

  1. Introduction
  2. What gene annotation data looks like
  3. Getting started writing some code
  4. Extracting attributes
  5. Testing it out
  6. A quick recap (how did we do it?)
  7. Packaging up our code
  8. Writing a useful command-line utility
  9. Challenges