Event Scientific training

Genome Annotation Workshop 2026

The Genome Annotation Workshop is designed to provide scientists with a comprehensive overview of eukaryotic genome annotation approaches.

Start date: 28 April 2026
End date: 30 April 2026
Time: 09.30 - 16.00
Venue: Online (via Zoom)
Organiser: David Swarbreck
Enquiries:

training@earlham.ac.uk

 

Cost: TBA

About the event

The value of a genome assembly hinges on its annotation quality. However, structural annotation remains challenging due to variability in genome size, complexity, and the diversity and quality of supporting data.

Accurate annotation is essential for downstream analyses — variation detection, comparative genomics, functional genomics, systems biology, genome editing, and synthetic biology all rely on accurate gene models.

The Earlham Institute has developed a number of tools and pipelines to support high-quality annotation, including:

  • Mikado (PMID: 30052957): Scores and selects transcript models from multiple assembly sources to create reliable gene annotations.
  • Portcullis (PMID: 30418570): Improves splice-junction accuracy for high-confidence gene structures.

Both were foundational in the wheat genome annotation by the IWGSC (PMID: 30115783).

  • REAT (Robust and Extendable Eukaryotic Annotation Toolkit) and minos:

These pipelines integrate Mikado and Portcullis along with evidence-guided prediction tools to produce high-quality annotations. 

We have used them to support projects across plants, fungi, insects, protists, and fish within consortia such as the European Reference Genome Atlas (ERGA) and Darwin Tree of Life.

In this 3-day virtual course, you will:

  • Get an overview of Next Generation Sequencing technologies relevant to genome annotation, gaining a deeper understanding of the benefits of each platform.
  • Understand what to look for in a sample that will pass quality control and that will likely succeed in producing viable sequencing data, including how to assess the quality of RNA-Seq data.
  • Learn about reference guided transcriptome assembly and steps for processing short and long read data.
  • Learn about alternative approaches for annotating protein coding genes in eukaryotic species utilizing transcriptome and homology data, via projection and evidence guided gene prediction. Discuss the challenges of annotation in different contexts.
  • Hands on experience of annotation tools including tools and pipelines developed at the Earlham Institute. You will build gene models, explore the use of combiners/choosers for integrating alternative gene predictions and assess the accuracy of different annotation tools.

Who is this training for?

Advanced PhD students and post-doctoral researchers who are undertaking projects involving annotating a genome assembly and looking to improve your awareness of different approaches and pipelines.

Prerequisites

You are expected to have experience with using the command line, and should be comfortable using the functions covered in the Software Carpentry lesson, The Unix Shell. We suggest you refresh your memory of these lessons if needed.

Register today.

Participation: Open to all

Register your interest in attending Genome Annotation Workshop 2026 today.

Research

Fred Sangol Uche

Research Software Engineer
Image
Profile of Fred Uche

Biography

Contact details:

fred.uche@earlham.ac.uk

Fred Sangol Uche


 

Fred is a Research Software Engineer working on the Grassroots project within the Technology Algorithms Group (Leggett Group).

Prior to joining the Earlham Institute, Fred worked at Oxford Brookes University as an Application Developer, where he was responsible for maintaining the Student Record System and developing new functionalities to support and enhance academic operations.

Previously, Fred worked as a Developer at the Medical Research Council Unit The Gambia, part of the London School of Hygiene & Tropical Medicine. One of the key projects he contributed to was the development of a Demographic Surveillance System, which facilitated the collection and analysis of health demographic data—enabling researchers to explore and address critical public health questions.

He also lectured in the Computer Science department at the University of The Gambia, delivering computer programming modules and supporting students in their project work.

Fred holds a BSc in Computer Science from the University of The Gambia and an MSc in Information Systems Development from Gaston Berger University in Senegal.

BioFAIR Hub

Nishadi De Silva

BioFAIR Senior Technical Project Manager
Image
Profile of Nishadi de Silva

As the Senior Technical Project Manager for BioFAIR, Dr Nishadi De Silva coordinates cross-disciplinary projects to advance FAIR practices and build a unified digital research infrastructure for the life sciences in the UK. 

This involves working closely with researchers, software engineers, delivery partners and stakeholders to align workflow and data management strategies to meet the demands within the UK.

Prior to this, Nishadi was at EMBL-EBI where she lead the delivery of three bioinformatics resources (Ensembl Bacteria, Ensembl Protists, Ensembl Fungi) and coordinated data management for cross-instituitional projects such as TREC. 

Before then, Nishadi worked in the Pathogen Informatics team at the Wellcome Sanger Institute and completed her postdoctoral research at the University of Southampton, UK.

Nishadi holds a BSc and PhD in Computer Science from the University of Southampton, UK.