Research group

Haerty Group

Exploring how genetic diversity influences health and resilience

Group activities.

Functional genetic elements, whether encoding proteins or not, represent about 10% of the human genome - at least as we have thus far identified them by their evolutionary footprints.  We aim to tackle the evolution of functional elements with a specific focus on those that are non-protein-coding, as there is mounting evidence that these elements are enriched for disease and trait-associated variants. 

Our work is primarily computational, focusing on comparative genomics and population genetics as well as data integration approaches to identify putative functional elements, characterise their evolution and assess the potential impact of their disruption.

One part of our work aims at identifying long (>200 nucleotides) non-coding RNAs (lncRNAs) and characterising their potential biological relevance and function, and the impact of mutations within these loci. Tens of thousands of long non-coding RNAs have now been annotated and are in general associated with gene expression regulation. So far the molecular function of a small fraction of these loci have been characterised, and mutations within several of these loci lead to lethal or developmental phenotypic changes at the cellular but also at the organismal level. However the function and the biological relevance of the vast majority of these loci remain to be discovered. As a consequence of the rapid evolution of these non-coding loci, our work primarily focuses on the patterns of sequence and expression variation among populations and closely related species. We are also investigating the impact of induced mutations within selected lncRNAs, aiming to quantify the in-vivo fitness impact of knocking down these loci.

The comparative analysis of Vertebrate genomes is another important focus of our work; improving human genome annotation and allowing the discovery of functional elements within it. However, little has been done towards discovering how these functional elements evolved. We are using comparative genomics to identify not only pan-mammalian conserved functional elements but also lineage and species-specific elements and how they relate to a lineage or species trait. 

Operations

Sarah Cossey

Chief Operating Officer
Image
Sarah Cossey, Chief Operating Officer

Biography

Personal Pronouns: She/Her

Contact details:

+44 (0) 1603 450 787

sarah.cossey@earlham.ac.uk

sjc0808

Sarah Cossey

 

Sarah is a professional accountant and project manager with extensive experience across the public and private sector. Sarah spent nine years in finance and audit within local government and the NHS. She then moved to the Cabinet Office as part of the Senior Management Team of the Office of Public Sector Information (OPSI) in 1998 to lead the finance and business support team.

Prior to joining the Earlham Institute, Sarah spent three years managing a large team within the private sector, in a regional media company, leading a shared finance service through significant change. Sarah is a qualified PRINCE II project manager and has managed projects including the merger of two government departments, the implementation of new IT systems and major organisational change. Sarah has successfully delivered considerable efficiency savings, both within the public and private sectors, and improved service delivery.

Sarah has been at EI since 2011, taking up her current position in 2014. She is a member of the EI Executive Team, and has responsibility for the operational support and oversight of the capabilities at EI. This includes the advanced training, communications and public engagement, knowledge exchange and industrial engagement and corporate services delivery.

 

Operations Division.

Sarah holds the responsibility for the Operations Division and has led EI through many development projects to improve its infrastructure and facilities. The Operations Division comprises the Communications, Business Development & Impact, Advanced Training, and Business Support teams.

 

NBRI

Chris Watkins

Project Management Team Leader
Image
Chris Watkins, Profile Picture

Biography

Contact details:

  • +44 (0) 1603 450 076 

chris.watkins@earlham.ac.uk

@CJwatkins83

ORCiD

I lead the Project Management Team within the Technical Genomics Group which is primarily responsible for capturing new projects into the high-throughput NGS pipeline including customer prospecting and nurturing, project strategising and pricing, as well as ensuring efficient project delivery. I am a PRINCE2 accredited Project Manager and sit on the EI Quality Assurance (QA) team which works towards maintaining EI's ISO9001 QA accreditation status. 

Before joining EI, I completed my PhD in Speech Recognition and Language Modelling at UEA in 2010, before joining the Centre for Visual Computing at the University of Bradford to help grow its research and commercial activity through project management, interactions with local industry, and grant writing.

This led to two successful JISC funded projects worth £850k to digitise human skeletal remains using 3D laser scanning technology in collaboration with the Archaeological Department.

Research

Felix Shaw

Research Software Engineer
Image

Biography

Contact details:

  • +44 (0) 1603 450 090 

felix.shaw@earlham.ac.uk

 

I'm a researcher working on the COPO (Collaborative Open Plant Omics) platform; a tool for creating, depositing and searching for bundles of research objects. 

I hold a First Class Honours degree in Computer Science and gained my PhD from the University of East Anglia in 2015, studying facial animation, computer graphics, signal processing and machine learning. 

My current interests are in the fields of data science, open source and how to apply machine learning techniques in these areas to improve the tools available to researchers. I have industry experience working as a web developer in several start-ups and three years experience teaching to undergraduate level.

Research group

ELIXIR-UK

Coordinating data, tools, training and standards across the UK

Group activities.

ELIXIR is a Europe wide data infrastructure for the life sciences. It aims to facilitate sharing of life science data by integrating data sources and tools under a unified set of standards and provide computing infrastructure and the training needed to make these resources accessible to bioinformaticians and life scientists.

EI is the Lead Institute for ELIXIR in the UK. As such it forms the bridge between the UK’s data infrastructure falling under ELIXIR’s platforms (Data, Tools, Compute, Standards and Training) and ELIXIR Hub. 

ELIXIR operates a hub and nodes model with the Hub being located at Hinxton. EI hosts the office coordinating the UK Node, bringing UK based resources into the wider project and providing a nucleating centre for UK data, tools, standards and training provision. 

EI’s ELIXIR activities are currently headed by Professor Neil Hall, who is supported by Ariadna Miquel Clopés (Research Administrator), Xenia Perez Sitja (Communications and Community Manager) and Nicola Soranzo (Technical Co-ordinator).

ELIXIR-UK is a distributed network co-led by Prof Neil Hall and Prof. Carole Goble at the University of Manchester. ELIXIR-UK is supported by three Research Councils: BBSRC, MRC and NERC. It was funded in 2014 to provide training coordination both across ELIXIR and in the UK.

As well as playing an active leading role in the UK’s ELIXIR contributions, the ELIXIR group at EI also contributes to the development of UK data infrastructure through participation in strategic initiatives and by making contributions in infrastructure development, for example by hosting major activity in BioJS development.

Research

Jon Wright

Bioinformatician
Image
Jon Wright, Profile Picture

Biography

Personal Pronouns: He/Him

Contact details:

  • +44 (0) 1603 450 986 

 jon.wright@earlham.ac.uk

@jonwright99

ORCiD

As a Bioinformatician in Anthony Hall’s group, I’m particularly interested in genome assembly and am currently contributing to the Delivering Sustainable Wheat and Decoding Biodiversity Institute Strategic Programmes. 

As part of these programmes, I am assembling the genomes of important UK wheat varieties and historical lines from the Watkins bread wheat landrace collection utilising the latest sequencing technologies available at the Earlham Institute such as PacBio, Nanopore and HiC. These high-quality genomes will allow us to identify structural rearrangements and introgressions that underlie important agronomic traits in these lines, and will also contribute to the wheat pangenome and pangenome-related tools for wheat.

I also collaborate with researchers at Rothamsted Research and the European Reference Genome Atlas (ERGA) to generate a genome assembly of Alopecurus aequalis, a problematic agricultural weed, to understand the genetic basis of weediness.

I joined the Earlham Institute in 2010 to generate assemblies of the 42 chromosome arms of hexaploid wheat for the chromosome-based draft sequence within the International Wheat Genome Sequencing Consortium. Since then, I have been involved in several collaborations to generate genome assemblies for important wheat lines from the UK and around the world using our w2rap pipeline.

My first degree was in Chemistry, after which I completed a MSc in Information Technology before working in the commercial software industry for several years. 

I received my PhD in Computational Biology from the John Innes Centre in 2010 where I worked on the model grass Brachypodium distachyon and comparative genomics in the grass family.

Research

Tilapia Genomic Resources

Developing genomic resources in wild and farmed tilapia for the benefit of the aquaculture industry globally and in Africa.

Project Summary.

Previous Funders:

  • UKRI Natural Environment Research Council (NERC)
  • UKRI Biotechnology and Biological Sciences Research Council (BBSRC) GCRF Data and Resources

 

Current Funders:

  • UKRI Biotechnology and Biological Sciences Research Council (BBSRC) 
  • EU Development Fund

 

Most of the world's fisheries are overexploited and it seems inevitable that fish-farming will largely replace fishing, just as livestock farming has replaced hunting as the main source of food from land animals. Unfortunately, many farmed fish, such as salmon, are themselves predators and need to be fed on fish meal. 

Global tilapia production hit 6 million tonnes for the first time in 2020 and is now a  $7.6 billion dollar industry,  highlighting the huge growth in the freshwater aquaculture industry. However, that growth is threatened by a number of factors - including disease and arising pathogens, unwanted hybridization with wild fish, and climate change.

Most tilapia production is based on a handful of strains, but there are more than 50 wild species throughout Africa which could harbour valuable genes for growth, disease resistance, temperature & salt tolerance etc. Many tilapia will hybridize readily, so that the natural genetic traits could easily be bred into farmed strains without the need for GM technology. However, this feature also renders them vulnerable to genetic swamping by stocking with alien farmed strains into natural water bodies, a practice now widespread in Africa..

In this project we are establishing the needed genomic resources to enable the study of genomic diversity of wild tilapia populations, including past and ongoing gene flow between populations and across species. These species present different degrees of environmental adaptation for salinity, temperature, oxygen content. Using comparative genomics and system biology approaches we aim to investigate, characterise, and understand the genomic bases associated with those traits of interest. In close collaboration with WorldFish, we are generating similar high quality genomic resources for elite farmed strains of Nile tilapia with the aim of enabling future selective breeding for traits such as pathogen and environmental resilience.

 

Image above courtesy of WorldFish.

Impact statement.

Tilapia are now the third most important freshwater species in aquaculture. Our work in close collaboration with National Fisheries organisations in Uganda, Kenya, and Tanzania aims at identifying the genetics status of existing stocks in water bodies of East Africa countries.

We will disseminate our research and deliver relevant bioinformatics training for local staff and research students through our partners in Uganda, Kenya, and Tanzania. Our work in close collaboration with WorldFish will lead to enhanced genomic resources for elite strains enabling further selective breeding.

Our activities will lead to improved research and monitoring and ultimately to enhancement of the aquaculture industry. We believe that this research will be of great interest to students and the general public in the UK and internationally and we will carry out a variety of dissemination activities through a range of media. Furthermore, the genome sequence information will be publicly available to future researchers, benefiting the wider academic community interested in research themes as diverse as selective breeding, fish health, and evolutionary biology.