Open Data graphic showing lines of binary code and an open padlock

Open and FAIR Data at EI

We are proponents of open science and actively contribute to the development of tools and standards that ensure scientific data is FAIR.

Open Science at EI

Open science describes a set of principles held and practices followed with the aim of making all scientific research - including publications, data, and tools - freely accessible to everyone.

The Earlham Institute is a strong advocate of open science, both as a publicly-funded organisation and in recognition of the huge benefits that come from unimpeded access. These benefits include increased efficiency, greater quality and integrity of science, improved transfer of knowledge, innovation across sectors, and the opportunity for everyone to be engaged with science.

Wherever possible, we publish in open-access journals, make our software and tools available as open source on platforms such as GitHub, and use standardised metadata to improve both reproducibility and clarity for other researchers wanting to use our datasets.

The Institute is a leader in data-intensive bioscience. An ethos of open science provides transparency to people about the data we generate or collect, the methodology used, and the analysis of the results. 

Four icons representing the practices of FAIR and open data practices

Making data FAIR

Modern science involves the generation of vast amounts of data. The ability to collect, store, and interrogate more and more data is empowering us to ask new research questions and make discoveries that would have previously been unimaginable.

There are inherent challenges, however, with scientists around the world generating, labelling, storing, and sharing all of this data in different ways - not to mention the sheer quantity of data itself. 

Without a universal approach, data can’t be reliably accessed, understood, or integrated by others who might benefit from it.

FAIR approaches provide a way to manage this by making data Findable, Accessible, Interoperable, and Reusable.

The Earlham Institute develops FAIR compliant metadata standards and tools for the community to adopt as a way of maximising the potential use and impact of the data we generate.

Our experts also play an active role in the community, contributing to discussion and debate.

For example, the Earlham Institute is the coordinating hub of ELIXIR-UK, the UK node of ELIXIR - an international organisation helping researchers to manage and analyse increasing volumes of data. 

The pan-European project provides the infrastructure for integrating life sciences data across the continent with the aim of facilitating the linking of data worldwide. 

Member organisations collectively provide platforms and guidance for research data management, reproducible data analysis, FAIR data and software management, and related services and standards. 

The UK Node leads many ELIXIR communities and focus groups, and participates in European consortia and international standards organisations.

Supported by UKRI’s Infrastructure Fund and Digital Research Infrastructure Programme, BioFAIR aims to establish a cohesive, UK-wide digital research infrastructure that bridges current gaps between researchers, digital research technical professionals, existing institutional digital research infrastructures, and the funder-community partnership.

BioFAIR will be a catalyst for innovation and discovery and over its five-year life span will:

  • accelerate the adoption of findable, accessible, interoperable and reusable (FAIR) data principles across the UK life sciences, making it more useful and valuable to researchers than ever before
  • unify the UK’s currently fragmented digital research landscape, fostering unprecedented opportunities for collaboration and coordination among the national life sciences community
  • break down barriers to democratise data accessibility, giving UK researchers the resources and autonomy needed for innovation and discovery to flouris
  • coordinate and deliver extensive training and support for practitioners at all levels, building critical workforce capacity and securing the UK's position as a global leader in life sciences.

Fundamental to the BioFAIR concept are its four capabilities - data commons, method comms, community centre, and knowledge centre - with each one driving a key component of the project.

Unleashing the power of data

The Earlham Institute has developed a range of freely-accessible services and open source software tools for the bioscience community, including Collaborative Open Omics (COPO) and Grassroots, as well as contributing code and tools to the Galaxy project.