The European Nucleotide Archive in 2023

Author:

Yuan David1ORCID,Ahamed Alisha1,Burgin Josephine1ORCID,Cummins Carla1ORCID,Devraj Rajkumar1,Gueye Khadim1,Gupta Dipayan1,Gupta Vikas1ORCID,Haseeb Muhammad1,Ihsan Maira1,Ivanov Eugene1,Jayathilaka Suran1,Kadhirvelu Vishnukumar Balavenkataraman1,Kumar Manish1,Lathi Ankur1ORCID,Leinonen Rasko1ORCID,McKinnon Jasmine1ORCID,Meszaros Lili1,O’Cathail Colman1ORCID,Ouma Dennis1,Paupério Joana1ORCID,Pesant Stephane1,Rahman Nadim1,Rinck Gabriele1ORCID,Selvakumar Sandeep1,Suman Swati1,Sunthornyotin Yanisa1,Ventouratou Marianna1,Vijayaraja Senthilnathan1,Waheed Zahra1,Woollard Peter1,Zyoud Ahmad1,Burdett Tony1ORCID,Cochrane Guy1ORCID

Affiliation:

1. European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus , Hinxton, Cambridge CB10 1SD, UK

Abstract

Abstract The European Nucleotide Archive (ENA; https://www.ebi.ac.uk/ena) is maintained by the European Molecular Biology Laboratory's European Bioinformatics Institute (EMBL-EBI). The ENA is one of the three members of the International Nucleotide Sequence Database Collaboration (INSDC). It serves the bioinformatics community worldwide via the submission, processing, archiving and dissemination of sequence data. The ENA supports data types ranging from raw reads, through alignments and assemblies to functional annotation. The data is enriched with contextual information relating to samples and experimental configurations. In this article, we describe recent progress and improvements to ENA services. In particular, we focus upon three areas of work in 2023: FAIRness of ENA data, pandemic preparedness and foundational technology. For FAIRness, we have introduced minimal requirements for spatiotemporal annotation, created a metadata-based classification system, incorporated third party metadata curations with archived records, and developed a new rapid visualisation platform, the ENA Notebooks. For foundational enhancements, we have improved the INSDC data exchange and synchronisation pipelines, and invested in site reliability engineering for ENA infrastructure. In order to support genomic surveillance efforts, we have continued to provide ENA services in support of SARS-CoV-2 data mobilisation and have adapted these for broader pathogen surveillance efforts.

Funder

European Molecular Biology Laboratory

Gordon and Betty Moore Foundation

Aquatic Symbiosis

UniEuk

European Union's Horizon 2020 and Horizon Europe research and innovation programmes

Aqa-FAANG

AtlantECO

BiCIKL

BioOcean5D

BlueCloud

Blue-Cloud 2026

BovReg

BGE

BY-COVID

EarlyCause

EASI-Genomics

eDNAqua-Plan

ELIXIR-CONVERGE

EOSC-Life

GENE-SwitCh

RECODID

VEO

Biotechnology and Biological Sciences Research Council

Wellcome Trust

SP3

Publisher

Oxford University Press (OUP)

Subject

Genetics

Reference8 articles.

1. The FAIR Guiding Principles for scientific data management and stewardship;Wilkinson;Sci. Data,2016

2. The international nucleotide sequence database collaboration;Arita;Nucleic Acids Res.,2021

3. GenBank;Sayers;Nucleic Acids Res.,2021

4. DDBJ database updates and computational infrastructure enhancement;Ogasawara;Nucleic Acids Res.,2019

5. The ELIXIR Core Data Resources: fundamental infrastructure for the life sciences;Drysdale;Bioinformatics,2020

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3