proGenomes2: an improved database for accurate and consistent habitat, taxonomic and functional annotations of prokaryotic genomes


Mende Daniel R1ORCID,Letunic Ivica2,Maistrenko Oleksandr M3,Schmidt Thomas S B3,Milanese Alessio3,Paoli Lucas4,Hernández-Plaza Ana5,Orakov Askarbek N3,Forslund Sofia K6,Sunagawa Shinichi4,Zeller Georg3,Huerta-Cepas Jaime5,Coelho Luis Pedro78,Bork Peer36910


1. Department of Medical Microbiology, Academic Medical Centre, University of Amsterdam, Amsterdam, The Netherlands

2. Biobyte solutions GmbH, Bothestr, 142, 69117 Heidelberg, Germany

3. Structural and Computational Biology Unit, European Molecular Biology Laboratory, 69117 Heidelberg, Germany

4. Institute of Microbiology, Department of Biology, ETH Zurich, Vladimir-Prelog-Weg 4, 8093 Zurich, Switzerland

5. Centro de Biotecnología y Genómica de Plantas, Universidad Politécnica de Madrid (UPM) - Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA), Campus de Montegancedo-UPM, 28223, Pozuelo de Alarcón, Madrid, Spain

6. Max Delbrück Centre for Molecular Medicine, 13125 Berlin, Germany

7. Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China

8. Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence (Fudan University), Ministry of Education, China

9. Molecular Medicine Partnership Unit, University of Heidelberg and European Molecular Biology Laboratory, 69120 Heidelberg, Germany

10. Department of Bioinformatics, Biocenter, University of Würzburg, 97074 Würzburg, Germany


AbstractMicrobiology depends on the availability of annotated microbial genomes for many applications. Comparative genomics approaches have been a major advance, but consistent and accurate annotations of genomes can be hard to obtain. In addition, newer concepts such as the pan-genome concept are still being implemented to help answer biological questions. Hence, we present proGenomes2, which provides 87 920 high-quality genomes in a user-friendly and interactive manner. Genome sequences and annotations can be retrieved individually or by taxonomic clade. Every genome in the database has been assigned to a species cluster and most genomes could be accurately assigned to one or multiple habitats. In addition, general functional annotations and specific annotations of antibiotic resistance genes and single nucleotide variants are provided. In short, proGenomes2 provides threefold more genomes, enhanced habitat annotations, updated taxonomic and functional annotation and improved linkage to the NCBI BioSample database. The database is available at


European Molecular Biology Laboratory

European Research Council

Heidelberg Center for Human Bioinformatics

ETH Zürich

Helmut Horten Foundation

Fudan University

Shanghai Municipal Science and Technology


Consejería de Educación, Juventud y Deporte de la Comunidad de Madrid

Fondo Social Europeo

Ministerio de Ciencia, Innovación y Universidades

Horizon 2020


Oxford University Press (OUP)









Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3