UniProt: the Universal Protein Knowledgebase in 2023
Author:
, Bateman Alex, Martin Maria-Jesus, Orchard SandraORCID, Magrane Michele, Ahmad Shadab, Alpi Emanuele, Bowler-Barnett Emily H, Britto Ramona, Bye-A-Jee Hema, Cukura Austra, Denny Paul, Dogan Tunca, Ebenezer ThankGod, Fan Jun, Garmiri Penelope, da Costa Gonzales Leonardo Jose, Hatton-Ellis Emma, Hussein Abdulrahman, Ignatchenko Alexandr, Insana Giuseppe, Ishtiaq Rizwan, Joshi Vishal, Jyothi Dushyanth, Kandasaamy Swaathi, Lock Antonia, Luciani Aurelien, Lugaric Marija, Luo Jie, Lussi Yvonne, MacDougall Alistair, Madeira Fabio, Mahmoudy Mahdi, Mishra Alok, Moulang Katie, Nightingale Andrew, Pundir Sangya, Qi Guoying, Raj Shriya, Raposo Pedro, Rice Daniel L, Saidi Rabie, Santos Rafael, Speretta Elena, Stephenson James, Totoo Prabhat, Turner Edward, Tyagi Nidhi, Vasudev Preethi, Warner Kate, Watkins Xavier, Zaru Rossana, Zellner Hermann, Bridge Alan J, Aimo Lucila, Argoud-Puy Ghislaine, Auchincloss Andrea H, Axelsen Kristian B, Bansal Parit, Baratin Delphine, Batista Neto Teresa M, Blatter Marie-Claude, Bolleman Jerven T, Boutet Emmanuel, Breuza Lionel, Gil Blanca Cabrera, Casals-Casas Cristina, Echioukh Kamal Chikh, Coudert Elisabeth, Cuche Beatrice, de Castro Edouard, Estreicher Anne, Famiglietti Maria L, Feuermann Marc, Gasteiger Elisabeth, Gaudet Pascale, Gehant Sebastien, Gerritsen Vivienne, Gos Arnaud, Gruaz Nadine, Hulo Chantal, Hyka-Nouspikel Nevila, Jungo Florence, Kerhornou Arnaud, Le Mercier Philippe, Lieberherr Damien, Masson Patrick, Morgat Anne, Muthukrishnan Venkatesh, Paesano Salvo, Pedruzzi Ivo, Pilbout Sandrine, Pourcel Lucille, Poux Sylvain, Pozzato Monica, Pruess Manuela, Redaschi Nicole, Rivoire Catherine, Sigrist Christian J A, Sonesson Karin, Sundaram Shyamala, Wu Cathy H, Arighi Cecilia N, Arminski Leslie, Chen Chuming, Chen Yongxing, Huang Hongzhan, Laiho Kati, McGarvey Peter, Natale Darren A, Ross Karen, Vinayaka C R, Wang Qinghua, Wang Yuqi, Zhang Jian
Abstract
Abstract
The aim of the UniProt Knowledgebase is to provide users with a comprehensive, high-quality and freely accessible set of protein sequences annotated with functional information. In this publication we describe enhancements made to our data processing pipeline and to our website to adapt to an ever-increasing information content. The number of sequences in UniProtKB has risen to over 227 million and we are working towards including a reference proteome for each taxonomic group. We continue to extract detailed annotations from the literature to update or create reviewed entries, while unreviewed entries are supplemented with annotations provided by automated systems using a variety of machine-learning techniques. In addition, the scientific community continues their contributions of publications and annotations to UniProt entries of their interest. Finally, we describe our new website (https://www.uniprot.org/), designed to enhance our users’ experience and make our data easily accessible to the research community. This interface includes access to AlphaFold structures for more than 85% of all entries as well as improved visualisations for subcellular localisation of proteins.
Funder
National Human Genome Research Institute National Institute of Allergy and Infectious Diseases National Institute on Aging National Institute of General Medical Sciences National Institute of Diabetes and Digestive and Kidney Diseases National Eye Institute National Cancer Institute National Heart, Lung, and Blood Institute National Institutes of Health NHGRI NIH Biotechnology and Biological Sciences Research Council Open Targets SERI European Molecular Biology Laboratory
Publisher
Oxford University Press (OUP)
Cited by
2488 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|