FAS: Assessing the similarity between proteins using multi-layered feature architectures-Reference-Cited by-同舟云学术

FAS: Assessing the similarity between proteins using multi-layered feature architectures

Published:2022-09-03 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Dosch Julian,Bergmann Holger,Tran Vinh,Ebersberger Ingo^ORCID

Abstract

AbstractMotivationExpert curation to differentiate between functionally diverged homologs and those that may still share a similar function routinely relies on the visual interpretation of domain architecture changes. However, the size of contemporary data sets integrating homologs from hundreds to thousands of species calls for alternate solutions. Scoring schemes to evaluate domain architecture similarities can help to automatize this procedure, in principle. But existing schemes are often too simplistic in the similarity assessment, many require an a-priori resolution of overlapping domain annotations, and those that allow overlaps to extend the set of annotations sources cannot account for redundant annotations. As a consequence, the gap between the automated similarity scoring and the similarity assessment based on visual architecture comparison is still too wide to make the integration of both approaches meaningful.ResultsHere, we present FAS, a scoring system for the comparison of multi-layered feature architectures integrating information from a broad spectrum of annotation sources. Feature architectures are represented as directed acyclic graphs, and redundancies are resolved in the course of comparison using a score maximization algorithm. A benchmark using more than 10,000 human-yeast ortholog pairs reveals that FAS consistently outperforms existing scoring schemes. Using three examples, we show how automated architecture similarity assessments can be routinely applied in the benchmarking of orthology assignment software, in the identification of functionally diverged orthologs, and in the identification of entries in protein collections that most likely stem from a faulty gene prediction.Availability and implementationFAS is available as python package: https://pypi.org/project/greedyFAS/

Publisher

Cold Spring Harbor Laboratory

Reference63 articles.

1. The Quest for Orthologs benchmark service and consensus calls in 2020

2. Altenhoff, A. M. , Levy, J. , Zarowiecki, M. , Tomiczek, B. , Vesztrocy, A. W. , Dalquen, D. A. , Müller, S. , Telford, M. J. , Glover, N. M. , Dylus, D. , & Dessimoz, C. (2019). OMA standalone: Orthology inference among public and custom genomes and transcriptomes. Genome Research, 29(7). https://doi.org/10.1101/gr.243212.118

3. Altschul, S. F. , Madden, T. L. , Schäffer, A. A. , Zhang, J. , Zhang, Z. , Miller, W. , & Lipman, D. J. (1997). Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. In Nucleic Acids Research (Vol. 25, Issue 17). https://doi.org/10.1093/nar/25.17.3389

4. Aramaki, T. , Blanc-Mathieu, R. , Endo, H. , Ohkubo, K. , Kanehisa, M. , Goto, S. , & Ogata, H. (2020). KofamKOALA: KEGG Ortholog assignment based on profile HMM and adaptive score threshold. Bioinformatics, 36(7). https://doi.org/10.1093/bioinformatics/btz859

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Feature architecture aware phylogenetic profiling indicates a functional diversification of type IVa pili in the nosocomial pathogen Acinetobacter baumannii;PLOS Genetics;2023-07-27

2. Domain-architecture aware phylogenetic profiling indicates a functional diversification of type IVa pili in the nosocomial pathogenAcinetobacter baumannii;2023-02-01