PhenClust, a standalone tool for identifying trends within sets of biological phenotypes using semantic similarity and the Unified Medical Language System metathesaurus

Author:

Wilson Jennifer L1ORCID,Wong Mike2,Stepanov Nicholas3,Petkovic Dragutin23,Altman Russ45

Affiliation:

1. Department of Chemical and Systems Biology, Stanford University, Stanford, California, USA

2. CoSE Computing for Life Science, San Francisco State University, San Francisco, California, USA

3. Department of Computer Science, San Francisco State University, San Francisco, California, USA

4. Department of Bioengineering, Stanford University, Stanford, California, USA

5. Department of Genetics, Stanford University, Stanford, California, USA

Abstract

Abstract Objectives We sought to cluster biological phenotypes using semantic similarity and create an easy-to-install, stable, and reproducible tool. Materials and Methods We generated Phenotype Clustering (PhenClust)—a novel application of semantic similarity for interpreting biological phenotype associations—using the Unified Medical Language System (UMLS) metathesaurus, demonstrated the tool’s application, and developed Docker containers with stable installations of two UMLS versions. Results PhenClust identified disease clusters for drug network-associated phenotypes and a meta-analysis of drug target candidates. The Dockerized containers eliminated the requirement that the user install the UMLS metathesaurus. Discussion Clustering phenotypes summarized all phenotypes associated with a drug network and two drug candidates. Docker containers can support dissemination and reproducibility of tools that are otherwise limited due to insufficient software support. Conclusion PhenClust can improve interpretation of high-throughput biological analyses where many phenotypes are associated with a query and the Dockerized PhenClust achieved our objective of decreasing installation complexity.

Funder

US Food and Drug Administration

SPARK at Stanford, and by a Sanofi iDEA Award

Publisher

Oxford University Press (OUP)

Subject

Health Informatics

Reference10 articles.

1. PathFX provides mechanistic insights into drug efficacy and safety for regulatory review and therapeutic development;Wilson;PLoS Comput Biol,2018

2. An overview of MetaMap: historical perspective and recent advances;Aronson;J Am Med Inform Assoc,2010

3. UMLS-Interface and UMLS-Similarity: open source software for measuring paths and semantic similarity;McInnes;AMIA Annu Symp Proc,2009

4. PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene-disease associations;Denny;Bioinformatics,2010

5. An introduction to Docker for reproducible research;Boettiger;Sigops Oper Syst Rev,2015

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3