Genes2WordCloud: a quick way to identify biological themes from gene lists and free text

Author:

Baroukh Caroline,Jenkins Sherry L,Dannenfelser Ruth,Ma'ayan Avi

Abstract

Abstract Background Word-clouds recently emerged on the web as a solution for quickly summarizing text by maximizing the display of most relevant terms about a specific topic in the minimum amount of space. As biologists are faced with the daunting amount of new research data commonly presented in textual formats, word-clouds can be used to summarize and represent biological and/or biomedical content for various applications. Results Genes2WordCloud is a web application that enables users to quickly identify biological themes from gene lists and research relevant text by constructing and displaying word-clouds. It provides users with several different options and ideas for the sources that can be used to generate a word-cloud. Different options for rendering and coloring the word-clouds give users the flexibility to quickly generate customized word-clouds of their choice. Methods Genes2WordCloud is a word-cloud generator and a word-cloud viewer that is based on WordCram implemented using Java, Processing, AJAX, mySQL, and PHP. Text is fetched from several sources and then processed to extract the most relevant terms with their computed weights based on word frequencies. Genes2WordCloud is freely available for use online; it is open source software and is available for installation on any web-site along with supporting documentation at http://www.maayanlab.net/G2W. Conclusions Genes2WordCloud provides a useful way to summarize and visualize large amounts of textual biological data or to find biological themes from several different sources. The open source availability of the software enables users to implement customized word-clouds on their own web-sites and desktop applications.

Publisher

Springer Science and Business Media LLC

Subject

Information Systems and Management,Health Informatics,Computer Science Applications,Information Systems

Reference6 articles.

1. Kuo BYL, Hentrich T, Good BM, Wilkinson MD: Tag clouds for summarizing web search results. Proceedings of the 16th international conference on World Wide Web:. 2007, ACM, 1203-1204. ; New York, New York, USA

2. Desai J, Flatow JM, Song J, Zhu LJ, Du P, Huang C-C, Lu H, Lin SM, Kibbe WA: Visual Presentation as a Welcome Alternative to Textual Presentation of Gene Annotation Information. Advances in Experimental Medicine and Biology. 2011, 680 (7): 709-715.

3. Oesper L, Merico D, Isserlin R, Bader G: WordCloud: a Cytoscape plugin to create a visual semantic summary of networks. Source Code for Biology and Medicine. 2011, 6 (1): 7-10.1186/1751-0473-6-7.

4. Sarkar IN, Schenk R, Miller H, Norton CN: LigerCat: Using "MeSH Clouds" from Journal, Article, or Gene Citations to Facilitate the Identification of Relevant Biomedical Literature. AMIA Annu Symp Proc. 2009, 2009: 563-567.

5. Consortium GO: The Gene Ontology in 2010: extensions and refinements. Nucleic Acids Res. 2010, D331-335. 38 Database

Cited by 23 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3