Hierarchical Bayesian classification methods to identify topics by journal quartile with an application in biological sciences

Author:

Restrepo Silvia,ter Horst Enrique,Zambrano Juan Diego,Gunn Laura H.,Molina German,Salazar Carlos Andres

Abstract

This manuscript builds on a novel, automatic, freely-available Bayesian approach to extract information in abstracts and titles to classify research topics by quartile. This approach is demonstrated for all N= 149,129 ISI-indexed publications in biological sciences journals during 2017. A Bayesian multinomial inverse regression approach is used to extract rankings of topics without the need of a pre-defined dictionary. Bigrams are used for extraction of research topics across manuscripts, and rankings of research topics are constructed by quartile. Worldwide and local results (e.g., comparison between two peer/aspirational research institutions in Colombia) are provided, and differences are explored both at the global and local levels. Some topics persist across quartiles, while the relevance of others is quartile-specific. Challenges in sustainable development appear as more prevalent in top quartile journals across institutions, while the two Colombian institutions favour plant and microorganism research. This approach can reduce information inequities, by allowing young/incipient researchers in biological sciences, especially within lower income countries or universities with limited resources, to freely assess the state of the literature and the relative likelihood of publication in higher impact journals by research topic. This can also serve institutions of higher education to identify missing research topics and areas of competitive advantage.

Publisher

IOS Press

Subject

Library and Information Sciences,Education,Information Systems

Reference36 articles.

1. Financial ratios, discriminant analysis and the prediction of corporate bankruptcy;Altman;The Journal of Finance,1968

2. Best practices for scholarly authors in the age of predatory journals;Beall;The Annals of The Royal College of Surgeons of England,2016

3. Latent dirichlet process;Blei;Journal of Machine Learning Research,2003

4. What makes a tweet be retweeted? a bayesian trigram analysis of tweet propagation during the 2015 colombian political campaign;Casarin;Journal of Information Science,2021

5. Text mining tools for extracting information about microbial biodiversity in food;Chaix;Food Microbiology,2019

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3