Centrality-based nearest-neighbor projected-distance regression (C-NPDR) feature selection for correlation predictors with application to resting-state fMRI of major depressive disorder

Author:

Kresock Elizabeth1,Luttbeg Henry1,Li Jamie1,Kuplicki Rayus2,McKinney B. A.1,McKinney Brett1,Dawkins Bryan3

Affiliation:

1. The University of Tulsa

2. Laureate Institute for Brain Research

3. SomaLogic, Inc

Abstract

Abstract Background. Nearest-neighbor projected-distance regression (NPDR) is a metric-based machine learning feature selection algorithm that uses distances between samples and projected differences between variables to identify variables or features that may interact to affect the prediction of complex outcomes. Typical bioinformatics data consist of separate variables of interest like genes or proteins. In contrast, resting-state functional MRI (rs-fMRI) data is composed of time-series for brain Regions of Interest (ROIs) for each subject, and these within-brain time-series are typically transformed into correlations between pairs of ROIs. These pairs of variables of interest can then be used as input for feature selection or other machine learning. Straightforward feature selection would return the most significant pairs of ROIs; however, it would also be beneficial to know the importance of individual ROIs. Results. We extend NPDR to compute the importance of individual ROIs from correlation-based features. We present correlation-difference and centrality-based versions of NPDR. The centrality-based NPDR can be coupled with any centrality method and can be coupled with importance scores other than NPDR, such as random forest importance. We develop a new simulation method using random network theory to generate artificial correlation data predictors with variation in correlation that affects class prediction. Conclusions. We compare feature selection methods based on detecting functional simulated ROIs, and we apply the new centrality NPDR approach to a resting-state fMRI study of major depressive disorder (MDD) and healthy controls. We determine that the areas of the brain that are the most interactive in MDD patients include the middle temporal gyrus, the inferior temporal gyrus, and the dorsal entorhinal cortex. The resulting feature selection and simulation approaches can be applied to other domains that use correlation-based features.

Publisher

Research Square Platform LLC

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3