Identification of the associations between genes and quantitative traits using entropy-based kernel density estimation-Reference-Cited by-同舟云学术

Identification of the associations between genes and quantitative traits using entropy-based kernel density estimation

Published:2022-06-30 Issue:2 Volume:20 Page:e17
ISSN:2234-0742
Container-title:Genomics & Informatics
language:en
Short-container-title:Genomics Inform

Author:

Yee Jaeyong^ORCID,Park Taesung^ORCID,Park Mira^ORCID

Abstract

Genetic associations have been quantified using a number of statistical measures. Entropy-based mutual information may be one of the more direct ways of estimating the association, in the sense that it does not depend on the parametrization. For this purpose, both the entropy and conditional entropy of the phenotype distribution should be obtained. Quantitative traits, however, do not usually allow an exact evaluation of entropy. The estimation of entropy needs a probability density function, which can be approximated by kernel density estimation. We have investigated the proper sequence of procedures for combining the kernel density estimation and entropy estimation with a probability density function in order to calculate mutual information. Genotypes and their interactions were constructed to set the conditions for conditional entropy. Extensive simulation data created using three types of generating functions were analyzed using two different kernels as well as two types of multifactor dimensionality reduction and another probability density approximation method called m-spacing. The statistical power in terms of correct detection rates was compared. Using kernels was found to be most useful when the trait distributions were more complex than simple normal or gamma distributions. A full-scale genomic dataset was explored to identify associations using the 2-h oral glucose tolerance test results and γ-glutamyl transpeptidase levels as phenotypes. Clearly distinguishable single-nucleotide polymorphisms (SNPs) and interacting SNP pairs associated with these phenotypes were found and listed with empirical p-values.

Funder

National Research Foundation of Korea

Publisher

Korea Genome Organization

Subject

Health Informatics,Genetics,Ecology, Evolution, Behavior and Systematics

Link

http://genominfo.org/upload/pdf/gi-22033.pdf

Reference37 articles.

1. Genome-wide association studies

2. Benefits and limitations of genome-wide association studies

3. Genome-wide association studies for complex traits: consensus, uncertainty and challenges

4. Q&A: Genetic Analysis of Quantitative Traits

5. Seventh Report of the Joint National Committee on Prevention, Detection, Evaluation, and Treatment of High Blood Pressure

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Artificial Intelligence Analysis and Reverse Engineering of Molecular Subtypes of Diffuse Large B-Cell Lymphoma Using Gene Expression Data;BioMedInformatics;2024-01-26