Abstract
As early as in 2002, the need was declared for a public repository of experimental results for gene expression profiling. Since that time, several storage hubs for gene expression
profiling data have been created, to enable profile analysis and comparison. This gene expression profiling may usually be performed using either mRNA microarray hybridization
ornext-generation sequencing. However, all these big data may be heterogeneous, even if they were obtained for the same type of normal or pathologically altered organs and tissues,
and have been investigated using the same experimental platform. In the current work, we have proposed a new method for analyzing the homogeneity of expression data based on the Student test.
Using computational experiments, we have shown the advantage of our method in terms of computational speed for large datasets, and developed an approach to interpreting the results
for the Student test application. Using a new method of data analysis, we have suggested a scheme for visualization of the overall picture of gene expression and comparison of expression
profiles at different diseases and/or different stages of the same disease.
Publisher
Institute of Mathematical Problems of Biology of RAS (IMPB RAS)
Subject
Applied Mathematics,Biomedical Engineering