Triclustering Discovery Using the δ-Trimax Method on Microarray Gene Expression Data-Reference-Cited by-同舟云学术

Triclustering Discovery Using the δ-Trimax Method on Microarray Gene Expression Data

Published:2021-03-08 Issue:3 Volume:13 Page:437
ISSN:2073-8994
Container-title:Symmetry
language:en
Short-container-title:Symmetry

Author:

Siswantining Titin^ORCID,Saputra Noval,Sarwinda Devvi,Al-Ash Herley Shaori^ORCID

Abstract

Clustering is a mathematical approach that allows one to find a group of data with similar attributes. This approach is also often used in the field of computer science to group a large amounts of data. Triclustering analysis is an analysis technique on 3D data (observation—attribute—context). Triclustering analysis can group observations on several attributes and contexts simultaneously. Triclustering analysis has been frequently applied to analyze microarray gene expression data. We proposed the δ-Trimax method to perform triclustering analysis on microarray gene expression data. The δ-Trimax method aims to find a tricluster that has a mean square residual smaller than δ and a maximum volume. Tricluster is obtained by deleting nodes from 3D data using multiple node deletion and single node deletion algorithms. The tricluster candidates that have been obtained are checked again by adding some previously deleted nodes using the node addition algorithm. In this research, the program improvement of the δ-Trimax method was carried out and also the calculation of the resulting tricluster evaluation result. The δ-Trimax method is implemented in two microarray gene expression data. The first implementation was carried out on gene expression data from the differentiation process of human-induced pluripotent stem cells (HiPSCs) from patients with heart disease, resulting in the best simulation when δ=0.0068, λ=1.2, and obtained five tricluster, which are considered as characteristics of heart disease. The second implementation was implemented on HIV-1 data, best simulation when δ=0.0046, λ=1.25 and produced three genes as biomarkers, with the gene names AGFG1, EGR1 and HLA-C. This gene group can be used by medical experts in providing further treatment.

Funder

Universitas Indonesia

Publisher

MDPI AG

Subject

Physics and Astronomy (miscellaneous),General Mathematics,Chemistry (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2073-8994/13/3/437/pdf

Reference21 articles.

1. GEOINFORMATICS OF TUBERCULOSIS (TB) DISEASE IN JAKARTA CITY INDONESIA

2. Differential gene co-expression network using BicMix

3. Fast Searching Density Peak Clustering Algorithm Based on Shared Nearest Neighbor and Adaptive Clustering Center