De-Noising, Clustering, Classification, and Representation of Microarray Data for Disease Diagnostics

Author:

Baharadwaj Nitin1,Wadhwa Sheena1,Goel Pragya1,Sethi Isha1,Arora Chanpreet Singh1,Goel Aviral1,Bhatnagar Sonika1,Parthasarathy Harish1

Affiliation:

1. Netaji Subhas Institute of Technology, India

Abstract

A microarray works by exploiting the ability of a given mRNA molecule to bind specifically to the DNA template from which it originated under specific high stringency conditions. After this, the amount of mRNA bound to each DNA site on the array is determined, which represents the expression level of each gene. Qualification of the mRNA (probe) bound to each DNA spot (target) can help us to determine which genes are active or responsible for the current state of the cell. The probe target hybridization is usually detected and quantified using dyes/flurophore/chemiluminescence labels. The microarray data gives a single snapshot of the gene activity profile of a cell at any given time. Microarray data helps to elucidate the various genes involved in the disease and may also be used for diagnosis /prognosis. In spite of its huge potential, microarray data interpretation and use is limited by its error prone nature, the sheer size of the data and the subjectivity of the analysis. Initially, we describe the use of several techniques to develop a pre-processing methodology for denoising microarray data using signal process techniques. The noise free data thus obtained is more suitable for classification of the data as well as for mining useful information from the data. Discrete Fourier Transform (DFT) and Autocorrelation were explored for denoising the data. We also used microarray data to develop the use of microarray data as diagnostic tool in cancer using One Dimensional Fourier Transform followed by simple Euclidean Distance Calculations and Two Dimensional MUltiple SIgnal Classification (MUSIC). To improve the accuracy of the diagnostic tool, Volterra series were used to model the nonlinear behavior of the data. Thus, our efforts at denoising, representation, and classification of microarray data with signal processing techniques show that appreciable results could be attained even with the most basic techniques. To develop a method to search for a gene signature, we used a combination of PCA and density based clustering for inferring the gene signature of Parkinson’s disease. Using this technique in conjunction with gene ontology data, it was possible to obtain a signature comprising of 21 genes, which were then validated by their involvement in known Parkinson’s disease pathways. The methodology described can be further developed to yield future biomarkers for early Parkinson’s disease diagnosis, as well as for drug development.

Publisher

IGI Global

Reference69 articles.

1. Technology Insight: tuning into the genetic orchestra using microarrays—limitations of DNA microarrays in clinical practice

2. Agrawal, R., & Srikant, R. (1994). Fast algorithms for mining association rules in large databases. In Proceedings of 20th International Conference on Very Large Data Bases, Santiago de Chile, Chile: Morgan Kaufmann.

3. Predicting the response to preoperative radiation or chemoradiation by a microarray analysis of the gene expression profiles in rectal cancer

4. Ankerst, M., Breunig, M. M., Kriegel, H., & Sander, J. (1999). OPTICS: ordering points to identify the clustering structure. In Proceedings of ACM SIGMOD International Conference on Management of Data (49-60). New York: ACM Press.

5. Genomic and proteomic strategies to identify novel targets potentially involved in learning and memory

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3