Affiliation:
1. Department of CS and Engineering, Institute for CS and Engineering, CONICET – UNS, Bahía Blanca, Argentina
Abstract
Abstract:
This paper presents a sequence of steps oriented to gain biological knowledge from microarray gene expression data. The pipeline's core is a canonical multi-objective Genetic Algorithm (GA),
which takes a gene expression matrix and a factor as input. The factor groups samples according to different criteria, e.g. healthy tissue and diseased tissue samples. The result of one run of the GA is a gene
set with good properties both at the individual level, in terms of differential expression, and at the aggregate level, in terms of correlation between expression profiles. Microarray experiment data are obtained from GEO (Gene Expression Omnibus dataset). As for the pipeline structure, independent runs of
the GA are analyzed, genes in common between all the runs are collected, and over-representation analysis is performed. At the end of the process, a small number of genes of interest arise. The methodology
is exemplified with a leukemia benchmark dataset, and a group of genes of interest is obtained for the
illustrative example.
Publisher
Bentham Science Publishers Ltd.
Subject
Computational Mathematics,Genetics,Molecular Biology,Biochemistry
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献