Author:
Pathak Ashish K,Singla Ridhima,Juneja Mamta,Tuli Rakesh
Abstract
AbstractTranscriptome data are widely used for functional analysis of genes. De-novo assembly of transcriptome gives a large number of unigenes. A large proportion of them remain unannotated. Efficient computational methods are required for identifying genes and modeling those for regulatory and functional roles. Principal component analysis (PCA) was used in a novel approach to shortlist genes, independently of annotation in genome expression data, taking seed development in Arabidopsis thaliana as a representative case. PCA was applied to published genome expression data from four lines of Arabidopsis, mutated in seed development. The PC separating all the developmental stages between a mutant and its respective wild type was selected for shortlisting genes as functionally more important. The shortlisted genes identified by PCA belong to a number of biological functions. The genes reported to give sensitivity to desiccation were identified in PCA analysis also in desiccation intolerant lines only. With respect to the network of 98 genes targeted by ABI3, a higher number of genes was identified as important in the mutants abi 3-5, fus 3-3 andlec 1-1 in comparison to abi 3-1. Ontological analysis and comparison with earlier studies suggest that PCA of genome expression data is useful for shortlisting functionally important genes.
Publisher
Cold Spring Harbor Laboratory
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献