Quantcast
Channel: Browse By Latest Additions - RMIT Research Repository
Viewing all articles
Browse latest Browse all 41248

Unsupervised variance based preprocessing of microarray data

$
0
0
Data preprocessing is an important step in preparation of DNA microarray data for further analysis. There is a significant amount of genes that do not influence the final classification. One of the reasons to eliminate such genes is the increasing computational complexity of supervised machine learning methods, especially in modern microarray experiments with hundreds of samples. This empirical study aims to measure differences in classification performance when different numbers of gene expression measurements are removed in a preprocessing phase. Simple unsupervised gene selection based on variance level of genes across all samples was used to remove genes with extremely low level of variance. This study shows the importance of combining unsupervised and supervised feature selection techniques along with classification algorithm. It was shown that gene expression values removed using simple unsupervised gene selection method are not of significant importance to the final results of supervised gene selection followed by classification.

Viewing all articles
Browse latest Browse all 41248

Trending Articles