Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 7-Day Trial for You or Your Team.

Learn More →

Supervised, Unsupervised, and Semi-Supervised Feature Selection: A Review on Gene Selection.

Supervised, Unsupervised, and Semi-Supervised Feature Selection: A Review on Gene Selection. Recently, feature selection and dimensionality reduction have become fundamental tools for many data mining tasks, especially for processing high-dimensional data such as gene expression microarray data. Gene expression microarray data comprises up to hundreds of thousands of features with relatively small sample size. Because learning algorithms usually do not work well with this kind of data, a challenge to reduce the data dimensionality arises. A huge number of gene selection are applied to select a subset of relevant features for model construction and to seek for better cancer classification performance. This paper presents the basic taxonomy of feature selection, and also reviews the state-of-the-art gene selection methods by grouping the literatures into three categories: supervised, unsupervised, and semi-supervised. The comparison of experimental results on top 5 representative gene expression datasets indicates that the classification accuracy of unsupervised and semi-supervised feature selection is competitive with supervised feature selection. http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png IEEE/ACM transactions on computational biology and bioinformatics Pubmed

Supervised, Unsupervised, and Semi-Supervised Feature Selection: A Review on Gene Selection.

IEEE/ACM transactions on computational biology and bioinformatics , Volume 13 (5): 19 – Oct 2, 2017

Supervised, Unsupervised, and Semi-Supervised Feature Selection: A Review on Gene Selection.


Abstract

Recently, feature selection and dimensionality reduction have become fundamental tools for many data mining tasks, especially for processing high-dimensional data such as gene expression microarray data. Gene expression microarray data comprises up to hundreds of thousands of features with relatively small sample size. Because learning algorithms usually do not work well with this kind of data, a challenge to reduce the data dimensionality arises. A huge number of gene selection are applied to select a subset of relevant features for model construction and to seek for better cancer classification performance. This paper presents the basic taxonomy of feature selection, and also reviews the state-of-the-art gene selection methods by grouping the literatures into three categories: supervised, unsupervised, and semi-supervised. The comparison of experimental results on top 5 representative gene expression datasets indicates that the classification accuracy of unsupervised and semi-supervised feature selection is competitive with supervised feature selection.

Loading next page...
 
/lp/pubmed/supervised-unsupervised-and-semi-supervised-feature-selection-a-review-cw1teXW9pU

References

References for this paper are not available at this time. We will be adding them shortly, thank you for your patience.

ISSN
1545-5963
eISSN
1557-9964
DOI
10.1109/TCBB.2015.2478454
pmid
26390495

Abstract

Recently, feature selection and dimensionality reduction have become fundamental tools for many data mining tasks, especially for processing high-dimensional data such as gene expression microarray data. Gene expression microarray data comprises up to hundreds of thousands of features with relatively small sample size. Because learning algorithms usually do not work well with this kind of data, a challenge to reduce the data dimensionality arises. A huge number of gene selection are applied to select a subset of relevant features for model construction and to seek for better cancer classification performance. This paper presents the basic taxonomy of feature selection, and also reviews the state-of-the-art gene selection methods by grouping the literatures into three categories: supervised, unsupervised, and semi-supervised. The comparison of experimental results on top 5 representative gene expression datasets indicates that the classification accuracy of unsupervised and semi-supervised feature selection is competitive with supervised feature selection.

Journal

IEEE/ACM transactions on computational biology and bioinformaticsPubmed

Published: Oct 2, 2017

There are no references for this article.