Selecting Relevant Genes with a Spectral Approach

Please use this identifier to cite or link to this item: http://dspace.mediu.edu.my:8181/xmlui/handle/1721.1/7282

Full metadata record

DC Field	Value	Language
dc.creator	Wolf, Lior	-
dc.creator	Amnon Shashua,	-
dc.creator	Mukherjee, Sayan	-
dc.date	2004-10-20T21:05:21Z	-
dc.date	2004-10-20T21:05:21Z	-
dc.date	2004-01-27	-
dc.date.accessioned	2013-10-09T02:48:56Z	-
dc.date.available	2013-10-09T02:48:56Z	-
dc.date.issued	2013-10-09	-
dc.identifier	AIM-2004-002	-
dc.identifier	CBCL-234	-
dc.identifier	http://hdl.handle.net/1721.1/7282	-
dc.identifier.uri	http://koha.mediu.edu.my:8181/xmlui/handle/1721	-
dc.description	Array technologies have made it possible to record simultaneously the expression pattern of thousands of genes. A fundamental problem in the analysis of gene expression data is the identification of highly relevant genes that either discriminate between phenotypic labels or are important with respect to the cellular process studied in the experiment: for example cell cycle or heat shock in yeast experiments, chemical or genetic perturbations of mammalian cell lines, and genes involved in class discovery for human tumors. In this paper we focus on the task of unsupervised gene selection. The problem of selecting a small subset of genes is particularly challenging as the datasets involved are typically characterized by a very small sample size ?? the order of few tens of tissue samples ??d by a very large feature space as the number of genes tend to be in the high thousands. We propose a model independent approach which scores candidate gene selections using spectral properties of the candidate affinity matrix. The algorithm is very straightforward to implement yet contains a number of remarkable properties which guarantee consistent sparse selections. To illustrate the value of our approach we applied our algorithm on five different datasets. The first consists of time course data from four well studied Hematopoietic cell lines (HL-60, Jurkat, NB4, and U937). The other four datasets include three well studied treatment outcomes (large cell lymphoma, childhood medulloblastomas, breast tumors) and one unpublished dataset (lymph status). We compared our approach both with other unsupervised methods (SOM,PCA,GS) and with supervised methods (SNR,RMB,RFE). The results clearly show that our approach considerably outperforms all the other unsupervised approaches in our study, is competitive with supervised methods and in some case even outperforms supervised approaches.	-
dc.format	2062939 bytes	-
dc.format	836436 bytes	-
dc.format	application/postscript	-
dc.format	application/pdf	-
dc.language	en_US	-
dc.relation	AIM-2004-002	-
dc.relation	CBCL-234	-
dc.subject	AI	-
dc.title	Selecting Relevant Genes with a Spectral Approach	-
Appears in Collections:	MIT Items

Files in This Item:

There are no files associated with this item.

Show simple item record

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets