Restructuring Sparse High Dimensional Data for Effective Retrieval

Please use this identifier to cite or link to this item: http://dspace.mediu.edu.my:8181/xmlui/handle/1721.1/6674

Full metadata record

DC Field	Value	Language
dc.creator	Isbell, Charles	-
dc.creator	Viola, Paul	-
dc.date	2004-10-08T20:37:10Z	-
dc.date	2004-10-08T20:37:10Z	-
dc.date	1998-05-01	-
dc.date.accessioned	2013-10-09T02:46:25Z	-
dc.date.available	2013-10-09T02:46:25Z	-
dc.date.issued	2013-10-09	-
dc.identifier	AIM-1636	-
dc.identifier	http://hdl.handle.net/1721.1/6674	-
dc.identifier.uri	http://koha.mediu.edu.my:8181/xmlui/handle/1721	-
dc.description	The task in text retrieval is to find the subset of a collection of documents relevant to a user's information request, usually expressed as a set of words. Classically, documents and queries are represented as vectors of word counts. In its simplest form, relevance is defined to be the dot product between a document and a query vector--a measure of the number of common terms. A central difficulty in text retrieval is that the presence or absence of a word is not sufficient to determine relevance to a query. Linear dimensionality reduction has been proposed as a technique for extracting underlying structure from the document collection. In some domains (such as vision) dimensionality reduction reduces computational complexity. In text retrieval it is more often used to improve retrieval performance. We propose an alternative and novel technique that produces sparse representations constructed from sets of highly-related words. Documents and queries are represented by their distance to these sets. and relevance is measured by the number of common clusters. This technique significantly improves retrieval performance, is efficient to compute and shares properties with the optimal linear projection operator and the independent components of documents.	-
dc.format	5435006 bytes	-
dc.format	502542 bytes	-
dc.format	application/postscript	-
dc.format	application/pdf	-
dc.language	en_US	-
dc.relation	AIM-1636	-
dc.title	Restructuring Sparse High Dimensional Data for Effective Retrieval	-
Appears in Collections:	MIT Items

Files in This Item:

There are no files associated with this item.

Show simple item record

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets