DSpace Repository

Restructuring Sparse High Dimensional Data for Effective Retrieval

Show simple item record

dc.creator Isbell, Charles
dc.creator Viola, Paul
dc.date 2004-10-08T20:37:10Z
dc.date 2004-10-08T20:37:10Z
dc.date 1998-05-01
dc.date.accessioned 2013-10-09T02:46:25Z
dc.date.available 2013-10-09T02:46:25Z
dc.date.issued 2013-10-09
dc.identifier AIM-1636
dc.identifier http://hdl.handle.net/1721.1/6674
dc.identifier.uri http://koha.mediu.edu.my:8181/xmlui/handle/1721
dc.description The task in text retrieval is to find the subset of a collection of documents relevant to a user's information request, usually expressed as a set of words. Classically, documents and queries are represented as vectors of word counts. In its simplest form, relevance is defined to be the dot product between a document and a query vector--a measure of the number of common terms. A central difficulty in text retrieval is that the presence or absence of a word is not sufficient to determine relevance to a query. Linear dimensionality reduction has been proposed as a technique for extracting underlying structure from the document collection. In some domains (such as vision) dimensionality reduction reduces computational complexity. In text retrieval it is more often used to improve retrieval performance. We propose an alternative and novel technique that produces sparse representations constructed from sets of highly-related words. Documents and queries are represented by their distance to these sets. and relevance is measured by the number of common clusters. This technique significantly improves retrieval performance, is efficient to compute and shares properties with the optimal linear projection operator and the independent components of documents.
dc.format 5435006 bytes
dc.format 502542 bytes
dc.format application/postscript
dc.format application/pdf
dc.language en_US
dc.relation AIM-1636
dc.title Restructuring Sparse High Dimensional Data for Effective Retrieval


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account