A part-of-query tagger for search queries
by Chheng, Tommy, M.S., UNIVERSITY OF CALIFORNIA, IRVINE, 2010, 69 pages; 1476704

Abstract:

In this thesis, we explore expanding beyond keyword searches for structured data when using a text field search interface. By associating meaning with each term in a query, a search system can provide more relevant results. This thesis will cover the implementation of a part-of-query tagging system which tags a search query's terms with its associated metadata field. Specifically, we create the system using a unified segmentation and classification algorithm based on a generalized Bayes classifier. We perform evaluations based on a generated test set and achieve an 85% accuracy in tagging the terms in a search query to the correct metadata fields. We incorporate the system into ResearchWatch, an unofficial search engine for NSF research grants and demonstrate the applicability of the system by providing a qualitative comparison to Research.gov, the official NSF research grant search engine.

 
AdvisersRamesh Jain; Bill Tomlinson
SchoolUNIVERSITY OF CALIFORNIA, IRVINE
SourceMAI/ 48-05, p. , Jun 2010
Source TypeThesis
SubjectsInformation science; Computer science
Publication Number1476704
Adobe PDF Access the complete dissertation:
 

» Find an electronic copy at your library.
  Use the link below to access a full citation record of this graduate work:
  http://gateway.proquest.com/openurl%3furl_ver=Z39.88-2004%26res_dat=xri:pqdiss%26rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation%26rft_dat=xri:pqdiss:1476704
  If your library subscribes to the ProQuest Dissertations & Theses (PQDT) database, you may be entitled to a free electronic version of this graduate work. If not, you will have the option to purchase one, and access a 24 page preview for free (if available).

About ProQuest Dissertations & Theses
With over 2.3 million records, the ProQuest Dissertations & Theses (PQDT) database is the most comprehensive collection of dissertations and theses in the world. It is the database of record for graduate research.

The database includes citations of graduate works ranging from the first U.S. dissertation, accepted in 1861, to those accepted as recently as last semester. Of the 2.3 million graduate works included in the database, ProQuest offers more than 1.9 million in full text formats. Of those, over 860,000 are available in PDF format. More than 60,000 dissertations and theses are added to the database each year.

If you have questions, please feel free to visit the ProQuest Web site - http://www.proquest.com - or call ProQuest Hotline Customer Support at 1-800-521-3042.