Opinion summarization: Automatically creating useful representations of the opinions expressed in text
by Stoyanov, Veselin Stoyanov, Ph.D., CORNELL UNIVERSITY, 2009, 176 pages; 3376679

Abstract:

Opinion analysis is concerned with extracting information about attitudes, beliefs, emotions, opinions, evaluations and sentiment expressed in texts. To date, research in the area of opinion analysis has focused on developing methods for the automatic extraction of opinions and their attributes. While this opinion information is useful, its true potential can be realized only after it is consolidated (summarized) in a meaningful way: the raw information contained in individual opinions is often incomplete and their number is overwhelming.

Until now, the task of domain-independent opinion summarization has received little research attention. We address this void by proposing methods for opinion summarization. Toward that end, we formulate new approaches for the problems of determining what opinions should be attributed to the same source ( source coreference resolution) and whether opinions are on the same topic (topic identification/coreference resolution). Additionally, we introduce novel evaluation metrics for the quantitative evaluation of the quality of complete opinion summaries. Finally, we describe and evaluate OASIS, the first opinion summarization system known to us that produces domain-independent non-extract based summaries. Results for the individual components are encouraging and the overall summaries produced by OASIS outperform a competitive baseline by a large margin when we put more emphasis on computing an aggregate summary during evaluation.

 
AdviserClaire T. Cardie
SchoolCORNELL UNIVERSITY
SourceDAI/B 70-10, p. , Nov 2009
Source TypeDissertation
SubjectsComputer science
Publication Number3376679
Adobe PDF Access the complete dissertation:
 

» Find an electronic copy at your library.
  Use the link below to access a full citation record of this graduate work:
  http://gateway.proquest.com/openurl%3furl_ver=Z39.88-2004%26res_dat=xri:pqdiss%26rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation%26rft_dat=xri:pqdiss:3376679
  If your library subscribes to the ProQuest Dissertations & Theses (PQDT) database, you may be entitled to a free electronic version of this graduate work. If not, you will have the option to purchase one, and access a 24 page preview for free (if available).

About ProQuest Dissertations & Theses
With over 2.3 million records, the ProQuest Dissertations & Theses (PQDT) database is the most comprehensive collection of dissertations and theses in the world. It is the database of record for graduate research.

The database includes citations of graduate works ranging from the first U.S. dissertation, accepted in 1861, to those accepted as recently as last semester. Of the 2.3 million graduate works included in the database, ProQuest offers more than 1.9 million in full text formats. Of those, over 860,000 are available in PDF format. More than 60,000 dissertations and theses are added to the database each year.

If you have questions, please feel free to visit the ProQuest Web site - http://www.proquest.com - or call ProQuest Hotline Customer Support at 1-800-521-3042.