Image segmentation and contextual modeling for object recognition
by Rabinovich, Andrew, Ph.D., UNIVERSITY OF CALIFORNIA, SAN DIEGO, 2008, 107 pages; 3331411

Abstract:

Recognizing objects is an essential part of navigating through the visual world. Identifying objects and finding boundaries between them provides us with some of the richest sensory information. Similarly, image segmentation and object recognition are among the most fundamental problems in computer vision and machine intelligence. The potential interaction between these processes has been discussed for many years. The usefulness of recognition for segmentation was demonstrated with various top-down segmentation algorithms; however, the impact of bottom-up image segmentation for object recognition is not well understood. One impeding factor is the unsatisfactory quality of image segmentation algorithms. In this work, we take advantage of a recently proposed method for computing multiple stable segmentations and illustrate the application of bottom-up image segmentation as a preprocessing step for object recognition.

In parallel to segmentation, the task of visual object recognition is often greatly facilitated by the objects' surroundings. Contextual information can play the very important role of reducing ambiguity in objects' visual appearance. In this dissertation, we propose a new model for object recognition that incorporates two types of context—co-occurrence and relative location—with local appearance-based features, thus named CoLA (for Co-occurrence, Location and Appearance).

Since a number of contextual models for recognition have been proposed in the recent history, it is necessary to compare the newly proposed model to the existing ones. Over the years, two general kinds of such models have emerged: those with contextual inference based on the statistical summary of the scene, and models representing the context in terms of relationships among objects in the image. Understanding the theoretical and practical properties of such approaches is essential in designing object recognition systems. We provide an analytical analysis of these models and evaluate them empirically.

 
AdviserSerge J. Belongie
SchoolUNIVERSITY OF CALIFORNIA, SAN DIEGO
SourceDAI/B 69-11, p. , Dec 2008
Source TypeDissertation
SubjectsArtificial intelligence; Computer science
Publication Number3331411
Adobe PDF Access the complete dissertation:
 

» Find an electronic copy at your library.
  Use the link below to access a full citation record of this graduate work:
  http://gateway.proquest.com/openurl%3furl_ver=Z39.88-2004%26res_dat=xri:pqdiss%26rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation%26rft_dat=xri:pqdiss:3331411
  If your library subscribes to the ProQuest Dissertations & Theses (PQDT) database, you may be entitled to a free electronic version of this graduate work. If not, you will have the option to purchase one, and access a 24 page preview for free (if available).

About ProQuest Dissertations & Theses
With over 2.3 million records, the ProQuest Dissertations & Theses (PQDT) database is the most comprehensive collection of dissertations and theses in the world. It is the database of record for graduate research.

The database includes citations of graduate works ranging from the first U.S. dissertation, accepted in 1861, to those accepted as recently as last semester. Of the 2.3 million graduate works included in the database, ProQuest offers more than 1.9 million in full text formats. Of those, over 860,000 are available in PDF format. More than 60,000 dissertations and theses are added to the database each year.

If you have questions, please feel free to visit the ProQuest Web site - http://www.proquest.com - or call ProQuest Hotline Customer Support at 1-800-521-3042.