Using analogy to model spatial language use and multimodal knowledge capture
by Lockwood, Kate, Ph.D., NORTHWESTERN UNIVERSITY, 2009, 189 pages; 3386521

Abstract:

Language and knowledge capture are two skills that allow us to create structured representations of the physical world for reasoning and communication. Spatial prepositions are a form of specialized language that is used to relate two objects in space. In addition to communicating the static location of objects, spatial prepositions contain layers of information about the interactions and potential interactions between objects, agents, and their environments. While a large amount of information is encoded in spatial prepositions, the components of scene that contribute to their use are only a fraction of the possible features available. In order to learn the correct spatial prepositions categories in a language, a learner must figure out how to abstract the important concepts without being distracted by surface features.

When communicating about complex spatial relationships, a diagram can often, as the saying goes, "be worth a thousand words". Diagrams can communicate complex spatial concepts concisely, which is why they are often used in educational materials. Information from text accompanying diagrams must be integrated with the spatial information from the diagram to create a cohesive understanding of the concepts being communicated. This process is called multimodal knowledge capture. It is multimodal because the information being captured is presented in two different modalities: text and diagrams. Often spatial language, in particular spatial prepositions, in the text provides clues about how to best integrate the different representations.

This dissertation addresses computational modeling of both learning spatial prepositions and multimodal knowledge capture. In particular, it examines the role that structure-mapping plays in both tasks. In spatial preposition use, sequential generalization over multiple scenes results in the abstraction of core category concepts. In multimodal knowledge capture, structure mapping provides a framework for integrating structured representations from different modalities.

 
AdviserKen Forbus
SchoolNORTHWESTERN UNIVERSITY
SourceDAI/B 70-12, p. , Feb 2010
Source TypeDissertation
SubjectsArtificial intelligence; Computer science
Publication Number3386521
Adobe PDF Access the complete dissertation:
 

» This is an open access dissertation.
  Use the link below to access the full text PDF of this graduate work:
  http://gradworks.umi.com/3386521.pdf
  Use the link below to search and retrieve all open access dissertations:
  http://pqdtopen.proquest.com

About ProQuest Dissertations & Theses
With over 2.3 million records, the ProQuest Dissertations & Theses (PQDT) database is the most comprehensive collection of dissertations and theses in the world. It is the database of record for graduate research.

The database includes citations of graduate works ranging from the first U.S. dissertation, accepted in 1861, to those accepted as recently as last semester. Of the 2.3 million graduate works included in the database, ProQuest offers more than 1.9 million in full text formats. Of those, over 860,000 are available in PDF format. More than 60,000 dissertations and theses are added to the database each year.

If you have questions, please feel free to visit the ProQuest Web site - http://www.proquest.com - or call ProQuest Hotline Customer Support at 1-800-521-3042.