Domains as Community Structure within Ontologies: Towards a Macro-Structure Model of Knowledge
by Lane, Kevin Keith, Ph.D., UNIVERSITY OF CALIFORNIA, LOS ANGELES, 2011, 185 pages; 3501918

Abstract:

Knowledge is commonly modeled as a network of inter-related concepts. The Semantic Web and Linked Data arcs of research have generated networks of 25 billion nodes built on this linked-concept foundation. Substantial research efforts fuel the effort to expand the size and breadth of these data sets. Other research efforts have focused on building systems that utilize the data sets. Still other research uses the data sets as an object of study in and of themselves. This study explores the topology of a large-scale knowledge network to identify clusters of connected concepts known, in network-theoretic terms, as community structure.

Specifically, this study extends current research by investigating domains as community structures within ontologies. First, I frame ontologies as operationalizations of knowledge. This orientation reflects the extensive scholarship that shows that modeling knowledge as a network of unweighted relations between two concepts provides a useful, if imperfect, framework.

I hypothesize that domains emerge as clusters of tightly connected concepts within large-scale ontologies, and I adopt the community structure paradigm for identifying those clusters within the YAGO ontology and several variants. I demonstrate that standard network-theoretic community structure methods can identify domains within an ontology.

Quantitatively, this study provides evidence that domains can be iteratively extracted from large-scale ontologies, suggesting a hierarchy of domains that correspond to domains of varying granularity. Further, the study provides evidence for structural features such as domains, facets and upper ontologies.

Qualitatively, the study captures the bias in the YAGO ontology toward logico-deductive assertions in contrast to associative relations between concepts. The YAGO dataset is comprised of factual assertions and extracted hyperlinks which omit some actual relations between concepts and oversimplify the strength of other relations.

Methodologically, community structure algorithms are shown to produce varying results in the identification of clusters of concepts, even when using modularity-optimizing techniques. This phenomenon may indicate a need for a more robust model of ontology structure and/or employment of alternate community structure identification approaches that are emerging from network theorists.

 
AdviserGregory H. Leazer
SchoolUNIVERSITY OF CALIFORNIA, LOS ANGELES
SourceDAI/A 73-07(E), p. , Mar 2012
Source TypeDissertation
SubjectsCognitive psychology; Information science; Computer science
Publication Number3501918
Adobe PDF Access the complete dissertation:
 

» Find an electronic copy at your library.
  Use the link below to access a full citation record of this graduate work:
  http://gateway.proquest.com/openurl%3furl_ver=Z39.88-2004%26res_dat=xri:pqdiss%26rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation%26rft_dat=xri:pqdiss:3501918
  If your library subscribes to the ProQuest Dissertations & Theses (PQDT) database, you may be entitled to a free electronic version of this graduate work. If not, you will have the option to purchase one, and access a 24 page preview for free (if available).

About ProQuest Dissertations & Theses
With over 2.3 million records, the ProQuest Dissertations & Theses (PQDT) database is the most comprehensive collection of dissertations and theses in the world. It is the database of record for graduate research.

The database includes citations of graduate works ranging from the first U.S. dissertation, accepted in 1861, to those accepted as recently as last semester. Of the 2.3 million graduate works included in the database, ProQuest offers more than 1.9 million in full text formats. Of those, over 860,000 are available in PDF format. More than 60,000 dissertations and theses are added to the database each year.

If you have questions, please feel free to visit the ProQuest Web site - http://www.proquest.com - or call ProQuest Hotline Customer Support at 1-800-521-3042.