UMI  
ProQuest® Dissertations & Theses
The world's most comprehensive collection of dissertations and theses. Learn more...
ProQuest  
 
 
Managing large scale distributed data with peer-to-peer search trees
by Zhang, Chi, PhD, PRINCETON UNIVERSITY, 2007, 0 pages; 3250045
 

Abstract: Massive data sets have been increasingly shared by distributed users following the success of broad-band Internet services. Indexing and complex queries are essential for such data collections in applications like multimedia retrieval, data mining, and spatial databases. Traditional service infrastructures rely on centralized management and query processing. This approach does not scale with the rapidly increasing amount of application data available on massively distributed systems like the Internet. In recent years, peer-to-peer computing has emerged as a promising model for building large, scalable systems. A number of peer-to-peer systems have been proposed by the research community, typically taking the form of Distributed Hash Tables (DHT). Such systems provide excellent scalability and resilience for storage applications. However, they do not provide complex query capabilities beyond exact matching. This thesis proposes Brushwood, a peer-to-peer infrastructure for building distributed search trees to support complex queries. Users define how their tree nodes handle index queries and insertion requests. The execution of query/insertion operations and maintenace of the search tree are implemented by Brushwood infrastructure, which guarantees logarithmic cost with decentralized tree navigation and maintenance, even in the face of skewed data sets and unbalanced tree structures. The platform is flexible to support diversified applications. I present two examples in this thesis: an index service for high dimensional data, and a content-based publish/subscribe system. The high-dimensional index supports range queries and nearest neighbor search based on Brushwood tree navigation. The publish/subscribe system supports complex sementics for matching dynamic events to user subscriptions. They use significantly different index structures built upon the same Brushwood platform. Experimental results demonstrated the versatility and scalability of the systems.

 
Advisor: Singh, Jaswinder Pal
School: PRINCETON UNIVERSITY
Source: DAI-B 68/01, p. 414, Jul 2007
Source Type: PhD
Subjects: Computer science
Publication Number: 3250045
     
Adobe PDF Access the complete dissertation:
 

» Find an electronic copy at your library.
  Use the link below to access a full citation record of this graduate work:
  http://gateway.proquest.com/openurl%3furl_ver=Z39.88-2004%26res_dat=xri:pqdiss%26rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation%26rft_dat=xri:pqdiss:3250045
  If your library subscribes to the ProQuest Dissertations & Theses (PQDT) database, you may be entitled to a free electronic version of this graduate work. If not, you will have the option to purchase one, and access a 24 page preview for free (if available).

 
 
 

About ProQuest Dissertations & Theses
With over 2.3 million records, the ProQuest Dissertations & Theses (PQDT) database is the most comprehensive collection of dissertations and theses in the world. It is the database of record for graduate research.

The database includes citations of graduate works ranging from the first U.S. dissertation, accepted in 1861, to those accepted as recently as last semester. Of the 2.3 million graduate works included in the database, ProQuest offers more than 1.9 million in full text formats. Of those, over 860,000 are available in PDF format. More than 60,000 dissertations and theses are added to the database each year.

If you have questions, please feel free to visit the ProQuest Web site - http://www.il.proquest.com - or call ProQuest Hotline Customer Support at 1-800-521-3042.



Copyright © 2007 ProQuest. All rights reserved. Terms and Conditions

ProQuest