UMI  
ProQuest® Dissertations & Theses
The world's most comprehensive collection of dissertations and theses. Learn more...
ProQuest  
 
 
Statistical methods for evaluating linkage disequilibrium and its patterns using length of haplotype sharing
by Wang, Yan, PhD, UNIVERSITY OF CALIFORNIA, BERKELEY, 2006, 0 pages; 3228519
 

Abstract: High-throughput genotyping technologies for single nucleotide polymorphisms (SNP) have stimulated much interest in studying genomewide linkage disequilibrium (LD) patterns. Conventional LD measures, such as D' and r2, are two-point measurements and their relationship with physical distance is highly noisy. We propose a new LD measure, Δ, defined in terms of the correlation coefficient for shared haplotype lengths around two loci, thereby borrowing information from multiple loci. A U-statistic-based estimator of Δ, which takes into consideration the dependence structure of the observed data, is developed and compared to an estimator based on the usual empirical correlation coefficient. Furthermore, we propose methods for inferring LD decay rates and recombination hotspots based on the new LD measure Δ. The results from coalescent simulation studies and analysis of HapMap SNP data demonstrate that the proposed estimators of Δ are superior to the two most popular conventional LD measures, in terms of their close relationship with physical distance and recombination rate, their small variability, and their strong robustness to marker allele frequencies. The haplotype block models, supported by both human genetic theory and empirical studies, have recently received much attention to describe LD patterns in genomic regions. However, the models proposed to date tend to be overly simple for the complex LD patterns presented by human genomic data. In addition, the identified block boundaries are usually very sensitive to the operational definition of a haplotype block and to marker density and allele frequency. We propose a hierarchical haplotype block model, which accommodates nested block structures, and a recursive partitioning algorithm, DHPBlocker , to identify the corresponding hierarchical block structure. The proposed algorithm is applied to two sets of human SNP data and our results reconcile the seemingly inconsistent results from existing methods, suggesting that the hierarchical block model is a more precise and flexible model, with the capacity of dealing with the complexity of LD patterns throughout the human genome. Moreover, analyses performed on different subsets of markers indicate that the DHPBlocker generates consistent global block structures which are robust to marker allele frequency and density.

 
Advisor: Dudoit, Sandrine
School: UNIVERSITY OF CALIFORNIA, BERKELEY
Source: DAI-B 67/08, p. 4191, Feb 2007
Source Type: PhD
Subjects: Biostatistics; Genetics
Publication Number: 3228519
     
Adobe PDF Access the complete dissertation:
 

» Find an electronic copy at your library.
  Use the link below to access a full citation record of this graduate work:
  http://gateway.proquest.com/openurl%3furl_ver=Z39.88-2004%26res_dat=xri:pqdiss%26rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation%26rft_dat=xri:pqdiss:3228519
  If your library subscribes to the ProQuest Dissertations & Theses (PQDT) database, you may be entitled to a free electronic version of this graduate work. If not, you will have the option to purchase one, and access a 24 page preview for free (if available).

 
 
 

About ProQuest Dissertations & Theses
With over 2.3 million records, the ProQuest Dissertations & Theses (PQDT) database is the most comprehensive collection of dissertations and theses in the world. It is the database of record for graduate research.

The database includes citations of graduate works ranging from the first U.S. dissertation, accepted in 1861, to those accepted as recently as last semester. Of the 2.3 million graduate works included in the database, ProQuest offers more than 1.9 million in full text formats. Of those, over 860,000 are available in PDF format. More than 60,000 dissertations and theses are added to the database each year.

If you have questions, please feel free to visit the ProQuest Web site - http://www.il.proquest.com - or call ProQuest Hotline Customer Support at 1-800-521-3042.



Copyright © 2007 ProQuest. All rights reserved. Terms and Conditions

ProQuest