UMI  
ProQuest® Dissertations & Theses
The world's most comprehensive collection of dissertations and theses. Learn more...
ProQuest  
 
 
Exploring evolutionary heterogeneity with change-point models, Gaussian Markov random fields, and Markov chain induced counting processes
by Minin, Volodymyr, PhD, UNIVERSITY OF CALIFORNIA, LOS ANGELES, 2007, 0 pages; 3257221
 

Abstract: Signatures of spatial variation, left by evolutionary processes in genomic sequences, provide important information about the function and structure of genomic regions. I discuss statistical methods for detection of such signatures in a Bayesian framework. I start with phylogenetic analysis of recombination. I present a recombination detection method that simultaneously incorporates discrepancies in phylogenies, caused by recombination, and spatial variation in evolutionary pressure across the alignment using a dual multiple change-point (DMCP) model. Next, I turn to mapping recombination hot-spots. Based on the DMCP model, I build a hierarchical framework for simultaneous inference of break-point locations and spatial variation in recombination frequency from multiple putative recombinant sequences. To overcome the sparseness of break-point data, dictated by the modest number of available recombinant sequences, I a priori impose a biologically relevant correlation structure on recombination location log-odds via a Gaussian Markov random field. Applied to HIV sequences, this approach reveals a previously unknown recombination hot-spot. I show that GMRF smoothing can also be successfully combined with Kingman's coalescent to estimate temporal variation of the population demographic: history. GMRF temporal smoothing does not require strong prior decisions and is robust to prior perturbations. I apply GMRF smoothing to hepatitis C sequences, contemporaneously sampled in Egypt, and human influenza A hemagglutinin sequences, serially sampled throughout three flu seasons. I conclude with posterior predictive model diagnostics for locating spatial patterns of variation in genomic sequences. The evolutionary counting processes that keep track of a priori labeled mutations provide very useful discrepancy measures for detecting model inadequacies. I take an algorithmic probability approach that allows for an exact and efficient computation of certain properties of evolutionary counting processes. I demonstrate that these properties allow detection of periodic patterns of mutation rate variation in nucleotide sequence alignments.

 
Advisor: Suchard, Marc A.
School: UNIVERSITY OF CALIFORNIA, LOS ANGELES
Source: DAI-B 68/04, p. 2013, Oct 2007
Source Type: PhD
Subjects: Biostatistics; Genetics
Publication Number: 3257221
     
Adobe PDF Access the complete dissertation:
 

» Find an electronic copy at your library.
  Use the link below to access a full citation record of this graduate work:
  http://gateway.proquest.com/openurl%3furl_ver=Z39.88-2004%26res_dat=xri:pqdiss%26rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation%26rft_dat=xri:pqdiss:3257221
  If your library subscribes to the ProQuest Dissertations & Theses (PQDT) database, you may be entitled to a free electronic version of this graduate work. If not, you will have the option to purchase one, and access a 24 page preview for free (if available).

 
 
 

About ProQuest Dissertations & Theses
With over 2.3 million records, the ProQuest Dissertations & Theses (PQDT) database is the most comprehensive collection of dissertations and theses in the world. It is the database of record for graduate research.

The database includes citations of graduate works ranging from the first U.S. dissertation, accepted in 1861, to those accepted as recently as last semester. Of the 2.3 million graduate works included in the database, ProQuest offers more than 1.9 million in full text formats. Of those, over 860,000 are available in PDF format. More than 60,000 dissertations and theses are added to the database each year.

If you have questions, please feel free to visit the ProQuest Web site - http://www.il.proquest.com - or call ProQuest Hotline Customer Support at 1-800-521-3042.



Copyright © 2007 ProQuest. All rights reserved. Terms and Conditions

ProQuest