UMI  
ProQuest® Dissertations & Theses
The world's most comprehensive collection of dissertations and theses. Learn more...
ProQuest  
 
 
Computational approaches to problems in protein structure and function
by Kingsford, Carleton L., Ph.D., PRINCETON UNIVERSITY, 2005, 168 pages; 3188669
 

Abstract:

We present computational approaches to solve several problems arising in protein structure and function.

In the first part of this thesis, we develop a new method for finding the lowest energy positions of side chains when given the backbone of a protein, a widely studied problem that has applications in homology modeling and protein design. We present an integer linear programming formulation of side-chain positioning and relax it to give a polynomial-time linear programming heuristic that allows us to tackle large problems. We test the integer and linear programming approach on native and homologous backbones, where we show that optimal solutions can usually be found using linear programming, and in protein redesign, where we find that instances often cannot be solved using linear programming directly, but where optimal solutions for large instances can be found using the more expensive integer programming procedure. We also present an alternative formulation of the side-chain positioning problem as a semidefinite program, which provides a tighter relaxation than the linear program. We introduce two novel rounding schemes to convert fractional solutions of the semidefinite program into choices of rotamers and provide some theoretical justifications for their effectiveness. We extensively test the semidefinite programming formulation and rounding schemes on simulated data and on the redesign of two naturally occurring protein cores and show that the approach finds good solutions.

The second part of this thesis considers the problem of finding transcription factor binding sites by locating a collection of mutually similar subsequences within the upstream DNA sequences of genes. Our approach to side-chain positioning can be recast to solve this problem, and it has previously been shown that this is a promising direction to pursue. We improve the mathematical programming formulation to find binding sites up to 45 times faster.

Finally, in the last part of the thesis, we investigate protein function more broadly and give extensions to the popular phylogenetic profile method for predicting shared function from cross-genomic evolutionary history. For many biological functions, our methods are better able to identify functionally linked proteins than previously introduced methods.

 
Advisor: Singh, Mona
School: PRINCETON UNIVERSITY
Source: DAI-B 66/09, p. , Mar 2006
Source Type: Ph.D.
Subjects: Computer science; Molecular biology
Publication Number: 3188669
     
Adobe PDF Access the complete dissertation:
 

» Find an electronic copy at your library.
  Use the link below to access a full citation record of this graduate work:
  http://gateway.proquest.com/openurl%3furl_ver=Z39.88-2004%26res_dat=xri:pqdiss%26rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation%26rft_dat=xri:pqdiss:3188669
  If your library subscribes to the ProQuest Dissertations & Theses (PQDT) database, you may be entitled to a free electronic version of this graduate work. If not, you will have the option to purchase one, and access a 24 page preview for free (if available).

 
 
 

About ProQuest Dissertations & Theses
With over 2.3 million records, the ProQuest Dissertations & Theses (PQDT) database is the most comprehensive collection of dissertations and theses in the world. It is the database of record for graduate research.

The database includes citations of graduate works ranging from the first U.S. dissertation, accepted in 1861, to those accepted as recently as last semester. Of the 2.3 million graduate works included in the database, ProQuest offers more than 1.9 million in full text formats. Of those, over 860,000 are available in PDF format. More than 60,000 dissertations and theses are added to the database each year.

If you have questions, please feel free to visit the ProQuest Web site - http://www.il.proquest.com - or call ProQuest Hotline Customer Support at 1-800-521-3042.



Copyright © 2007 ProQuest. All rights reserved. Terms and Conditions

ProQuest