UMI  
ProQuest® Dissertations & Theses
The world's most comprehensive collection of dissertations and theses. Learn more...
ProQuest  
 
 
High resolution force fields and residue contact prediction models for protein structure prediction
by Rajgaria, Rohit, Ph.D., PRINCETON UNIVERSITY, 2009, 309 pages; 3356744
 

Abstract:

The process by which a protein acquires its stable and functional three dimensional structure is referred as the protein folding process. The understanding of protein folding process is one of the most important and challenging problems in computational biology. Various approaches to determine the three dimensional native structure of a protein from its amino acid sequence have been proposed. Some of these methods use existing experimentally-determined structures whereas, other ab initio methods do not rely on existing structures for their predictions.

Once the predictions have been made, it is very important to identify the best structure (most similar to the native structure) from an ensemble of predicted structures. Anfinsen's hypothesis states that the native structure corresponds to the global Gibbs free energy minima. A linear programming based model, that uses this hypothesis as the main criterion, has been developed to generate a C? -C? distance dependent force field. A diverse protein set and an improved decoy generation technique was employed to generate a challenging set of high quality training decoys. The C? -C? distance dependent force field generated using this model was found to be very successful in selecting native structures from an ensemble of high resolution conformers. Another linear programming based model has been developed to generate a side chain centroid distance dependent force field that includes the presence of side chain atoms of a residue. This force field was found to be more successful in discriminating between the native and non-native structures of a protein. These force fields can also be used for fold recognition and de novo protein design.

Protein structure prediction using first principles methods is very difficult and challenging because of the enormity of the conformational search space that needs to be searched. Any information that can reduce the conformational search space can potentially make the structure prediction method more efficient. In this thesis, two models have been developed to predict contacts between non-local residues of a protein. The first model uses an integer linear optimization formulation to predict non-local hydrophobic contacts of an ?-helical protein. The second model also uses an integer linear optimization formulation to predict contacts between non-local residues of ?, ? + ?, and ?/? proteins. The predicted contacts can be used to generate distance bounds between contacting residues. These bounds can prove very useful for first principles methods like ASTRO-FOLD to reduce the protein conformational search space. The problem of improving the accuracy of predicted contacts has been addressed by generating an optimal set of filters. The selection of optimal filters has been formulated as an integer linear optimization problem. The usefulness and effectiveness of proposed models for tertiary structure prediction has been tested and validated using a set of test proteins including test cases from blind protein structure prediction experiments.

 
Advisor: Floudas, Christodoulos A.
School: PRINCETON UNIVERSITY
Source: DAI-B 70/05, p. , Nov 2009
Source Type: Ph.D.
Subjects: Chemical engineering
Publication Number: 3356744
     
Adobe PDF Access the complete dissertation:
 

» Find an electronic copy at your library.
  Use the link below to access a full citation record of this graduate work:
  http://gateway.proquest.com/openurl%3furl_ver=Z39.88-2004%26res_dat=xri:pqdiss%26rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation%26rft_dat=xri:pqdiss:3356744
  If your library subscribes to the ProQuest Dissertations & Theses (PQDT) database, you may be entitled to a free electronic version of this graduate work. If not, you will have the option to purchase one, and access a 24 page preview for free (if available).

 
 
 

About ProQuest Dissertations & Theses
With over 2.3 million records, the ProQuest Dissertations & Theses (PQDT) database is the most comprehensive collection of dissertations and theses in the world. It is the database of record for graduate research.

The database includes citations of graduate works ranging from the first U.S. dissertation, accepted in 1861, to those accepted as recently as last semester. Of the 2.3 million graduate works included in the database, ProQuest offers more than 1.9 million in full text formats. Of those, over 860,000 are available in PDF format. More than 60,000 dissertations and theses are added to the database each year.

If you have questions, please feel free to visit the ProQuest Web site - http://www.il.proquest.com - or call ProQuest Hotline Customer Support at 1-800-521-3042.



Copyright © 2007 ProQuest. All rights reserved. Terms and Conditions

ProQuest