Decoding the protein-DNA recognition rules
by Temiz, Nuri Alpay, Ph.D., UNIVERSITY OF PITTSBURGH, 2009, 110 pages; 3375347

Abstract:

Transcription factors (TF) are key proteins involved in gene regulation by binding to specific DNA sites. The C2H2 zinc finger (ZF) TFs form the largest family of DNA binding proteins in eukaryotes and are a key participant in the regulation of most genes. A major obstacle towards understanding the molecular basis of transcriptional regulation is the lack of a general recognition code for protein-DNA interactions. In this thesis, we aim to understand molecular mechanisms of DNA binding/recognition by TFs and to quantitatively estimate recognition rules for TF-DNA interactions. To this aim, we first identified key residues that play an important role in ZF-DNA binding and studied their dynamics prior to binding using molecular dynamics (MD) simulations. We found that key residues that are buried upon complexation are prealigned to conformations close to their bound state prior to binding. The bound-like behavior of some of these residues is found to be dependent on the ion concentration of the system, consistent with experimental observations of increased binding affinity with increased ionic strength in protein-DNA interactions. We identified a binding site for Cl- ions located in the same pocket where DNA phosphates are found most buried in the complex structure of ZFs. Bound Cl- ions constrain key side chains in conformations similar to those observed when interacting with the phosphates. These results suggest that ZFs are able to maintain bound like conformations of key residues upon encountering the DNA hinting at a general mechanism to rapidly form encounter complexes amenable for a fast readout of the DNA. Next, we develop a novel experimentally-based approach using high quality crystal structures and binding data on the promiscuous family of C2H2 zinc fingers (ZF) and decode ten fundamental specific interactions responsible for protein-DNA recognition. The interactions include five hydrogen bond types, three atomic desolvation penalties, a favorable non-polar energy, and a novel water accessibility factor. We apply this code to three large data sets containing a total of 89 C2H2 TF mutants on the three ZFs of EGR. Guided by MD simulations of individual ZFs, we map the interactions into homology models that embody all feasible intra- and inter-molecular bonds, selecting for each sequence the structure with the lowest free energy. The interactions reproduce the change in affinity of 35 mutants of finger I (R2 = 0.99), as well as two independent validation sets of 23 mutants of finger II (R 2 = 0.97) and 31 human ZFs on finger IIII (R2 = 0.95). More importantly, the method predicts bound ZF-DNA complexes for all 89 mutants, decoding molecular basis of EGR-DNA specificity. Our findings reveal recognition rules that depend on DNA sequence/structure, molecular water at the interface and induced fit of the C2H2 TFs. Collectively, our method provides the first robust framework to decode the molecular basis of TFs binding to DNA.

 
AdviserCarlos J. Camacho
SchoolUNIVERSITY OF PITTSBURGH
SourceDAI/B 70-10, p. , Nov 2009
Source TypeDissertation
SubjectsMolecular biology; Biochemistry
Publication Number3375347
Adobe PDF Access the complete dissertation:
 

» Find an electronic copy at your library.
  Use the link below to access a full citation record of this graduate work:
  http://gateway.proquest.com/openurl%3furl_ver=Z39.88-2004%26res_dat=xri:pqdiss%26rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation%26rft_dat=xri:pqdiss:3375347
  If your library subscribes to the ProQuest Dissertations & Theses (PQDT) database, you may be entitled to a free electronic version of this graduate work. If not, you will have the option to purchase one, and access a 24 page preview for free (if available).

About ProQuest Dissertations & Theses
With over 2.3 million records, the ProQuest Dissertations & Theses (PQDT) database is the most comprehensive collection of dissertations and theses in the world. It is the database of record for graduate research.

The database includes citations of graduate works ranging from the first U.S. dissertation, accepted in 1861, to those accepted as recently as last semester. Of the 2.3 million graduate works included in the database, ProQuest offers more than 1.9 million in full text formats. Of those, over 860,000 are available in PDF format. More than 60,000 dissertations and theses are added to the database each year.

If you have questions, please feel free to visit the ProQuest Web site - http://www.proquest.com - or call ProQuest Hotline Customer Support at 1-800-521-3042.