Effects of DNA Shape and Sequence of Protein-DNA Recognition
by West, Sean Matthew, Ph.D., COLUMBIA UNIVERSITY, 2011, 154 pages; 3451526

Abstract:

Protein-DNA recognition is crucial for a variety of biological processes such as transcription and regulation of genome accessibility. Proteins may recognize DNA sequence through specific hydrogen bonds to the bases. However, proteins often recognize DNA shape by interacting with DNA which is deformed. This interplay between the recognition of DNA sequence and shape necessitates examination of both sequence and structure as they relate to protein-DNA recognition.

Analysis of all X-ray crystallographic structures of proteins which contact the DNA revealed a widespread readout mechanism involving shape readout of the DNA minor groove. The width of the DNA backbone along the minor groove is tightly linked to the electrostatic potential in and around the minor groove, even more than to the base identity. Proteins are able to use basic amino acids, most commonly arginine, to recognize narrow minor grooves, which have increased negative electrostatic potential. Often, AT-rich sequences, which are intrinsically narrow, are recognized in this manner. Furthermore, arginines are used to stabilize the highly deformed DNA of the nucleosome by inserting into narrow minor grooves of the DNA as it wraps around the histone octamer.

In our analysis of structures of nucleosome core particles, two drastically different DNA sequences gave rise to similar minor groove widths, despite variance of the helical parameters. This indicates that the shape of DNA is being induced to conform to the histone-DNA interface. However, histone octamers do not bind in a completely non-specific manner; there are regions of the genome which have no nucleosome occupancy as well as regions which have strongly positioned nucleosomes. Our research shows the in vivo positioning of nucleosomes along the genome is related to the frequency and length of A-tracts.

Our results demonstrate the importance of using both sequence and structural information in studying protein-DNA recognition. By analyzing all available protein-DNA structures, as well as the sequences bound by proteins, we have contributed to the community's understanding of protein-DNA recognition.

 
Advisor
SchoolCOLUMBIA UNIVERSITY
SourceDAI/B 72-06, p. , May 2011
Source TypeDissertation
SubjectsMolecular biology; Genetics; Biophysics; Computer science
Publication Number3451526
Adobe PDF Access the complete dissertation:
 

» Find an electronic copy at your library.
  Use the link below to access a full citation record of this graduate work:
  http://gateway.proquest.com/openurl%3furl_ver=Z39.88-2004%26res_dat=xri:pqdiss%26rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation%26rft_dat=xri:pqdiss:3451526
  If your library subscribes to the ProQuest Dissertations & Theses (PQDT) database, you may be entitled to a free electronic version of this graduate work. If not, you will have the option to purchase one, and access a 24 page preview for free (if available).

About ProQuest Dissertations & Theses
With over 2.3 million records, the ProQuest Dissertations & Theses (PQDT) database is the most comprehensive collection of dissertations and theses in the world. It is the database of record for graduate research.

The database includes citations of graduate works ranging from the first U.S. dissertation, accepted in 1861, to those accepted as recently as last semester. Of the 2.3 million graduate works included in the database, ProQuest offers more than 1.9 million in full text formats. Of those, over 860,000 are available in PDF format. More than 60,000 dissertations and theses are added to the database each year.

If you have questions, please feel free to visit the ProQuest Web site - http://www.proquest.com - or call ProQuest Hotline Customer Support at 1-800-521-3042.