Information retrieval and knowledge-based methods for drug discovery
by Wale, Nikil, Ph.D., UNIVERSITY OF MINNESOTA, 2008, 103 pages; 3332460

Abstract:

Drug discovery is an expensive process. It has been estimated that a new drug compound that is introduced in the market after FDA approval carries a cost of approximately $800 million from the conception of target implicated for a disease to successful identification of chemical entity or drug that is successful in human trials. And this number does not even include expenses incurred for compounds that fail in the later stages of drug discovery due to problems such an toxicity, lack of efficacy in humans (although it shows potency in in-vitro experiments), poor physical properties that make it unsuitable for absorption, distribution at the required site, metabolism or excretion in human body. There is an urgent need to cut the cost of developing new drugs (to bring overall cost lower for the producers and consumers alike) by identifying promising candidate drug compounds in the early stages of drug discovery. In order to achieve this objective, in recent years the development of computational techniques that build models to correctly assign chemical compounds to various classes, to retrieve potential drug-like compounds (hits), or to identify promising new targets for different diseases has been an active area of research.

In this thesis we study and develop algorithms for various problems in the field of computational drug discovery that show state-of-the-art performance. Specifically, we develop methods in three key areas of drug discovery - representation of chemical compounds for classification and retrieval (termed as Effective Descriptor-Space Representation Problem), identification of diverse set bioactives for a given query (termed as Scaffold-Hopping Problem) and identification of likely targets for given (set) of chemical compounds (termed as Target Identification Problem). All of these methods utilize experimental data on chemical compounds binding to various assays and is derived from in-vitro as well as in-vivo experiments. The methods we propose are inspired by research in the areas of information retrieval and machine learning. Our extensive experimental evaluation shows that most of the methods developed in this work are either competitive or substantially outperform previously developed approaches to solve the above problems in drug discovery.

 
AdviserGeorge Karypis
SchoolUNIVERSITY OF MINNESOTA
SourceDAI/B 69-10, p. , Dec 2008
Source TypeDissertation
SubjectsBioinformatics; Computer science
Publication Number3332460
Adobe PDF Access the complete dissertation:
 

» Find an electronic copy at your library.
  Use the link below to access a full citation record of this graduate work:
  http://gateway.proquest.com/openurl%3furl_ver=Z39.88-2004%26res_dat=xri:pqdiss%26rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation%26rft_dat=xri:pqdiss:3332460
  If your library subscribes to the ProQuest Dissertations & Theses (PQDT) database, you may be entitled to a free electronic version of this graduate work. If not, you will have the option to purchase one, and access a 24 page preview for free (if available).

About ProQuest Dissertations & Theses
With over 2.3 million records, the ProQuest Dissertations & Theses (PQDT) database is the most comprehensive collection of dissertations and theses in the world. It is the database of record for graduate research.

The database includes citations of graduate works ranging from the first U.S. dissertation, accepted in 1861, to those accepted as recently as last semester. Of the 2.3 million graduate works included in the database, ProQuest offers more than 1.9 million in full text formats. Of those, over 860,000 are available in PDF format. More than 60,000 dissertations and theses are added to the database each year.

If you have questions, please feel free to visit the ProQuest Web site - http://www.proquest.com - or call ProQuest Hotline Customer Support at 1-800-521-3042.