Volatility matrix estimation and high dimensional classification
by Fan, Yingying, Ph.D., PRINCETON UNIVERSITY, 2007, 147 pages; 3256597

Abstract:

This dissertation contributes to two areas of statistics: volatility matrix estimation and high dimensional classification, in methodology and theory. Time- and state-domain methods are two common approaches for nonparametric estimation. Combining information in these two domains is an interesting challenge in statistics. This problem is surmounted in this thesis by dynamic integration of these two pieces of information from time and state domains. A data driven weighting strategy is proposed to optimally combine the time- and state-domain estimators, for both scalar and multivariate volatility estimation. In the multivariate case, a factor modeling strategy is proposed to attenuate the curse of dimensionality. By comparing their effciencies, it is demonstrated both theoretically and numerically that the newly proposed integrated estimator uniformly dominates the time- and state-domain estimators alone. For the second topic, determined efforts are devoted to understanding the impacts of dimensionality on classifications. In the presence of high dimensionality, classical methods such as Fisher discriminant do not perform well or even break down, and the independence rule has been proposed in the literature to overcome the difficulty. Studies in this thesis demonstrate that even for the independence rule, classification using all features can be as bad as random guessing due to noise accumulation in estimating parameters, and moreover that almost all linear discriminants can perform as bad as random guessing. It is therefore paramountly important to select a subset of important features, resulting in a new procedure, the Feature Annealed Independence Rules. The conditions under which all the important features can be selected by the two-sample t-statistic with probability one are established. The choice of the optimal number of features is given.

 
Advisor
SchoolPRINCETON UNIVERSITY
SourceDAI/B 68-03, p. , Jul 2007
Source TypeDissertation
SubjectsStatistics
Publication Number3256597
Adobe PDF Access the complete dissertation:
 

» Find an electronic copy at your library.
  Use the link below to access a full citation record of this graduate work:
  http://gateway.proquest.com/openurl%3furl_ver=Z39.88-2004%26res_dat=xri:pqdiss%26rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation%26rft_dat=xri:pqdiss:3256597
  If your library subscribes to the ProQuest Dissertations & Theses (PQDT) database, you may be entitled to a free electronic version of this graduate work. If not, you will have the option to purchase one, and access a 24 page preview for free (if available).

About ProQuest Dissertations & Theses
With over 2.3 million records, the ProQuest Dissertations & Theses (PQDT) database is the most comprehensive collection of dissertations and theses in the world. It is the database of record for graduate research.

The database includes citations of graduate works ranging from the first U.S. dissertation, accepted in 1861, to those accepted as recently as last semester. Of the 2.3 million graduate works included in the database, ProQuest offers more than 1.9 million in full text formats. Of those, over 860,000 are available in PDF format. More than 60,000 dissertations and theses are added to the database each year.

If you have questions, please feel free to visit the ProQuest Web site - http://www.proquest.com - or call ProQuest Hotline Customer Support at 1-800-521-3042.