Robust nonparametric and semiparametric modeling
by Kai, Bo, Ph.D., THE PENNSYLVANIA STATE UNIVERSITY, 2009, 207 pages; 3480767

Abstract:

In this dissertation, several new statistical procedures in nonparametric and semiparametric models are proposed. The concerns of the research are efficiency, robustness and sparsity.

In Chapter 3, we propose complete composite quantile regression (CQR) procedures for estimating both the regression function and its derivatives in fully nonparametric regression models by using local smoothing techniques. The CQR estimator was recently proposed by Zou and Yuan (2008) for estimating the regression coefficients in the classical linear regression model. The asymptotic theory of the proposed estimator was established. We show that, compared with the classical local linear least squares estimator, the new method can significantly improve the estimation efficiency of the local linear least squares estimator for commonly used non-normal error distributions, and at the same time, the loss in efficiency is at most 8.01% in the worst case scenario.

In Chapter 4, we further consider semiparametric models. The complexity of semiparametric models poses new challenges to parametric inferences and model selection that frequently arise from real applications. We propose new robust inference procedures for the semiparametric varying-coefficient partially linear model. We first study a quantile regression estimate for the nonparametric varying-coefficient functions and the parametric regression coefficients. To improve efficiency, we further develop a composite quantile regression procedure for both parametric and nonparametric components. To achieve sparsity, we develop a variable selection procedure for this model to select significant variables. We study the sampling properties of the resulting quantile regression estimate and composite quantile regression estimate. With proper choices of penalty functions and regularization parameters, we show the proposed variable selection procedure possesses the oracle property in the terminology of Fan and Li (2001).

In Chapter 5, we propose a novel estimation procedure for varying coefficient models based on local ranks. By allowing the regression coefficients to change with certain covariates, the class of varying coefficient models offers a flexible semiparametric approach to modeling nonlinearity and interactions between covariates. Varying coefficient models are useful nonparametric regression models and have been well studied in the literature. However, the performance of existing procedures can be adversely influenced by outliers. The new procedure provides a highly efficient and robust alternative to the local linear least squares method and can be conveniently implemented using existing R software packages. We study the sample properties of the proposed procedure and establish the asymptotic normality of the resulting estimate. We also derive the asymptotic relative efficiency of the proposed local rank estimate to the local linear estimate for the varying coefficient model. The gain of the local rank regression estimate over the local linear regression estimate can be substantial. We further develop nonparametric inferences for the rank-based method. Monte Carlo simulations are conducted to access the finite sample performance of the proposed estimation procedure. The simulation results are promising and consistent with our theoretical findings.

All the proposed procedures are supported by intensive finite sample simulation studies and most are illustrated with real data examples.

 
AdvisersRunze Li; David R. Hunter
SchoolTHE PENNSYLVANIA STATE UNIVERSITY
SourceDAI/B 73-01, p. , Nov 2011
Source TypeDissertation
SubjectsStatistics
Publication Number3480767
Adobe PDF Access the complete dissertation:
 

» Find an electronic copy at your library.
  Use the link below to access a full citation record of this graduate work:
  http://gateway.proquest.com/openurl%3furl_ver=Z39.88-2004%26res_dat=xri:pqdiss%26rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation%26rft_dat=xri:pqdiss:3480767
  If your library subscribes to the ProQuest Dissertations & Theses (PQDT) database, you may be entitled to a free electronic version of this graduate work. If not, you will have the option to purchase one, and access a 24 page preview for free (if available).

About ProQuest Dissertations & Theses
With over 2.3 million records, the ProQuest Dissertations & Theses (PQDT) database is the most comprehensive collection of dissertations and theses in the world. It is the database of record for graduate research.

The database includes citations of graduate works ranging from the first U.S. dissertation, accepted in 1861, to those accepted as recently as last semester. Of the 2.3 million graduate works included in the database, ProQuest offers more than 1.9 million in full text formats. Of those, over 860,000 are available in PDF format. More than 60,000 dissertations and theses are added to the database each year.

If you have questions, please feel free to visit the ProQuest Web site - http://www.proquest.com - or call ProQuest Hotline Customer Support at 1-800-521-3042.