Model Selection in Kernel Methods
by You, Di, Ph.D., THE OHIO STATE UNIVERSITY, 2011, 187 pages; 3496003

Abstract:

Kernel methods have been extensively studied in pattern recognition and machine learning over the last decade, and they have been successfully used in a variety of applications. A main advantage of kernel methods is that nonlinear problems such as classification and regression can be efficiently solved using classical linear approaches. The performance of kernel methods greatly depends on the selected kernel model. The model is defined by the kernel mapping and its parameters. Different models result in different generalization performance. Hence, model selection in kernel methods is an important problem and remains a challenge in the literature. In this dissertation, we propose several approaches to address this problem. Our approaches can determine good learning models by optimizing both the kernels and all other parameters in the kernel-based algorithms.

In classification, we develop an algorithm yielding class distributions that are linearly separable in the kernel space. The idea is to enforce the homoscedasticity and separability of the pairwise class distributions simultaneously in the kernel space. We show how this approach can be employed to optimize kernels in discriminant analysis. We then derive a criterion to search for a good kernel representation by directly minimizing the Bayes classification error over different kernel mappings.

In regression, we derive a model selection approach to directly balance the model fit and model complexity using the framework of multiobjective optimization. We develop an algorithm to obtain the Pareto-optimal solutions which balance the trade-off between the model fit and model complexity. We show how the proposed method is related to minimizing the predicted generalization error of the learning function.

In our final algorithm, the kernel matrix is recursively learned with genetic algorithms until the classification/prediction error falls below a threshold. We derive a family of adaptive kernels to better fit the data with various densities and show their superiority over the commonly used fixed-shape kernels.

Extensive experimental results demonstrate that the proposed approaches are superior to the state of the art.

 
AdviserAleix M. Martinez
SchoolTHE OHIO STATE UNIVERSITY
SourceDAI/B 73-05, p. , Mar 2012
Source TypeDissertation
SubjectsElectrical engineering
Publication Number3496003
Adobe PDF Access the complete dissertation:
 

» Find an electronic copy at your library.
  Use the link below to access a full citation record of this graduate work:
  http://gateway.proquest.com/openurl%3furl_ver=Z39.88-2004%26res_dat=xri:pqdiss%26rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation%26rft_dat=xri:pqdiss:3496003
  If your library subscribes to the ProQuest Dissertations & Theses (PQDT) database, you may be entitled to a free electronic version of this graduate work. If not, you will have the option to purchase one, and access a 24 page preview for free (if available).

About ProQuest Dissertations & Theses
With over 2.3 million records, the ProQuest Dissertations & Theses (PQDT) database is the most comprehensive collection of dissertations and theses in the world. It is the database of record for graduate research.

The database includes citations of graduate works ranging from the first U.S. dissertation, accepted in 1861, to those accepted as recently as last semester. Of the 2.3 million graduate works included in the database, ProQuest offers more than 1.9 million in full text formats. Of those, over 860,000 are available in PDF format. More than 60,000 dissertations and theses are added to the database each year.

If you have questions, please feel free to visit the ProQuest Web site - http://www.proquest.com - or call ProQuest Hotline Customer Support at 1-800-521-3042.