Logistic regression with misclassified covariates using auxiliary data
by Dong, Nathan N., Ph.D., THE UNIVERSITY OF TEXAS AT ARLINGTON, 2009, 104 pages; 3369324

Abstract:

When standard regression methods are used, measurement errors cause a bias in parameters estimation. In dealing with discrete covariates, such measurement errors are known as misclassification and the corresponding discrete covariate is said to be misclassified. Even though the collected data may not be fully reliable, it may be possible to collect some sub-data with full precision called auxiliary data and the remaining data called primary data may contain misclassification.

In this paper, in order to improve predictions from the primary data with misclassification, we propose a method based on the maximum likelihood approach; by using the prediction with auxiliary data we are able to improve the prediction from the primary data and to correct the bias in parameters estimation.

For the simplified model, we consider a primary data sample with binary response Y and discrete covariate W which is a misclassified version of true latent variable X. In addition, an auxiliary data in which both W and X can be observed used to adjust for the bias due to misclassification in W. First the model parameters are shown to be non-identifiable. To resolve around the problem, we replace the estimated misclassification probability between X and W in the model. The estimator is shown to be consistent and asymptotic normality under some regularity conditions. Testing of our method by using Monte Carlo simulations indicate that the method works well with finite samples.

 
AdviserDoyle L. Hawkins
SchoolTHE UNIVERSITY OF TEXAS AT ARLINGTON
SourceDAI/B 70-08, p. , Sep 2009
Source TypeDissertation
SubjectsStatistics
Publication Number3369324
Adobe PDF Access the complete dissertation:
 

» Find an electronic copy at your library.
  Use the link below to access a full citation record of this graduate work:
  http://gateway.proquest.com/openurl%3furl_ver=Z39.88-2004%26res_dat=xri:pqdiss%26rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation%26rft_dat=xri:pqdiss:3369324
  If your library subscribes to the ProQuest Dissertations & Theses (PQDT) database, you may be entitled to a free electronic version of this graduate work. If not, you will have the option to purchase one, and access a 24 page preview for free (if available).

About ProQuest Dissertations & Theses
With over 2.3 million records, the ProQuest Dissertations & Theses (PQDT) database is the most comprehensive collection of dissertations and theses in the world. It is the database of record for graduate research.

The database includes citations of graduate works ranging from the first U.S. dissertation, accepted in 1861, to those accepted as recently as last semester. Of the 2.3 million graduate works included in the database, ProQuest offers more than 1.9 million in full text formats. Of those, over 860,000 are available in PDF format. More than 60,000 dissertations and theses are added to the database each year.

If you have questions, please feel free to visit the ProQuest Web site - http://www.proquest.com - or call ProQuest Hotline Customer Support at 1-800-521-3042.