Statistical methods for failure time data with biased sampling and measurement errors
by Cheng, Yu-Jen, Ph.D., THE JOHNS HOPKINS UNIVERSITY, 2009, 145 pages; 3357043

Abstract:

In many epidemiological and cancer survey studies, units are sampled with proportional probability to some function of their values, leading to what are called biased samples and have been addressed in last two decades. In such studies, we focus on estimating marginal causal treatment effect. However, due to systematic differences between treated and untreated groups with respect to various covariates, direct comparisons of observed outcomes from the two groups are not appropriate. We make inference on the marginal causal survival function and the propensity score with biased sampling. The problem is especially complex because outcome (“potential outcomes”), as well as covariates, are partially observed. The missingness comes from two different sources: One is due to the hypothetic potential outcome framework; the other is because of prevalent sampling scheme. Making causal inference without adjusting for the both sources of the missingness will lead to a bias result. We propose an inverse weighting approach to estimate marginal causal survival function and develop a method to correct the propensity score. Furthermore, we provide a double robust estimator which is asymptotically unbiased if either the underlying propensity score model or the underlying regression function is correctly specified. Our methodology was motivated by and applied to Surveillance, Epidemiology, and End Results (SEER)-Medicare data for women diagnosed with breast cancer.

Substantial methodological and applied research has been dedicated in recent years to survival analysis with covariates subject to measurement error. Current statistical approaches for Cox regression with covariates measured with error only focus on linear log-hazard function. We propose, develop and implement a fully Bayesian inferential approach for the Cox model when the log hazard function contains unknown smooth functions of the variables measured with error. Our approach is to model nonparametrically both the log-baseline hazard and the smooth components of the log-hazard functions using low-rank penalized splines. The likelihood of the Cox model is coupled with the likelihood of the measurement error process. Careful implementation of the Bayesian inferential machinery is shown to produce remarkably better results than the naive approach. Our methodology was motivated by and applied to the study of progression time to chronic kidney disease (CKD) as a function of baseline kidney function and applied to the Atherosclerosis Risk in Communities (ARIC) study, a large epidemiological cohort study.

 
AdvisersMei-Cheng Wang; Ciprian M. Crainiceanu
SchoolTHE JOHNS HOPKINS UNIVERSITY
SourceDAI/B 70-04, p. , Jul 2009
Source TypeDissertation
SubjectsBiostatistics; Statistics
Publication Number3357043
Adobe PDF Access the complete dissertation:
 

» Find an electronic copy at your library.
  Use the link below to access a full citation record of this graduate work:
  http://gateway.proquest.com/openurl%3furl_ver=Z39.88-2004%26res_dat=xri:pqdiss%26rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation%26rft_dat=xri:pqdiss:3357043
  If your library subscribes to the ProQuest Dissertations & Theses (PQDT) database, you may be entitled to a free electronic version of this graduate work. If not, you will have the option to purchase one, and access a 24 page preview for free (if available).

About ProQuest Dissertations & Theses
With over 2.3 million records, the ProQuest Dissertations & Theses (PQDT) database is the most comprehensive collection of dissertations and theses in the world. It is the database of record for graduate research.

The database includes citations of graduate works ranging from the first U.S. dissertation, accepted in 1861, to those accepted as recently as last semester. Of the 2.3 million graduate works included in the database, ProQuest offers more than 1.9 million in full text formats. Of those, over 860,000 are available in PDF format. More than 60,000 dissertations and theses are added to the database each year.

If you have questions, please feel free to visit the ProQuest Web site - http://www.proquest.com - or call ProQuest Hotline Customer Support at 1-800-521-3042.