Components of overdispersion in hierarchical generalized linear models
by Lalonde, Trent Lewis, Ph.D., ARIZONA STATE UNIVERSITY, 2009, 118 pages; 3371215

Abstract:

A hierarchical generalized linear model is developed in which the data are allowed to determine the distributional assumptions of the dispersion sub-models while considering fixed and random effects. In addition the model allows for covariates to be included in both the mean and dispersion sub-models.

In recent times the pseudo-likelihood and double extended quasi-likelihood models were developed for modeling data from a hierarchical generalized linear structure. The pseudo-likelihood model is appropriate for any response distribution from the exponential family, but its systematic component is approximated by restricting to a normal random variable. In addition, the pseudo-likelihood model only allows for a constant dispersion correction, and cannot include dispersion covariates. The double extended quasi-likelihood model is appropriate for any response and random effect distributions from the exponential family, and does allow dispersion sub-models with dispersion covariates. However, the double extended quasi-likelihood model imposes a specific mean-variance relationship for both dispersion sub-models.

In hierarchical generalized linear models it is natural to account for components of overdispersion separately at each level of clustering. The dispersion sub-models developed can be used to model components of overdispersion. In this research the double generalized extended quasi-likelihood model consists of power functions which are defined by the mean-variance relationships in the dispersion sub-models as determined by the data, and thus are expected to more accurately model the dispersion in the data.

The models developed are compared to the pseudo-likelihood and double extended quasi-likelihood models through a simulation study. The performances of these models are assessed through data simulated with various distributional assumptions, and also in data simulated with overdispersion present at different levels. The double generalized extended quasi-likelihood is shown to be reliable and performs consistently better than existing models. These models are used to analyze a commonly known respiratory data set.

 
AdviserJeffrey Wilson
SchoolARIZONA STATE UNIVERSITY
SourceDAI/B 70-08, p. , Oct 2009
Source TypeDissertation
SubjectsStatistics
Publication Number3371215
Adobe PDF Access the complete dissertation:
 

» Find an electronic copy at your library.
  Use the link below to access a full citation record of this graduate work:
  http://gateway.proquest.com/openurl%3furl_ver=Z39.88-2004%26res_dat=xri:pqdiss%26rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation%26rft_dat=xri:pqdiss:3371215
  If your library subscribes to the ProQuest Dissertations & Theses (PQDT) database, you may be entitled to a free electronic version of this graduate work. If not, you will have the option to purchase one, and access a 24 page preview for free (if available).

About ProQuest Dissertations & Theses
With over 2.3 million records, the ProQuest Dissertations & Theses (PQDT) database is the most comprehensive collection of dissertations and theses in the world. It is the database of record for graduate research.

The database includes citations of graduate works ranging from the first U.S. dissertation, accepted in 1861, to those accepted as recently as last semester. Of the 2.3 million graduate works included in the database, ProQuest offers more than 1.9 million in full text formats. Of those, over 860,000 are available in PDF format. More than 60,000 dissertations and theses are added to the database each year.

If you have questions, please feel free to visit the ProQuest Web site - http://www.proquest.com - or call ProQuest Hotline Customer Support at 1-800-521-3042.