An investigation of the effects of varying the domain definition of science and method of scaling on a vertical scale
by Proctor, Thomas Patrick, Ph.D., THE UNIVERSITY OF IOWA, 2008, 173 pages; 3323461

Abstract:

There is long-standing interest in assessing students' progress throughout their education, which has increased since the implementation of the No Child Left Behind Act of 2001. One popular metric to assess progress is a vertical scale, which places tests at different educational levels but of similar content onto a single score scale.

Of interest in this dissertation was the extent to which the domain definition influences the vertical scale. This is important for several reasons. First, the content on which a vertical scale is based may become out-dated, or views of its importance may change. This may cause the vertical scale not to accurately reflect what educators believe students need to learn. Another reason educational stakeholders' interest in comparing results of state tests with those from other state and national tests. If the content on which the vertical scales are based varies between tests, meaningful comparisons of student achievement may be unattainable.

The purpose of this study is to compare the effect of varying the domain definition of science with the effect of using different psychometric models to create a vertical scale. Three tests of science were assembled that varied in the degree to which they assessed scientific inquiry skills, an area believed to be important for students in all grades. The tests were administered to students in grades 3 through 8, and vertical scales were constructed and compared for each test using the Hieronymus and 3PL IRT scaling methods.

Results suggest that domain definition had a larger effect on vertical scaling than did scaling method. This effect seems to be most pronounced for a test consisting solely of items that assess scientific inquiry rather than a mix of other sub-content areas (e.g., life, physical, and earth and space sciences).

Although the effect of domain definition was not pronounced, these results suggest that content can have an important influence on patterns of growth described by vertical scales. Greater attention to this influence has the potential to increase understanding of all aspects of vertical scaling methodology.

 
AdvisersStephen B. Dunbar; Alicia C. Alonzo
SchoolTHE UNIVERSITY OF IOWA
SourceDAI/A 69-07, p. , Nov 2008
Source TypeDissertation
SubjectsEducational tests & measurements; Quantitative psychology and psychometrics
Publication Number3323461
Adobe PDF Access the complete dissertation:
 

» Find an electronic copy at your library.
  Use the link below to access a full citation record of this graduate work:
  http://gateway.proquest.com/openurl%3furl_ver=Z39.88-2004%26res_dat=xri:pqdiss%26rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation%26rft_dat=xri:pqdiss:3323461
  If your library subscribes to the ProQuest Dissertations & Theses (PQDT) database, you may be entitled to a free electronic version of this graduate work. If not, you will have the option to purchase one, and access a 24 page preview for free (if available).

About ProQuest Dissertations & Theses
With over 2.3 million records, the ProQuest Dissertations & Theses (PQDT) database is the most comprehensive collection of dissertations and theses in the world. It is the database of record for graduate research.

The database includes citations of graduate works ranging from the first U.S. dissertation, accepted in 1861, to those accepted as recently as last semester. Of the 2.3 million graduate works included in the database, ProQuest offers more than 1.9 million in full text formats. Of those, over 860,000 are available in PDF format. More than 60,000 dissertations and theses are added to the database each year.

If you have questions, please feel free to visit the ProQuest Web site - http://www.proquest.com - or call ProQuest Hotline Customer Support at 1-800-521-3042.