A Comparison of Linear, Fixed Common Item, and Concurrent Parameter Estimation Equating Procedures in Capturing Academic Growth. Jodoin, M. G., Keller, L. A., & Swaminathan, H. The Journal of Experimental Education, 71(3):229–250, January, 2003. Publisher: Routledge _eprint: https://doi.org/10.1080/00220970309602064
A Comparison of Linear, Fixed Common Item, and Concurrent Parameter Estimation Equating Procedures in Capturing Academic Growth [link]Paper  doi  abstract   bibtex   
A natural method to assess academic growth, and in turn facilitate inferences regarding the effectiveness of educational policy, administration, and instruction, is to compare the performance of students on standardized tests over time. A key methodological step in this process is equating test scores from separate administrations so that the scores are comparable. Three common item response theory equating approaches are investigated: linear transformation of separate calibrations, fixed common item parameter calibration, and concurrent calibration. Using data from a high-stakes statewide testing program, differences in mean growth were found between methods depending upon both the type of ability estimate used and the type of equating procedure implemented. Furthermore, depending upon the choice of equating method, large differences in classification may occur.
@article{jodoin_comparison_2003,
	title = {A {Comparison} of {Linear}, {Fixed} {Common} {Item}, and {Concurrent} {Parameter} {Estimation} {Equating} {Procedures} in {Capturing} {Academic} {Growth}},
	volume = {71},
	issn = {0022-0973},
	url = {https://doi.org/10.1080/00220970309602064},
	doi = {10.1080/00220970309602064},
	abstract = {A natural method to assess academic growth, and in turn facilitate inferences regarding the effectiveness of educational policy, administration, and instruction, is to compare the performance of students on standardized tests over time. A key methodological step in this process is equating test scores from separate administrations so that the scores are comparable. Three common item response theory equating approaches are investigated: linear transformation of separate calibrations, fixed common item parameter calibration, and concurrent calibration. Using data from a high-stakes statewide testing program, differences in mean growth were found between methods depending upon both the type of ability estimate used and the type of equating procedure implemented. Furthermore, depending upon the choice of equating method, large differences in classification may occur.},
	number = {3},
	urldate = {2020-06-02},
	journal = {The Journal of Experimental Education},
	author = {Jodoin, Michael G. and Keller, Lisa A. and Swaminathan, H.},
	month = jan,
	year = {2003},
	note = {Publisher: Routledge
\_eprint: https://doi.org/10.1080/00220970309602064},
	keywords = {IRT, accountability, equating, measuring growth},
	pages = {229--250}
}

Downloads: 0