Which of the following conclusions about item response theory distinguishes it from classical testing theory. Lords book, applications of item response theory to practical testing. Using classical test theory, item response theory, and rasch. The chapter discusses the procedures for estimating the standard error and reliability of the scores for the different score theories. However, analytical estimates of the variance of the irt reliability coefficients are not available in the literature and an estimated standard error. The first issue is estimating the size of standard errors when equating older.
It is not as straightforward as cronbachs alpha because irt is a completely different approach to conceptualizing the way that individuals respond to items on scales and tests. Item response theory irt has its roots in thurstones work to scale tests of mental development in the 1920s. This is a modern test theory as opposed to classical test theory. Item response theory, reliability and standard error brent culligan 2 0 1 f. Item response theory, reliability and standard error. Item response theory is used to describe the application of mathematical models to data from questionnaires and tests as a basis for measuring abilities, attitudes, or other variables. It is not the only modern test theory, but it is the most popular one and is currently an area of active research. An introduction to item response theory and rasch analysis.
In the place of reliability, irt offers the test information function which shows the degree of precision at different. Krabbe, in the measurement of health and health status, 2017. As discussed by bock, thurstone envisioned a measurement model in which the probability of success on a given intelligence test item was a function of the chronological age of the respondent. The advantages of irt have been described in a number of papers and book chapters. Despite the name, item response theory irt is not really a theory but rather a collection of measurement models.
On the relationship between classical test theory and item response theory. Standard deviations and effect size estimates are critical because they. Ctt, rmt, and irt evaluations were conducted, and results were assessed in a headtohead comparison. Each is an attempt to explain the process by which individuals respond to items. Reliability and error in measurement instruments developed. An introduction to item response theory and rasch analysis of the. All irt models are built to measure subjective phenomena, and the basic one is the rasch model. Item response theory, reliability and standard error wordengine.
In its simplest form, item response theory posits that the probability of a random person j with ability. Measurement precision varies across ranges of item difficulty and person ability. Item response theory aka irt is also sometimes called latent trait theory. Item response theory irt modeling views responses to test items as. The test information function and standard error for the original 25 items are presented in the left side of figure 3. Reliability is seen as a characteristic of the test and of the variance of the trait it measures. In psychometrics, item response theory irt is a paradigm for the design, analysis, and scoring. Reliability issues in highstakes educational tests springerlink. The new psychometrics item response theory classical test theory is concerned with the reliability of a test and assumes that the items within the test are sampled at random from a domain of relevant items. Overview of classical test theory and item response theory. Traditionally, reliability refers to the precision of measurement i.
The estimation of the irt reliability coefficient and its lower and upper bounds, with comparisons to ctt reliability statistics. Doc item response theory, reliability and standard error. Chapter 8 the new psychometrics item response theory. Lords book, applications of item response theory to practical testing problems, presented much of the current irt theory in language easily understood by many practitioners. Large sample confidence intervals for item response theory. Pdf scoring and estimating score precision using irt. Classical test theory and item response theory the wiley. One kind of support for the validity of the interpretation is that the test measures the psychological trait consistently.
It is used for statistical analysis and development of assessments, often for high stakes tests such as the graduate record examination. Item response theory an overview sciencedirect topics. One of the major contributions of item response theory is the extension of the concept of reliability. What is the reliability measure equivalent to cronbach. It covered basic concepts, comparison to ctt methods, relative efficiency, optimal number of choices per item, flexilevel tests, multistage tests, tailored testing. A note on the reliability coefficients for item response modelbased ability estimates.
1026 692 618 789 1310 869 271 373 1312 57 637 500 968 1596 935 773 1489 634 862 11 1360 153 165 1226 296 1224 296 821 1325 242 897 993 1594 27 600 338 709 432 520 188 1239 1375 1127 1189