This paper proposes another criterion for empirical item-selection, namely, item test-retest stability (ITRS). Four tests were used in this research. Two of them were ability power tests and the other two were objective personality scales. These tests were administered twice to 100 students with a time interval of eight months in between. For each item in each test a phi correlation between the first and second administrations was calculated and then used as an ITRS index. For each test an abbreviated version was created, by selecting the items with the highest ITRS scores. The retest stability coefficients of the abbreviated and original tests were assessed with a new sample, consisting of another 100 students, and the abbreviated tests were found to be not lower than the respective coefficients of the original, longer tests. The ITRS and the more popular IIC (Item internal consistency) criterion were compared to each other in terms of their effects upon retest stability and internal consistency, and conclusions were drawn.
ASJC Scopus subject areas
- Developmental and Educational Psychology
- Applied Psychology
- Applied Mathematics