In psychometrics, statistical methods for assessing the fit of an examinee’s item responses to a postulated psychometric model are often called person-fit statistic. The person-fit analysis can help to verify the individual diagnostic results, and is mainly used to distinguish the abnormal examinees from the normal ones. The abnormal response patterns include “sleeping” behavior, fatigue, cheating, creative responding, random guessing responses and cheating with randomness, and all of these abnormal response patterns can affect the deviation of examinee’s ability estimation. The person-fit analysis can help researchers identify the abnormal response patterns more accurately, so as to delete the abnormal responding examinees and improve the validity of the test. In the past, most of the person fit researches were mainly carried out under the Item Response Theory (IRT) framework, while only few papers have been published dealing with person-fit under the CDM framework. This study attempts to fill a gap in the literature by introducing new methods. In this study, a new person fit index (R) was proposed.
In order to verify the validity of the newly developed person fit index, this study explores the type I error and statistical test power of R index under different item length, item discrimination and different misfit types of respondent, and compares it with existing methods RCI and lz . Type I error rate was defined as the proportion of flagged abnormal response patterns by a person fit statistic out of 1,000 generated normal response patterns from the DINA model. The control variables of this study include: the number of subjects is controlled to 1000, the cognitive diagnosis model is chosen as DINA model, the attributes are 6, and the Q matrix is fixed. Finally, in order to reflect the value of person fit index in practical application, the R index is applied to the empirical data of fractional subtraction.
The results show that the type I error of R index is reasonable and stable at 0.05. In the aspect of statistical test power, with the improvement of item differentiation, the statistical test power of each index in different abnormal examinees is improved. With the increase in the number of items, most of the statistical power show an upward trend. For different types of abnormal subjects, R index perform best in the cases of random guessing responses and cheating with randomness. In the case of fatigue, sleep, and creative responding, the lz index perform better. In the empirical data study, the detection rate of abnormal examinees is 4.29%.
With the increase of the discrimination of items and the increase of the number of items, the power of R index has improved, and the performance of R index is the most robust when the discrimination of item is low. The R index has a high power for the types of abnormal behavior such as creative responding behavior, random guessing responses and cheating with randomness.
"
[V2] | 2022-05-12 16:42:54 | ChinaXiv:202204.00026V2 | Download |
[V1] | 2022-04-06 08:13:50 | ChinaXiv:202204.00026v1 View This Version | Download |
1. 父母温暖与儿童晚期亲社会行为的关系: 集体取向的作用 | 2024-05-29 |
2. 书写认知老化发生机制及神经机理 | 2024-05-29 |
3. 中学生社交焦虑与攻击性的关系:有调节的中介模型 | 2024-05-29 |
4. 当父母有助于成为好领导?基于“父母-领导增益”视角的领导力发展探究 | 2024-05-28 |
5. 不同手机使用时长下青少年抑郁症状的特点及演化:基于大样本网络分析 | 2024-05-27 |
6. 情绪和吸引力线索整合促进学习判断准确性:来自行为和脑电的证据 | 2024-05-27 |
7. 有序还是无序:陈列秩序与产品属性的匹配效应 | 2024-05-26 |
8. 死亡意识与亲社会行为:死亡反省和死亡焦虑的不同效应及机制 | 2024-05-24 |
9. 原生艺术绘画对心理健康促进的实践研究 | 2024-05-24 |
10. 密集追踪成对数据分析的模型建构探索 | 2024-05-23 |