Statistical and practical significance of item misfit in educational testing

教育测试中项目失配的统计意义和实际意义

基本信息

项目摘要

Testing model fit is considered an important step in item response theory (IRT) modeling, since model fit is a necessary prerequisite for drawing valid inferences from estimated parameters (Wainer & Thissen, 1987). Hambleton and Hahn (2005) suggest several steps to evaluate model fit, including (a) the calculation of item fit statistics and (b) investigating the consequences of misfit with regard to test outcomes. In educational large-scale assessments, various item fit indices are employed. Many of these statistics show severe limitations such as lack of theoretical proof about the distribution of the test statistic (Liang, Wells, & Hambleton, 2014). This lack of methodological basis for determining accurate cut-off values makes decisions regarding item fit rather arbitrary. Unsurprisingly, studies comparing the statistics have demonstrated that the statistics result in contradictory conclusions regarding which items show statistical misfit and should hence be excluded from the test (see, e.g., Chon, Lee, & Ansley, 2013). A second issue regarding decisions about item fit concerns the fact that practical significance of item misfit (i.e., effects on relevant test outcomes) is not taken into account when determining item exclusions. This is mostly due the fact that, thus far, no readily usable method for evaluating practical significance exists. The proposed project aims to (1) establish guidelines for practitioners about the performance of relatively common as well as more recent item fit statistics in educational large-scale assessments and (2) develop criteria for evaluating practical significance of item misfit. Simulation studies will be conducted to pursue the first research objective, investigating possible influencing factors on the fit statistics` performance (i.e., their Type I error rates and power). These factors include (i.) sample size, (ii.) interaction between misfit and item parameters, (iii.) type of model violation (iv.) size of misfit, (v.) amount of misfitting items in the data, and (vi.) amount and type of missing values in the data. Regarding the size of misfit, we plan to propose different effect size measures that allow distinguishing between small, medium, and large item misfit. The second major objective is to develop methods to determine practical significance of misfit for outcomes that are relevant in low-stakes educational testing, including (i.) analyses on relationships between competence and covariates, and (ii.) competence comparisons over time. We will use empirical data (e.g., from PISA or NEPS) to validate our findings and to illustrate our methods.
测试模型拟合被认为是项目反应理论(IRT)建模的重要步骤,因为模型拟合是从估计参数中得出有效推论的必要前提(Wainer & Thissen, 1987)。Hambleton和Hahn(2005)提出了评估模型拟合的几个步骤,包括(a)计算项目拟合统计数据和(b)调查不拟合对测试结果的影响。在教育大规模评估中,采用了多种项目契合度指标。其中许多统计数据显示出严重的局限性,例如缺乏关于检验统计量分布的理论证明(Liang, Wells, & Hambleton, 2014)。由于缺乏确定准确临界值的方法学基础,使得有关项目拟合的决定相当武断。不出所料,比较统计数据的研究表明,统计数据导致了关于哪些项目显示统计不匹配的矛盾结论,因此应该从测试中排除(参见,例如,Chon, Lee, & Ansley, 2013)。关于项目拟合决策的第二个问题是,在确定项目排除时,没有考虑到项目不拟合的实际意义(即对相关测试结果的影响)。这主要是由于到目前为止,还没有现成可用的方法来评估实际意义。拟议的项目旨在(1)为从业者建立关于在教育大规模评估中相对常见的以及最近的项目拟合统计性能的指导方针;(2)制定评估项目不拟合的实际意义的标准。为了实现第一个研究目标,将进行仿真研究,调查可能影响拟合统计性能的因素(即它们的I型错误率和功率)。这些因素包括(i.)样本量,(ii.)错拟和项目参数之间的相互作用,(iii.)模型违反的类型(iv.)错拟的大小,(v.)数据中错拟项目的数量,以及(vi.)数据中缺失值的数量和类型。关于错配的大小,我们计划提出不同的效应大小测量,以区分小、中、大项目错配。第二个主要目标是开发方法,以确定与低风险教育测试相关的结果的不匹配的实际意义,包括(i)对能力和协变量之间关系的分析,以及(ii)随时间的能力比较。我们将使用经验数据(例如,来自PISA或NEPS)来验证我们的发现并说明我们的方法。

项目成果

期刊论文数量(2)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
A semiparametric approach for item response function estimation to detect item misfit.
用于检测项目不匹配的项目响应函数估计的半参数方法
A Bias-Corrected RMSD Item Fit Statistic: An Evaluation and Comparison to Alternatives
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Dr. Carmen Köhler其他文献

Dr. Carmen Köhler的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Dr. Carmen Köhler', 18)}}的其他基金

Explaining Inconsistent Effects of Teaching Quality on Educational Outcomes
解释教学质量对教育成果的不一致影响
  • 批准号:
    518236946
  • 财政年份:
  • 资助金额:
    --
  • 项目类别:
    Research Grants

相似国自然基金

Lagrange网络实用同步的不连续控制研究
  • 批准号:
    61603174
  • 批准年份:
    2016
  • 资助金额:
    20.0 万元
  • 项目类别:
    青年科学基金项目

相似海外基金

A complex and comprehensive micro-sociological study on the significance and practical methodology of employment support by DARC
关于DARC就业支持的意义和实践方法的复杂而全面的微观社会学研究
  • 批准号:
    23H00884
  • 财政年份:
    2023
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
Practical Significance of Effects from Growth Modeling of Alcohol Use Data
酒精使用数据增长模型影响的实际意义
  • 批准号:
    9311360
  • 财政年份:
    2017
  • 资助金额:
    --
  • 项目类别:
Understanding, Significance and Practical Concepts
理解、意义和实用概念
  • 批准号:
    249625529
  • 财政年份:
    2014
  • 资助金额:
    --
  • 项目类别:
    Research Grants
practical and theoretical resarch on the modern significance of the trade union and reconstraction of the labor union principles of law
工会现代意义的实践与理论研究及工会法原则的重构
  • 批准号:
    25380072
  • 财政年份:
    2013
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
The theoretical and practical significance of tree species and functional diversity in designing novel forest ecosystems
树种和功能多样性在设计新型森林生态系统中的理论和实践意义
  • 批准号:
    411895-2010
  • 财政年份:
    2012
  • 资助金额:
    --
  • 项目类别:
    Collaborative Research and Development Grants
The theoretical and practical significance of tree species and functional diversity in designing novel forest ecosystems
树种和功能多样性在设计新型森林生态系统中的理论和实践意义
  • 批准号:
    411895-2010
  • 财政年份:
    2011
  • 资助金额:
    --
  • 项目类别:
    Collaborative Research and Development Grants
The theoretical and practical significance of tree species and functional diversity in designing novel forest ecosystems
树种和功能多样性在设计新型森林生态系统中的理论和实践意义
  • 批准号:
    411895-2010
  • 财政年份:
    2010
  • 资助金额:
    --
  • 项目类别:
    Collaborative Research and Development Grants
Experimental study on the significance of developmental growth through interactions with nature -proposal of a practical model for child care
通过与自然互动促进发育成长的意义的实验研究——提出实用的儿童保育模型
  • 批准号:
    21500888
  • 财政年份:
    2009
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
The Influence of Staff Turnover on Quality
员工流动对质量的影响
  • 批准号:
    7234351
  • 财政年份:
    2006
  • 资助金额:
    --
  • 项目类别:
Study on practical significance of knowledge of Fact, Act and Norm-Philosophical review of contrast between 'nature' and 'artificiality'
事实、行为、规范知识的现实意义研究——“自然”与“人为”对比的哲学审视
  • 批准号:
    14310002
  • 财政年份:
    2002
  • 资助金额:
    --
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了