RESEARCH LIBRARY

View the latest publications from members of the NBME research team

Showing 1 - 3 of 3 Research Library Publications

Outlier Detection Using t-test in Rasch IRT Equating under NEAT Design

Posted: September 6, 2022 | Chunyan Liu, Dan Jurich

Applied Psychological Measurement: Volume 47, issue 1, page(s) 34-47

This study used simulation to investigate the performance of the t-test method in detecting outliers and compared its performance with other outlier detection methods, including the logit difference method with 0.5 and 0.3 as the cutoff values and the robust z statistic with 2.7 as the cutoff value.

Category:Assessment-Oriented Research, Scoring

Timing Considerations for Performance Assessments

Posted: June 25, 2020 | M. J. Margolis, M. von Davier, B. E. Clauser

Integrating Timing Considerations to Improve Testing Practices

This chapter addresses timing considerations in the context of other types of performance assessments and reports on a previously unpublished experiment examining timing with respect to performance on computer-based case simulations that are used in physician licensure.

Category:Assessment-Oriented Research, General Measurement

The Use of Multivariate Generalizability Theory to Evaluate the Quality of Subscores

Posted: April 3, 2018 | Z. Jiang, M.R. Raymond

Applied Psychological Measurement: Volume: 42 issue: 8, page(s): 595-612

Conventional methods for evaluating the utility of subscores rely on reliability and correlation coefficients. However, correlations can overlook a notable source of variability: variation in subtest means/difficulties. Brennan introduced a reliability index for score profiles based on multivariate generalizability theory, designated as G, which is sensitive to variation in subtest difficulty. However, there has been little, if any, research evaluating the properties of this index. A series of simulation experiments, as well as analyses of real data, were conducted to investigate G under various conditions of subtest reliability, subtest correlations, and variability in subtest means.

Category:Assessment-Oriented Research, Reliability/Validity, Scoring

Stay Up to Date

USMLE® Fee Assistance

Communication Learning Assessment

Introduction to Measurement Concepts: Validity and Reliability

NBME Academy

Latin America Grants

USMLE® Fee Assistance

RESEARCH LIBRARY

Filter:

Outlier Detection Using t-test in Rasch IRT Equating under NEAT Design

Timing Considerations for Performance Assessments

The Use of Multivariate Generalizability Theory to Evaluate the Quality of Subscores