RESEARCH LIBRARY

View the latest publications from members of the NBME research team

Showing 21 - 30 of 43 Research Library Publications

How Examinees Use Time

Posted: June 25, 2020 | P. Harik, R.A. Feinberg RA, B.E. Clauser

Integrating Timing Considerations to Improve Testing Practices

This chapter addresses a different aspect of the use of timing data: it provides a framework for understanding how an examinee's use of time interfaces with time limits to impact both test performance and the validity of inferences made based on test scores. It focuses primarily on examinations that are administered as part of the physician licensure process.

Category:Assessment-Oriented Research, General Measurement, Reliability/Validity

Integrating Timing Considerations to Improve Testing Practices

Posted: June 25, 2020 | M.J. Margolis, R.A. Feinberg (eds)

Integrating Timing Considerations to Improve Testing Practices

This book synthesizes a wealth of theory and research on time issues in assessment into actionable advice for test development, administration, and scoring.

Category:Assessment-Oriented Research, General Measurement

Timing Considerations for Performance Assessments

Posted: June 25, 2020 | M. J. Margolis, M. von Davier, B. E. Clauser

Integrating Timing Considerations to Improve Testing Practices

This chapter addresses timing considerations in the context of other types of performance assessments and reports on a previously unpublished experiment examining timing with respect to performance on computer-based case simulations that are used in physician licensure.

Category:Assessment-Oriented Research, General Measurement

A History of Test Speededness: Tracing the Evolution of Theory and Practice

Posted: June 25, 2020 | D. Jurich

Integrating Timing Considerations to Improve Testing Practices

This chapter presents a historical overview of the testing literature that exemplifies the theoretical and operational evolution of test speededness.

Category:Assessment-Oriented Research, General Measurement, Reliability/Validity

Detecting High-Functioning Autism In Adults Using Eye Tracking And Machine Learning

Posted: April 30, 2020 | V. Yaneva, L. A. Ha, S. Eraslan, Y. Yesilada, R. Mitkov

IEEE Transactions on Neural Systems and Rehabilitation Engineering

The purpose of this study is to test whether visual processing differences between adults with and without high-functioning autism captured through eye tracking can be used to detect autism.

Category:Assessment-Oriented Research, Applications of Technology

Using multisite instrumental variables to estimate treatment effects and treatment effect heterogeneity

Posted: April 29, 2020 | C.R. Runyon

UT Electronic Theses and Dissertations

Using Monte Carlo simulation, the current study examines the performance of three IV estimators and two conventional estimators in recovering the CATE and CATE heterogeneity under simulation conditions that resemble multisite trials of well-known educational programs.

Category:Assessment-Oriented Research, General Measurement

Adding Objectivity to Standard Setting: Evaluating Consequence Using the Conscious and Subconscious Weight Methods

Posted: February 26, 2020 | B.C. Leventhal, I. Grabovsky

Educational Measurement: Issues and Practice, 39: 30-36

This article proposes the conscious weight method and subconscious weight method to bring more objectivity to the standard setting process. To do this, these methods quantify the relative harm of the negative consequences of false positive and false negative misclassification.

Category:Assessment-Oriented Research, General Measurement

Handbook of Diagnostic Classification Models

Posted: August 31, 2019 | M. von Davier, YS. Lee

Springer International Publishing; 2019

This handbook provides an overview of major developments around diagnostic classification models (DCMs) with regard to modeling, estimation, model checking, scoring, and applications. It brings together not only the current state of the art, but also the theoretical background and models developed for diagnostic classification.

Category:Assessment-Oriented Research, General Measurement, Scoring

Leveraging Natural Language Processing: Toward Computer-Assisted Scoring of Patient Notes in the USMLE Step 2 Clinical Skills Exam

Posted: March 1, 2019 | J. Salt, P. Harik, M. A. Barone

Academic Medicine: March 2019 - Volume 94 - Issue 3 - p 314-316

The United States Medical Licensing Examination Step 2 Clinical Skills (CS) exam uses physician raters to evaluate patient notes written by examinees. In this Invited Commentary, the authors describe the ways in which the Step 2 CS exam could benefit from adopting a computer-assisted scoring approach that combines physician raters’ judgments with computer-generated scores based on natural language processing (NLP).

Category:Assessment-Oriented Research, Scoring, Applications of Technology, Product-Oriented Research, USMLE

The Choice of Response Probability in Bookmark Standard Setting: An Experimental Study

Posted: January 16, 2019 | P. Baldwin, M.J. Margolis, B.E. Clauser, J. Mee, M. Winward

Educational Measurement: Issues and Practice, 39: 37-44

This article presents the results of an experiment in which content experts were randomly assigned to one of two response probability conditions: .67 and .80. If the standard-setting judgments collected with the bookmark procedure are internally consistent, both conditions should produce highly similar cut scores.

Category:Assessment-Oriented Research, General Measurement

Stay Up to Date

USMLE® Fee Assistance

Communication Learning Assessment

Introduction to Measurement Concepts: Validity and Reliability

NBME Academy

Latin America Grants

USMLE® Fee Assistance

RESEARCH LIBRARY

Filter:

How Examinees Use Time

Integrating Timing Considerations to Improve Testing Practices

Timing Considerations for Performance Assessments

A History of Test Speededness: Tracing the Evolution of Theory and Practice

Detecting High-Functioning Autism In Adults Using Eye Tracking And Machine Learning

Using multisite instrumental variables to estimate treatment effects and treatment effect heterogeneity

Adding Objectivity to Standard Setting: Evaluating Consequence Using the Conscious and Subconscious Weight Methods

Handbook of Diagnostic Classification Models

Leveraging Natural Language Processing: Toward Computer-Assisted Scoring of Patient Notes in the USMLE Step 2 Clinical Skills Exam

The Choice of Response Probability in Bookmark Standard Setting: An Experimental Study