RESEARCH LIBRARY

View the latest publications from members of the NBME research team

Showing 11 - 20 of 41 Research Library Publications

The Fundamentals of Artificial Intelligence in Medical Education Research: AMEE Guide No. 156

Posted: March 2, 2023 | Martin G. Tolsgaard, Martin V. Pusic, Stefanie S. Sebok-Syer, Brian Gin, Morten Bo Svendsen, Mark D. Syer, Ryan Brydges, Monica M. Cuddy, Christy K. Boscardin

Medical Teacher: Volume 45 - Issue 6, Pages 565-573

This guide aims aim to describe practical considerations involved in reading and conducting studies in medical education using Artificial Intelligence (AI), define basic terminology and identify which medical education problems and data are ideally-suited for using AI.

Category:Assessment-Oriented Research, Applications of Technology

Reading Differences in Eye-Tracking Data as a Marker of High-Functioning Autism in Adults and Comparison to Results from Web-Related Tasks

Posted: January 27, 2023 | Victoria Yaneva, Le An Ha, Sukru Eraslan, Yeliz Yesilada, Ruslan Mitkov

Neural Engineering Techniques for Autism Spectrum Disorder: Volume 2, Pages 63-79

Automated detection of high-functioning autism in adults is a highly challenging and understudied problem. In search of a way to automatically detect the condition, this chapter explores how eye-tracking data from reading tasks can be used.

Category:Health Professions, Assessment-Oriented Research, Applications of Technology

“Cephalgia” or “Migraine”? Solving the Headache of Assessing Clinical Reasoning Using Natural Language Processing

Posted: November 21, 2022 | Christopher Runyon, Polina Harik, Michael Barone

Diagnosis: Volume 10, Issue 1, Pages 54-60

This op-ed discusses the advantages of leveraging natural language processing (NLP) in the assessment of clinical reasoning. It also provides an overview of INCITE, the Intelligent Clinical Text Evaluator, a scalable NLP-based computer-assisted scoring system that was developed to measure clinical reasoning ability as assessed in the written documentation portion of the now-discontinued USMLE Step 2 Clinical Skills examination.

Category:Product-Oriented Research, USMLE, Assessment-Oriented Research, Applications of Technology

Effects of Data Preprocessing on Detecting Autism in Adults Using Web-Based Eye-Tracking Data

Posted: September 14, 2022 | Erfan Khalaji, Sukru Eraslan, Yeliz Yesilada, Victoria Yaneva

Behavior & Information Technology

This study builds upon prior work in this area that focused on developing a machine-learning classifier trained on gaze data from web-related tasks to detect ASD in adults. Using the same data, we show that a new data pre-processing approach, combined with an exploration of the performance of different classification algorithms, leads to an increased classification accuracy compared to prior work.

Category:Assessment-Oriented Research, Applications of Technology

Outlier Detection Using t-test in Rasch IRT Equating under NEAT Design

Posted: September 6, 2022 | Chunyan Liu, Dan Jurich

Applied Psychological Measurement: Volume 47, issue 1, page(s) 34-47

This study used simulation to investigate the performance of the t-test method in detecting outliers and compared its performance with other outlier detection methods, including the logit difference method with 0.5 and 0.3 as the cutoff values and the robust z statistic with 2.7 as the cutoff value.

Category:Assessment-Oriented Research, Scoring

Historical Perspectives on Score Comparability Issues Raised by Innovations in Testing

Posted: May 11, 2022 | Peter Baldwin, Brian E. Clauser

Journal of Educational Measurement: Volume 59, Issue 2, Pages 140-160

A conceptual framework for thinking about the problem of score comparability is given followed by a description of three classes of connectives. Examples from the history of innovations in testing are given for each class.

Category:Assessment-Oriented Research, Scoring

Video-Based Communication Assessment of Physician Error Disclosure Skills by Crowdsourced Laypeople and Patient Advocates Who Experienced Medical Harm: Reliability Assessment With Generalizability Theory

Posted: April 29, 2022 | Andrew A. White, Ann M. King, Angelo E. D’Addario, Karen Berg Brigham, Suzanne Dintzis, Emily E. Fay, Thomas H. Gallagher, Kathleen M. Mazor

JMIR Medical Education: Volume 8 - Issue 2 - e30988

This article aims to compare the reliability of two assessment groups (crowdsourced laypeople and patient advocates) in rating physician error disclosure communication skills using the Video-Based Communication Assessment app.

Category:Assessment-Oriented Research, Applications of Technology

Leveraging Machine Learning Technology to Improve Accuracy and Efficiency of Identification of Enemy Item Pairs

Posted: January 1, 2022 | Ian Micir, Kimberly Swygert, Jean D'Angelo

Journal of Applied Technology: Volume 23 - Special Issue 1 - Pages 30-40

The interpretations of test scores in secure, high-stakes environments are dependent on several assumptions, one of which is that examinee responses to items are independent and no enemy items are included on the same forms. This paper documents the development and implementation of a C#-based application that uses Natural Language Processing (NLP) and Machine Learning (ML) techniques to produce prioritized predictions of item enemy statuses within a large item bank.

Category:Assessment-Oriented Research, Scoring, Applications of Technology

Using Eye-Tracking Data as Part of the Validity Argument for Multiple-Choice Questions

Posted: December 4, 2021 | Victoria Yaneva, Brian E. Clauser, Amy Morales, Miguel Paniagua

Journal of Educational Measurement: Volume 58, Issue 4, Pages 515-537

In this paper, the NBME team reports the results an eye-tracking study designed to evaluate how the presence of the options in multiple-choice questions impacts the way medical students responded to questions designed to evaluate clinical reasoning. Examples of the types of data that can be extracted are presented. We then discuss the implications of these results for evaluating the validity of inferences made based on the type of items used in this study.

Category:Assessment-Oriented Research, Applications of Technology

Automated Prediction Of Examinee Proficiency From Short-Answer Questions

Posted: December 10, 2020 | Le An Ha, Victoria Yaneva, Polina Harik, Ravi Pandian, Amy Morales, Brian Clauser

Proceedings of the 28th International Conference on Computational Linguistics

This paper brings together approaches from the fields of NLP and psychometric measurement to address the problem of predicting examinee proficiency from responses to short-answer questions (SAQs).

Category:Assessment-Oriented Research, Scoring

Stay Up to Date

USMLE® Fee Assistance

Communication Learning Assessment

Introduction to Measurement Concepts: Validity and Reliability

NBME Academy

Latin America Grants

USMLE® Fee Assistance

RESEARCH LIBRARY

Filter:

The Fundamentals of Artificial Intelligence in Medical Education Research: AMEE Guide No. 156

Reading Differences in Eye-Tracking Data as a Marker of High-Functioning Autism in Adults and Comparison to Results from Web-Related Tasks

“Cephalgia” or “Migraine”? Solving the Headache of Assessing Clinical Reasoning Using Natural Language Processing

Effects of Data Preprocessing on Detecting Autism in Adults Using Web-Based Eye-Tracking Data

Outlier Detection Using t-test in Rasch IRT Equating under NEAT Design

Historical Perspectives on Score Comparability Issues Raised by Innovations in Testing

Video-Based Communication Assessment of Physician Error Disclosure Skills by Crowdsourced Laypeople and Patient Advocates Who Experienced Medical Harm: Reliability Assessment With Generalizability Theory

Leveraging Machine Learning Technology to Improve Accuracy and Efficiency of Identification of Enemy Item Pairs

Using Eye-Tracking Data as Part of the Validity Argument for Multiple-Choice Questions

Automated Prediction Of Examinee Proficiency From Short-Answer Questions