RESEARCH LIBRARY

View the latest publications from members of the NBME research team

Showing 1 - 10 of 10 Research Library Publications

Advancing Natural Language Processing in Educational Assessment

Posted: June 5, 2023 | Victoria Yaneva (editor), Matthias von Davier (editor)

Advancing Natural Language Processing in Educational Assessment

This book examines the use of natural language technology in educational testing, measurement, and assessment. Recent developments in natural language processing (NLP) have enabled large-scale educational applications, though scholars and professionals may lack a shared understanding of the strengths and limitations of NLP in assessment as well as the challenges that testing organizations face in implementation. This first-of-its-kind book provides evidence-based practices for the use of NLP-based approaches to automated text and speech scoring, language proficiency assessment, technology-assisted item generation, gamification, learner feedback, and beyond.

Category:Assessment-Oriented Research, Applications of Technology, General Measurement

Extracting Linguistic Signal From Item Text and Its Application to Modeling Item Characteristics

Posted: June 5, 2023 | Victoria Yaneva, Peter Baldwin, Le An Ha, Christopher Runyon

Advancing Natural Language Processing in Educational Assessment: Pages 167-182

This chapter discusses the evolution of natural language processing (NLP) approaches to text representation and how different ways of representing text can be utilized for a relatively understudied task in educational assessment – that of predicting item characteristics from item text.

Category:Assessment-Oriented Research, Applications of Technology, Scoring

Assessment of Clinical Skills: A Case Study in Constructing an NLP-based Scoring System for Patient Notes

Posted: June 5, 2023 | Polina Harik, Janet Mee, Christopher Runyon, Brian E. Clauser

Advancing Natural Language Processing in Educational Assessment: Pages 58-73

This chapter describes INCITE, an NLP-based system for scoring free-text responses. It emphasizes the importance of context and the system’s intended use and explains how each component of the system contributed to its accuracy.

Category:Assessment-Oriented Research, Applications of Technology, Scoring

Similarities Between Clinically Matched and Unmatched Analogue Patient Raters: A Mixed Methods Study

Posted: April 1, 2023 | Ann King, Kathleen Mazor, Andrew Houriet, Thea Musselman, Ruth Hoppe, Angelo D’Addario

Patient Education and Counseling: Volume 109, Supplement, April 2023, Page 2

Physicians' responses to patient communication were assessed by both clinically matched and unmatched analogue patients (APs). Significant correlations between their ratings indicated consistency in evaluating physician communication skills. Thematic analysis identified twenty-one common themes in both clinically matched and unmatched AP responses, suggesting similar assessments of important behaviors. These findings imply that clinically unmatched APs can effectively substitute for clinically matched ones in evaluating physician communication and offering feedback when the latter are unavailable.

Category:Assessment-Oriented Research, Applications of Technology, Links to Outcomes

The Fundamentals of Artificial Intelligence in Medical Education Research: AMEE Guide No. 156

Posted: March 2, 2023 | Martin G. Tolsgaard, Martin V. Pusic, Stefanie S. Sebok-Syer, Brian Gin, Morten Bo Svendsen, Mark D. Syer, Ryan Brydges, Monica M. Cuddy, Christy K. Boscardin

Medical Teacher: Volume 45 - Issue 6, Pages 565-573

This guide aims aim to describe practical considerations involved in reading and conducting studies in medical education using Artificial Intelligence (AI), define basic terminology and identify which medical education problems and data are ideally-suited for using AI.

Category:Assessment-Oriented Research, Applications of Technology

Reading Differences in Eye-Tracking Data as a Marker of High-Functioning Autism in Adults and Comparison to Results from Web-Related Tasks

Posted: January 27, 2023 | Victoria Yaneva, Le An Ha, Sukru Eraslan, Yeliz Yesilada, Ruslan Mitkov

Neural Engineering Techniques for Autism Spectrum Disorder: Volume 2, Pages 63-79

Automated detection of high-functioning autism in adults is a highly challenging and understudied problem. In search of a way to automatically detect the condition, this chapter explores how eye-tracking data from reading tasks can be used.

Category:Health Professions, Assessment-Oriented Research, Applications of Technology

Evaluation of a New Method for Providing Full Review Opportunities in Computerized Adaptive Testing — Computerized Adaptive Testing with Salt

Posted: October 1, 2018 | Z. Cui, C. Liu, Y. He, H. Chen

Journal of Educational Measurement: Volume 55, Issue 4, Pages 582-594

This article proposes and evaluates a new method that implements computerized adaptive testing (CAT) without any restriction on item review. In particular, it evaluates the new method in terms of the accuracy on ability estimates and the robustness against test‐manipulation strategies. This study shows that the newly proposed method is promising in a win‐win situation: examinees have full freedom to review and change answers, and the impacts of test‐manipulation strategies are undermined.

Category:Assessment-Oriented Research, General Measurement, Applications of Technology

Crowdsourcing for Assessment Items to Support Adaptive Learning

Posted: August 10, 2018 | S. Tackett, M. Raymond, R. Desai, S. A. Haist, A. Morales, S. Gaglani, S. G. Clyman

Medical Teacher: Volume 40 - Issue 8 - p 838-841

Adaptive learning requires frequent and valid assessments for learners to track progress against their goals. This study determined if multiple-choice questions (MCQs) “crowdsourced” from medical learners could meet the standards of many large-scale testing programs.

Category:Assessment-Oriented Research, Applications of Technology

Guest Editorial

Posted: April 3, 2018 | I. Kirsch, W. Thorn, M. von Davier

Quality Assurance in Education, Vol. 26 No. 2, pp. 150-152

An introduction to a special issue of Quality Assurance in Education featuring papers based on presentations at a two-day international seminar on managing the quality of data collection in large-scale assessments.

Category:Assessment-Oriented Research, General Measurement, Applications of Technology

Automated Item Generation with Recurrent Neural Networks

Posted: March 12, 2018 | M. von Davier

Psychometrika 83, 847–857 (2018)

Utilizing algorithms to generate items in educational and psychological testing is an active area of research for obvious reasons: Test items are predominantly written by humans, in most cases by content experts who represent a limited and potentially costly resource. Using algorithms instead has the appeal to provide an unlimited resource for this crucial part of assessment development.

Category:Assessment-Oriented Research, Applications of Technology

Stay Up to Date

USMLE® Fee Assistance

Communication Learning Assessment

New Psychometric Workshops

NBME Academy

Latin America Grants

USMLE® Fee Assistance

RESEARCH LIBRARY

Filter:

Advancing Natural Language Processing in Educational Assessment

Extracting Linguistic Signal From Item Text and Its Application to Modeling Item Characteristics

Assessment of Clinical Skills: A Case Study in Constructing an NLP-based Scoring System for Patient Notes

Similarities Between Clinically Matched and Unmatched Analogue Patient Raters: A Mixed Methods Study

The Fundamentals of Artificial Intelligence in Medical Education Research: AMEE Guide No. 156

Reading Differences in Eye-Tracking Data as a Marker of High-Functioning Autism in Adults and Comparison to Results from Web-Related Tasks

Evaluation of a New Method for Providing Full Review Opportunities in Computerized Adaptive Testing — Computerized Adaptive Testing with Salt

Crowdsourcing for Assessment Items to Support Adaptive Learning

Guest Editorial

Automated Item Generation with Recurrent Neural Networks