Four Aspirations for Evaluating AI-Infused Technologies: A Narrative Review

Dane A. Morey¹, Mohammadreza Jalaeian¹, Morgan E. Reynolds¹, Nicolette M. McGeorge², Hunter K. Oldham³, and Michael F. Rayo¹

Journal of Cognitive Engineering and Decision Making, Published: April 16 2026, Vol. 0(0) 1-20

Abstract

As artificial intelligence (AI), machine learning (ML), and other forms of advanced automation are increasingly considered for deployment in safety-critical industries, there is an urgent need for evaluation methods which reliably identify risks of deployment prior to people being harmed. In this narrative review, we discuss the benefits and drawbacks of 11 major methodological decisions underpinning evaluations of AI-infused technologies from the perspective of cognitive systems engineering (CSE) and naturalistic decision making (NDM). These methodological decisions are organized around four aspirations central to the perspective of CSE and NDM: evaluations of AI-infused technologies should be (1) integrated, (2) naturalistic, (3) grounded, and (4) pattern-centered. We use these aspirations to interpret common human-AI evaluation methods and discuss new evaluation challenges for emerging AI-infused technologies. This narrative review is meant to guide both current methods and future research toward safe and effective strategies for evaluating AI-infused technologies, especially in safety-critical settings.

¹ The Ohio State University
² Charles River Analytics
³ Air Force Research Laboratory

For More Information

To learn more, contact Nicolette McGeorge.

(Please include your name, address, organization, and the paper reference. Requests without this information will not be honored.)

PUBLICATIONS

Four Aspirations for Evaluating AI-Infused Technologies: A Narrative Review