Validate·Usability Testing·Automation·Emerging·VAL-038

Simulated Usability Testing

Value hypothesis

Agents simulate user interactions with a prototype, generating plausible behavioural data and task completion patterns without recruiting participants or scheduling sessions.

Velocity · Innovation

Following a typical task elicitation protocol, agents interact with a prototype or live interface, simulating user task attempts across a defined scenario set. Options range from tools modelling individual user journeys to systems generating thousands of simulated sessions in parallel, producing completion rates, error patterns, and drop-off points. Researchers review the findings, decide which behavioural patterns are plausible proxy outcomes for real users, and determines what warrants follow-up with live participants.

Risks in application

Empathy Gap

While agents can simulate task completion they cannot replicate the emotional state, cognitive load, confusion, or lived context that shape how real users experience a design. They can offer no response regarding brand credibility or product fit within a broader context of use or market options. Simulated findings may be structurally plausible but humanly wrong.

Shallow Solutions

High-volume simulation output can create false confidence in results; large numbers of simulated sessions do not compensate for the gap between agent behaviour and human behaviour.

Expertise that differentiates

Research and Insight

Determing which simulated behaviours are credible approximations for real user responses and which reflect model assumptions about how users approach tasks, as opposed to how they actually do.

Behavioral Reasoning

Interpreting task failure patterns in terms of underlying user mental models, rather than treating agent errors as equivalent to human errors.

AI Fluency that assures

Platform Awareness

AI usability simulation has unvalidated predictive accuracy against real user behaviour. Knowing whether a study type and decision stakes warrant simulation over live testing must be assessed before planning simulations, and requires knowledge of platform and methodology limits.

Process Description

Task scenario configuration matters: underspecified scenarios let agents navigate paths of least resistance and miss the usability failures the study was designed to find.

Depends on

Prompt-to-PrototypeRequired for agent testers to interact with.Test Plan DraftingTest plan defines task scenarios and criteria for simulation.

↓

Simulated Usability Testing

↓

Enables

Test Plan DraftingSupplies hypotheses for the live test plan.

Possible Indicators

Session generation speed

Time from prototype to behavioural findings relative to recruited usability testing baseline

Issue detection overlap

Proportion of AI-identified issues confirmed in subsequent live testing

Sources

Rosala, M., & Moran, K. (2024). Synthetic Users: If, When, and How to Use AI-Generated 'Research'. Nielsen Norman Group.

Nielsen Norman Group (2025). Evaluating AI-Simulated Behavior: Insights from Three Studies on Digital Twins and Synthetic Users. NN/g.

Kelemen, F. Z., Sima, L., & Huppert, K. (2024). Can AI take over usability testing? We put it to the test. UX studio.

Dugan, L. (2026). Synthetic users won't tell you they're lying. Bootcamp (Medium).

Kuric, E. (2025). Human-Centered Design Through AI-Assisted Usability Testing: Reality Or Fiction? Smashing Magazine.

Pendharkar, S. (2026). Conducting Pre-Research with AI Agent Personas: Pressure-testing concepts for expert workflows [conference talk]. Designing with AI 2026, Rosenfeld.

Suggest an edit

Simulated Usability Testing

Risks in application

Empathy Gap

Shallow Solutions

Expertise that differentiates

Research and Insight

Behavioral Reasoning

AI Fluency that assures

Platform Awareness

Process Description

Related

Possible Indicators

Sources