Review Reports - AI Versus Human-Delivered Online Cognitive Behavioral Therapy for Anxiety Symptoms in Young Adults: A Randomized Controlled Trial

Round 1

Reviewer 1 Report

Comments and Suggestions for Authors

Brief Summary (one short paragraph)

This manuscript reports a single-blind randomized controlled trial comparing AI-delivered online cognitive behavioral therapy (AI‑CBT), human-delivered online CBT (participants told it was AI), and a no-intervention control in young adults with anxiety. Using a mixed-methods approach, the authors assess changes in anxiety, sleep quality, exercise self-efficacy, perceived psychotherapy benefit, and qualitative user experiences over a four-week intervention. The study’s main strengths lie in its innovative deception-based comparison design, adherence to CONSORT-AI reporting, and the integration of quantitative and qualitative findings. The work contributes valuable empirical evidence on the current limitations of AI-delivered psychotherapy, particularly regarding emotional responsiveness and perceived benefit.

General Comments on Scientific Content

Overall assessment

The manuscript is clear, relevant, and well structured, following a logical IMRAD format.
The topic is highly relevant to digital mental health and AI-assisted psychotherapy.
The mixed-methods design is appropriate and strengthens interpretability.

Scientific soundness and hypothesis testing

The hypotheses are clearly stated and testable.
The randomized controlled design is appropriate; however, the exploratory nature of the study is not always sufficiently emphasized in the interpretation of outcomes, especially in the Results and Discussion.

References

The majority of references are recent (last 5 years) and relevant.
Self-citation does not appear excessive.
Some foundational references (e.g., CBT mechanisms) are older but acceptable given their canonical status.

Reproducibility

The Methods section is detailed overall.
Reproducibility is limited by:

Lack of a fully predefined intervention protocol (non-manualized CBT).
Absence of intention-to-treat analysis.
Limited transparency regarding chatbot prompt evolution during sessions.

Figures and tables

Tables and figures are appropriate and generally well explained.
Statistical methods are mostly appropriate and clearly reported.
Some effect sizes are reported, but clinical relevance could be discussed more explicitly.

Ethics and data availability

Ethical approval, informed consent, and debriefing are clearly described.
Data availability statement is adequate and justified given sensitivity.

Consolidated Scientific Comments and Chapter-Level Evaluation

Introduction

Strengths

The rationale for comparing AI-delivered CBT with both human-delivered CBT and a no-intervention control is strong and well justified.
The manuscript is well positioned within the current AI and CBT literature, with clearly articulated aims and hypotheses.
Epidemiological framing and clinical relevance are clearly established.