Reinforcement-Based Person-Specific Training for Children with Autism Using a Humanoid Robot NAO

Karim, Masud; Mia, Md. Solaiman; Tareeq, Saifuddin Md.; Hasanuzzaman, Md.

doi:10.3390/robotics15040066

Open AccessArticle

Reinforcement-Based Person-Specific Training for Children with Autism Using a Humanoid Robot NAO

¹

Department of Computer Science and Engineering, University of Dhaka, Dhaka 1000, Bangladesh

²

Department of Computer Science and Engineering, Green University of Bangladesh, Narayanganj 1461, Bangladesh

³

Department of Math and Computer Science, Southern Arkansas University, Magnolia, AR 71753, USA

^*

Author to whom correspondence should be addressed.

Robotics 2026, 15(4), 66; https://doi.org/10.3390/robotics15040066

Submission received: 17 January 2026 / Revised: 4 March 2026 / Accepted: 10 March 2026 / Published: 25 March 2026

(This article belongs to the Section AI in Robotics)

Download

Browse Figures

Versions Notes

Abstract

Autism Spectrum Disorder (ASD) is defined by ongoing difficulties in social communication, flexibility in behavior, and adaptive learning skills. Interventions that utilize robots have demonstrated potential in providing organized training for children with ASD; however, there is a lack of controlled studies that specifically examine the effects of reinforcement strategies. This research introduces a systematic interaction policy based on reinforcement, founded on the principles of Applied Behavior Analysis (ABA), and assesses its effectiveness through a randomized controlled experimental design with observation. The humanoid robot NAO was used in two different interaction scenarios, one involving a reinforcement condition (RC) and the other a non-reinforcement condition (RC), ensuring that the instructional material and environment were maintained, while only the availability of contingent positive feedback was altered. A total of 50 participants diagnosed with ASD Level 2 engaged in structured word-learning sessions. Learning outcomes were assessed using institutional performance criteria, average response time, and emotion analysis derived from a CNN-based facial expression model. Independent samples t-tests revealed statistically significant improvements in both performance scores (t(48) = 3.779, p < 0.05) and response times (t(48) = 3.758, p < 0.05) in the reinforcement condition compared to the non-reinforcement condition. The findings demonstrate that structured ABA-based reinforcement within robotic interaction significantly enhances learning efficiency and task engagement, contributing methodologically rigorous evidence to robot-assisted ASD intervention research.

Keywords:

human–robot interaction; autism; ASD; NAO; humanoid robot; reinforcement

1. Introduction

Autism Spectrum Disorder (ASD) is a developmental condition that affects communication, behavior, and social interactions. It is referred to as a “spectrum” because symptoms and characteristics can range from mild to severe, varying significantly from person to person.

Autism Spectrum Disorder (ASD) is clinically defined as a neurodevelopmental disorder characterized by persistent deficits in social communication and social interaction across multiple contexts, along with restricted, repetitive patterns of behavior, interests, or activities, as formally outlined in the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM-5) published by the American Psychiatric Association. According to DSM-5 diagnostic criteria, symptoms must be present in the early developmental period and cause clinically significant impairment in social, occupational, or other important areas of functioning. The spectrum nature of ASD reflects variability in symptom severity, cognitive ability, language development, and adaptive functioning. Complementing the DSM-5, the World Health Organization classifies Autism Spectrum Disorder (ASD) under disorders in the International Classification of Diseases 11th Revision. This classification further emphasizes that ASD has a developmental basis.

Studies show that more and more people are being diagnosed with ASD worldwide. This increase highlights the need for effective and proven strategies to help people with ASD. Early and structured behavioral interventions, such as those based on Applied Behavior Analysis (ABA), are among the most supported approaches to improve communication and daily living skills. ABA works by using reinforcement to increase the likelihood of desired behaviors through feedback.

However, children with Autism Spectrum Disorder often react differently to rewards. They really want to connect with others. They struggle with focus and feelings. These issues make it difficult to successfully implement certain teaching methods, so special learning spaces are needed.

Recent advances in robotics suggest that robots like the NAO robot can help children with ASD have structured and predictable interactions. These robots can give rewards, control inputs and track how well a child performs. Some studies have found that using robots for therapy can help children with ASD engage more and improve their imitation skills. The NAO robot and similar robots can also help children with ASD learn things. However, not many studies have combined rewards, automated tracking of performance, and analysis of emotions. This is an area that must be researched to help children with ASD.

This study aims to fill this gap by using established criteria and behavioral theory to investigate reinforcement-based robotic learning interventions for children with ASD. By using a controlled framework, this study provides a strong foundation for exploring the effectiveness of robotic interventions for children with ASD. By aligning the proposed system with DSM-5 diagnostic constructs and evidence-based behavioral mechanisms, this research contributes to bridging the gap between clinical theory and technology-assisted therapeutic practice.

The exact causes of autism are not fully understood, though research suggests it involves a combination of genetic and environmental factors. Children with autism often exhibit a variety of characteristics that can impact their social, cognitive, and sensory development. Common traits include: social communication difficulties, repetitive behaviors, sensory sensitivities, motor skill challenges, etc.

While ASD has no cure, various therapies and training can help individuals manage symptoms, build skills, and lead more fulfilling lives. The goal of these interventions is not to eliminate autism, but to reduce associated challenges and enhance quality of life.

There exist robots specifically designed for autism-related interventions, such as socially assistive robots developed with therapeutic goals in mind (e.g., platforms designed to promote joint attention, emotional recognition, or social reciprocity in children with ASD). Some general-purpose humanoid robots, such as NAO, were originally developed for broader educational and research applications but have been adapted for autism intervention due to their programmability, predictable behavior, and interactive capabilities. In recent years, robotic assistance has emerged as a promising tool in autism therapy. Robots, such as the NAO Kaspar, Kebbi, etc., can support children with autism by facilitating social interactions and assisting with daily tasks and basic learning activities.

Although the NAO robot was originally developed as a general-purpose humanoid platform for education and research, it has been widely adopted in autism-related interventions due to several characteristics that align well with the learning needs of children with Autism Spectrum Disorder (ASD). NAO is a robot that helps people interact with it in a way. This is really helpful for children who become anxious or feel overwhelmed when they are around people. NAO has a face, and it moves in a way that is easy to understand. This helps children know what to expect from the robot. NAO looks like a person. It is not too complicated. This means children can learn things from NAO and then use those skills when they are with people. We can program NAO to do things like talk in a certain way or give rewards when a child does something good. This is really helpful for therapists who use something called Applied Behavior Analysis to help children with autism. Some studies have shown that robots like NAO can really help children with autism. These children are more likely to pay attention, imitate the robot, and interact with it. This is why NAO is a tool to use when helping children with autism, even though it was not originally made just for them.

One widely used therapeutic approach is Applied Behavior Analysis (ABA), which emphasizes the use of reinforcement, both positive and negative, to encourage desired behaviors. Reinforcement plays a critical role in helping children with ASD learn and adapt through consistent feedback. In this paper, we define the term “reinforcement” specifically as positive reinforcement is used to mean a type of therapy that creates motivation. This type of therapy comes from something called Applied Behavior Analysis (ABA). It does not have anything to do with computer learning. When we discuss reinforcement, we are referring to giving feedback to someone when they do something right. This feedback is given right after the person does something, like answering a question. For example, when a child answers a question correctly, the robot will say something to them like “good job”. At the end of the time they spend with the robot, they might even get a treat like a piece of candy, a mango bar, or a banana. The reason we do this is to help the child learn how to communicate. We want them to keep doing the things that help them talk and understand others. This is based on the idea that when children get rewarding feedback, they are more likely to keep doing what they are doing. It is also important to note that the robot is not trying to figure out a way to teach the child. It does not change how it teaches based on what the child does. The robot is simply following a set of rules that were programmed into it before it started working with the child.

Reinforcement is really good for children in the long run. It helps them use the skills they learn in different situations [1,2,3]. They can adjust their behavior to various social contexts. We can use reinforcement at home, in the classroom, or in therapy. Just give them praise or a small reward when they do something in a group.

When we tailor reinforcement to each child, it really works. It makes them want to do things. They get excited. This leads to behavior that lasts. Reinforcement is a part of Applied Behavior Analysis therapy. It helps children with autism perform socially adaptive behaviors. When we give rewards that children like, it encourages them to adopt behaviors that will help them in social interactions. They learn more. Reinforcement is a way to help children with autism [1,4]. It makes their learning experience better. This approach emphasizes increasing motivation, building self-esteem, and promoting sustained behavioral change. These elements are essential to the success of autism therapy and training, as they empower individuals with autism to develop their skills and lead more fulfilling lives.

Researchers are increasingly leveraging modern technology to assess the status of children with autism and to enhance their learning and developmental skills [4,5]. This study employed a randomized controlled, longitudinal repeated-measures design to investigate the effect of reinforcement-based robotic training on vocabulary learning in children with ASD. Participants were randomly assigned to either a reinforcement-based robotic intervention group or a non-reinforcement robotic control group. It will also provide practical recommendations for integrating AI as an effective reinforcement tool in therapy. Reinforcement is a key component of any educational program, especially for children with Autism Spectrum Disorder (ASD). However, their natural environment often lacks the specific types of reinforcement these children need. For example:

They may struggle to engage with toys or not know how to use them.
Typical social situations may not be motivating or rewarding for them.
They may respond more to sensory stimuli or negative attention rather than conventional reinforcement with rewards.

Because of these challenges, traditional reinforcement methods may fall short. Therefore, introducing new technologies, including AI and interactive tools, can provide customized, engaging reinforcement to support learning and behavior development in children with autism.

This paper presents and evaluates the interaction between children with ASD and an NAO robot using reinforcement techniques. Results show that the NAO robot can help children with autism learn and improve their conduct, especially by increasing their participation in RC interactions. This study introduces a structured reinforcement interaction policy in ABA therapeutic principles and evaluates it under a controlled experimental framework. The novelty lies not in proposing a reinforcement learning algorithm, but in experimentally isolating reinforcement as the independent interaction variable while maintaining identical robotic hardware, instructional content, and evaluation procedures. Furthermore, we integrate institutional performance metrics, response latency analysis, and emotion-based engagement measurements within a randomized controlled and longitudinal design. This provides a multi-dimensional and statistically validated assessment framework. This methodological rigor advances the empirical foundation of reinforcement-based robotic intervention beyond descriptive and feasibility-focused studies.

2. Literature Review

2.1. Reinforcement and Applied Behavior Analysis in ASD

Reinforcement-based learning for Autism Spectrum Disorder is basically about using rewards to encourage behavior. This is based on something called Applied Behavior Analysis, which is a well-known way to help people with Autism Spectrum Disorder. The idea is that when an individual performs an encouraged behavior, they receive something they like. This makes them more likely to repeat that behavior. Trainers and therapists have used this approach to help children with Autism Spectrum Disorder improve their speech, interact with others, copy what they see, and learn things in school. Lots of studies have shown that using rewards helps children with Autism Spectrum Disorder pay attention, learn new skills, and use what they learn in different situations. Applied Behavior Analysis for ASD is used to help these children in various ways.

Despite its effectiveness, ABA has some limitations due to individual variability. Children with autism are different. They respond to rewards in different ways. Some children might be habituated to receiving rewards outside of therapy sessions. This reduces the effectiveness of rewards. It is also hard to maintain a constant environment in places where people are learning. These problems show that we need a system that is organized and consistent, and maybe we can use technology to help with that. We need to be able to give rewards in a way that is controlled and personalized for each learner, such as through reinforcement. Reinforcement is important. We need to find a way to apply it more effectively.

2.2. Reinforcement-Based and Technology-Assisted Interventions

Several prior studies have explored reinforcement-based training in children with ASD. Jabeen et al. [1] divided 48 children into reinforcement and non-reinforcement groups and reported improved outcomes in the reinforcement condition; however, evaluation relied primarily on manual questionnaires. Similarly, Malaco et al. [6] examined reinforcement through tangible rewards and assessed behavioral improvements using Likert-scale ratings from educators and parents. While these studies confirm the benefits of reinforcement, their reliance on subjective evaluation methods limits real-time adaptability and objective measurement.

Language and vocabulary acquisition represent another critical domain. Hashim et al. [7] emphasized the challenges faced by children with ASD in learning word meanings and the importance of visual support tools. Kamran et al. [8] developed a vocabulary learning prototype and reported improvements in correct responses, attempts, and response time. Although promising, these systems primarily focus on educational outcomes without integrating automated behavioral reinforcement monitoring or emotional engagement analysis.

More broadly, reinforcement preference assessment has been identified as essential in maximizing therapeutic effectiveness, as preferred stimuli function as stronger reinforces. However, most technology-assisted systems incorporate reinforcement descriptively rather than through experimentally isolating its effect within controlled comparative designs.

2.3. Identified Research Gap and Study Positioning

We know that rewards can really help people learn and that tools like computers can make things more interesting. There are still some problems with conventional reinforcement protocols. For one thing, a lot of the time, rewards are given out by hand. Human evaluators have to decide who is doing well and who is not. This can be unfair, as their criteria are not always the same. We have used robots to help people with skills or behavior but not as much to teach subjects like vocabulary in a very controlled way. Also, when we try to make things personal for each person, it is often through a set of pre-planned interactions. What we really need is a way for the computer to know who each person is and how they are doing, and to be able to give them the lessons at the right time. We need to make reinforcement learning, like vocabulary learning, better with the help of assistive technologies to improve learning outcomes.

We have not really looked at how people feel when they learn. While some people have tried to understand this, not many have used special tools to examine how faces show emotions while also looking at how well people are learning. This means we do not have a full picture of how well people learn in conjunction with how they feel about it. We need to look at both how well people learn and how they feel to understand the mechanisms of learning and emotions.

The current study aims to fill these voids through the design of a structured reinforcement-based robotic interaction system fitted with facial recognition for identity-based lesson retrieval, automated performance evaluation, real-time reinforcement provision, and emotion analysis in the context of a randomized controlled and longitudinal experimental paradigm. In contrast to previous descriptive implementations, this work experimentally isolates reinforcement as the considered independent interaction variable while holding contextually congruous robotic hardware and instructional content constant. Statistical validation of performance and response time outcomes in this study represents a methodological advancement and provides an empirical basis for the use of reinforcement-based robotic interventions for children with ASD.

3. Proposed System Description

This part is about the design and parts of a robot system that helps children learn. This system uses a NAO robot, which can recognize faces and give lessons tailored to each child. It does this by using a child-specific learning plan called an Individualized Education Program (IEP). The robot can also recognize what the child says and evaluate how they are doing, then give them feedback specific to them. The robot is like a teacher. It helps the child learn by giving them lessons and adjusting what it says based on what the child needs. The Figure 1 shows overall concept and the steps of the proposed system. The next parts of this paper will describe about how the lessons are made, how the robot is trained, how the robot knows who each child is, how the robot decides what to teach, how we know if the child is learning, and how the robot knows how the child is feeling.

3.1. Lesson Creation

For this experiment we used a learning plan that was based on each child’s Individualized Education Program [9,10]. This Individualized Education Program (IEP) is like a syllabus that made for teaching children with special needs. The IEP is made to ensure that children with autism get an education that is tailored to their needs. It is like a roadmap that shows the things the child needs to learn, the help they need to get, and the things that need to be done differently to help the child do well in school and develop as a person. The IEP is used to help children with autism. It is an important part of their education. We used the children’s IEPs from school [10] to make a learning plan for each child (Table 1).

In this study, learning activities that specifically focused on word meanings were selected directly from each child’s IEP. Children with autism often engage with social interactions and communication in unique ways. In some cases, they may pay less attention to conversation compared to their non-autistic peers, which can affect their ability to learn and use language effectively. As a result, they require additional conversational training and targeted support to develop these essential skills. Learning word meanings is especially important for children with autism, as it forms the foundation for effective communication, emotional expression, understanding and following instructions, navigating social situations, and language comprehension.

By focusing on word learning tailored to each child’s IEP, this approach aims to enhance their ability to engage more meaningfully with their environment and improve both their academic and social outcomes.

3.2. Robot Interaction Configuration

We employed a supervised learning approach to train the NAO robot for interaction with children. The training process consisted of two main components:

Facial Recognition—enabling the robot to recognize and identify individual children for specific lesson.
Lesson Setup—programming the robot to deliver learning content.

To teach word meanings, we recorded audio clips and uploaded them to the robot in WAV format for supporting in NAO. These recordings were used as part of the instructional content delivered during interactions. Before this, a learning database was created using MySQL, which was derived from the school’s existing manual database. For facial recognition, we took and printed photographs of the children. These images were used to configure the robot’s ability to identify individual faces. The NAO robot was further customized using Aldebaran’s software (version 6) suite [11], allowing for precise control over the robot’s movements and actions during learning sessions. The step-by-step process involved in training the robot is illustrated in Figure 2.

3.3. Child Identification Through Face Recognition

In this study, we implemented a face recognition technique to identify individual children during interactions with the NAO robot. We used the software suite tool developed by SoftBank Robotics [13] to customize the robot’s face recognition capabilities. Specifically, we employed the ALFaceDetection vision module, which allows the NAO robot to detect and recognize faces in its field of view. To help the robot recognize each child correctly, we used pictures of their faces to train it. We did this by using the “Learn Face” tool of the software suite. We inputted each child’s picture and assigned them an ID number. Then we set it up so that when the robot saw a face, it was able to say who it was. This way, the robot could reliably identify children in a timely manner when it talked to them. Table 2 shows the basic information of the participated children.

3.4. Children’s Specific Lesson Selection

The children’s special IDs were identified using face recognition, as is explained in Section 3.2 and Section 3.3. This special ID was then used to perform a search in the lesson database to find the lesson ID that was assigned to that child. We can see this in Figure 3. The children’s IDs and the lesson IDs were connected in the database. When the right lesson was found, the NAO robot started the lesson automatically. We used Python version 2 to make it easy for the robot and the server to talk to each other. This way the NAO robot could receive information from the database server easily.

3.5. Children’s Training by the Robot

This section of the research is important, as we aim to explore the potential of the NAO robot as a substitute for teaching children with autism. The robot is programmed to execute the necessary steps to deliver specific educational content to children. To teach the meaning of a word, the robot follows a simple three-step process. During the instructional session, the robot also generates reinforcement activities to support learning and maintain engagement. For this experiment, we proposed a simple interaction protocol for reinforcement-based training, which guided the robot’s teaching and feedback process. The interaction protocol is illustrated in Figure 4.

The proposed interaction design follows ABA-based behavioral principles, operationalizing reinforcement as a contingent positive consequence delivered immediately after a target response, consistent with the antecedent behavior consequence framework.

3.6. Children Learning Evaluated by the Robot

Evaluating learning in children with Autism Spectrum Disorder (ASD) requires a focus on individualized outcomes based on each child’s unique learning style. This process involves observing behavior and collecting developmental data throughout learning sessions. In this experiment, the NAO robot performs ongoing evaluations during each training session. Through a focus group discussion with school [10] educators, the training and evaluation criteria, like the number of demonstrations, threshold value, and time that the robot will wait to get a response from children, are defined. The instructional sequence is as follows.

The robot demonstrates a word and its meaning three times.
The robot asks the child about the meaning of the word and waits 10 s for a reply.
When the child responds, their response is evaluated by the ALSpeechRecognition API.

The ALSpeechRecognition API module enables the robot to identify predetermined words or phrases. The list of words that need to be recognized are inserted into ALSpeechRecognition during the programming of the robot. When ALSpeechRecognition is launched, it inserts the key SpeechDetected, a Boolean that indicates whether or not a speaker is being heard. The item in the list that most closely resembles what the robot hears is added to the API, WordRecognizedAndGrammar key, if a speaker is heard [13]. If the child’s answer meets a predefined accuracy threshold [14], the robot offers reinforcement conditioning (RC) by expressing gratitude and proceeds to the next word. If the child fails to respond or their answer falls below the threshold, the robot repeats the explanation of the previous word. It repeats the interaction protocol described in Figure 4. At the end of the session, the robot provides a mango bar/candy/banana to the child as a gift, which creates another instance of reinforcement. This method of continuous, adaptive assessment during the learning process has proven highly effective. Notably, children show increased enthusiasm and engagement when learning with the robot.

On the other hand, in the sessions under the NRC, the robot follows the interaction protocol of the NRC in Figure 4. If the child’s answer meets a predefined accuracy threshold, then the robot proceeds to the next word. It does not engage in any reinforcement activities. At the end of the session, the robot does not provide any gift.

It is mentioned that, to define the threshold value and wait time of the robot, we demonstrated the speech recognition and wait time activities of NAO to the educators at the school [10]. The technical issues were described with a test of different threshold values and wait times in seconds. As per their recommendation, we defined a value of 15 as the threshold value and 10 s as the wait time.

3.7. Children’s Learning Assessed by Trainer

As part of the robot’s performance evaluation, a vocabulary test was administered by a school trainer following each training session. The trainer assessed:

The response time taken by each child to answer with a learned word meaning.
A performance score derived from the accuracy and consistency of the child’s responses.

The performance score illustrated in Figure 5 and the associated response time measures were derived from the standardized assessment framework routinely used by the participating institution [10] to evaluate children’s learning progress. This rubric has scoring rules that match the instructional [10] goals and is used for all learners (children with autism) at the center. In this study, the speech recognition of the robot also helped validate responses by a confidence score (threshold). The same rules and automated validation were used for both groups so the process was consistent, which supports the accuracy and reliability of the results. The institutional criteria and validation thresholds were applied uniformly to both conditions, maintaining procedural consistency. This consistency supports the reliability and internal validity of the reported outcome measures.

Figure 5 shows the evaluation matrix that measured how well children learn after each training session. The scoring system was created with the help of trainers from the Prottasha Centre for Autism Care [10]. The matrix gives a score of 2 when a child gets both target word meanings right. It gives a 1 when a child gets one word right. A score of 0.5 is given when a child tries to answer or shows interest but does not get it right. A score of 0 means the child did not respond at all. The evaluation matrix is used to track children’s learning performance. Children’s learning performance is evaluated after each training session.

This system performance rubric enables an objective measurement of vocabulary acquisition while accounting for partial learning and mental engagement, which are critical considerations in educational assessment for children with autism. The use of incremental scoring reflects established reinforcement and behavior-based evaluation principles reported in prior ASD intervention studies [1,6]. A performance score below 50% is considered unsatisfactory.

3.8. Emotion Detection from Facial Video

In parallel, video recordings were captured during the training sessions, with a focus on the children’s facial expressions. A separate camera was set up in the training room. These recordings were used to detect emotional responses to later observe:

The effectiveness of the training;
The child’s level of engagement and acceptance of the robot-assisted learning process.

This assessment approach helps in evaluating both cognitive progress and emotional receptiveness, ensuring a comprehensive understanding of the training’s impact.

These videos were analyzed to extract facial emotional expressions, categorized into seven primary emotions: anger, disgust, fear, happiness, neutrality, sadness, and surprise [15,16]. We used an open source Python library that leverages a convolutional neural network (CNN) for emotion detection [17,18]. The CNN-based emotion recognition component used in this research was adopted from a publicly available, validated open-source implementation trained on widely used facial expression datasets. In this study, the model was not retrained or modified. It was employed as a standardized analytical tool to ensure consistent evaluation across both experimental conditions. Because the same model, camera setup, and environmental conditions were applied uniformly to both groups (RC and NRC), any potential systematic bias or misclassification would affect both conditions equally, preserving the validity of relative comparisons. From the detected emotional data, we assessed the children’s emotional responses by grouping them into positive and negative categories:

Negative emotions: anger, disgust, fear, and sadness;
Positive emotions: happiness, neutrality, and surprise.

Although children with autism often show elevated levels of sadness, we hypothesized that interaction with the robot would reduce this negative emotion. Therefore, during analysis, sadness was treated as a key indicator of negative emotional response. Interestingly, the neutral emotion was interpreted as a form of non-engagement or avoidance of the robot. However, for the purpose of this study, neutral responses were still categorized as positive, as they indicate absence of distress rather than an active negative emotion.

4. Experiment

4.1. Study Design

This study employed a quantitative experimental research design using a randomized controlled longitudinal framework with repeated measures. Participants were randomly assigned to one of two interaction conditions (reinforcement condition and non-reinforcement condition), and objective performance metrics, including response accuracy, response time, and facial emotion-based engagement scores, were collected across five sessions. All outcome variables were numerically quantified and statistically compared between groups. No qualitative interviews, observational coding narratives, or thematic analyses were conducted. Therefore, the methodological approach of this study is strictly quantitative in nature.

4.2. Participants and Procedure

The experiment was conducted at a specialized institution, Prottasha Centre for Autism Care [10], which works exclusively with children with special needs. A total of 50 children, all exhibiting characteristics of autism, were randomly selected from the institute to participate in the study. According to the institutional [10] record, all participating children were diagnosed with Autism Spectrum Disorder (ASD) at Severity Level 2 (Requiring Substantial Support) in accordance with the criteria outlined in the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM-5) published by the American Psychiatric Association. This classification indicates marked deficits in social communication and noticeable behavioral rigidity requiring substantial support, while still allowing participation in structured, guided learning activities. Prior to the experiment, formal approval was obtained from both the school authorities and the children’s guardians to ensure ethical compliance. Throughout this study, the terms reinforcement condition (RC) and non-reinforcement condition (NRC) refer to two alternative interaction policies implemented using the same NAO robot.

We used stratified block randomization to assign the children to reinforcement condition (RC) and non-reinforcement condition (NRC) interaction sessions due to the total number of participating children being 50. We used children’s age, gender, and autism severity as stratification variables, which are shown in Table 3. Children were randomly assigned to either human educator-based interventions or robot-assisted interventions using stratified block randomization based on stratification variables, ensuring equal distribution across groups.

It is important to clarify that the same NAO humanoid robot was used in both experimental conditions. The two groups did not differ in hardware configuration, instructional content, speech recognition module, or evaluation procedure. Rather, the experimental manipulation involved two alternative interaction policies implemented within the same robotic system and environment. In the reinforcement condition (RC), the robot delivered explicit positive reinforcement with verbal praise and a gift following correct responses. In the non-reinforcement condition (NRC), the robot followed an identical instructional and assessment protocol but omitted all forms of reinforcement. Therefore, the comparison reflects differences in interaction strategy rather than differences in robotic platform or learning model.

The experiment was conducted in a classroom, measuring 12 ft by 14 ft, for one child at a time at the school. The child was seated directly in front of the NAO humanoid robot (version 6) [13] with a container holding a mango bar/candy/banana positioned to the robot’s right. A camera was placed behind the robot to record the session, focusing on the child’s facial expressions. A volunteer remained beside the child to ensure safety and address any unforeseen issues.

The NAO robot’s speech was generated using pre-recorded audio files. Before the experiment, the robot was trained to recognize the faces of all the participants. During the session, NAO was positioned in a seated posture, with mango bar/candy/banana placed on its right side. Some sample figures of the sessions are shown in Figure 6 and Supplementary Materials.

When a child entered the room, the robot first greeted the child, then recognized the child’s face and initiated the appropriate learning session.

The speed of the robot’s speech and physical movements was adjusted based on recommendations from the school’s educators. After each teaching segment, NAO prompted the child to recall the meaning of the taught word.

If the child responded correctly, the robot rewarded them with either a candy, a mango bar, or a banana.

4.3. Experimental Procedure

Each child participated in structured vocabulary-learning sessions facilitated by the humanoid robot. The robot delivered identical instructional content to both groups, including word presentation, visual cues, and response evaluation.

In the reinforcement condition, correct responses were followed by structured positive reinforcement, including verbal praise and reward cues consistent with ABA therapeutic principles. In the non-reinforcement condition, the robot provided neutral acknowledgment without praise or reward, while maintaining identical task flow and timing.

Each session followed a standardized sequence: child identification through facial recognition, lesson retrieval from the database, delivery of the lesson, capturing of video of the session, evaluation, and feedback generation.

4.4. Measurements

In this study, three primary outcome measures were used:

Average Response Time

Response time was recorded automatically by the trainer as the interval between stimulus presentation and child response. This objective measure provided an indicator of task processing efficiency.

Performance Score

Performance was evaluated using the institutional assessment criteria [10], which are routinely applied by the trainer to measure learning performance. Scores were standardized across participants to ensure consistency.

Emotional Engagement

Facial expressions were analyzed using the open source Python package TensorFlow and a CNN-based facial emotion recognition model applied to video frames captured during interactions. Emotional states were categorized and aggregated to estimate engagement levels.

4.5. Statistical Analysis

To determine whether differences between the two independent groups were statistically significant, independent samples t-tests were conducted for both average response times and average performance scores. The independent samples t-test was selected because the study involved two separate groups with no overlapping participants and continuous outcome variables. Statistical significance was evaluated at α = 0.05.

5. Result Analysis

The children participated in five separate training sessions, each held on different days. The core training segment of each session lasted 6 to 7 min, resulting in a total training time of approximately 30 to 35 min per child. After each session, a school trainer escorted the children to a separate room to conduct an assessment [10]. During this evaluation, the trainer:

Asked the children about the meanings of the words learned that day.
Measured their response time using a stopwatch.
Assigned a performance score to quantify their performance (as illustrated in Figure 5).

The results, including each child’s response time, performance score, and mean scores per lesson, are presented in Figure 7 and Figure 8.

From the sessions under the reinforcement condition (RC), we found:

A total of 20 children demonstrated good learning outcomes from the robot-assisted reinforcement training.
A total of 5 children did not achieve satisfactory performance.
Across all lessons, the overall performance was satisfactory.

From the session NRC, we found:

A total of 12 children demonstrated good learning outcomes from the robot-assisted reinforcement training.
A total of 13 children did not achieve satisfactory performance.
Across all lessons, the overall performance was not satisfactory compared to RC.

These findings indicate that the NAO robot, when integrated RC-based training, is effective in delivering educational content to children with autism.

An independent-samples t-test was conducted to examine differences between the reinforcement condition and the non-reinforcement condition on average response time and mean performance score. For response time, the reinforcement condition (M = 28.61, SD = 18.10) demonstrated significantly lower mean response times compared to the non-reinforcement condition (M = 44.44, SD = 10.76), t(48) = 3.758, p < 0.05, indicating that contingent reinforcement was associated with faster task responses.

Similarly, for performance scores, children in the reinforcement condition (M = 64.67, SD = 19.44) achieved significantly higher scores than those in the non-reinforcement condition (M = 46.00, SD = 15.24), t(48) = 3.779, p < 0.05. These results indicate that the reinforcement-based interaction policy produced statistically significant improvements in both learning performance and response efficiency compared to the non-reinforcement condition under the predefined significance level (α = 0.05)

Figure 9 and Figure 10 presents the comparison of positive and negative emotional expressions of RC and NRC. The differences in positive and negative emotional expression results for each child are shown in Figure 11 (RC) and Figure 12 (NRC). According to Section 3.8, the Difference value = Value of Positive Expression − Value of Negative Expression.

Analysis of Figure 11 (RC) reveals that:

A total of 3 children did not engage in the training session (C1, C17, and C23).
The remaining 22 children displayed clear interest in participating (C2, C3, C4, C5, C6, C7, C8, C9, C10, C11, C12, C13, C14, C15, C16, C18, C19, C20, C21, C22, C24, and C25).

Analysis of Figure 12 (NRC) reveals that:

A total of 15 children did not engage in the training session (C27, C29, C30, C31, C32, C34, C35, C38, C42, C43, C44, C45, C47, C48, and C49).
The remaining 10 children displayed clear interest in participating (C26, C28, C33, C36, C37, C39, C40, C41, C46, and C50).

These findings indicate that the majority of the children with autism responded positively to the reinforcement-based training delivered by the NAO robot, demonstrating both engagement and acceptance of this learning approach. Summary result of children’s performance are shown in Table 4.

6. Discussion

This study examined whether a reinforcement-based robotic interaction policy improves vocabulary learning and emotional engagement in children with ASD compared to a non-reinforcement condition. The findings demonstrate a clear quantitative advantage for the reinforcement condition, with 80% of children achieving satisfactory learning outcomes compared to 48% in the non-reinforcement condition. Additionally, emotional analysis revealed substantially higher positive engagement (88%) in the reinforcement condition relative to the non-reinforcement condition (40%). These findings suggest that structured positive reinforcement significantly enhances both cognitive and affective dimensions of robot-assisted learning, which proves the ability of robots to teach children with autism.

The improvement in learning performance matches what we know about Applied Behavior Analysis (ABA). In ABA, when a reward is provided after a behavior, it makes that behavior happen more often. Unlike prior studies that rely primarily on manual scoring or questionnaire-based evaluations [1,6], this study integrates automated speech-recognition-based evaluation and facial emotion analytics. It gives us an indication of how well an intervention works, using data to back it up. The results agree with studies that show rewards help children with Autism Spectrum Disorder (ASD) learn new skills. This study adds to what we know by using a robot to give rewards in a consistent and automatic way.

Findings about how people feel are really important. Other studies about robots and autism have shown that robots that look like people can help children pay attention and copy things. A lot of these studies are about how children react to other people, as opposed to their academic performance. What we found out is that when children work with robots, they do better on tasks and they do not get bored or upset as easily. This means that working with robots can make children want to learn and also help them feel better when they are doing their school work. The robots seem to help children behave and feel happy.

The thing that is important to note is that the way the robot helps children in this study is based on the principles of ABA therapy. The robot does not try to figure out how to do things or change how it thinks about things. Instead it just gives feedback that is based on how the learner is doing. This is different because it means that the learner gets better because of the way the robot is helping them, not because the robot is getting smarter. The results of this study show that it is very important to think about how a robot interacts with children and how it helps them learn, as opposed to just making the robot really advanced. The robot uses behavioral reinforcement mechanisms to help children. This is what makes the difference.

The results of this study are important to discuss in relation to making robot-assisted interventions personal. We used face recognition to give each child their own lessons but the way we rewarded them was the same for everyone. What is really interesting is that the children in the groups performed very differently, which means that having a plan for rewarding them might be more important than just making the lesson personal for each child. Robot-assisted interventions need to be further examined to see if changing how much we reward children or what they like as a reward makes a difference in how they do.

Finally, since all participants were classified as having ASD at Severity Level 2 (Requiring Substantial Support), the findings demonstrate that reinforcement-based robotic instruction can be effective for children requiring considerable social communication support. However, generalization to Level 1 or Level 3 populations requires further investigation.

This study helps the field of robotics by showing that using a kind of reward system in a robot that looks like a human makes a big difference in how well children learn and how much they care about what they are doing. This is especially good for children with Autism Spectrum Disorder. The study shows that using this kind of reward system in robots that are meant to help children can really work. It makes the case for using this kind of system in robots that are designed to help children with Autism Spectrum Disorder. The use of ABA-grounded reinforcement delivered a humanoid system is what makes this study so important for Autism Spectrum Disorder-related interventions.

7. Conclusions

Autism Spectrum Disorder is a condition that affects how children develop. Children with Autism Spectrum Disorder often have a difficult time talking to others and they do not make eye contact. They also do not show their feelings on their face much. So when we want to teach children with Autism Spectrum Disorder, we need to have a plan and use methods that we know really work. We have to make sure they know when they are doing something. This is called reinforcement. It is based on how children behave. Reinforcement helps children with Autism Spectrum Disorder acquire encouraged behaviors. Autism Spectrum Disorder is something that we have to understand so we can help these children learn.

This study experimentally evaluated a structured reinforcement-based interaction policy implemented through a humanoid robotic system. Using a randomized controlled design, reinforcement was isolated as the primary interaction variable while maintaining identical instructional content across conditions. The results demonstrated statistically significant improvements in both mean performance scores (t(48) = 3.779, p < 0.05) and average response time (t(48) = 3.758, p < 0.05) in the reinforcement condition compared to the non-reinforcement condition. These findings provide quantitative evidence that structured ABA-based reinforcement within robotic interaction enhances learning efficiency in vocabulary training tasks. Engagement analysis indicated generally positive interaction patterns; however, these findings should be interpreted cautiously given the exploratory nature of emotion classification.

Previous studies have explored reinforcement and robotic applications in ASD intervention but often relied primarily on manual or questionnaire-based evaluations [5,19,20,21,22,23], particularly in earlier developmental stages of robotic therapy systems [24,25,26,27,28,29]. In contrast, the present study integrates automated performance tracking, response-time analysis, and statistical validation to strengthen methodological rigor.

There are some limitations we need to think about. Our study looked at learning new words, and we only had a few sessions. This might mean that this method does not work for everything. Our system is also not completely automatic. If we had a Learning Management System (LMS), it would be easier to track information for a long time and make expand the robot’s capabilities. We should try to add content to learn and make the program longer, like more than one month. We should also make the system work better so it is not just talking through a connection. We need to make these changes to the Learning Management System, the system architecture, and the way it talks to things like socket-based communication, which is used by the system.

The findings show that robots can be really helpful in education when they are used in a controlled and structured way. This can be especially true when we are trying to help children learn and grow in a guided environment. The robots can be a tool for therapy when we use them in a very careful and planned way. We need to make sure we are watching how well the robots are working and using ways to measure how well they are doing. This way we can see if the robots are really helping children learn and grow. The use of robots, in education and therapy can be very good when we do it in a controlled and careful way.

Supplementary Materials

The following supporting information can be downloaded at https://drive.google.com/drive/folders/1vcZhnIbPJ-CpDf0QE-alXeyoD55dGa8Z?usp=sharing (accessed on 9 March 2026).

Author Contributions

Conceptualization: M.K., M.S.M., S.M.T. and M.H.; methodology: M.K., M.S.M. and S.M.T.; formal analysis and investigation: M.K. and M.S.M.; writing—original draft preparation: M.K. and M.S.M.; writing—review and editing: M.S.M., S.M.T. and M.H.; funding acquisition: M.K.; resources: M.K. and M.S.M.; supervision: S.M.T. and MH. All authors have read and agreed to the published version of the manuscript.

Funding

This work was funded in part by the University of Dhaka, Bangladesh, and ICT Division, Ministry of Posts, Telecommunications & Information Technology, The People’s Republic of Bangladesh. Funding Number: 56.00.0000.052.33.002.23-28, Date 6 February 2024.

Data Availability Statement

Related data are available at this URL: https://drive.google.com/drive/folders/1vcZhnIbPJ-CpDf0QE-alXeyoD55dGa8Z?usp=sharing (accessed on 9 March 2026).

Acknowledgments

This work was supported by University of Dhaka, Dhaka, Bangladesh. We acknowledge the special school, Prottasha Centre for Autism Care, for allowing us to conduct the experiments. We also acknowledge the use of ChatGPT (OpenAI version 5) for language refinement and structural suggestions, and Grammarly (version 1.2.241.1851) for grammar and proofreading support during manuscript preparation.

Conflicts of Interest

The authors declare that they have no conflict of interest.

References

Jabeen, S.; Kalsoom, T.; Nader, M.; Moazzam, M. Effect of Positive Reinforcement on Social Skills of Students with Autism Spectrum Disorder at Primary Level. Linguist. Antverp. 2021, 2430–2441. [Google Scholar]
Schuetze, M.; Rohr, C.S.; Dewey, D.; McCrimmon, A.; Bray, S. Reinforcement Learning in Autism Spectrum Disorder. Front. Psychol. 2017, 8, 2035. [Google Scholar] [CrossRef] [PubMed]
Chase, J.; Li, J.J.; Lin, W.C.; Tai, L.H.; Castro, F.; Collins, A.G.; Wilbrecht, L. Genetic changes linked to two different syndromic forms of autism enhance reinforcement learning in adolescent male but not female mice. bioRxiv 2025. [Google Scholar] [CrossRef] [PubMed]
Shi, J. The Application of AI as Reinforcement in the Intervention for Children with Autism Spectrum Disorders (ASD). J. Educ. Dev. Psychol. 2024, 9, 17. [Google Scholar] [CrossRef]
Karim, M.; Mia, M.S.; Tareeq, S.M.; Hasanuzzaman, M. Evaluate Effectiveness of NAO Robot to Train Children with Autism Spectrum Disorder (ASD). In Proceedings of the 2023 IEEE 5th International Conference on Cognitive Machine Intelligence (CogMI), Atlanta, GA, USA, 1–3 November 2023; pp. 165–174. [Google Scholar] [CrossRef]
ISO 9001:2015; Role of Positive Reinforcement to the Social Skills of Children with Autism Spectrum Disorder. Sultan Kudarat State University: Tacurong, Philippines, 2020. [CrossRef]
Hashim, H.U.; Yunus, M.M.; Norman, H. Autism Children and English Vocabulary Learning: A Qualitative Inquiry of the Challenges They Face in Their English Vocabulary Learning Journey. Children 2022, 9, 628. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Khowaja, K.; Al-Thani, D.; Salim, S.S. Vocabulary Learning of Children with Autism Spectrum Disorder (ASD): From the Development to an Evaluation of Serious Game Prototype. In Proceedings of the Ecgbl 2018 12th European Conference on Game-Based Learning, Sophia Antipolis, France, 4–5 October 2018. [Google Scholar]
Kurth, J.; Mastergeorge, A.M. Individual Education Plan Goals and Services for Adolescents with Autism: Impact of Age and Educational Setting. J. Spec. Educ. 2010, 44, 146–160. [Google Scholar] [CrossRef]
Available online: https://www.special.pcacschool.com (accessed on 1 December 2025).
Available online: https://aldebaran.com/support/kb/nao6/downloads/nao6-software-downloads/ (accessed on 10 November 2025).
Scassellati, B.; Boccanfuso, L.; Huang, C.M.; Mademtzi, M.; Qin, M.; Salomons, N.; Ventola, P.; Shic, F. Improving social skills in children with ASD using a long-term, in-home social robot. Sci. Robot. 2018, 3, eaat7544. [Google Scholar] [CrossRef] [PubMed]
Available online: https://www.softbankrobotics.com (accessed on 1 December 2025).
Askari, F.; Feng, H.; Sweeny, T.D.; Mahoor, M.H. A Pilot Study on Facial Expression Recognition Ability of Autistic Children Using Ryan, A Rear-Projected Humanoid Robot. In Proceedings of the 2018 27th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN), Nanjing, China, 27–31 August 2018; pp. 790–795. [Google Scholar] [CrossRef]
Jeon, M.; Zhang, R.; Lehman, W.; Fakhrhosseini, S.; Barnes, J.; Park, C.H. Development and Evaluation of Emotional Robots for Children with Autism Spectrum Disorders. In HCI International 2015—Posters’ Extended Abstracts; Springer: Cham, Switzerland, 2015; p. 528. [Google Scholar] [CrossRef]
Pour, A.G.; Taheri, A.; Alemi, M.; Meghdari, A. Human–Robot Facial Expression Reciprocal Interaction Platform: Case Studies on Children with Autism. Int. J. Soc. Robot. 2018, 10, 179–198. [Google Scholar] [CrossRef]
Qiu, L.; Zhai, J. A hybrid CNN-SVM model for enhanced autism diagnosis. PLoS ONE 2024, 19, e0302236. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
Alsaidi, M.; Obeid, N.; Al-Madi, N.; Hiary, H.; Aljarah, I. A Convolutional Deep Neural Network Approach to Predict Autism Spectrum Disorder Based on Eye-Tracking Scan Paths. Information 2024, 15, 133. [Google Scholar] [CrossRef]
Available online: https://ml.cms.waikato.ac.nz/weka (accessed on 9 January 2025).
Burns, R.B.; Seifi, H.; Lee, H.; Kuchenbecker, K.J. A Haptic Empathetic Robot Animal for Children with Autism. In Proceedings of the Companion of the 2021 ACM/IEEE International Conference on Human-Robot Interaction, Boulder, CO, USA, 8–11 March 2021; pp. 583–585. [Google Scholar] [CrossRef]
Rudovic, O.; Lee, J.; Mascarell-Maricic, L.; Schuller, B.W.; Picard, R.W. Measuring Engagement in Robot-Assisted Autism Therapy: A Cross-Cultural Study. Front. Robot. AI 2017, 4, 36. [Google Scholar] [CrossRef]
Alnajjar, F.; Cappuccio, M.; Renawi, A.; Mubin, O.; Loo, C.K. Personalized Robot Interventions for Autistic Children: An Automated Methodology for Attention Assessment. Int. J. Soc. Robot. 2021, 13, 67–82. [Google Scholar] [CrossRef]
Taheri, A.; Meghdari, A.; Mahoor, M.H. A Close Look at the Imitation Performance of Children with Autism and Typically Developing Children Using a Robotic System. Int. J. Soc. Robot. 2020, 13, 1125–1147. [Google Scholar] [CrossRef]
Kumazaki, H.; Warren, Z.; Swanson, A.; Yoshikawa, Y.; Matsumoto, Y.; Takahashi, H.; Sarkar, N.; Ishiguro, H.; Mimura, M.; Minabe, Y.; et al. Can Robotic Systems Promote Self-Disclosure in Adolescents with Autism Spectrum Disorder? A Pilot Study. Front. Psychiatry 2018, 9, 36. [Google Scholar] [CrossRef] [PubMed]
Kim, E.S.; Berkovits, L.D.; Bernier, E.P.; Leyzberg, D.; Shic, F.; Paul, R.; Scassellati, B. Social Robots as Embedded Reinforcers of Social Behavior in Children with Autism. J. Autism Dev. Disord. 2013, 43, 1038–1049. [Google Scholar] [CrossRef] [PubMed]
Kumazaki, H.; Yoshikawa, Y.; Yoshimura, Y.; Ikeda, T.; Hasegawa, C.; Saito, D.N.; Tomiyama, S.; An, K.-M.; Shimaya, J.; Ishiguro, H.; et al. The impact of robotic intervention on joint attention in Children with Autism spectrum disorders. Mol. Autism 2018, 9, 46. [Google Scholar] [CrossRef] [PubMed]
Talaei-Khoei, A.; Lewis, L.; Kaul, M.; Daniel, J.; Sharma, R. Use of Lean Robotic Communication to Improve Social Response of Children with Autism. In Proceedings of the Twenty-Third Americas Conference on Information Systems, Boston, MA, USA, 10–12 August 2017. [Google Scholar]
Suzuki, R.; Lee, J.; Rudovic, O. NAO-Dance Therapy for Children with ASD. In Proceedings of the Companion of the 2017 ACM/IEEE International Conference on Human-Robot Interaction, Vienna, Austria, 6–9 March 2017. [Google Scholar] [CrossRef]
Simut, R.E.; Vanderfaeillie, J.; Peca, A.; Van de Perre, G.; Vanderborght, B. Children with Autism Spectrum Disorders Make a Fruit Salad with Probo, the Social Robot: An Interaction Study. J. Autism Dev. Disord. 2015, 46, 113–126. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Block diagram of proposed system.

Figure 2. Steps for configuring robot interaction. ALFaceDetection module is a face detection/recognition solution provided by OKI [12].

Figure 3. Process of selecting child-specific lesson.

Figure 4. Proposed Interaction Protocol for the Training Session RC and NRC.

Figure 5. Scale of performance score.

Figure 6. Interaction session between child and NAO robot.

Figure 7. Children’s performance based on reply time.

Figure 8. Children’s performance based on performance score.

Figure 9. Comparison of positive and negative emotional expressions in the sessions under the RC.

Figure 10. Comparison of positive and negative emotional expressions in the sessions under the NRC.

Figure 11. Difference between value of positive and negative expressions of children during the sessions under the RC.

Figure 12. Difference between value of positive and negative facial expression of children during the sessions under the NRC.

Table 1. Learning items for the experiment.

Training ID	Training (Learning Word Meaning)	Outcome
101	Cow, Goat	Children will learn the words for two domestic animals.
102	Sky, Moon	Children will learn the words for two natural objects.
103	Head, Ear	Children will learn the words for two parts of the human body.
104	Hen, Duck	Children will learn the words for two domestic birds.
105	Train, Bus	Children will learn the words for two vehicles.

Table 2. Children’s information.

Session Type	Number of Children	Male–Female Ratio	Mean Age	Age Standard Deviation
Robotic session RC	25	18:7	7.6	2.4
Robotic session NRC	25	20:5	8	2.5

Table 3. Participating children’s data for randomization.

Block	Age Range	Male:Female	Autism Severity	Total	RC	NRC
B1	7–12	18:5	Level 2	27	14	13
B2	7–13	20:7	Level 2	23	11	12
Total	50	50	50	50	25	25

Table 4. Summary result of two groups.

Criteria	Reinforcement Condition (RC)	Non-Reinforcement Condition (NRC)
Reply Time (Seconds)	29	44
Average Performance Score	65	46
Average Value of Positive Emotion	0.608	0.433
Average Value of Negative Emotion	0.380	0.544
Number of Children Displaying Engagement	22	10
Number of Child Displaying Non-Engagement	3	15

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Karim, M.; Mia, M.S.; Tareeq, S.M.; Hasanuzzaman, M. Reinforcement-Based Person-Specific Training for Children with Autism Using a Humanoid Robot NAO. Robotics 2026, 15, 66. https://doi.org/10.3390/robotics15040066

AMA Style

Karim M, Mia MS, Tareeq SM, Hasanuzzaman M. Reinforcement-Based Person-Specific Training for Children with Autism Using a Humanoid Robot NAO. Robotics. 2026; 15(4):66. https://doi.org/10.3390/robotics15040066

Chicago/Turabian Style

Karim, Masud, Md. Solaiman Mia, Saifuddin Md. Tareeq, and Md. Hasanuzzaman. 2026. "Reinforcement-Based Person-Specific Training for Children with Autism Using a Humanoid Robot NAO" Robotics 15, no. 4: 66. https://doi.org/10.3390/robotics15040066

APA Style

Karim, M., Mia, M. S., Tareeq, S. M., & Hasanuzzaman, M. (2026). Reinforcement-Based Person-Specific Training for Children with Autism Using a Humanoid Robot NAO. Robotics, 15(4), 66. https://doi.org/10.3390/robotics15040066

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Reinforcement-Based Person-Specific Training for Children with Autism Using a Humanoid Robot NAO

Abstract

1. Introduction

2. Literature Review

2.1. Reinforcement and Applied Behavior Analysis in ASD

2.2. Reinforcement-Based and Technology-Assisted Interventions

2.3. Identified Research Gap and Study Positioning

3. Proposed System Description

3.1. Lesson Creation

3.2. Robot Interaction Configuration

3.3. Child Identification Through Face Recognition

3.4. Children’s Specific Lesson Selection

3.5. Children’s Training by the Robot

3.6. Children Learning Evaluated by the Robot

3.7. Children’s Learning Assessed by Trainer

3.8. Emotion Detection from Facial Video

4. Experiment

4.1. Study Design

4.2. Participants and Procedure

4.3. Experimental Procedure

4.4. Measurements

4.5. Statistical Analysis

5. Result Analysis

6. Discussion

7. Conclusions

Supplementary Materials

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI