1. Introduction
English is considered to be the international language of communication and is termed English as a foreign language (EFL) for its learners. Therefore, proficiency in speaking and listening to English has become necessary in communicating about and understanding worldwide matters [
1]. Moreover, there is a growing need for EFL learners’ proficiency levels to be reliably assessed for a variety of reasons ranging from enrolling in higher education to migrating to an English speaking country for professional reasons. This is determined by certain international organizations [
2] which categorize EFL learners into different groups based on their English proficiency levels. The age of technology has resulted in technology becoming popular in each aspect of current modern life, with teaching and learning being in the forefront of this transition. Many people are studying various topics using Internet sources such as YouTube videos, webpages, etc., [
3].
Similarly, many people are currently learning English from YouTube videos. Moreover, traveling and enrolling in educational institutes require good command of English, and with the advances of and availability of different ICTs, this process has now become more flexible [
3]. As a result of using ICT, learning has become more flexible, creative, and enriched in quality [
4]. Learning in any domain using ICT results in benefits in various areas; according to a report published by the National Institute of Multimedia Education in Japan, ICT-technology-based learning is more effective compared with traditional study in classrooms. Further, it is reported that ICT-based learners have higher presentation skills, creativity, and understanding [
5] of topics.
The use of electronic devices allows the provision of more visually appealing illustrations of topics via audio and video description, graphical representation, animated description, etc. Although these ICT technologies are also being utilized in classroom-based studies to increase student interactivity, remote access to YouTube-like platforms makes learning more accessible for anyone [
6]. Smart choices for a more flexible learning environment are being abundantly adopted, including in the form of mobile learning, social interaction, personal grooming and growth, global networking and engagements, etc. [
7,
8,
9]. Personal growth is one of the soft critical skills that any individual needs for development and involves autonomous learning of an individual at their own pace and environment to cater to those specific needs they desire.
Further, linking and structuring elements of any specific topic is also more beneficial to the learner to gain broader and stronger conceptualization [
10]. Interactivity and socialization are other vital factors in learning environments. Individuals interacting with others in academic life helps others to learn from them and explore more ideas [
11]. Inclusiveness is another factor in learning environments, and it highlights the barriers that cause issues when teaching students. In this way, the students, regardless of social, national and religious status, can receive equal opportunities in their studies [
12].
All these above mentioned qualities of a learning environment are available on a single ICT platform, YouTube, for learning and teaching the English language. Further, a greater variety of authentic teachers [
13] from different native languages are available, providing a more specialized experience for EFL learners. Pun et al. [
14] state that the content delivered on online multimedia is more functional and has more up-to-date data than textbooks. Further, learning English from YouTube can lead to more up-to-date learning compared with regular institutional studies. At present, many professors in universities and colleges have started their own YouTube channels in which they upload their lectures as a complete course, which helps many students in revising the lectures with flexibility with reference to time, and absent students can also benefit from this through remote learning.
However, many recent studies on YouTube-based English language learning have focused on conducting interviews and questionnaires of EFL learners that seek English from YouTube videos. This highlights the significance of YouTube-based learning and raises specific research questions. The use of manual interpretation of YouTube-based EFL learning can be misguided and biased, as discussed in reports on previous studies of YouTube-based English learning. Therefore, analysis of the real-time collected comments, likes, and responses to those comments by other users can give practical information regarding YouTube-based English language learning. Further, examination of the YouTube comments provided by citizens of different nations can provide greater insight into YouTube-based English language learning. Our current study solves the following challenges regarding YouTube-based EFL learning:
An automated solution was established to collect multinational data about YouTube-based EFL learning and analyze the most discussed topics and the sentiments of learners regarding these topics.
How and why does English learning from YouTube videos influence students compared with conventional EFL learning? Is YouTube helping or misleading online consumers?
What are the challenges and difficulties of learning from internet sources?
Suggestions and hypotheses were derived based on LSA-based topic modeling of user feedback in the form of YouTube video comments.
To answer these questions, in our study, the data of comments and other information from several popular YouTube channels were collected. These include the benefits of learning from YouTube videos and the gaps that some viewers lack. The remaining sections of this work are
Section 2: Related Work;
Section 3: Material and Methods;
Section 4: Findings and Discussions;
Section 5: Hypothetical Findings and Suggestions; and
Section 6: Conclusion, which includes future suggestions.
2. Related Work
Many of the studies that investigated YouTube-based English language learning highlight the benefits of learning from YouTube. They mostly relied on manual collection of datasets via conducting interviews and the use of questionnaires. This targeted specific users, while it also did not involve real-time comments or data. However, we highlight a few studies which were experimental in nature here.
A study conducted by a Taiwanese university reported certain facts about YouTube-based English learning and included 20 students in the hypothetical experimentation, and answers to certain predefined questions were obtained from these students. The three major areas of concern were highlighted in terms of three questions asked by the interviewers and classified into multiple options based on the EFL learners. Essentially, classroom- and YouTube-based EFL learning were differentiated. Further, the study explored whether learning is self-regulatory or not using the responses of EFL learners. The study was based on SRL models and focused on six goals. The inter-rater agreements were calculated based on EFL responses and the SRL models, and the responses were correspondingly classified. Lastly, the differences and similarities between EFL and regular classroom learners were highlighted [
15]. In another example, YouTube-based language learning research was used as the basis for a conceptual study that marked the advancements and limitations of YouTube-video-based language learning environments. It reported that classroom-based language learning could be more effective if YouTube-based assistance is added to guide the students. Further, due to the availability of authentic content, the learners can be more involved in the selection of the instructor in the YouTube videos [
16]. The study was conducted in a school where English was studied as a subject, and for data collection, 14 randomly selected students were enrolled. The students had both online and offline experiences, and the conducted study based its conclusions on student reviews.
In another study, three main questions were analyzed based on the responses of students to certain questions on (1) the differences in offline and online studies for EFL learners, (2) user interaction regarding study using ICT tools or YouTube as a learning environment, interaction, and engagement to multimedia, and (3) the learner’s personal experience of learning and speaking English using ICT tools. For the quantitative exploration of the results, the Pearson correlation test and regression analysis were used. The time range of YouTube channel videos was analyzed, wherein the exposure between these watching times was analyzed. For qualitative analysis, the improvement in writing skills and the affordance of YouTube videos were reported by asking all learners simple questions. The study also concluded that negative aspects were related to personal growth and the accuracy in the fluency of learning and writing English [
17].
Wahyuni et al. conducted a study on YouTube-based English language learning in an Indonesian setting, and a class of 40 students was observed. It was reported that the learning environment of any learning is highly significant. Therefore, the English language learning-based study was conducted by applying a qualitative approach in the experimental setup. Mainly English speaking was analyzed, where four steps were repeatedly applied to answer the questions asked in the questionnaire reported in that study. The statistical measures of mean, standard deviation, maximum, and minimum were applied to the questions and their answers. It was concluded, based on the statistical analysis, that YouTube provides a good environment for learning English speaking [
18].
The author of another study claimed that there are two modes of learning: audio and video. The study recommends video illustration over audio for English speaking. The cognitive reasoning regarding the use of video and audio media is described in the methodology section, where the multiple styles of learning and effective reasons are further described. It was reported that when using video-based EFL learning, there is less cognitive load on the learners. It was further stated that YouTube is not only useful for entertainment but can also be beneficial for language learning [
19].
Putri et al. [
20] reported in their study that YouTube is a more flexible, entertaining, and fun environment to learn the English language. The study included experimental data based on 12 students. Qualitative analysis was performed by asking questions on different cultural, social, meta-cognitive, resource, effectiveness, and goal commitment aspects. The analysis was based on the different factors, concerned with their effect on how swiftly the English language can be learned.
Another study based on making and uploading English-speaking videos on YouTube was conducted based on 10 participants and qualitative analysis of their answers to questions was performed. The questionnaire in use comprised five questions with an 80% response rate, which showed that before making English-speaking assignment videos and uploading them on YouTube, extensive practice was conducted on correcting their English pronunciation. Therefore, to ensure correct English pronunciation, English speaking could be improved by uploading English-speaking videos. The different suggestions to improve English pronunciation made by all learners were also provided [
21].
To analyze YouTube-based EFL learners, the authors of [
22] stated, in their study, that this platform contributes to developing individual participation as well as critical thinking and social engagement. Authentic English teaching material is available on YouTube. Additionally, a few channels also provide supplementary materials for English language learning. Including all these aspects results in YouTube being an effective source of learning for EFL learners. Another study on YouTube-based EFL learning included 32 students [
23]. The pros and cons of the learning environment were discussed by presenting students with open-ended questions. The report concluded that YouTube is a very familiar, cheap, and flexible platform for EFL learning. However, participants encountered difficulties that could be resolved by using different YouTube-video-watching features and by discussing problems with teachers and friends.
A study in an Indonesian university included students participating in EFL learning via YouTube [
24]. This involved the use of the Web 2.0 program of the YouTube platform, which provides many other benefits over traditional EFL learning. This research uncovered the benefits of YouTube-based learning over classroom learning. Certain practices were also suggested for improving teaching in the future to ensure students felt more at ease in that environment.
There are many areas of EFL learning in which different people are weak and wish only to improve in that specific aspect. To investigate this, a study was conducted on YouTube-based EFL learning for improving pronunciation [
25]. The benefits of using YouTube were revealed through such remarks as videos are free to watch, different speakers from different nations even with good British and American English could be found to improve pronunciation, in particular, and the possibility to watch videos anywhere and also on different devices. The recommendations given were to learn pronunciation while watching YouTube videos, where important points were also given before and after watching videos.
In another study, student reviews were collected using Google Forms, and based on the questionnaire answers, it was concluded that YouTube-based EFL learning occurs most specifically according to the content given in the course with the addition of attractive and flexible features [
26].
The different recent studies on EFL learning discussed above highlight the use of YouTube in English language learning for improving writing, speaking, grammar, and pronunciation.
However, the data that such studies utilized were only manually collected, and qualitative and quantitative analysis was mostly performed on only certain and specific questions. The research questions in those studies were built upon personal impressions and analyzed with some statistical approaches. Therefore, to bridge a clear research gap related to identifying wider challenges of YouTube-based English language learning, an automated solution to collect real-time responses from channel comments is needed. Moreover, qualitative and quantitative discussion of EFL learner responses is also required. In this way, the problems and benefits can be highlighted. Further, inter-comparison of the different YouTube-based channels could allow pedagogical implications to be formulated and shared with content makers to facilitate more positive responses from their viewers.
3. Materials and Methods
In the current study, user comments on three popular YouTube English teaching channels were collected via an API provided by Google developer services. The data were cleaned after scraping from the API, and playlist-wise topic modeling was performed, where each channel playlist topic was compared to check the user response to the topic. The primary purpose of this was to increase the audience and enhance their interest by proposing suggestions for new and already existing and popular YouTube channels. The results obtained form the current study may also carry important implications which can pour into the improvement of traditional EFL classroom environment. All primary steps of the current study are shown in
Figure 1.
Data acquired from YouTube were then subjected to cleaning. The user response on each topic of each channel is important. Therefore, at first, the sentiment of the user was checked using the TextBlob method of sentiment analysis, and, secondly, their topics of discussion in the comments box were discussed following topic modeling via the LSA method. The most popular topics and their associated used words were discussed, and suggestions were proposed by determining people’s sentiments on these topics.
3.1. Data Collection and Cleaning
In previous studies, authors mostly selected interested students from different semesters and assigned them to watch videos and share narratives/notes describing their impressions. After that, interviews and questionnaires were conducted and completed to analyze the topics and behavior. The predefined questions asked mostly answer the following major and basic questions: Is YouTube a good source of learning for EFL learners or not? Similarly, are the sources on YouTube trustworthy and authentic or not?
However, in all previously used approaches, automated and real-time collection from a multinational audience was lacking. In this way, the known issues and benefits of using YouTube-based EFL learning would also be lacking. In our study, we manually selected the three of the most viewed channels on EFL learning on YouTube and scraped people’s comments on them together with their likes and responses on each playlist. A YouTube developer account was created and the API key was used, and the collection of user responses according to channel and playlist was performed. The selected channel names are (1) Learn English with Englishclass101.com (
) [
27]; (2) Learn English with Jessica (
) [
28]; (3) Speak English with Vanessa (
) [
29].
The textual data of comments on these channels contained stop words, URLs, and other redundant information that was removed using the regular expression library of Python. After collecting and cleaning data on all playlists, a topic wise analysis was performed. We highlight the main findings from this step below.
3.2. Dataset Topics
Different topics of discussion were found on the three different channels. The user response and interest in all topics were analyzed and included negative, positive, and neutral responses. First, the video topics are discussed here, and the findings related to these are discussed in
Section 4.
Table 1 describes the datasets and their discussed topics.
We can see in
Table 1 that there are several different topics discussed in the videos. The first dataset contains the topics of English vocabulary, conversational phrases, listening, and speaking at beginner and advanced levels in both British and American contexts. The second dataset involves creating animated videos containing anonymous objects. Most of the videos in this dataset contain the regular use of English regarding speaking.
The third dataset contains interesting topics compared with both of the above-discussed datasets. It contains the regular content for EFL learners as well as live interviews and the fearless discussions of its users/subscribers through Skype and other live interactions. It also contains different levels of the syllabus for EFL learners according to their usages, such as business and travel English. The user response on all these datasets and their topics is more interesting and is discussed in the following section of this manuscript.
3.3. User Sentiment in Dataset Topics via Sentiment Analysis
In previous studies, statistical analysis was performed on answers to predefined questions in questionnaires asking about YouTube videos. The analysis concerned user sentiment and suggestions to watch YouTube videos for EFL learning. Similarly, there was an automated collection of user sentiment based on open-ended comments, sentiment analysis of all comments on different topics in the videos was performed for each dataset, and the sentiments on each topic are shown in
Section 4. TextBlob was used to perform sentiment analysis.
3.4. User-Response-Based Topic Modeling via Latent Semantic Analysis (LSA)
In previous studies, most of the questions and topics were user-defined and collected and provided by authors. In our study, we examined user comments on different YouTube channels in which there were different topics of discussion. To analyze the topics of discussion in comments, topic modeling based on LSA [
30] was performed.
The cleaned textual data were first provided to the LDA model [
31], and the most frequently occurring unique topics from each playlist of each channel were analyzed. The mathematical representation of applied steps on documents or input data are shown in Equations (
1)–(
3).
In Equation (
1),
represents the three datasets, where
n varies from 1 to 3, and
i represents a certain comment or text document that is passed from the preprocessing step to clean the document from stop words, URLs, etc. The cleaned data are stored in
. These cleaned data are then used to create the count vector document matrix. The mathematical representation is shown in Equation (
2).
represents the document count against the cleaned data matrix
. The count is the term count in each document or text comment in this case study. To obtain multi-matrix decomposition as acquired by the LSA model, the singular value decomposition (SVD) method is used, where topics on each channel’s comments data are manually specified here. SVD is a matrix term that is used to reduce the matrix dimensionality [
32]. The mathematical representation is shown in Equation (
3).
In Equation (
3), two input parameters are specified while making the
matrix that will generate the unique topics of discussion. Topics are represented by
, where
m is the number of topics, and the output is calculated by using specific topics (
m) and the cleaned count vectorizer of each document. After obtaining user topics of discussion via each dataset, sentiment analysis is performed to determine user sentiment as being either positive, negative, or neutral.
4. Findings and Discussion
There are two types of topics that need to be discussed in this section about the EFL learning topics and the user responses to them. First, let us explore the collected datasets and the sentiments of people about them.
4.1. Sentiment Analysis of Collected Datasets
There are three datasets collected where the basis of the collection was their different playlist topics. To identify people’s sentiments on each topic of channels, the three basic sentiments of people were collected based on the polarity of their comments. The sentiments for each topic are shown in
Table 2.
There are many topics, and the different sentiments regarding these can be seen in
Table 2. Regarding our first dataset topics and their sentiments, we can see that there are six unique topics for which the user sentiment analysis was performed. The sentiments regarding basic vocabulary remain less negative compared with those on the discussion of conversational phrases.
Fun and ease, which is the third topic of sentiment analysis in the table, is also associated with 9% negative sentiment, and this could be due to the critical thinking about this topic of discussion. The fourth topic is associated with far less positive sentiment and is American English intermediate, and the next topic is more negative and is advanced-level American English reading practice.
The British English reading-based comments received more positive feedback compared with American readings, as can be seen in . This could be attributed to higher quality exposure of and practice in the British variety of English compared with the American context, though this needs to be formally established in a future study utilizing validation and triangulation techniques.
Regarding the second dataset, which specifically utilized animated videos to teach spoken English, the self-made objects were made and used to practice spoken English. Both playlists were associated with positive sentiments and some (5%) negative sentiments. Therefore, it can be said that increasing excitement by employing non-traditional and innovative techniques such as making animated videos for object-based English practice and learning can lead to positive impressions amongst the target audience.
4.2. LSA-Based Topic Modeling of User Comments
Topic modeling based on user comments was performed, where the most popular topics are also identified with the use of the most significant words in them. The count-vector-based feature vector is provided to the SVD matrix maker and the LSA model is then applied.
4.2.1. Dataset 1 () Topics Discussed by Users
The topics of user comments from each playlist of datasets were collected, and those for dataset
are shown in
Table 3.
The table of topics in the dataset () shows the major discussed topics based on videos of each topic. The first topic () shows the type of topics that are more likely to be associated with positive sentiments and even show that the user Jammu Kashmir belongs to and is inspired by the English teaching of this channel.
Secondly, we can see in that people from Saudi Arabia and Bangladesh follow this channel and strongly recommend the language learning style used here for conversational phrases videos. In the third topic of this channel, the user mentions that it is a fun and easy way to learn English. It includes user-interaction-based activity to comment in English for a given phrase. Therefore, the user comments are almost the same, talking about bills, electricity, education, etc., again and again.
Only three topics of discussion are listed in the table where other similar topic related comments are discarded. The fourth and fifth topics, which are on American English learning, already showed slightly negative sentiments of people compared with British English on this channel. In the user comments, the user is saying that the video is boring but simple and has given suggestions for where improvement is needed.
The next topic on British English learning received positive comments, and it also came from a multinational audience learning British English from this channel that positively recommended taking British English lessons through this YouTube channel.
4.2.2. Dataset 2 () Topics Discussed by Users
The users of the second channel gave mostly positive responses on data, where their discussed topics are discussed in
Table 4.
In the second dataset, channel comments where animated videos were given received many positive comments and best wishes, whereas comments about the video maker and story maker being nice were given. Now, it can be noted that by making objects and animated videos, plays, and by making small stories in an innovative fashion, we can increase the efficacy of EFL teaching utilizing ICT. Again, this is an area of potential future research focusing on innovative content creation in ICT and how it can increase its utility and impact in the learning process. Certain areas of language learning such as expressions, pronunciation, and vocabulary can be easier to deliver using innovative content as compared to other areas such speaking.
4.2.3. Dataset 3 () Topics Discussed by Users
The third channel includes interesting topics, compared with the two other channels, that essentially use the same strategy to teach English, which also works in regular classrooms and courses for EFL learners. The users on this channel and its topics also showed a higher proportion of comments and sharing compared with the other two channels. To examine the differences and reasons for more interaction of users of this channel, the most discussed topics are shown in
Table 5.
As previously discussed, there are certain unique topics of discussion that are distinguished and described by the LSA model. For example, the users of dataset () showed more interest compared with those of datasets 1 and 2, and there are many aspects that seem to engage the audience. At first, the very basic start of this channel poses the question “How to learn English?", a very basic question in itself to which there are many different answers, and even people of different ages and language backgrounds could all learn English. People’s sentiments and comments all remained positive and they appreciated the way to start EFL learning.
Secondly, there is a global issue of whether to learn either British or American English. The differences are highlighted when people discuss this topic, and excellent differences are highlighted with acquired expressions to speak the English of both varieties. The third topic , to take English lessons again, engages many audiences, the teacher’s accent and expressions are, again, much appreciated, and the lesson is also much appreciated by the audience and viewers.
Acquiring and retaining Vocabulary can be a challenge for many EFL learners. Users who visited the YouTube channels under investigation appear to agree with this. On this topic, , many people from different countries, such as from Australia, say that Indonesian people watching a video on this topic feel more relaxed and inspired after watching these videos. The next topics of the video are related to English on business and travel demonstrating a narrower focus on a less targeted EFL learning area especially in a free online resource.
However, business English with topic ID showed all positive responses with, again, expressions-based acknowledgment. The English grammar topics and the phrasal verbs are much appreciated, and people immensely enjoyed these difficult topics. Some users found the channel to be unique and rare with interesting topics and expressions based on EFL learning. In , the people appreciated the advancement of this course of EFL learning and showed immense appreciation to this teacher. For this topic, it was also mentioned that a practical and easy way of learning was introduced, and teachers are increasingly struggling to teach lessons in an interesting way.
The people commenting on topic also mention that they accidentally found an amazing and fluent English-speaking channel. The last topics are all practice- and interaction-based and lead to different, but fully positive, sentiments. The live sessions with students, users, and regular viewers are a unique and fruitful activity to teach English. The people themselves give positive remarks such as that they can never forget this fluent way of speaking English full of expressions and that they are more likely to understand it based on the positive expressions of the teacher.
4.3. User Difficulties Found in the Collected Datasets
In the collected datasets, although the sentiments mostly remained positive, there are a few concerns highlighted in all topic-based data from the scraping of datasets. The remarks of interest are represented as a word cloud in
Figure 2, which summarizes the negative topics and remarks of dataset 1.
We can see from the word cloud that the topics of discussion are narrow in words such as that the user “wants” something; for example, “speak” loudly, “boring”, “exams”, “slowly”, etc., are the top negative words used in dataset
channel comments. The negative comments can be seen in detail in
Table 6, which shows user negative remarks more deeply forthe difficulty level.
Users’ negative comments mainly revolved around a few areas which we discuss here. First, word repetition made videos according to users boring to watch. Speaking loudly is also a topic causing negativity in users’ discussion, and speaking slowly is also another issue. In addition, exam difficulty, where applicable, is another source of negative sentiments that can be exhibited as a user’s personal problem with the site.
To determine the difficult topics in dataset
, the top words of interest are also shown as a word cloud in
Figure 3. The top words show, again, the positive remarks, and why a comment is marked as negative should be discussed. The negative topics are listed in
Table 7.
There is a mix of positive and negative words in negative marked comments in dataset 3. Words such as “thanks”, “British”, and “phrasal” need to be examined in more depth when looking at topics that are listed in
Table 7.
The topics with the highest keywords found in comments categorized as negative from TextBlob methods are shown in
Figure 3, and the filtered topics are shown in
Table 7. Most of the comments were expressing thanks to the content creator Vanessa, whereas the negative or questioning comments are showing that some users are asking whether British and American phrasal verbs are the same or different, and what the differences are. The other topic is the same as in dataset (
): “Slow down the speed of speaking where you speak very fast”. The other topic is that the “user feels strange that teacher always keep smiling”.
The topics of discussion with negative sentiment are quite the same and show that the user feels difficulty when they are learning. However, slowing the speed, which was found to be a common difficulty in users of different datasets, could be solved using the YouTube feature to slow down the video playback. The other remarks regarding always smiling is a single person’s opinion, where most of the users that commented on dataset 3 videos highlighted the importance and happiness toward the expression of its teachers. Word repetition in video creation could be an irritating point while learning English. The created video does not need to be long, and if long, it should not be boring, ensuring that interesting facts and activities are added to avoid boredom.
4.4. Key Features of YouTube-Based EFL Learning
The most prominent key feature of YouTube-based EFL learning is its remote access and control. Restrictions such as those regarding time and speed are not relevant. YouTube also provides subtitles and options for speed increase and decrease when playing videos that could be more beneficial for listening and understanding video topics according to user interest. Punctuation and pronunciation problems can be addressed in either British or American Englishes, as both can be found on YouTube. The live streaming feature is now also of interest to engage students globally to ask questions and to give suggestions for acquiring a different set of skills within the EFL learning environment. We saw reviews from people that are mostly positive regarding online or YouTube-based EFL learning. This positivity relies on many key factors. The key points or features of YouTube-based EFL learning are video sharing among friends or classmates by teachers, which reduces the reproduction of the same lecture by the instructor. It also assists absent students to catch up on certain topics. The other feature could be its larger and more extraordinary syllabus or content compared with regular classroom-based textbook learning. Global viewership is another important feature which can be considered a strength of YouTube-based EFL learning. People from different nationalities watch these YouTube-based EFL learning channels and it would be interesting to gather more information of this global viewership (e.g. where the user posting the comment is actually from). The people that respond in comments are visually illustrated according to the country in
Figure 4.
We can see that people from mostly the Asia Pacific region engaged in collected datasets viewership, whereas a few of them also belong to Saudi Arabia, Australia, and Indonesia. Few users are found in the United Kingdom, United States, and other European countries. Therefore, we could emphasize here that regions that people are mostly from may need more English learning, and those from other countries may also need EFL learning but not as much as people from Asia.
5. Hypothetical Findings and Suggestions
The topics of three different YouTube channels and discussions of their users were investigated in this study. Many new findings were uncovered, compared with previous studies, and the summarized hypothetical findings are discussed here and shown in
Table 8. In
, classroom-based textbook learning has a traditional old syllabus, whereas it is found that YouTube channels include a larger syllabus and new content compared with textbooks. An issue for consideration, as stated in
, is determining how to learn English vocabulary and pronunciation, which is a basic topic discussed by one of the investigated YouTube channels. Therefore, it is found to be an important factor to be included in learning. It is also found that expression-based EFL learning is important as it includes the necessary accent, expression, etc., and is described in
. Fearless discussions between teachers and students are also an important factor in EFL learning that increases the confidence of learners and is shown in
. This will boost EFL learning. EFL learning varies according to people’s interests, such as regarding British, American, and business English. Therefore, an acquired syllabus and content are needed in EFL learning, as mentioned in hypothesis
. A general and specific age-based understanding of EFL learning is required when designing an EFL course.
shows this general requirement that needs to be added in courses.
In , user-based suggestions are specifically given to learn English according to your interests. In , trainer authentication needs to be cross-checked to determine whether the user is learning correctly or not. In , each type of question asked in comments or live interaction should be answered to engage an increasingly larger audience. In , ask the EFL learner what new things, improvements, and suggestions for things need to be added in future videos or courses. In , one of the channels included animated videos and stories in which topic-wise discussion was added for listening and viewing by users. There are certain problems, challenges upon exploration, and discussion of datasets that are found listed in tabular format. There are two types of categories of findings from the YouTube channel datasets, and the user discussions on them were explored and used as a basis for deriving certain hypotheses and suggestions. From H1 to H5, how things are happening on different YouTube channels based on EFL learning is discussed.
The positive and dominant factors compared with regular classroom-based textual studies are discussed. These hypotheses play major roles in YouTube-based EFL learning compared with regular classroom EFL learning. However, a few suggestions made on user responses in comments were collected that could improve both classroom- and YouTube-based EFL learning.
6. Conclusions
The current study scraped the comment data of three different YouTube channels, using the YouTube developer-provided API service. The scraped data were collected and cleaned over a textual preprocessing phase and then assessed to determine user sentiments on topics discussed in them. Then, the user comments based on top and unique topics were collected to obtain user reviews of these. To obtain topics of user discussion, the LSA method was used, which works on the sentiments of a given document. In previous studies, manual inspection of YouTube-based EFL learning was conducted on limited datasets of mostly less than 100 people. Thus, they do not highlight the benefits and challenges of global users. However, this study used an automated method for data collection as well as to find topics in user comments. This resulted in making hypotheses regarding the importance and uniqueness of YouTube-based EFL learning. The benefits of this learning are also discussed, such as having a larger syllabus on YouTube as compared to textbook learning, expressions and punctuation following words, course design that covers a multinational audience and is suitable for learning by a student of any age, and allowing separate learning of each variety of English, such as British and American. Further, classroom-based EFL learning could also be improved by implementing the following suggestions: find the course of interest first based on one’s needs which can be discussed with a tutor or any English teacher to optimize the learning experience, participate in fearless educator–learner interaction and engagement, and ask other EFL learners for their previous experiences with learning online in order for the learner to maximize benefit.