4.1. Key Topics of AI Privacy in Higher Education
To answer RQ1—what are the key topics of AI privacy in higher education reflected in Western and Chinese news?—we performed non-negative matrix factorization (NMF) topic modeling on the final dataset of 315 English and 347 Chinese news articles.
4.1.1. Overview of the NMF Topic Modeling Process
Before applying NMF, we preprocessed the English and Chinese corpora. For English texts, stopwords were removed and lemmatization was applied. For Chinese texts, we used Jieba for segmentation, removed stopwords, and merged key synonyms (e.g., “高校” and “大学”). We then constructed a TF-IDF matrix. To determine the number of topics, we tested values of k ranging from three to six and selected k = four based on semantic interpretability, minimal overlap between topic terms, and relevance to the research questions. Although we did not apply formal coherence scoring, the selected model provided thematically distinct and meaningful clusters based on manual inspection.
4.1.2. Topics in Western Media
After applying NMF to the 315 English articles, four main topics were obtained.
Table 1 summarizes the model output by presenting the top keywords for each topic, providing a structured and interpretable view of the topic modeling results.
Topic 1 frequently discusses major technology companies (e.g., Facebook, Google) and ongoing debates about privacy legislation such as European regulations and data-protection laws. While higher education is not the focus of the Topic, universities sometimes feature in discussions about collaborations with big technology companies or disputes involving the sharing of student data, including with social platforms or third-party technology vendors. Legislative frameworks such as the GDPR are frequently cited, as are policy debates about data processing and student consent. Topic 2 is the most education-centric topic. This topic focuses primarily on direct applications of AI in educational settings, covering everything from automated exam proctoring, personalized learning platforms, faculty training, and institutional policies. News articles in this category often highlight the impact of AI tools on student privacy, exam integrity, and the overall learning environment. It explores the integration of AI-driven platforms in the classroom, with a particular focus on data collection, online exams, and potential violations of student privacy or academic standards. Topic 3 focuses on AI-driven surveillance technologies, particularly facial recognition. While much of the discussion involves police use and surveillance of public spaces, universities also engage in discussions when campus surveillance cameras, dormitory access systems, or event security measures are combined with facial recognition tools. Concerns about data misuse and the ethics of monitoring student populations frequently appear in these articles. Topic 4 reflects on how global geopolitical tensions are affecting AI research in universities. It often touches on issues such as national security, research funding, and the potential for cross-border technology transfer. The main theme of these articles is the China–US focal point around advanced technologies, national security, and data governance. While not all articles explicitly mention higher education, universities often appear in the context of research collaborations, intellectual property disputes, or concerns about the security risks posed by technology companies.
4.1.3. Topics in Chinese Media
Similar NMF analysis of 347 Chinese articles also revealed four main topics.
Table 2 summarizes the topic modeling output by listing the most representative keywords for each topic, supporting interpretation and comparison. These topics highlight how Chinese media have linked AI privacy issues to relevant national policies, digital transformation, and global governance trends in a broad discussion.
Topic 1 focuses on AI industrialization and data security, involving university policy orientations in AI research, computing power construction, and cybersecurity. It emphasizes the role of artificial intelligence in the broader digital economy and industrial transformation. News reports often focus on cooperation between universities and technology companies, focusing on research capabilities and cybersecurity initiatives, but also acknowledging gaps in privacy protection. Although “privacy” does not appear prominently in the keywords, university research cooperation and industry are often mentioned, especially in news reports on campus cybersecurity and the integration of industry, academia, and research. Topic 2 is most closely related to the field of higher education. It focuses on the application and privacy challenges of AI in the field of education, covering the application of AI in teaching, learning analysis, student evaluation, such as student data collection, privacy risks of intelligent learning systems, etc. At the same time, it also raises questions about student privacy protection, accompanied by concerns about the abuse of personal data and the tracking of learning behavior. Topic 3 emphasizes the institutional and value issues behind AI applications, such as laws and regulations, social ethics, human–machine relations, and new risks brought about by generative AI. It reflects the Chinese media’s attention to international AI regulatory trends and China’s interest in promoting or shaping global governance structures. Overseas cases such as the United States often appear in such reports, showing the Chinese media’s continued attention to international regulation and global governance. Topic 4 involves enterprise practices of AI models, technological development, and AI research in universities. The focus is on the application of artificial intelligence in various industries and the role of universities in large-scale model development. Although privacy is not the main concern here, it comes up when discussing whether model training and actual deployment scenarios comply with data regulations. Compared with Topic 1, this topic is more inclined to technical details and enterprise applications, rather than macro-digital development or cybersecurity.
4.1.4. Comparative Insights Across Western and Chinese Topics
Although AI-related news covers a variety of areas, from regulatory debates to industrial applications to ethical governance, it is particularly illuminating to compare Western and Chinese media coverage of these issues in the context of higher education. Topic analysis of the data shows that both sides are concerned about student privacy and the integration of AI education tools. However, in terms of specific reporting content and discussion focus, Western and Chinese media show different narrative frameworks. Categories were manually constructed by clustering semantically related NMF topics into broader comparative themes. This is shown in
Table 3.
In Western media, coverage frequently centers on AI regulation and student rights, assessing whether systems violate GDPR or infringe on privacy in contexts such as AI-powered proctoring. There is also intensified scrutiny on technology companies and an emphasis on the legal responsibilities of universities to protect data. By contrast, Chinese media places greater weight on policy regulation, AI-driven educational strategies, and government-led initiatives for integrating AI into higher education. AI development is framed as part of the country’s broader digital economy strategy, which involves both opportunities and challenges, while calling for stricter privacy protection guidelines.
Despite these distinctions, student privacy emerges as a pivotal concern across both Western and Chinese media reports. Discussions address how universities manage data via proctoring tools, personalized learning platforms, or campus surveillance, highlighting significant ethical and practical risks. However, the regulatory narrative diverges: Western outlets typically reference GDPR, FERPA, and the supervision of tech companies, whereas Chinese sources tend to situate AI privacy within the nation’s overarching policy goals, focusing on governance frameworks and long-term developmental directions. Overall, while both sides recognize the privacy risks posed by AI in higher education, Western media often stresses legal accountability and individual rights, whereas Chinese media underscores balancing privacy protection with digital innovation and government-led governance.
4.2. Time Trend Analysis of AI Privacy
4.2.1. Time Trend Analysis of AI Privacy in English Media
Figure 1 shows the trend of Western media coverage of AI privacy from 2019 to 2024. Overall, the attention paid to AI privacy issues has not increased linearly, but has been highly dependent on specific events, showing an intermittent fluctuation pattern. In the early phase (2019–2021), media coverage of university AI privacy remained scattered, without forming a sustained upward trajectory. Although AI proctoring (Topic 2) and facial recognition monitoring (Topic 3) experienced brief surges during 2020–2021 due to the COVID-19 pandemic’s rapid shift toward remote learning, discussions were largely event-driven. Articles typically highlighted ethical concerns surrounding biometric data collection, online examination integrity, and the initial implications of GDPR. Technology company responsibilities (Topic 1) also attracted some attention amid these debates, but the focus remained on responding to specific incidents rather than establishing a comprehensive, ongoing discourse.
Following this, reporting expanded considerably in 2022, yet continued to be driven by notable events and controversies. Topics 1 and 2 emerged as dominant themes, reflecting increasing scrutiny of university–industry data sharing practices under GDPR. During this period, high-profile student protests against AI proctoring prompted repeated spikes in coverage, centering on concerns over privacy violations and technological transparency. Simultaneously, greater emphasis was placed on how large technology firms adhered to GDPR mandates, as well as on universities’ attempts to reconcile innovation with data compliance in a rapidly changing regulatory environment.
In the peak period (2023–2024), Western media reports reached new heights, with Topics 1 and 2 commanding the greatest share of attention. Articles focused heavily on the transparency of university data governance, particularly in relation to facial recognition, personalized learning tools, and online data management under stronger GDPR enforcement. Topic 3 surged in relevance whenever legislative measures arose to restrict or ban facial recognition on campuses, prompting in-depth debates on the implications for student rights, privacy protection, and university security. Although coverage of Topic 3 was overall less frequent than Topics 1 and 2, it nonetheless drew considerable interest during periods of legislative or institutional policy change.
In contrast, Topic 4 remained comparatively peripheral in 2023–2024, with only sporadic peaks tied to restrictions on U.S.–China academic cooperation or national security legislation affecting AI research funding. While these episodes occasionally generated media attention, the broader discourse continued to focus on AI monitoring, student privacy, and technology company compliance. Overall,
Figure 1 underscores that Western media reporting on AI privacy within universities has remained event-driven rather than reflecting a stable, policy-focused conversation. As GDPR enforcement intensifies, AI regulatory policies evolve, and technology companies adapt their data governance practices, it is likely that Western media coverage will continue to revolve around emergent controversies, placing legal obligations, corporate responsibilities, and the ethical challenges of AI implementation at the forefront.
4.2.2. Time Trend Analysis of AI Privacy in Chinese Media
An analysis of Chinese media coverage of AI privacy from 2019 to 2024 shows that the focus of attention on this topic has undergone significant changes, reflecting broader contextual factors such as legislation, technological development, and digital reforms in higher education. As shown in
Figure 2, before 2022, AI privacy was rarely mentioned in Chinese media and mostly appeared in the context of broad discussions of digital transformation or industrial policy. However, from the end of 2023 to 2024, the number of reports on AI privacy increased significantly, showing a more intensive discussion. This change was largely driven by the evolving regulatory environment, especially the policy implementation after the promulgation of the Personal Information Protection Law (PIPL) in 2021, and the further strengthening of generative AI governance and data security regulations in 2024. As the application of AI technology in higher education institutions continues to expand, the privacy issues of AI in education have gradually become the focus of policy and public opinion.
In this evolving media narrative, Topic 1 will see a significant increase in attention in 2024. This trend reflects the Chinese government’s efforts in recent years to strengthen cybersecurity and data governance and promote cooperation between universities and technology companies in computing infrastructure, data security, and AI industry applications. Although these initiatives have promoted technological innovation, they have also sparked discussions about the balance between university data security and national security. In particular, as universities become key hubs for the development and deployment of AI technology, how to ensure the security of sensitive data and strengthen data supervision have become important topics of media concern. In 2024, the coverage of this topic reached a peak, in line with the government’s trend of accelerating legislation and policy implementation in the fields of AI industry and digital security.
At the same time, Topic 3 began to increase significantly in early 2023, indicating that the media’s attention to AI ethical challenges and international regulatory trends has increased.
Figure 2 shows that the coverage of this topic is still low in 2022, but it grows rapidly in 2023, indicating that the improvement of the AI regulatory framework is driving the media to discuss issues such as AI ethics, data governance, and international academic cooperation more frequently. In the context of AI ethics and governance issues becoming a global policy focus, Chinese media have begun to pay more attention to the role of universities in AI research and international cooperation, and how to find a balance between technological innovation, ethical responsibility, and data security. In early 2024, as discussions on AI regulation intensified worldwide, the topic’s discussion in Chinese media also increased.
Topic 4 is also one of the fastest growing topics in 2024. This topic mainly involves China’s technological innovation in generative AI, intelligent teaching systems, and adaptive learning platforms. As can be seen from
Figure 2, the coverage of this topic is still relatively scattered in 2022, but it has increased since 2023. The media has widely reported on the application of AI in education and emphasized the privacy challenges it brings. For example, how to ensure the transparency and data security of AI technology in personalized learning has become a focus of media discussion. In addition, many reports pointed out the ethical and legal risks of using student and teacher data for AI training, prompting the media to call on universities to formulate clearer data protection guidelines when deploying AI applications.
In contrast, Topic 2 has maintained a low level of attention throughout the time frame and has not experienced significant growth like other topics. As can be seen from
Figure 2, the amount of coverage of this topic has always been secondary between 2019 and 2024. Even if the media’s overall attention to AI privacy increases in 2024, the increase in this topic is still limited.
4.2.3. Comparative Perspectives: Western vs. Chinese Media
Figure 1 and
Figure 2 show the trends in Western and Chinese media coverage of AI privacy from 2019 to 2024. Although both media have shown an increase in their attention to AI, privacy, and higher education, there are significant differences in reporting patterns, drivers, and core concerns. These differences not only reflect the differences in regulatory frameworks and social values, but also reveal differences in institutional priorities.
In 2019–2021, both types of media reported less on AI privacy, but after 2022, there was a clear increase. However, there are fundamental differences in the drivers and reporting patterns in this growth. Western media coverage shows intermittent fluctuations, and the increase in attention is often driven by specific controversial events or policy interventions. For example, events such as the expansion of GDPR enforcement, student protests against biometric exams, and the review of China–US academic cooperation have all triggered short-term media booms. However, these discussions usually revolve around individual lawsuits, political debates, or technical disputes, lack systematic agenda setting, and media coverage rises and falls with sudden events rather than by long-term accumulation.
In contrast, Chinese media coverage of AI privacy has shown a policy-driven systematic growth since the end of 2022, especially after the implementation of the PIPL and the improvement of generative AI regulatory policies in 2024, and the intensity of discussion has increased significantly. Chinese media pay more attention to the role of AI in the country’s digital transformation and regard universities as key hubs for data governance and AI technology applications. Topic 1 dominates Chinese media coverage, focusing on cooperation between universities and technology companies, data security supervision, and AI industry development, while paying less attention to student privacy and AI proctoring. This trend shows that China’s AI privacy discussion is highly consistent with national strategy and revolves around the government policy framework.
The reporting pattern of Western media is mainly driven by emergencies and public opinion. Western media pay more attention to individual rights, institutional responsibilities and legal accountability, and policy changes are usually a response to public opinion rather than a dominant force in agenda setting. In addition, geopolitics is also a key driver, especially on issues such as China–US academic cooperation, data security, and technology company governance, where media attention often involves the game between national security and academic freedom. In contrast, the reporting pattern of Chinese media is dominated by national policies and regulatory frameworks, and its attention has increased significantly in 2024. However, this growth is not driven by individual events, but revolves around long-term development issues such as national policies, industrial layout, and digital reform of universities. For example, after the implementation of PIPL, reports on data governance and privacy protection have increased significantly, while the improvement of AI regulatory policies in 2024 has further promoted discussions on AI ethics, cybersecurity, and industry implementation. Compared with Western media, Chinese media tend to regard AI privacy as part of national technological development, economic growth, and global competitiveness, rather than simply focusing on individual rights or university governance.
On specific topics, Western media tend to focus on individual rights, legal responsibilities, and institutional autonomy. Topic 2 and Topic 3 occupy an important position in Western media reports, highlighting ethical issues such as privacy issues of AI proctoring, student data protection, and algorithm transparency. For example, topics such as student protests against AI proctoring, university data compliance, and the appropriateness of government regulatory measures have been widely reported in Western media. These discussions emphasize institutional accountability, legal constraints, and focus on how AI regulation affects higher education governance. In comparison, Chinese media discussions on AI privacy are more inclined to industrial policies and technological development. Topic 1 and Topic 4 have high coverage, while Topic 2 receives a lower level of attention. This trend shows that the core issues of Chinese media focus on how universities can help the development of the AI industry and how to strengthen data security supervision, while paying less attention to micro issues such as student privacy and AI proctoring disputes. In addition, Topic 3 became a hot topic in 2024, reflecting the strengthening of global AI ethical supervision and prompting Chinese media to explore the role of universities in AI ethical governance.
Despite the different reporting patterns, both types of media recognize the core role of universities in AI ethics and privacy governance. Whether in Western media or Chinese media, universities are seen as experimental fields for artificial intelligence technology, and their AI research, educational applications, and data governance practices will have a profound impact on future policy making and social norms. In both contexts, the media focused on issues such as facial recognition, algorithmic bias, and student data protection, indicating that the risks of AI in higher education have become a common concern worldwide. In addition, geopolitical factors also affect media reporting in both countries. Western media are more concerned about academic freedom, intellectual property rights, and national security, while Chinese media are more concerned about the impact of international AI regulatory trends on domestic policies and how to maintain technological advantages in global competition.
4.3. Results of the Sentiment Analysis
This section presents the results of a sentiment analysis of AI privacy discourse in Chinese and Western media from 2019 to 2024. The results show that people have very different sentiment patterns and topic concerns in different media environments, which reflects the diverse manifestations of AI privacy issues in different sociopolitical environments.
Figure 3 displays the sentiment distribution of AI privacy discussions in Chinese and Western media, respectively. The results indicate that Chinese media show overwhelming positive sentiment on AI privacy-related issues, with more than 80% of news with positive sentiment. Neutral and negative sentiments are relatively rare. The sentiment distribution of Western media is more diversified, with a much larger proportion of negative and neutral articles than in Chinese media. Although positive sentiment still dominates, negative articles also occupy a considerable proportion, indicating concerns about AI surveillance, data privacy risks, and regulation issues. Chinese media cover the development of AI in a very positive and government-supportive way, emphasizing technological innovation and industry growth. Western media are more critical in their coverage, often highlighting ethical issues, data protection challenges, and governance problems.
Next, we explore the sentiment distribution of AI privacy topics in Chinese and Western media from 2019 to 2024 to analyze how the media shapes the public discourse on AI privacy in different social environments.
Figure 4 shows the sentiment share of the four main topics. The results show that Chinese media as a whole presents a highly positive sentiment tendency, while Western media show a more critical and diverse sentiment distribution.
In Chinese media reports, AI privacy issues are mainly placed in the framework of technological progress and industrial development, showing an overall positive emotional pattern. In particular, the proportions of positive emotions in reports related to digital industry and cybersecurity (Topic 1) and governance and ethical supervision (Topic 3) are as high as 93% and 83%, respectively, indicating that AI is more often shaped as a core tool to promote national scientific and technological development, promote educational modernization, and enhance governance capabilities in Chinese media discourse. In the field of higher education, AI is widely used in intelligent teaching, educational governance optimization and campus safety management, while critical discussions around privacy risks are relatively rare. However, the topic of AI and student privacy in higher education (Topic 2) shows a certain degree of emotional differentiation, with negative emotions accounting for 26%, higher than other technology-related topics. This trend may reflect that AI proctoring, data governance of personalized learning platforms, and student privacy protection have begun to receive a certain degree of attention in Chinese media, but compared with the positive narrative of industrial policy and technological innovation, they have not yet become mainstream topics.
Western media reports on AI privacy issues showed a higher proportion of negative sentiment, especially when it came to technology company governance (Topic 1) and AI monitoring and facial recognition (Topic 3), with negative reports accounting for 69% and 61%, respectively. This trend reflects the Western media’s highly critical attitude towards the power of large technology companies in AI data governance, the effectiveness of government regulation, and the threat that AI technology may pose to privacy rights. In the field of higher education in particular, the use of technologies such as AI proctoring, student data tracking, and biometric recognition has become the focus of negative reports. For example, Topic 2 (AI applications in higher education) has a negative sentiment ratio of 35% in Western media, indicating that the media has shown a more critical stance on how the application of AI in education affects students’ privacy rights and data security. This reporting pattern not only highlights Western society’s concern about data rights and academic freedom, but also reflects concerns about the commercialization of educational technology, algorithmic bias, and the abuse of student data.
This difference in emotional patterns is mainly influenced by policy orientation, media agenda and higher education governance system. In China, artificial intelligence has become an important part of the national science and technology development strategy and education digital reform. The application of artificial intelligence technology in educational scenarios is more often described to improve teaching efficiency, promote educational equity and optimize school management. With the advancement of intelligent campus construction, AI proctoring, learning data analysis and intelligent decision support systems have been widely adopted, while discussions on privacy protection and ethical governance are usually incorporated into the framework of data security supervision and technology optimization in policy discourse, rather than as independent controversial issues. In addition, the management model of Chinese universities makes the application of artificial intelligence mainly led by the government, emphasizing the combination of education digital reform and national policies, which further shapes the media reporting model dominated by positive narratives.
In contrast, the higher education system in Western countries is more decentralized, and data governance relies on the autonomous decision-making of each university and is subject to multiple laws and regulations. Therefore, Western universities are more controversial on the issue of AI privacy, especially on issues such as student monitoring, biometric data storage, and the fairness of personalized learning algorithms. The public’s concerns about technology abuse and rights infringement are more prominent. In addition, due to the deep cooperation between Western universities and technology companies in AI research and data sharing, the media has raised more questions about how academic institutions can maintain data transparency and protect student privacy. This discussion mode not only leads to a high proportion of negative emotions, but also further promotes the legal and ethical game between higher education institutions, government regulators, and civil society.
Figure 5 shows the trend of the average sentiment score, with significant differences in the sentiment expressions of Chinese media (blue line) and Western media (orange line). The results show that Chinese media as a whole show a consistently high level of positive sentiment, while Western media show a more stable and negative sentiment tone.
In the Chinese media, the sentiment score generally remained in a relatively high positive range, and since 2020, sentiment fluctuations have increased significantly. In particular, during 2021 and 2022, the sentiment score showed multiple significant peaks, indicating that the frequency of reporting on AI privacy issues increased during this period, and the overall reporting content tended to emphasize the positive impact of technological development. This trend is highly consistent with a series of AI and data governance-related policies issued by the Chinese government during this period. For example, the promulgation and implementation of the Personal Information Protection Law (PIPL) in 2021 has made the media discourse on AI privacy protection more systematic, but the focus of discussion is still on how to balance data security and industrial innovation through technological governance, rather than simply criticizing privacy risks. In addition, in 2022, the Chinese government further strengthened its support for AI infrastructure, digital economic development, and smart education applications, and the positive narratives of AI-enabled education governance, smart learning platforms, and automated management systems in media reports increased significantly, driving the rise in sentiment scores.
The sentiment trend in Western media shows a more stable but overall low sentiment score, with a much smaller sentiment fluctuation than that in Chinese media, and most of the time it remains close to zero or negative. This trend shows that Western media are more critical in discussing AI privacy issues and more cautious about the application of technology. In particular, the sentiment score of Western media has dropped significantly during the 2020–2021 period, a phenomenon that may be related to the large-scale application of AI technology in education during the COVID-19 epidemic. For example, the popularity of AI proctoring systems, the widespread use of facial recognition technology, and algorithmic decision-making in personalized learning platforms have made data privacy and surveillance controversies the focus of media attention. Due to the long-term high sensitivity of Western society to data protection, academic freedom, and technological ethics, the media has a more critical attitude towards the use of AI in higher education, emphasizing student privacy rights, algorithmic transparency, and the potential ethical risks of university cooperation with technology companies, which has led to a decline in the overall sentiment score.
Entering 2023–2024, the sentiment trends in the two types of media continued the existing pattern. Although the sentiment scores of Chinese media fluctuated during this period, they remained at a high level overall and positive reports still dominate. The sentiment trend in Western media showed a relatively stable pattern with a high proportion of negative emotions, indicating that this issue is still mainly regarded as a regulatory and ethical challenge in Western society, rather than a simple technological opportunity. In early 2024, the sentiment scores of Western media fell to the negative range at multiple time points, which may be closely related to the strengthening of AI regulatory policies in Europe and the United States, the strengthening of scrutiny of large technology companies, and the adjustment of university data governance policies. For example, Europe and the United States introduced a series of data protection bills on AI applications between 2023 and 2024, which put forward stricter regulatory requirements for data sharing, algorithm transparency and the use of monitoring technology between universities and technology companies, which further intensified the media’s critical discussion on AI privacy issues.
This trend has important implications for AI policy and data governance in higher education. Judging from discussions in the Chinese media, the application of AI in higher education is still centered on the country’s digital reform, emphasizing that technology can empower educational innovation, improve teaching quality, and optimize management efficiency. Therefore, when reporting on AI proctoring, student data analysis, and personalized learning platforms, the media usually describe them as positive manifestations of technological progress rather than potential threats or privacy risks. In contrast, Western universities face stricter data privacy laws and ethical constraints, which require the application of AI in education to be adjusted within the regulatory framework. When reporting on technologies such as AI proctoring, automatic grading systems, and student behavior data analysis, the media generally emphasizes data ethics, privacy rights, and academic autonomy and regards them as technical applications that require stricter supervision and transparency review.