1. Introduction
Transportation has long contributed to fossil energy consumption and greenhouse gas emissions, and it has become one of the fastest-growing sources of greenhouse gas emissions in China in recent years [
1]. Globally, the transportation sector faces significant carbon emission challenges. In regions such as the United States and Europe, this sector is a major contributor to CO
2 and overall greenhouse gas emissions. In the United States, transportation accounts for a substantial share of carbon emissions, and its reliance on petroleum makes reducing emissions from this sector a critical challenge in addressing climate change. Similarly, Europe faces pressure to reduce emissions but has distinct needs and strategies for decarbonizing transportation due to differences in geography and economic structure [
2,
3]. Against the backdrop of promoting global energy conservation and emission reduction, the popularization of electric vehicles (EVs) integrates elements such as green energy consumption, intelligent travel, and innovative mobile internet platforms, while catalyzing industrial convergence through cutting-edge technologies, advanced materials, and renewable energy. Among EVs, fuel cell vehicles (FCVs) have emerged as a promising alternative, offering advantages such as zero emissions, long driving ranges, and rapid refueling times compared to traditional battery electric vehicles [
4,
5]. Thus, EVs have become a pivotal instrument for carbon decoupling efforts worldwide [
6]. A transition from combustion to EVs not only has the potential to significantly reduce climate change impacts but has also been recognized as an inevitable choice for realizing structural upgrading and sustainable development of China’s auto industry.
Therefore, the Chinese government has introduced various guiding policies to promote the development of EVs. Since 2010, the Chinese government has launched a series of policies aimed at accelerating EV adoption, particularly emphasizing preferential road access rights and financial subsidies [
7,
8]. Internationally, many countries have implemented policies to promote electric vehicles (EVs). The United States has enforced strict emission standards and offered tax incentives to encourage EV adoption, with some states setting clear sales targets. European countries have supported EVs through strict CO
2 regulations, consumer subsidies, and investments in charging infrastructure [
3]. Among these measures, tax exemptions on vehicle purchases for EVs have attracted increased academic attention, with empirical studies widely verifying their positive impact on consumer purchases [
9,
10]. Consequently, China has achieved remarkable growth in EV production, sales, and stock. According to official data released by the MIIT of China, EV production and sales volumes reached 12.89 million and 12.87 million, respectively, in 2024, with EV sales accounting for 40.9% of total automotive sales and maintaining China’s position as the global EV leader for ten consecutive years. Up to now, China’s EV industry has achieved ‘one breakthrough and three upgrades’, being the first to break through the 10 million EV sales milestone globally. Besides that, product performance, industrial system maturity, and user accessibility were all upgraded with measurable improvements.
To avoid low-quality, indiscriminate industry expansion and to transition EV development toward market-driven mechanisms, China has gradually phased out national and local subsidy policies since 2016. The immediate impact of subsidy reductions on EV market dynamics was evident, leading to intensified market competition, declining profit margins for battery manufacturers and automakers, and the accelerated elimination of lower-tier enterprises. As the development of China’s EV industry has transitioned from policy-led to market-oriented, it is necessary to understand consumer decision-making patterns regarding low-carbon consumption behaviors.
Recent academic attention has increasingly focused on low-carbon consumption behaviors in China, and governmental agencies have also carried out extensive communication and education campaigns to promote sustainable consumption practices. Despite these efforts, empirical evidence demonstrates suboptimal effects on actual carbon-reduction behaviors [
11]. As sentiment is fundamental to human communication, affective experiences generated alongside cognition fundamentally affect subsequent behavior.
Existing research predominantly examines correlations between environmental awareness and low-carbon consumption behaviors by employing cross-sectional data at the micro-individual level. Gu and You applied the theory of planned behavior, using questionnaires to analyze consumers’ psychological attributions towards the purchase of new energy hybrid vehicles, emphasizing how individual-level factors affect purchase intention and behavior. Jesus et al. collected data from European electric vehicle drivers, exploring the factors influencing drivers’ satisfaction and continuance intention from the perspectives of task-technology fit and green self-identity [
10,
12].
In contrast, this study innovatively investigates whether environmental affect (EA) influences sales of EVs at the macro level, controlling for policy interventions and major market strategies, aiming to unravel the underlying unique mechanisms driving the world’s largest EV market.
The remainder of the paper is structured as follows. We first review the relevant literature on environmental affect, as well as its mechanisms, in the context of sustainable behavior. Following that, we describe our data collection and the methodological approach used to investigate the underlying research question. We then innovatively employ a CNN deep learning model to extract the topics and associated sentiments in Weibo text from 2014 to 2023. Using the monthly market share of EVs as the dependent variable, the three types of EA indices (positive, negative, and neutral) as independent variables, and adding government subsidy reductions, price, and other marketing strategy indices as control variables, we apply the time-series cointegration model to evaluate the long-term effects of environmental emotions. The results show that the direct association of environmental affect with EV sales is found for only one of the three sentiment variables: the Positive Environmental Affect Index. Followed by a general discussion of the key findings, the last part also includes theoretical contributions, managerial implications, and future research directions.
4. Sentiment Analysis of Public Environment Affect
We trained multiple models using a well-established public dataset along with an additional manually annotated dataset and selected the optimal model. This model enabled us to identify environmental affect from a total of 723,864 valid entries. Additionally, we filtered out 216,707 irrelevant entries to prevent any impact on subsequent time-series cointegration regression analysis. The detailed process information is provided next.
4.1. Data Preprocessing
The text preprocessing stage involved preparing the raw Weibo text data by cleaning irrelevant entries, removing duplicates, and conducting initial normalization. This step enhanced data quality, ensuring the accuracy and effectiveness of subsequent model training and application. The text preprocessing in this study was conducted through the following steps:
- (1)
Data Cleaning: The Python regular expression module (re) was used to initially clean the text by removing irrelevant emojis (e.g., “\u200b”), special characters (e.g., “@” and “#”), URL links, and redundant punctuation. This initial cleaning step effectively reduced noise within the dataset, enhancing the efficiency of subsequent data processing.
- (2)
Tokenization: The jieba segmentation tool was applied to segment the cleaned Weibo text. Known for its high accuracy and efficiency in Chinese text segmentation, jieba effectively handles the complexities of Chinese text.
- (3)
Stopword Removal: A stopword list was constructed based on the Modern Chinese Word List by the Institute of Computational Linguistics at Peking University. This list includes common function words such as prepositions, conjunctions, and articles. Removing these frequently appearing but sentiment-neutral words reduces noise during model training and improves the precision of subsequent analysis.
- (4)
Word Vector Conversion: Pre-trained Chinese Word Vectors were used to convert tokenized words into numerical vectors. Developed by the Natural Language Processing and Computational Social Science Lab at Tsinghua University, this model was trained on a large-scale Chinese corpus and captures semantic relationships between words effectively.
4.2. Corpus Construction
The quality of the corpus directly affects the performance of the model.
- (1)
Base Corpus Selection: We selected a publicly available and academically validated Weibo sentiment corpus as the foundation, primarily used for training the model to identify positive and negative environmental affect. The initial corpus used was the Chinese Weibo Sentiment Analysis dataset (FudanNLP Weibo Sentiment Corpus) released by the Natural Language Processing Lab at Fudan University.
- (2)
Supplementary Corpus Annotation: In response to the research needs, we specifically supplemented the corpus with neutral and irrelevant samples. A cross-validation annotation process was adopted, where each entry was independently annotated by two trained graduate students. In cases of inconsistent annotation results, a panel of three experts was consulted to ensure the consistency and reliability of the annotations.
- (3)
Corpus Integration: The corpus extracted from the established dataset was combined with the manual coding corpus to form the final training dataset. The integrated dataset includes four categories: positive, negative, neutral, and irrelevant.
- (4)
Data Balancing: To address the issue of class imbalance in the dataset and prevent the influence of label distribution disparities on model training, we applied data augmentation techniques, such as synonym replacement, back-translation, and random insertion, to balance the dataset’s label distribution.
4.3. Model Training and Application
During the model training phase, 70% of the corpus was used as the training set, while the remaining data were split into validation and test sets to ensure sufficient samples for training and adequate data for validation and final testing. The initial model was trained, and its performance was optimized by adjusting the model architecture and hyperparameters (e.g., learning rate, batch size, and dropout rate). To prevent overfitting, we also employed Early Stopping for the deep learning models. The results were as follows: the accuracy of Logistic Regression (LR) was 69.92%, Support Vector Machine (SVM) achieved 70.23%, Convolutional Neural Network (CNN) reached 75.31%, and Recurrent Neural Network (RNN) attained 72.07%. The model with the highest accuracy, CNN, was ultimately chosen for this study. As shown in
Figure 1, the CNN model was selected for this study not only because it achieved the highest accuracy but also due to its inherent advantages in processing short-text sentiment analysis. CNNs excel at capturing local patterns and hierarchical features through convolutional layers, which is particularly effective for identifying sentiment-bearing phrases or keywords in short texts (e.g., Weibo posts). Additionally, CNNs are less prone to the vanishing gradient problem compared to RNNs and can efficiently handle spatial invariance in text data through weight sharing and pooling operations. These characteristics make CNNs a robust choice for sentiment analysis tasks where contextual nuances and localized sentiment cues are critical.
In conclusion, this study overcame the challenges posed by the diversity of internet short-text contexts in traditional sentiment analysis by employing supervised deep learning neural networks. The model successfully identified the sentiment attributes of 723,864 Weibo posts, filtered out 216,707 irrelevant entries (e.g., “Before and after the beginning of summer, the climate changes significantly, with large temperature differences between morning and evening, which makes people prone to illness…”), and then calculated the proportion of each sentiment category’s posts relative to the number of active Weibo users in the corresponding month. This led to the construction of the environmental affect indices for this study.
To ensure the scientific rigor and robustness of the constructed indices, both the validity and reliability of the sentiment classification process were carefully considered. The model was trained using a publicly validated corpus—the Weibo Sentiment Corpus from Fudan University—and was further supplemented with manually annotated neutral and irrelevant texts to align with the specific research context. Dual annotations with expert arbitration were adopted to enhance labeling consistency. After comparing multiple model performances, Convolutional Neural Network (CNN) was selected for its highest classification accuracy (75.31%). In addition, robustness tests using an alternative dependent variable produced consistent results, further confirming the internal consistency and empirical reliability of the environmental affect indices.
6. Discussion and Conclusions
This study aims to empirically examine whether environmental affect, triggered by environmental events discussed on social media over the past decade, has influenced purchase behavior in the world-leading EV market.
Contrary to the findings of Austmann and Vigne (2021) in Europe [
26], we found that environmental affect is positive and significant in predicting EV sales in China for only one of the three sentiment variables, namely, the Positive Environmental Affect Index. This provides evidence of changes in public awareness over the past decade. Public support for green and sustainable consumption plays an important role from a macroeconomic perspective, which reveals a distinct regional pattern. While previous short-term research reported relationships for both positive and negative emotions, the long-term positive environmental affect is proved to be more directly linked to sustainable behavior than the negative and neutral types, reconciling prior studies.
A second conclusion is that EV-specific factors were discovered to be significant when checking control variables, namely, subsidy, disparity, charger, patent, and price. China has gradually phased out national and local subsidy policies since 2016. Subsidy reduction was all negative and significant, confirming that the subsequent three stages of subsidy reduction hindered EV sales and were conducive to bringing EV development back to market-driven mechanisms. Four market factors (disparity, charger, patent, and price) were all significant and substantial, corresponding with the findings of prior studies about determinants of EVs, such as Austmann and Vigne (2021) [
26]. Our results further indicate that market forces promoted EV sales while government subsidies were being phased out. This also shows that the Chinese EV market itself was upgraded with measurable improvements during the transition from government-led to market-driven growth.
To accelerate this transition while sustaining growth, policymakers should strengthen positive environmental messaging through targeted social media campaigns that showcase EV benefits and real-world impact. The reallocation of subsidy savings toward critical infrastructure development, particularly the strategic expansion of fast-charging networks in urban hubs and along major transportation corridors, would address key adoption barriers. Simultaneously, establishing regional innovation hubs with tax incentives for technology commercialization could bridge the gap between patent development and market implementation. These measures would collectively reinforce the market-driven evolution of China’s EV sector while maintaining its global leadership position. The implementation of tiered electricity pricing for off-peak charging, coupled with public awareness initiatives demonstrating cost savings, would further incentivize consumer adoption. By aligning these policy interventions with the empirical findings on environmental affect and market forces, China can ensure a smooth transition from policy-led support to sustainable, market-driven growth in its EV industry.
The present study exhibits several limitations that warrant acknowledgment. First, certain constraints exist in data investigation. Although this research employs a decade-long longitudinal dataset, the measurement of environmental affect predominantly relies on social media text analysis, which may introduce sample bias. Specifically, Weibo users skew toward younger (aged 18–35), urban, and educated demographics compared to the general Chinese population, potentially underrepresenting older and rural consumers’ environmental attitudes. The generalizability of findings could be compromised by the representativeness of social media user demographics and the authenticity of posting behaviors. Furthermore, the environmental affect indices utilized in this study may not fully capture consumers’ latent emotional fluctuations, particularly manifesting limitations in measuring emotional intensity and temporal persistence.
Second, inadequacies emerge in variable control methodology. When analyzing the impact of environmental affect on purchasing behavior, this study potentially fails to comprehensively account for confounding effects from other influential factors. These include macroeconomic conditions, competitive brand strategies, and individual consumption patterns, among other variables. The presence of these unmeasured covariates may undermine the robustness of the research conclusions.
Building upon the identified limitations, future studies could address these gaps through methodological and theoretical advancements. First, researchers could enhance data robustness by integrating multi-source data collection approaches. Combining social media text analysis with survey-based measures, behavioral tracking (e.g., sensor data or purchase records), and qualitative interviews may mitigate sample bias and improve the granularity of environmental affect measurement. Advanced text mining techniques, such as sentiment-aware deep learning models, could further refine the assessment of emotional intensity and temporal dynamics. Longitudinal designs with higher-frequency sampling intervals or panel data tracking individual-level changes over time could also elucidate the persistence and evolution of environmental emotions.
Second, future work should prioritize rigorous variable control and causal inference frameworks. Employing quasi-experimental designs (e.g., natural experiments or instrumental variable approaches) could help disentangle the causal relationship between environmental affect and consumer behavior while accounting for confounding factors like macroeconomic shifts and competitive market dynamics. Multi-level modeling or machine learning methods that incorporate heterogeneous consumer characteristics (e.g., demographics and cultural values) and contextual variables (e.g., policy interventions and brand campaigns) would strengthen the generalizability and specificity of findings.
Critically, the current findings on the link between positive environmental affect and electric vehicle sales in China may not be generalized across diverse cultural and market contexts. To address this, cross-cultural comparative studies should be systematically integrated into future research. For example, parallel analyses in markets with distinct regulatory frameworks (e.g., the EU’s stringent emissions standards), cultural values (e.g., individualist vs. collectivist societies), or economic structures (e.g., mature vs. emerging EV markets) could reveal how environmental affect interacts with local factors to shape consumer behavior. Such comparisons would not only test the universality of the observed effects but also identify context-specific mechanisms—offering actionable insights for policymakers and industry stakeholders aiming to promote sustainable consumption globally.