1. Introduction
The modern world faces environmental problems and anthropogenic climate change, forcing global institutions and local governments to develop climate change adaptation and mitigation policies [
1,
2,
3]. In the near future, these current policies may transform the actions of people in a local territory to meet their needs [
4,
5], and a set of these actions can be defined as social practices [
6]. However, it is still unclear whether social practices are becoming more environmentally friendly, and how they could be changed to overcome the environmental crisis.
Social practices that aim to harmonize the relationship between people and the natural environment by reducing resource use, waste generation, pollution, and emissions are defined as green practices [
7]. Embracing the idea of the distinction of the repertoires of environmental changes regarding consumption [
8], green practices can be divided into adaptive and transformative in terms of their impact on consumption. Adaptive practices are a societal response to a deteriorating environmental state, but they do not imply a reduction in consumption. Transformative practices are designed to reduce the production of goods and services and society’s consumption of materials and energy. Focusing on green waste practices, adaptive practices are associated with the treatment of generated waste in contrast to transformative practices that aim to prevent waste.
Figure 1 shows the division of green waste practices into transformative and adaptive practices. Transformative practices include exchanging, refusing purchases, sharing, repairing, and participating in actions to promote responsible consumption. Adaptive practices include waste sorting, studying the product labeling, waste recycling, and signing petitions. Researchers study green practices to offer citizens, eco-activists, and government agencies new ways to scale these practices [
9,
10].
At present, there is insufficient knowledge regarding the existing green waste practices in Russian society. Only a few studies focus on some green practices, such as waste sorting, and describe them by eliciting data from questionnaires and interviews [
11,
12]. Meanwhile, the content of online environmental community posts on social media is a valuable source of information about the characteristics and activities of online communities [
13,
14]. Social networks have a significant impact on changing social practices [
15,
16]. Internet communities perform the functions of recruitment, attribution, aggregation of interests, and mobilization of resources [
17,
18,
19]. In addition, virtual communities promote participation in real activities, which can be both constructive and protesting in nature [
20,
21,
22]. For further research, it is necessary to collect and systematize a large amount of social information about the green waste practices prevalent in Russian society, and the important source of such information is social networks using different methods [
14,
23,
24,
25]. Since the manual identification of information in posts published in online communities is a time-consuming and long-term process, it requires the use of the automated analysis of such posts. However, to date, there are very few studies on environmental practices in the context of their risks and opportunities based on big data analytics, for example, deep learning or content analysis methods [
13,
26].
This paper presents the GreenRu dataset for detecting the mentions of green practices in social media posts. The dataset consists of Russian social media posts from the VKontakte network, the most popular social network in Russia [
27]. The dataset contains the mentions of nine green practices (
Figure 1), which were manually annotated. GreenRu is marked up at the sentence level; in other words, each sentence is labeled in terms of the practices it contains. Several machine learning models in the form of both multi-label and binary classifications of sentences are applied to the present dataset. Since some green practices are currently more common than others [
28], the classes, that is, the mentions of different types of green practices, are unevenly distributed in the dataset. It is expected to cause lower quality regarding the machine learning models for rarely represented classes. This paper compares several methods of data augmentation for the minority classes. It was demonstrated that augmentation can significantly improve the quality of the detection of the mention of a few common green practices. The main contributions of this work can be summarized as follows:
GreenRu, the first dataset for detecting the mentions of green practices in Russian social media posts, is described. The paper presents our annotation scheme for green practice mentions that can be easily adapted for application in other languages and domains.
The multi-label and one-versus-rest current state-of-the-art models for text classification are evaluated while performing the task of detecting the mentions of green practices.
The performance of several data augmentation methods is estimated to handle class imbalance since the distribution of the mentions of green practices is unequal. The results are provided both in terms of classification metrics and human evaluation. The results can be used in other tasks related to imbalanced multi-class text classification.
The paper is organized as follows.
Section 2 describes the processes of data collection and annotation. The section includes the subsections presenting the types of green practices, the details of the collecting posts, and the annotation guidelines.
Section 3 describes the experimental setup. It contains the dataset statistics, presents the models and the techniques for handling class imbalance that were used, as well as the metrics utilized for this study.
Section 4 provides the results of our experiments for the dataset and discusses the results and limitations of the study.
Section 5 concludes this paper.
4. Results and Discussion
Table 4 and
Table 5 demonstrate the results of the evaluation of BERT and RoBERTa. The sign
+ w indicates class weighting. The best result in each column is highlighted. If the average
F1-score obtained for the augmented dataset outperforms the same metric for the original dataset, the value is marked with ↑.
As can be seen from
Table 4 and
Table 5, the one-versus-rest models fine-tuned on the original dataset outperform the multi-label models in terms of all the averaged metrics. For RoBERTa, the result of the model with class weighting slightly surpassed the model without class weighting (81.65% vs. 81.16%). For BERT, the opposite situation is observed (78.89% vs. 79.62%). In most cases, the scores obtained for the augmented dataset are higher than the scores for the original data. However, in our experiments, this effect is more pronounced for BERT. All the augmentation methods improve the results of BERT in terms of
and
. The highest scores were achieved using backtranslation (84.74%) and ChatGPT (84.9%) for
and
, respectively. In the case of RoBERTa, the highest average scores for the original data were demonstrated by the one-versus-rest model with class weighting (81.00% and 85.55% for
and
, respectively). In terms of
, the value obtained for the original dataset was improved by all the methods except RuGPT3. The highest result was shown by ChatGPT (86.52%). In terms of
, the best result on the original dataset was outperformed using simple duplication (86.00%), RuT5 (86.07%), and ChatGPT (86.02% and 85.98% without and with class weighting, respectively). The highest result was achieved with RuT5 using a decoding masked sentences procedure (86.07%). In our experiments, none of the data augmentation methods were the absolute best across all the metrics. Such a case is common for machine learning studies, for example, in works [
49,
50,
51].
Table 6 contains the human evaluation results. The ratings demonstrate the ability of the augmentation methods to preserve the class labels. They show how many of the texts generated by the model for green practices really contain the mention of such practices. For the evaluation, 100 generated texts containing various practices were randomly chosen for each model.
The highest result was achieved by ChatGPT (94 out of 100), followed by RuT5 (89), backtranslation (82), and RuGPT3 (59). The result of RuGPT3 was substantially lower than the scores of the other methods. Probably, this is because the texts were truncated to the first five tokens for augmentation using RuGPT3. The goal of the fine-tuned model was to continue the truncated sequence of tokens. However, the truncation of the texts did not allow the model to correctly detect the types of green practices mentioned in them. In our experiments, RuGPT3 was the worst at generating texts corresponding to a certain green practice among all the considered data augmentation methods.
Three types of model errors were identified during the human evaluation.
Different green waste practices. The model generates a text that contains the mention of another practice instead of the required green practice. For example,
Or bring yours to exchange (“Или принoсите свoи на oбмен”) → Or bring yours for recycling (“Или принoсите свoи на перерабoтку”).
Absence of green waste practices. The model generates a text with no mentions of green practices. For example,
You can post even the most insignificant, only at first glance, actions that will immediately affect the climate footprint: you went to the store with a rag bag, and not with a plastic bag; poured coffee in a thermos cup, and not in a plastic cup; cleared up your mess and collected books for disposal (“Мoжете выкладывать даже самые незначительные, лишь на первый взгляд, действия, кoтoрые сразу же пoвлияют на климатический след: пoшел в магазин с тряпичнoй сумкoй, а не с пакетoм; налил кoфе в термoкружку, а не в пластикoвый стаканчик; расхламился и сoбрал книги на утилизацию”) → You can post even the most insignificant, in your opinion, changes in the promotion program - please do it as soon as possible (“Мoжете выкладывать даже самые незначительные, на ваш взгляд, изменения в прoграмме акции – пoжалуйста, делайте этo как мoжнo раньше”).
Negation of green waste practices. The model generates a text that contains a negation of the required green practice. For example,
The volunteer association supported by [club46977103|Paketa net] organizes an event for parents, where it is possible to lend or give away something and/or to get a new toy without buying. (“Дoбрoвoльческoе oбъединение Кругoвoрoт при пoддержке [club46977103|Пакета нет] oрганизует мерoприятие для рoдителей, где вoзмoжнo oтдать вещь на время или навсегда и/или пoлучить, не пoкупая, нoвую игрушку”) → The volunteer association does not organize an event for parents, where you can give something and/or get for a child without buying a new (“Ассoциация дoбрoвoльцев не oрганизует сoбытие для рoдителей, где мoжнo дать чтo-тo и/или пoлучить для ребенка, не пoкупая нoвую”).
Most of the RuGPT3 and RuT5 errors were associated with the generation of texts related to other green practices. Most of the backtranslation and ChatGPT errors regarded the generation of texts that do not contain mentions of green practices. An error related to the negation of green practices was generated only once using backtranslation.
Even though the initial experiments show promise, there is still room for improvement. For the rarest practice (repairing, 9), in many cases, the results are very poor. Thus, models without data augmentation cannot cope with the detection of this practice due to the small number of texts. However, the use of augmented data in most cases enables better classification of this practice. Further experiments on data augmentation with different amounts of generated texts may improve the current results for minority practices. In addition, the models trained on the current version of GreenRu might be used to search for posts containing mentions of rare green practices. These posts can be checked by experts and subsequently utilized to train classification models. Another possible direction for further improvement is the use of the context of the sentence as additional information.
Some potential issues could limit the applicability of GreenRu. The dataset only contains the texts posted in the online green communities of Tyumen, Russia. The recent works on deep learning show that the generalization ability of the fine-tuned transformer-based models can be influenced by domain bias [
52,
53,
54]. Thus, the performance of the models for detecting the mentions of green practices may deteriorate for the posts of green communities of other regions and the posts of other domains. This issue is typical for most domain-specific text corpora; in particular, this limitation is discussed by [
55,
56,
57]. The currently used domain adaptation methods can be broadly classified into those employing deep architectures and rule-based techniques, such as instance-based and feature-based approaches to align the domain distributions [
52]. Applying these techniques for the detection of green practices would be a fruitful area for further work. Another potential limitation of this research is related to the set of practices reflected in this study. GreenRu contains mentions of green practices only in waste management as one of the key aspects of harmonizing the relationship between people and the natural environment. The further extension of the dataset may include other green practices, such as caring for stray animals, cleaning and landscaping land, planting greenery, and some others.
5. Conclusions
The texts posted in green communities on social media contain multiple mentions of green practices. The automated detection of green practices on social media facilitates the comprehensive analysis of their prevalence, efficacy, and scalability, thereby informing potential strategies for expansion. The paper describes the GreenRu dataset of social media posts annotated with mentions of green waste practices at the sentence level. The dataset contains mentions of nine green waste practices covering both the adaptive and transformative types. Our baseline experiments conducted for GreenRu demonstrate that fine-tuned transformer-based models can be applied for the detection of the mentions of green practices. Considering the task of detecting the mentions of green practices as a text classification task, the multi-label and one-versus-rest approaches were compared. Moreover, several ways to handle class imbalance using data augmentation methods were assessed, both in terms of classification metrics and human evaluation. The study shows that the use of data augmentation can significantly improve the performance of detecting the mentions of rare green practices.
The theoretical significance of the study lies in the fact that it is the first to develop an approach to the automated identification of nine green practices on Russian social media. Thus, researchers will be able to monitor the prevalence of these practices on Russian social media, and the results of this monitoring will be used for developing environmental policies. Furthermore, using this approach, other researchers may identify additional green practices, thereby contributing to a more comprehensive understanding of the current processes of Russian society’s greening.
The algorithms developed to identify green practices may also be employed to enhance the functionality of search engines. GreenRu has the potential to support the creation of machine learning models aimed at extracting the mentions of green practices from textual data. This makes it possible to analyze vast amounts of social media content, assess the current prevalence and effectiveness of different types of green practices, and identify potential pathways for scaling up these practices.