Value Assessment of UGC Short Videos through Element Mining and Data Analysis

: UGC short videos play a crucial role in sharing information and disseminating content in the era of new information technology. Accurately assessing the value of UGC short videos is highly signiﬁcant for the sustainable development of self-media platforms and the secure governance of cyberspace. This study proposes a method for assessing the value of UGC short videos from the perspective of element mining and data analysis. The method involves three steps. Firstly, the text clustering algorithm and topic mapping visualization technology are utilized to identify elements for assessing the value of UGC short videos and construct an assessment index system. Secondly, structured data indexes are quantiﬁed using platform data statistics, while unstructured data indexes are quantiﬁed using the LSTM ﬁne-grained sentiment analysis model. Lastly, the VIKOR model, incorporating an improved gray correlation coefﬁcient, is employed to effectively evaluate the value of UGC short videos. The empirical results indicate that the value of current domestic UGC short videos is primarily associated with three dimensions: the creators, the platforms, and the users. It encompasses 11 value elements, including fan popularity, economic returns of creation, and frequency of interaction. Additionally, we assess the value of short videos within the mainstream partitions of the Bilibili platform and generate a value radar chart. Our ﬁndings reveal that short videos in game partitions generate higher revenue for creators and platforms but may neglect users’ needs for knowledge, culture, and other content. Conversely, short videos in the knowledge, food, and music partitions demonstrate speciﬁc distinctions in fulﬁlling users’ requirements. Ultimately, we offer personalized recommendations for the future development of high-value UGC short videos within the mainstream partitions.


Introduction
During the era of Web 2.0, the field of digital media has experienced rapid growth, with various forms of digital media constantly evolving.One such form is UGC short videos, which serve as a prominent means for creating digital resources and engaging in information interaction [1].UGC short videos refer to short video content created and shared by users on digital platforms.Users typically create these videos to capture life moments, express creativity, share opinions, or showcase their talents [2].In recent years, UGC short videos have gained significant worldwide success and influence due to their ease of distribution and diverse creativity.This has resulted in the emergence of UGC short video sharing platforms such as YouTube, TikTok, Bilibili, etc.The 2023 China Network Audiovisual Development Research Report, released on 29 March 2023, reveals that as of December 2022, the number of Chinese short video users reached 1.012 billion.This reflects a year-on-year increase of 77 million users, corresponding to a growth rate of 8.3% [3].
However, the increasing number of UGC short videos poses significant challenges for the content management of video sharing platforms.On one hand, the platform witnesses the widespread dissemination of UGC short videos with negative effects, including defaming others' images, spreading false information, and violating privacy.Users may doubt the platform's content filtering and auditing mechanisms and question the authenticity of the video content, thus harming the platform's reputation and credibility [4].On the other hand, the platform has overly focused on the traffic generated by UGC short videos, disregarding their inherent value and hindering the development of other videos that offer more creative and valuable content, thereby restricting the diversity of UGC short videos on the platform.Consequently, this limitation hampers the diversity of UGC short videos on the platform, which undermines effective content strategy optimization and hinders improvements in the user experience [5].Therefore, it is crucial to assess the value of UGC short videos to enhance the operational capacity of the video sharing platform and foster its sustainable development in the future.For example, a user on the TikTok platform uploaded a short video demonstrating how to prepare a specialty dish at home in a vivid and engaging manner.The platform can accurately understand which types of content users are more interested in by evaluating various dimensions.For instance, they might be more interested in cooking skills conveyed through videos or in local Chinese cultural specialties that are popularized by science.This is crucial for platforms to optimize their content recommendation strategies.By adjusting their recommendation algorithms according to users' feedback on different types of food videos, platforms can provide content that better matches users' preferences, ultimately leading to increased user activity and retention.
This paper aims to address two questions and provide theories and methods for evaluating the value of user-generated content (UGC) short videos on video sharing platforms in mainland China.(1) What elements contribute to the value of UGC short videos in mainland China?(2) How can we scientifically establish a value assessment model for UGC short videos?
This paper proposes a method for assessing the value of UGC short videos.First, the value elements for assessing the value of UGC short videos in the professional field are identified, and an index system for UGC short video value assessment is constructed.Second, the LSTM fine-grained sentiment analysis model is employed to calculate the sentiment value for the unstructured data indexes in the assessment system, achieving index quantification.The quantification of structured data indicators is based on statistics from multi-source platforms, and all indicators are assigned values using a combination of methods.Finally, the value of UGC short videos is evaluated using an improved VIKOR method with gray correlation coefficients.A radar chart of value dimensions is then created to provide decision support for the platform's evaluation of UGC short videos.

Literature Review 2.1. Study on the Value Assessment of UGC Short Videos Using Technical Methods
Two main methods are commonly used to evaluate the value of UGC short videos: traditional technical methods and deep learning [6].The traditional technical approach involves utilizing computer vision and signal processing techniques to process and analyze UGC short videos, aiming to determine their value level.For instance, Sun et al. [7] developed a VQA model that effectively assesses the value of UGC videos.They achieved this by training an end-to-end spatial feature extraction network, enabling the model to directly learn value-aware spatial features from the original pixels of UGC videos.Tu et al. [8] enhanced the performance of the reference-free VQA video value assessment model (BVQA) by incorporating a feature selection strategy, which demonstrated favorable results.Nanne et al. [9] compared the performance of three mainstream computer vision technology models (YOLOV2, Google Cloud Vision, and Clarifai) for evaluating UGC value.Their findings revealed that the Google Cloud Vision model outperformed the others, especially in assessing brand promotional UGCs.Deep learning methods, being the prevailing AI technology, enable automatic learning and extraction of high-dimensional and nonlinear data features through the construction of deep neural network models.This facilitates the efficient evaluation of UGC's value.For instance, Wang et al. [10] introduced a framework based on deep neural networks (DNNs) that considers features like technical value and video compression level.This framework aims to perceive the significance of UGC video value and ultimately assign a value score for assessment.Su et al. [11] developed a hyper-network-based NR-IQA model that can dynamically adjust video value prediction parameters.Their model exhibited superior performance in differentiating UGC values.

Study on Value Assessment of UGC Short Videos through the Construction of an Index System
The value assessment method based on constructing an index system refers to the process of developing a UGC value assessment index system.This method involves enhancing the existing index system through qualitative or quantitative methods and subsequently utilizing the index system for assessing the value of UGC short videos.It serves as a crucial foundation for evaluating the value of UGC short videos.For instance, Meng et al. [12] developed a comprehensive six-dimensional UGC short video value assessment index system that encompasses attributes of publishers, viewer behavior, video content, and more.They conducted a questionnaire survey and identified video quality, title, and viewer interest as the primary factors influencing the value of UGC short videos.Chen et al. [13] applied rooting theory to identify the fundamental value elements of rural landscape short videos.Additionally, Yu et al. [14] employed data analysis and gathered short video value factors using a randomly administered online questionnaire.Their findings revealed that enhancing user stickiness and increasing user scale can substantially boost the value of short UGC videos.Mohammad et al. [15] developed an online brandbased UGC value assessment system and employed partial least squares-structural equation modeling (PLS-SEM) to analyze empirical data.They revealed that content and technology are the primary elements that influence the value of UGC short videos.Zhang et al. [16] analyzed the value elements of short video communication using the all-information emotion theory and the super IP theory.They employed the fuzzy set theory-DEMATEL model to identify the key influencing factors on the secondary indicators and identified 11 value elements, which include public expression and popularity demand, among others.

Study on the Value Assessment of UGC Short Videos Using Data Modeling
The value assessment method based on data modeling involves automating the assessment of UGC short videos by constructing data models and algorithms using usergenerated content data.For example, Cai [17] proposed a value assessment model for UGC short videos based on random forest and BP neural networks.Through experiments, it was found that the constructed model significantly improved the accuracy of UGC short video value assessment compared to a single model.Manikandan et al. [18] compared multiple models, including linear regression, random forest, and XGBoost, for assessing the value and performance of UGC short videos.They discovered that the random forest model outperformed other machine learning models.Leszczuk et al. [19] used the random forest model to train and test using professional content data from UGC short videos, along with video technology data.They achieved better performance in evaluating the value of IP-based UGC videos.Rui et al. [20] evaluated the value of UGC short videos using online gradient descent and least squares methods, respectively.They compared the methods and found that the least squares method yielded more accurate value outputs.Gupta et al. [21] compiled a comprehensive list of machine learning and artificial intelligence techniques used in assessing the value of UGC short videos.They discovered that these techniques aid creators in assessing the authenticity of their content and enhance the production of high-quality short video resources.Mekouar et al. [22] employed a simple machine learning model to assess the distribution value of UGC short videos based on short video parameters and a proposed popularity function.They conducted an empirical study using YouTube short videos as an example and found that their proposed model exhibited good assessment performance.

Research Gap
In summary, recent years have seen some progress in research assessing the value of UGC short videos, but further improvements can still be made.
The research methods primarily focus on evaluating the value of UGC short videos in the technical fields of computer vision and signal processing.These methods excessively emphasize the technical elements of UGC short videos while neglecting the content attributes and value attributes of UGC short videos as a form of user-generated content information product.Additionally, value assessment methods based on constructing index systems have started to investigate the content and emotional characteristics of UGC short videos.However, the data modeling-based approach allows for the quantification of various attributes of UGC short videos.Moreover, the machine learning model demonstrates excellent performance in evaluation.This aspect has practical application.Nevertheless, the following problems persist: First, the evaluation index system is typically constructed using questionnaires and a literature review.However, it lacks multidimensional value element exploration, correlation analysis, and model representation of UGC short videos through the integration of multiple authoritative data sources.Second, the existing indexes quantify the factors in a singular manner and fail to effectively capture the impact of unstructured data indicators on the value of UGC short videos.This limitation makes it challenging to fully capture the emotional characteristics of users within the indexes.Third, the current evaluation models mostly rely on machine learning techniques, which do not adequately consider the correlation and weightage between different indicators.Consequently, it becomes difficult to comprehensively assess the variations in value among the research subjects.Finally, most of these models are black-box models, and interpreting the evaluation results is often challenging.Thus, additional statistical analysis and visualization methods are required to understand the underlying patterns.
Considering these issues, this paper aims to address the strong subjectivity of the index system, the insufficient quantification of key features, and the limited interpretation of evaluation results in the evaluation process by mining the value elements of UGC short videos.The research focuses on the following aspects: (1) Mining the value elements for assessing UGC short videos by utilizing text clustering algorithms, visualization technology for theme mapping, and incorporating professional literature and expert commentary texts.This will facilitate the construction of a comprehensive UGC short video value assessment index system; (2) employing a diverse processing approach for multi-dimensional index data: structured data indicators are statistically quantified from UGC short video operation platforms and third-party platforms, while unstructured data indicators, such as user comments, are quantified through LSTM fine-grained sentiment analysis; and (3) utilizing the AHP method and CRITIC method to combine normalized index data for weighting, which is then input into an improved VIKOR model with gray correlation coefficients.This enables the realization of value assessments for UGC short videos.

Methodology
This paper aims to assess the value of UGC short videos.It utilizes authoritative texts, including literature collections in the field and expert commentary articles, as the basis.Text clustering algorithms and visualization techniques are employed for topic mapping.Through this process, the UGC short video value elements are identified, and a comprehensive UGC short video value assessment index system is constructed.Subsequently, a UGC value assessment model is established to achieve effective value assessment of UGC short videos, as illustrated in Figure 1.
UGC value assessment model is established to achieve effective value assessment of UGC short videos, as illustrated in Figure 1.

Process 1: Constructing an Index System for Assessing the Value of UGC Short Videos Based on Element Mining
This study initially retrieves relevant literature on UGC short video value assessment from databases such as CNKI, Wanfang, and WOS.The collected literature includes fields such as title, abstract, keywords, and publication time.Additionally, expert review texts related to UGC short videos are collected as Supplementary Data.The collected data undergoes pre-processing, which involves text data normalization, merging, and the removal of duplicate and invalid samples.As a result of this process, a dataset for UGC short video value mining is obtained.
Text clustering algorithms and visualization techniques, such as topic mapping, are employed to mine UGC short video value elements and construct the UGC short video value evaluation index system.The specific process is illustrated in Figure 2.This study initially retrieves relevant literature on UGC short video value assessment from databases such as CNKI, Wanfang, and WOS.The collected literature includes fields such as title, abstract, keywords, and publication time.Additionally, expert review texts related to UGC short videos are collected as Supplementary Data.The collected data undergoes pre-processing, which involves text data normalization, merging, and the removal of duplicate and invalid samples.As a result of this process, a dataset for UGC short video value mining is obtained.
Text clustering algorithms and visualization techniques, such as topic mapping, are employed to mine UGC short video value elements and construct the UGC short video value evaluation index system.The specific process is illustrated in Figure 2.
After pre-processing the data, the text dataset undergoes entity and inter-entity relationship extraction.Specifically, this process includes: (a) The extraction of entities.This paper performs data cleaning on the text and employs the jieba word splitting tool to accurately segment the text data based on specific word splitting patterns, thereby eliminating redundant data.Additionally, the TF-IDF strategy in natural language processing is utilized to remove punctuation and stop words, minimizing interference with word segmentation and obtaining the subject words for each analyzed content; (b) The calculation of inter-entity relationships.The K-means method is utilized to cluster the subject words, and then the strength of entity association, inter-entity spacing, and inter-entity position are calculated.The inter-entity association relationship is visually presented by controlling the entity color; and (c) The construction of topic mapping.The topic mapping is constructed using VOSviewer to visualize the association relationship between elements.After pre-processing the data, the text dataset undergoes entity and inter-entity relationship extraction.Specifically, this process includes: (a) The extraction of entities.This paper performs data cleaning on the text and employs the jieba word splitting tool to accurately segment the text data based on specific word splitting patterns, thereby eliminating redundant data.Additionally, the TF-IDF strategy in natural language processing is utilized to remove punctuation and stop words, minimizing interference with word segmentation and obtaining the subject words for each analyzed content; (b) The calculation of inter-entity relationships.The K-means method is utilized to cluster the subject words, and then the strength of entity association, inter-entity spacing, and inter-entity position are calculated.The inter-entity association relationship is visually presented by controlling the entity color; and (c) The construction of topic mapping.The topic mapping is constructed using VOSviewer to visualize the association relationship between elements.
Subsequently, theme clustering and key element mining are conducted.This process includes: (a) The analysis of clusters.Utilizing the constructed topic mapping, cluster analysis and representative entity word extraction are performed to analyze the characteristics of entity words within each cluster category; and (b) The mining of key elements.The clusters that fulfill the value assessment criteria for UGC short videos are selected, and these selected clusters are renamed by incorporating the value characteristics of UGC short videos.
Lastly, utilizing classical theory or a priori knowledge, the mined value assessment elements are classified and organized hierarchically.The structure is then optimized and adjusted based on the similarity and difference of indicators, resulting in the construction of a UGC short video value assessment index system.

Process 2: Quantifying the Assessment Index for UGC Short Video Value
This paper focuses on quantifying indicators using heterogeneous data from multiple sources.The main sources include structured data from UGC short video operation platforms and third-party platform statistics, as well as unstructured data from user comments.The data processing steps are outlined as follows: Step 1: Processing of structured data indicators.To mitigate the impact of indicator data dimensions and enable comparability among different indicators, the indicator data Lastly, utilizing classical theory or a priori knowledge, the mined value assessment elements are classified and organized hierarchically.The structure is then optimized and adjusted based on the similarity and difference of indicators, resulting in the construction of a UGC short video value assessment index system.

Process 2: Quantifying the Assessment Index for UGC Short Video Value
This paper focuses on quantifying indicators using heterogeneous data from multiple sources.The main sources include structured data from UGC short video operation platforms and third-party platform statistics, as well as unstructured data from user comments.The data processing steps are outlined as follows: Step 1: Processing of structured data indicators.To mitigate the impact of indicator data dimensions and enable comparability among different indicators, the indicator data obtained from various platforms should undergo individual maximum-minimum normalization; Step 2: Processing of unstructured data indexes.In the social media environment, users express their emotions, opinions, and attitudes through the creation and publication of UGC short videos.These videos encapsulate the intense emotional catharsis and clash of ideas experienced by users [23].Thus, user attributes related to emotional tendencies become fundamental attributes of UGC short video value [24].The index data for these user attributes cannot be directly quantified.Typically, quantification is accomplished through text data analysis techniques such as word frequency statistics [25] and hierarchical analysis [26] based on feature words.However, these methods do not originate from the user's inherent attributes, thus making it less likely to explore the user's genuine emotional expressions and emotional tendencies.Consequently, accurate quantification of UGC short video value indexes that encompass user attributes cannot be achieved through these methods.
The investigation of user review text sentiment intensity has gained significant attention in the academic, business, and political spheres in recent years.Numerous studies have indicated the crucial role of calculating user review text sentiment intensity in optimizing domain-specific sentiment lexicons [27] and devising brand development strategies.Hence, this paper begins with user review texts and achieves precise quantification of metrics related to user attributes by computing sentiment values extracted from user reviews.
It has been observed that computer information technology performs effectively in processing textual information.However, traditional neural network methods can suffer from issues like gradient explosion and gradient disappearance with an increasing number of learning layers.On the other hand, LSTM models exhibit robustness and generalization capabilities [28,29], which can effectively address these challenges in textual information processing [30][31][32] of ideas experienced by users [23].Thus, user attributes related to emotional tendencies become fundamental attributes of UGC short video value [24].The index data for these user attributes cannot be directly quantified.Typically, quantification is accomplished through text data analysis techniques such as word frequency statistics [25] and hierarchical analysis [26] based on feature words.However, these methods do not originate from the user's inherent attributes, thus making it less likely to explore the user's genuine emotional expressions and emotional tendencies.Consequently, accurate quantification of UGC short video value indexes that encompass user attributes cannot be achieved through these methods.
The investigation of user review text sentiment intensity has gained significant attention in the academic, business, and political spheres in recent years.Numerous studies have indicated the crucial role of calculating user review text sentiment intensity in optimizing domain-specific sentiment lexicons [27] and devising brand development strategies.Hence, this paper begins with user review texts and achieves precise quantification of metrics related to user attributes by computing sentiment values extracted from user reviews.
It has been observed that computer information technology performs effectively in processing textual information.However, traditional neural network methods can suffer from issues like gradient explosion and gradient disappearance with an increasing number of learning layers.On the other hand, LSTM models exhibit robustness and generalization capabilities [28,29], which can effectively address these challenges in textual information processing [30][31][32]  (1) Data processing.Prior to extracting feature words and special evidence sentences from the comment text, it is necessary to perform data cleaning and de-weighting on the comment data.This step significantly improves the quality and quantity of the comment (1) Data processing.Prior to extracting feature words and special evidence sentences from the comment text, it is necessary to perform data cleaning and de-weighting on the comment data.This step significantly improves the quality and quantity of the comment text while preventing interference or reduction of accuracy in calculating the sentiment value of indicators due to irrelevant or duplicate comments; (2) Feature word extraction.Term Frequency (TF) represents the frequency of words, which is the number of times a word appears in the text.The processed result is the word frequency, TF.The importance of a word in the text increases with its frequency.The feature word frequency is obtained by utilizing the counter library to create a candidate library of feature words.The feature words are classified into the UGC short video value assessment index system keyword table through manual screening, with reference to the constructed UGC short video value assessment index system; (3) Feature sentence extraction.After performing word-by-word traversal using jieba splitting, the Stanford NPL is utilized for dependency sentence analysis.The result is then compared with the keyword table of the UGC short video value assessment index system.The matching feature words are considered the feature attributes of the comment text, thus extracting the comment as a special evidence sentence; (4) LSTM-based model training and testing.To reduce the manual sentiment labeling time, this study labels the extracted idiosyncratic sentences with sentiments (−1 for negative, 0 for neutral, and 1 for positive) using a sentiment dictionary.The Boson NLP sentiment lexicon, constructed based on microblogging, news, forums, and other data sources, serves as the base lexicon, supplemented by the KnowNet HowNet negative word and degree adverb lexicon, ultimately forming the domain sentiment lexicon.
For model training and testing, the Long Short-Term Memory (LSTM) neural network is used.First, the data is divided into a training set and a test set with a ratio of 4:1.Second, the tanh function is chosen as the activation function for the LSTM model, commonly used for transforming input continuous real values into output values between −1 and 1.The word vector dimension is set to 100, representing the semantic information of each word as a 100-dimensional vector.The data batch size is 32, ensuring computational efficiency and providing sufficient data for the model to learn.To prevent overfitting, the dropout technique is introduced, randomly disabling a portion of neurons during training.After cross-validation, a dropout value of 0.5 is selected as it generates the most diverse network structures; Step 3, the index data obtained from the normalization process in Steps 1 and 2 are combined together to form the UGC short video value index matrix; Step 4, the combination of indicators is determined using the AHP method and the CRITIC method.This approach integrates the subjective experience of decision-makers with the objective attributes of data information.
Hierarchical analysis (AHP) is a multi-criteria decision-making method introduced by Saaty, an American operations researcher [33].This method simplifies multi-objective decision-making problems and enables fuzzy quantification of qualitative indicators, making it simple and practical.However, it heavily relies on subjective judgment and does not offer novel solutions for decision-makers, thus being categorized as a subjective empowerment method [34].The CRITIC method, proposed by Diakoulaki et al. in 1995 [35], is an objective empowerment method.It provides a comprehensive assessment of the objective weights of indicators by considering the comparative strengths of evaluation indicators and the conflicts between them.However, the objective weighting method demands a significant amount of data, involves complex calculations, and does not consider the rater's importance, leading to a notable discrepancy between the assessed weights and the actual importance [36].
In this study, the AHP method and the CRITIC method are integrated with subjective and objective assignments to conduct a comprehensive assessment of the weights of evaluation indices.This approach addresses the limitations of single assignment methods [37,38] and provides comprehensive weights for each index.It effectively incorporates both the subjective intentions of decision-makers and the objective attributes of data information.
The subjective weight of the jth indicator calculated using the AHP method is denoted as W Pj , and the objective weight of the jth indicator calculated using the CRITIC method is denoted as W V j , assuming a total of i indicators.Since there exists a certain interdependence among UGC short video value indicators, this paper utilizes the multiplicative synthesis method to calculate the combined weights of the indicators [39].
Step 5, the UGC short video value weighting index matrix is determined based on the UGC short video value index matrix and the combination weights of the UGC short video value assessment indices.This matrix serves as the input data for the subsequent value assessment model.

Process 3: Assessment of UGC Short Video Value
This study effectively evaluates the value of UGC short videos using the improved VIKOR model incorporating gray correlation coefficients.
The VIKOR (multicriteria compromise optimal solution) method, proposed by Opricovic and Tzeng [40], is a compromise-based multicriteria decision-making method.It obtains the feasible compromise solution that is closest to the multi-attribute positive ideal solution by making concessions between attributes, thereby determining the relative superiority or inferiority of the evaluated object based on the evaluation index [41,42].This approach enables the value assessment of UGC short videos.
Recognizing that utilizing a standardized method for calculating the gray correlation coefficients of each assessment index can significantly enhance the model's accuracy and convergence speed [43,44], this study integrates the gray correlation coefficients into the VIKOR model to enhance the validity of decision results.The calculation steps are outlined below: (1) Unquantified processing of indicators.The unquantified processing of gray correlation involves determining the reference sequence , and the normalized index f ij (correlation coeffi- cients).The unquantified formula is as follows: the quantization adopts a resolution coefficient ρ ∈ (0, 1], set to 0.5.The variables m and n indicate the starting and ending positions of the subsequence, respectively, where both m and n are integers satisfying 1 ≤ m ≤ n.The resolution coefficient ρ is employed in gray correlation analysis to adjust the similarity between sequences.The value range of this parameter is (0, 1].When ρ approaches 1, the correlation coefficient of all sequences becomes close to 1, indicating reduced dissimilarity between the sequences.Conversely, as ρ approaches 0, the dissimilarity between sequences increases.In this paper, ρ is set to 0.5.This choice ensures a suitable balance between difference and similarity, allowing the correlation coefficients between sequences to reflect their similarity while also showing their difference to a certain extent.Additionally, setting ρ to 0.5 helps avoid potential computational fluctuations; (2) Calculate the positive ideal solution f + j and the negative ideal solution f − j for each indicator as follows: where I 1 represents a benefit-based indicator and I 2 represents a cost-based indicator; (3) Calculate the group benefit value S i and the individual regret value R i for each assessment object as follows: where S i represents the group effect of the assessed object and a smaller S i value indicates a lower group effect value.R i represents the individual regret value and a smaller R i value indicates a lower individual regret.ω j is used to indicate the weight of the jth evaluation indicator; (4) Calculate the interest ratio of the appraisal object, which serves as a crucial basis for assessing the value of UGC short videos: where S + = minS i , S − = maxS i , R + = minR i , and R − = maxR i .v is the decision mechanism coefficient, which is taken as 0.5.The rationale behind this choice is that v aims to strike a reasonable balance between S i and R i , determining whether the evaluation strategy favors the majority-criterion optimal solution (minimum S i ) or the maximum group consensus solution (minimum R i ).Typically, v has values between 0 and 1.A value of v = 0 considers only the group benefit value, while v = 1 considers only the individual regret value.When v is set to 0.5, both the group benefit and the individual regret carry equal weight, representing a typical equal decision-making mechanism.This approach ensures that the evaluation and comparison of the short video's value account for both the overall benefit and the impact of individual elements.The choice of 0.5 is common because it achieves a well-balanced consideration of group and individual impacts.Following the aforementioned process, the value of UGC short videos as empirical objects is assessed.Additionally, a radar chart depicting the UGC short video value dimensions is generated, aiding the platform in discerning the value characteristics of various UGC short video categories.

Data Acquisition and Processing
In mainland China, the main databases selected for literature searches were CNKI and Wanfang, supplemented by expert review texts.The keywords were chosen through literature research, resulting in the following search formula: Topic/Keyword/Abstract = (short video and user-generated content) AND (short video and UGC) AND (short video and content generation) AND (TikTok and content generation) AND (Bilibili and content generation).For foreign databases, Web of Science was selected as the primary data source, and the search formula used was: TS = ("short video" AND "UGC") OR TS = ("short video" AND "user generated content") OR TS = ("short video" AND "content production") OR TS = ("Tiktok" AND "user generated content") OR TS = ("short video" AND "consumer generated media").The search period spanned from 1 January 2010 to 1 July 2022, focusing on papers and journals related to the research topic of "UGC short video value".Irrelevant literature was excluded, resulting in a total of 212 valid related articles.Additionally, 41 articles were collected from reputable platforms such as People's Daily Online and Guangming.comthat were relevant to the research topic.After the data pre-processing step referred to as "Process 1", a total of 253 text data pieces were obtained.

Element Mining Based on Text Clustering Algorithm and Topic Mapping
The commentary text data is processed using the relationship extraction technique between entities in Process 1.The standardized association strength selection method is employed with default values.The random parameter starts with a value of 1, and the maximum number of iterations is set to 1000.The initial step size is 1, and the step size convergence coefficient is 0.001.The clustering resolution and minimum number of clustering items are both set to 1.The process is repeated 10 times to obtain the topic mapping of feature words shown in Figure 4.The mapping represents the location and associations of feature words using various visual cues such as color, size, ring thickness, and perspective.
employed with default values.The random parameter starts with a value of 1 maximum number of iterations is set to 1000.The initial step size is 1, and the convergence coefficient is 0.001.The clustering resolution and minimum numb tering items are both set to 1.The process is repeated 10 times to obtain the topic of feature words shown in Figure 4.The mapping represents the location and as of feature words using various visual cues such as color, size, ring thickness, and tive.The mapping resulted in the formation of 24 clusters consisting of 358 enti These clusters include thematic categories such as UGC short video revenue, th of fans, and platform interaction.By analyzing the clusters within the topic map identifying representative entity words, a detailed analysis of the characteristics within each topic category is conducted.For this study, 11 clusters were selected quantifiability, ease of understanding, and applicability of indicators, aligning value assessment of UGC short videos.The selected clusters were renamed to r inherent value characteristics of UGC short videos.The mapping resulted in the formation of 24 clusters consisting of 358 entity words.These clusters include thematic categories such as UGC short video revenue, the number of fans, and platform interaction.By analyzing the clusters within the topic mapping and identifying representative entity words, a detailed analysis of the characteristics of entities within each topic category is conducted.For this study, 11 clusters were selected based on quantifiability, ease of understanding, and applicability of indicators, aligning with the value assessment of UGC short videos.The selected clusters were renamed to reflect the inherent value characteristics of UGC short videos.Table 1 below presents the renamed clusters: Based on the statistical data, it is evident that the accuracy of the 11 topic clusters related to the value assessment of UGC short videos is above 0.5.This indicates the effectiveness of the text clustering model in producing high-quality clusters and sets the stage for constructing the subsequent UGC short video value assessment index system.

Construction of the UGC Short Video Value Assessment Index System
The information ecological chain theory, introduced in the 1980s, focuses on the interconnectedness of information flow among various information subjects in information systems [45,46].Analysis shows that the application scenarios based on this theory involve information flow activities among three types of information subjects: information generators, information disseminators, and information consumers.These subjects exhibit a certain degree of interdependence, forming a chain-like relationship.In the context of this study, we posit that UGC short videos also involve interactions among three key subjects during the dissemination process: creators, platforms, and users.Creators primarily undertake the creation and production of UGC short videos, while platforms serve as intermediaries between creators and users, handling the operation and management of UGC short videos.Users, in turn, receive and engage with UGC's short video content.
Thus, this study integrates the 11 identified UGC short video value features and establishes a hierarchical relationship among indicators based on three dimensions: creators, platforms, and users.The study analyzes the similarities and differences between the indicators to optimize and adjust them, ultimately constructing the UGC short video value assessment index system.Refer to Table 2 for more information.Reflecting that UGC short videos can meet users' entertainment needs in terms of audio and visual Sentiment value of the special evidence sentences related to "audiovisuality" in user comments Fun X11 Reflecting UGC short video content and its presentation has the quality of attracting viewers' interest and human touch, which can meet users' interesting needs The sentiment value of the special testimonials related to "interesting" in user comments 4.2.Quantification of the UGC Short Video Value Assessment Index 4.2.1.Data Collection Bilibili (referred to as B station) is a prominent UGC short video creation and sharing platform in mainland China, offering a wide range of video resources and data.For this study, we utilized a Python crawler tool to randomly select four popular video categories on B station in 2022: "Game Zone", "Knowledge Zone", "Culinary Zone", and "Music Zone".Specifically, we randomly chose videos with traffic exceeding 1 million views as empirical objects, collected between 1 July 2022, and 31 December 2022.The collected data, presented in Table 3, includes the UGC video title, user comments, and the creator's number of fans, among other information.In total, 46,234 comments were obtained.

Indicator Processing Process
Utilizing the LSTM fine-grained sentiment analysis model, we calculated the sentiment value for the indicators related to user comments in the UGC short video value assessment index system.The calculation results are presented in Table 4.The structured data indicators are quantified as follows: the "FeiGua Data" platform provides statistics on the number of new fans gained by creators after releasing a video, enabling quantification of "fan popularity" of the creator; the popular revenue conversion formula of B station, "1w plays can earn 30 RMB", is used to quantify the "economic return on creation" for each video by the creator; the interaction index of each UGC short video on the "FeiGua Data" platform is calculated by considering users' interactive behaviors such as likes, favorites, and coins, which quantifies the "frequency of interaction" on the platform; the "CMHI" data detection and analysis platform calculates the interaction index of third-party platforms for each UGC short video released through the B station platform.This is performed by counting the total number of reposts and reports by third-party platforms, which quantifies the "information repost volume" of the platform.To quantify the platform's "cultural carrying capacity", video content is converted into textual information, and the TF-IDF method is applied to identify culture-related keywords for quantification.The total count of these keywords in the text data determines the quantification.
To facilitate the comparison and assignment of indicators, all quantified indicators were separately normalized before the combination assignment, resulting in the UGC short video value indicator matrix.By considering subjective and objective assignments and calculating the weights for indicator combinations, the results in Table 5 are obtained.Based on the quantified indexes and weights of the UGC short video value assessment indexes, the final weighting matrix for UGC short video value is derived.The gray correlation coefficient of each index is calculated and inputted into the VIKOR model to obtain the group effect value S, individual regret value R, and benefit ratio Q for the 20 UGC short videos.

Assessment Principles
The VIKOR method calculates the benefit ratio value (Q) by measuring the distances between the empirical UGC videos and the positive and negative ideal solutions.This ratio serves as the value of the UGC videos.The group effect value (S) represents the sum of the distance ratios to the positive ideal solution.A smaller value indicates that the research object is closer to the "optimal solution".The individual regret rate (R) represents the maximum distance ratio between the evaluation object and the positive ideal solution.A smaller value suggests that the research object is closer to its own "optimal solution".A lower value signifies a higher value of the research object itself, serving as an important criterion for judging the value of UGC short videos.

Analysis of Assessment Results
In this paper, we utilize element mining technology to extract the components of UGC short video value indexes and build the UGC short video value assessment index system.We obtained the UGC short video value assessment results and the weighted ranking results of UGC short video indexes, which are presented in Figures 5 and 6.

Analysis of Assessment Results
In this paper, we utilize element mining technology to extract the components of UGC short video value indexes and build the UGC short video value assessment index system.We obtained the UGC short video value assessment results and the weighted ranking results of UGC short video indexes, which are presented in Figures 5 and 6.Based on the principle of UGC short video value assessment, the benefit (Q) reflects the magnitude of the video's corresponding value, with smaller v cating higher video value and higher ranking.As shown in Figure 5, the valu (using video numbers instead of video names) results for the 20 selected UGC sh in this paper are as follows: The results indicate that the weighted ranking of UGC short video indicato ranking of UGC short video value show a similar trend, with slight differen ranking order of individual videos.This suggests that the UGC short video va ment model proposed in this paper focuses on the mutual influence among the in the UGC short video value assessment system while ensuring evaluation acc Based on the principle of UGC short video value assessment, the benefit ratio value (Q) reflects the magnitude of the video's corresponding value, with smaller values indicating higher video value and higher ranking.As shown in Figure 5, the value ranking (using video numbers instead of video names) results for the 20 selected UGC short videos in this paper are as follows: The results indicate that the weighted ranking of UGC short video indicators and the ranking of UGC short video value show a similar trend, with slight differences in the ranking order of individual videos.This suggests that the UGC short video value assessment model proposed in this paper focuses on the mutual influence among the indicators in the UGC short video value assessment system while ensuring evaluation accuracy.The comparison and analysis of the best and worst result items in the assessment object demonstrate the superiority of the VIKOR model in handling multi-attribute decision problems.
This paper selected 20 UGC short videos, which were categorized into four partitions: game, knowledge, food, and music.To analyze the value performance of UGC short videos in each partition, the weighted quantification results of the videos were divided based on the creator dimension, platform dimension, and user dimension.Radar charts were created using the average values of the three dimensions for each thematic partition, as presented in Figure 7.These radar charts effectively illustrate the value and performance of short videos in different partitions.

Discussion
In this study, we develop a comprehensive set of models for the scientific evaluati of the value of UGC short videos.This involves three processes: (1) constructing the ind system for evaluating the value of UGC short videos based on element mining; (2) qua tifying the indexes used to evaluate the value of UGC short videos; and (3) conducting evaluation of the value of UGC short videos.
Firstly, we initiate the process by analyzing the professional text of UGC short vide We employ text clustering algorithms and visualization technology for theme mapping thoroughly explore the value characteristics of UGC short videos, thus constructing t UGC short video value assessment index system.Secondly, we quantitatively evalu the indexes using heterogeneous data from multiple sources.For structured data indi tors, we statistically quantify them using data from various platforms, while the LST fine-grained sentiment analysis model calculates the sentiment value of unstructured d indicators to enable precise quantification.Finally, by assigning weights to the subjectiv objective combinations of all indicators, we utilize the gray correlation coefficient of t improved VIKOR model to accurately assess the value of UGC short videos.The empiri analysis yields the following findings: Finding 1: Deconstructing the Value Characteristics of UGC Short Videos.

Discussion
In this study, we develop a comprehensive set of models for the scientific evaluation of the value of UGC short videos.This involves three processes: (1) constructing the index system for evaluating the value of UGC short videos based on element mining; (2) quantifying the indexes used to evaluate the value of UGC short videos; and (3) conducting an evaluation of the value of UGC short videos.
Firstly, we initiate the process by analyzing the professional text of UGC short videos.We employ text clustering algorithms and visualization technology for theme mapping to thoroughly explore the value characteristics of UGC short videos, thus constructing the UGC short video value assessment index system.Secondly, we quantitatively evaluate the indexes using heterogeneous data from multiple sources.For structured data indicators, we statistically quantify them using data from various platforms, while the LSTM fine-grained sentiment analysis model calculates the sentiment value of unstructured data indicators to enable precise quantification.Finally, by assigning weights to the subjective-objective combinations all indicators, we utilize the gray correlation coefficient of the improved VIKOR model to accurately assess the value of UGC short videos.The empirical analysis yields the following findings: Finding 1: Deconstructing the Value Characteristics of UGC Short Videos.
In contrast to single-dimensional value evaluation, this study employed value element mining technology to discover ecological value elements in UGC short videos and establish a multi-dimensional value evaluation index system.We identified that the current value elements of UGC short videos in China mainly encompass three dimensions: the creators, the platforms, and the users.These dimensions consist of the following value elements: The creator dimension includes fan popularity and creative economic return; the platform dimension includes interaction frequency, information reprint volume, and cultural carrying capacity; and the user dimension includes knowledge, skills, cultural identity, community identity, audio-visual appeal, and entertainment value.The value of UGC short videos should not be assessed solely based on their content and technical attributes.Instead, it is an organic unity resulting from the interactions among UGC short video creators, communication platforms, and users [47].This system aligns with the interactive nature of UGC short videos as a form of Internet communication [48] and holds valuable implications for the future value evaluation of UGC short videos; Finding 2: Differentiated Value Performance of UGC Short Videos in Mainstream Categories.
1. Game Zone: Among the mainstream video divisions, the game partition exhibits the highest indicator weighting results in the creator and platform dimensions, suggesting that game videos in this category can generate more tangible benefits and value for creators and platforms [49].However, compared to other partitions, the weighted results of indicators in the user dimension are average, indicating a relatively lower value contribution of game short videos to users.This may be attributed to the emphasis placed on short-term revenue and video traffic growth, which overlooks the genuine needs and experiences of users.Consequently, the overall short video value score of the division is relatively low [50].In the future, practitioners should create game-short videos with a user-centric approach, focusing on delivering new audiovisual experiences while addressing the diverse knowledge and cultural needs of users to enhance their overall experience.It is noteworthy that a domestic game-themed short video, "Genshin impact", ranks second in value.Compared to other game-short videos, it stands out in terms of "cultural carrying capacity" and "cultural identity" index values.This highlights the importance of integrating cultural elements into the creation of high-value game-short videos, as it enhances the artistic and aesthetic value of the videos, attracting greater user attention and engagement [51].Moreover, cultural elements serve as a valuable source of inspiration for game-short video creation [52], fostering more innovative ideas and expanding the imaginative space, thereby further enhancing the value and distinctiveness of video content; 2. Knowledge Zone: The high user value scores of short videos in the knowledge partition indicate that these videos effectively fulfill viewers' knowledge and skill-related needs.This exceptional user value performance also reflects the growing demand for quality knowledge content among users today.With the widespread popularity of mobile internet, people increasingly turn to innovative media such as short videos to access information and acquire knowledge [53], further bolstering the prominence of short videos in the knowledge domain.Additionally, knowledge-based short videos provide an excellent platform for knowledge creators to share their expertise and insights, serving as a conduit for the dissemination of wisdom.Popularizing knowledge through short videos not only facilitates rapid learning and mastery for viewers but also enhances the reputation and influence of knowledge creators [2].For instance, "Luo Xiang speaks criminal law", a knowledge-focused up-owner examined in this study, has garnered the distinction of being the fastest video creator to reach 10 million fans in the history of B-station.However, in terms of the creator dimension and the platform dimension, the performance of knowledgebased short videos remains average.This indicates the need for creators to align their video content with current social and produce more targeted videos [54].Moreover, creators should delve deeper into professional knowledge to create videos with both depth and breadth, thereby delivering a more comprehensive and diverse video content experience.Additionally, platforms should enhance the promotion of high-quality content and provide robust resource support for outstanding creators, ensuring that viewers find delight in their knowledge acquisition journey.In today's society, knowledge-based short videos have become a preferred leisure choice for individuals, contributing to the accumulation of knowledge and fostering innovation for societal development.Consequently, creators and platforms should continuously optimize and improve content value while enhancing viewers' overall viewing experience; 3. Gourmet Zone: Short videos in the food category receive high user scores, indicating that the creators of these videos effectively showcase their cooking skills and professionalism from a user experience perspective while teaching viewers cooking techniques [55].Additionally, the data demonstrates that these food-related short videos outperform other indicators in terms of "cultural carrying capacity", suggesting that they provide viewers with rich cultural content in the context of China's culinary culture [56], further strengthening their cultural identity.The study's findings also highlight the exceptional performance of short videos in the food category in terms of "interestingness".Creators are gradually departing from traditional cooking processes, employing humorous and witty copywriting as well as exquisite editing skills to create highly engaging audio-visual experiences [57].The content covers various topics such as "Production", "Taste Exploration", and "Judging", allowing viewers to appreciate the beauty of cooking and adding diversity and richness to the video topics.Looking ahead, creators should continue to explore new topics and formats in food-related short videos, showcasing unique ingredients and production methods.Additionally, incorporating interspersed storytelling elements can bring food to life, offering viewers a fresh and unprecedented experience [58]; 4. Music Zone: Short videos in the music category receive relatively high scores in the user dimension.Compared to other mainstream video divisions, these videos demonstrate higher quantitative results in the "audio-visual" index.This indicates the importance of enhancing the musical presentation in music videos [59], showcasing the performers' virtuosity, deep emotions, and excellent artistic performances.Creators can utilize camera work, editing techniques, and post-production to further emphasize the inherent power of the music [60].The study's results also suggest that the performance of music videos is moderate in the creator dimension and platform dimension, indirectly highlighting the increasing significance of music subject matter and values in future music video production [61].Creators should ensure that the music aligns with the video's theme, fully expresses their own emotions, resonates with the audience's values, and reflects contemporary social trends [62].Additionally, creators should continuously expand their video production skills by incorporating storytelling elements [63], enriching the content and viewing value of music videos.This approach will make short music videos more captivating and resonant with viewers, thereby contributing to the overall enhancement of video value.
This study holds both theoretical and practical significance.Theoretically, it establishes a new paradigm for assessing the value of UGC short videos.By mining key value elements from authoritative texts, employing differentiated indicator quantification methods for multi-source data, and conducting value assessment, the study effectively addresses the issues of strong subjectivity, insufficient quantification of key features, and weak interpretability of assessment results in the current UGC short video value assessment indicators.Consequently, it provides a novel assessment theory and direction for future evaluations of the value of short videos and other related Internet content communication media.In practical terms, this study introduces a new and efficient video value assessment tool for short video operation platforms.By applying our study, platforms can effectively identify high-value short videos and curtail the dissemination of videos influenced by misguided opinions, thereby enhancing their reputation and credibility.

Figure 1 .
Figure 1.UGC short video value assessment model.

Figure 1 .
Figure 1.UGC short video value assessment model.

3. 1 .
Process 1: Constructing an Index System for Assessing the Value of UGC Short Videos Based on Element Mining

Figure 2 .
Figure 2. The process of constructing the UGC short video value assessment index system.

Figure 2 .
Figure 2. The process of constructing the UGC short video value assessment index system.Subsequently, theme clustering and key element mining are conducted.This process includes: (a) The analysis of clusters.Utilizing the constructed topic mapping, cluster analysis and representative entity word extraction are performed to analyze the characteristics of entity words within each cluster category; and (b) The mining of key elements.The clusters that fulfill the value assessment criteria for UGC short videos are selected, and these selected clusters are renamed by incorporating the value characteristics of UGC short videos.Lastly, utilizing classical theory or a priori knowledge, the mined value assessment elements are classified and organized hierarchically.The structure is then optimized and adjusted based on the similarity and difference of indicators, resulting in the construction of a UGC short video value assessment index system.
. Therefore, this paper integrates the domain sentiment lexicon with the LSTM deep learning model to calculate the sentiment value of indicators and achieve the quantification of user comment data.The specific steps consist of four parts: data processing, feature word extraction, feature sentence extraction, and training and testing based on the LSTM model.Refer to Figure 3 for detailed information.
. Therefore, this paper integrates the domain sentiment lexicon with the LSTM deep learning model to calculate the sentiment value of indicators and achieve the quantification of user comment data.The specific steps consist of four parts: data processing, feature word extraction, feature sentence extraction, and training and testing based on the LSTM model.Refer to Figure 3 for detailed information.

Figure 3 .
Figure 3. Emotional intensity calculation model based on the LSTM model.

Figure 3 .
Figure 3. Emotional intensity calculation model based on the LSTM model.

Figure 4 .
Figure 4. Clustering mapping of UGC short video value elements.

Figure 4 .
Figure 4. Clustering mapping of UGC short video value elements.

Figure 5 .
Figure 5. UGC short video value assessment result chart.Figure 5. UGC short video value assessment result chart.

Figure 5 .
Figure 5. UGC short video value assessment result chart.Figure 5. UGC short video value assessment result chart.

Figure 6 .
Figure 6.Bar chart of weighted results of UGC short video indicators.

Figure 6 .
Figure 6.Bar chart of weighted results of UGC short video indicators.

Figure 7 .
Figure 7. Performance of UGC short video values in different partitions.

Figure 7 .
Figure 7. Performance of UGC short video values in different partitions.

Table 1
below presents the clusters:

Table 2 .
UGC short video value assessment index system.

Table 3 .
Name of the empirical video.

Table 4 .
Quantification of the sentiment value of user review data indicators.

Table 5 .
Weighting weights for indicator combinations.