A Novel Approach of Integrating Natural Language Processing Techniques with Fuzzy TOPSIS for Product Evaluation

: Product designers need to fully understand consumers’ emotional preferences and responses for product forms to improve products. However, users and designers have different understandings and concepts in the product evaluation process, which will lead to cognitive asymmetry in the product design and evaluating process. This phenomenon prevents designers to grasp users’ needs, increasing the risk of product development failure. To this end, this paper proposes a product evaluation method that combines natural language processing techniques and fuzzy multi-criteria decision-making into a new integrated way to reduce the cognitive difference between users and designers, so as to solve the problem of cognitive asymmetry. This was done ﬁrstly by obtaining the review data of products from users on the Internet, based on a web crawler, and then constructing word vectors based on natural language processing techniques to realize the parametric expression of the Kansei image. Secondly, by using a statistical method to extract the product scheme that meets the preferences of users and designers, and then quantifying the relationship between the product form and Kansei image based on a grey relational analysis (GRA). Finally, by calculating the indicator weight based on the Entropy method and using the fuzzy TOPSIS method to explore the prioritization of the product design alternatives in view of the Kansei needs of users. Taking the smart capsule coffee machine as an example, the feasibility and effectiveness of this method are veriﬁed. In particular, the method proposed in this research can not only enable different cognitive subjects to achieve cognitive symmetry, but also ﬁlter out product forms that meet the cognitive needs of users. Moreover, this study provides a theoretical basis and practical signiﬁcance for reducing the cognitive differences between cognitive subjects in the whole process of product design, and provides a systematic framework for the industry to effectively connect customer needs and product design decisions. At the same time, this study has introduced a new method for Kansei engineering.


Introduction
Due to the increasingly fierce market competition, many companies find that their manufacturing and market research have reached the same level as their competitors, so the only remaining competitive weapon is design innovation and improvement of design quality [1]. Since the form of a product has gotten rid of the shackles for production function in the post-modern era, its lifestyle and emotional value are considered more pivotal in product design [2]. Product design must not only meet the functional requirements of the product, but also pay more attention to the meaning and value created by the product, so as to give consumers functional and psychological expectations [3]. For example, Apple's smartphone research and development often develop a variety of styles of different materials and colors based on different user groups to expand consumer groups for different ages, so as to extend the product life cycle [4]. With the improvement in industrial integration, product differentiation is becoming difficult [5]. Therefore, the user's perception and preference factors must be incorporated into the process of product differentiation production and promotion to assist product design decision-making [6]. The user's emotions and preferences are a customer's psychological response to the product's Kansei design details (such as the shape), and they are also the basis for the formation of human values and judgments [7]. In fact, the essence of a product's Kansei design is to design function, which could meet the various expectations and needs of users, so as to make the product emotionally engaging [8]. Therefore, in the product design and development process, it is necessary to consider production design attributes based on the emotional needs of customers, and then to design products with corresponding characteristics to obtain the favor of users [9,10]. In order to carry out the Kansei design of products, it is necessary to start with two stages of product concept generation and evaluation. Specifically, the concept generation and evaluation of a product design are two key steps to obtain the optimal design result in the product design stage, where the former could generate possible design concepts and the latter could determine the final selection of candidate design schemes [11]. As a gatekeeper, the impact of design concept evaluation on the novelty, feasibility and quality of the final product is extremely important [12], which is also a big challenge. In order to select the ideal design concept, the design team needs to consider various factors from customer needs, technical attributes and design constraints, and then develop appropriate evaluation models to determine the priority ranking of the candidate design options [11]. Moreover, in order to reduce the impact of the cognitive biases of decision-makers, group decision-making strategies are becoming increasingly common in design concept evaluation [13]. By inviting experts to discuss and determine the evaluation criteria, and give their evaluation information, the ranking of an alternative design plan can be determined.
In fact, Kansei engineering (KE) [14,15] is a theory based on design science, psychology, cognition, and other related disciplines that introduces human perceptual analysis into the field of engineering technology [16]. KE aims to establish a database for consumer perceptions, which can demonstrate the mapping process between Kansei vocabulary and design elements from the perspective of consumers [17]. It is a conversion technology that transforms consumers' feelings and imaginations of products into design elements [18], and generate product solutions and alternative concepts based on the investigation and analysis of visual stimuli [19]. Kansei is a Japanese vocabulary that expresses users' psychological feelings and imaginations of new products [20]. When KE is used, we will pay attention to how people perceive images or objects of a production, as well as study the effect of their personal Kansei preferences or cultural basis on their psychological feelings, then identifying the emotional attributes of customers and associate them with design elements [18].
In essence, the product innovation design scheme is a typical multi-attribute comprehensive decision-making problem (MCDM), which is essentially a multi-objective, multi-level, and uncertain complex decision-making process. Therefore, to explore the way to establish a more precise mapping relationship between user needs and product development has never stopped. In this regard, some MCDM techniques, such as the fuzzy analytic hierarchy process (FAHP) [21], ANN [22], QFD [10], and TOPSIS [23], have been integrated widely with fuzzy sets [24] to effectively be used in making decisions from various solutions. However, these models have certain limitations. Among them, artificial neural networks (ANN) are considered "black box" models, because its performance depends only on the size and quality of the data, and the structural characteristics of the model [7]. In addition, there are two limitations in the application of fuzzy mathematics: (1) The fuzzy mathematics is used to simulate the cognitive process of human beings; however, the determination of the used relational functions is usually based on the experience and intuition of experts. (2) The boundary interval of the fuzzy number is rapidly expanded due to fuzzy arithmetic operations, thus lowering the identifiability of the obtained fuzzy number, and it also deviates from the correct prior result of the customer's needs [25]. Fortunately, grey system theory (GST) is an effective research method [26]. Generally, grey theory provides a technique for transferring information from black (unknown) to white (known), it has the advantage of using uncertain or scattered information to handle complex tasks [27], and its features are suitable for limited sample research [28]. What is more, the GST method takes the grey system of "small sample, little information" as the research object [7], so it has superiority in dealing with fuzzy or grey data effectively in the KE. In addition, in most of the previous KE literature, questionnaire surveys are usually used in the process of obtaining Kansei attributes and product attributes [29]. Doubtless, this traditional method provides high-quality results, but it has some disadvantages, such as a small scale, one-time, and so on. It also has certain limitations in terms of data update and collection efficiency [30]. At the same time, the traditional survey questions are also designed based on expert thinking rather than customer opinions [31], so that the survey results cannot reflect the user's true emotional factors [32]. However, with the development of computer technology and Internet technology, the function of a computer as an information exchange medium has been rapidly expanded. Therefore, online shopping has become a trend to impact consumption patterns [33]. People can choose their favorite products in online shopping and give their own opinions. These data constitute an information network that connects product attributes and user experience, which contains accurate and true user Kansei images information [34]. Thus, this information could help designers to understand consumers' experience and expectations of services and products [35]. Some researchers are interested in using online consumer reviews to gain insights for research issues [31,32,34,36,37]. Although previous text mining studies have explored ways to help customers understand review summaries, few studies can help designers clearly identify customer needs to support the product development process. Therefore, this study uses web crawlers to collect online reviews of products, and utilizes text mining technology to extract the users' real Kansei vocabulary from the reviews, and introduces the users' true evaluation information from the web-crawled big data into the KE, based on constructing an automated method. In this way, the symmetry between the acquired Kansei demands and the real user needs is realized, so as to solve the asymmetry problem in the process of demand acquisition, which has become the focus of this paper. At the same time, this research uses a T-test to quantify the cognitive differences between the designers and users, the extracted products that meet the cognitive needs, and the preferences of the designers and users, as well as to screen out product-form design schemes that show symmetrical affective cognition between users and designers. Finally, an evaluation index is extracted from the high-quality product design schemes and their weights are quantified based on entropy, while the fuzzy TOPSIS method is used to evaluate the product design schemes at the same time, so as to prioritize products according to user needs. Therefore, the innovative design and evaluation of products become more objective based on this proposed method. Moreover, the method proposed in this research provides a quantitative reference for companies and designers to screen out high-satisfaction product design plans. The overall comparison between this article and previous studies is shown in Table 1. Obviously, the main contributions of this article are listed as follows:

1.
Excavating the user's Kansei information from network evaluation big data, and to use the natural language processing technology to introduce the user's authentic evaluation information for the Kansei image, which can effectively avoid the deviation between expressed preferences and real emotional appeals in traditional consumer surveys.

2.
Extracting products with symmetrical cognitive information between designers and users, and to analyze the contribution degree between the core modeling items and Kansei intentions of the product, so that the selection of key product indicators is completed in a user-centric manner.

3.
Making reasonable evaluations based on user needs to determine the priority of alternative product schemes, so as to realize the optimization of multi-attribute decisionmaking for design schemes, and solves the uncertainty and subjective problem of design scheme evaluation in the group decision-making environment, so as to provide better theoretical support for the selection of the best production plan.
In particular, this study attempts to adopt a novel combination method to conduct systematic decision-making research on products from a consumer-centric way, which is achieved by a hybrid analytical approach combining natural language processing, KE, GRA, entropy, and fuzzy TOPSIS to achieve effective product innovation program decisionmaking. Firstly, the natural language processing of artificial intelligence is used to qualitatively and quantitatively analyze the Kansei needs of users, and the semantic relationship of each word is accurately communicated in a vector way, and then the results obtained from the Kansei needs of users are further processed for dimensionality reduction based on factor analysis (FA). Then, by using t-test and correlation analysis methods to further extract the productions that fit the Kansei between users and designers and determining the contribution degree of the styling project to the Kansei image through GRA, so as to extract the key styling items of the production. On this basis, an entropy and fuzzy TOPSIS model is proposed to further sort the potential products, so that products that meet the user's emotional demand are systematically identified and selected. Therefore, this reasonable product Kansei demand assessment framework provided in this study could realize the optimization of multi-attribute group decision-making of design schemes, and it also provides a better basis for scheme selection. In order to illustrate the effectiveness of the method, the proposed method is verified by using a smart capsule coffee machine as an example.
The rest of this article is structured as follows. Section 2 briefly reviews the concepts of GST and fuzzy TOPSIS. Section 3 introduces the proposed framework. Section 4 verifies the effectiveness of the framework through a case study. The last part summarizes this research and puts forward the problems to be further researched and solved in the future.  [45] QFD, CPA TRIZ Hsiao et al. [46] AHP QT-I, GA, Wang [7] SD, KJ, K-means GRA SVR, ANN Shi et al. [47] KE RST, ARM Lin et al. [48] Focus group Kano Fuzzy QFD QFD: quality function deployment; CA: conjoint analysis; FA: factor analysis; KE: Kansei engineering; EGM: evaluation gird method; NLP: natural language processing; ANN: artificial neural network; RST: rough set theory; ARM: association rule mining; GA: genetic algorithm; SVR: support vector regression; TRIZ: the theory of inventive problem solving; DT: decision tree.

GST
Grey system theory (GST) was proposed by Professor Deng Julong of Huazhong University of Science and Technology in 1982 [26]. Grey system theory is based on uncertain systems, i.e., where a part of the information is known and a part of the information is unknown, as the research object [49]. In GST, the concept of a grey box is used to describe the characteristics of messages in the actual environment. In grey events, the integrity of the information can be expressed based on the degree of grey. The system is expressed in white when it is completely aware of the information, and the system is expressed in black when it is completely unknown; thus, the system is expressed in grey when it is partially known and partially unknown. The entire grey system can change dynamically with time [1]. For the factors between systems, the correlation size measurement that varies with different objects is called the degree of correlation. The grey correlation analysis method is based on the degree of similarity or difference between the development trends of factors in the grey system theory; that is, the "grey correlation degree", which works as a method to quantitatively measure the correlation degree of system changes and development trends for factors [49]. Furthermore, it relies on the mathematical foundation of space theory to determine the geometric similarity between the reference factor sequence and the related factor sequence so as to determine the degree of relevance between the various factors of the system, which is a dynamic analysis process essentially. This method have some features, such as simple calculation, clear sorting, and no special requirements for data distribution types [50]. In essence, the main difference between grey relational analysis (GRA) and traditional mathematics is that grey relational analysis provides a summary framework for analyzing the small analysis amount of used information. Therefore, this method could be widely used in system analysis, management decision-making, and KE [1,28,[50][51][52]. However, in KE, the GRA is only effectively used to analyze the importance of product color and styling items. Doubtless, it lacks the integration of experimental results into the overall aesthetic priority evaluation of the product. Hence, this research explores products that are emotionally compatible with consumers and designers, and uses GRA to analyze the contribution degree of product form elements based on the Kansei demand, and then extracts key modeling features and quantify them as evaluation indicators, which are used as the evaluation carrier of a product's comprehensive satisfaction ranking. In this way, the evaluation result could become more objective and accurate, thus effectively assisting designers in design decision-making.
Therefore, in order to solve the problem of the same item weight in the traditional product design and evaluation process, a method of assigning different item attributes with different weights is proposed, and the key product form items are extracted so as to make the results of the product evaluation more objective and accurate. In this study, the GRA method is adopted to calculate the degree of correlation between the product modeling items and Kansei image, and the priority of modeling elements that affect the product Kansei image is explored precisely so as to guide the design decisions and evaluation during the product development process.

The Fuzzy TOPSIS
Hwang and Yoon (1981) proposed [53] a method of Technique for Order Preference by Similarity to an Ideal Solution (TOPSIS). In fact, its basic idea is based on the shortest distance between the selected solution and the positive ideal solution, and the furthest distance to the negative ideal solution (NIS) so as to select the optimal plan in the decision-making process [54]. Technique for Order Preference by Similarity to Ideal Solution (TOPSIS) has become a mature and effective method in multi-attribute decision-making, which has been widely used in supplier selection research [39], comprehensive ranking of energy supply systems [55], business and market management research [56], product development decision-making and configuration [57]. The application of the TOPSIS method has a straightforward execution process and it is more convenient to use, and its computational requirements are lower than other management decision methods [58]. However, the traditional TOPSIS method can only deal with the situation where the evaluation index value is a quantitative value, and in view of the evaluation of the morphological design plan in the product design stage, there will be evaluation indexes that cannot be quantified. Therefore, it is necessary to introduce fuzzy theory into the TOPSIS method and use the transformation scale to change the language variables converted into triangular fuzzy numbers so as to quantify the qualitative indicators, and then the final scheme ranking is obtained by developing the fuzzy TOPSIS method. At the same time, determining the weight of each criterion of the product design is also an important part of the fuzzy TOPSIS method. By determining the weights of the indicators, all alternatives can be compared based on the overall performance of all criterion. In fact, subjective weighting methods based on the subjective preferences of experts, including AHP, Delphi, FARE, BWM, etc., tend to increase with the number of evaluation indicators, and may lead to the problem that experts may not be able to assign precise weights for each criterion. To this end, this problem can be effectively avoided based on the objective weighting method, in which entropy is an effective method, and then the weight value of the evaluation index can be effectively obtained through an analysis of the dispersion of the data.
Accordingly, to review the current KE researches, the fuzzy TOPSIS method is rarely applied in the field of product design and evaluation process. Therefore, in this study, in order to solve the problem whereby the traditional TOPSIS method can only deal with quantitative evaluation indicators, this study tries to use the fuzzy TOPSIS method to evaluate productions to reduce the subjectivity and complexity in the evaluation process of decision makers. At the same time, the entropy method is utilized to obtain the objective weight value of the evaluation item index. Hence, the final ranking of the product sample plans is explored effectively so as to make the evaluation result more accurate and precise.

Method
In order to meet the needs of users and speed up the product development process, this research proposes a product form design decision-making method driven by user Kansei images based on natural language processing and grey theory. In particular, this method could not only save time in identifying customer needs but also help designers to develop customer-oriented products. Specifically, we first apply the TF-IDF feature extraction method to extract product feature words from user review data, which could make up for the disadvantages of Word2Vec in feature extraction. Meanwhile, we use Word2Vec to map the words to the vector space to obtain the word vector; it also could solve the problem of the inability of the semantic information of words based on the TF-IDF feature extraction method. Hence, this combination method of TF-IDF feature extraction technology and Word2Vec could not only avoid possible over-fitting problems but also fully consider the semantic information of words. Furthermore, we use the SD method, T-test, and correlation analysis method to select the products that are highly compatible with the emotions of users and designers, so that productions with cognitive symmetry are obtained. In order to accurately obtain the key demands of users on product styling items, the expert interview method is used to extract the core key styling features, and grey relational analysis is used to obtain the relationship between styling features and Kansei images, and then the key styling items are screened out. Moreover, through the application of the fuzzy TOPSIS method to the preference analysis of customer demand solutions, the product design solutions that meet the Kansei needs of users are selected, thus guiding the positioning and development of the product conceptual design. As shown in Figure 1, for convenience, the operational procedures include the following steps:

1.
Initially, the product feature words are extracted through the natural language process, so as to analyze and mine accurate and real user Kansei image appeals, which could solve the disadvantages in traditional methods of data scale, data credit, data update, and collection efficiency. Consequently, the deviation between the preferences of consumer surveys and real preferences could be effectively reduced, thereby effectively improving the success rate of new product development.

2.
Then, in order to determine the difference in the perception of products between designer and customer, the SD method is used to quantify the emotion, and then the T-test method is used to compare the Kansei image scores of all samples from designers and customers, and then choose Pearson correlation analysis to obtain the product samples that meet not just the Kansei needs of users but also the emotional appeals of designers. Hence, such products are a high-quality product form with symmetrical cognitive preferences of users and designers.

3.
Thereafter, the grey relational analysis is utilized to quantitatively analyze the relationship between the user's Kansei need and product styling feature factors. According to the experimental results, the contribution of the styling project can be explored, and then the product core styling evaluation project can be selected so as to effectively identify the main design elements that affect the product Kansei image.

4.
Finally, entropy and fuzzy TOPSIS are used to explore the final ranking of the sample plans effectively, to then evaluate and optimize the product form development scheme. Due to the vagueness and uncertainty of emotional information, the fuzzy TOPSIS evaluation method is used to reduce the subjectivity and complexity of the decisionmakers' evaluation for product form schemes, so as to minimize the impact of users' personal subjective factors on the results, and thus making the evaluation results more precise.

Natural Language Processing
The main task of product review mining is the extraction of product feature words and the emotional evaluation of product feature words [59]. It mainly refers to the use of natural language processing [60], computational linguistics, statistics analysis, and other technologies that automatically identify and extract useful information from texts [61], which is essentially a process of using natural language processing and machine learning techniques to obtain high-quality information from unstructured texts [31,62]. How to represent words in a form that can be processed by machines is the basis of natural language processing. The most intuitive way of representing word vectors is one-hot representation, but this high-dimensional and sparse representation is very easy to fall into the "curse of dimensionality". This method makes the machine unable to judge the degree of relevance between word vectors, nor can it store the semantic information of words. Fortunately, Hinton (Hinton, 1986) proposed Distributed Representations to represent semantics to perfectly solve the above problems. Among them, Word2Vec, as a more prominent method, was first developed by Google's Tomas Mikolov et al. [63]. It was proposed in 2013 by using the middle layer parameters of the model to convert natural language vocabulary from the one-hot form into a word vector model represented by a fixed-length dense vector. The semantic logical relationship between words can be reflected by the correlation between word vectors. Essentially, this model is a word prediction model based on the core algorithm of a neural network ( Figure 2). Its training samples are words in the text (target words) and words in their context, and the target words are predicted by the context words of the target words. At the same time, in order to quickly create a vector space in Word2Vec, it can be easily composed of hundreds of dimensions [63]. This semantic vector space reflects the hypothesis of distributed semantics, that is, words that appear in the same context often has a similar meaning [64]. Two methods are most commonly used when building Word2Vec models, namely, continuous bag-of-words (CBOW) and skip-gram ( Figure 3). In the CBOW method, surrounding words are used to predict the current word [64], while skip-gram tries to predict words in a window of size c from the current word. In fact, skip-gram models tend to perform better on smaller data sets [64]. Therefore, this study could use a skip-gram model for research.  This study uses the way of web crawlers to further filter the data on the Internet, and then uses the Jieba word segmentation tool for further research, finally drawing the word cloud and calculating the importance of each word through TF-IDF. Term frequencyinverse document frequency (TF-IDF) [66] is a commonly used weighting technique for information retrieval and text mining to quantify the strength of each Kansei vocabulary. The importance of each word increases in proportion to the number of times it appears in the document. However, at the same time, it will decrease in inverse proportion to the frequency of its appearance in the corpus. The specific steps are as follows: For a set D containing M Chinese texts, first use the Jieba tool to segment each text in the set D and remove the stop words, and then use the TF-IDF algorithm to calculate its weight in the text TF-IDF i,j , which could represent the weight of the word t i in the text D j (j = 1, 2, 3, . . . , M), calculated by Equation (1): In Equation (2): n i,j is the number of occurrences of the word t i in the text D j , and the denominator is the sum of the number of occurrences of all words in the text D j : In Equation (3), |D| is the total number of texts in the corpus, |{j:ti∈Dj}| contains the number of texts of the word t i , which is the number of texts with n i,j = 0; if the word is not in the corpus, it will result in a zero denominator. Therefore, in general, 1 + |{j:ti∈Dj}| is used. The high word frequency in the particular text and the low text frequency of the word in the entire text collection can produce a high weight for TF-IDF. Therefore, TF-IDF tends to filter common words and retain important words.
Since TF-IDF can only extract high-weight Kansei words, it cannot consider the semantic information of the words in the overall semantic context. The word vector converted by the Skip_gram model of Word2Vec can retain the main semantic information and the neural network takes a word w t as input and tries to predict the words w t+i around it. Through input-output one selects a random index value i from the skip-gram program window c to randomly sample k times, and all the sample values of i of an input word w t come from the set J. Therefore, the training goal is defined as Equation (4) [64], and finally the Kansei vocabulary of the word vector that retains the semantic information of the word is obtained.

Grey Relational Analysis
Grey theory provides a metric called the "Grey Correlation Coefficient" to measure the closeness of the comparison alternatives and benchmark alternatives (reference series). For convenience, we have adopted a five-level scoring standard, and the details are as follows [41]: To establish a comparison series: assume that X represents an m × n matrix to collect the evaluation of the performance level of n dimension elements in m alternatives, where x i (k) represents the element k of alternative i, m is the number of series (alternatives), and n denotes the number of criteria (attributes).
Setting the reference series: the elements of the reference series are set by their corresponding maximum rating scales (similar to an ideal solution).
where J denotes the set of benefit (the-larger-the-better) terms and J denotes the set of cost (the-smaller-the-better) terms.
Computing 'the degree of greyness' between the target series and the reference series via the following deviation matrix: where , and x r (j) represent the jth element of the target and the reference series, respectively.
The attribute-based grey correlation coefficient is calculated as follows: where a distinguishing coefficient ξ ∈ (0, 1) is generally set at the value of 0.5. Averaging the degree of grey relation to prioritize the product varieties:

The Fuzzy TOPSIS
The concept of Fuzzy TOPSIS is to find a positive ideal solution (PIS) and a negative ideal solution (NIS) as the comparison criteria for each choice. By comparing the Euclidean distance between the ideal solution and the alternatives, the closeness of the alternatives is obtained, and then the pros and cons of the alternatives are ranked. Furthermore, firstly, calculate the fuzzy rating of the standardized assembly of the target image and the brandnew sample. If the fuzzy rating of the k-th decision maker is x ijk = (a ijk , b ijk , C ijk ), the aggregate fuzzy rating of each new sample is as follows [67]: where Therefore, the standardized fuzzy decision matrix can be expressed as where B and C are the set of benefit criteria and the set of cost criteria, respectively. Taking into account the different importance of the criteria, the weighted normalized fuzzy decision matrix is constructed as The fuzzy optimal ideal solution A + and the fuzzy worst ideal solution A − can be defined as, respectively, For each new sample, the distance between the candidate solution and the optimal solution A + and the worst solution A − is defined as where Then, the relative closeness of each new sample to the ideal solution is calculated, as the final criterion to determine the sort order of all alternatives.
Finally, the new samples are arranged according to the descending order of closeness, and the new sample that ranked first is the best solution, which is the closest to the optimal solution and the farthest to the worst solution.

Information Entropy
Entropy refers to the degree of chaos in the system, which is the measure of the possibility (probability) of the system in a certain macroscopic state. Entropy is derived from information theory and it has become a powerful indicator to measure the content factors of uncertain information [57]. Assuming that the uncertainty of the information is represented by a discrete probability distribution composed of k categories (p 1 ,p 2 , . . . ,p k ), the distribution containing larger variables in all p i 's has more uncertainty than the distribution of smaller variables. Specifically, information entropy is defined as [57] where ∅ k = 1/ln(k) is a positive constant to ensure that the entropy will be limited to the interval [0, 1]. Then, the weight w j of the target attribute in the product evaluation process can be derived as where E(x j ) represents the entropy of attribute j.

Construction of the Semantic Space of Product Modeling
In this section, we describe a smart capsule coffee machine as the research object. The product shape design of the smart capsule coffee machine is more complicated, and many features interact with each other to convey the unique Kansei image to users. Therefore, this study selected the shape of the smart capsule as the design object to verify the feasibility of the method. Firstly, the popular smart capsule coffee machine products sold on the internet could be collected to capture the product Kansei semantic image space. Customer reviews are collected from the two most popular mainstream e-commerce websites (Taobao, JingDong). Then, we searched for the smart capsule coffee machine that users prefer based on the most recent year's sales and the number of reviews as a test sample. A total of 10 smart capsule coffee machine products were selected from two platforms, and then 20 smart capsule coffee machine productions were used as the specific products of this case study, and the review page of each website selling a specific capsule coffee machine was used as the text data for analysis. In the review page, this study used the web crawler tool Jieba to capture the text content of the reviews, excluding the user's name, date, rating, and project pictures, so that the results of the text mining only reflect the customer's Kansei image. To ensure that the selected products are sought after by consumers, the minimum comment crawling page for each sample on the Taobao platform was set to 10 pages, and the minimum comment crawling page for each sample on the Jingdong platform was set to 15 pages. The total number of comments was composed of 127,765 words, and the total user reviews obtained were exported to a .TXT file for subsequent text mining analysis. Then, by using the Jieba tool in Python software to obtain the user review information, the irrelevant stop words were removed, and then performing word segmentation on the text, creating the word cloud of the segmented text. Accordingly, it was found that most users pay more attention to the smart capsule coffee machine products from a comprehensive perspective, such as the appearance, style, and quality of the smart capsule coffee machine. Moreover, to extract the feature words from the word segmentation results based on the Equation (1), the keywords and weights of these 20 examples of user online comments could be obtained, selecting the top 100 user evaluation keywords according to the rule of the weight values in descending order. Part of the results is described in Table 2. According to Table 2, it can be found that 100 user evaluation keywords include some adjectives related to the user's emotional feelings after experiencing a product, as well as some product-related use words, such as operation, Starbucks, coffee shop, cleaning, sound, milk froth, taste, making, and so on. In order to select the words that reflect the users' inner feelings about the product shape from 100 user evaluation keywords, and further extracting the users' emotional experience of the Kansei words from these 100 keywords-to capture the users' inner emotional feelings about the product form-so as to extract 18 Kansei words, it was needed to mine the users' real perceptual demand information from the network review big data. However, only capturing the users' perceptual appeal to the product is often limited. In view of the product form design, activity begins with the designer's ideographic communication, and finally depend on the user's feelings and understanding, the core of which lies in expression and communication. Some studies mainly involve tactile comparisons of textile materials [68], wine aroma [69], pearl quality [70], the aesthetic characteristics of artistic works [71], etc., especially in sensory evaluation [72][73][74], these studies have demonstrated that emotion perception and evaluation vocabulary vary with the difference in professional level. In fact, the demands of the designer have been built into the product design, which is different from the meaning of the product in the eyes of the user. This contradiction will lead to the failure of the product design. To this end, it is necessary to investigate the demands of experts and designers on the product in order to improve the dimension of the Kansei mining. Specifically, interviews between experts and designers were used to investigate the semantic space of related perceptual images corresponding to the samples, and 20 smart capsule coffee machine product samples were processed in A4 size, and the resolution was set to a uniform 100 dpi, and the background was removed to make the samples clear. At the beginning of the experiment, a group of eight product designers with extensive experience participated in the survey of the product perceptual image factors, including four males and four females. The age of the participants was 21-40 years old. At the same time, the subjects were asked to perform a perceptual evaluation and description of the product samples, extract the designer's perceptual semantic space vocabulary for the product based on the results, and use the KJ simplification method to further merge the results of the investigation, and finally obtain seven Kansei image vocabularies. These seven vocabularies also appear in the web crawler corpus of the smart capsule coffee machine product but are not in the top 100 vocabularies. Then the obtained designer Kansei vocabulary was merged with the user Kansei vocabulary crawled by the network information. Finally, 25 composite Kansei image semantic vocabulary of the smart capsule coffee machine products were obtained (see Table 3).

Extraction of Key Semantic Features
According to Table 3, if we directly investigate the cognition of users and experts on these 25 Kansei images' adjectives, the cognitive load between the experts and users will increase. Additionally, these 25 groups of Kansei vocabulary could also partially overlap in the dimension of the word meaning, describing the product style as a 25-dimension vector that will also increase the difficulty of data computation, which is not conducive to obtain effective experimental results. To this end, it is necessary to reduce the dimensionality of the product styling image and dig out the key semantic adjectives that affect the smart capsule coffee machine. Firstly, in view of context of the overall semantics, word vectors were constructed based on the Word2Vec, and then these 25 semantic words were converted into word vectors, so as to accurately express the meaning of the words. Specifically, using Word2Vec to set the crawled user online review data as the original data, the word vector dimension was set as 200, further training the text word vector to realize the parameterized expression of the Kansei words, and then revealing the semantic features of the vocabulary connotation (see Table 4). Furthermore, performing a dimensionality reduction operation on the obtained Kansei semantic matrix, the purpose is to condense 25 image semantic words without the loss of interpretation ability. Firstly, based on the KMO sampling appropriateness test and the Barlett spherical test, to determine whether the Kansei semantics is suitable for factor analysis, the KMO = 0.866, Barlett's Test value s = 9220.807, degrees of freedom = 253, and significance = 0.000. According to the research results, the value of KMO is greater than 0.5, which means that the result is reasonable. The significance factor is 0.000, which is less than 0.05, so there is an obvious difference (see Table 5). Therefore, according to experimental results, the data can be further analyzed based on the factor analysis. The principal component analysis method in the factor analysis was used to extract the common factors. The selection rule was that the characteristic value is greater than 1. As a result, a total of three main factors were selected, and the maximum variance axis method (varimax) can make the representative meaning of each factor more obvious and easier to explain. The maximum variance method in the principal component analysis method was then used to orthogonally rotate each factor, to obtain the rotating component matrix table (see Table 6). In this study, the extraction criterion is that the initial feature value is greater than 1, and then three factors were extracted (see Table 6), which could explain 82.706% of all the factor variables. This could mean that these three factors can describe the modeling style characteristics of the experimental sample cases well. The load of the first factor is between 0.858 and 0.611, which is named as refined and concise. The load of the second factor is between 0.875 and 0.676, which is named as personalized and fashionable. Moreover, the load of the third factor is between 0.699 and 0.324, which is named as dynamic and flexible.

Comparison of Cognitive Preferences between Designers and Users
Taking the three groups of Kansei words-concise and simple, personalized and fashionable, and dynamic and flexible-as the evaluation carrier, the designer and user's perception of the product was quantified through experimental investigation, and then the consumer and designer's perception emotional preference of the smart capsule coffee machine was analyzed. Firstly, through the semantic difference method (SD), 30 designers and 30 users were investigated for the quantified values of the Kansei vocabulary of 20 samples for these three semantic spaces, and the pros and cons of the semantic adjectives were divided by a 7-level Likert scale (see Table 7). For the two groups of people participating in the experiment (designers and users), the results of the experiment were calculated using the statistical method of sample averaging. Based on the experimental results, the next step was to use the T-test method to explore whether there are statistically obvious differences between the designer and the customer based on the average value of the Kansei image for each product. Taking the exquisite and concise Kansei image as an example, an independent sample T-test was performed, and the experimental results obtained are shown in Table 8. According to Table 8, it can be seen that p = 0.001 < 0.05. Therefore, the 95% confidence level between the designer and the customer have obvious differences on the exquisite and simple Kansei images of the product. According to the experimental results, there is an obvious difference between the designer and the customer at the 95% confidence level in the dynamic and flexible (p = 0.033) Kansei image, while the personalized and fashionable (p = 0.06) Kansei image have no significant difference at the 95% confidence level.

Designer and User Emotion Matching Analysis
In order to study the related factors between the emotional response of the electric bicycle product perceived by the designers and customers, a correlation analysis was carried out. If the consumer feeling stimulated by the smart capsule coffee machine product is consistent with the emotional feeling of the designer to promote the product, the design could be considered to have a high matching quality, which can effectively reduce the cognitive difference between the designers and users. Firstly, five design experts with rich design experience for more than 5 years were recruited to form a designer group, who then classified the products of the smart capsule coffee machines promoted in the market. According to their suggestions, these 20 products were finally grouped into five categories. The five categories also cover the most popular and typical smart capsule coffee machines on the market (see Table 9). Based on these five product modeling categories, a semantic difference method experiment was carried out to quantify the experts' semantic preference for his/her Kansei vocabulary, and the corresponding perceptual value was obtained. Then, in order to quantify the quality of the emotional matching between the designers and users, a correlation analysis was carried out on the two sets of quantitative data surveyed on the users and experts. Firstly, the expert's category image score was put into SPSS as the target standard, and then the correlation between the user's and the expert's score was obtained through calculations, and the Pearson's correlation coefficient (Pearson's r) was obtained, which represents the correlation intensity between the designer and the user and the quality level of the emotional matching. The positive correlation in the results means that the consumer's feelings are consistent with the effects promoted by the design strategy, while the negative correlation is the opposite. A correlation close to zero is considered irrelevant (Table 10). In order to define the threshold of the correlation coefficient, some experts have proposed that a correlation greater than 0.8 is usually described as a strong correlation, and a correlation less than 0.5 is usually described as a weak correlation [75]. Hence, the product form design of each smart capsule coffee machine is classified: (1) The design with a correlation analysis coefficient above 0.8 is a high-quality design; (2) the design with a correlation coefficient between 0.5-0.8 is a medium-quality design; and (3) the design with a correlation coefficient below 0.5 is a low-quality design. It can be seen that the seven smart capsule coffee machines in Table 10, except for sample 11, are all high-quality matching products, while the Pearson's r correlation coefficient of sample 11 is −0.998, which is a negatively correlated product. Accordingly, through the comparison and analysis of seven high-quality form matching design products (Table 11), the factors that affect the matching quality were explored, which will help industrial designers and manufacturers re-evaluate their product design to improve consumer satisfaction.

Analyzing the Influence Ranking of the Product Components
In order to quickly and accurately analyze the key product form items that affect the user's Kansei image, priority ranking of the influence of the product form items on the Kansei words was realized, also considering that the grey correlation analysis takes the grey system of "small sample, poor information" as the research object. Therefore, based on the morphological evaluation indicators of the seven products explored above, a grey analysis model was constructed to establish the quantitative relationship between the morphological characteristics of the smart capsule coffee machine and the user Kansei image. Firstly, it is necessary to analyze the product form characteristics, to select 10 people to form a research team, including three designers, five industrial design graduate students, and two design teachers. Using the morphological analysis method to analyze and extract the design elements of the seven research samples, the design elements and category characteristics of the smart capsule coffee machine products were summarized. According to the survey results, the six main morphological items (indicated as X1~X6) and the related specific modeling design elements were effectively extracted (see Table 12). Furthermore, a product (sample 17) was added after asking the opinions of the research team, so the number of samples was increased to eight. Then, 30 users and designers were investigated by a 7-level semantic difference method (SD) questionnaire based on these six styling items of eight samples for the exquisite and simple, personalized and fashionable, and dynamic and flexible three product Kansei semantic spaces. The sample average method was utilized to obtain the average value. The specific survey was conducted through an online questionnaire website. Subsequently, the result of the Kansei image survey for personalized and fashionable and the corresponding score values of each sample modeling element were finally calculated (see Table 13). Accordingly, the grey correlation degree between the products' modeling items and Kansei images was calculated and prioritized. To refer to the previous literature [51], the identification coefficient ξ value was set to 0.5 in this study to obtain the ranking results of the importance for the influence degree of the modeling items (see Figures 4 and 5). The sorting results show that the resultant rank sequence of the form parameters shown in Figure 4a can be obtained: X6 > X2 > X1 > X5 > X4 > X3; for the second Kansei word pair, stylish and personalized, the resultant rank sequence shown in Figure 4b is obtained as X6 > X2 > X1 > X5 > X3 > X4; and for the third Kansei word pair, dynamic and flexible, the resultant rank sequence shown in Figure 4c is obtained as X6 > X2 > X5 > X1 > X4 > X3.  From these rank sequence results, it is seen that X1, X2, and X6 are closely related to three Kansei images words, among which X6 has the highest influence value among these three Kansei images, and the influence of X1 and X2 is always smaller than X6; other form elements of the product showed varying degrees of influence, depending on which Kansei words were taken into consideration. Since X5 could rank fourth in the exquisite and simple, personalized and fashionable Kansei image, and rank third in the dynamic and flexible Kansei image, there is a certain degree of relevance between its item characteristics and product Kansei image. Therefore, the X5 is also the key factor in the design of the coffee machine. However, some projects (X3, X4) have relatively little contribution to the product's Kansei intentions, and the lower ranked formal elements may be ignored. Accordingly, in order to make smart capsule coffee machine products present exquisite and simple, personalized and fashionable, and dynamic and flexible of Kansei images, it is necessary to focus on designing the X1, X2, X5, and X6 styling feature items of the product.

Product form Evaluation Index Construction
The X1, X2, X5, and X6 index form items of the smart capsule coffee machine have the most significant influence on the overall form of production. Therefore, we interviewed five designers for the key strategies of four parts for the product design plan. Thus, the design method of the product was explored. At the same time, in order to avoid similar or repeated cases among the result indicators obtained from the interview, which would affect the effectiveness of the evaluation, the KJ method is used to merge similar categories further, and then 19 evaluation indicators for the morphological characteristic elements of the smart capsule coffee machines were constructed (see Table 14). Based on these 19 indicators, the pros and cons of the product plan can be further analyzed. Table 14. Evaluation index of the morphological elements of the smart capsule coffee machine.

Components Category Evaluation Index
Upper body (S 1 )

S 11
Upper morphological image S 12 The ratio of the upper body to the fuselage S 13 Interface button layout S 14 Interface location S 15 Outlet shape style S 16 The ratio of the water outlet to the upper part of the whole The outline of the fuselage S 22 The shape style of the fuselage S 23 Morphological image of the fuselage S 24 The ratio of the fuselage to the overall shape S 25 Decorative pattern Overall style (S 3 )

S 31
Main body style S 32 Body trend S 33 Body-scale relationship S 34 Morphological image S 35 Unity of function and style Water tank (S 4 ) S 41 The ratio of the water tank to the whole body S 42 The degree of coordination between the shape of the water tank and the overall style S 43 The position relationship between the water tank and the whole

Computing the Importance of Indicators
A decision-making group composed of 10 users and design experts carried out investigations on the above 19 indicators, and normalized the survey questionnaire results of these 19 evaluation indicators with 8 samples to construct a decision matrix and then obtain the entropy results of each indicator, to finally get the weight of the 19 indicators (see Table 15). For convenience, the results of the core features of importance are demonstrated in Figure 6. At the same time, the target image weight results are further converted into triangular fuzzy numbers. As indicated by Table 16, the conversion scale of the image weights is determined, so that the results are more in line with the cognitive thinking characteristics of the users.   (5,7,9) 0.1-0.2 (7,9,9)

Product Priority Decision-Making
According to the fuzzy rating standards corresponding to the language variables in Table 16, 33 decision makers composed of 6 design teachers, 5 product managers, and 22 industrial design graduate students began to evaluate 10 smart capsule coffee machine product design schemes (P 1 , P 2 , P 3 , P 4 , P 5 , P 6 , P 7 , P 8 , P 9 , P 10 ) based on 19 evaluation indicators (also see Table 14). In order to minimize the uncertain factors of information and make the evaluation results more objective and accurate, the language variables were directly converted into triangular fuzzy numbers (see Table 17), and the weighted overall evaluation of the 33 decision makers was obtained according to Equations (13)- (19). Table 17. Fuzzy rating of the schemes corresponding to the linguistic variables.

Linguistic Variables Fuzzy Rating
Very bad (1, 1, 3) Bad (1, 3, 5) General (3, 5, 7) Good (5,7,9) Very good (7,9,9) According to Equations (22) and (23), the distance between the new sample and the fuzzy optimal solution and the worst solution can be obtained (see tables 18 and 19). According to Equation (25), the relative closeness between the new sample and the ideal solution can be obtained, and it is demonstrated in Table 20 (also see Figure 7), so the priority order of the 10 smart capsule coffee machine product samples is P 8 > P 7 > P 9 > P 1 > P 5 > P 6 > P 2 > P 3 > P 4 > P 10 . Based on this, the sample of the best solution for product modeling design is P 8 , followed by P 7 , and again P 9 . Therefore, P 8 is established as the optimal form design scheme of the smart capsule coffee machine.

Sensitivity Analysis
In order to further study the evaluation process of the fuzzy TOPSIS method, based on the entropy fuzzy number, a sensitivity analysis was carried out to change the product evaluation standard weight. Specifically, the criterion weight obtained from the entropybased and fuzzy evaluation was used for the change operation. In each round of sensitivity analysis, the weights of the two evaluation criteria were exchanged, while the weights of the other criteria remain the same. Then, the new standard weight was combined with TOPSIS based on the fuzzy number to sort the candidates. Referring to the previously published literature [11], only the weight of the first evaluation standard was used to exchange sequentially with other standards. In the first round (Z1), the original weights, w 1 , w 2 , w 18 , and w 19 , were used as the evaluation criteria. In the second round (Z2), the weights of criteria S 11 and S 12 were exchanged, and then the standard weights were changed to w 2 , w 1 , w 18 , and w 19 . Similarly, in the 19th round (Z19), the weights of the standards S 11 and S 43 were exchanged, and the new standard weights obtained were w 19 , w 2 , w 18 , and w 1 . Based on these assumptions, the detailed sensitivity analysis results are shown in Table 21 and Figure 8.    It can be seen from Table 20 and Figure 8 that the relative closeness (Ci) of P 8 , P 7 , and P 9 is the highest. However, in most evaluation index weight changes, the values of these two options are very close. By changing the weights of the two evaluation criteria, the ranking result does not change much. Therefore, the proposed fuzzy TOPSIS method based on entropy is stable and reliable for the evaluation of design concepts. Specifically, scheme P 8 ranked first in the experiment 13 times, so the probability of scheme P 8 becoming the first choice accounted for 68.4%. In addition, there are many different change strategies in the sensitivity analysis; we only list the exchanges between criterion S 11 and other criteria. Furthermore, more extension methods can be introduced to further enhance the sensitivity analysis.

Discussion and Conclusions
In order to survive under global competition and in a fast-changing environment, product development must make product innovations popular with consumers; that is, meet the demands of their target users. This product strategy attempts to customize the best product to adapt to different needs of different markets. In order to realize the symmetry of consumer and designer perceptions and preferences for smart capsule coffee machines, this research proposes a new product design evaluation method that combines natural language processing and the fuzzy TOPSIS method based on Kansei engineering to realize product design making-decision and optimization. The Kansei needs of consumers are crawled from online reviews, and the task of capturing the Kansei intentions of users can be automatically completed based on the natural language processing. Furthermore, the product design elements are quantified, and the key design elements of the product are extracted based on GRA to establish the product evaluation indicators. Moreover, the fuzzy TOPSIS method is used to complete the product priority ranking based on user preferences. Therefore, the design work can be accelerated based on automatic extraction for user Kansei needs, and the uncertainty and subjective manipulation of the design concept evaluation in the group decision-making environment can be solved at the same time, and the objectivity and credibility of the ranking results of the design alternatives can be improved. The main contributions of this article are summarized as follows:

1.
In order to meet the needs of users and speed up the product development process, this research proposes a user-driven automated product design framework that integrates text mining and KE to extract products with symmetrical cognitive information between users and designers, and then to help the company and designer to better complete the product customization and decision-making.

2.
For capturing customers' perception of a product's emotional characteristics effectively, a product Kansei image acquisition method based on user network review data is proposed, as well as to mine the users' real evaluation Kansei information from the network review big data in a faster way so as to break away from the interview and questionnaire methods in traditional KE, and to establish a more efficient Kansei design way.

3.
Parameterization of the Kansei vocabulary semantic vector through artificial intelligence technology of NLP can effectively avoid the deviation between the real preferences and preferences shown in traditional questionnaire surveys, while saving research time so that designers can perform related value-added work, consequently achieving symmetry between the extracted needs and the real needs of users.

4.
Utilizing GRA to identify the associated features between the user's preferred modeling elements and the Kansei image can predict the importance priority of each design element for the product.

5.
Introducing fuzzy theory into the TOPSIS method and using the transformation scale can convert the linguistic variables into triangular fuzzy numbers. Using fuzzy logic can solve the problem of the uncertainty and ambiguity of human thinking, so the accuracy of the experimental results is improved. Thus, the subjectivity of judgment can be avoided. 6.
Adopting a T-test and correlation analysis, the products that fit the psychological preference factors of designers and users are extracted precisely, so as to realize the symmetry between the designer and the user's cognitive information.
On the other hand, the limitations of this document are as follows: 1.
The current system only collects information from online reviews of e-commerce platforms, which is limited. We should try from more dimensions, and other information extracted from sources such as consumer reports and social media should be included so as to explore the customer need factors from more comprehensive dimensions.

2.
The method used in this research is to make decision of the design plan based on the evaluation of existing products in the market. However, there are differences between the conceptual evaluation results of the marketed product and the actual product design plan. In the future, the product evaluation and decision-making in the design concept stage should be explored.

3.
What is more, it is necessary to collect and classify more background information from online customers to provide a research path for market segmentation and personalized development and design of products.
In addition, in the process of users' demand acquisition, the relatively old reviews should be weakened. Therefore, in the future, the quantification of the time factor should be integrated into the user demand analysis to make the user demand acquisition more effective. Moreover, the evaluation of products should start from a multi-dimensional perspective, such as structure and material, so as to make the evaluation results more comprehensive for production design.
Funding: This work was supported by the Anhui University talent introduction research start-up funding project (No. S020318019).

Institutional Review Board Statement: Not applicable.
Informed Consent Statement: Not applicable.

Data Availability Statement:
The experiment data used to support the findings of this study are included in the article.

Conflicts of Interest:
The author declares no conflict of interest.