Which Influencers Can Maximize PCR of E-Commerce?
Abstract
:1. Introduction
1.1. Social Influencers as a Marketing Medium
1.2. Related Works
Name of Thesis | Published in | Research Topics | Characteristics | Dataset | |
1 | CID: Categorical Influencer Detection on microtext-based social media | Emerald Insight, Vol. 44 No. 5, pp. 1027–1055. | Proposed a system, Categorical Influencer Detection (CID), to identify categorical influencers on social media channels | Uses a Variational Autoencoder (VAE) to simulate the LDA process | Facebook, Twitter, YouTube, Instagram datasets |
2 | A Graph Based Approach for Effective Influencer Marketing | International Journal of Data Mining and Knowledge Management Process (IJDKP) Vol. 9, No. 4, 2019 | Proposed an algorithm to select the minimum number of influencers that can reach the desired audience | Uses a greedy algorithm to rationalize the minimum number of influencers | 500 influencers from Pakistan with 5000 to 300,000 followers |
3 | Measuring Time-Sensitive and Topic-Specific Influence on Social Networks with LSTM and Self-Attention | 2020, IEEE Access, Vol. 8, pp. 82,481–82,492 | Proposed an advanced framework to measure social network influence in terms of both time and topic | Uses SeededLDA, advanced GAT, and matrix-based LSTM to measure time and topic simultaneously | Three labeled datasets from Twitter and Reddit |
4 | Influencer Attribute Analysis-based Recommendation System | Journal of the Korea Institute of Information and Communication Engineering, Vol. 23, No. 11: 1321~1329, Nov. 2019 | Proposed a system to recommend the best influencer for marketing or categorical users | Uses a TF-IDF, LDA algorithm to make recommendations | Numerical data from YouTube videos which advertise the application “Swipe”. |
5 | Analyzing Impacts of Country, Product, and Influencer on Purchase Intention in the Chinese Cosmetics Market | International Commerce and Information Review, Volume 22, Number 2, June, 2020: pp. 309~329 | Proposed the advertisement effect which can influence customer buying intentions | Uses a structural equation model to analyze relationships and uses an SPSS program to check the validity of data for comparison | Survey of online customers that buy cosmetic products |
6 | Metrological Analysis of Online Consumption Evaluation Influence Commodity Marketing Decision Based on Data Mining | Mathematical Problems in Engineering, Volume 2020 | Proposed a model suitable for any platform and any commodity to mine the potential information behind ratings and reviews of online products | Creates a prediction model using an intelligent algorithm BP neural network and a fuzzy comprehensive evaluation model based on principal component analysis | Scores and comments on Amazon Market |
1.3. Our Works
2. Background
2.1. RNN and LSTM Background
2.1.1. Recurrent Neural Network (RNN)
2.1.2. Long Short-Term Memory (LSTM)
2.2. Transformer Model
2.3. TF-IDF
2.4. Collaborative Filtering
3. Research Method
3.1. Post Data Crawling of a Naver Blog
Algorithm 1 Naver Blog Crawler |
INPUT: product name save search url as baseurl + product name WHILE end of the page page scroll down END WHILE get url link from each block FOR url IN url list search each url make dataframe set column ‘Title’, ‘Blogger’, ‘Post URL’, ‘Post’, “Image_num’, ‘Paragraph num’, ‘Comment num’, ‘Video num’, ‘weekly viewer mean’, ‘Sympathy num’, ‘AD’, ‘Posting Date’ scroll each element according to column make keyword list that implies AD use image tesseract to recognize character IF keyword in Post or Tesseract post is AD END IF END FOR save dataframe into csv file |
3.2. Survey (i.e., Human Evaluation)
3.3. Click Data
3.4. Review Data Crawling and Labeling of Web Shopping Mall (i.e., Coupang and Naver Smartstore)
3.5. Review Data Crawling (Oliveyoung)
3.6. TF-IDF Analysis
Algorithm 2 TF-IDF |
INPUT: D < - blog posts TF < - term frequency dictionary DF < - DF dictionary initialized all 0 for each post in D do tokenizing for token in tokenized post do If text length < 1 then delete text end if end for stemming end for for tokenized post in D do for token in tokenized post do if token not in TF then TF[token] < - TF else TF[token] < - TF[token] + TF end if end for end for for term in TF do for tokenized post in D do if term in tokenized post then DF[term] < - DF[term] + 1 end if end for end for get TF-IDF from TF and DF |
3.7. Word Frequency Analysis
Algorithm 3 Text Word Frequency |
read csv file FOR post IN csv file Post use konlpy to extract each words count each word’s frequency filter it by most common word get font of each post END FOR make a histogram about word frequency |
3.8. Noun-Josa Ratio and Word Count Analysis
4. The Proposed Scheme
4.1. Framework
4.2. Quantitative Analysis
4.2.1. Text Score (Readability)
4.2.2. Statistical Information Score (Influencer Power)
4.3. Qualitative Analysis
4.4. Influencer Recommender System
5. Results
6. Conclusions
6.1. Contribution
6.2. Limitations and Etical Considerations
6.3. Future Plan
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Huynh, T.; Nguyen, H.; Zelinka, I.; Dinh, D.; Pham, X.H. Detecting the Influencer on Social Networks Using Passion Point and Measures of Information Propagation. Sustainability 2020, 12, 3064. [Google Scholar] [CrossRef] [Green Version]
- Quan, T.T.; Mai, D.T.; Tran, T.D. CID: Categorical Influencer Detection on microtext-based social media. Online Inf. Rev. 2020, 44, 1027–1055. [Google Scholar] [CrossRef]
- Zheng, C.; Zhang, Q.; Long, G.; Zhang, C.; Young, S.D.; Wang, W. Measuring Time-Sensitive and Topic-Specific Influence in Social Networks with LSTM and Self-Attention. IEEE Access 2020, 8, 82481–82492. [Google Scholar] [CrossRef] [PubMed]
- Park, J.R.; Park, J.; Kim, M.; Oh, H. Influencer Attribute Analysis based Recommendation System. J. Korea Inst. Inf. Commun. Eng. 2019, 23, 1321–1329. [Google Scholar] [CrossRef]
- CHEN Qing-Yu, CHO Hyuk-So Analyzing Impacts of Country, Product and Influencer on Purchase Intention in the Chinese Cosmetics Market. Int. Commer. Inf. Rev. 2020, 22, 309–332. [CrossRef] [Green Version]
- Xu, Y.-H.; Huang, L.-F.; Guo, R.-R.; Zhang, X.-Y.; Zhu, J.-M. Metrological Analysis of Online Consumption Evaluation Influence Commodity Marketing Decision Based on Data Mining. Hindawi Math. Probl. Eng. 2020, 2020, 9345901. [Google Scholar] [CrossRef]
- Wu, S.; Wingate, N.; Wang, Z.; Liu, Q. The Influence of Fake Reviews on Consumer Perceptions of Risks and Purchase Intentions. J. Mark. Dev. Compet. 2019, 13. [Google Scholar] [CrossRef]
- Das, R.K.; Dash, S.S.; Das, K.; Panda, M. Detection of Spam in YouTube Comments Using Different Classifiers. In Advanced Computing and Intelligent Engineering; Springer: Singapore, 2020; pp. 201–214. [Google Scholar]
- Samsudin, N.M.; Foozy, C.F.M.; Alias, N.; Shamala, P.; Othman, N.F.; Din, W. Youtube spam detection framework usingnaïve bayes and logistic regression. Indones. J. Electr. Eng. Comput. Sci. 2019, 14, 1508. [Google Scholar] [CrossRef]
- Ezpeleta, E.; Iturbe, M.; Garitano, I.; de Mendizabal, I.V.; Zurutuza, U. A mood analysis on youtube comments and a methodfor improved social spam detection. In Proceedings of the Hybrid Artificial Intelligent Systems: 13th International Conference, HAIS 2018, Oviedo, Spain, 20–22 June 2018. [Google Scholar] [CrossRef]
- Hussain, N.; Turab Mirza, H.; Rasool, G.; Hussain, I.; Kaleem, M. Predilection decoded: Spam Review Detection Techniques: A Systematic Literature Review. Appl. Sci. 2019, 9, 987. [Google Scholar] [CrossRef] [Green Version]
- Cauteruccio, F.; Corradini, E.; Terracina, G.; Ursino, D.; Virgili, L. Extraction and analysis of text patterns from NSFW adult content in Reddit. Data Knowl. Eng. 2022, 138, 101979. [Google Scholar] [CrossRef]
High/Low (w/Video) | |
---|---|
Rate | 46.167 |
Gif num | 4.2 |
Weekly viewer mean | 1.693 |
Sticker num | 1.571 |
Image num | 1.545 |
Sympathy num | 1.094 |
Word count | 0.961 |
Video num | 0.980 |
Buddy num | 0.733 |
Comments (Korean Translated to English) | Label | |
---|---|---|
0 | Wow~ After putting it together like this, it really is the end of the year!! It’s so cool, teacher ^^ (smile face emoticon) I feel like I’ve seen the exhibition without giving money. | 1 |
1 | Each work is full of sincerity. These are wonderful works. | 1 |
2 | Every time I look at the pretty doll house very well. I think I should express my gratitude once a year, so I left a little comment. | 1 |
3 | Miniatures are overflowing with imagination! I’ve been looking for some good info. Have a relaxing evening~~!! | 1 |
4 | It’s really not easy to upload every single day, but what a great writer!! The works are also very healing and good:) | 1 |
5 | This is a good post ^^ (smile face emoticon) Like it~ I hope you always have a happy time^^ I would appreciate it if you could visit my blog too haha | 0 |
6 | What about China? When it comes to Korea, it’s kimchi. | 0 |
Feature | Ratio |
---|---|
Word frequency score | 0.05 |
TF-IDF score | 0.15 |
Noun/josa score | 0.05 |
Word count score | 0.10 |
Image score | 0.10 |
Video score | 0.10 |
GIF score | 0.20 |
Sticker score | 0.15 |
Img word score | 0.10 |
Terms | Meaning |
---|---|
x | Values obtained through quantitative analysis (0~1) |
y | Values obtained through qualitative analysis (0~1) |
α | Ratio of x(α + β = 1) |
β | Ratio of y(α + β = 1) |
I1(x) | 1 if x in an IT-related article 0 if it is a non-IT-related article |
I2(x) | 1 if x in a non-IT-related post 0 if it is an IT-related post |
f1(x) | Value obtained according to the ratio of objective sentences (0~1) |
f2(x) | Value obtained according to the ratio of subjective sentences (0~1) |
PCR | Purchase Conversion Rate |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Oh, H.; Lee, J.; Lee, J.-S.; Kim, S.-M.; Lim, S.; Jung, D. Which Influencers Can Maximize PCR of E-Commerce? Electronics 2023, 12, 2626. https://doi.org/10.3390/electronics12122626
Oh H, Lee J, Lee J-S, Kim S-M, Lim S, Jung D. Which Influencers Can Maximize PCR of E-Commerce? Electronics. 2023; 12(12):2626. https://doi.org/10.3390/electronics12122626
Chicago/Turabian StyleOh, Hayoung, Jiyoon Lee, Joo-Sik Lee, Sung-Min Kim, Sechang Lim, and Dongha Jung. 2023. "Which Influencers Can Maximize PCR of E-Commerce?" Electronics 12, no. 12: 2626. https://doi.org/10.3390/electronics12122626
APA StyleOh, H., Lee, J., Lee, J.-S., Kim, S.-M., Lim, S., & Jung, D. (2023). Which Influencers Can Maximize PCR of E-Commerce? Electronics, 12(12), 2626. https://doi.org/10.3390/electronics12122626