Applying Neural Networks in Wineinformatics with the New Computational Wine Wheel
Abstract
:1. Introduction
2. Materials and Methods
2.1. Classification Algorithms
2.1.1. SVM
2.1.2. Neural Networks
2.1.3. Naïve Bayes
- P(A|B) is the posterior probability of A given B
- P(B|A) is the likelihood probability of B given A
- P(A) is the prior probability of A
- P(B) is the prior probability of B
2.2. Wine Review Data from Wine Spectator and Robert Parker Wine Advocate
2.3. Computational Wine Wheel
2.4. Computational Wine Wheel 3.0
2.4.1. New Attribute Extraction from Robert Parker’s Elite Bordeaux Data Set Process
2.4.2. CWW2.0 vs. CWW3.0
2.5. Data Sets
2.5.1. Dataset1: Elite Bordeaux Data Set (513 Wines)
2.5.2. Dataset2: Big Data Set
2.6. Data Preprocessing
2.7. Classification Evaluations
3. Experimental Results
- -
- Three inner layers, each with a number of nodes equal to the number of attributes of the data set;
- -
- Each inner layer is followed by a drop-off layer set at a 20% ratio;
- -
- A five-fold cross-validation is used;
- -
- Since neural networks use the random initial weight for each node for each data set, the reported accuracy is the average of five independent runs.
3.1. Performances on Dataset1: Elite Bordeaux
3.1.1. Results of Robert Parker’s review in Dataset1
3.1.2. Results from Wine Spectator
3.2. Performances on Dataset2: Big Dataset
3.2.1. Results of Robert Parker’s Review in Dataset2
3.2.2. Results of Wine Spectator’s Reviews in Dataset2
3.3. Performances of the Computational Wine Wheel 2.0 vs. 3.0
4. Discussion
5. Conclusions
- Adopting neural networks on wine reviews processed by the Computational Wine Wheel and comparing performances directly with Naïve Bayes and SVM classification results;
- Proposing a new Computation Wine Wheel evolved into the third generation by including Robert Parker’s reviews;
- A new data set, which consists of more than 10,000 wines was collected and tested for a comprehensive comparison between two different review sites (Wine Spectator and Robert Parker Wine Advocate) using all three algorithms and two Computational Wine Wheels.
6. Future Works
Supplementary Materials
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Chen, B.; Velchev, V.; Palmer, J.; Atkison, T. Wineinformatics: A Quantitative Analysis of Wine Reviewers. Fermentation 2018, 4, 82. [Google Scholar] [CrossRef] [Green Version]
- Palmer, J.; Chen, B. Wineinformatics: Regression on the Grade and Price of Wines through Their Sensory Attributes. Fermentation 2018, 4, 84. [Google Scholar] [CrossRef] [Green Version]
- Cortez, P.; Cerdeira, A.; Almeida, F.; Matos, T.; Reis, J. Modeling wine preferences by data mining from physicochemical properties. Decis. Support Syst. 2009, 47, 547–553. [Google Scholar] [CrossRef] [Green Version]
- Chen, M.-C.; Chen, L.-S.; Hsu, C.-C.; Zeng, W.-R. An information granulation based data mining approach for classifying imbalanced data. Inf. Sci. 2008, 178, 3214–3227. [Google Scholar] [CrossRef] [Green Version]
- Capece, A.; Romaniello, R.; Siesto, G.; Pietrafesa, R.; Massari, C.; Poeta, C.; Ro-mano, P. Selection of indigenous Saccharomyces cerevisiae strains for Nero d’Avola wine and evaluation of selected starter implantation in pilot fermentation. Int. J. Food Microbiol. 2010, 144, 187–192. [Google Scholar] [CrossRef] [PubMed]
- Edelmann, A.; Diewok, J.; Schuster, K.C.; Lendl, B. Rapid Method for the Discrimination of Red Wine Cultivars Based on Mid-Infrared Spectroscopy of Phenolic Wine Extracts. J. Agric. Food Chem. 2001, 49, 1139–1145. [Google Scholar] [CrossRef] [PubMed]
- Chen, B.; Rhodes, C.; Crawford, A.; Hambuchen, L. Wineinformatics: Applying data mining on wine sensory reviews pro-cessed by the computational wine wheel. In Proceedings of the 2014 IEEE International Conference on Data MiningWork-Shop, Shenzhen, China, 14 December 2014; pp. 142–149. [Google Scholar]
- Chen, B.; Rhodes, C.; Yu, A.; Velchev, V. The computational wine wheel 2.0 and the TriMax triclustering in wineinformatics. In Proceedings of the Industrial Conference on Data Mining, Barcelona, Spain, 12–15 December 2016; Springer: Cham, Switzerland, 2016; pp. 223–238. [Google Scholar]
- McCune, J.; Riley, A.; Chen, B. Clustering in Wineinformatics with Attribute Selection to Increase Unique-ness of Clusters. Fermentation 2021, 7, 27. [Google Scholar] [CrossRef]
- Hommerberg, C. Persuasiveness in the Discourse of Wine: The Rhetoric of Robert Parker. Ph.D. Thesis, Linnaeus University Press, Kalmar, Sweden, 2011. [Google Scholar]
- Tian, Q.; Whiting, B.; Chen, B. Wineinformatics: Comparing and Combining SVM Models Built by Wine Reviews from Robert Parker and Wine Spectator for 95 + Point Wine Prediction. Fermentation 2022, 8, 164. [Google Scholar] [CrossRef]
- Loyola-Gonzalez, O. Black-Box vs. White-Box: Understanding Their Advantages and Weaknesses from a Practical Point of View. IEEE Access 2019, 7, 154096–154113. [Google Scholar] [CrossRef]
- Cherkassky, V.; Dhar, S. Interpretation of Black-Box Predictive Models; Springer: Cham, Switzerland, 2015; pp. 267–286. [Google Scholar] [CrossRef]
- Riul, A., Jr.; de Sousa, H.C.; Malmegrim, R.R.; Santos, D.S.D., Jr.; Carvalho, A.C.P.F.; Fonseca, F.J.; Oliveira, O.N., Jr.; Mattoso, L.H.C. Wine classification by taste sensors made from ultra-thin films and using neural networks. Sens. Actuators B Chem. 2004, 98, 77–82. [Google Scholar] [CrossRef]
- Aguilera, T.; Lozano, J.; Paredes, J.A.; Alvarez, F.J.; Suárez, J.I. Electronic nose based on inde-pendent component analysis combined with partial least squares and artificial neural networks for wine prediction. Sensors 2012, 12, 8055–8072. [Google Scholar] [CrossRef]
- Katumullage, D.; Yang, C.; Barth, J.; Cao, J. Using Neural Network Models for Wine Review Classification. J. Wine Econ. 2022, 17, 27–41. [Google Scholar] [CrossRef]
- Chen, Y.-L.; Hung, L.T.-H. Using decision trees to summarize associative classification rules. Expert Syst. Appl. 2009, 36, 2338–2351. [Google Scholar] [CrossRef]
- Weisberg, S. Applied Linear Regression; John Wiley & Sons: Hoboken, NJ, USA, 2005; Volume 528. [Google Scholar] [CrossRef]
- Ben-Gal, I. Bayesian networks. In Encyclopedia of Statistics in Quality and Reliability; Wiley Online Library: New York, NY, USA, 2008; Volume 1. [Google Scholar]
- Webb, G.I.; Keogh, E.; Miikkulainen, R. Naïve Bayes. Encycl. Mach. Learn. 2010, 15, 713–714. [Google Scholar]
- Chen, B.; Le, H.; Atkison, T.; Che, D. A Wineinformatics Study for White-box Classification Algorithms to Understand and Evaluate Wine Judges. Trans. Mach. Learn. Data Min. 2017, 10, 3–24. [Google Scholar]
- Boser, B.E.; Guyon, I.M.; Vapnik, V.N. A training algorithm for optimal margin classifiers. In Proceedings of the Fifth Annual Workshop on Computational Learning Theory, Pittsburgh, PN, USA, 27–29 July 1992. [Google Scholar]
- Karimi, N.; Kazem, S.; Ahmadian, D.; Adibi, H.; Ballestra, L. On a generalized Gaussian radial basis function: Analysis and applications. Eng. Anal. Bound. Elem. 2020, 112, 46–57. [Google Scholar] [CrossRef]
- Joachims, T. Making large-Scale SVM Learning Practical. In Advances in Kernel Methods—Support Vector Learning; Schölkopf, B., Burges, C., Smola, A., Eds.; MIT-Press: Cambridge, MA, USA, 1999. [Google Scholar]
- Abadi, M.; Barham, P.; Chen, J.; Chen, Z.; Davis, A.; Dean, J.; Devin, M.; Ghemawat, S.; Irving, G.; Isard, M.; et al. Tensorflow: A system for large-scale machine learning. In Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI’16), Savannah, GA, USA, 2–4 November 2016; pp. 265–283. [Google Scholar]
- Medsker, L.R.; Jain, L.C. Recurrent neural networks. Des. Appl. 2001, 5, 64–67. [Google Scholar]
- Palmer, J.; Sheng, V.S.; Atkison, T.; Chen, B. Classification on grade, price, and region with multi-label and multi-target methods in wineinformatics. Big Data Min. Anal. 2019, 3, 1–12. [Google Scholar] [CrossRef]
- Roca, P. State of the Vitiviniculture World in 2020; International Organization of Vine and Wine: Dijon, France, 2021. [Google Scholar]
- Goodhue, R.; Green, R.; Heien, D.; Martin, P. California wine industry evolving to compete in 21st century. Calif. Agric. 2008, 62, 12–18. [Google Scholar] [CrossRef] [Green Version]
- Buccafusco, C.; Masur, J.S.; Whalen, R. How Many Latours Is Too Many? Measuring Brand Name Congestion in Bordeaux Wine. J. Wine Econ. 2021, 16, 419–428. [Google Scholar] [CrossRef]
- Capehart, K.W. Expensive and Cheap Wine Words Revisited. J. Wine Econ. 2021, 16, 411–418. [Google Scholar] [CrossRef]
- Yang, C.; Barth, J.; Katumullage, D.; Cao, J. Wine Review Descriptors as Quality Predictors: Evidence from Language Processing Techniques. J. Wine Econ. 2022, 17, 64–80. [Google Scholar] [CrossRef]
- Gu, J.; Wang, Z.; Kuen, J.; Ma, L.; Shahroudy, A.; Shuai, B.; Liu, T.; Wang, X.; Wang, G.; Cai, J.; et al. Recent advances in convolutional neural networks. Pattern Recognit. 2018, 77, 354–377. [Google Scholar] [CrossRef] [Green Version]
- Cheng, J.; Dong, L.; Lapata, M. Long Short-Term Memory-Networks for Machine Reading. arXiv 2016, arXiv:1601.06733. [Google Scholar]
- LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef] [PubMed]
- Chen, B.; Buck, K.H.; Lawrence, C.; Moore, C.; Yeatts, J.; Atkison, T. Granular computing in wineinformatics. In Proceedings of the 2017 13th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD), Guilin, China, 29–31 July 2017; 2017; pp. 1228–1232. [Google Scholar]
- Mehta, S.; Azarnoush, B.; Chen, B.; Saluja, A.; Misra, V.; Bihani, B.; Kumar, R. Simplify-then-translate: Automatic preprocessing for black-box translation. In Proceedings of the AAAI Conference on Artificial Intelligence, New York, NY, USA, 7–12 February 2020; Volume 34, pp. 8488–8495. [Google Scholar]
- Miyata, R.; Fujita, A. Understanding Pre-Editing for Black-Box Neural Machine Translation. arXiv 2021, arXiv:2102.02955. [Google Scholar]
Robert Parker’s Review | Wine Spectator’s Review |
---|---|
Made with organically grown clones of Michel and Lampia, the E Pira-Chiara Boschis 2017 Barolo Mosconi softly presents bright berry aromas, raspberry, wild cherry, crushed limestone and delicate floral tones of lilacs and violets. Like other wines from this hot vintage, this expression from the Mosconi cru in Monforte d’Alba has that unique floral signature that is precious and unexpected. The wine shows great depth and balance with a pretty intensity that spreads over the palate. The tannins are dry with some dustiness, but the mouthfeel is spot-on in terms of length and polish. | Cherry and plum fruit flavors are accented by vanilla, toast, hay, white pepper and tar notes in this expressive, solidly built Barolo, which is fluid, with a dense matrix of tannins shoring up the long finish, showing fine complexity, balance and length. Best from 2025 through 2048. |
Robert Parker’s Score Meaning | Wine Spectator’s Score Meaning |
---|---|
96–100 Extraordinary: worth a special effort to find, purchase, and consume. 90–95 Outstanding: terrific wines 80–89 Barely above average and very good 70–79 Average: a straightforward, innocuous wine. 60–69 Below average 50–59 Unacceptable | 95–100 Classic: a great wine 90–94 Outstanding: a wine of superior character and style 85–89 Very good: a wine with special qualities 80–84 Good: a solid, well-made wine 75–79 Mediocre: a drinkable wine that may have minor flows 50–74 Not recommended |
Hand-Extracted Attributes | Program-Extracted Attributes | Common Attributes |
---|---|---|
powerful, blackcurrant, black raspberries, blueberry, pie, melted chocolate, aniseed, camphor, kirsch, subtle, floral, full-bodied, concentrated, bold, seductive, fine-grained, silt-like tannins, jam-packed, tightly wound, fruit, layers, finishing, wonderful, mineral, sparks, magic | powerful, black raspberries, blueberry, pie, melted chocolate, kirsch, subtle, floral, full-bodied, concentrated, bold, seductive, jam-packed, tightly wound, fruit, layers, finishing, wonderful, mineral, sparks, purple color, tannins, explodes | powerful, black raspberries, blueberry, pie, melted chocolate, kirsch, subtle, floral, full-bodied, concentrated, bold, seductive, jam-packed, tightly wound, fruit, layers, finishing, wonderful, mineral, sparks, |
Total count: 26 | Total count: 23 | Total count: 20 |
CWW2.0 | CWW3.0 | |
---|---|---|
Data Source | Wine Spectator | Wine Spectator + Robert Parker |
Categories | 14 | 14 |
Subcategories | 34 | 34 |
Specific Terms | 1932 | 2589 |
Normalized Attributes | 986 | 1191 |
CATEGORY_NAME | SUBCATEGORY_NAME | SPECIFIC_NAME | SPECIFIC_NAME Added | NORMALIZED_NAME | NORMALIZED_NAME Added |
---|---|---|---|---|---|
CARAMEL | CARAMEL | 97 | 26 | 56 | 16 |
CHEMICAL | PETROLEUM | 11 | 2 | 6 | 1 |
SULFUR | 11 | 0 | 10 | 0 | |
PUNGENT | 4 | 0 | 4 | 1 | |
EARTHY | EARTHY | 128 | 56 | 47 | 16 |
MOLDY | 2 | 0 | 2 | 0 | |
FLORAL | FLORAL | 87 | 26 | 45 | 6 |
FRUITY | BERRY | 84 | 35 | 39 | 11 |
CITRUS | 56 | 19 | 35 | 2 | |
DRIED FRUIT | 76 | 9 | 65 | 5 | |
FRUIT | 42 | 20 | 16 | 7 | |
OTHER | 22 | -3 | 9 | -9 | |
TREE FRUIT | 55 | 16 | 40 | 9 | |
TROPICAL FRUIT | 67 | 19 | 36 | 9 | |
FRESH | FRESH | 75 | 34 | 44 | 15 |
DRIED | 50 | 25 | 39 | 18 | |
CANNED/COOKED | 18 | 2 | 17 | 2 | |
MEAT | MEAT | 36 | 11 | 21 | 8 |
MICROBIOLOGICAL | YEASTY | 5 | 0 | 4 | 0 |
LACTIC | 14 | 0 | 6 | 0 | |
NUTTY | NUTTY | 27 | 2 | 20 | 5 |
OVERALL | TANNINS | 124 | 34 | 6 | 2 |
BODY | 61 | 11 | 17 | -6 | |
STRUCTURE | 51 | 11 | 2 | 0 | |
ACIDITY | 61 | 21 | 4 | 1 | |
FINISH | 233 | 49 | 13 | 8 | |
FLAVOR/DESCRIPTORS | 899 | 240 | 467 | 35 | |
OXIDIZED | OXIDIZED | 2 | 1 | 2 | 1 |
PUNGENT | HOT | 3 | 0 | 2 | 0 |
COLD | 1 | 0 | 1 | 0 | |
SPICY | SPICE | 85 | 2 | 53 | 9 |
WOODY | RESINOUS | 31 | 7 | 12 | 3 |
PHENOLIC | 6 | 0 | 5 | 1 | |
BURNED | 51 | 4 | 28 | 2 |
Evaluation Matrix | Predicted (Positive) | Predicted (Negative) |
---|---|---|
Actual (Positive) | TP | FN |
Actual (Negative) | FP | TN |
Key Attributes from Positive Label (95+) | Key Attributes from Negative Label (94-) |
---|---|
GREAT, BLACK CURRANT, PURPLE, FULL-BODIED, LAYER, RIPE, FRUIT, FLORAL, FIRM, BLACK FRUIT | MEDIUM-BODIED, WELL-BALANCED, VELVET, YOUNG, PURE, AROMA, DENSE, STYLE, TANNINS_DECENT, TOBACCO |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Le, L.; Hurtado, P.N.; Lawrence, I.; Tian, Q.; Chen, B. Applying Neural Networks in Wineinformatics with the New Computational Wine Wheel. Fermentation 2023, 9, 629. https://doi.org/10.3390/fermentation9070629
Le L, Hurtado PN, Lawrence I, Tian Q, Chen B. Applying Neural Networks in Wineinformatics with the New Computational Wine Wheel. Fermentation. 2023; 9(7):629. https://doi.org/10.3390/fermentation9070629
Chicago/Turabian StyleLe, Long, Pedro Navarrete Hurtado, Ian Lawrence, Qiuyun Tian, and Bernard Chen. 2023. "Applying Neural Networks in Wineinformatics with the New Computational Wine Wheel" Fermentation 9, no. 7: 629. https://doi.org/10.3390/fermentation9070629
APA StyleLe, L., Hurtado, P. N., Lawrence, I., Tian, Q., & Chen, B. (2023). Applying Neural Networks in Wineinformatics with the New Computational Wine Wheel. Fermentation, 9(7), 629. https://doi.org/10.3390/fermentation9070629