Understanding 21st Century Bordeaux Wines from Wine Reviews Using Naïve Bayes Classifier
Abstract
:1. Introduction
2. Bordeaux Wine Dataset
2.1. Wine Spectator
- 95–100 Classic: a great wine
- 89–94 Outstanding: a wine of superior character and style
- 85–89 Very good: a wine with special qualities
- 80–84 Good: a solid, well-made wine
- 75–79 Mediocre: a drinkable wine that may have minor flaws
- 50–74 Not recommended
2.2. The Computational Wine Wheel
2.3. Datasets
2.3.1. ALL Bordeaux Wine Dataset
2.3.2. 1855 Bordeaux Wine Official Classification Dataset
3. Methods
3.1. Classification Algorithms
3.1.1. Naïve Bayes
- P(Y|X): The posterior probability of Y belongs to a particular class when X happens;
- P(X|Y): The prior probability of certain feature value X when Y belongs to a certain class;
- P(Y): prior probability of Y;
- P(X): prior probability of X.
3.1.2. SMV
3.2. Evaluations
- TP:
- The real condition is true (1) and predicted as true (1); 90+ wine correctly classified as 90+ wine;
- TN:
- The real condition is false (−1) and predicted as false (−1); 89− wine correctly classified as 89− wine;
- FP:
- The real condition is false (−1) but predicted as true (1); 89− wine incorrectly classified as 90+ wine;
- FN:
- The real condition is true (1) but predicted as false (−1); 90+ wine incorrectly classified as 89− wine.
4. Results
4.1. ALL Bordeaux Wine Dataset
4.2. 1855 Bordeaux Wine Official Classification Dataset
4.3. Comparison of Two Datasets
4.4. Visualization of 1855 Bordeaux Wine Official Classification Dataset
4.5. Top 20 Keywords
5. Conclusions
Author Contributions
Funding
Conflicts of Interest
Appendix A. The 1855 Classification, Revised in 1973
Appendix A.1. Red Wines
- Château Haut-Brion, Pessac, AOC Pessac-Léognan
- Château Lafite-Rothschild, Pauillac, AOC Pauillac
- Château Latour, Pauillac, AOC Pauillac
- Château Margaux, Margaux, AOC Margaux
- Château Mouton Rothschild, Pauillac, AOC Pauillac
- Château Brane-Cantenac, Cantenac, AOC Margaux
- Château Cos-d’Estournel, Saint-Estèphe, AOC Saint-Estèphe
- Château Ducru-Beaucaillou, Saint-Julien-Beychevelle, AOC Saint-Julien
- Château Durfort-Vivens, Margaux, AOC Margaux
- Château Gruaud-Larose, Saint-Julien-Beychevelle, AOC Saint-Julien
- Château Lascombes, Margaux, AOC Margaux
- Château Léoville-Barton, Saint-Julien-Beychevelle, AOC Saint-Julien
- Château Léoville-Las-Cases, Saint-Julien-Beychevelle, AOC Saint-Julien
- Château Léoville-Poyferré, Saint-Julien-Beychevelle, AOC Saint-Julien
- Château Montrose, Saint-Estèphe, AOC Saint-Estèphe
- Château Pichon-Longueville-Baron-de-Pichon, Pauillac, AOC Pauillac
- Château Pichon-Longueville-Comtesse-de-Lalande, Pauillac, AOC Pauillac
- Château Rauzan-Ségla, Margaux, AOC Margaux
- Château Rauzan-Gassies, Margaux, AOC Margaux
- Château Boyd-Cantenac, Cantenac, AOC Margaux
- Château Calon-Ségur, Saint-Estèphe, AOC Saint-Estèphe
- Château Cantenac-Brown, Cantenac, AOC Margaux
- Château Desmirail, Margaux, AOC Margaux
- Château Ferrière, Margaux, AOC Margaux
- Château Giscours, Labarde, AOC Margaux
- Château d’Issan, Cantenac, AOC Margaux
- Château Kirwan, Cantenac, AOC Margaux
- Château Lagrange, Saint-Julien-Beychevelle, AOC Saint-Julien
- Château La Lagune, Ludon, AOC Haut-Médoc
- Château Langoa-Barton, Saint-Julien-Beychevelle, AOC Saint-Julien
- Château Malescot-Saint-Exupéry, Margaux, AOC Margaux
- Château Marquis-d’Alesme, Margaux, AOC Margaux
- Château Palmer, Cantenac, AOC Margaux
- Château Beychevelle, Saint-Julien-Beychevelle, AOC Saint-Julien
- Château Branaire-Ducru, Saint-Julien-Beychevelle, AOC Saint-Julien
- Château Duhart-Milon, Pauillac, AOC Pauillac
- Château Lafon-Rochet, Saint-Estèphe, AOC Saint-Estèphe
- Château Marquis-de-Terme, Margaux, AOC Margaux
- Château Pouget, Cantenac, AOC Margaux
- Château Prieuré-Lichine, Cantenac, AOC Margaux
- Château Saint-Pierre, Saint-Julien-Beychevelle, AOC Saint-Julien
- Château Talbot, Saint-Julien-Beychevelle, AOC Saint-Julien
- Château La Tour-Carnet, Saint-Laurent-de-Médoc, AOC Haut-Médoc
- Château d’Armailhac, Pauillac, AOC Pauillac
- Château Batailley, Pauillac, AOC Pauillac
- Château Belgrave, Saint-Laurent-de-Médoc, AOC Haut-Médoc
- Château Camensac, Saint-Laurent-de-Médoc, AOC Haut-Médoc
- Château Cantemerle, Macau, AOC Haut-Médoc
- Château Clerc-Milon, Pauillac, AOC Pauillac
- Château Cos-Labory, Saint-Estèphe, AOC Saint-Estèphe
- Château Croizet-Bages, Pauillac, AOC Pauillac
- Château Dauzac, Labarde, AOC Margaux
- Château Grand-Puy-Ducasse, Pauillac, AOC Pauillac
- Château Grand-Puy-Lacoste, Pauillac, AOC Pauillac
- Château Haut-Bages-Libéral, Pauillac, AOC Pauillac
- Château Haut-Batailley, Pauillac, AOC Pauillac
- Château Lynch-Bages, Pauillac, AOC Pauillac
- Château Lynch-Moussas, Pauillac, AOC Pauillac
- Château Pédesclaux, Pauillac, AOC Pauillac
- Château Pontet-Canet, Pauillac, AOC Pauillac
- Château du Tertre, Arsac, AOC Margaux
Appendix A.2. White Wines
- Château d’Yquem, Sauternes, AOC Sauternes
- Château Climens, Barsac, AOC Barsac
- Clos Haut-Peyraguey, Bommes, AOC Sauternes
- Château Coutet, Barsac, AOC Barsac
- Château Guiraud, Sauternes, AOC Sauternes
- Château Lafaurie-Peyraguey, Bommes, AOC Sauternes
- Château Rabaud-Promis, Bommes, AOC Sauternes
- Château Rayne-Vigneau, Bommes, AOC Sauternes
- Château Rieussec, Fargues-de-Langon, AOC Sauternes
- Château Sigalas-Rabaud, Bommes, AOC Sauternes
- Château Suduiraut, Preignac, AOC Sauternes
- Château La Tour-Blanche, Bommes, AOC Sauternes
- Château d’Arche, Sauternes, AOC Sauternes
- Château Broustet, Barsac, AOC Barsac
- Château Caillou, Barsac, AOC Barsac
- Château Doisy-Daëne, Barsac, AOC Barsac
- Château Doisy-Dubroca, Barsac, AOC Barsac
- Château Doisy-Védrines, Barsac, AOC Barsac
- Château Filhot, Sauternes, AOC Sauternes
- Château Lamothe (Despujols), Sauternes, AOC Sauternes
- Château Lamothe-Guignard, Sauternes, AOC Sauternes
- Château de Malle, Preignac, AOC Sauternes
- Château de Myrat, Barsac, AOC Barsac
- Château Nairac, Barsac, AOC Barsac
- Château Romer-du-Hayot, Fargues-de-Langon, AOC Sauternes
- Château Romer, Fargues-de-Langon, AOC Sauternes
- Château Suau, Barsac, AOC Barsac
Appendix B. The List of Wine and Vintages We Can’t Find
- CHÂTEAU PÉDESCLAUX Pauillac (2005,2004,2003,2002,2001)
- CHÂTEAU CLIMENS Barsac (2000)
- CHÂTEAU RABAUD-PROMIS Sauternes (2016,2015,2014,2010,2008)
- CHÂTEAU RIEUSSEC Sauternes (2012)
- CHÂTEAU SUDUIRAUT Sauternes (2012)
- CHÂTEAU LA TOUR BLANCHE Sauternes (2000)
- CHÂTEAU BROUSTET Barsac (2012,2008,2007,2005,2004,2000)
- CHÂTEAU CAILLOU Barsac(2016,2015,2014,2010,2008,2000)
- CHÂTEAU LAMOTHE-DESPUJOLS Sauternes (2016,2015,2014,2013,2012,2011,2010,2009,2006,2005,2004,2002,2000)
- CHÂTEAU NAIRAC Barsac (2016,2000)
- CHÂTEAU ROMER DU HAYOT Sauternes (2016,2015,2014,2010)
- CHÂTEAU ROMER Sauternes (2016,2010,2008,2006,2004,2002,2001,2000)
- CHÂTEAU SUAU Barsac (2014,2010,2007)
- CHÂTEAU D’YQUEM Sauternes (2012)
- CHÂTEAU D’ARCHE Sauternes (2016,2015,2014,2012,2010)
- Château Durfort-Vivens Margaux (2016,2015,2014)
- Château Pichon-Longueville-Baron-de-Pichon, Pauillac, AOC Pauillac(Château Pichon-Longueville Baron Pauillac Les Griffons de Pichon Baron (2016,2015,2013,2011,2010,2009,2008,2007,2006,2005,2004,2003,2002,2001,2000))
- Château Pichon-Longueville-Comtesse-de-Lalande, Pauillac, AOC Pauillac(Château Pichon Longueville Lalande Pauillac Réserve de la Comtesse (2013,2007))
- Château Rauzan-Gassies Margaux (2007,2004)
- Château Boyd-Cantenac Margaux (2016,2015,2014,2013,2012)
- Château Desmirail Margaux (2007,2006,2005,2004,2003,2002,2001,2000)
- CHÂTEAU MARQUIS D’ALESME BECKER Margaux (2004)
- CHÂTEAU BEYCHEVELLE St.-Julien Amiral de Beychevelle (2013,2011,2004,2003,2002,2001)
- CHÂTEAU MARQUIS DE TERME Margaux (2003)
- CHÂTEAU POUGET Margaux (2016,2015,2014,2013,2012)
- CHÂTEAU DE CAMENSAC Haut-Médoc (2016,2015,2014,2008)
- Château La Lagune Haut-Médoc (2016,2015,2013)
- CHÂTEAU COS LABORY St.-Estèphe (2016,2015,2014,2013)
- CHÂTEAU CROIZET-BAGES Pauillac (2007)
- Château d’Issan, Cantenac, AOC Margaux (not Found)
- Château Doisy-Dubroca, Barsac (not found)
- Château Lamothe-Guignard, Sauternes (2016)
References
- Combris, P.; Lecocq, S.; Visser, M. Estimation of a hedonic price equation for Bordeaux wine: Does quality matter? Econ. J. 1997, 107, 389–402. [Google Scholar] [CrossRef]
- Cardebat, J.-M.; Figuet, J. What explains Bordeaux wine prices? Appl. Econ. Lett. 2004, 11, 293–296. [Google Scholar] [CrossRef]
- Ashenfelter, O. Predicting the quality and prices of Bordeaux wine. Econ. J. 2008, 118, F174–F184. [Google Scholar] [CrossRef]
- Shanmuganathan, S.; Sallis, P.; Narayanan, A. Data mining techniques for modelling seasonal climate effects on grapevine yield and wine quality. In Proceedings of the 2010 2nd International Conference on Computational Intelligence, Communication Systems and Networks, Liverpool, UK, 28–30 July 2010; pp. 84–89. [Google Scholar]
- Noy, F.N.; Sintek, M.; Decker, S.; Crubézy, M.; Fergerson, R.W.; Musen, M.A. Creating semantic web contents with protege-2000. IEEE Intell. Syst. 2001, 16, 60–71. [Google Scholar] [CrossRef]
- Noy, F.N.; McGuinness, D.L. Ontology Development 101: A Guide to Creating Your First Ontology. Stanford Knowledge Systems Laboratory Technical Report KSL-01-05 and Stanford Medical Informatics Technical Report SMI-2001-0880. March 2001. Available online: http://www.corais.org/sites/default/files/ontology_development_101_aguide_to_creating_your_first_ontology.pdf (accessed on 1 January 2020).
- Quandt, R.E. A note on a test for the sum of ranksums. J. Wine Econ. 2007, 2, 98–102. [Google Scholar] [CrossRef] [Green Version]
- Ashton, R.H. Improving experts’ wine quality judgments: Two heads are better than one. J. Wine Econ. 2011, 6, 135–159. [Google Scholar] [CrossRef]
- Ashton, R.H. Reliability and consensus of experienced wine judges: Expertise within and between? J. Wine Econ. 2012, 7, 70–87. [Google Scholar] [CrossRef]
- Bodington, J.C. Evaluating wine-tasting results and randomness with a mixture of rank preference models. J. Wine Econ. 2015, 10, 31–46. [Google Scholar] [CrossRef]
- Cardebat, J.M.; Livat, F. Wine experts’ rating: A matter of taste? Int. J. Wine Bus. Res. 2016, 28, 43–58. [Google Scholar] [CrossRef] [Green Version]
- Cardebat, J.M.; Figuet, J.M.; Paroissien, E. Expert opinion and Bordeaux wine prices: An attempt to correct biases in subjective judgments. J. Wine Econ. 2014, 9, 282–303. [Google Scholar] [CrossRef]
- Cao, J.; Stokes, L. Evaluation of wine judge performance through three characteristics: Bias, discrimination, and variation. J. Wine Econ. 2010, 5, 132–142. [Google Scholar] [CrossRef] [Green Version]
- Cardebat, J.M.; Paroissien, E. Standardizing expert wine scores: An application for Bordeaux en primeur. J. Wine Econ. 2015, 10, 329–348. [Google Scholar] [CrossRef]
- Hodgson, R.T. An examination of judge reliability at a major US wine competition. J. Wine Econ. 2008, 3, 105–113. [Google Scholar] [CrossRef]
- Hodgson, R.T. An analysis of the concordance among 13 US wine competitions. J. Wine Econ. 2009, 4, 1–9. [Google Scholar] [CrossRef]
- Hodgson, R.; Cao, J. Criteria for accrediting expert wine judges. J. Wine Econ. 2014, 9, 62–74. [Google Scholar] [CrossRef]
- Hopfer, H.; Heymann, H. Judging wine quality: Do we need experts, consumers or trained panelists? Food Qual. Prefer. 2014, 32, 221–233. [Google Scholar] [CrossRef]
- Ashenfelter, O.; Goldstein, R.; Riddell, C. Do expert ratings measure quality? The case of restaurant wine lists. In Proceedings of the 4th Annual AAWE Conference at the University of California at Davis, Davis, CA, USA, 20 June 2010. [Google Scholar]
- Cardebat, J.M.; Corsinovi, P.; Gaeta, D. Do Top 100 wine lists provide consumers with better information? Econ. Bull. 2018, 38, 983–994. [Google Scholar]
- Reuter, J. Does advertising bias product reviews? An analysis of wine ratings. J. Wine Econ. 2009, 4, 125–151. [Google Scholar] [CrossRef]
- Chen, B.; Rhodes, C.; Crawford, A.; Hambuchen, L. Wineinformatics: Applying data mining on wine sensory reviews processed by the computational wine wheel. In Proceedings of the 2014 IEEE International Conference on Data Mining Workshop, Shenzhen, China, 14–14 December 2014; pp. 142–149. [Google Scholar]
- Chen, B.; Rhodes, C.; Yu, A.; Velchev, V. The Computational Wine Wheel 2.0 and the TriMax Triclustering in Wineinformatics. In Industrial Conference on Data Mining; Springer: Cham, Switzerland, 2016; pp. 223–238. [Google Scholar]
- Chen, B.; Velchev, V.; Palmer, J.; Atkison, T. Wineinformatics: A Quantitative Analysis of Wine Reviewers. Fermentation 2018, 4, 82. [Google Scholar] [CrossRef] [Green Version]
- Palmer, J.; Chen, B. Wineinformatics: Regression on the Grade and Price of Wines through Their Sensory Attributes. Fermentation 2018, 4, 84. [Google Scholar] [CrossRef] [Green Version]
- Wine Spectator. Available online: https://www.winespectator.com (accessed on 1 January 2020).
- Bordeaux Wine Official Classification of 1855. Available online: https://www.bordeaux.com/us/Our-Terroir/Classifications/Grand-Cru-Classes-en-1855 (accessed on 1 January 2020).
- Wine Spectator’s 100-Point Scale | Wine Spectator, Winespectator.com. 2019. Available online: https://www.winespectator.com/articles/scoring-scale (accessed on 1 January 2020).
- Chen, B.; Le, H.; Rhodes, C.; Che, D. Understanding the Wine Judges and Evaluating the Consistency Through White-Box Classification Algorithms. In Advances in Data Mining. Applications and Theoretical Aspects. ICDM 2016; Perner, P., Ed.; Lecture Notes in Computer Science; Springer: Cham, Switzerland, 2016; Volume 9728. [Google Scholar]
- Rish, I. An empirical study of the naive Bayes classifier. In Proceedings of the IJCAI 2001 Workshop on Empirical Methods in Artificial Intelligence; 2001; Volume 3, pp. 41–46. Available online: https://www.cc.gatech.edu/~isbell/reading/papers/Rish.pdf (accessed on 1 January 2020).
- Suykens, K.J.A.; Vandewalle, J. Least squares support vector machine classifiers. Neural Process. Lett. 1999, 9, 293–300. [Google Scholar] [CrossRef]
- Thorsten, J. Svmlight: Support Vector Machine. Available online: https://www.researchgate.net/profile/Thorsten_Joachims/publication/243763293_SVMLight_Support_Vector_Machine/links/5b0eb5c2a6fdcc80995ac3d5/SVMLight-Support-Vector-Machine.pdf (accessed on 1 January 2020).
- Robert Parker Wine Advocate. Available online: https://www.robertparker.com/ (accessed on 1 January 2020).
- Wine Enthusiast. Available online: https://www.wineenthusiast.com/ (accessed on 1 January 2020).
- Decanter. Available online: https://www.decanter.com/ (accessed on 1 January 2020).
- Chateau Latour 2009 Wine Reviews. Available online: https://www.wine.com/product/chateau-latour-2009/119875 (accessed on 1 January 2020).
Classifier | Accuracy | Precision | Recall | F-Score |
---|---|---|---|---|
Naïve Bayes Laplace | 85.17% | 73.22% | 79.03% | 76.01% |
SVM | 86.97% | 80.68% | 73.80% | 77.10% |
Classifier | Accuracy | Precision | Recall | F-Score |
---|---|---|---|---|
Naïve Bayes Laplace | 84.62% | 86.79% | 90.02% | 88.38% |
SVM | 81.38% | 86.84% | 84.12% | 85.46% |
CATEGORY | 90+ WINES AND 89− WINES | |||
---|---|---|---|---|
FLAVOR/DESCRIPTORS | GREAT | FLAVORS | ||
FRUITY | FRUIT | PLUM | BLACKBERRY | CURRENT |
BODY | FULL-BODIED | CORE | ||
FINISH | FINISH | |||
HERBS | TOBACCO | |||
TANNINS | TANNINS_LOW |
CATEGORY | 90+ WINES | |||
---|---|---|---|---|
FLAVOR/DESCRIPTORS | LONG | RANGE | RIPE | |
FRUITY | BLACK CURRANT | APPLE | RASPERBERRY | FIG |
BODY | SOLID | |||
SPICE | LICORICE |
CATEGORY | 89− WINES | ||
---|---|---|---|
FLAVOR/DESCRIPTORS | CHARACTER | FRESH | GOOD |
FRUITY | CHERRY | BERRY | |
BODY | MEDIUM-BODIED | LIGHT-BODIED | |
TANNINS | TANNINE_MEDIUM |
CATEGORY | 90+ WINES AND 89− WINES | |||
---|---|---|---|---|
FLAVOR/DESCRIPTORS | GREAT | FLAVORS | SWEET | |
FRUITY | FRUIT | PLUM | BLACKBERRY | CURRENT |
BODY | FULL-BODIED | CORE | ||
FINISH | FINISH | |||
HERBS | TOBACCO |
CATEGORY | 90+ WINES | ||
---|---|---|---|
FLAVOR/DESCRIPTORS | LONG | STYLE | LOVELY |
FRUITY | BLACK CURRENT | FIG | APPLE |
EARTHY | IRON | ||
TANNINS | TANNINS_LOW | ||
SPICE | SPICE |
CATEGORY | 89− WINES | |||
---|---|---|---|---|
FLAVOR/DESCRIPTORS | CHARACTER | FRESH | RANGE | GOOD |
FRUITY | BERRY | |||
BODY | MEDIUM-BODIED | LIGHT-BODIED | ||
TANNINS | TANNIS_MEDIUM |
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Dong, Z.; Guo, X.; Rajana, S.; Chen, B. Understanding 21st Century Bordeaux Wines from Wine Reviews Using Naïve Bayes Classifier. Beverages 2020, 6, 5. https://doi.org/10.3390/beverages6010005
Dong Z, Guo X, Rajana S, Chen B. Understanding 21st Century Bordeaux Wines from Wine Reviews Using Naïve Bayes Classifier. Beverages. 2020; 6(1):5. https://doi.org/10.3390/beverages6010005
Chicago/Turabian StyleDong, Zeqing, Xiaowan Guo, Syamala Rajana, and Bernard Chen. 2020. "Understanding 21st Century Bordeaux Wines from Wine Reviews Using Naïve Bayes Classifier" Beverages 6, no. 1: 5. https://doi.org/10.3390/beverages6010005
APA StyleDong, Z., Guo, X., Rajana, S., & Chen, B. (2020). Understanding 21st Century Bordeaux Wines from Wine Reviews Using Naïve Bayes Classifier. Beverages, 6(1), 5. https://doi.org/10.3390/beverages6010005