Next Article in Journal
Effect of Industrial Processing on the Volatile Organic Compound Fingerprint of Dry-Cured Tuna
Next Article in Special Issue
Sensory Properties of Fermented Blends of Sunflower Press Cake and Whey
Previous Article in Journal
Exploring the Sensory Typicity of Timorasso Wines: Physicochemical and Sensory Characteristics of Seven Consecutive Vintages
Previous Article in Special Issue
The Role of Bitter-Tasting Substances in Salivation and Swallowing: Results of the Pilot Study
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Cross-Cultural Comparison of the Espresso Protocol Repeatability

1
Department of Food Science and Nutrition, College of Human Ecology, Pusan National University, Busan 46241, Republic of Korea
2
Center for Sensory Analysis and Consumer Behavior, Kansas State University, Manhattan, KS 66502, USA
*
Author to whom correspondence should be addressed.
Foods 2025, 14(4), 593; https://doi.org/10.3390/foods14040593
Submission received: 28 December 2024 / Revised: 20 January 2025 / Accepted: 7 February 2025 / Published: 11 February 2025
(This article belongs to the Special Issue Sensory and Consumer Testing of Novel Methods and Novel Foods)

Abstract

:
The Espresso Protocol (TEP) was used to assess the quality of coffee beans through espresso extraction incorporating a sensory approach. TEP includes overall quality evaluation and attribute evaluation using check-all-that-apply (CATA). This study aims to evaluate the repeatability of TEP when used by experts and to compare cross-cultural assessments to determine its applicability across different countries with diverse coffee cultures. Experts with over three years of experience in the coffee industry from five countries—France (n = 7), India (n = 12), Italy (n = 7), the Republic of Korea (n = 10), and the USA (n = 10)—participated in our study. The experiment was conducted in three replications using eight different single-origin coffee samples over two or three consecutive days. Cluster analysis using CATA data was performed to verify the repeatability of individual participants in the characterization of espresso samples, revealing that most participants were repeatable in their three-time evaluations. Moreover, a significant homogeneity index demonstrated a high degree of similarity in the sensory characteristics used by experts from each country, although cultural differences were observed in the terminology used to describe coffee. In conclusion, the repeatability of individual experts and the reliability of TEP were successfully demonstrated. However, some differences in sensory evaluations were noted across cultures; these were likely influenced by differences in the use of terminology, which emphasizes the need for training in the coffee lexicon.

1. Introduction

One of the most widely consumed beverage is coffee, with a record consumption of 176.6 million bags of coffee in 2021/22 [1]. There is also a growing interest in flavor components that have an important role in marketing the product [2]. Brewing and extraction methods vary across geographic, cultural, and social contexts, as well as individual preferences. Furthermore, coffee is a complex beverage that contains over 1000 compounds contributing to its distinctive flavor and aroma [3]. Espresso is a brewed beverage produced by extracting ground coffee using high-pressure hot water. It is commonly used as the base for various coffee-based beverages [4,5,6]. The quality of espresso can be influenced by several factors, including the coffee variety, blending formulations, grind size, water temperature, and the inherent quality of the coffee itself [7,8].
To evaluate coffee quality in the coffee industry, the cupping method was usually used to measure the sensory properties of coffee. However, some authors have shown that cupping scores vary among cuppers and that there is no correlation between the cupping scores of importers and exporters [2]. The Specialty Coffee Association of America (SCAA) introduced a protocol in 1984 to define the sensory quality of coffee. This protocol is detailed and adaptable to standardization, meeting the needs of producers [2,8]. However, recent concerns have highlighted several limitations of the cupping method. While cupping is widely recognized as a reliable method, it is not suitable for all situations [9]. The cupping committee of SCAA (2005) stated that the purpose of sensory evaluation is (1) to identify actual sensory differences between samples, (2) to describe the taste of the samples, and (3) to determine product preference. In the past, the value of coffee was often determined by the preferences of the cupper through cupping [10]. Since 2020, the SCA has recognized the need to evolve cupping protocols. While recent protocols emphasize quality scoring, they fall short in capturing descriptive information. With the increasing importance of descriptive evaluation, the SCA proposed a new cupping format to collect such information, which was previously lacking. This includes a descriptive evaluation for objective flavor assessment and an emotional evaluation based on overall quality impressions [11]. As a result, the SCA introduced the Coffee Value Assessment (CVA) as the new cupping standard in 2024 [12].
Similarly, in a prior study on TEP [13], ‘The Espresso Protocol’ (TEP) was used to evaluate espresso, including sensory methods such as check-all-that-apply, intensity rating, and just-about-right scale. The early versions of TEP provided a basic framework for assessing espresso, but they had limitations in fully describing the complexity of sensory characteristics.
Furthermore, cultural differences may influence the evaluation results in this study. Kim (2015) reviewed various cross-cultural studies, noting that differences observed in consumer preferences or perceived characteristics are primarily attributable to cultural factors rather than inherent individual differences [14]. As such, cultural differences between countries participating in this study could affect the results. Participants may be reluctant to use negative descriptors and/or extreme values and may have difficulties with translation. In addition, it is essential to recognize and account for cultural customs, manners, religious beliefs, knowledge, and attitudes toward products [15]. To date, most coffee studies have merely compared differences based on elements such as roasting level [4,16], storage [17,18], brewing methods [2,5,19], serving temperatures [20,21], and geographic origin [22,23] of brewed coffee. Furthermore, there have only been a few cross-cultural studies on coffee, where consumers’ coffee purchase and consumption behavior were compared between Chinese and Swedish university students [24], and TEP usage was compared between the United States or Canada (US/CA) and Australia or New Zealand (AU/NZ) [13].
The objectives of our study were (1) to determine the repeatability of the modified TEP, and (2) to identify cross-cultural differences between five countries (France, India, Italy, the Republic of Korea, and the USA) in evaluating espresso using TEP.

2. Materials and Methods

2.1. Samples

Eight coffee bean types sold wholesale in the U.S. were selected based on prior knowledge, origin, the quality of coffee beans, and the price range of these coffee samples (Table 1). One sample, called Pre-Packaged in our study, is a product that has already been roasted at the medium level and sold, while the other seven samples—Colombia, Ethiopia, Brazil, Honduras, Costa Rica, Guatemala, and Brazil Cerrado—were green beans from each country of origin. A total of eight coffee bean varieties were shipped with the alphabetical labels A through G, each containing 3 KG (6.6 lb); the Pre-Packaged was a bag of 1.5 KG (3.3 lb) of roasted coffee labeled H.

2.1.1. Sample Preparation

In each country, a professional roaster and a barista were recruited for sample preparation. Additionally, experts participating in espresso evaluation were prevented from sample preparation because sample information may cause bias.
The first step of sample preparation was the roasting of the seven green bean samples. As the quantity of coffee beans to be roasted is small (3 KG), the roaster had to use a small batch or large sample roaster (e.g. Probatino, Bellwether Series 2, HB600 or 2 KG, etc.). The roaster was free to determine the appropriate roasting level for each sample considering the country-of-origin information. Although not all countries used the same roast level, most were roasted at a light-medium or medium level. The professional roaster provided details about roasting profiles used in coffee beans, such as roast time, roast level, peak temperature, and other information to describe the roast (Supplementary Table S1). Then, each coffee bean sample was divided into three new bags, one for each replication. The bags used for repacking were provided and labeled with the letters A to H instead of coffee origin, with three-digit random codes that were different for each replication. Approximately 1.1 lb (0.5 KG) of coffee per bag was packed for the three sample bags, with an additional ‘Excess Coffee’ bag for extra coffee in case of problems. Roasting was completed at least 3 days before, but not more than 10 days before the evaluation. The exception to the roasting was India, where roasting was done 20 days before the evaluation due to outsourcing the roasting.
The grinding and brewing were the last step of sample preparation, taken care of by a trained barista. To meet the grind specifications required by TEP, each country had to use a commercial quality grinder, as factors such as grinder type, burr design, and the age and usage (wear) of the burrs all affect the outcome. Also, to meet the brewing standards (Table 2), espresso machines had to be of commercial quality with essential features, including (1) the ability to measure and adjust brewing temperature (setting temperature with a PID, which stands for a Proportional-Integral-Differential controller; a thermostat is optimal), (2) the ability to measure and adjust pump pressure (a rotary pump is optimal), (3) consistent brew temperature throughout the brew cycle, and (4) the use of water that meets the water standards specified in this study (Table 3). The espresso machine had to undergo temperature and pressure checks, and slight adjustments to the grinder settings were required for each sample, prompting the professional barista to repeatedly monitor the coffee preparation process.
In addition, professional baristas were guided to provide samples in accordance with the extraction guidelines of this study. They served each sample promptly for evaluation to maintain the temperature of each serving. For the coffee dose, a range of 15–17 g was selected as the optimal range for the test standard, as it produces a concentrated sensory result suitable for evaluation and meets the preparation standard.

2.1.2. Sample Evaluation Procedure

The eight coffees were evaluated three times each, i.e., three replications. The sample serving orders were balanced, and experts in each country evaluated them in the same order due to constraints in sample preparation for espresso equipment settings. Each sample evaluation took approximately 20 to 30 min to taste and evaluate. Eight to sixteen espresso samples were evaluated in the session, depending on the testing country. France, Italy, and India evaluated the samples over two consecutive days. In France, 16 samples and 8 samples were evaluated in each session, Italy evaluated 12 samples per day; and in India, on the first day, two sessions were held in the afternoon and in the evening, where 8 samples were evaluated in each session. On the next day, one session was held to evaluate the third replication of the 8 samples. Thus, in India, a total of three evaluation sessions were held over two consecutive days. The other countries (the Republic of Korea and the USA) evaluated eight samples per day for three consecutive days. Although these numbers of samples might seem large for typical sensory descriptive studies [25], in Issanchou (2018) [26], it is suggested that evaluating more than 10 samples per session should be avoided, as it may lead to sensory fatigue. Additionally, Kemp (2018) recommended performing 2 to 4 repetitions of the analysis for statistical reliability [27].
For each sample evaluation, one cup of espresso (70–90 mL) was served. Experts evaluated the prepared samples following five steps: (1) On serving the espresso sample, observe the crema and evaluate it; (2) Place a demitasse spoon at the bottom of the cup, stir the espresso back and forth, and then inhale the aroma with the nose over the cup five times and evaluate the aroma; (3) Wait for one minute, then take the first sip to evaluate the flavor and body/mouthfeel; (4) Wait about 30 s, then take a second sip to evaluate the basic taste, continuity, aftertaste, and holistic/hedonic, and then evaluating balance/amplitude and defects; (5) When the espresso has cooled to room temperature, take a third sip, check its stability, and confirm the evaluations. Finally, the questionnaire was completed by providing an open-ended comment for the coffee sample or TEP, evaluating the overall quality score, and checking the willingness to use it as an espresso.

2.2. The Espresso ProtocolTM (TEP)

TEP is a coffee quality evaluation tool that aims to allow users to quantify and indicate the characteristics and attributes of espresso coffees in an efficient, precise, and repeatable way across panelists and geographies. In our study, TEP is composed of questions in the order of appearance during espresso experiences using multiple check-all-that-apply questions, two 100-point scale questions, several “check-one” questions, two yes/no questions, two expository text questions, and a section for open-ended comments. More details about the questionnaires are shown in Table 4. An online survey tool, QualtricsXM (Seattle, WA, USA), was used to execute TEP. The evaluation was carried out using personal hand-held mobile devices. Once experts scanned the QR code with the electronic device, they were directed to the corresponding questionnaires. In each replication, experts were given a placemat with the QR code and sample code displayed on it.
To evaluate aroma, flavor, tactile impact, basic tastes, and overall impact, TEP is a questionnaire developed using the World Coffee Research Sensory Lexicon [28]. The initial TEP [13] used the 5-point JAR scale in the protocol, but the questionnaires in this current study were revised by adding more sensory characteristics used in CATA and adding them as drop-down options. The composition of the TEP used in previous studies can be found in Table 1 of Kim (2025) [13]. The same 13 sensory characteristic terms (descriptions) were used in the categories of ‘Aroma’ and ‘Flavor’, and these 13 terms have a more detailed drop-down menu (Table 4). For example, if one selects the description of ‘Fruity’, ‘Nutty’, and ‘Sweet’ in the category of ‘Aroma’, then one will see a screen that selects the detailed description of each of the items. The same logic applies to the ‘Flavor’ category.

2.3. Participants

The Espresso Protocol (TEP) participants were composed of a total of 46 experts from France (n = 7), India (n = 12), Italy (n = 7), the Republic of Korea (n = 10), and the USA (n = 10), who currently resided in the designated countries (Table 5). Experts who evaluated coffee using the TEP in this study were recruited as adults over 19 years of age, with at least 3 years of experience in the coffee industry, no caffeine sensitivity, and were not pregnant or lactating.
Before the experiment, the experts who would evaluate the coffee samples were provided with the WCR Sensory Lexicon and the evaluation guidelines, but no separate training sessions were conducted. On the day of the experiment, in order to ensure a smooth online survey process, a detailed explanation of the survey flow and evaluation methods was given to the participants prior to the coffee evaluation, ensuring they had a clear understanding of the procedure. Additionally, the Coffee Taster’s Flavor Wheel (2016) from the Specialty Coffee Association (SCA) and World Coffee Research (WCR) was provided for reference during the evaluation process.

2.4. Data Analysis

CLUSCATA was conducted using CATA sensory characterization data to determine whether replications from individual experts were grouped together and whether experts were divided into separate clusters. The homogeneity index of each country was confirmed by the results of CLUSCATA. Cochran’s Q test was performed to verify the frequency of CATA terms and significant differences between eight samples in each evaluation category. Correspondence analysis (CA) was carried out using the same CATA data. The results of CA visually represented the relationship between samples and characteristics, and the similarity of sensory perception was also compared across samples in different countries using RV-coefficients. One-way ANOVA with Tukey post-hoc tests (p < 0.05) was conducted to examine the differences in the overall quality of samples evaluated in each country. Two-way ANOVA with Tukey post-hoc tests (p < 0.05) was performed to determine which countries and samples were statistically significantly different. CLUSCATA, Cochran’s Q test (performed in CATA data analysis), and RV-coefficients were analyzed using XLSTAT software (version 2024.1.0; Addinsoft Inc., New York, NY, USA). Correspondence analysis, One-way ANOVA, and Two-way ANOVA were analyzed using SAS software (version 9.4; SAS Institution Inc., Cary, NC, USA).

3. Results

3.1. Repeatability of TEP

CLUSCATA is a method designed to segment subjects such that the homogeneity within a cluster is maximized. It is based on the same principles as CATATIS, which calculates group mean data with down-weighted subjects [29,30]. Building on this, our study sought to verify that the results of TEP were evaluated consistently through CLUSCATA, which we interpret as ensuring reliable evaluation data from experts. The data from each country were the sensory attributes selected in the CATA portion of the study. In XLSTAT, the class number selection was automatically maintained to ensure more accurate cluster analysis, and the integration of classes obtained through the layering algorithm was applied [31]. The homogeneity of the experts in our study, together with the repeatability of TEP, was verified. Based on the data evaluated by the participants, both the homogeneity of individuals and cross-cultural consistency can be confirmed.

3.1.1. Verification of Homogeneity in Expert Evaluation Through CLUSCATA

Table 6 lists the experts grouped together for each of the five clusters. Participant numbers were used as follows: 101–110 for the USA, 201–208 for Italy (excluding 202 due to missing data), 301–310 for the Republic of Korea, 401–412 for India, and 801–807 for France. The four-digit numbers are identifiers used to distinguish the evaluation data, with each number representing one of the three evaluation sessions per participant.
Data from each individual expert’s three replications were grouped into the same cluster. For example, in Cluster A, the first, second, and third replications of expert 302 (1302, 2302, 3302) were grouped in the same cluster, showing consistency for that expert. Overall, five clusters of CATA terms were identified. For Korean experts, two distinct clusters emerged: Clusters A and B. However, the other four countries formed three mixed clusters. This suggests that the Republic of Korea experts, although generally consistent within themselves, use notably distinct sensory terminology compared to those of the other countries.
Figure 1 presents the dendrogram of CLUSCATA results from each country. It was confirmed that the experts’ evaluations were consistent, as the individual data from three replications were generally grouped into individual clusters. In the USA, as shown in Figure 1a, 9 out of 10 experts showed repeatability. In Italy, Figure 1b, 5 out of 7 experts showed repeatability.
In the Republic of Korea, shown in Figure 1c, the homogeneity of individual evaluations was clearly demonstrated, as clusters were formed for each of the 10 participants. Among them, participants 307 and 301 exhibited the highest homogeneity, with scores of 0.7 or higher. In India, as indicated in Figure 1d, 9 out of 12 experts showed repeatability. In France, as shown in Figure 1e, most experts formed a single cluster except for expert 804, leading to 6 out of 7 experts showing repeatability.
In many studies, when CLUSCATA is conducted, very low in similarity and atypical responses are often excluded from the analysis, and on average, approximately one-third of responses are classified as “K + 1 clusters” or “noise clusters” [32]. It has been demonstrated that when atypical respondents were excluded, the homogeneity index of clusters increased [30]. The preceding studies [30] found that the homogeneity index improved to 0.482 and 0.407 as the K + 1 cluster was excluded. However, in this study, there were no data considered atypical respondents; thus, none were excluded as the K + 1 cluster, indicating a good level of homogeneity across all five countries regardless of cross-cultural differences. Although some participants did not show consistent evaluation tendencies across all three evaluations, CLUSCATA analyses showed that most participants evaluated repeatedly.

3.1.2. Comparison of Homogeneity Index by Countries

CLUSCATA calculated the homogeneity index, which results from verifying the similarity of CATA responses among experts. The homogeneity index for each group indicates the similarity in their use of sensory characteristics. Table 7 shows that the global homogeneity index for all evaluation data (n = 138) is 0.364. In particular, the homogeneity index was found to increase as the clusters were further subdivided in each country, suggesting that the terms used within each country were more similar. A recently published chocolate study found homogeneity indices of 0.550 and 0.538 in the results after excluding the K + 1 cluster, with the lowest homogeneity index being 0.402 when evaluated by consumers [30]. In our study, a homogeneity index of 0.5 or higher was observed in four countries, with the exception of Italy (Table 7). In addition, the number of terms provided in TEP was much higher, which can reduce the homogeneity index. In contrast to the findings where the homogeneity index became more pronounced after the removal of atypical respondents [32], our study found that no evaluation data were classified as “K + 1”. This indicates that there were no data negatively impacting the homogeneity index.
Moreover, in Figure 1, the horizontal axis, D (delta), represents the value reflecting the increase in internal heterogeneity when clusters merge and should be kept as small as possible [29]. The values on the y-axis represent the distance or difference between clusters as they are merged, facilitating an initial analysis of the clustering level of the data. Specifically, a larger D value indicates a greater difference between clusters, while a smaller value suggests higher similarity. This pattern is consistent with the homogeneity index presented in Table 7. The D values increase in the following order: Republic of Korea, France, India, USA, and Italy, which corresponds to the same ranking observed in the homogeneity indices.

3.2. Cross-Cultural Comparison

3.2.1. Cross-Cultural Comparison of RV Coefficients for Sample Configurations

Table 8 presents the results of comparing sample configurations across five countries based on the CA of sensory characteristics assessed using the CATA method. The closer an RV coefficient is to 1, the more similar the sample and attribute configurations are [33,34,35]. The eight samples were classified using CATA (Check-All-That-Apply) terms that exhibited slight differences by country. The sample configurations in the first and second dimensions of CA for the RV coefficient were higher than 0.7 and significant for five out of ten comparisons. An acceptable degree of similarity was observed in half of the comparisons, despite potential cultural differences (e.g., preference, consumption tendency, type of response, etc.).
There are varying perspectives on the stability of the RV coefficient, and it is important to assess its significance and adjust the threshold according to the required precision [36,37,38]. For instance, an RV value of 0.85 is considered indicative of ‘good test-retest repeatability’ [39], while RV coefficients ranging from 0.65 to 0.71 have been reported as reflecting adequate agreement between consumers and trained panels [40]. Considering these values, five out of ten comparisons in our study were statistically significant, suggesting that the RV coefficients are stable.
Figure 2 presents a visualization of the correspondence analysis (CA) used in the comparison. It must be noted that the proximity of characteristics to each other does not indicate a direct relationship, as this is not represented in the layout [41]. However, the proximity of terms to a sample typically does indicate that some of those terms may be found at higher levels in those closer samples than in other samples. Cochran’s Q test was performed to identify the significant differences between the sample frequencies for each attribute [42], and the results were used to explain CA biplots.

3.2.2. Difference Between Country and Sample on Overall Quality

Table 9 shows the results of a One-way ANOVA on the overall quality score, based on a 0–100 point scale, evaluated in three replicates separately for five countries.
By country, there were significant differences in the samples from all countries except the USA. A common trend emerged across all countries, with Ethiopia being rated as having the highest overall quality and Pre-Packaged coffee as the lowest overall quality. The Pre-Packaged coffee sample had been roasted much earlier before the evaluation, which might have influenced the lower quality score. These common results suggest that the quality evaluation of these two samples was not influenced by cross-cultural factors. However, the rankings of the other six samples varied slightly between countries, which is believed to reflect cultural differences, such as consumer preferences in each country.
Figure 3 shows significant interactions of the overall quality score (0–100) between countries and samples (p < 0.0001).
The USA did not distinguish between coffee samples, whereas Italy and the Republic of Korea employed a broader range of the scale to differentiate samples based on the overall quality score. At first glance, this data suggests that the experts in most countries were more discriminating among samples than those in the USA. However, because almost all the USA experts were consistent in their evaluations, the data may suggest that the USA group of experts simply found all samples to be of equal quality. Differences in quality evaluation are exemplified by the ‘Brazil Cerrado’ and ‘Brazil’ coffee samples, which differed between countries, likely reflecting what is accepted as good quality coffee in each country.

4. Discussion

Analysis of the homogeneity index of CATA data has not been performed frequently so far, and prior research on this topic is limited. Although consumers’ chocolate characterization using CATA research was one of few studies that examined the homogeneity index [30], we report here that experts in our study resulted in higher homogeneity scores without excluding atypical data. In fact, the homogeneity index from a couple of countries was the highest reported thus far.
No atypical data were found in our study, meaning that all expert evaluation data were analytically significant. Although homogeneity among the data was relatively low in Italy, this suggests not only a difference in the use of sensory characteristics but also indicates that these differences may be influenced by the role of the expert in the coffee industry. Based on demographic information from Italy, four out of seven participants were working as roasters. The remaining individuals in Italy included a researcher, a technician, and a staff member. This suggests that it may have been relatively new for them to evaluate coffee using the sensory characteristics or terms that describe coffee. In contrast, nine out of ten Republic of Korea participants took on barista roles, with additional responsibilities in other roles. Only one participant served as a roaster. The Republic of Korea exhibited the highest homogeneity index among the five countries, with the data forming a single cluster, reflecting much higher homogeneity in using terms to describe specific coffee samples. This suggests that homogeneity in using sensory characteristics may be influenced by the participants’ roles.
Although various factors, such as roasting degree, evaluation environment, extraction ratio, extraction yield, and water quality, differed across countries, this present study focused on validating the repeatability of TEP despite these variations. However, there were some differences in the terminology used to describe and assess the samples across countries. Regarding the overall quality score, there was a common trend of evaluation by country. Ethiopia was evaluated as having the highest overall quality coffee, while Pre-Packaged was rated as having the lowest overall quality. These two samples were evaluated similarly across all five countries, suggesting that the quality evaluation for these two samples was not influenced by cross-cultural factors. However, it was not possible to rule out the possibility that the other six samples were evaluated with slight differences in ranking. Additionally, in the open-ended questions asking about the preferred country of origin, Ethiopia was the most frequently mentioned, followed by Kenya, Colombia, India, and Panama (Geisha). Other origins mentioned included various regions and varieties, such as Africa, Bourbon, Brazil, Cidra, Congo, Costa Rica (Villa Sarchi), Ecuador, Honduras, Indonesia, Rwanda, and the USA. Based on these responses, it can be inferred that the overall higher quality score for the Ethiopian coffee, even when tested in a blind study, was likely influenced by the evaluators’ familiarity with and preferences for those sensory characteristics.
In this study, we expanded the terminology for describing sensory characteristics when evaluating coffee using TEP, allowing for a more effective and diverse description of the coffee than in previous TEP studies [13]. This approach enables a more detailed tracking of the characteristics of espresso extraction as required by buyers and producers. Although the roasting degree was not identical across countries, the individual reproducibility and repeatability of TEP were sufficiently demonstrated, meeting the objectives of this study.
The experts were ‘untrained’ in the specific terms used in this study, although they had received training in the overall concepts of the WCR lexicon [28] and the coffee flavor wheel [43]. That makes them more ‘trained’ than consumers, but, like consumers, experts may face challenges in evaluating coffee due to its complex flavor compounds. Several studies have demonstrated that meaningful results can be achieved through targeted training for coffee evaluation [9,44,45,46,47,48]. By developing educational tools tailored to consumers’ level of understanding of the sensory lexicon, the use of TEP can be made more effective, allowing the coffee industry to better align its products with consumers’ perspectives, ultimately enhancing consumers’ espresso experience.

5. Limitations

There are several limitations in this study. First, the number of participants varied by country. Although we initially planned to recruit 10 experts from each country, the actual number of participants ranged from 7 to 12, depending on the circumstances in each country. Additionally, differences in terminology understanding and usage across countries and individuals impacted the repeatability and homogeneity of the evaluations. It is believed that training on sensory characteristics, using a standardized framework such as the WCR sensory lexicon, is necessary to improve the homogeneity and clarity of espresso evaluations. Finally, participants were unfamiliar with the protocol and, thus, it would have taken some time for familiarization. Repeated use of TEP in future studies should help increase the homogeneity of the evaluation results.

6. Conclusions

In conclusion, our study successfully demonstrated the repeatability of TEP, a protocol for evaluating espresso with experts. Despite variables such as roasting level, extraction recipes, and water quality varying across countries, the homogeneity of sensory characteristics and individual repeatability were high in most countries, yielding meaningful results. Although there were some differences in the terminology used to describe the samples, it could be improved through training on the sensory lexicon. Notably, the Ethiopia and Pre-Packaged samples were not influenced by cross-cultural factors, considering both the overall quality score and CATA characterizations, while differences in terminology and the overall quality score ranking of the other six samples were observed.
In future studies, the effect of coffee lexicon training preceding TEP use should be considered. In addition, determining whether consumers can use TEP would be important because TEP could be utilized as a communication tool between the coffee industry and their direct consumers’ perspectives on quality coffee.
Additionally, by classifying roles within the coffee industry and identifying the specific factors that contribute to differences between these roles, utilizing TEP can generate meaningful data that would also offer valuable information to the coffee industry.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/foods14040593/s1, Table S1: The Roasting Information.

Author Contributions

Conceptualization, J.L. and E.C.IV; methodology, J.C. and J.L.; software, J.C.; validation, J.L. and E.C.IV; formal analysis, J.C.; investigation, J.C. and J.L.; resources, J.L.; data curation, J.C.; writing—original draft preparation, J.C.; writing—review and editing, E.C.IV and J.L.; visualization, J.C.; supervision, J.L.; project administration, J.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding. The APC was funded by the BK21 Four Program: Precision Nutrition Program for Future Global Leaders, Pusan National University.

Institutional Review Board Statement

This study was approved by the Institutional Review Board at Pusan National University (PNU IRB/2023_209_HR) for the Republic of Korea on 22 November 2023 and the Committee on Research with Human Subjects at Kansas State University (proposal number 5930) for the USA, Italy, India, and France on 15 April 2020.

Informed Consent Statement

All consumers were given informed consent and were allowed to opt out of the research at any time.

Data Availability Statement

The datasets presented in this article are not readily available because of ethical restrictions to protect participants’ privacy.

Acknowledgments

International taste trials of this research were funded by the Calderia O’Connor DeFaria Foundation.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. ICO. International Coffee Organization. Available online: https://www.icocoffee.org/documents/cy2023-24/annual-review-2022-2023-e.pdf (accessed on 7 February 2025).
  2. Worku, M.; Duchateau, L.; Boeckx, P. Reproducibility of coffee quality cupping scores delivered by cupping centers in Ethiopia. J. Sens. Stud. 2016, 31, 423–429. [Google Scholar] [CrossRef]
  3. Nijssen, L.M.; Visscher, C.A.; Maarse, H.; Willemsense, L.C.; Boelens, M.H. Volatile Compounds in Food; TNO Nutrition and Food Research Institute: Zeist, The Netherlands, 1996. [Google Scholar]
  4. Hasni, D.; Muziafa, M.; Widayat, H.P.; Ariska, N.; Maulidar; Rahmad, D. Physicochemical properties and cupping quality of Gayo Espresso Coffee based on blending ratio and roasting techniques. Int. J. Adv. Sci. Eng. Inf. Technol. 2023, 13, 1378–1386. [Google Scholar] [CrossRef]
  5. Angeloni, G.; Guerrini, L.; Masella, P.; Parenti, A.; Bellumori, M.; Daluiso, S.; Innocenti, M. What kind of coffee do you drink? An investigation on effects of eight different extraction methods. Food Res. Int. 2019, 116, 1327–1335. [Google Scholar] [CrossRef] [PubMed]
  6. Parenti, A.; Guerrini, L.; Masella, P.; Spinelli, S.; Calamai, L.; Spugnoli, P. Comparison of espresso coffee brewing techniques. J. Food Eng. 2014, 121, 112–117. [Google Scholar] [CrossRef]
  7. Cordoba, N.; Femandez-Alduenda, M.; Moreno, F.L.; Ruiz, Y. Coffee extraction: A review of parameters and their influence on the physicochemical characteristics and flavour of coffee brews. Trends Food Sci. Technol. 2020, 96, 45–60. [Google Scholar] [CrossRef]
  8. Specialty Coffee Association. 2018. Available online: https://www.scith.coffee/wp-content/uploads/2021/03/SCA-Protocols-_-Best-Practices.pdf (accessed on 7 February 2025).
  9. Di Donfrancesco, B.; Gutierrez Guzman, N.; Chambers, E., IV. Comparison of results from cupping and descriptive sensory analysis of Colombian brewed coffee. J. Sens. Stud. 2014, 29, 301–311. [Google Scholar] [CrossRef]
  10. SCAA. Specialty Coffee Association of America. 2005. Available online: https://atlanticspecialtycoffee.com/wp-content/uploads/SCAA-Cupping-Protocols-2005.pdf (accessed on 7 February 2025).
  11. SCA. Specialty Coffee Association. 2022. Available online: https://sca.coffee/sca-news/read/evolving-the-sca-cupping-protocol-and-form-an-overview-of-the-pilot-testing-process (accessed on 7 February 2025).
  12. SCA. Specialty Coffee Association. 2024. Available online: https://sca.coffee/sca-news/sca-new-cva-cupping-standards (accessed on 7 February 2025).
  13. Kim, Y.; An, J.; Lee, J. The espresso protocol as a tool for sensory quality evaluation. Food Res. Int. 2025, 202, 115670. [Google Scholar] [CrossRef]
  14. Kim, Y.K.; Jombart, L.; Valentin, D.; Kim, K.O. Familiarity and liking playing a role on the perception of trained panelists: A cross-cultural study on teas. Food Res. Int. 2015, 71, 155–164. [Google Scholar] [CrossRef]
  15. Goldman, A. Important considerations in cross-cultural sensory and consumer research—Introductory overview. Food Qual. Pref. 2006, 17, 647. [Google Scholar] [CrossRef]
  16. Charles, M.; Romano, A.; Yener, S.; Barnabà, M.; Navarini, L.; Märk, T.D.; Biasoli, F.; Gásperi, F. Understanding flavour perception of espresso coffee by the combination of a Dynamic sensory method and in vivo nosespace analysis. Food Res. Int. 2015, 69, 9–20. [Google Scholar] [CrossRef]
  17. Kwon, D.J.; Lee, M.J.; Park, O.J. Consumer's sensory evaluation of specialty grade coffee during storage. Korean J. Food Nutr. 2012, 25, 1033–1038. [Google Scholar] [CrossRef]
  18. Zuritz, C.A.; Foster, K.; Baributsa, D.; Parra, C.A.; Natalie, D. Coffee storage in hermetic bags and its influence on prices and sensory scores. Rev. Cienc. Agrícolas 2022, 39, 14–32. [Google Scholar] [CrossRef]
  19. Nicola, C.; Alessandro, G.; Mariana, D.C.; Alberto, C.; Raffaele, S. Neapolitan coffee brew chemical analysis in comparison to espresso, moka and American brews. Food Res. Int. 2014, 61, 152–160. [Google Scholar] [CrossRef]
  20. Nóbrega, I.C.C.; Costa, I.H.L.; Macedo, A.C.; Ishihara, Y.M.; Lachenmeier, D.W. Serving temperatures of bestselling coffees in two segments of the Brazilian foods service industry are “Very Hot”. Foods 2020, 9, 1047. [Google Scholar] [CrossRef]
  21. Steen, I.; Waehrens, S.S.; Petersen, M.A.; Münchow, M.; Bredie, W.L.P. Influence of serving temperature on flavour perception and release of Bourbon Caturra coffee. Food Chem. 2017, 219, 61–68. [Google Scholar] [CrossRef]
  22. Maeztu, L.; Andueza, S.; Ibañez, C.; De Peña, M.P.; Bello, J.; Cid, C. Multivariate methods for characterization and classification of espresso coffees from different botanical varieties and types of roast by foam, taste, and mouthfeel. J. Agric. Food Chem. 2001, 49, 4743–4747. [Google Scholar] [CrossRef]
  23. Yener, S.; Romano, A.; Cappellin, L.; Del Pulgar, J.S.; Gasperi, F.; Biasioli, F.; Märk, T.D.; Navarinic, L. PTR-ToF-MS characterisation of roasted coffees (C. arabica) from different geographic origins. J. Mass Spectrom. 2014, 49, 929–935. [Google Scholar] [CrossRef]
  24. Yan, M.; Li, Q. Consumer Behavior in Boffee Drinking: Comparison Between Chinese and Swedish University Utudents. University of Gävle, Sweden, 26-08-2016. Available online: https://hig.diva-portal.org/smash/record.jsf?pid=diva2%3A954525&dswid=-2903 (accessed on 7 February 2025).
  25. Nascimento, M.O.; Ombredane, A.S.; de Oliveira, L.L. Descriptive sensory tests for evaluating Coffea arabica: A systematic review. Coffee Sci. 2024, 19, e192204. [Google Scholar] [CrossRef]
  26. Issanchou, S. General considerations. In Descriptive Analysis in Sensory Evaluation; Kemp, S.E., Hort, J., Hollowood, T., Eds.; Wiley Blackwell: Hoboken, NJ, USA, 2018; pp. 41–80. ISBN 9781118991671. [Google Scholar]
  27. Kemp, S.E.; Ng, M.; Hollowood, T.; Hort, J. Introduction to descriptive analysis. In Descriptive Analysis in Sensory Evaluation; Kemp, S.E., Hort, J., Hollowood, T., Eds.; Wiley Blackwell: Hoboken, NJ, USA, 2018; pp. 1–39. ISBN 9781118991671. [Google Scholar]
  28. Chambers, E., IV; Sanchez, K.; Phan, U.X.; Miller, R.; Civille, G.V.; Di Donfrancesco, B. Development of a “living” lexicon for descriptive sensory analysis of brewed coffee. J. Sens. Stud. 2016, 31, 465–480. [Google Scholar] [CrossRef]
  29. Llobell, F.; Cariou, V.; Vigneau, E.; Labenne, A.; Qannari, E.M. A new approach for the analysis of data and the clustering of subjects in a CATA experiment. Food Qual. Pref. 2019, 72, 31–39. [Google Scholar] [CrossRef]
  30. An, J.; Lee, J. Consumers’ sensory perception homogeneity and liking of chocolate. Food Qual. Pref. 2024, 118, 105178. [Google Scholar] [CrossRef]
  31. XLSTAT. 2023. Available online: https://help.xlstat.com/6515-clustering-tables-means-clustatis-excel (accessed on 7 February 2025).
  32. Llobell, F.; Giacalone, D.; Labenne, A.; Qannari, E.M. Assessment of the agreement and cluster analysis of the respondents in a CATA experiment. Food Qual. Pref. 2019, 77, 184–190. [Google Scholar] [CrossRef]
  33. Robert, P.; Excoufier, Y. A unifying tool for linear multivariate statistical methods: The RV coefficient. J. R. Stat. Soc. C Appl. Stat. 1976, 25, 257–265. [Google Scholar] [CrossRef]
  34. Abdi, H. Congruence: Congruence coeffcient, RV-coefficient, and Mantel coefficient. In Encyclopedia of Research Design; Salkine, N., Ed.; Sage: Thousand Oaks, CA, USA, 2010; pp. 222–229. Available online: https://personal.utdallas.edu/~herve/abdi-congruence2010-pretty.pdf (accessed on 7 February 2025).
  35. El Ghaziri, A.; Qannari, E.M. Measures of association between two datasets; application to sensory data. Food Qual. Pref. 2015, 40 Pt A, 116–124. [Google Scholar] [CrossRef]
  36. Blancher, G.; Parcon, J.; Clavier, B.; Egoroff, C.; Duineveld, K. A method to investigate the stability of a sorting map. Food Qual. Pref. 2012, 23, 36–43. [Google Scholar] [CrossRef]
  37. Ares, G.; Tárrega, A.; Izquierdo, L.; Jaeger, S.R. Investigation of the number of consumers necessary to obtain stable sample and descriptor configurations from check-all-that-apply (CATA) questions. Food Qual. Pref. 2014, 31, 135–141. [Google Scholar] [CrossRef]
  38. Vidal, L.; Antúnez, L.; Ares, G.; Tárrega, A.; Jaeger, S.R. Comparison of Correspondence Analysis based on Hellinger and chi-square distances to obtain sensory spaces from check-all-that-apply (CATA) questions. Food Qual. Pref. 2015, 43, 106–112. [Google Scholar] [CrossRef]
  39. Lawless, H.T.; Glatter, S. Consistency of multidimensional scaling models derived from odor sorting. J. Sens. Stud. 1990, 5, 217–230. [Google Scholar] [CrossRef]
  40. Lelièvre, M.; Chollet, S.; Abdi, H.; Valentin, D. What is the validity of the sorting task for describing beers? A study using trained and untrained assessors. Food Qual. Pref. 2008, 19, 697–703. [Google Scholar] [CrossRef]
  41. Meyners, M.; Castura, J.C.; Carr, B.T. Existing and new approaches for the analysis of CATA data. Food Qual. Pref. 2013, 30, 309–319. [Google Scholar] [CrossRef]
  42. Jaeger, S.R.; Cadena, R.S.; Torres-Moreno, M.; Antúnez, L.; Vidal, L.; Giménez, A.; Hunter, D.C.; Beresfor, M.K.; Kam, K.; Yin, D.; et al. Comparison of check-all-that-apply and forced-choice Yes/No question formats for sensory characterisation. Food Qual. Pref. 2014, 35, 32–40. [Google Scholar] [CrossRef]
  43. Spencer, M.; Sage, E.; Velez, M.; Guinard, J.-X. Using single free sorting and multivariate exploratory methods to design a new coffee taster’s flavor wheel. J. Food Sci. 2016, 81, S2997–S3005. [Google Scholar] [CrossRef]
  44. Lawless, H.T. Descriptive analysis of complex odors: Reality, model or illusion? Food Qual. Pref. 1999, 10, 325–332. [Google Scholar] [CrossRef]
  45. Alaoui-Ismaïli, O.; Robin, O.; Rada, H.; Dittmar, A.; Vermet-Maury, E. Basic emotions evoked by odorants: Comparison between autonomic responses and self-evaluation. Physiol. Behav. 1997, 62, 713–720. [Google Scholar] [CrossRef]
  46. Chollet, S.; Valentin, D. Impact of training on beer flavor perception and description: Are trained and untrained subjects really different? J. Sens. Stud. 2001, 16, 601–618. [Google Scholar] [CrossRef]
  47. Köster, E.P. The psychology of food choice: Some often encountered fallacies. Food Qual. Pref. 2003, 14, 359–373. [Google Scholar] [CrossRef]
  48. Labbe, D.; Rytz, A.; Hugi, A. Training is a critical step to obtain reliable product profiles in a real food industry context. Food Qual. Pref. 2004, 15, 341–348. [Google Scholar] [CrossRef]
Figure 1. Dendrograms of CLUSCATA results 1 for each country based on sensory characteristics. Results of (a) USA; (b) Italy; (c) Republic of Korea; (d) India; and (e) France. 1 The horizontal axis, D (delta), represents the value reflecting the increase in internal heterogeneity when clusters merge.
Figure 1. Dendrograms of CLUSCATA results 1 for each country based on sensory characteristics. Results of (a) USA; (b) Italy; (c) Republic of Korea; (d) India; and (e) France. 1 The horizontal axis, D (delta), represents the value reflecting the increase in internal heterogeneity when clusters merge.
Foods 14 00593 g001aFoods 14 00593 g001b
Figure 2. Correspondence analysis 1 in each country: (a) France; (b) India; (c) Italy; (d) Republic of Korea; (e) USA.; 1 Empty circles (○) indicate attributes; Filled rhombuses (♦) indicate samples.
Figure 2. Correspondence analysis 1 in each country: (a) France; (b) India; (c) Italy; (d) Republic of Korea; (e) USA.; 1 Empty circles (○) indicate attributes; Filled rhombuses (♦) indicate samples.
Foods 14 00593 g002aFoods 14 00593 g002b
Figure 3. Cross-cultural comparison of significant interaction between eight samples and overall quality scores (p < 0.0001).
Figure 3. Cross-cultural comparison of significant interaction between eight samples and overall quality scores (p < 0.0001).
Foods 14 00593 g003
Table 1. Coffee bean information.
Table 1. Coffee bean information.
LabelOriginInformation
RegionFarmerProcessing MethodVariety
AColombiaNariñoAponte CommunityWashedCaturra
BEthiopiaGuji and YirgacheffeSmallholdersWashedHeirloom
CBrazilSul di Minas-NaturalCauai
DHonduras----
ECosta RicaTarrazu, San JoseTarrazu SmallholdersWashedCaturra, Catuai
FGuatemalaHuehertenangoSmallholdersWashedCaturra, Marsellesa, Catuai, and Bourbon
GBrazil CerradoCerrado MineiroHidetoshi Alberto OtaguriExtended Fermentation-
HPre-PackagedUp to nine originsVariesMixedArabica
Table 2. Brewing standards.
Table 2. Brewing standards.
Parameters
Brewing Pressure9 Bar
Temperature92–94 °C (198–201 °F)
Filter Basket Volume15–17 g ± 1 g) to make two servings
(Use the filter basket that matches the size of the dose, either 15 g or 17 g.)
Liquid Weight70–90 mL, two servings of 35–45 mL each
Extraction Time25 s ± 3 s
Extraction Ratio50–55% coffee weight to extraction time
Table 3. Water standards 1.
Table 3. Water standards 1.
CharacteristicTargetAcceptable Range
OdorClean, Fresh/Odor FreeClean, Fresh/Odor Free
ColorClear ColorClear Color
Total ChlorineNoneNone
TDS150 mg/L75–250 mg/L
Calcium Hardness50–175 ppm CaCO3
(4 grains or 68 mg/L)
50–175 ppm CaCO3
(1–5 grains or 17 mg/L–85 mg/L)
Total Alkalinity40 ppmAt or near 40–70 ppm
pH7.06.0–8.0
Sodium10 mg/LAt or near 10 mg/L
1 The specifications are based on those set by the SCAA Standards Committee (Water Standards Version:21NOV2009A, by the Specialty Coffee Association of America) for use in brewing specialty coffee with a few modifications for TEP based on industry recommendations (shown in italics).
Table 4. The Espresso ProtocolTM (TEP) questionnaire composition.
Table 4. The Espresso ProtocolTM (TEP) questionnaire composition.
Evaluation CategoryScaleTerms in CATA
A. Description of the espresso
CremaCATANone, Hole in crema, Dark, Brown, Light Brown, Off Color, Multi-hued, Tiger Stripe, Persistent, Dissipated Quickly, Splotchy, Bubbles, Thick, Thin
Aroma, Flavor 1)CATAFruity 4), Floral 4), Nutty 4), Cereal 4), Cocoa 4), Sweet 4), Earthy 4), Roasted 4), Spices 4), Vegetative 4), Stale/Papery 4), Chemical 4), Alcohol/Fermented 4)
Fruity 1)CATACitrus fruit 4), Lemon 4), Grapefruit 4), Orange 4), Lime 4), Berry 4), Strawberry 4), Raspberry 4), Blackberry 4), Dried Fruit 4), Raisin 4), Prune 4), Other Fruit 4), Apple 4), Pear 4), Peach 4), Grape 4), Cherry 4), Pomegranate 4), Coconut 4), Pineapple 4), Other 2), Not Sure 3)
Floral 1)CATABlack tea 4), Jasmine 4), Rose 4), Chamomile 4), Other 2), Not Sure 3)
Nutty 1)CATAPeanuts 4), Almond 4), Hazelnut 4), Other 2), Not Sure 3)
Cereal 1)CATAGrain 4), Malt 4), Other 2), Not Sure 3)
Cocoa 1)CATACocoa 4), Chocolate 4), Dark Chocolate 4), Other 2), Not Sure 3)
Sweet 1)CATASweet 4), Molasses 4), Maple Syrup 4), Brown Sugar 4), Caramelized 4), Honey 4), Vanilla 4), Other 2), Not Sure 3)
Earthy 1)CATAMusty/Earthy 4), Meaty/Brothy 4), Animalic 4), Moldy/Damp 4), Smoky 4), Phenolic 4), Other 2), Not Sure 3)
Roasted 1)CATAAcrid 4), Brown roast 4), Burnt 4), Ashy 4), Tobacco 4), Pipe Tobacco 4), Other 2), Not Sure 3)
Spices 1)CATABrown spice 4), Pepper 4), Nutmeg 4), Clove 4), Cinnamon 4), Anise 4), Other 2), Not Sure 3)
Vegetative 1)CATAHay-like 4), Herb-like 4), Beany 4), Under-ripe 4), Green 4), Dark Green 4), Peapod 4), Olive Oil 4), Raw 4), Fresh 4), Other 2), Not Sure 3)
Stale/Papery 1)CATAStale 4), Woody 4), Cardboard 4), Papery 4), Other 2), Not Sure 3)
Chemical 1)CATABitter 4), Medicinal 4), Petroleum 4), Skunky 4), Rubber 4), Salty 4), Other 2), Not Sure 3)
Alcohol/Fermented 1)CATAFermented 4), Overripe/Near Fermented 4), Winey 4), Whiskey 4), Other 2), Not Sure 3)
Flavor IntensityCATAMild, Pleasant, Prominent, Strong, Overbearing
B. Tactile Impact
Body/MouthfeelCATAThickness 4), Tea Like, Mouthwatering, Smooth, Grainy, Creamy, Soft, Round, Bold, Thin, Metallic 4), Mouth Drying 4), Oily
Body/Mouthfeel IntensityCheck-OneHigh, Medium, Low
C. Taste Basics
SweetnessCheck-OneSweet 4), Molasses 4), Maple Syrup 4), Brown Sugar 4), Caramelized 4), Honey 4), Vanilla 4)
SweetnessCheck-OnePleasant, Unpleasant
AcidityCATACrisp, Clean, Biting, Juicy, Winey 4), Vibrant, Tart, Citric Acid 4), Bright, Sparkling, Puckering
AcidityCheck-OnePleasant, Unpleasant
Bitter Intensity0–100 points scale
BitterCheck-OnePleasant, Unpleasant
UmamiCATAMouthwatering, Complex, Tongue Coating, Savory, Long Lasting
D. Overall Impact
Continuity of Taste/FlavorYes/No(If evaluators choose ‘Yes’, it will move on to the below menu.)
Continuity of Taste/FlavorCATAInitial, Middle, Late, Finish
AftertasteCATAJuicy, Clean, Persistent, Simple, Complex, Astringent, Flat, Elegant
AftertasteCheck-OneShort, Medium, Long
Holistic/HedonicCATAAromatic, Pleasant, Balanced/Blended 4), Harmonious, Full/Bold 4), Lively, Stable, Irregular/Disjointed, Bland/Flat
Key Memorable ExperienceExpository Text
Unique CharacteristicsExpository Text
Balance/AmplitudeCATAComplex, Elegant, Fullness, Satisfying, Blended, Harmonious, Balanced, Stable Through Cooling, Bland/Flat, Falls Apart Through Cooling, Disjointed, Unbalanced
DefectsCATAStale 4), Papery/Cardboard 4), Fermented 4), Metallic 4), Burnt 4), Rubber 4), Medicinal 4), Potato, Musty 4), Underdeveloped, Moldy 4), Petroleum 4), Woody 4), Rancid, Over Fermented 4), Smoky 4), Salty 4), Other 2), Not Sure 3)
Stability (through cooling)Yes/No
Additional CommentsOpen-ended comments
E. Final Questions
Overall Quality0–100 points scale
Willing to use for espresso extractionYes/No
Final CommentsOpen-ended comments
1) The ‘Aroma’ and ‘Flavor’ evaluations contained the same set of characteristic terms. 2) The ‘Other’ selection is optional, allowing participants to provide additional characteristics if those they perceive are not on the list during the evaluation. 3) The ‘Not Sure’ selection is also available to indicate that there were no defining characteristics; however, when ‘Not Sure’ was chosen, other characteristics could not be selected. 4) Terms in bold are taken from the World Coffee Research Sensory Lexicon [28].
Table 5. Demographic information.
Table 5. Demographic information.
Total
(n = 46)
France
(n = 7)
India
(n = 12)
Italy
(n = 7)
Republic of Korea
(n = 10)
USA
(n = 10)
Frequency (%)Frequency (%)Frequency (%)Frequency (%)Frequency (%)Frequency (%)
Gender
Man33 (71.7)5 (71.4)8 (66.7)6 (85.7)8 (80.0)6 (60.0)
Woman12 (26.1)1 (14.3)4 (33.3)1 (14.3)2 (20.0)4 (40.0)
Transgender0 (0.0)0 (0.0)0 (0.0)0 (0.0)0 (0.0)0 (0.0)
Non-binary/non-conforming1 (2.2)1 (14.3)0 (0.0)0 (0.0)0 (0.0)0 (0.0)
Prefer not to respond0 (0.0)0 (0.0)0 (0.0)0 (0.0)0 (0.0)0 (0.0)
Age
19–252 (4.3)0 (0.0)0 (0.0)0 (0.0)0 (0.0)2 (20.0)
26–3517 (37.0)4 (57.1)5 (41.7)0 (0.0)8 (80.0)0 (0.0)
36–4518 (39.1)3 (42.9)7 (58.3)4 (57.1)2 (20.0)2 (20.0)
46–555 (10.9)0 (0.0)0 (0.0)3 (42.9)0 (0.0)2 (20.0)
56–652 (4.3)0 (0.0)0 (0.0)0 (0.0)0 (0.0)2 (20.0)
66–752 (4.3)0 (0.0)0 (0.0)0 (0.0)0 (0.0)2 (20.0)
76 and above0 (0.0)0 (0.0)0 (0.0)0 (0.0)0 (0.0)0 (0.0)
Prefer not to respond0 (0.0)0 (0.0)0 (0.0)0 (0.0)0 (0.0)0 (0.0)
Role in Coffee Industry (Check-all-that-apply)
Barista20 (43.5)6 (85.7)2 (16.7)0 (0.0)9 (90.0)3 (30.0)
Roaster23 (50.0)6 (85.7)5 (41.7)4 (57.1)5 (50.0)3 (30.0)
Grower4 (8.7)0 (0.0)4 (33.3)0 (0.0)0 (0.0)0 (0.0)
Buyer13 (28.3)4 (57.1)4 (33.3)0 (0.0)4 (40.0)1 (10.0)
Other16 (34.8)0 (0.0)8 (66.7)3 (42.9)1 (10.0)4 (40.0)
Table 6. Cluster composition of CLUSCATA results 1.
Table 6. Cluster composition of CLUSCATA results 1.
Cluster A
(n = 21)
Cluster B
(n = 9)
Cluster C
(n = 44)
Cluster D
(n = 42)
Cluster E
(n = 22)
1302 2230813011409240234073401110812043208280514083102
1303230913071404240334061104110512053203280224083109
1305230513101402240934051103110712083207280411103106
1306330523011412241234092103110112031803380311063110
1304330623071406241134112104210822041807380511091201
1309330823101405240734083103210722051805380711021206
1308330933071401240634103104210122031804380621021207
2303330333101407240134121801210522081806380421102201
2304330233011410241034022801310722071802380221062206
23023304 1411240534042806310132042807 21093201
2306 1403240434033801310532052803 31083206
1 CLUSCATA was conducted for all five countries, including each expert’s data from three replications. 2 Numbers are comprised of replication (1st digit = 1–3), country (2nd digit: 1 = USA, 2 = Italy, 3 = Republic of Korea, 4 = India, and 8 = France) and the expert within a country (3rd and 4th digit, e.g., the 1st expert in a country is coded 01).
Table 7. The homogeneity index by country 1.
Table 7. The homogeneity index by country 1.
Total
(n = 138)
France
(n = 21)
India
(n = 36)
Italy
(n = 21)
Republic of Korea
(n = 30)
USA
(n = 30)
Homogeneity Index0.3640.6060.5720.3930.6520.562
Percent of experts being grouped with themselves (%)91.385.775.071.4100.090.0
1 CLUSCATA was conducted in each country for the three-replication data.
Table 8. The RV coefficient on sample configurations from correspondence analysis 1.
Table 8. The RV coefficient on sample configurations from correspondence analysis 1.
FranceIndiaItalyRepublic of KoreaUSA
France1----
India0.828 21--
Italy0.8930.8201--
Republic of Korea0.4830.4730.7011-
USA0.3380.3880.5700.7451
1 Correspondence analysis was performed for sample configurations. 2 Bold indices indicate that the RV coefficient was significant (p < 0.05).
Table 9. The average overall quality scores by sample 1.
Table 9. The average overall quality scores by sample 1.
ColombiaEthiopiaBrazilHondurasCosta RicaGuatemalaBrazil
Cerrado
Pre-
Packaged
MSDp-Value
France77.1 a 283.2 a79.7 a76.2 a80.5 a79.8 a81.5 a61.3 b9.3752<0.0001
India78.1 a78.3 a73.6 a74.8 a72.7 a74.7 a73.4 a66.1 b6.5176<0.0001
Italy55.5 a67.0 a57.7 a54.2 a51.6 a55.8 a61.0 a27.0 b17.6620<0.0001
Republic of Korea57.6 bc81.0 a59.6 bc65.1 b63.1 b67.0 ab57.9 bc44.8 c15.7660<0.0001
USA70.175.475.272.572.974.573.971.711.61500.8634
Total68.3 a77.3 a69.4 b69.3 b68.6 b70.9 b69.5 b56.0 c6.3571<0.0001
1 One-way ANOVA was performed for overall quality scores within each country. 2 Different letters within a row indicate significant differences for the Tukey post-hoc test with minimum significant difference (MSD) (p < 0.05).
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Choi, J.; Lee, J.; Chambers, E., IV. Cross-Cultural Comparison of the Espresso Protocol Repeatability. Foods 2025, 14, 593. https://doi.org/10.3390/foods14040593

AMA Style

Choi J, Lee J, Chambers E IV. Cross-Cultural Comparison of the Espresso Protocol Repeatability. Foods. 2025; 14(4):593. https://doi.org/10.3390/foods14040593

Chicago/Turabian Style

Choi, Jisoo, Jeehyun Lee, and Edgar Chambers, IV. 2025. "Cross-Cultural Comparison of the Espresso Protocol Repeatability" Foods 14, no. 4: 593. https://doi.org/10.3390/foods14040593

APA Style

Choi, J., Lee, J., & Chambers, E., IV. (2025). Cross-Cultural Comparison of the Espresso Protocol Repeatability. Foods, 14(4), 593. https://doi.org/10.3390/foods14040593

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop