Next Article in Journal
Shifting Perceptions and Behaviors: The Impact of Digitalization on Banking Services
Previous Article in Journal
The Impact of Artificial Intelligence Service Competency Among Korean Citizens on Digital Utilization Outcomes in the Context of Digital Trade Expansion: The Mediating Role of E-Commerce
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Encourage Online Consumers to Embrace Voice Shopping: Roles of Tasks, Technology, and Individual Characteristics

1
College of Economics and Management, Qingdao Agricultural University, Qingdao 266109, China
2
Department of Industrial & Management Engineering, Hanbat National University, Daejeon 34158, Republic of Korea
*
Author to whom correspondence should be addressed.
J. Theor. Appl. Electron. Commer. Res. 2025, 20(4), 293; https://doi.org/10.3390/jtaer20040293 (registering DOI)
Submission received: 5 July 2025 / Revised: 17 October 2025 / Accepted: 26 October 2025 / Published: 1 November 2025

Abstract

Voice shopping brings consumers convenience and retailers a new channel to reach buyers, which is an important component of online shopping. However, it has received a tepid response recently. Two issues must first be addressed to promote voice shopping: why consumers have a low acceptance of voice shopping and how to motivate their use. Since technology is intended to serve practical purposes, it is necessary to achieve a fit among voice shopping, online shopping tasks, and consumers. Accordingly, this study elaborates on fit and integrates the task-technology fit (TTF) theory (for objective fit) and the technology acceptance model (TAM, for subjective fit) to build a research model in which task, technology, and individual characteristics affect fit that will stimulate voice shopping intention. Using Structural Equation Modeling to analyze the data collected from 425 consumers who do not accept voice shopping, the results show that both objective fit and subjective fit can determine voice shopping intention; however, some critical task, technology, and individual characteristics cannot affect either aspect of fit, indicating that consumers still do not believe voice is workable in online shopping. These findings make suggestions for a purposeful upgrade of the voice shopping experience, which helps promote voice shopping and ultimately contributes to the prosperity of online shopping. This study also offers insights into what constitutes fit and its roles in the integrated model of the TTF theory and TAM.

1. Introduction

Since the computer was introduced to the public, typing has been a common way to interact with devices. For a long time, people have become accustomed to opening a website and typing a few words when they need information. Recently, the rapid development of voice technology has changed this situation—it serves users via their most familiar way of interaction, i.e., speaking in a natural language. The voice assistant, one of the most typical applications, such as Siri, Google Assistant, and Alexa, stands out. With Speech Recognition, Natural Language Processing, and Text-to-Speech functions, they can elaborate on human speech, understand their intent, and respond in a human-like voice. Increasingly, people are showing enthusiasm for speaking to check the weather, send messages, and control home appliances. By 2026, over 150 million voice assistants are projected to be found in the US alone [1].
While being popular among users, voice assistants also garner attention in business. Retail giants like Amazon and Alibaba have incorporated them into shopping functions, which enables consumers to make purchases through voice commands. This new trend—voice shopping—supports an on-the-go shopping experience and creates another touchpoint for businesses to reach buyers [2], showing a disruptive potential in e-commerce. Studies have found that smart speakers can encourage users to make more purchases [3,4] and be more loyal to sellers [5]. The voice shopping market share is predicted to rise by USD 80.21 billion from 2024 to 2029 at an increasing CAGR of 22.7% [6]. Apparently, voice shopping is becoming an important component of the online shopping experience. However, it is worth noting that over the past few years, the adoption of voice shopping has been slower than expected and even stagnated [7,8]. About 60% of online buyers made daily or weekly purchases via smart home voice assistants in 2021, but a 2024 study reported that only 24.5% of consumers conducted voice shopping every day or every week [9,10]. Although professional service providers, such as Amazon and Apple, have been racing to an overhaul to improve voice shopping [11], this “most overlooked innovation” in e-commerce still faces challenges in acceptance [12], which raises the following two research questions:
RQ1 
: Why do many consumers not use voice shopping currently?
RQ2 
: How can the voice shopping experience be improved to encourage more usage?
Despite growing interest in voice assistants, little is known about consumer behaviors in the voice shopping context. Most existing studies focus on the determinants of consumers’ purchasing behavior or final decision (e.g., [2,3,4,7,13,14,15,16,17]), contributing to the improvement of voice shopping. However, they were mostly conducted under the assumption that consumers have already accepted or regularly used voice shopping, without considering the situation where consumers have not accepted it. Suggestions from the above works cannot directly apply to voice shopping promotion. Scholars took notice and started discussing the acceptance of voice shopping. They have investigated technology characteristics, such as privacy concerns, message interactivity, and human–AI interaction fluency [7,18,19,20,21,22,23,24,25,26,27], and individual characteristics, such as innovativeness and personal attitudes [8,28]. However, to date, many technology characteristics of interest pertain to the general features of voice assistants rather than attributes specific to shopping purposes. Moreover, only a little research examines the influence of individual characteristics and considers them with technology characteristics simultaneously. In general, technology has practical purposes [29], whose acceptance in the long term is not only an issue of accepting technology per se but an issue of accepting its specific functions for certain tasks [30,31]. Voice shopping requires consumers to actively interact with the voice assistant for conducting shopping tasks, indicating that the roles of task-related, technology-related, and consumer-related characteristics should not be ignored. Therefore, to fill the research gap, this study aims to investigate the acceptance of voice shopping by simultaneously considering task, technology, and individual characteristics specific to the voice shopping context.
Fit, the concept that aligns the interrelationships among technologies, tasks, and individuals [32], can be understood as the extent to which technology features meet the requirements of the task and abilities of an individual. Studies have emphasized that AI applications, such as voice assistants, should balance the human–computer roles they play as a consumer touchpoint when serving consumers (e.g., [7,8,33]), implying that the fit among voice-assistant-supported shopping technology, online shopping tasks, and consumers is crucial. Therefore, this study leverages the concept of fit to understand and encourage the acceptance of voice shopping. In the fit area, the task-technology fit (TTF) theory and technology acceptance model (TAM) are the two best-known theories, but they have different focuses. The TTF theory posits that matching technology capabilities and task demands contributes to technology use. It is task-focused without much consideration for the users’ beliefs and attitudes [34], which lays particular stress on objective fit. The TAM proposes that users’ perceptions of the technology are essential, which can potentially present the subjective fit between users and technology [32,34,35]. To maximize their advantages, this study integrates the TTF theory (for objective fit) and the TAM (for subjective fit) to develop a research framework in which task, technology, and individual characteristics affect the two aspects of fit that determine the voice shopping intention.
This work expects to make two major contributions. First, leveraging fit to construct the research model for voice shopping acceptance will not only enhance the theoretical basis for integrating the TTF theory and TAM but also enrich the concept of fit. This work introduces fit among task, technology, and consumers into the voice shopping domain and applies it to strengthen the theoretical foundation of integrating the TTF theory and TAM. It further examines the validity of the integrated model by comparing the data analysis results of the integrated model with two individual models, which is expected to offer a robust basis for future research on technology usage in voice shopping or other AI application settings. Moreover, in the process of clarifying the influences of fit on voice shopping acceptance, this study breaks fit into objective fit and subjective fit, which provides insights into the concept of fit, as divergence still exists regarding what constitutes it. Second, focusing on the influences of important task, technology, and individual characteristics specific to the voice shopping context can yield precise and actionable suggestions for service providers to encourage technology acceptance. This study aims to address two research questions: the reasons many consumers do not use voice shopping currently and solutions to improve the voice shopping experience for wider adoption. The results can provide a practical basis for service providers and retailers to upgrade and promote voice shopping, ultimately fueling the growth of online shopping.

2. Conceptual Framework Development

2.1. Theoretical Foundation for Fit

The concept of fit has continued to receive attention in technology acceptance studies. Technology can be broadly understood as “a designed, material means to an end” [36]. That means technology has its practical purposes [29]. Accordingly, scholars considered that technology acceptance in the long term is not only an issue of accepting technology per se but also, of primary importance, an issue of accepting its specific functions [30,31], emphasizing the significance of fit.
Initially, fit was introduced to explore the interaction between task and technology [37,38]. It refers to the degree to which technology functionality satisfies the task requirements (e.g., [39]). Goodhue and Thompson’s TTF theory precisely explains this interaction [40]. The theory posits that, regardless of people’s attitude toward the technology, they will not accept it if a mismatch exists between the technology’s functionality and task demands, indicating that task characteristics and technology characteristics are important. In this theory, the degree of fit between tasks and technology is considered the determinant of the user’s performance and technology utilization (see Figure 1). As the significant role of the original task-technology fit has been proven, scholars have gradually expanded its scope to include task-individual fit and individual-technology fit (e.g., [32,35,41]). Therefore, the concept of fit was extended to the alignment of interrelationships between tasks, technology, and individuals [32,35,42]. In recent years, new terms such as environmental-technology fit and experience-technology fit have also been proposed [43,44]. However, very few studies can test all dimensions of task-technology fit due to the model complexity. Consequently, even if fit has been applied to explain the acceptance and usage of various technology tools, divergence still exists about what constitutes it [37,42].
Another approach to operationalizing fit offers insights into a better understanding of this concept—some scholars consider that the support fit is also important. Aspects such as ease of use and accessibility are often used to describe the fit [47]. Such an operation was also recognized by Goodhue et al. [41]. They introduced user evaluations of fit (e.g., ease of use) to make up for the potential shortage of task-technology fit. As a result, studies treat task-technology fit as actual and rational, but user evaluations of fit are sensitive and depend on the users [34,35,41]. Among the studies, the most typical approach is to combine the TAM and the original task-technology fit (see Figure 1), as the TTF theory is primarily task-focused, whereas the TAM focuses on users’ beliefs and attitudes toward the technology [34]. Figure 1 shows that perceived usefulness (PU) and perceived ease of use (PEOU) are the two crucial factors of the TAM. Both of them reflect an individual’s subjective evaluation of the technology for being applied to specific tasks. PU refers to the probability that a prospective user believes using the technology will help them enhance their performance, and PEOU means the extent to which a prospective user considers the technology to be effortless [45]. The results have proven that integrating the TTF theory and TAM can assist in better understanding why people use certain technologies for specific tasks (e.g., [34,48,49]).
The above indicates that fit has two main aspects: objective fit and subjective fit. The objective fit shows the extent to which technology is suitable for performing a portfolio of tasks; the subjective fit is the degree to which a user believes that technology can help them conduct the task.
Like other technologies, voice shopping also has its practical purposes—it should support online shopping tasks. In addition, when interacting with voice assistants for shopping, users can also evaluate them [22], emphasizing the importance of subjective evaluation. Thus, when applying the fit concept to understand the acceptance of voice shopping, the researchers chose to integrate the objective fit and subjective fit. The objective fit refers to the fit between online shopping tasks and voice shopping technology, which the task-technology fit in the TTF theory can measure. The subjective fit refers to the degree to which a user believes voice shopping can help them conduct online purchase activities, as reflected by perceived usefulness and perceived ease of use in the TAM in this study. According to Davis et al. [45], PU is further defined as the degree to which a person believes that using voice shopping would increase his or her performance in online shopping and PEOU as the degree to which a person expects that using voice shopping would be free of effort. Based on the TTF theory and TAM, it is reasonable to propose that the higher the task-technology fit, PEOU, and PU, the more likely individuals are to accept voice shopping. Thus
H1. 
Task-technology fit positively influences the intention to use voice shopping.
H2. 
PEOU positively influences the intention to use voice shopping.
H3. 
PU positively influences the intention to use voice shopping.
As mentioned, studies have demonstrated that the combined model of the TTF theory and TAM has a greater explanatory power for technology use than the two individual models. Some researchers also directly applied the integrated model to explain technology acceptance and discussed the relationship among task-technology fit, PEOU, and PU [50,51,52,53]. Task-technology fit is often expected to predict a user’s perception of technology utilization. For example, Goodhue and Thompson [40] claimed that task-technology fit could determine the evaluation of technology’s advantages, like usefulness. Park et al. [53] hypothesized that task-technology fit could positively influence an individual’s expectancy of performance (i.e., PU) and effort (i.e., PEOU). Similarly, if the voice shopping method fits well with one’s online shopping needs or tasks, they are more likely to perceive the technology as easy to use and useful. In addition, the TAM claims that PEOU can affect the acceptance intention directly or through PU [45] because technology that is easy to use can increase an individual’s behavior performance [54]. Based on the above, it is reasonable to infer the relationship as follows:
H4. 
Task-technology fit positively influences the PEOU of voice shopping.
H5. 
Task-technology fit positively influences the PU of voice shopping.
H6. 
PEOU positively influences the PU of voice shopping.

2.2. Voice Shopping and Related Task, Technology, and Individual Characteristics

According to the definition of fit, it is closely related to task, technology, and individual characteristics. However, when integrating the TTF theory and TAM, research usually discusses technology characteristics from an overall perspective, i.e., measuring it by one construct. In practice, however, technology often has multiple features affecting user acceptance. Analyzing their effects can help service providers promote technology usage more effectively. In addition, individual characteristics have not received much attention in the integrated model. For voice shopping, individual initiatives could be crucial, as users should speak with their devices actively. Thus, this study proceeds to clarify the important task, technology, and individual characteristics in the voice shopping context.

2.2.1. Studies Related to Voice Shopping Acceptance

Voice shopping is one of the novel trends in retailing. It enables individuals to conduct online shopping simply by speaking to voice assistants embedded in smartphones or smart speakers. This new shopping method is promising for improving the online shopping experience, thereby benefiting both retailers and consumers. However, compared with other voice-activated functions, such as texting messages and asking for the weather, voice shopping is still in its initial stage [8].
As mentioned, in the voice shopping setting, antecedents of a consumer’s buying behavior or decision have attracted much attention [2,3,4,7,13,14,15,16,17]. Research on the determinants of voice shopping acceptance is still relatively scarce [19]. Early studies often asked consumers about their perceptions of voice-activated devices and then categorized reasons into benefits and costs [22,23,24,25,26]. To date, the technology characteristics of voice assistants and individual characteristics of consumers have become the focus of published literature (see Table 1).
Table 1 reveals the following: (1) the existing studies paid more attention to the technology characteristics of voice assistants. Different from initial research, the current studies mainly examine either the positive factors [19,21,27] or the negative ones [7,18,20], such as interaction advantages and privacy risks. Most of them belong to the general characteristics of voice assistants rather than attributes specific to shopping purposes. However, technology often has practical purposes [29]. Its acceptance is not merely an issue of accepting technology per se but an issue of accepting its specific functions [30,31]. Thus, it is necessary to delve into the characteristics of voice technology for shopping purposes. (2) The influences of personal factors need further investigation. Only a few scholars have discussed the role of personal factors in voice shopping acceptance [8,28]. Moreover, little research considers the impact of both technology and individual characteristics in this setting simultaneously. Voice shopping requires consumers to use their voice to actively interact with the voice assistant for conducting shopping tasks, indicating the significant roles of shopping tasks, technology, and consumer-related factors. Therefore, this study focuses on the task, technology, and individual characteristics specific to the voice shopping context and examines how these factors affect consumers’ acceptance of voice shopping.

2.2.2. Task Characteristics

Tasks refer to the actions conducted by individuals in turning inputs into outputs to satisfy their needs [40]. It is a piece of work that people perform through a series of actions to achieve a goal [61]. Thus, the task characteristics can be regarded as requirements, perceived needs, or dimensions related to those that a user might apply technology to perform [49,61,62]. This study defines the task as the activities in which individuals engage during online shopping. Accordingly, the task characteristics are an individual’s online shopping requirements. With the development of online shopping, recent studies have found that online shoppers increasingly face trends such as choice and information overload [63,64] and multiple tasks [65,66]. As a result, they inevitably expect an adaptable, immediately accessible, and custom “do it for me” shopping experience [18,66].
The TTF theory hypothesizes that task characteristics influence the task-technology fit because the corresponding requirements encourage users to be more dependent on the technology [40]. Several studies have provided evidence for this relationship [52,61,67]. Regarding PU and PEOU, some work proposes that task characteristics can influence PU but not PEOU. As Lee et al. [21] claimed, examining the direct relationship between task characteristics and PEOU may not be plausible because PEOU is contingent on whether the target system is easy to use or not. Similar to voice shopping, it can satisfy users’ needs when it is inconvenient to type for shopping. That helps users perform online shopping tasks anytime and anywhere, making them perceive that voice shopping is useful. However, it cannot change the operation process of voice shopping. Thus, this study posits the following:
H7a. 
Task characteristics positively influence task-technology fit.
H7b. 
Task characteristics positively influence the PU of voice shopping.

2.2.3. Technology Characteristics

Technology characteristics refer to the attributes of tools individuals use in performing tasks [68]. According to the TAM and TTF theory, technology attributes can influence its acceptance through task-technology fit, PEOU, and PU [34,52,69]. In practice, technology has both positive and negative attributes. This work focuses on the positive aspects of voice assistants for shopping. There are two main reasons. First, recent studies have disclosed that positive attributes or feelings toward voice shopping are taking precedence over the negative ones (e.g., [8,19,70]). Second, the primary purpose is to uplift the voice shopping experience through possible technologies or strategies. Therefore, this study continues to extract the positive features of voice shopping compared with other popular shopping methods.
Voice technology has tremendously changed how people interact with information sources, altering the way they purchase products. Before the Internet era, people were accustomed to searching for products and buying them in an offline context. They usually walked to the physical store, checked products with a sales clerk in most cases, and paid. It may provide more accurate product information, but this shopping method is usually inconvenient and offers limited choice. With the popularity of the Internet, people can navigate the sea of information. They start to search for products on the computer by typing on the keyboard. In this period, people have more choices, but they still need to sit in front of computers. As mobile technology has developed, users have gradually obtained timely access to product information, but typing on mobile phones cannot separate them from their devices physically, and they become more used to being “silent” with their phones. With the arrival of voice technology, consumers have found speech-based communication to be more natural and effective [24,71]. Voice shopping enables users to obtain information by simply asking their voice assistants in spoken language anytime and anywhere [72,73]. Based on the above, this study identifies three prominent characteristics of voice shopping: convenience (one-step search and possible purchase), accessibility (the ability to enable consumers to obtain shopping information anywhere and anytime), and context awareness (the ability to understand what a user really wants in spoken language, including meaning and intent). The following will elaborate on them and their possible influence on two aspects of fit.
  • Convenience
Despite much attention to convenience in various areas, studies have not reached a consensus on its definition and dimensions [74,75]. Since time and effort utilities have mainly been discussed [74], this study defines convenience as the extent to which voice shopping could provide consumers with time and effort utility. It is the most critical characteristic of voice shopping [19].
The convenience of information technology tools can impact task-technology fit and users’ perception of ease of use and usefulness [69,76]. Similarly, voice shopping enables users to perform shopping tasks by speaking to their voice assistants without steps like clicking or filling out forms [73,77]. That means voice technology automatically realizes the one-step search for shopping information. Additionally, spoken language is the most natural approach for humans [24]. Compared with typing on a computer or smartphone, voice shopping is often easier to conduct [18] because it does not require additional education, effort, and skill related to typing [77,78]. As a result, voice shopping can save users time and effort [16], leading to perceived ease of use and usefulness. Related advantages can also match the shopping needs of current consumers who seek to quickly and easily find what they want in the era of choice and information overload [64]. Thus, this study hypothesizes that
H8a-1. 
Convenience positively influences task-technology fit.
H8a-2. 
Convenience positively influences the PEOU of voice shopping.
H8a-3. 
Convenience positively influences the PU of voice shopping.
  • Accessibility
Accessibility can be briefly understood as the ability to access a resource, such as an entity, information, or service. For voice shopping, this study considers accessibility as the ability to allow users to reach shopping information and services anytime, anywhere, and in any situation [18]. As mentioned before, previous shopping methods, including offline shopping, online shopping on computers, and mobile shopping, have limitations in time and/or space. On the contrary, voice technology can physically separate users from their devices. Voice control and the innovation of far-field voice capture technology can enable users to initiate the shopping journey without many limitations. That means users can remain reachable and engaged in shopping activities at all times.
Despite limited direct evidence for the effect of accessibility on task-technology fit, PEOU, and PU (e.g., [79,80]), factors with a similar concept, such as reachability [69] and portability [81], have shown positive influences on these three constructs. By prioritizing accessibility, voice shopping can meet the increasing demand for reaching and interacting with shopping systems across time, space, and contexts, especially among those who often have to or prefer to handle multiple tasks [66,82]. In addition, it ensures equal access to online shopping for diverse consumer groups [83]. Thus, this study proposes the following:
H8b-1. 
Accessibility positively influences task-technology fit.
H8b-2. 
Accessibility positively influences the PEOU of voice shopping.
H8b-3. 
Accessibility positively influences the PU of voice shopping.
  • Context awareness
Context means the information that can be applied to characterize an entity’s situation [84]. A system is context-aware when it can provide information and services, automatically execute these services, and tag information for later retrieval based on context [85]. A context-aware information technology device or application would improve its ease of use, usefulness, and efficiency rating (e.g., [85,86]).
Voice assistants leverage conversational AI to communicate with users. AI refers to the study of agents that receive percepts from the environment and perform actions [87]. Thus, high context awareness can be a distinctive feature and the future of voice assistants for shopping [88]. With this sophisticated feature, voice assistants can grasp what a user says, including his or her location, mood, and even intent, and respond to commands based on different situations. Such a deep understanding of conversation could guarantee a high-quality shopping process, which enables a personalized experience for consumers [18,89]. Accordingly, it is reasonable to assume that context awareness can influence the task-technology fit, PEOU, and PU.
H8c-1. 
Context awareness positively influences task-technology fit.
H8c-2. 
Context awareness positively influences the PEOU of voice shopping.
H8c-3. 
Context awareness positively influences the PU of voice shopping.

2.2.4. Individual Characteristics

Individual characteristics are essential variables in studies on acceptance behavior. Factors like personal innovativeness, self-efficacy, and experience have received considerable attention (e.g., [50,80,90,91,92]). Voice technology shifts the way people engage in online shopping—they need to open their mouths [71]. Therefore, it is reasonable to assume that individual initiative can affect their acceptance of this new shopping method. That could also explain why studies in this area start from the motivation or attitude of users. Among the characteristics above, personal innovativeness is closely related to one’s initiative [92], a trait that also attracts interest in the adoption of voice assistants [90]. In addition, an increasing number of people use human metaphors to describe their voice assistants. Human-like cues give them a sense of social presence through conversation [71], rendering the human–voice assistant interaction a social interaction [93]. In particular, when it comes to shopping, users are often expected to initiate the interaction and express their needs. However, people still feel creepy and socially awkward about voice shopping, revealing that they are not fully comfortable asking voice assistants to perform this task [8]. That is, people are still embarrassed about talking to a machine when they are on their own, not to mention when they are in public. According to the American Psychological Association, feeling creepy or awkward during social interaction could be more due to the trait of shyness. This personality trait can easily cause people to talk less [94] and limit their initiatives [95], which deviates from the basic requirements of voice shopping. As a result, this study focuses on two individual characteristics, personal innovativeness in information technology and shyness.
  • Shyness
Shyness refers to an individual’s tendency to avoid social interactions and fail to participate appropriately in social situations [96]. It reflects the actual frequency of an individual talking, which represents a behavioral pattern [94]. Scholars imagined that compared to face-to-face, shy people might prefer communicating in online settings [97,98]. Compared with less shy individuals, however, shy people are more inclined to avoid social behaviors like interaction [99,100,101], are more afraid of negative evaluation, and are more likely to poorly self-evaluate and show deficiencies in self-presentational skills [97,102,103]. As a result, shy people tend to talk less [94]. However, voice shopping requires users to actively speak with devices. Due to the fear of talking in others’ presence and/or being negatively evaluated [96,97], shy users may consider typing to purchase more suitable and easier. Thus, this work proposes the following:
H9a-1. 
Shyness negatively influences task-technology fit.
H9a-2. 
Shyness negatively influences the PEOU of voice shopping.
H9a-3. 
Shyness negatively influences the PU of voice shopping.
  • Personal innovativeness in information technology (PIIT)
Personal innovativeness, the degree to which people accept an innovation before others in a social system [104], is an essential determinant of acceptance behaviors [90]. An individual can be regarded as innovative if he or she is early to adopt an innovation [105]. With the development of information technology, researchers specified personal innovativeness in information technology and defined it as an individual’s willingness to experience new information technology (e.g., [105]). PIIT is usually applied in technology acceptance research (e.g., [50,90,92]). Innovative people are open to change and expect to realize an innovation’s potential [91] because they tend to be curious about new things, active in searching for innovation-related information, willing to take risks, and communicative [69,106]. Thus, they are more likely to feel that it is not hard to learn about a new thing [107]. A positive relationship between PIIT and PEOU can exist for these users; however, PIIT may not directly influence the utilities of voice shopping [107], especially the objective fit between tasks and technology. Thus, the study proposes that
H9b. 
PIIT positively influences the PEOU of voice shopping.
Based on the above, this study finally constructs the research model in Figure 2. To reveal the possible influences of task, technology, and individual characteristics on voice shopping acceptance, this model introduces and clarifies the concept of fit. In this process, the TTF theory and TAM are integrated to explain the two aspects of fit. That is, task-technology fit from the TTF theory can indicate the objective fit that the available functionality of voice shopping technology is suitable for online shopping tasks; perceived ease of use and perceived usefulness present the subjective fit that consumers believe the voice shopping technology can help them purchase online. Meanwhile, compared to Figure 1, the research model forges links between the two aspects of fit. It proposes that the objective fit can trigger subjective fit. In addition, the research model also makes theoretical improvements in two aspects. First, the current work classifies the technology characteristics of voice shopping into convenience, accessibility, and context awareness rather than a single construct. Second, the model adds individual characteristics that may influence technology acceptance. As mentioned, the two kinds of characteristics have received attention in both the TTF theory and TAM. However, their contributions to the integrated model remain unclear. Understanding their impact on subjective and objective fit in the voice shopping setting enables more precise marketing implications. In summary, the current research model can amplify the benefits of integrating the TTF theory and TAM.

3. Research Methods

3.1. Data Collection

This work aims to understand why consumers do not accept voice shopping and how to promote voice shopping. Therefore, the consumers who do not accept voice shopping are the research target, including those who have once experienced voice shopping but no longer use it and those who have not experienced voice shopping. To test the research model, the researchers conducted an anonymous questionnaire survey among them. For consumers who have tried voice shopping, the selection criterion was set as “has once experienced voice shopping, but has not used it in the past six months”. Since there is no need to explain too much about voice shopping, a brief explanatory text was attached to the front of the questionnaire. The potential respondents were requested to read the following: “Speech recognition technology assists us in speaking with devices. A representative example is talking with voice assistants, like Apple’s Siri, Xiaomi’s Xiaoai, and Alibaba’s Tmall Genie. Instead of typing on the keyboard or screen, we can directly ask our voice assistants to search for the weather, phone number, and time. What is more, voice assistants can help us make purchases online. Please recollect the experience when asking voice assistants to open a shopping app, check product information, and/or pay for the product, and answer the following questions according to your real situation.” The questionnaire was posted on the online survey platform Wenjuanxing, a professional survey website. It was distributed through social networking sites and online shopping enthusiasts’ chat groups. For the consumers who had never experienced voice shopping, it was necessary to give them an explanation so they could understand the voice-shopping-related items in the questionnaire. Therefore, the researchers chose to distribute the questionnaire offline. The busiest shopping mall was the main location where the survey was conducted. After ensuring that the voluntary respondent had not experienced voice shopping, the researchers showed him or her a 30 s video about voice shopping in a neutral manner and encouraged him or her to try it. After that, they were requested to answer the questionnaire. Invalid responses with short response times and responses that were the same answer to all questions were removed. There were 425 valid responses, including 204 responses from consumers who have once experienced voice shopping and 221 responses from those who have not. To ensure that the data from the two groups could be combined, this study conducted a chi-square test for homogeneity. The results showed that the two groups have the same underlying characteristics in gender (Pearson chi-square = 0, Sig. = 0.229), age (Pearson chi-square = 6.582, Sig. = 0.160), occupation (Pearson chi-square = 2.414, Sig. = 0.660), and education (Pearson chi-square = 3.210, Sig. = 0.523). Finally, Table 2 summarizes the demographic statistics. The demographic characteristics of the sample (e.g., age, gender, and frequency of using voice search) are similar to those of the population of interest. In addition, the number of responses exceeds general benchmarks for Structural Equation Modeling (e.g., [108]), indicating the sample is adequate and suitable for the research setting.

3.2. Instruments and Measures

The measurement items were selected from previous research and revised to fit the current voice shopping context. All items were measured with a five-point Likert scale. Items and sources are shown in Appendix Table A1 (Appendix A). To increase the validity and reliability, the researchers invited three professors specializing in information technology to check and revise the questionnaire. A pilot test was conducted with 30 undergraduates who have experienced voice shopping to help slightly modify the instruments.
Finally, items for ten constructs were obtained. The construct of task characteristics was measured by four items (TC1-4). For technology characteristics, both convenience (CO1-3) and accessibility (AC1-3) were tested by three items, and context awareness was tested by four items (CA1-4). For individual characteristics, both shyness (SH1-4) and personal innovativeness in information technology (PIIT1-4) were measured by four items. While task-technology fit (TTF1-3) was explained by three items, perceived ease of use (PEOU1-4), perceived usefulness (PU1-4), and voice shopping intention (VSI1-4) were tested by four items. Appendix Table A1 (Appendix A) includes the items.

4. Results

4.1. Measurement Model

The CFA was conducted in AMOS 18.0 to validate the measurement model. According to the criteria of factor loadings and modification indices (MI) [52,109,110,111], the measurement model was refined. Ten items were dropped because their standardized factor loadings were lower than 0.6. To ensure adequate fit with the ratio of chi-square to d.f., this study further checked MI to examine if the overall model fit can be improved. The final model encompassed 26 items (see Appendix Table A2 (Appendix A)). The values, including the ratio of chi-square (767.860) to d.f. (254) at 3.023, GFI at 0.879, CFI at 0.941, IFI at 0.942, TLI at 0.925, RMR at 0.028, and RMSEA at 0.069 indicate a good fit statistics for the measurement model.
According to the criteria of convergent validity, reliability, and discriminant validity [112,113,114], the validity and reliability of all measures are acceptable in this study. First, the Appendix shows that all factor loadings exceed 0.6 at a significant level, revealing strong convergent validity. Second, reliability was tested by composite reliability (CR) and average variance extracted (AVE). According to Table 3, the CRs of all constructs exceed the threshold of 0.6, and all AVEs are more than 0.5. The reliability can be accepted. Finally, the discriminant validity was measured by comparing correlations between constructs with the square root of AVE for each construct. Table 3 shows that the square root of AVE for each construct (on the diagonal) is greater than its correlation coefficients with other constructs. Thus, it is possible to argue that a reasonable extent of discriminant validity is examined.

4.2. Structural Model

AMOS 18.0 was used to assess the structural model. The fit indices (χ2/d.f. = 3.104, p < 0.001, GFI = 0.873, CFI = 0.937, IFI = 0.937, TLI = 0.922, RMR = 0.034, and RMSEA = 0.07) suggest a good model fit. Thus, this study could further examine the path coefficients of the structural model. Table 4 summarizes the final results.
For task, technology, and individual characteristics, the influences of accessibility (β = 0.458, p < 0.001), context awareness (β = 0.132, p < 0.05), and shyness (β = −0.248, p < 0.001) on task-technology fit, influences of accessibility (β = 0.313, p < 0.001), context awareness (β = 0.175, p < 0.01), and shyness (β = −0.239, p < 0.001) on PEOU, and influences of accessibility (β = 0.254, p < 0.001) and context awareness (β = 0.117, p < 0.05) on PU are significant; H8b-1, H8c-1, H9a-1, H8b-2, H8c-2, H9a-2, H8b-3, and H8c-3 are supported. The coefficients indicate that accessibility is worthy of much attention. While context awareness plays an important role in fit, the individual characteristic shyness exhibits a stronger negative impact, causing consumers to hesitate in voice shopping usage. The remaining paths are not significant, indicating that H7a, H7b, H8a-1, H8a-2, H8a-3, H9a-3, and H9b are rejected. Among them, there are two most surprising results. First, task characteristics cannot affect task-technology fit and perceived usefulness, indicating that consumers currently may not believe voice shopping is a good choice for conducting shopping tasks. Second, PIIT cannot significantly affect PEOU. The possible reason for this might lie in the operational complexity and defects of the early-stage voice shopping. Both of them can offer insights into reasons consumers do not use voice shopping and how to improve the corresponding experience, which will be further investigated in the Discussion section.
For the three constructs measuring objective and subjective fit, all of them—task-technology fit (β = 0.316, p < 0.001), PEOU (β = 0.159, p < 0.05), and PU (β = 0.445, p < 0.001)—positively influence the intention to use voice shopping. Thus, H1, H2, and H3 are supported. The above results reveal that the subjective fit between technology and individuals plays a more critical role in voice shopping acceptance, emphasizing the importance of a nuanced experience. In addition, the influences of task-technology fit on PEOU (β = 0.353, p < 0.001), task-technology fit on PU (β = 0.163, p < 0.05), and PEOU on PU (β = 0.37, p < 0.001) are proven, supporting H4, H5, and H6.
Moreover, the explained variances (R2) of task-technology fit, PEOU, PU, and voice shopping intention are 0.628, 0.633, 0.645, and 0.695, respectively. The integrated model can strongly explain user acceptance of voice shopping.
To verify whether uniting the TTF theory and TAM can offer a better understanding of voice shopping intention than the individual model alone, the researchers further summarized the results from the evaluation of the TTF theory and TAM, which are shown in Table 5. The analysis shows that for the TTF theory, the explained variance (R2) of voice shopping intention is 0.596; for TAM, the explained variance (R2) of voice shopping intention is 0.666. Thus, the integrated model provides a better result in the voice shopping setting.
The 425 respondents consist of consumers who have once experienced voice shopping (n = 204) and those who have not (n = 221). Studies on technology acceptance have found that individuals may pay attention to different factors over time with increasing experience [115]. To further test whether there is a difference in the path coefficients between the two groups, this study conducted a multi-group analysis and a t-test (see Table 6). Distinctions do exist between the two groups based on the path coefficients, but only the influence of convenience on PEOU showed a statistically significant difference.

5. Discussion and Implications

5.1. Discussion of Findings

To improve the online shopping experience, retailers continue to leverage emerging and promising technologies, including voice technology. However, a high percentage of consumers still rarely use voice shopping to date. This study applies the concept of fit among task, technology, and individual characteristics to figure out this phenomenon and help encourage the acceptance of voice shopping. It puts forward that the voice shopping tool should achieve both an objective fit with online shopping tasks and a subjective fit with users. Based on this, this study ultimately integrates the TTF theory and TAM to establish a research model for the acceptance of voice shopping. The data analysis results provide the following insights into the two research questions proposed in the introduction.
For RQ1: Why do many consumers not use voice shopping currently? The results offer clues from the following three aspects.
First, task characteristics cannot significantly influence task-technology fit and usefulness, suggesting that average consumers still do not believe that voice shopping can meet their shopping needs. Task characteristics are the requirements that people want to or have to perform in online shopping [61]. As summarized, an increasing number of consumers expect adaptable, immediately accessible, and customized shopping experiences. Voice shopping is expected to support corresponding requirements since it enables hands-free search, on-the-go purchases, and spoken language understanding. However, unlike many findings [52,67,116], results suggest that task characteristics do not lead to either the task-technology fit or the perceived usefulness. It indicates that people do not consider voice shopping a viable option for meeting their online shopping requirements.
Second, the convenience of voice shopping fails to persuade consumers. Generally, technology that brings convenience can positively influence its use (e.g., [69]). However, this study shows that convenience, one of the most important characteristics of voice shopping [19], has no effect on task-technology fit, PEOU, and PU. Convenience shows the degree to which voice shopping can provide time and effort utility, primarily by enabling purchases without typing. The conversational interface makes it easier to use, which is commonly recognized as the most important advantage of voice technology [117]. However, current consumers may have been used to typing for their online purchases. Compared with waiting for the speech-to-text process and potential interruptions caused by technological limitations, they easily feel it faster and more comfortable to open a shopping app and search with their hands. Furthermore, when conducting voice shopping, consumers often cannot directly view product pictures, as text and links are frequently used to display product information. The above causes less fit between online shopping tasks and voice shopping, as well as perceived ease of use and perceived usefulness. Therefore, service providers need to create a close match-up between shopping scenarios and voice shopping. To help users adapt to voice shopping, they may start with the products that require less browsing, such as daily necessities, food, and electronics. For other products, they can enrich search results, like adding pictures.
Third, important personal factors cannot play positive roles in encouraging user acceptance. A notable finding is that PIIT cannot influence PEOU, which is inconsistent with many studies (e.g., [50,61]). A possible reason is that consumers with highly innovative characteristics are generally open to voice shopping and are active in trying it. However, voice shopping is more complex than other voice commands, such as navigation. Moreover, early-stage voice shopping still needs further improvement. As a result, consumers must make efforts to have a better understanding of it, hindering PIIT from having a positive influence on ease of use. The result suggests that service providers should fully consider the feedback from early users. They can invite consumers to participate in voice shopping testing and optimization. Another remarkable finding is related to shyness. Despite no effect on PU, shyness negatively influences task-technology fit and PEOU, delaying user acceptance. Thus, helping people overcome embarrassment calls for extra attention. As a personality characteristic [94], shyness is hard to change directly and rapidly. Service providers may leverage external effects, such as others’ adoptions, to attract shy consumers. For example, they can attach unique tags like “coupons for voice shopping” to the retailer and “voice shopping user” to the consumers. In addition, not only shy individuals but many people can feel embarrassed when talking to those they are not familiar with. According to Rhee and Choi [17], the voice assistant’s social role as a friend can positively influence a consumer’s attitude toward the recommended product. Thus, service providers can emphasize the voice assistant’s role as a private friend. For example, users can be allowed to design personalities or shopping styles for their voice assistants.
For RQ2: How can the voice shopping experience be improved to encourage more usage? The rest of the results provide insights from the following three aspects.
First, service providers of voice assistants and retailers should establish a cooperation mechanism. The analysis reveals that task-technology fit, PEOU, and PU positively affect voice shopping intention, with PU exerting a greater effect than the other two factors. That means both objective fit between tasks and technology and subjective fit between individual and technology matter; the final acceptance of voice shopping lies more in the subjective perception of consumers. Therefore, service providers should create a more nuanced experience. Generally, consumers expect basics such as finding what they want quickly and buying it at a fair price. Voice assistants then must understand their users. To do so, service providers and retailers need to cooperate regarding the sharing of customer data. In addition, to date, many voice assistants cannot support a seamless and smooth shopping trip [17,118], especially those on mobile phones. Service providers of voice assistants and retailers should collaborate to overcome this limitation. In particular, when they are not the same entity, they need to carefully negotiate matters, such as on whose platform the products can be browsed.
Second, technical improvement requires attention. Significant relationships between technology characteristics and fit can offer clues for voice shopping upgrades. The results show that both accessibility and context awareness are catalysts for task-technology fit, PEOU, and PU. Accessibility refers to the ability that voice shopping allows consumers to reach shopping information and services anytime, anywhere, and in any situation. Thus, keeping users connected to their voice assistants is important. Service providers can enhance the ease of activation, expand the voice recognition range, and improve listening skills in more complex work environments, such as driving environments and noisy settings. Context awareness refers to the ability to understand what users say, including environmental factors, mood, and even intent. High context awareness often generates a personalized shopping process, which stimulates the use of voice shopping. Thus, voice assistants should learn to obtain more details from their users and enhance comprehension ability. To achieve this, service providers must develop the skills of voice assistants to handle longer sentences, continuous conversations, and emotion recognition.
Third, it is necessary to segment consumers based on the experience of voice shopping. Data analysis presents differences between online consumers who have once experienced voice shopping and those who have not. Service providers and retailers should give special attention to the convenience feature. It positively influences task-technology fit but hinders the PEOU of consumers who have once experienced voice shopping. This result highlights the importance of encouraging consumers to use voice shopping more frequently and helping them adapt to this new shopping method again. There is no statistically significant difference in other relationships between the two groups. However, it is still worth noting the positive influences of task characteristics on task-technology fit and PU, as well as the influence of PEOU on voice shopping intention among consumers who have not experienced voice shopping. The impact of task characteristics suggests that service providers need to attract those consumers by evoking their demand for an adaptable, immediately accessible, and customized shopping experience. The impact of PEOU stresses that service providers need to provide facilitating conditions to make voice shopping easy to operate. For example, a guided interaction, such as “You may speak like…”, could be more workable than a direct “Sorry, I do not understand.”

5.2. Theoretical and Practical Implications

5.2.1. Theoretical Implications

This work provides theoretical implications mainly from two aspects. First, it investigates the acceptance of voice shopping by considering task, technology, and individual characteristics simultaneously, which fills the research gap in this domain. As mentioned above, studies on the acceptance of voice shopping are still scarce. Although they emphasized the importance of technology and individual characteristics, most technology characteristics are general characteristics of voice assistants rather than those for shopping purposes, and little is known about the influences of individual characteristics. Under these circumstances, the current study elaborates on task, technology, and individual characteristics and investigates their roles in voice shopping acceptance. The results enrich the understanding of the factors influencing voice shopping acceptance. In particular, the surprising findings, such as the insignificant effects of task characteristics, convenience, and PIIT, challenge previous research. Future studies can explore whether there are hidden moderating variables. While carefully examining the impacts of technology characteristics and individual characteristics, this study also updates the integrated model of the TTF theory and TAM. Moreover, the data analysis result provides evidence for the robustness of the integrated model in the voice shopping setting.
Second, this study categorizes fit into objective and subjective fit and extends this concept into the voice shopping context. Fit is a critical basis for understanding technology acceptance, but divergence still exists about what constitutes fit [42]. The current study proposes that fit can be understood from two main aspects: objective fit between task and technology and subjective fit between individual and technology. To measure them, this work leverages task-technology fit, PEOU, and PU. It provides a fresh theoretical perspective not only for understanding fit but also for integrating the TTF theory and TAM to discover technology acceptance. The results indicate that one of the subjective fit aspects—PU—plays a more critical role in voice shopping acceptance. This further highlights the necessity of more nuanced investigations into consumers’ individual characteristics in this setting. In addition, fit so far has been mainly used in information technology, not digital technologies [37] or AI applications. This study brings it into the voice shopping area, paving the way for further investigation on adoption and continuous use behavior in emerging technology fields. For example, scholars can apply the fit concept and the research model to explore human–AI collaboration.

5.2.2. Practical Implications

The practical implications of this study can be understood from two streams. First, this study provides insights into why consumers still use voice shopping less. Data analysis infers three reasons: (a) average consumers still do not consider that their online shopping needs can be satisfied by voice shopping. (b) The convenience of voice shopping fails to persuade consumers. (c) Important personal initiatives do not play positive roles in user acceptance. Accordingly, service providers and retailers should make voice technology more suitable for online shopping tasks. In addition, they must make efforts to encourage consumers to try voice shopping and take advantage of the power of early adopters. When promoting emerging technologies or tools, marketers should also pay attention to such issues, as consumers do not always behave as expected in terms of technology use. It is important to figure out why they do not accept the technology or tool in the early stage.
Second, suggestions are given for increasing the acceptance of voice shopping. The results highlight that retailers and service providers should attach importance to the following three aspects: (a) the cooperation mechanism between the two parties. Updating the voice shopping experience is essential to persuade consumers that speaking is a suitable and useful method for online shopping. Thus, service providers and retailers must collaborate to understand their users better and overcome the current limitations of voice shopping. To achieve this, they must consider some issues seriously, such as data sharing and the platform for displaying information search results due to incompatibilities. (b) The next aspect is technical improvements. Voice shopping is still in the early stages, and its performance requires further improvement. Relationships between technology characteristics and task-technology fit, PEOU, and PU give clues. For example, to enhance the accessibility of voice assistants, service providers must expand their voice recognition range and improve listening skills in more complex work environments. (c) Last is customer segmentation based on the experience of voice shopping. This study reveals that there are differences between online consumers who have experienced voice shopping and those who have not. Service providers and retailers can draw those consumers via distinct strategies. For example, they should first evoke the demand for adaptable, immediately accessible, and customized shopping experiences among consumers who have not experienced voice shopping.

6. Future Research Directions

This study has leveraged fit among tasks, technology, and individuals to understand consumers’ intention to use voice shopping. It proposes a research model that integrates the TTF theory for objective fit and the TAM for subjective fit, where task, technology, and individual characteristics lead to the two aspects of fit, which ultimately influence voice shopping intention. Based on the results, this study explains why many consumers do not accept voice shopping and offers practical suggestions to improve the voice shopping experience to encourage more usage. Overall, the findings from this work enhance the understanding of voice shopping acceptance.
This study has certain limitations. First, as a relatively novel study on voice shopping acceptance, this study focuses more on the various determinants and their influences but pays limited attention to the mediating effects of subjective fit and objective fit. A future study can collect more survey data or real data from consumers in all walks of life to examine their influences, as well as verify and improve the current model. In addition, the technology and individual characteristics may need further investigation. For example, voice assistants exhibit human-like cues that might make consumers consider them as shopping companions. The current study only applies context awareness to represent this aspect, which could be enriched with other human-like characteristics. Furthermore, the results suggest that context awareness cannot significantly affect the task-technology fit and PU of consumers who have once experienced voice shopping, which may be due to the privacy risks that context awareness poses. Therefore, although studies have disclosed that positive attributes or feelings toward voice shopping are taking precedence over negative ones (e.g., [8,19,70]), important negative technology characteristics, like privacy invasion, may need further investigation.
Second, this study focuses on the task, technology, and individual characteristics without much consideration of possible environmental features. The environment-technology fit has not received much attention [44], which may provide a potential perspective for improving the current integrated model. For example, this study finds that shyness hinders voice shopping intention, indicating that the role of social influence deserves attention. Future work can introduce environmental factors as moderators.
Third, some demographic and contextual variables are worth noting. For example, (a) cultural differences might matter. Previous studies on technology acceptance also took cultural differences into account. Taking Pavlou and Chai [119] as an example, China is high on collectivism and power distance, while the U.S. is relatively low on collectivism and power distance, which may vary determinants of their voice shopping acceptance. (b) The product category and shopping scenarios can also be important. Different perspectives have been applied to discuss categories like fashion [28] and grocery [20]. The researchers aim to investigate whether there is heterogeneity in determinants across various product categories and shopping scenarios.

Author Contributions

Conceptualization, S.B. and L.W.; methodology, L.W.; software, L.W.; validation, S.B.; data curation, S.B. and L.W.; writing—original draft preparation, L.W.; writing—review and editing, S.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Shandong Higher Education “Youth Innovation Team Plan” (No. 2023RW039); Shandong Philosophy and Social Sciences Youth Talent Team (No. 2024-QNRC-40); and QAU Special Support Program for Talents.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data is contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Table A1. Items used in this study.
Table A1. Items used in this study.
FactorItemsWordingSource
Task characteristicsTC1I want to avoid some messy operations when typing for shopping.Self-developed
TC2I want to find the product more quickly.
TC3I want to check shopping information when I want.
TC4I want to find the product without typing by myself.
ConvenienceCO1Voice shopping frees our hands.[120]
CO2Voice shopping helps purchase online more quickly.
CO3Voice search helps purchase online more easily (e.g., reducing mistyping)
AccessibilityAC1When it is needed, voice shopping can be easily activated and run.[69]
AC2Voice shopping helps to purchase online anywhere (e.g., when devices like my phone are not in my hands).
AC3Voice shopping helps to purchase online anytime.
Context awarenessCA1When conducting voice shopping, the device can understand what I want with simple formal words.Self-developed
CA2When conducting voice shopping, the device can understand what I want when I speak naturally.
CA3When conducting voice shopping, the device can recognize what I said literally.
CA4When conducting voice shopping, the device can continuously understand what I mean in the context.
ShynessSH1I feel tense when I’m with people I don’t know well.[121]
SH2I do not find it difficult to ask other people for information.
SH3I do not feel uncomfortable at parties and other social functions.
SH4I do not find it hard to talk to strangers.
Personal innovativeness in information technologyPIIT1If I heard about a new information technology, I would look for ways to experiment with it.[92]
PIIT2Among my peers, I am usually the first to try out new information technology.
PIIT3In general, I am active in trying out new information technology.
PIIT4I like to experiment with new information technology.
Task-technology fit TTF1The voice shopping method is enough to help people purchase online.[122]
TTF2The voice shopping method is appropriate to help people purchase online.
TTF3In general, voice shopping method can meet needs for purchasing online.
Perceived ease of usePEOU1I think the voice shopping method is easy to learn.[52]
PEOU2I think it is easy to employ the voice shopping method in practice.
PEOU3It would be easy to conduct voice shopping expertly.
PEOU4Overall, I think it is easy to conduct voice shopping.
Perceived usefulnessPU1I believe that the voice shopping method can help me easily find the products that interest me.[34,52]
PU2I believe that voice shopping improves online purchase performance of mine.
PU3I believe that voice shopping enhances online purchase effectiveness of mine.
PU4Overall, I believe that voice shopping is useful for my online purchase.
Voice shopping intentionVSI1Given the opportunity, I will use voice shopping.[123]
VSI2I expect to use voice shopping in the near future.
VSI3I will frequently use voice shopping in the near future.[76]
VSI4I will recommend others to use voice shopping.
Table A2. Confirmatory factory analysis.
Table A2. Confirmatory factory analysis.
FactorItemsFactor LoadingsCRAVE
TotalExperienceNo ExperienceExperience/
No Experience
Experience/
No Experience
Task characteristicsTC20.7190.7410.6630.653/0.7410.486/0.593
TC40.7140.6500.864
ConvenienceCO10.8170.8380.7970.878/0.8420.783/0.728
CO20.9210.9290.906
AccessibilityAC10.8390.8100.8590.865/0.9270.681/0.81
AC20.8470.7850.901
AC30.9070.8780.938
Context awarenessCA10.8090.7900.8160.776/0.8680.635/0.768
CA20.8630.8030.933
ShynessSH20.7580.7800.7070.732/0.6790.577/0.514
SH40.7240.7390.727
Personal innovativeness in information technologyPIIT10.8690.8550.8820.876/0.8750.701/0.7
PIIT20.8250.8270.826
PIIT40.8170.8300.8
Task-technology fit TTF10.9120.8870.9290.865/0.9070.762/0.83
TTF20.8750.8590.893
Perceived ease of usePEOU10.7660.7950.740.925/0.920.756/0.744
PEOU20.8930.8730.908
PEOU30.8790.8920.861
PEOU40.9220.9140.929
Perceived usefulnessPU20.8790.8710.8840.908/0.9410.767/0.841
PU30.9170.8840.941
PU40.8990.8720.926
Voice shopping intentionVSI20.9180.8990.9350.911/0.9450.774/0.851
VSI30.9040.8880.91
VSI40.890.8510.922
Note: CR and AVE values of total responses are not present in this table since they have been mentioned before.

References

  1. Statista. Virtual Assistant Technology—Statistics & Facts. Available online: https://www.statista.com/topics/5572/virtual-assistants/#topicOverview (accessed on 5 December 2024).
  2. Mari, A.; Mandelli, A.; Algesheimer, R. Empathic voice assistants: Enhancing consumer responses in voice commerce. J. Bus. Res. 2024, 175, 114566. [Google Scholar] [CrossRef]
  3. Son, Y.; Oh, W.; Im, I. The voice of commerce: How smart speakers reshape digital content consumption and preferences. MIS Q. 2023, 47, 857. [Google Scholar] [CrossRef]
  4. Sun, C.; Shi, Z.; Liu, X.; Ghose, A. The effect of voice AI on digital commerce. Inf. Syst. Res. 2025, 36, 1147–1166. [Google Scholar] [CrossRef]
  5. Moriuchi, E. Okay, Google!: An empirical study on voice assistants on consumer engagement and loyalty. Psychol. Mark. 2019, 36, 489–501. [Google Scholar] [CrossRef]
  6. Technavio. Voice commerce market analysis North America, Europe, APAC, South America, Middle East and Africa—US, UK, China, Canada, Germany, UK, Japan, France, Brazil, Italy, India—Size and forecast 2025–2029. Available online: https://www.technavio.com/report/voice-commerce-market-industry-analysis (accessed on 14 May 2025).
  7. Rabassa, V.; Sabri, O.; Spaletta, C. Conversational commerce: Do biased choices offered by voice assistants’ technology constrain its appropriation? Technol. Forecast. Soc. Change 2022, 174, 121292. [Google Scholar] [CrossRef]
  8. Kamoonpuri, S.Z.; Sengar, A. Love it or hate it? Deconstructing consumers’ attitudes towards AI enabled voice shopping. J. Consum. Behav. 2024, 23, 2395–2412. [Google Scholar] [CrossRef]
  9. Statista. Share of Online Consumers Who Regularly Bought Products Via the Following Shopping Channels in the United States as of March 2021. Available online: https://www.statista.com/statistics/1242517/united-states-consumers-preferred-purchase-channels/ (accessed on 6 October 2024).
  10. PYMNTS. How the World Does Digital: A Deep Dive into Global Digital Engagement. Available online: https://www.pymnts.com/consumer-insights/2024/new-report-how-the-world-does-digital-a-deep-dive-into-global-digital-engagement/ (accessed on 14 May 2025).
  11. CNBC. Amazon Plans to Give Alexa an AI Overhaul—And a Monthly Subscription Price. Available online: https://www.cnbc.com/2024/05/22/amazon-plans-to-give-alexa-an-ai-overhaul-monthly-subscription-price.html (accessed on 6 July 2024).
  12. Sigh, S.; Vijay, T.S. Technology roadmapping for the e-commerce sector: A text-mining approach. J. Retail. Consum. Serv. 2024, 81, 103977. [Google Scholar]
  13. Bigne, E.; Ruiz, C.; Curras-Perez, R. Furnishing your home? The impact of voice assistant avatars in virtual reality shopping: A neurophysiological study. Comput. Hum. Behav. 2024, 153, 108104. [Google Scholar] [CrossRef]
  14. Halbauer, I.; Jacob, S.; Klarmann, M. Brand presentation order in voice shopping: Understanding the effects of sequential product presentation. J. Retail. 2022, 98, 759–778. [Google Scholar] [CrossRef]
  15. Hu, P.; Lu, Y.; Wang, B. Experiencing power over AI: The fit effect of perceived power and desire for power on consumers’ choice for voice shopping. Comput. Hum. Behav. 2022, 128, 107091. [Google Scholar] [CrossRef]
  16. Huh, J.; Whang, C.; Kim, H.Y. Building trust with voice assistants for apparel shopping: The effects of social role and user autonomy. J. Glob. Fash. Mark. 2023, 14, 5–19. [Google Scholar] [CrossRef]
  17. Rhee, C.E.; Choi, J. Effects of personalization and social role in voice shopping: An experimental study on product recommendation by a conversational voice agent. Comput. Hum. Behav. 2020, 109, 106359. [Google Scholar] [CrossRef]
  18. Bawack, R.E.; Wamba, S.F.; Carillo, K.D.A. Exploring the role of personality, trust, and privacy in customer experience performance during voice shopping: Evidence from SEM and fuzzy set qualitative comparative analysis. Int. J. Inf. Manag. 2021, 58, 102309. [Google Scholar] [CrossRef]
  19. Calahorra-Candao, G.; Martín-de Hoyos, M.J. The effect of anthropomorphism of virtual voice assistant on perceived safety as an antecedent to voice shopping. Comput. Hum. Behav. 2024, 153, 108124. [Google Scholar] [CrossRef]
  20. Carsten, C.; Philipp, B. Acceptance of digital voice assistants for grocery shopping. In Proceedings of the 50th Annual EMAC Conference, Madrid, Spain, 25–28 May 2021. [Google Scholar]
  21. Lee, S.; Oh, J.; Moon, W.K. Adopting voice assistants in online shopping: Examining the role of social presence, performance risk, and machine heuristic. Int. J. Hum.-Comput. Interact. 2023, 39, 2978–2992. [Google Scholar] [CrossRef]
  22. Muthukumaran, A.P.K.; Vani, H. Optimizing the usage of voice assistants for shopping. Indian J. Sci. Technol. 2020, 13, 4407–4416. [Google Scholar] [CrossRef]
  23. Park, J.; Ha, S.; Jeong, S. Voice commerce: Exploring consumer expectations of smart speakers for shopping. In Proceedings of the International Textile and Apparel Association Annual Conference Proceedings, Denver, CO, USA, 31 December 2022. [Google Scholar]
  24. Rawool, V.; Foroudi, P.; Palazzo, M. AI-powered voice assistants: Developing a framework for building consumer trust and fostering brand loyalty. Electron. Commer. Res. 2024, 1–33. [Google Scholar] [CrossRef]
  25. Rzepka, C.; Berger, B.; Hess, T. Why another customer channel? Consumers’ perceived benefits and costs of voice commerce. In Proceedings of the 53rd Hawaii International Conference on System Sciences, Maui, HI, USA, 7–10 January 2020. [Google Scholar]
  26. Sven, T.; Stefanie, P. Alexa—What’s on my shopping list? Investigating consumer perceptions of voice-controlled devices. In Proceedings of the SERVSIG Conference Proceedings 2018 Paris: Opportunities for Services in a Challenging World, Paris, France, 14–16 June 2018. [Google Scholar]
  27. Yang, S.; Xie, W.; Chen, Y.; Li, Y.; Jiang, H.; Zhou, W. Warmth or competence? Understanding voice shopping intentions from Human-AI interaction perspective. Electron. Commer. Res. 2024, 1–30. [Google Scholar] [CrossRef]
  28. Kautish, P.; Purohit, S.; Filieri, R.; Dwivedi, Y.K. Examining the role of consumer motivations to use voice assistants for fashion shopping: The mediating role of awe experience and eWOM. Technol. Forecast. Soc. Change 2023, 190, 122407. [Google Scholar] [CrossRef]
  29. Czaja, S.J.; Charness, N.; Fisk, A.D.; Hertzog, C.; Nair, S.N.; Rogers, W.A.; Sharit, J. Factors predicting the use of technology: Findings from the center for research and education on aging and technology enhancement (CREATE). Psychol. Aging 2006, 21, 333–352. [Google Scholar] [CrossRef]
  30. Aljukhadar, M.; Senecal, S.; Nantel, J. Is more always better? Investigating the task-technology fit theory in an online user context. Inf. Manag. 2014, 51, 391–397. [Google Scholar] [CrossRef]
  31. Larsen, T.J.; Sørebø, A.M.; Sørebø, Ø. The role of task-technology fit as users’ motivation to continue information system use. Comput. Hum. Behav. 2009, 25, 778–784. [Google Scholar] [CrossRef]
  32. Parkes, A. The effect of task-individual-technology fit on user attitude and performance: An experimental investigation. Decis. Support Syst. 2013, 54, 997–1009. [Google Scholar] [CrossRef]
  33. Guha, A.; Grewal, D.; Kopalle, P.K.; Haenlein, M.; Schneider, M.J.; Jung, H.; Moustafa, R.; Hegde, D.R.; Hawkins, G. How artificial intelligence will affect the future of retailing. J. Retail. 2021, 97, 18–41. [Google Scholar] [CrossRef]
  34. Dishaw, M.T.; Strong, D.M. Extending the technology acceptance model with task-technology fit constructs. Inf. Manag. 1999, 36, 9–21. [Google Scholar] [CrossRef]
  35. Ammenwerth, E.; Iller, C.; Mahler, C. IT-adoption and the interaction of task, technology and individuals: A fit framework and a case study. BMC Med. Inform. Decis. Mak. 2006, 6, 3. [Google Scholar] [CrossRef]
  36. Agar, J. What is technology? Ann. Sci. 2020, 77, 377–382. [Google Scholar] [CrossRef]
  37. Muchenje, G.; Seppänen, M. Unpacking task-technology fit to explore the business value of big data analytics. Int. J. Inf. Manag. 2023, 69, 102619. [Google Scholar] [CrossRef]
  38. Alter, S. 18 reasons why IT-reliant work systems should replace “The IT artifact” as the core subject matter of the IS field. Commun. AIS 2003, 12, 365–394. [Google Scholar] [CrossRef]
  39. Goodhue, D.L. Supporting Users of Corporate Data: The Effect of I/S Policy Choices. Ph.D. Thesis, Sloan School of Management, Massachusetts Institute of Technology, Cambridge, MA, USA, 1988. [Google Scholar]
  40. Goodhue, D.L.; Thompson, R.L. Task-technology fit and individual performance. MIS Q. 1995, 19, 213–236. [Google Scholar] [CrossRef]
  41. Goodhue, D.L.; Klein, B.D.; March, S.T. User evaluation of IS as surrogates for objective performance. Inf. Manag. 2000, 38, 87–101. [Google Scholar] [CrossRef]
  42. Davern, M.J. Towards a unified theory of fit: Task, technology and individual. In Information Systems Foundations: Theory, Representation and Reality; Hart, D.N., Gregor, S.D., Eds.; ANU Press: Canberra, Australia, 2007; pp. 49–69. [Google Scholar]
  43. Jung, T.; Bae, S.; Moorhouse, N.; Kwon, O. The effects of Experience-Technology Fit (ETF) on consumption behavior: Extended Reality (XR) visitor experience. Inf. Technol. People 2024, 37, 2006–2034. [Google Scholar] [CrossRef]
  44. Liu, K.; Yao, J.; Tao, D.; Yang, T. Influence of individual-technology-task-environment fit on university student online learning performance: The mediating role of behavioral, emotional, and cognitive engagement. Educ. Inf. Technol. 2023, 28, 15949–15968. [Google Scholar] [CrossRef]
  45. Davis, F.D.; Bagozzi, R.P.; Warshaw, P.R. User acceptance of computer technology: A comparison of two theoretical models. Manag. Sci. 1989, 35, 982–1003. [Google Scholar] [CrossRef]
  46. Davis, F.D. A Technology Acceptance Model for Empirically Testing New End-User Information Systems: Theory and Results. Ph.D. Thesis, Sloan School of Management, Massachusetts Institute of Technology, Cambridge, MA, USA, 1986. [Google Scholar]
  47. Liu, Y.; Lee, Y.; Chen, A.N.K. Evaluating the effects of task-individual-technology fit in multi-DSS models context: A two-phase view. Decis. Support Syst. 2011, 51, 688–700. [Google Scholar] [CrossRef]
  48. Klopping, I.M.; McKinney, E. Extending the technology acceptance model and the task-technology fit model to consumer e-commerce. Inf. Technol. Learn. Perform. J. 2004, 22, 35–48. [Google Scholar]
  49. Shih, Y.Y.; Chen, C.Y. The study of behavioral intention for mobile commerce: Via integrated model of TAM and TTF. Qual. Quant. 2013, 47, 1009–1020. [Google Scholar] [CrossRef]
  50. Chang, H.S.; Lee, S.C.; Ji, Y.G. Wearable device adoption model with TAM and TTF. Int. J. Mob. Commun. 2016, 14, 518–537. [Google Scholar] [CrossRef]
  51. Dishaw, M.T.; Strong, D.M.; Bandy, D.B. Extending the task-technology fit model with self-efficacy constructs. In Proceedings of the AMCIS 2002 Proceedings, Dallas, TX, USA, 9–11 August 2002. [Google Scholar]
  52. Yen, D.C.; Wu, C.-S.; Cheng, F.-F.; Huang, Y.-W. Determinants of users’ intention to adopt wireless technology: An empirical study by integrating TTF with TAM. Comput. Hum. Behav. 2010, 26, 906–915. [Google Scholar] [CrossRef]
  53. Park, J.; Gunn, F.; Lee, Y.H.; Shim, S. Consumer acceptance of a revolutionary technology-driven products: The role of adoption in the industrial design development. J. Retail. Consum. Serv. 2015, 26, 115–124. [Google Scholar] [CrossRef]
  54. Venkatesh, V.; Davis, F.D. A theoretical extension of the technology acceptance model: Four longitudinal field studies. Manag. Sci. 2000, 46, 186–204. [Google Scholar] [CrossRef]
  55. Bawack, R.E.; Wamba, S.F.; Carillo, K.D.A. Adoption of smart speakers for voice shopping. In Exploring Innovation in a Digital World; Ceici, F., Prencipe, A., Spagnoletti, P., Eds.; Springer: Cham, Switzerland, 2021; Volume 51, pp. 21–35. [Google Scholar]
  56. Jang, Y.T.J.; Liu, A.Y.; Ke, W.Y. Exploring smart retailing: Anthropomorphism in voice shopping of smart speaker. Inf. Technol. People 2023, 36, 2894–2913. [Google Scholar] [CrossRef]
  57. Calahorra-Candao, G.; Martín-de Hoyos, M.J. From typing to talking: Unveiling AI’s role in the evolution of voice assistant integration in online shopping. Information 2024, 15, 202. [Google Scholar] [CrossRef]
  58. Yang, S.; Xie, W.; Chen, Y.; Li, Y.; Wei, J. Understanding voice shopping value co-creation: A Sem-Fsqca method. J. Comput. Inf. Syst. 2024, 1–19. [Google Scholar] [CrossRef]
  59. Gupta, A.S.; Mukherjee, J. Exploring drivers of customer engagement with voice interface in E-retail. Int. J. Retail. Distrib. Manag. 2025, 53, 297–311. [Google Scholar] [CrossRef]
  60. Sharma, H.; Sharma, D. Does humanizing help? Exploring the tole of anthropomorphism and privacy concerns in shoppers adoption of voice-commerce. Mark. Intell. Plan. 2025, 24, 1220–1250. [Google Scholar] [CrossRef]
  61. Lu, H.P.; Yang, Y.W. Toward an understanding of the behavioral intention to use a social networking site: An extension of task-technology fit to social-technology fit. Comput. Hum. Behav. 2014, 34, 323–332. [Google Scholar] [CrossRef]
  62. D’Ambra, J.; Wilson, C.S.; Akter, S. Application of the task-technology fit model to structure and evaluate the adoption of e-books by academics. J. Am. Soc. Inf. Sci. Technol. 2013, 64, 48–64. [Google Scholar] [CrossRef]
  63. Nagar, K.; Gandotra, P. Exploring choice overload, Internet shopping anxiety, variety seeking and online shopping adoption relationship: Evidence from online fashion stores. Glob. Bus. Rev. 2016, 17, 851–869. [Google Scholar] [CrossRef]
  64. Wassel, B. Online Shopping Takes Excessive Effort for Many Customers, Study Finds. Available online: https://www.customerexperiencedive.com/news/ecommerce-excessive-effort-personalization-accenture-study/715727/ (accessed on 6 July 2024).
  65. PYMNTS. Multitasking Consumers Want to Shop—And Work—At the Same Time. Available online: https://www.pymnts.com/study_posts/multitasking-consumers-want-to-shop-and-work-at-the-same-time/ (accessed on 6 July 2024).
  66. Walmart Global Tech. Walmart Releases First Annual State of Adaptive Retail Report. Available online: https://www.linkedin.com/posts/walmartglobaltech_walmart-releases-first-annual-state-of-adaptive-activity-7221566166090047488-pbOm (accessed on 10 September 2024).
  67. Tam, C.; Oliveira, T. Understanding the impact of m-banking on individual performance: DeLone & McLean and TTF perspective. Comput. Hum. Behav. 2016, 61, 233–244. [Google Scholar]
  68. D’Ambra, J.; Wilson, C.S. Explaining perceived performance of the world wide web: Uncertainty and the task-technology fit model. Internet Res. 2004, 14, 294–310. [Google Scholar] [CrossRef]
  69. Kim, C.; Mirusmonov, M.; Lee, I. An empirical examination of factors influencing the intention to use mobile payment. Comput. Hum. Behav. 2010, 26, 310–322. [Google Scholar] [CrossRef]
  70. Sattarapu, P.K.; Wadera, D.; Nguyen, N.P.; Kaur, J.; Kaur, S.; Mogaji, E. Tomeito or Tomahto: Exploring consumer’s accent and their engagement with artificially intelligent interactive voice assistants. J. Consum. Behav. 2024, 23, 278–298. [Google Scholar] [CrossRef]
  71. Hu, P.; Lu, Y.; Gong, Y.; Ding, A.W. Speaking vs. listening: Balancing conversation attributes of voice assistants for better voice marketing. Int. J. Res. Mark. 2023, 40, 109–127. [Google Scholar] [CrossRef]
  72. Choudhary, S.; Kaushik, N.; Sivathanu, B.; Rana, N.P. Assessing factors influencing customers’ adoption of AI-based voice assistants. J. Comput. Inf. Syst. 2025, 65, 592–609. [Google Scholar] [CrossRef]
  73. Kumar, V.; Ashraf, A.R.; Nadeem, W. AI-powered marketing: What, where, and how? Int. J. Inf. Manag. 2024, 77, 102783. [Google Scholar] [CrossRef]
  74. Berry, L.L.; Seiders, K.; Grewal, D. Understanding service convenience. J. Mark. 2002, 66, 1–17. [Google Scholar] [CrossRef]
  75. Srinivasan, S.S.; Anderson, R.; Ponnavolu, K. Customer loyalty in e-commerce: An exploration of its antecedents and consequences. J. Retail. 2002, 78, 41–50. [Google Scholar] [CrossRef]
  76. Yoon, C.; Kim, S. Convenience and TAM in a ubiquitous computing environment: The case of wireless LAN. Electron. Commer. Res. Appl. 2007, 6, 102–112. [Google Scholar] [CrossRef]
  77. Prohorchik, K. Voice Commerce Could be the Next Disruptive Tech in Retail. Available online: https://rishisoftwareservices.com/blog/voice-commerce.html (accessed on 6 July 2024).
  78. Ruan, S.; Wobbrock, J.O.; Liou, K.; Ng, A.; Landay, J. Speech is 3x Faster Than Typing for English and Mandarin Text Entry on Mobile Devices. Available online: https://arxiv.org/abs/1608.07323 (accessed on 6 July 2024).
  79. Lee, J.; Ryu, M.H.; Lee, D. A study on the reciprocal relationship between user perception and retailer perception on platform-based mobile payment service. J. Retail. Consum. Serv. 2019, 48, 7–15. [Google Scholar] [CrossRef]
  80. Park, N.; Roman, R.; Lee, S.; Chung, J.E. User acceptance of a digital library system in developing countries: An application of the technology acceptance model. Int. J. Inf. Manag. 2009, 29, 196–209. [Google Scholar] [CrossRef]
  81. Okazaki, S.; Mendez, F. Exploring convenience in mobile commerce: Moderating effects of gender. Comput. Hum. Behav. 2013, 29, 1234–1242. [Google Scholar] [CrossRef]
  82. Rowe, S.D. Majority of Online Shoppers are Multitasking, Study Finds. Available online: https://www.destinationcrm.com/Articles/CRM-Insights/CRM-Blog/Majority-of-Online-Shoppers-Are-Multitasking-Study-Finds-125137.aspx (accessed on 11 October 2024).
  83. Instandart. Voice-Activated Shopping: Revolution in The E-Commerce. Available online: https://instandart.com/blog/e-commerce-development/voice-activated-shopping-revolution-in-the-e-commerce/ (accessed on 1 November 2024).
  84. Dey, A.K.; Abowd, G.D.; Salber, D. A conceptual framework and a toolkit for supporting the rapid prototyping of context-aware applications. Hum. -Comput. Interact. 2001, 16, 97–166. [Google Scholar] [CrossRef]
  85. Dey, A.K.; Häkkilä, J. Context-awareness and mobile devices. In Handbook of Research on User Interface Design and Evaluation for Mobile Technology; Lumsden, J., Ed.; IGI Global: Hershey, PA, USA, 2008; pp. 205–217. [Google Scholar]
  86. Cao, Q.; Niu, X. Integrating context-awareness and UTAUT to explain Alipay user adoption. Int. J. Ind. Ergon. 2019, 69, 9–13. [Google Scholar] [CrossRef]
  87. Russel, S.J.; Norvig, P. Artificial Intelligence: A Modern Approach, 3rd ed.; Pearson Education: Upper Saale River, NJ, USA, 2010; p. viii. [Google Scholar]
  88. Hebert, J. Needed for Voice-Assisted Technologies: Context. Available online: https://www.enterpriseai.news/2019/05/07/needed-for-voice-assisted-technologies-context/ (accessed on 11 October 2024).
  89. Lan, H.; Tang, X.; Ye, Y.; Zhang, H. Abstract or concrete? The effects of language style and service context on continuous usage intention for AI voice assistants. Humanit. Soc. Sci. Commun. 2024, 11, 99. [Google Scholar] [CrossRef]
  90. Aiolfi, S. How shopping habits change with artificial intelligence: Smart speakers’ usage intention. Int. J. Retail. Distrib. Manag. 2023, 51, 1288–1312. [Google Scholar] [CrossRef]
  91. Jackson, J.D.; Yi, M.Y.; Park, J.S. An empirical test of three mediation models for the relationship between personal innovativeness and user acceptance of technology. Inf. Manag. 2013, 50, 154–161. [Google Scholar] [CrossRef]
  92. Lu, Y.; Cao, Y.; Wang, B.; Yang, S. A study on factors that affect users’ behavioral intention to transfer usage from the offline to the online channel. Comput. Hum. Behav. 2011, 27, 355–364. [Google Scholar] [CrossRef]
  93. Mahmood, A.; Wang, J.; Wang, D.; Huang, C.M. User interaction patterns and breakdowns in conversing with LLM-powered voice assistants. Int. J. Hum.-Comput. Stud. 2025, 195, 103406. [Google Scholar] [CrossRef]
  94. McCroskey, J.C.; Richmond, V.P. Communication apprehension and shyness: Conceptual and operational distinctions. Commun. Stud. 1982, 33, 458–468. [Google Scholar] [CrossRef]
  95. Zimbardo, P.G. The Stanford Shyness Project. In Shyness: Perspectives on Research and Treatment; Jones, W.H., Cheek, J.M., Brigges, S.R., Eds.; Plenum: New York, NY, USA, 1986; pp. 17–25. [Google Scholar]
  96. Pilkonis, P.A. The behavioral consequences of shyness. J. Personal. 1977, 45, 596–611. [Google Scholar] [CrossRef]
  97. Chak, K.; Leung, L. Shyness and locus of control as predictors of Internet addiction and Internet use. CyberPsychology Behav. 2004, 7, 559–570. [Google Scholar] [CrossRef]
  98. Hammick, J.K.; Lee, M.J. Do shy people feel less communication apprehension online? The effects of virtual reality on the relationship between personality characteristics and communication outcomes. Comput. Hum. Behav. 2014, 33, 302–310. [Google Scholar] [CrossRef]
  99. Asendorpf, J.B.; Wilpers, S. Personality effects on social relationships. J. Personal. Soc. Psychol. 1998, 74, 1531–1544. [Google Scholar] [CrossRef]
  100. Baker, L.R.; Oswald, D.L. Shyness and online social networking services. J. Soc. Pers. Relatsh. 2010, 27, 873–889. [Google Scholar] [CrossRef]
  101. Nelson, L.J.; Padilla-Walker, L.M.; Badger, S.; Barry, C.M.; Carroll, J.S.; Madsen, S.D. Associations between shyness and internalizing behaviors, externalizing behaviors, and relationships during emerging adulthood. J. Youth Adolesc. 2008, 37, 605–615. [Google Scholar] [CrossRef]
  102. Madell, D.; Muncer, S. Internet communication: An activity that appeals to shy and socially phobic people? CyberPsychology Behav. 2006, 9, 618–622. [Google Scholar] [CrossRef]
  103. Miler, R.S. On the nature of embarrassability: Shyness, social evaluation, and social skill. J. Personal. 1995, 63, 339–351. [Google Scholar]
  104. Rogers, E.M.; Shoemaker, E.F. Communication of Innovations; The Free Press: New York, NY, USA, 1971. [Google Scholar]
  105. Agarwal, R.; Prasad, J. A conceptual and operational definition of personal innovativeness in the domain of information technology. Inf. Syst. Res. 1998, 9, 204–215. [Google Scholar] [CrossRef]
  106. Bhatti, T. Exploring factors influencing the adoption of mobile commerce. J. Internet Bank. Commer. 2007, 12, 1–13. [Google Scholar]
  107. Lu, J. Are personal innovativeness and social influence critical to continue with mobile commerce? Internet Res. 2014, 24, 134–159. [Google Scholar] [CrossRef]
  108. Kline, R.B. Principles and Practice of Structural Equation Modeling, 5th ed.; The Guilford Press: New York, NY, USA; pp. 156–179.
  109. Awang, Z.; Afthanorhan, A.; Mohamad, M.; Asri, M.A.M. An evaluation of measurement model for medical tourism research: The confirmatory factor analysis approach. Int. J. Tour. Policy 2015, 6, 29–45. [Google Scholar] [CrossRef]
  110. Hair, J.F.; Black, W.C.; Babin, B.J.; Anderson, R.E. Multivariate Data Analysis, 8th ed.; Cengage Leaning, EMEA: Hampshire, UK, 2019; pp. 278–280. [Google Scholar]
  111. Kang, J.Y.M.; Mun, J.M.; Johnson, K.K.P. In-store mobile usage: Downloading and usage intention toward mobile location-based retail apps. Comput. Hum. Behav. 2015, 46, 210–217. [Google Scholar] [CrossRef]
  112. Bagozzi, R.P.; Yi, Y. On the evaluation of structural equation models. J. Acad. Mark. Sci. 1988, 16, 74–94. [Google Scholar] [CrossRef]
  113. Campbell, D.T.; Fiske, D.W. Convergent and discriminant validation by the multi-trait-multi-method matrix. Psychol. Bull. 1959, 56, 81–105. [Google Scholar] [CrossRef]
  114. Fornell, C.; Larcker, D.F. Evaluating structural equation models with unobservable variables and measurement error. J. Mark. Res. 1981, 18, 39–50. [Google Scholar] [CrossRef]
  115. Venkatesh, V.; Morris, M.G.; Davis, G.B.; Davis, F.D. User acceptance of information technology: Toward a unified view. MIS Q. 2003, 27, 425–478. [Google Scholar] [CrossRef]
  116. Kim, B.G.; Park, S.C.; Lee, K.J. A structural equation modeling of the Internet acceptance in Korea. Electron. Commer. Res. Appl. 2007, 6, 425–432. [Google Scholar] [CrossRef]
  117. Jain, S.; Basu, S.; Dwivedi, Y.K.; Kaur, S. Interactive voice assistants—Does brand credibility assuage privacy risks? J. Bus. Res. 2022, 139, 701–717. [Google Scholar] [CrossRef]
  118. Johansson, S.H. Voice Commerce—Part 1: An Online Shopping Trend Getting Louder. Available online: https://ecom.nets.eu/blog-posts/voice-commerce-part-1-an-online-shopping-trend-getting-louder/ (accessed on 1 October 2024).
  119. Pavlou, P.A.; Chai, L. What drives electronic commerce across cultures? A cross-cultural empirical investigation of the theory of planned behavior. J. Electron. Commer. Res. 2002, 3, 240–253. [Google Scholar]
  120. Johnson, T.J.; Kaye, B.K. Webelievability: A path model examining how convenience and reliance predict online credibility. Journal. Mass Commun. Q. 2002, 79, 619–642. [Google Scholar] [CrossRef]
  121. Cheek, J.M. The revised Cheek and Buss shyness scale. Unpublished Manuscript; Wellesley College: Wellesley, MA, USA, 1983. [Google Scholar]
  122. Zhou, T.; Lu, Y.; Wang, B. Integrating TTF and UTAUT to explain mobile banking user adoption. Comput. Hum. Behav. 2010, 26, 760–767. [Google Scholar] [CrossRef]
  123. Schierz, P.G.; Schilke, O.; Wirtz, B.W. Understanding consumer acceptance of mobile payment services: An empirical analysis. Electron. Commer. Res. Appl. 2010, 9, 209–216. [Google Scholar] [CrossRef]
Figure 1. General relationships related to the TTF theory and TAM. Note: The figure is adapted from the TTF theory [40] and TAM ([45,46]). The relationships represented by solid lines are included in the TTF theory, and those by dashed lines are in the TAM.
Figure 1. General relationships related to the TTF theory and TAM. Note: The figure is adapted from the TTF theory [40] and TAM ([45,46]). The relationships represented by solid lines are included in the TTF theory, and those by dashed lines are in the TAM.
Jtaer 20 00293 g001
Figure 2. Research model of consumers’ acceptance of voice shopping.
Figure 2. Research model of consumers’ acceptance of voice shopping.
Jtaer 20 00293 g002
Table 1. A summary of factors in several representative voice shopping studies.
Table 1. A summary of factors in several representative voice shopping studies.
ReferencesCharacteristicsFinal Outcome VariablesMethodology
Technology
Characteristics
Individual
Characteristics
[18]Privacy concernsExtroversion, agreeableness, conscientiousness, emotional instability, intellect, and prior experienceCustomer experience performanceRegression analysis
[55]Perceived risk, privacy concern, and technology learnabilityN/AVoice shopping behaviorRegression analysis
[20]Privacy concernsTechnology anxietyBehavioral intentionRegression analysis
[7]Biased offersN/AN/AInterview
[56]AnthropomorphismN/AIntention to adopt voice shoppingRegression analysis
[21]Message interactivityMachine heuristicIntention to useRegression analysis
[19]Anthropomorphism, i.e., human-like voice, social presence, and friendlinessN/AWord of mouth and voice shopping intentionRegression analysis
[57]Enjoyment, performance expectancy, and perceived safetyN/AVoice shopping intentionRegression analysis
[27]Human–AI interaction fluencyN/AVoice shopping intentionRegression analysis
[58]Perceived competence and perceived warmthN/AVoice shopping intention and value co-creation intentionRegression analysis
[28]N/AFunctional, hedonic, social, and cognitive motivated consumers innovativenesseWOM and purchase intentionRegression analysis
[8]N/ACognitive beliefs, affective feelings, and conative behavioral intentionsN/AFuzzy Analytical Hierarchy Process
[59]Positive and negative characteristicsN/ACustomer engagement and enhanced usageInterview
[60]Anthropomorphism, privacy concerns, and functional intelligenceN/ABehavioral intentionRegression analysis
Note: Only technology characteristics, individual characteristics, and final outcome variables are included for a clear view.
Table 2. Demographic statistics of respondents.
Table 2. Demographic statistics of respondents.
CategoriesFreq.%
ExperienceNo ExperienceTotal
GenderMale938818142.6
Female11113324457.4
Age≤173140.9
18–24525911126.1
25–30829517741.7
31–4020325212.2
≥4147348119.1
OccupationStudent545711126.1
Business employee567212830.1
Institution staff525310524.7
Freelancer30245412.7
Others1215276.4
Education≤Junior middle school104143.3
(Vocational) High school24275112
College graduate25295412.7
Undergraduate10411622051.8
≥Master41458620.2
Frequency of using voice searchNever0440.9
Sometimes17218936185
Every week1721388.9
Everyday157225.2
Number of respondents204221425
Table 3. Construct reliability and discriminant validity.
Table 3. Construct reliability and discriminant validity.
CRAVETCCOACCASHPIITTTFPEOUPUVSI
TC0.6790.5130.716
CO0.8620.7580.5280.871
AC0.8990.7480.4980.6820.865
CA0.8230.70.4610.5470.5660.837
SH0.7090.549−0.383−0.4−0.362−0.3510.741
PIIT0.8750.7010.3940.4380.5460.425−0.5880.837
TTF0.8880.7990.5140.6020.7130.553−0.5060.5240.894
PEOU0.9230.7520.4080.520.6720.57−0.5320.4890.7290.867
PU0.9260.8070.470.5380.6970.572−0.3850.4790.6930.7330.898
VSI0.9310.8170.4040.6260.7060.589−0.5390.5230.7250.710.7770.904
Note: TC (task characteristics), CO (convenience), AC (accessibility), CA (context awareness), SH (shyness), PIIT (personal innovativeness in information technology), TTF (task-technology fit), PEOU (perceived ease of use), PU (perceived usefulness), and VSI (voice shopping intention).
Table 4. Results of path coefficients (integrated model based on TTF theory and TAM).
Table 4. Results of path coefficients (integrated model based on TTF theory and TAM).
PathβResultR2
H7aTask characteristicsTask-technology fit0.075Not supported0.628
H8a-1Convenience0.09Not supported
H8b-1Accessibility0.458 ***Supported
H8c-1Context awareness0.132 *Supported
H9a-1Shyness−0.248 ***Supported
H8a-2ConveniencePerceived ease of use−0.08Not supported0.633
H8b-2Accessibility0.313 ***Supported
H8c-2Context awareness0.175 **Supported
H9a-2Shyness−0.239 ***Supported
H9bPersonal innovativeness in information technology−0.047Not supported
H4Task-technology fit0.353 ***Supported
H7bTask characteristicsPerceived usefulness0.069Not supported0.645
H8a-3Convenience−0.007Not supported
H8b-3Accessibility0.254 ***Supported
H8c-3Context awareness0.117 *Supported
H9a-3Shyness0.036Not supported
H5Task-technology fit0.163 *Supported
H6Perceived ease of use0.37 ***Supported
H1Task-technology fitVoice shopping intention0.316 ***Supported0.695
H2Perceived ease of use0.159 **Supported
H3Perceived usefulness0.445 ***Supported
Note: * p < 0.05, ** p < 0.01, *** p < 0.001.
Table 5. Results of path coefficients (separating TTF theory and TAM).
Table 5. Results of path coefficients (separating TTF theory and TAM).
PathβR2
TTF theoryTask characteristicsTask-technology fit0.1110.633
Convenience0.141 *
Accessibility0.3496 ***
Context awareness0.188 ***
Task-technology fitVoice shopping intention0.772 ***0.596
TAMConveniencePerceived ease of use−0.0290.594
Accessibility0.473 ***
Context awareness0.223 ***
Shyness−0.331 ***
Personal innovativeness in information technology−0.039
ConveniencePerceived usefulness0.0360.636
Accessibility0.327 ***
Context awareness0.143 **
Shyness−0.008
Perceived ease of use0.406 ***
Perceived ease of useVoice shopping intention0.313 ***0.666
Perceived usefulness0.559 ***
Note: * p < 0.05, ** p < 0.01, *** p < 0.001.
Table 6. Path coefficient comparison for people with prior experience and no experience.
Table 6. Path coefficient comparison for people with prior experience and no experience.
PathExperienceNo Experiencet-TestDifference
ββ
TCTTF−0.0010.173 *0.857No
CO0.275 *−0.0091.954No
AC0.335 *0.475 ***0.837No
CA0.1960.0980.636No
SH−0.198 *−0.338 ***1.062No
COPEOU−0.334 *0.0092.055 *YES
AC0.42 *0.323 ***0.468No
CA0.239 *0.1260.698No
SH−0.208 *−0.297 **0.493No
PIIT−0.086−0.0660.138No
TTF0.499 ***0.264 **1.256No
TCPU0.0580.154 *0.546No
CO−0.0060.0060.089No
AC0.337 *0.226 **0.69No
CA0.0650.126 *0.111No
SH0.102−0.0711.329No
TTF0.1120.1070.032No
PEOU0.426 ***0.347 ***0.65No
TTFVSI0.492 ***0.255 ***1.461No
PEOU0.0150.194 **1.23No
PU0.412 ***0.461 ***0.327No
Note: TC (task characteristics), CO (convenience), AC (accessibility), CA (context awareness), SH (shyness), PIIT (personal innovativeness in information technology), TTF (task-technology fit), PEOU (perceived ease of use), PU (perceived usefulness), and VSI (voice shopping intention). * p < 0.05, ** p < 0.01, *** p < 0.001.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wang, L.; Bae, S. Encourage Online Consumers to Embrace Voice Shopping: Roles of Tasks, Technology, and Individual Characteristics. J. Theor. Appl. Electron. Commer. Res. 2025, 20, 293. https://doi.org/10.3390/jtaer20040293

AMA Style

Wang L, Bae S. Encourage Online Consumers to Embrace Voice Shopping: Roles of Tasks, Technology, and Individual Characteristics. Journal of Theoretical and Applied Electronic Commerce Research. 2025; 20(4):293. https://doi.org/10.3390/jtaer20040293

Chicago/Turabian Style

Wang, Li, and SungMin Bae. 2025. "Encourage Online Consumers to Embrace Voice Shopping: Roles of Tasks, Technology, and Individual Characteristics" Journal of Theoretical and Applied Electronic Commerce Research 20, no. 4: 293. https://doi.org/10.3390/jtaer20040293

APA Style

Wang, L., & Bae, S. (2025). Encourage Online Consumers to Embrace Voice Shopping: Roles of Tasks, Technology, and Individual Characteristics. Journal of Theoretical and Applied Electronic Commerce Research, 20(4), 293. https://doi.org/10.3390/jtaer20040293

Article Metrics

Back to TopTop