Recent statistics indicate that, in 2020, online retail accounted for about 21.3% of total retail spending in the U.S., which constitutes a 44% increase from the previous year [
1]. The convenience of online shopping, and the health concerns associated with in-store shopping during the pandemic, were major factors in the expansion of online markets. This is happening at the same time that the technology of communication, tracking, data collection, and processing is developing at a fast pace.
There are several cases where sellers used cookies to collect information on buyers’ online behavior. For example, in 2000, Amazon charged people different prices for the same DVD [
4]. Similarly, online travel agencies have relied on cookies and more advanced technologies to identify when buyers are most ready to buy [
5]. The extent to which user information is used by online sellers to individualize search results and prices is not well understood, as sellers are not explicit about the type of data they collect or the way these data are used. Yet, research indicates that firms have an incentive to excessively track and monitor buyers’ online behavior as this results in higher profits. Shiller [
6] estimated the demand for Netflix plans and found that using individual web browsing behavior to set prices is more effective than relying on demographic information only.
The purpose of this paper is to take a first step towards understanding the extent of information use and individualization that exists online by analyzing search results on Amazon.com. We investigate whether Amazon’s prices and products are (1) standardized across buyers as in brick-and-mortar stores, or (2) individualized based on buyers’ information. Since our findings showed that search results are not standardized across buyers, we proceeded to test whether individualization exists and, if so, analyze whether it is based on consumers’ demographic and geolocation characteristics.
Literature Review
There is a growing literature on online markets. The advances in the technology that allows firms to obtain buyers’ information online raises privacy concerns and highlights the need for policies ensuring transparency pertaining to the use of consumer information and regulating the extent of information use. Borgesius [
7], Steppe [
8], and Sears [
9] outline the extent to which personal information is utilized by online retailers and the role played by the current data-use regulations in Europe. Borgesius [
10] found that algorithmic pricing can discriminate against particular groups of people, which is a problem that cannot be fully addressed by the current European competition, data protection, and non-discrimination laws.
Incidences of price discrimination by retailers such as Staples, Home Depot, and Amazon have been identified in which the seller uses the buyer’s IP address or location to individualize search results and prices [
4,
5,
10,
11,
12,
13]. Several papers in the literature have attempted to track price discrimination practices in online markets more widely, and to identify the variables upon which prices and search outcomes are based. Mikians et al. [
14,
15] studied the effect of the location, system specifics (operating system and browser), user persona (affluent vs. budget-conscious customers), and originating webpage on search results and prices. Hannak et al. [
16] analyzed price discrimination and steering (where product ranking is changed across buyers). They collected information on users’ location, the operating system used, and the purchase history. Badmaeva and Hullmann [
17] tested for individualization of German online retailers through collecting user information including demographics, operating system, and purchase history.
The techniques for data collection varied across the papers. Mikians et al. [
14] used a controlled experiment with PlantLab computing nodes. Mikians et al. [
15] used crowd-sourcing through a browser extension that enabled them to collect data on 1500 queries of 600 retailers by 350 users across 5 months. They concluded that price and search discrimination may have happened. Hannak et al. [
16] recruited 100 users through Amazon’s Mechanical Turk, and they collected the results of searches undertaken by these users of a predefined set of retailers. The results obtained from each user were compared to a control account. The user/control difference was compared to the control/control (twin account) to control for noise, which was a significant contribution of their paper. In their study, the price steering was measured through the Jaccard index, Kendall’s τ, and nDGG. To study factors that might affect price discrimination and steering, they conducted controlled experiments where they controlled the variables to be analyzed, namely, account information, operating system and browser, and click and purchase history. Similarly, Badmaeva and Hullmann [
17] compared search results across students and trained personas, and computed price differences.
In general, there is some evidence of individualization in online markets, yet there is some inconsistency regarding the variables upon which prices are based. Hinderman [
13] found that individualization online is exercised by large retailers more than small ones, and when individualization was detected, it was based on either location, operating system, or some consumer characteristics. Mikians et al. [
14], Mikians et al. [
15], and Hannak et al. [
16] found evidence of individualization. Mikians et al. [
14] found that individualization is based on the origin URL of the user, location, and the spending behavior of the trained personas, but not on the operating system or the browser used. Mikians et al. [
15] found price variability to be in the range of 10 to 30 percent. They found that location plays a role in prices and that some retailers varied prices across the US while others kept US prices constant and varied prices internationally. There was no difference in prices observed by affluent and budget-constrained individuals (unlike in their earlier paper), and some price difference was observed between users who logged in and those who did not when buying Kindle e-books. Hannak et al. [
16] found evidence suggesting that individualization is based on operating system, browser, account, and purchase history. On the contrary, Vissers et al. [
18] and Azzolina et al. [
19] could not find evidence of individualization in the market for airline tickets. Badmaeva and Hüllmann [
17] found no evidence of individualization in German online retail.
In this paper we ask whether Amazon individualizes search results. Our goal is to test the following two propositions.
Proposition 1. Amazon offers a standardized menu, where it does not change the search results across different users.
Proposition 2. Amazon individualizes search results based on the user’s characteristics.
We proceeded to test these two propositions using the data we collected.
To test these two propositions, we investigated (1) whether participants reported identical search results, (2) whether products were offered at the same prices, (3) whether there is the relation between the results order and the products’ prices, and (4) whether participants observed the same coupon menu. We found that search results varied significantly across participants in terms of content, order, and prices net of coupons. However, we could not find consistent evidence that variation in users’ demographic variables, geolocation variables, and account characteristics explained variation in the search results observed after controlling for fixed effects of dynamic pricing through the day. We also found persistence in pricing and search results patterns; for example, users who receive coupons when they search for one product are more likely to receive coupons when they search for other products. The combination of these results suggests that individualization exists and is not simply based on demographic, geolocation, or account information. Instead, it is based on more complex user information e.g., browsing history and online behavior.