Session-Based Recommender System for Sustainable Digital Marketing

: Many companies operate e-commerce websites to sell fashion products. Some customers want to buy products with intention of sustainability and therefore the companies need to suggest appropriate fashion products to those customers. Recommender systems are key applications in these sustainable digital marketing strategies and high performance is the most necessary factor. This research aims to improve recommendation systems’ performance by considering item session and attribute session information. We suggest the Item Session-Based Recommender (ISBR) and the Attribute Session-Based Recommenders (ASBRs) that use item and attribute session data independently, and then we suggest the Feature-Weighted Session-Based Recommenders (FWSBRs) that combine multiple ASBRs with various feature weighting schemes. Our experimental results show that FWSBR with chi-square feature weighting scheme outperforms ISBR, ASBRs, and Collaborative Filtering Recommender (CFR). In addition, it is notable that FWSBRs overcome the cold-start item problem, one signiﬁcant limitation of CFR and ISBR, without losing performance.


Introduction
Modern companies are interested in sustainable marketing, where they aim to propose value to the consumer and satisfy the customer's needs in a sustainable way [1].Providing appropriate products and services that meet customers' needs and wants is one of major factors that enable sustainable marketing.With digital technology, modern customers can access various product and service information on the web and usually experience difficulty in choosing right products.Many web analytics studies have been conducted to address this problem in the digital marketing context [2,3].In this sense, recommender systems are an important web analytic application for sustainable digital marketing, because they can propose products and services in a personalized manner satisfying individual customers' sustainability preferences [4].
Recommender systems can be applied in many areas, including various products, books, news, etc.In this research, we propose recommender systems for fashion products, which might be closely related to sustainability needs, namely sustainable fashion [5].Fashion products are a major category of products sold in online shopping malls.According to Statista, the worldwide revenue of the e-commerce fashion industry is expected to rise from $481.2 billion in 2018 to $712.9 billion by 2022 [6].In online shopping malls, customers express their preferences implicitly and thus we do not know which customers have sustainable preferences.It is possible to achieve goals of sustainability marketing by developing recommender systems that perform accurately.recommendations are generated for the user using items preferred by the similar users.Item-Based CFRs determine similar items using user interactions with items.Item-Based CFRs recommend similar items precomputed in advance using interaction data when a user selects an item [29].
Although CFRs are successfully used in many situations, they cannot be used with new users and new items.If a user has no interactions with items, CFRs cannot recommend any items for this user; alternatively, if an item has not been interacted with by users, CFRs cannot recommend them to any user.This problem is called the cold-start problem.Usually this limitation of CFRs is addressed by considering content and developing hybrid recommender systems [18,19,22,30,31].In addition, CFRs do not consider the order or sequence of user interactions.Although some temporal recommender systems include a time factor when suggesting items [32,33], they do not focus on sequential patterns.

Session-Based Recommender Systems
SBRs explicitly consider the order of user interactions, namely sessions, when recommending items.The Association Rules Recommender (ARR) is a representative approach among SBRs.Originally it was developed to discover user consumption patterns within a large transaction database regardless of the order of their appearance [34,35].Later was extended to consider sequence patterns in the transactions [36][37][38].In the ARR, the usefulness of a sequence pattern is usually assessed using measures such as support, confidence and lift.Each measure is defined as follows: con f idence = P(Y|X) = P(X ∩ Y) P(X) Since the sequence pattern X ⇒ Y is regarded as significant only if the conditional probability P(Y|X) is greater than P(X), the lift of the sequence pattern X ⇒ Y should be larger than 1 [39,40].The ARR is easy to understand, such that it can be easily deployed in practice.The ARR has been used to find next webpage [41,42], next book [41], next mobile application [43] and next song [44].However, similar to CFR, ARR cannot suggests recommendation with new items and new users, because the user and the item has no interaction related to them.This research proposes SBRs that use attribute session data and item session data to suggest recommendations.While item SBRs still encounter the cold-start problem similar to the CFR, the attribute SBRs can overcome the cold-start problem, which will be discussed in Section 5 in detail.

Feature Weighting
Users generally consider multiple attributes simultaneously with different importance when they interact with fashion items, so it is natural to consider different weights when we create recommendation models by combining different ASBRs.In order to find weights that reflect attribute importance, this research adopted the following wrapper feature weighting methods to evaluate the goodness of a selected feature subset [45]: chi-square score, PCA, gain ratio, RELIEF and SVM.

Chi-square score
This score utilizes the test of independence to assess whether the feature is independent of the class label [46].Given a particular feature f i with class r different feature values, the chi-square (CHI) score of that feature can be computed as: where n js is the number of examples with the j th feature value given feature f i .In addition, µ js = n * s n j * n , where n j * is the number of examples with the j th feature value given feature f i , n * s denotes the number of examples in class r.A higher CHI score indicates that the feature is relatively more important [47].

PCA score
Principal Components Analysis (PCA) is a predominant linear dimensionality reduction technique.PCA seeks to map or embed examples from a high dimensional space to a low dimensional space, while keeping all the relevant linear structure intact.During feature reduction process, PCA calculates the contribution of the j th feature as follows: where V pj denote the j th entry of V p , the eigenvectors corresponding to the first m largest eigenvalues [48].
In PCA, principal components are linear combinations of existing features, where the weights used in the linear combination can be used to show the importance of attributes [49].

Gain Ratio
The information gain (IG) measure is used to select the best attribute at each node of the decision tree.Gain ratio (GR) enhances IG as it offers a normalized score of a feature's contribution to an optimal information gain based classification decision [45].Let p i be the probability that an arbitrary example in dataset D belongs to class C ik , estimated by |Ci,D| |D| .Expected information (entropy) needed to classify an example in D is defined as: Let it be assumed that D is divided by feature f i , which has v values.Information needed to classify is defined as: Thus, information gain (IG), which shows information gained by branching on feature f i , is defined as: Since information gain is biased towards features with a large number of values, it is better to normalize to resolve this problem.Gain ratio is defined as: where Gain ratio is used as one of disparity measures and a high gain ratio for a selected feature implies that the feature will be useful for classification [45].

RELIEF
This algorithm is considered one of the most successful ones due to its simplicity and effectiveness [50].RELIEF iteratively estimates feature scores according to their ability to discriminate between neighboring patterns.Specifically, it tries to find a good estimate of the following probability to assign as the weight for each feature f i .
x r ∈NM( j,y) d(x( j, i) − X(r, i))) (10) where NH( j) and NM( j, y) are the nearest examples of x j in the same class and in class y, respectively.Their sizes are m j and h jy , respectively.p(y) is the ratio of examples in class y [47].RELIEF cannot not deal with categorical attributes.Therefore, we convert the categorical values into numerical values using dummy coding.

SVM
Unlike other feature selection algorithms, Support Vector Machines (SVM) is a wrapper feature selection method which uses the classifiers as a black box and the classifier performance as the objective function to evaluate the feature subset.We choose SVM as a representative wrapper feature selection method because it helps to avoid estimating the statistical distributions of different classes in hyper-dimensional feature space based on a margin maximization principle [51].

Sustainable Marketing and Recommenders for Fashion Products
Sustainable marketing aims to implement the concept of sustainability in marketing.Digital marketing is regarded as a key factor that can fill the gap between the behavior and beliefs of society and market regarding sustainability [52].Recommender systems are major applications that support digital marketing via recommending products reflecting user preference.Recommenders for fashion products have been sought after to improve customer satisfaction [9][10][11]15,16,[53][54][55][56][57][58].The fashion product recommenders usually use interaction logs between users and products and visual features [8,15,16].In relation to visual features, an interesting characteristic of fashion recommenders is that systems consider fashion coordination between multiple products whether it is based on interaction logs or visual features [7][8][9]15,[59][60][61].When choosing clothes, people consider various aesthetic factors such as color, type, material, style and season [60].Shen et al. [10] suggest a scenario-oriented recommender that suggests recommendations by mapping users' daily scenarios to fashion product attributes.
The collaborative filtering approach has been used for exploiting interaction logs [15,16,62].However, most studies consider visual features of fashion products.Hu et al. [15] suggested a collaborative filtering recommendation that suggests multiple fashion items together, which implies that their system considers fashion coordination.He and McAuley [58] suggested a one-class collaborative filtering fashion recommender that combines images and user interaction to capture temporal drifts of fashion and personal preferences.
Recent studies in fashion recommendation employed deep learning techniques such as convolutional neural networks (CNN) in order to enhance feature processing.Yu et al. [60] introduced Brain-inspired Deep Networks (BDN), a deep CNN structure consisting of several parallel pathways and high level synthesis networks.They used this to extract the aesthetic features of clothing from images.Gu et al. [8] suggested a deep learning modeling algorithm, called Autoencoder, which differs from CNN and used it to extract features from fashion product images.He and McAuley [58] also used CNN for extracting visual features as well as temporal dynamics.
Most studies gave attention to the visual aspect of fashion products and coordination of fashion products.This research aims to develop SBRs to address the fashion product recommendation problem by reflecting users' sequential behavior on websites.When developing the SBRs, we will focus on not only item sequences but also item attribute sequences because the attributes of fashion items are important decision factors for users, and the cold-start problem can be resolved by using attribute SBRs.

Method
We propose various SBRs using interaction session and attribute session data.First, we constructed interaction and attribute session datasets from the user interaction data.With these data we then constructed an ISBR with interaction session dataset and ASBRs with attribute session datasets.Then we integrated the multiple ASBRs into a single recommender by using attribute weights by using various feature weighting algorithms.

Item
The term item refers to fashion product/s listed on the online shopping mall.A collection of items is denoted by I = {item 1 , item 2 , . . ., item n }.

Attribute
The term attribute refers to values that describe an item.An item i is described by k attributes item i = {att 1 , att 2 , . . ., att k }, where att x represents the x th attribute.

User
The term user refers to the customer/s who use the online shopping mall.A user x is denoted by user x and all m users are denoted by U = {user 1 , user 2 , . . ., user m }.In this research, the user has no attribute except a unique user identifier.

Interaction
The term interaction refers to a user's click on one of the items listed on the online shopping mall website.Implicitly an interaction presents a user's preference for a specific item.An interaction between user x (user x ) and item i (item i ) is denoted by int id = user x , item i , ts , where timestamp, ts, shows when the interaction occurs.

Session
The term session refers to sequential items or attributes selected by a user within a specific minimum and maximum time interval.An item session is denoted by item_session id = user i , item t , item t+1 , min, max , where item t comes before item t+1 in the interactions of user i .Similarly an attribute session is denoted by att_session id = user i , att t , att t+1 , min, max , where att t comes before att t+1 in the interactions of user i .

Conversion Probability
For a given item session, the item conversion probability between item t and item t+1 , denoted by CP(item t , item t+1 ), is obtained by dividing the number of sessions between item t and item t+1 with the number of sessions from item t .Similarly the attribute conversion probability between attribute t and attribute t+1 , denoted by CP(attribute t , attribute t+1 ), is obtained by dividing the number of sessions between attribute t and attribute t+1 with the number of sessions from attribute t .

Construct Session Data
We generated an item session dataset and attribute session dataset from an interaction dataset collected from the online shopping mall.There were two constraints: First, maximum and minimum interval time were set by analyzing historical data as discussed in Section 4. Second, the sessions that had the same precedent item and attribute were not included in the session data.

Session-Based Recommenders
Item conversion probability and attribute conversion probability of five attributes were calculated according to the definition in Section 3.1.We assumed that items are recommended to the user when they click an item in the online shopping mall.For each item, top-N items were selected based on item and attribute conversion probability.If multiple items had the same conversion probability, the item that had been clicked most recently was recommended to the user.

Feature-Weighted Session-Based Recommenders
Since users usually select items considering different attributes of a fashion item at once, it is logical to integrate individual ASBRs.For this purpose, we consider the aggregation of individual attribute conversion probabilities with different weights to reflect different importance of attributes.This research uses the recommendation results of the item session recommender to obtain aggregation weights of individual attribute conversion probabilities.The k th recommendation R k of ISBR can be represented as follows: where item i is an item in test, item j is a recommended item and outcome is a recommendation result (success or fail).Based on attributes of item i and item j , we can derive attribute conversion probability data as follows: where ACP att x,item i , att x,item j is attribute conversion probability between attribute x of item i and attribute x of item j .Since the problem is binary classification, various weighting algorithms, such as gain ratio, PCA, chi-square score, were used to obtain normalize attribute importance weights.Finally, the aggregated attribute conversion probability was defined as: where w x is the weight of attribute x derived from a feature weighting algorithm and ACP att x,item i , att x,item j is the conversion probability between a value of attribute x th of item i and a value of attribute x th of item j .Note that it is possible to calculate the integrated attribute conversion probability for attribute pairs, including new items which appeared in the testing.Recommendation worked as follows: for each item in the testing dataset, top-N items were selected based on the integrated attribute conversion probability.Similar to the item conversion probability recommender and individual attribute recommenders, we used the recency criterion to determine what items to display when items with the same conversion probability exist.

Baseline Recommender
In this experiment, the Item-Based CF recommender (Item-Based CFR) was employed as a baseline recommender and its performance was compared with the proposed SRSs.Item-Based CFR finds similar k-nearest items by calculating similarities between items.The top-N recommendations were generated by using a weighted average of the user's ratings of these similar items.In our experiment, the user's rating is a binary value-0 or 1.We use Item-Based CFR as a baseline recommender because it is conventional and used as baseline in previous recommendation studies [28,63].

Data Sets
Data used in this research was collected from the online shopping mall website of a leading fashion company in Korea, henceforth referred to as Company K. Data consisted of interactions between users of the website and fashion items listed on the website.A total of 51,079 unique customers were users and a total 117,791 unique fashion items were listed on the website.While items had an item identifier and brand, color, style, season and price attributes, no information was recorded about users aside from their user identifier.A total of 3,980,457 interactions were collected over one year from August 2015 to August 2016.Each entry of click data consisted of a user identifier, an item identifier, and creation date time.
Figure 1 illustrates distribution of clicks per user and clicks per item.Average clicks per user was 77.9, median click per user was 26 and the standard deviation was 127.5.Average clicks per item was 113.7, median click per item was 64 and standard deviation was 137.7.Users who only clicked once and items clicked only once by the user could not be used for this research evaluation.Each fashion product has five attributes-brand, style, color, season and price.A total of twenty brands supply fashion items to Company K. Items from two major brands (19 and 20) accounted for a large proportion of all fashion items listed on the online shopping mall website (see Figure 2a).There were 192 style type classification for all fashion items listed on the website.However, the product classification was skewed because the top 20 products accounted for 71% of all items (see Figure 2b).There were 448 color classifications for fashion items, but similar to style type classification, the top 20 colors accounted for majority of all items (81.1%) (see Figure 2c).There were five classifications for season-spring, summer, fall, winter and non-seasonal.Non-seasonal products accounted for the largest proportion of all items (33.7%), followed by summer (24.3%) and winter (19.7%) (see Figure 2d).Fashion item prices are continuous values and had a large range.We normalized the price by using min-max normalization.Figure 2e illustrates the normalized price distribution when applying equal-width discretization.In this case, most items belong to bin 1. Figure 2f illustrates normalized price distribution when applying equal-frequency discretization.Since the latter approach shows a reasonable distribution, we used it as the discretization for the fashion product prices.
(a) (b) Each fashion product has five attributes-brand, style, color, season and price.A total of twenty brands supply fashion items to Company K. Items from two major brands (19 and 20) accounted for a large proportion of all fashion items listed on the online shopping mall website (see Figure 2a).There were 192 style type classification for all fashion items listed on the website.However, the product classification was skewed because the top 20 products accounted for 71% of all items (see Figure 2b).There were 448 color classifications for fashion items, but similar to style type classification, the top 20 colors accounted for majority of all items (81.1%) (see Figure 2c).There were five classifications for season-spring, summer, fall, winter and non-seasonal.Non-seasonal products accounted for the largest proportion of all items (33.7%), followed by summer (24.3%) and winter (19.7%) (see Figure 2d).Fashion item prices are continuous values and had a large range.We normalized the price by using min-max normalization.Figure 2e illustrates the normalized price distribution when applying equal-width discretization.In this case, most items belong to bin 1. Figure 2f illustrates normalized price distribution when applying equal-frequency discretization.Since the latter approach shows a reasonable distribution, we used it as the discretization for the fashion product prices.
(19.7%) (see Figure 2d).Fashion item prices are continuous values and had a large range.We normalized the price by using min-max normalization.Figure 2e illustrates the normalized price distribution when applying equal-width discretization.In this case, most items belong to bin 1. Figure 2f illustrates normalized price distribution when applying equal-frequency discretization.Since the latter approach shows a reasonable distribution, we used it as the discretization for the fashion product prices.Based on the interaction data, we derived session data, which represents the sequence of interaction data.We considered the following two constraints: if the interval time between two sequential interactions is too large, the session is not regarded as a true session.If the interval time between two sequential interactions is too short (e.g., less than one second), the session cannot be human activity; it must be a crawler.Sessions belonging to these two type of sessions were removed from the session data.In order to set the minimum and maximum interval thresholds, we used the following heuristics: First, we set 60 min as the maximum threshold because we viewed that if the interval is greater than one hour, it cannot not be seen as a sequential behavior.Figure 3a illustrates the distribution of sessions and cumulated session ratio by session interval.Sessions within 60 min accounts for 83.1% of all sessions, but most sessions occur within 10 min (78.9% of all sessions).Secondly, we set 2 s (≤2 s) as the minimum interval because that is too short to be regarded as true expression of the user's preferences.Of all session data, 2.0% was removed by setting this minimum threshold (see Figure 3b).Based on the interaction data, we derived session data, which represents the sequence of interaction data.We considered the following two constraints: if the interval time between two sequential interactions is too large, the session is not regarded as a true session.If the interval time between two sequential interactions is too short (e.g., less than one second), the session cannot be human activity; it must be a crawler.Sessions belonging to these two type of sessions were removed from the session data.In order to set the minimum and maximum interval thresholds, we used the following heuristics: First, we set 60 min as the maximum threshold because we viewed that if the interval is greater than one hour, it cannot not be seen as a sequential behavior.Figure 3a illustrates the distribution of sessions and cumulated session ratio by session interval.Sessions within 60 min accounts for 83.1% of all sessions, but most sessions occur within 10 min (78.9% of all sessions).Secondly, we set 2 s (≤2 s) as the minimum interval because that is too short to be regarded as true expression of the user's preferences.Of all session data, 2.0% was removed by setting this minimum threshold (see Figure 3b).
human activity; it must be a crawler.Sessions belonging to these two type of sessions were removed from the session data.In order to set the minimum and maximum interval thresholds, we used the following heuristics: First, we set 60 min as the maximum threshold because we viewed that if the interval is greater than one hour, it cannot not be seen as a sequential behavior.Figure 3a illustrates the distribution of sessions and cumulated session ratio by session interval.Sessions within 60 min accounts for 83.1% of all sessions, but most sessions occur within 10 min (78.9% of all sessions).Secondly, we set 2 s (≤2 s) as the minimum interval because that is too short to be regarded as true expression of the user's preferences.Of all session data, 2.0% was removed by setting this minimum threshold (see Figure 3b).Since timestamps for data were available, we collected training and testing datasets for our evaluation from the full dataset reflecting temporal order.The training dataset (Table 1) was collected for six weeks from 03-JAN-16 to 13-FEB-16 and the test dataset was collected for one week from 14-FEB-16 to 20-FEB-16.Note that some items in the test period did not exist in the training period.We call these items 'cold-start items' in the following discussion.We generated session data, consisting of a precedent item and a subsequent item.There were 433,515 sessions created by 17,364 users in the training period and 72,104 sessions created by 5821 users in the test period.Note that there were precedent cold-start items (1307) and subsequent cold-start items (1297).

Procedures for Performance Evaluation
The performance evaluation was conducted using the following steps: (1) Construct session data for training and testing; (2) Calculate the item conversion probability, attribute conversion probability and integrated conversion probability using the training dataset; (3) Suggest top-N recommendation items for each precedent item in the test dataset.N was varied from five to 25 in increments of five in this study; (4) Check whether the top-N recommendation is successful.If a precedent item's subsequent is in the recommendation, the recommendation is regarded as a 'success'; otherwise, 'fail' [64].
In Step 3, the top-N list includes the items of the first N highest preference levels.Then, the performance of the proposed recommender system was evaluated by determining how effective the top-N list was in finding the hidden items.A similar evaluation procedure was used in previous studies (e.g., Sarwar et al. [65]).

Evaluation Metrics
Recall, precision and f1 measures are used to evaluate the performance of a recommender system [64,65].They are respectively defined as: where |T| is the number of test data and N is the number of recommended products for each product.
In addition to these two measures, we also used the F1 measure, which combines recall and precision, because these two measures are inversely related.For instance, as N increases, recall also increases but precision generally decreases.F1 is defined as the harmonic mean of recall and precision as follows [65].

ISBR and
Table 2 summarizes top-N recommendation results by the item k-NN CFR as well as ISBR and ASBRs.For each top-N recommendation, ISBR and ASBRs exhibited significantly better performance compared to Item-Based CFR in all performance measures.ISBR showed higher recall compared to ASBRs when top-5 and top-10 items are recommended.However, ASBRs achieved comparable recalls with ISBR when top-15, top-20 and top-25 items are recommended.In particular, ASBR of 'style' attribute outperformed ISBR in these recommendations (see Table 2a).Precisions were also similar to recalls.When small number of items were suggested (top-5 and top-10), ISBR outperformed ASBRs.However, when a large number of items were recommended (e.g., top-15, top-20, and top-25), ASBRs outperformed ISBR (see Table 2b).As ISBR showed better performance in recall and precision when suggesting small numbers of items (e.g., top-5 and top-10), its f1-measure was also better than ASBRs.The ASBR of 'style' attribute showed better performance compared to the item SRS and other attribute SRSs (see Table 2c).

Feature-Weighted Session-Based Recommenders
Table 3 summarizes attribute importance weights derived by feature weight algorithms.The results show that the importance weights were significantly different amongst algorithms.GR suggested 'style' was the most important, followed by 'color' and 'brand', and 'season' was the least important.PCA suggested 'brand' was the most important but other than that, weight differences were not significant compared to other algorithms.PCA suggested 'color' was the least important.CHI showed 'style' was the most important but 'color' was the least important.Interestingly RELIEF regarded 'price' as the most important, while 'style' was the least important.Finally SVM suggested 'brand' was the most important while 'Price' was the least important.Since there were significant differences in the weights, it was expected that the recommender systems that used different weights would produce different performance levels.FWSBRs achieved significant performance improvements when using attribute importance weights to combine ASBRs.In recall, all FWSBRs outperformed ASBRs.In particular, FWSBR with CHI outperformed ISBR as well as other FWSBRs (see Table 4a).Similarly, in precision, all FWSBRs outperformed any individual ASBR, and FWSBR with CHI outperformed ISBR as well as other FWSBRs (see Table 4b).Finally, F1-measures of FWSBRs with CHI exhibited the best performance among all recommender systems (see Table 4c).

FWSBRs with Cold-Start Items
The performance of FWSBRs with cold-start items was significantly better than FWSBRs without cold-start items.Recalls of FWSBR with chi-square scores exhibited higher performance compared to other recommender systems (see Table 5a).However, unlike FWSBRs without the cold-start items, the precision of FWSBR with PCA exhibited higher performance compared to other recommender systems, except FWSBR with GR.Finally, FWSBR with PCA exhibited the highest performance in the F1-measure.

Discussion
This paper examines various fashion item recommendation methods that assist customers' selection of fashion items in an e-commerce website.Recommender systems are regarded as one of the major types of web analytics systems in sustainable digital marketing, because they help the customers who have specific personal preferences, including needs for sustainability.
Sustainable fashion has diverse definitions and may cause confusion among writers.Lundblad and Davies [66] define sustainable fashion as "clothing that incorporate fair trade principles with sweatshop-free labor conditions; that does not harm the environment or workers by using biodegradable and organic cotton, and designed for a longer lifetime use; that is produced in an ethical production system, perhaps even locally; that which causes little or no environmental impact and makes use of eco-labeled or recycled materials".
In e-commerce, customers usually express their preference for sustainable fashion but consumers do not have a real choice since large quantities of such fashion products doesn't appear in the market [67].In this context, the recommenders are essential in promoting consumption of sustainable fashion products [4].In order to achieve this goal, the recommenders should promote consumption of sustainable fashion products according to the customers' preference.This implies that the recommender should work with high level performance.
This research suggested various SBRs in order to improve recommenders' performance.There are existing studies that focus on session information [34][35][36][37][38].Our proposed SBR methods are unique compared to previous research because we consider item sessions as well as attribute sessions.In addition, we suggest SBRs that combine item and attribute sessions by using various feature-weighting schemes.Our experimental demonstrates that the item and session combined SBRs perform better than baseline CFR as well as ISBR.The cold-start problem is a significant problem in the interaction-based recommenders, including various CFRs [18], Matrix Factorization recommender [19,68] and ISBR.Our proposed FWSBRs successfully suggest the cold-start items to the customers with significant performance improvement.

Conclusions
This research proposes new fashion product recommendation methods for the sustainable digital marketing using item and attribute session data.Our research suggests theoretical contributions, detailed as follows.Recommender systems have been researched to resolve information overflow problems.Content-based recommender systems and collaborative-based recommender systems have been extensively researched in academia as well as in commercial areas and have exhibited significant success in practice.These recommender systems are based on timeless information, such as item/user content and user behaviors with items.User activities and preferences depend on time factors and the recommender systems should include them when creating recommendation models.Recent temporal or time sensitive recommender systems such as [32,69,70] consider time factor and gained substantial improvement in recommendation performance.However, these studies do not consider sequential patterns when they consider temporal factors.Our major theoretical contribution is that time factor is explicitly considered when creating recommendation models.Our proposed methods also include not only sequential behavioral patterns but also sequential attribute patterns when creating session-based recommenders.In addition, we combined various ASBRs with feature weighted schemes, resulting in superior performance over pure ISBR and conventional CFR.This allows the recommenders of cold-start items to users without a loss of performance.
In addition to theoretical contributions, our research has practical implications, detailed as follows.Recently, sustainable digital marketing has gained interest from many industries.The meanings of sustainable digital marketing are two-fold.One the one hand, marketing efforts have a sustained impact on a company's results.On the other hand, the system and processes allow continuous improvement by maximizing existing assets.In this sense, recommender systems are an appropriate technical medium for sustainable digital marketing because they allow the company to provide accurate information regarding what the users want to see.Sustainability of recommender systems largely depends on their performance.Since our proposed methods achieved higher performance compared to conventional recommenders, we believe that our recommenders can have a positive impact on sustainable digital marketing results for the fashion products.
Although this research shows that various SBRs can suggest recommendation successfully, there are several limitations.First, this research uses session information, which is a special type of sequence pattern that consider only one step further from the current status to the next status.Since a series of sequence can show users' preference more completely, it is better to consider multiple sequences together [17].Second, our proposed method uses conversion probability, but there are better ways to consider sequence conversion, for example deep learning approaches.Therefore, our further research will consider these alternative approaches.

Sustainability 2019 , 18 Figure 1 Figure 1 .
Figure 1 illustrates distribution of clicks per user and clicks per item.Average clicks per user was 77.9, median click per user was 26 and the standard deviation was 127.5.Average clicks per item was 113.7, median click per item was 64 and standard deviation was 137.7.Users who only clicked once and items clicked only once by the user could not be used for this research evaluation.

Figure 1 .
Figure 1.Clicks per user and clicks per item.(a) Interactions per user; (b) interactions per item.

Figure 2 .
Figure 2. Product distribution by brand, style, color and season.(a) Item distribution by brand, (b) item distribution by style, (c) item distribution by color, (d) item distribution by season, (e) item distribution by price (equal-width discretization) and (f) item distribution by price (equal-frequency discretization).

Figure 2 .
Figure 2. Product distribution by brand, style, color and season.(a) Item distribution by brand, (b) item distribution by style, (c) item distribution by color, (d) item distribution by season, (e) item distribution by price (equal-width discretization) and (f) item distribution by price (equal-frequency discretization).

Table 1 .
Dataset for evaluation.

Table 2 .
Individual attribute session recommendation results.
(a) Recall of top-N recommendations.

Table 4 .
Recommendation results of FWSBRs.(a) Recall of top-N recommendations.

Table 5 .
Recommendation results of FWSBR with cold-tart items.(a) Recall of top-N recommendations.