recommender by time shift

Upload: ezekill

Post on 03-Apr-2018

218 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/29/2019 Recommender By Time Shift

    1/6

    Detection of Preference Shift Timing

    using Time-Series Clustering

    Fuyuko Ito, Tomoyuki Hiroyasu, Mitsunori Miki and Hisatake Yokouchi

    Abstract Recommendation methods help online users topurchase products more easily by presenting products thatare likely to match their preferences. In these methods, userprofiles are constructed according to past activities on the site.When a user accesses an e-commerce site, the user preferencesmay change during the course of web shopping. We calledthis a preference shift in this paper. However, conventionalrecommendation methods suppose that user profiles are static,and therefore these methods cannot follow the preference shift.Here, a novel product recommendation method is proposed,which responds to the preference shift. With use of thisrecommendation method, the users remain at the site longerthan before. This paper discusses the detection method for

    finding the preference shift timing using time-series clustering.In the proposed method, the products preferred by a user areclustered and the preference shift timing is detected as thechange in the clustering results.

    I. INTRODUCTION

    There is increasing demand for e-commerce sites because

    they allow larger numbers of products to be presented than

    physical stores, and provide vendors with increased sales

    opportunities and greater choice for consumers. Recom-

    mendation methods used in these services extract a users

    profile based on the users activity and present information

    that suits the obtained profile. For example, Amazon.com1

    has attempted to increase sales opportunities by presentingproducts that are likely to be purchased by the user based on

    their purchase history.

    However, the users preference may change during shop-

    ping on the web, a situation that we refer to here as a

    preference shift. Conventional recommendation methods

    cannot follow the preference shift because these methods

    assume that user profiles are static. Moreover, increasing

    the time a user spends on the e-commerce site can increase

    sales opportunities. To lead users to remain at a site longer

    than before, it is necessary to update the users preferences

    constantly and be able to induce a preference shift by

    presenting certain products because a users purchase can be

    changed by visual priming of the e-commerce site[9], [11],

    [12].

    In this paper, a method to detect the timing of the

    preference shift using time-series clustering is discussed. In

    the proposed method, the products preferred by a user are

    Fuyuko Ito is with the Graduate School of Enigineering, DoshishaUniversity, Kyoto, Japan and she is the research fellow of the Japan societyfor the promotion of science. e-mail: [email protected]

    Tomoyuki Hiroyasu and Hisatake Yokouchi are with the Department ofLife and Medical Sciences, Doshisha University.

    Mitsunori Miki is with the Department of Science and Engineering,Doshisha University.

    1http://amazon.com/

    clustered and the timing of the preference shift is determined

    from changes in the features of obtained clusters. The outline

    of this paper is as follows. The next section describes the

    user preference model in the proposed method. In section

    III, the preference shift is defined and we discuss ways of

    detecting the timing of the preference shift. In section IV, we

    discuss an experiment performed to investigate which cluster

    features can be used to determine the preference shift timing

    with some artificial data. Finally, we present our conclusions

    in section V.

    II. USE RS PREFERENCE MODEL FOR PRODUCTRECOMMENDATION

    A. Preference Model in Conventional Recommendation

    Methods

    In general, recommendation methods are classified into

    the following three types[1]: collaborative filtering[7], [19],

    content-based filtering, and hybrid approaches. In content-

    based filtering, it is necessary to build a model of a users

    profile based on the preference information acquired from

    his/her purchase history. First, a target product is represented

    as a vector consisting of a number of features. There are

    several approaches to model a users preference in the

    feature space, i.e., detecting preferred regions in the featurespace and representing suitability according to the users

    preference as the fitness function. In the former approach,

    preferred regions are detected based on the users preferred

    products and the suitability of a given product according to

    this preference is predicted by the similarity between that

    product and another preferred product. On the other hand, the

    selection of products to present is optimized by maximization

    of fitness. The input of the fitness function of the preference

    is a product and the output is the fitness of the product for the

    preference. However, the fitness function as the preference

    model is not known a priori. Therefore, some methods opti-

    mize product presentation by predicting the fitness function

    interactively based on the users preferences[2], [10], [20]. In

    this study, the preference in the e-commerce site was defined

    as the tendency toward the users ideal product. Hence, the

    preferred regions in the product feature space were identified

    interactively based on the products preferred by the user.

    B. Detection of Users Preference in Feature Space using

    Clustering

    In this study, the users preference is defined as a set of

    preferred products, which are defined as those clicked by

    the user as a metric of user interest. Meanwhile, a product

    is described as a feature vector of the feature space. For

    FUZZ-IEEE 2009, Korea, August 20-24, 2009

    978-1-4244-3597-5/09/$25.00 2009 IEEE 1585

  • 7/29/2019 Recommender By Time Shift

    2/6

    Fig. 2. Detect regions corresponding to users preference by clustering.

    example, when clothes are the target products, a product is

    represented as a combination of the values of various fea-

    tures, such as color, material, sleeve length, etc. Nevertheless,

    the users preference may have multiple tendencies (see Fig.

    1). For this reason, a set of preferred products is clustered

    in the feature space and multiple tendencies of the users

    preferences can be obtained. The capability of this method

    has already been confirmed in a subjective experiment in

    which application of clustering to the preferred productswas shown to be able to acquire the multiple preferences

    of the user[5], [6]. In that experiment, the multiple preferred

    regions were specified in the feature space from the clustering

    result and the products included in the specified regions were

    presented (see Fig. 2). The results verified that the multiple

    preferences can be obtained appropriately based on clustering

    of the preferred products and the products that are presented

    from the specified regions suited the users preference.

    III. USE RS PREFERENCE SHIFT ON PRODUCT

    RECOMMENDATION

    A. Time-Series Clustering

    The preference shift on product recommendation is defined

    as the change in a users tendency regarding the ideal product

    during web shopping. For example, a user may be looking

    for a dress to wear to a party on an e-commerce site. She may

    initially begin looking for a black dress, but notice vividly

    colored dresses while shopping. She may then begin to search

    for dresses that are pink, orange, green, blue, etc. If a dress

    is represented as a vector consisting of two elements, color

    and price, all dresses clicked by the user can be mapped

    to the feature space as shown in Fig. 3.

    Fig. 1. Each region corresponds to each preference.

    Fig. 3. An example of the preference shift.

    As mentioned above, the preference shift is represented as

    the change in clustering result with clicking on a product.

    Therefore, the clustering result of clicked dresses varies as

    the search advances (see Fig. 3). However, the following

    items must be considered when clustering is applied to the

    time-series data.

    How to select the data for clustering

    How to suppress drastic changes in the clustering result

    The phrase time-series clustering in this paper means

    applying clustering to the data per unit of time as shown in

    Fig. 3 and differs from the concept of the clustering of time-series data such as waves[17]. One of the simplest methods

    of time-series clustering is the application of clustering to all

    stored preferred data as the user clicks a product. However,

    if a set of data is stored for a long time, the clustering result

    may not be changed, although small amounts of new data

    with different characteristics from most of the stored data

    may be added. Therefore, it is necessary to select data for

    clustering. Here, the sliding window technique was used and

    a certain amount of the newest data is selected as the sample

    data of the window.

    Moreover, when clustering is applied to the stored data

    FUZZ-IEEE 2009, Korea, August 20-24, 2009

    1586

  • 7/29/2019 Recommender By Time Shift

    3/6

    independently as the user clicks a product, it is possible that

    the cluster structure obtained before clicking maybe changed

    dramatically. Hence, the constraints of past clustering results

    are added to clustering of the current data in the proposed

    method to avoid drastic changes in the clustering result.

    B. Detection of Preference Shift Timing

    When the data of the preferred products are stored and allstored data are clustered, the clustering result is compared

    with the former to find the preference shift timing in the

    proposed method. Nevertheless, it is not known which feature

    of the cluster we should compare to detect the preference

    shift timing. For this reason, the features of the cluster are

    discussed to judge when the clustering result has changed in

    this paper.

    IV. DISCUSSION OF CLUSTER FEATURES FOR

    DETECTION OF THE PREFERENCE SHIFT TIMING

    A. Experimental Overview

    In this experiment, clustering was applied to the incremen-

    tal time-series data of the preferred products and the features

    of the cluster were examined to determine the preference shift

    timing. The experimental data, the clustering algorithm, the

    method for identification of the relevance between two states

    of the same cluster, and the features of the cluster were as

    described below.

    1) Experimental Data: The feature space of the data is

    a two-dimensional space and datum x is described as x =(x0, x1) when x0(0 x0 16) and x1(0 x1 16) arereal numbers. Three test data including 24 data (1 t 24)are generated by an agent implementing the following three

    preferences. Each of the following preferences represents a

    possible model of the preference shift on an e-commerce site.

    Test data (1): Preference shift of a single preference

    As shown in Fig. 4(a), the preferred region was set

    as region (1) and region (2) in the first and second

    halves of the search, respectively. First, twelve data

    were generated randomly and uniformly in region (1)

    and then an additional twelve data were generated in

    region (2) in the same way. Therefore, the preference

    shift timing of these test data was t = 13.

    Test data (2): Preference shift of one of two preferences

    In the first two thirds of the search, the preferred regions

    were set as regions (1) and (2) (see Fig. 4(b)). In theremaining third, regions (2) and (3) were set as the

    preferred regions. Thus, region (2) was preferred by the

    agent for the whole search. First, the agent generated

    eight and four data in regions (1) and (2), respec-

    tively. The order of generation in these regions was

    randomized. Then, four and eight data were generated

    in regions (2) and (3), respectively. The preference shift

    timing of these test data was t = 14 because the firstdata generated in region (3) appeared at t = 14.

    Test data (3): Simultaneous preference shifts of two

    preferences toward a new preference

    Fig. 4. The agent generates data on the regions defined as a userspreference.

    The preferred regions were set as region (1) and region

    (2) in the first two thirds of the search, and eight data

    were generated randomly in each region (see Fig. 4(c)).

    In the remaining third, the preferred region was set

    as region (3) and the last eight data were generated

    randomly in region (3). Therefore, the preference shift

    timing of this test data was t = 17.

    Test data (4): Without preference shift

    The agent randomly generated 24 data in region (1)

    throughout the whole search. In this test data, the

    preference did not change (see Fig. 4(d)).

    2) Clustering Algorithm: The algorithm to detect com-

    munities in a network, as proposed by Newman[14], was

    employed and extended to handle the weighted network in

    this experiment. This method is a hierarchical clustering

    algorithm and can obtain an optimal division of nodes in

    a network with a high density of within-cluster edges and

    a lower density of between-cluster edges by maximizing

    quality function modularity Q. Therefore, this method can

    automatically determine the number of clusters. Here, a kksymmetric matrix e whose element eij is the number of all

    edges that link nodes in cluster Ai to nodes in cluster Ajis defined. Then, ai =

    j eij is calculated. Therefore, eii

    indicates the number of edges that link nodes in cluster Aito nodes in the same cluster and ai describes the number of

    all edges emerging from nodes in cluster Ai. Q is designed

    to emphasize the connection within a cluster and diminish

    the connection between clusters as shown in the following

    equation.

    FUZZ-IEEE 2009, Korea, August 20-24, 2009

    1587

  • 7/29/2019 Recommender By Time Shift

    4/6

    Fig. 5. dc(t) is the distance between the centroids of two clusters. dS(t)is the difference between the spaces of two clusters.

    Q =

    i

    (eii a2

    i ) (1)

    In the proposed method, the clustering method mentioned

    above is applied to the weighted network whose weight is the

    degree of relevance between each of two products, whereas

    the relevance between two nodes is described as the existence

    of an edge in Newmans original method. Therefore, the

    degree of relevance between two data, xi and xj , is defined

    as the inverse of the distance between them in the feature

    space, as shown in the following equation.

    Similarity(xi,xj) =1

    Distance(xi,xj)(2)

    Meanwhile, the latest n data are utilized as samples of a

    window for clustering. In this experiment, n, the number of

    sample data for a window, was set to 9. The constraint of

    past clustering result was not added in this experiment.

    3) Identification of the Relevance between Two States of

    the Same Cluster: To verify the time-series variation of

    a certain cluster, it is necessary to identify which cluster

    Aj(t0 + t) at t = t0 + t is most relevant to the clusterAi(t0) at t = t0. In this study, the similarity between two

    clusters was computed by the auto-correlation function[16]as shown below. Moreover, |Ai(t0) Aj(t0 + t)| is thenumber of data in common between Ai(t0) and Aj(t0+t),and |Ai(t0)Aj(t0+t)| is the number of nodes in the unionof Ai(t0) and Aj(t0 + t). C Aij(t0 + t) is computed forall pairs of two clusters at t = t0 and t = t0 + t, andeach pair is defined as the same cluster in decreasing order

    of similarity. Here, t is set as t = 1.

    C Aij(t0 + t) |Ai(t0) Aj(t0 + t)|

    |Ai(t0) Aj(t0 + t)|(3)

    Fig. 6. Transitions of sum ofdc(t), sum of dS(t) and C(t) of test data(1).

    4) Features of Clusters: The following features of the

    cluster A(t) are discussed to find the preference shift timing

    in this experiment. dc(t), dS(t) and C(t) are features of thecluster A(t) in transition from t 1 to t. The concepts ofdc(t) and dS(t) are shown in Fig. 5.

    dc(t): Distance between the centroids of A(t 1) andA(t)

    dS(t): Difference between the spaces occupied by thedata of A(t 1) and A(t)

    C(t): Similarity of data between A(t 1) and A(t)

    B. Experimental Results and Discussion

    First, the time-series variation of each feature of clusters

    of the test data (1), representing a preference shift of a singlepreference, is discussed. Transitions of the sum of dc(t),sum of dS(t), and C(t) are shown in Fig. 6. The horizontalaxes in Fig. 6 describe the time t. Meanwhile, clustering is

    applied from t = 9 because the number of sample data forthe window is nine.

    Figure 6(a) shows that dc(t) and dS(t) increased rapidlyat t = 13. The preference shift timing of test data (1) wasset at t = 13. Therefore, the variations of dc(t) and dS(t)may indicate the change in clustering result. On the other

    hand, dc(t) and dS(t) were also increased at t = 20 becausethe data that suit the preference in early steps disappeared

    FUZZ-IEEE 2009, Korea, August 20-24, 2009

    1588

  • 7/29/2019 Recommender By Time Shift

    5/6

    Fig. 7. Transitions of sum ofdc(t) and dS(t) of test data (2).

    from the window. For this reason, the number of sample data

    for a window should be discussed further in future studies.

    Moreover, it is difficult to determine the preference shift

    appropriately with C(t) because the data included in each

    cluster change rapidly (see Fig. 6(b)).Second, we discuss the variation in the sum of dc(t) and

    sum of dS(t) of test data (2) as shown in Fig. 7. In the testdata, dc(t) and dS(t) increased simultaneously at t = 14when the preference shifted. However, these increments were

    small in comparison with the increments at t = 17. Two datain region (2) were very close to each other in a cluster at

    t = 13 (see Fig. 4), and a datum in region (3) was added tothis cluster at t = 14 due to the preference shift. However,the centroid of the cluster moved over slightly and the

    space covered by the data of the cluster was approximately

    the same as before because these three data were close to

    each other. This result indicated that the distribution or the

    covariance of the data in each cluster must be considered in

    future studies.

    Next, the time-series variation of each feature of clusters

    of test data (3) is shown in Fig. 8. The test data represent

    simultaneous preference shifts of two preferences toward a

    single preference. dc(t) and dS(t) increased rapidly at t = 17when the preference shift timing of this test data was set. In

    the same way as the test data (1), it is possible to detect the

    preference shift timing based on the time-series variation of

    dc(t) and dS(t). However, the number of sample data for awindow and the constraint of the past clustering result must

    be considered because the last datum that suits the initial

    preference is merged into other clusters, and dc(t) and dS(t)increased at t = 20 and t = 24.

    Finally, the time-series variation of each feature of clusters

    of test data (4) is shown in Fig. 9. The test data showed

    consistent preference with no preference shift, and it must be

    confirmed whether dc(t) and dS(t) increased or not. Figure9 shows that v were consistent in comparison with Figs. 6,

    7, and 8.

    Overall, it is possible to detect the preference shift tim-

    ing according to the rapid increases in distance between

    the centroids dc(t) and the difference between the spacesdS(t). Moreover, these features would not change when a

    Fig. 8. Transitions of sum ofdc(t) and dS(t) of the test data (3).

    Fig. 9. Transitions of sum ofdc(t) and dS(t) of test data (4).

    preference shift does not occur. Nevertheless, the distribution

    or the covariance of data within a cluster must be discussed

    further in future studies.

    V. CONCLUSIONS

    The purpose of this study was to increase sales oppor-

    tunities by detection of the preference shift on e-commerce

    sites and its triggers. In this paper, a method that applies

    time-series clustering on preferred products was proposed

    to detect the preference shift timing. The features of the

    cluster were also discussed using three sets of artificial test

    data to determine when the clustering result had changed. As

    an experimental result, the preference shift timing could be

    detected according to the time-series variations of distance

    between the centroids and the difference between the spacesof two states of the same cluster. In future studies, the

    number of sample data for a window and application of

    constraints of past clustering results should be discussed.

    Eventually, the capability of the proposed method to detect

    the preference shift timing of actual users should also be

    assessed in subjective experiments.

    REFERENCES

    [1] Adomavicius, G., Tuzhilin, E.: Toward the next generation of recom-mender systems: A survey of the state-of-the-art and possible exten-sions. IEEE Transactions on Knowledge and Data Engineering, Vol.17,No.6, pp.734749 (2005)

    FUZZ-IEEE 2009, Korea, August 20-24, 2009

    1589

  • 7/29/2019 Recommender By Time Shift

    6/6

    [2] Aoki, K., Takagi, H.: Interactive ga-based design support system forlighting design in 3-d computer graphics. The transactions of theInstitute of Electronics, Information and Communication Engineers,Vol.81, No.7, pp.16011608 (1998)

    [3] Fukui, K., Saito, K., Kimura, M., Numao, M.: Visualizing Dynamics ofthe Hot Topics Using Sequence-Based Self-Organizing Maps. LectureNotes in Artificial Intelligence, Vol.3684, pp.745751 (2005)

    [4] Fukui, K., Saito, K., Kimura, M., Numao, M.: Compilation to Visu-alize the Dynamic Clusters by the Adapted Self-Organizing Network.

    Transactions of the Japanese Society for Artificial Intelligence, Vol.23,No.5, pp.319329 (2008)[5] Ito, F., Hiroyasu, T., Miki, M., Yokouchi, H.: Discussion of Offspring

    Generation Method for Interactive Genetic Algorithms with Consider-ation of Multimodal Preference. Simulated Evolution and Learning,Lecture Notes in Computer Science, Springer, Vol.5361, pp.349359(2008)

    [6] Ito, F., Hiroyasu, T., Miki, M., Yokouchi, H.: Offspring GenerationMethod for interactive Genetic Algorithm considering Multimodal Pref-erence. Transactions of the Japanese Society for Artificial Intelligence,Vol.24, No.1, pp.127135 (2009)

    [7] Konstan, J.A., Miller, B.N., Maltz, D., Herlocker, J.L., Gordon, L.R.,Riedl, J.: Grouplens: applying collaborative filtering to usenet news.Communications of the ACM, Vol.40, No.3, pp.7787 (1997)

    [8] Kleinberg, J., Karypis, G., Konstan, J., Reidl, J.: Bursty and hierarchicalstructure in streams. In: Proceedings of the 8th ACM SIGKDDInternational Conference on Knowledge Discovery and Data Mining,

    pp.91101 (2002)[9] Koufaris, M., Kambil, A., LaBarbera, P.A.: Consumer Behavior in

    Web-Based Commerce: An Empirical Study. International Journal ofElectronic Commerce, Vol.6, No.2, pp.115138 (2002)

    [10] Llora, X., Sastry, K., Goldberg, D.E., Gupta, A., Lakshmi, L.: Combat-ing user fatigue in igas: partial ordering, support vector machines, andsynthetic fitness. In: Proceedings of Genetic Evolutionary ComputationConference, pp.13631370 (2005)

    [11] Mandel, N., Johnson, E.J.: When Web Pages Influence Choice:Effects of Visual Primes on Experts and Novices. Journal of ConsumerResearch, Vol.29, No.2, pp.235245 (2002)

    [12] Mandel, N, Nowlis, S.M.: The Effect of Making a Prediction aboutthe Outcome of a Consumption Experience on the Enjoyment of ThatExperience. Journal of Consumer Research, Vol.35, No.1, pp.920(2008)

    [13] Newman, M.E.J., Girvan, J.: Finding and evaluating communitystructure in networks. Physics Review E, Vol.69, Issue 2, 026113 (2004)

    [14] Newman, M.E.J.: Fast algorithm for detecting community structure innetworks. Physics Review E, Vol.69, Issue 6, 066133 (2004)

    [15] Palla, G., Derenyi, I., Farkas, I., Vicsek, T.: Uncovering the overlap-ping community structure of complex networks in nature and society.Nature, Vol.435, No.7043, pp.814818 (2005)

    [16] Palla, G., Barabasi, A.L., Vicsek, T.: Quantifying social groupevolution. Nature, Vol.446, No.7136, pp.664667 (2007)

    [17] Ratanamahatana, C.A. and Keogh, E.: Making Time-series Classi-fication More Accurate Using Learned Constraints. In: Proceedingsof SIAM International Conference on Data Mining (SDM 04), LakeBuena Vista, Florida, pp.1122 (2004)

    [18] Sakaki, T., Matsuo, Y., Ishizuka, M.: Topic Extraction from ScientificPaper Database. In: Proceedings of the Annual Conference on theJapanese Society for Artificial Intelligence, Vol.20, 1A1-1 (2006)

    [19] Sarwar, B., Karypis, G., Konstan, J., Reidl, J.: Item-based collaborativefiltering recommendation algorithms. In: Proceedings of the 10th

    international conference on World Wide Web, pp.285295 (2001)[20] Takagi, H.: Interactive evolutionary computation: Fusion of the

    capabilities of ec optimization and human evaluation. In: Proceedingsof the IEEE, Vol.89, No.9, pp.12751296 (2001)

    FUZZ-IEEE 2009, Korea, August 20-24, 2009

    1590