poster v4 - stat · title: poster_v4 author: patrick bachmann created date: 12/9/2018 12:57:22 pm

2
CUSTNO ORDER_DATE PRICE 10000094635 2005-01-09 69.95 10000094635 2005-01-09 89.95 10000094635 2005-10-22 129.95 --- 9999920974 2010-05-20 9999920974 2011-02-07 9999920974 2011-02-11 Create a CLV Object createCLVObject(…) Add time-invariant contextual factors setStaticCovariates(…) Add time-varying contextual factors setDynamicCovariates(…) Estimate the model parameters estimate(…) Predict future customer behavior predict(…) Transactional Data Time-varying information (e.g. direct marketing) Time-invariant information (e.g. demographics) 1 2 3 Summary summary(…) Plot plot(…) optional Estimating Individual Customer Lifetime Values with R: The CLVTools Package Patrick Bachmann 1 , Markus Meierer 1 , Jeffrey Näf 2 , Patrik Schilter 1 , René Algesheimer 1 1 Department of Business Administration, URPP Social Networks, University of Zurich 2 Department of Mathematics, ETH Zurich 1. CLV Assign every customer an individual value In general, CLV describes the long-term economic value of customers and gives managers an idea of how customers evolved over time. With individual CLV firms may… § Identify future most valuable customer. § Minimize Spending for unprofitable customer. § Optimize and benchmark customer development. Using aggregated CLV firms may… § Evaluate relational marketing actions. § Improve financial valuation of a firm (due diligence). 5. CLVTools package The CLVTools package is a toolbox for various probabilistic customer attrition models. It provides a framework which is easy to use but a the same time also easy to extend with further model and model features. § Class-based (S4) framework that is easy to extend as… § the package provides base classes for all models and § model specific inherited class(es). § Easy to use by implementing… § base methods for the generic functions (summary, plot, estimate, predict …). § intuitive error handling and validity checks. § Handler for the optimizer is provided (allows for restrictions & extensions). Contact Patrick Bachmann [email protected] www.socialnetworks.uzh.ch 3. Probabilistic models are widely used to calculate CLVs Among the various approaches to assess CLV, probabilistic customer attrition models stand out due to their ability to simultaneously forecast a customer’s actual lifetime and future transactions (Gupta and Zeithaml 2006). This is especially valuable in non-contractual settings, where customer attrition is not observed. Probabilistic customer attrition models build up not only on a strong tradition in marketing (Fader and Hardie 2009), but they perform well in out of sample settings to predict CLV in manifold business environments (Romero, van der Lans, and Wierenga 2013). Their real-world applicability has been shown for many non-contractual businesses e.g. medical supply firms, insurances (Schmittlein and Peterson 1994), high-tech B2B manufacturers (W. J. Reinartz and Kumar 2003) and retailers (Abe 2009; Fader, Hardie, and Lee 2005a; Platzer and Reutterer 2016). Probabilistic modeling approaches in marketing use stochastic processes to model observed customer behavior. The modeler assumes that customers’ behavior varies across the population according to a probability distribution. By using a negative binomial distribution (NBD) to model customers’ purchase behavior, Ehrenberg (1959) was the first to start the long tradition of probabilistic models in marketing. In his model the ”random” mean transaction rate of customers is characterized by a Poisson distribution. Variation of the mean transaction rate across customers, customer heterogeneity, is modeled by a Gamma distribution. The combination of both distributions results in a negative binomial distribution. The first probabilistic models focused on customers’ purchase behavior only, but ignored customer attrition completely. References Abe M (2009) “Counting Your Customers” One by One: A Hierarchical Bayes Extension to the Pareto/NBD Model. Mark. Sci. 28(3):541–553. Ehrenberg AS (1959) The Pattern of Consumer Purchases. Appl. Stat. 8(March):26–41. Fader PS, Hardie BGS (2009) Probability Models for Customer-Base Analysis. J. Interact. Mark. 23(1):61–69. Fader PS, Hardie BGS, Lee LK (2005) Computing P (alive) using the BG/NBD model. Gupta S, Zeithaml V (2006) Customer Metrics and Their Impact on Financial Performance. Mark. Sci. 25(6):718–739. Platzer M, Reutterer T (2016) Ticking away the moments: Timing regularity helps to better predict customer activity. Mark. Sci. 35(5):779–799. Reinartz WJ, Kumar V (2003) The Impact of Customer Relationship Characteristics on Profitable Lifetime Duration. J. Mark. 67(1):77–99. Romero J, van der Lans R, Wierenga B (2013) A Partially Hidden Markov Model of Customer Dynamics for CLV Measurement. J. Interact. Mark. 27(3):185–208. Schmittlein DC, Peterson RA (1994) Customer Base Analysis: An Industrial Purchase Process Application. Mark. Sci. 13(1):41–67. btyd Btydplus CLVTools Usage of S4 classes & generic methods no no yes Option for time- invariant cont. factors no yes yes Option for time- varying cont. factors no no yes Optional Model Extensions no no yes Process correlation no no yes Regularization no no yes Equality constraints no no yes Available on CRAN yes yes soon 2. Non-contractual settings are most challenging In contrast to contractual settings, where customer “announce” when they leave the company, in non-contractual settings customer churn is not observed. Two different non-contractual settings have to be distinguished: § Discrete non-contractual: Churning customers are not observed. Purchase occasions are limited to specific discrete events. Examples are: movie theater, donations § Continuous non-contractual: Churning customers are not observed. The customer can freely choose when to purchase. Examples: Retailing, hotel industry. 4. Many model variations exist To cope with the different settings and different customer characteristics many different variations of probabilistic customer attrition models have been developed. While the basic concept remains the similar, different underlying distributions are used to capture customer behavior. In addition, extensions to cope with contextual factors or relax model assumptions where added. For many models an implementation exist, however a generalized framework unifying the power of the many variations is missing. CLVTools:a universal R framework for probabilistic customer attrition models 6. Three steps to predict individual CLV When valuing customers, 4 different settings have to be distinguished: 1) Add Data: We start with transactional data that is easy to collect: Customer Id, Purchase Date and Price. Contextual factors may be added optionally. Select the model to be estimated. 2) Estimate model parameters: Add restrictions or extensions such as process correlation through optional layers to the optimization process. 3) Predict: Define the prediction horizon. Use summary() and plot() at any stage of the process to obtain an overview of the data or the results. Select the model to be estimated here. Add optional model extensions Continuous Non-contractual Discrete Non-contractual Use generic functions throughout the package.

Upload: others

Post on 24-Aug-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Poster v4 - Stat · Title: Poster_v4 Author: Patrick Bachmann Created Date: 12/9/2018 12:57:22 PM

CUSTNO ORDER_DATE PRICE10000094635 2005-01-09 69.9510000094635 2005-01-09 89.9510000094635 2005-10-22 129.95---9999920974 2010-05-20 59.959999920974 2011-02-07 119.959999920974 2011-02-11 119.95

Create a CLV Object

createCLVObject(…)

Add time-invariant contextual factors

setStaticCovariates(…)

Add time-varying contextual factors

setDynamicCovariates(…)

Estimate the model parameters

estimate(…)

Predict future customer behavior

predict(…)

Transactional Data

Time-varying information(e.g. direct marketing)

Time-invariant information(e.g. demographics)

1 2 3

Summary

summary(…)

Plot

plot(…)

optional

Estimating Individual Customer Lifetime Values with R: The CLVTools PackagePatrick Bachmann1, Markus Meierer1, Jeffrey Näf2, Patrik Schilter1, René Algesheimer1

1Department of Business Administration, URPP Social Networks, University of Zurich2Department of Mathematics, ETH Zurich

1. CLV Assign every customer an individual valueIn general, CLV describes the long-term economic value ofcustomers and gives managers an idea of how customersevolved over time.

With individual CLV firms may…

§ Identify future most valuable customer.

§ Minimize Spending for unprofitable customer.

§ Optimize and benchmark customer development.

Using aggregated CLV firms may…

§ Evaluate relational marketing actions.

§ Improve financial valuation of a firm (due diligence).

5. CLVTools packageThe CLVTools package is a toolbox for various probabilisticcustomer attrition models. It provides a framework which iseasy to use but a the same time also easy to extend withfurther model and model features.

§ Class-based (S4) framework that is easy to extend as…

§ the package provides base classes for all models and

§ model specific inherited class(es).

§ Easy to use by implementing…

§ base methods for the generic functions (summary, plot, estimate, predict …).

§ intuitive error handling and validity checks.

§ Handler for the optimizer is provided (allows for restrictions & extensions).

ContactPatrick Bachmann

[email protected]

www.socialnetworks.uzh.ch

3. Probabilistic models are widely used to calculate CLVsAmong the various approaches to assess CLV, probabilisticcustomer attrition models stand out due to their ability tosimultaneously forecast a customer’s actual lifetime andfuture transactions (Gupta and Zeithaml 2006). This isespecially valuable in non-contractual settings, where customerattrition is not observed. Probabilistic customer attrition modelsbuild up not only on a strong tradition in marketing (Fader andHardie 2009), but they perform well in out of sample settingsto predict CLV in manifold business environments (Romero,van der Lans, and Wierenga 2013). Their real-worldapplicability has been shown for many non-contractualbusinesses e.g. medical supply firms, insurances (Schmittleinand Peterson 1994), high-tech B2B manufacturers (W. J.Reinartz and Kumar 2003) and retailers (Abe 2009; Fader,Hardie, and Lee 2005a; Platzer and Reutterer 2016).

Probabilistic modeling approaches in marketing use stochasticprocesses to model observed customer behavior. Themodeler assumes that customers’ behavior varies across thepopulation according to a probability distribution. By using anegative binomial distribution (NBD) to model customers’purchase behavior, Ehrenberg (1959) was the first to start thelong tradition of probabilistic models in marketing. In his modelthe ”random” mean transaction rate of customers ischaracterized by a Poisson distribution. Variation of the meantransaction rate across customers, customer heterogeneity, ismodeled by a Gamma distribution. The combination of bothdistributions results in a negative binomial distribution. The firstprobabilistic models focused on customers’ purchase behavioronly, but ignored customer attrition completely.

ReferencesAbe M (2009) “Counting Your Customers” One by One: A Hierarchical Bayes Extension to the

Pareto/NBD Model. Mark. Sci. 28(3):541–553.Ehrenberg AS (1959) The Pattern of Consumer Purchases. Appl. Stat. 8(March):26–41.

Fader PS, Hardie BGS (2009) Probability Models for Customer-Base Analysis. J. Interact. Mark. 23(1):61–69.

Fader PS, Hardie BGS, Lee LK (2005) Computing P (alive) using the BG/NBD model.

Gupta S, Zeithaml V (2006) Customer Metrics and Their Impact on Financial Performance. Mark. Sci. 25(6):718–739.

Platzer M, Reutterer T (2016) Ticking away the moments: Timing regularity helps to better predict customer activity. Mark. Sci. 35(5):779–799.

Reinartz WJ, Kumar V (2003) The Impact of Customer Relationship Characteristics on Profitable Lifetime Duration. J. Mark. 67(1):77–99.

Romero J, van der Lans R, Wierenga B (2013) A Partially Hidden Markov Model of Customer Dynamics for CLV Measurement. J. Interact. Mark. 27(3):185–208.

Schmittlein DC, Peterson RA (1994) Customer Base Analysis: An Industrial Purchase Process Application. Mark. Sci. 13(1):41–67.

btyd Btydplus CLVTools

Usage of S4 classes & generic methods

no no yes

Option for time-invariant cont. factors

no yes yes

Option for time-varying cont. factors

no no yes

Optional Model Extensions

no no yes

Process correlation no no yes

Regularization no no yes

Equality constraints no no yes

Available on CRAN yes yes soon

2. Non-contractual settings are most challengingIn contrast to contractual settings, where customer “announce”when they leave the company, in non-contractual settingscustomer churn is not observed.

Two different non-contractual settings have to be distinguished:

§ Discrete non-contractual: Churning customers are notobserved. Purchase occasions are limited to specific discreteevents. Examples are: movie theater, donations

§ Continuous non-contractual: Churning customers are notobserved. The customer can freely choose when topurchase. Examples: Retailing, hotel industry.

4. Many model variations existTo cope with the different settings and different customercharacteristics many different variations of probabilisticcustomer attrition models have been developed. While thebasic concept remains the similar, different underlyingdistributions are used to capture customer behavior. In addition,extensions to cope with contextual factors or relax modelassumptions where added. For many models animplementation exist, however a generalized frameworkunifying the power of the many variations is missing.

CLVTools:a universal R framework for probabilistic customer attrition models

6. Three steps to predict individual CLVWhen valuing customers, 4 different settings have to bedistinguished:

1) Add Data: We start with transactional data that is easy tocollect: Customer Id, Purchase Date and Price. Contextualfactors may be added optionally. Select the model to beestimated.

2) Estimate model parameters: Add restrictions orextensions such as process correlation through optionallayers to the optimization process.

3) Predict: Define the prediction horizon.

Use summary() and plot() at any stage of the process to

obtain an overview of the data or the results.

Select the model to be estimated here.

Add optional model extensions

ContinuousNon-contractual

DiscreteNon-contractual

Use generic functions throughout the package.

Page 2: Poster v4 - Stat · Title: Poster_v4 Author: Patrick Bachmann Created Date: 12/9/2018 12:57:22 PM

University unit

How to Create a Poster1) Preparation: Making Guides Visible

PC:

- Right click in gray area outside slide frame

- Select “Grids and Guides”

- Select “Display drawing guides on screen”

Mac:

- Right (alt) click in gray area outside slide frame

- Select “Guides”

- Select “Static guides”

2) Designing Poster: Copy/Paste

Whenever practical, copy layout elements (text, image,

legends) from the basic style sheet, then replace content with new data.

It is highly recommended not to squeeze too much into the

frames – a looser fit with enough empty space looks better and

is easier to read.

3) Spell Check

The language selection in PowerPoint applies to the entire program, but may also be separately selected for individual

objects.

To specify the standard language:

PC:

- Menu “Review”

- “Language”

- “Set Proofing Language”

- Click “Standard” to confirm

Mac:

- Menu “Extras”

- “Language”

- Select language and click “Standard” to confirm

It is recommended that you set a standard language for all

objects:

- Select all objects on the slide- Same procedure as with choosing a standard language, but

click “OK” (instead of “Standard”)

- Repeat procedure for every slide

Fig.4: Nulla consequat massa quis enim. Donec pede justo, fringilla

vel, aliquet nec. Ullo to optur, eum nim int ex eritati orposapidem.

Ihil experchitium quatur sunt dio.

Pudiciliquae pel eum facitin niam quam, voluptati to iuntusFicit

expernam quas digent.

3. Seasonality is common inmany businesses• Repeating pattern of events. Mostly in equidistant time

periods.

• Pattern is constant over time

• Affects the level of transactions and lifetime of a customer

ConclusionBy incorporating seasonality the predictive performance

provides:

• a better in-sample fit

• a better out-of-sample performance

The approach presented is not only limited to model

seasonality but can also be used to include any other time-

varying determinants in a continuous non-contractual business

setting.