![Page 1: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/1.jpg)
Consumer Behavior Prediction using Parametric and
Nonparametric Methods
Elena EnevaCALD Masters Presentation
19 August 2002
Advisors: Alan Montgomery, Rich Caruana,
Christos Faloutsos
![Page 2: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/2.jpg)
Outline
Introduction Data Economics Overview Baseline Models New Hybrid Models Results Conclusions and Future Work
![Page 3: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/3.jpg)
Background
Retail chains are aiming to customize prices in individual stores
Pricing strategies should adapt to the neighborhood demand
Stores can increase operating profit margins by 33% to 83%
![Page 4: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/4.jpg)
Price Elasticity
consumer’s response to price change
Ppercent
Qpercent E
inelastic elastic
Q is quantity purchased
P is price of product
![Page 5: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/5.jpg)
Data Example
0
20000
40000
60000
80000
100000
0.02 0.03 0.04 0.05 0.06price
quantity
![Page 6: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/6.jpg)
2.75
3.25
3.75
4.25
4.75
5.25
-1.58 -1.53 -1.48 -1.43 -1.38 -1.33 -1.28ln(price)
ln(quant)
Data Example – Log Space
![Page 7: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/7.jpg)
Assumptions
Independence– Substitutes: fresh fruit, other juices– Other Stores
Stationarity– Change over time– Holidays
![Page 8: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/8.jpg)
“The” ModelCategory
Price of Product 1
Price of Product 2
Price of Product 3
Price of Product N
. . .
“I know your
customers”
PredictorPredictor
Quantity bought of Product 1
. . .
Quantity bought of Product 2
Quantity bought of Product 3
Quantity bought of Product N
Need to multiply this across many stores, many categories.
),0(~
))(ln(~)ln(2
N
pfq
conv
ert t
o ln
spa
ce
conv
ert t
o or
igin
al s
pace
![Page 9: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/9.jpg)
Converting to Original Space
),0(~
))(ln(~)ln(2
N
pfq
))(ln(ˆ)ln( pfq
),(~))(ln(|)ln( 2Npfq
2^
2
1)ln(
ˆq
eq
eqE2
2
1
][
![Page 10: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/10.jpg)
Existing Methods
Traditionally – using parametric models (linear regression)
Recently – using non-parametric models (neural networks)
![Page 11: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/11.jpg)
Our Goal
Advantage of LR: known functional form (linear in log space), extrapolation ability
Advantage of NN: flexibility, accuracy
robustness
acc
ura
cy
NNnew
LR
Take Advantage: use the known functional form to bias the NN
Build hybrid models from the baseline models
![Page 12: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/12.jpg)
Datasets
weekly store-level cash register data at the product level
Chilled Orange Juice category
2 years 12 products 10 random stores selected
![Page 13: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/13.jpg)
Evaluation Measure
Root Mean Squared Error (RMS) the average deviation between the
predicted quantity and the true quantity
N
iii qq
NRMSerror
1
2ˆ
1
![Page 14: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/14.jpg)
Models
Hybrids– Smart Prior– MultiTask Learning– Jumping Connections– Frozen Jumping
Connections
Baselines–Linear Regression–Neural Networks
![Page 15: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/15.jpg)
Baselines
Linear Regression
Neural Networks
![Page 16: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/16.jpg)
q is the quantity demanded pi is the price for the ith product K products overall The coefficients a and bi are determined by
the condition that the sum of the square residuals is as small as possible.
Linear Regression
),0(~
)ln()ln(
2
1
N
pbaq i
K
ii
![Page 17: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/17.jpg)
Linear Regression
![Page 18: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/18.jpg)
Results RMS
0
2000
4000
6000
8000
10000
12000
LR NN SmPr MTL JC FJC Vote WAV
![Page 19: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/19.jpg)
Neural Networks
generic nonlinear function approximators
a collection of basic units (neurons), computing a (non)linear function of their input
backpropagation
![Page 20: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/20.jpg)
Neural Networks
1 hidden layer, 100 units, sigmoid activation function
![Page 21: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/21.jpg)
Results RMS
0
2000
4000
6000
8000
10000
12000
LR NN SmPr MTL JC FJC Vote WAV
![Page 22: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/22.jpg)
Hybrids
Smart Prior MultiTask Learning Jumping Connections Frozen Jumping Connections
![Page 23: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/23.jpg)
Smart Prior
Idea: start the NN at a “good” set of weights, help it start from a “smart” prior.
Take this prior from the known “linearity” NN first trained on synthetic data generated
by the LR model NN then trained on the real data
![Page 24: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/24.jpg)
Smart Prior
![Page 25: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/25.jpg)
Results RMS
0
2000
4000
6000
8000
10000
12000
LR NN SmPr MTL JC FJC Vote WAV
![Page 26: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/26.jpg)
Multitask Learning
Idea: learning an additional related task in parallel, using a shared representation
Adding the output of the LR model (built over the same inputs) as an extra output to the NN
Make the net share its hidden nodes between both tasks
Custom halting function Custom RMS function
![Page 27: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/27.jpg)
MultiTask Learning
![Page 28: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/28.jpg)
0
2000
4000
6000
8000
10000
12000
LR NN SmPr MTL JC FJC Vote WAV
Results RMS
![Page 29: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/29.jpg)
Jumping Connections
Idea: fusing LR and NN
change architecture add connections which “jump” over the
hidden layer Gives the effect of simulating a LR and
NN all together
![Page 30: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/30.jpg)
Jumping Connections
![Page 31: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/31.jpg)
Results RMS
0
2000
4000
6000
8000
10000
12000
LR NN SmPr MTL JC FJC Vote WAV
![Page 32: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/32.jpg)
Frozen Jumping Connections
Idea: you have the linearity, now use it!
same architecture as Jumping Connections, plus really emphasizing the linearity
freeze the weights of the jumping layer, so the network can’t “forget” about the linearity
![Page 33: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/33.jpg)
Frozen Jumping Connections
![Page 34: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/34.jpg)
Frozen Jumping Connections
![Page 35: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/35.jpg)
Frozen Jumping Connections
![Page 36: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/36.jpg)
Results RMS
0
2000
4000
6000
8000
10000
12000
LR NN SmPr MTL JC FJC Vote WAV
![Page 37: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/37.jpg)
Models
Hybrids– Smart Prior– MultiTask Learning– Jumping Connections– Frozen Jumping
Connections
Baselines:–Linear Regression–Neural Networks
Combinations–Voting–Weighted Average
![Page 38: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/38.jpg)
Combining Models
Idea: Ensemble Learning
Committee Voting – equal weights for each model’s prediction
Weighted Average – optimal weights determined by a linear regression model
2 baseline and 3 hybrid models (Smart Prior, MultiTask Learning, Frozen Jumping Conections)
![Page 39: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/39.jpg)
Committee Voting
Average the predictions of the models
![Page 40: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/40.jpg)
0
2000
4000
6000
8000
10000
12000
LR NN SmPr MTL JC FJC Vote WAV
Results RMS
![Page 41: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/41.jpg)
Weighted Average – Model Regression
Linear regression on baselines and hybrid models to determine vote weights
![Page 42: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/42.jpg)
Results RMS
0
2000
4000
6000
8000
10000
12000
LR NN SmPr MTL JC FJC Vote WAV
![Page 43: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/43.jpg)
Normalized RMS Error
Compare model performance across stores Stores of different sizes, ages, locations, etc Need to normalize Compare to baselines
Take the error of the LR benchmark as unit error
![Page 44: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/44.jpg)
Normalized RMS Error
0.75
0.80
0.85
0.90
0.95
1.00
1.05
1.10
LR NN SmPr MTL JC FJC Vote WAV
![Page 45: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/45.jpg)
Conclusions
Clearly improved models for customer choice prediction
Will allow stores to price the products more strategically and optimize profits
Maintain better inventories Understand product interaction
![Page 46: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/46.jpg)
Future Work Ideas
analyze Weighted Average model compare extrapolation ability of new
models use other domain knowledge
– shrinkage model – a “super” store model with data pooled across all stores
![Page 47: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/47.jpg)
Acknowledgements
I would like to thank my advisors
and
my CALDling friends and colleagues
![Page 48: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/48.jpg)
The Most Important Slide
for this presentation and the paper:
www.cs.cmu.edu/~eneva/research.htm
![Page 49: Consumer Behavior Prediction using Parametric and Nonparametric Methods](https://reader030.vdocument.in/reader030/viewer/2022033014/56814746550346895db482a6/html5/thumbnails/49.jpg)
References
Montgomery, A. (1997). Creating Micro-Marketing Pricing Strategies Using Supermarket Scanner Data
West, P., Brockett, P. and Golden, L (1997) A Comparative Analysis of Neural Networks and Statistical Methods for Predicting Consumer Choice
Guadagni, P. and Little, J. (1983) A Logit Model of Brand Choice Calibrated on Scanner data
Rossi, P. and Allenby, G. (1993) A Bayesian Approach to Estimating Household Parameters