c. candia-castro-vallejos, c. jara-figueroa, carlos...

1
Pablo Neruda “Es tan corto el amor y tan largo el olvido” “Love is so short forgetting is so long” THE UNIVERSAL DECAY OF HUMAN COLLECTIVE MEMORY C. Candia-Castro-Vallejos, C. Jara-Figueroa, Carlos Rodriguez-Sickert, Albert-László Barabási, César A. Hidalgo Collective Learning Group, MIT Media Lab, Massachusetts Institute of Technology, Cambridge, United States Network Science Institute, Northeastern University, Boston, MA 02115, USA Centro de Investigación en Complejidad Social (CICS), Facultad de Gobierno, Universidad del Desarrollo, Santiago, Chile The literature on human collective memory proposes that memory decays through two mechanisms, one involving communicative memory--the memory sustained by oral communication--and another involving cultural memory--the memory sustained by the physical recording of information. Yet, there is no statistical evidence supporting the decay of collective memory through these two mechanisms, or exploring the universality of the decay function. Here, we use time series data on papers and patents' citations, and on the popularity of songs, movies, and biographies, to test the hypotheses that the decay of human collective memory involves the decay of communicative and cultural memory, and that the decay function is universal. We derive a mathematical model from first principles by formalizing these two mechanisms and we compare the bi-exponential function, predicted by this model, with other decay functions proposed in the literature. Our results support the hypotheses that the decay of human collective memory involves the combined decay of communicative and cultural memory, and that the decay function is universal across multiple cultural domains. These findings allow us to explain the dynamics of the attention received by a piece of cultural content during its lifetime, and suggest that the dynamics of human collective memory follows a universal decay mechanism. ABSTRACT MECHANISMS OF FORGETTING A 0.01 0.10 1.00 0 10 20 30 40 Age S(t) p q r 0.75 0.05 0.1 0.75 0.05 0.03 0.75 0.005 0.1 0.25 0.005 0.1 B 0.01 0.10 1.00 1 10 100 Age S(t) C Bi-exponential Exponential Log-normal 0.01 0.10 1.00 0 10 20 30 40 50 60 70 80 Age S(t) Communicative memory Cultural memory r p q S t+1 t+1 t+1 v t+1 S t v t u t t r p q u t+1 Decay processes Modelling collective memory. A) The red curve shows the bi-exponential function predicted by our model in log-lin scale. The light-blue and light-green curves show the two exponential of communicative and cultural memory. The inset illustrates the basic mechanics of the model. At any time point t the total memory is the sum of communicative memory u and cultural memory v. Both communicate and cultural memory decay with their own respective decay rates p and q. B) The bi-exponential model (Eq. 1) for various parameters p, q, and r, can account for a wide range of decays. C) Comparison between the bi-exponential model (in red), and the exponential and log-normal models in log-log scale. DATA & RESULTS 10 2 10 1.5 10 1 10 0.5 10 0 10 0.5 10 0 10 0.5 10 1 Age [t-1980] Average Number of new citations 10 3 10 2 10 1 10 0 10 1 10 0 10 0.5 10 1 10 1.5 Age [t-1980] Average Number of new citations k*=3 k*=7 k*=20 k*=3 k*=7 k*=20 10 3 10 2 10 1 10 0 10 1 10 0 10 0.5 10 1 10 1.5 Age [t-1980] Average Number of new citations 1.5 1.0 0.5 0.0 0.5 1.0 1.5 2.0 2.5 10 0 10 0.5 10 1 10 1.5 Standarized popularity Spotify popularity 1.5 1.0 0.5 0.0 0.5 1.0 1.5 2.0 2.5 10 0 10 0.5 10 1 10 1.5 Age [2017- t] Age [2017- t] Age [2017- t] Age [2017- t] Age [2017- t] Age [2017- t] Standarized popularity YouTube views 1.5 1.0 0.5 0.0 0.5 1.0 1.5 2.0 2.5 10 0 10 0.5 10 1 10 1.5 Standarized popularity Last.fm playcounts 1.5 1.0 0.5 0.0 0.5 1.0 1.5 2.0 2.5 10 0 10 0.5 10 1 10 1.5 10 2 Standarized popularity 1.5 1.0 0.5 0.0 0.5 1.0 1.5 2.0 2.5 10 0 10 0.5 10 1 10 1.5 Standarized popularity 10 2 10 1.5 10 1 10 0.5 10 0 10 0.5 10 0 10 0.5 10 1 Age [t-1980] Average Number of new citations k*=1 k*=3 k*=6 1.5 1.0 0.5 0.0 0.5 1.0 1.5 2.0 2.5 10 0 10 0.5 1 10 1.5 10 2 Standarized popularity Wikipedia Views Wikipedia Views 10 2 10 1 10 0 10 1 10 0 10 0.5 10 1 10 1.5 Age [t-1980] Average Number of new citations J Billboard Songs D E F PRB 1980 PRD 1980 PRL 1990 G H I Patents in CAT 1 granted in 1985 Patents in CAT 5 granted in 1990 Billboard Songs IMDB Movies K L Tennis Players Basketball Players Olympic Medalists A B C 10 3 10 2 10 1 10 0 10 1 10 0 10 0.5 10 1 10 1.5 Age [t-1980] Average Number of new citations k*=2 Bi-exponential Exponential Log-normal k*=7 k*=20 k*=2 k*=5 k*=13 k*=1 k*=2 k*=5 Bi-exponential Exponential Log-normal Bi-exponential Exponential Log-normal Bi-exponential Exponential Log-normal Bi-exponential Exponential Log-normal Bi-exponential Exponential Log-normal Bi-exponential Wikipedia Views Exponential Log-normal Bi-exponential Exponential Log-normal Bi-exponential Exponential Log-normal Bi-exponential Exponential Log-normal Bi-exponential Exponential Log-normal Bi-exponential Exponential Log-normal PRL 1980 The universal decay of collective memory. Average number of new citations received by All papers published in A) Physical Review B in 1980, B) Physical Review D in 1980, C) Physical Review Letters in 1990, D) Physical Review L in 1980, E) All Mechanical patents granted in 1990, and F) All Chemical patents granted in 1985. Here we show that this fast decay followed by a mild decline is a universal bi-exponential curve that can be derived from first principles from two concepts already in the literature on collective memory: the ideas of communicative and cultural memory. We show that this bi- exponential decay offers a more accurate description of the decay function than previously proposed models, validating these mechanisms, and also, that it produces parameters that can be readily interpretable. Our results empirically validate the concepts of communicative and cultural memory and show that the decay of collective memory is universal across a variety of domains. DISCUSSION q r p Tc [Years] Spotify Last.fm Youtube T ennis Players Olympic Medalists Basketball Players 0.00 0.05 0.10 0.0 0.1 0.2 0.3 0.0 0.2 0.4 0.6 0.8 0 10 20 30 Coefficient Estimate Songs Movies Biographies Patents Papers CAT 1 1985 k*=1 CAT 1 1985 k*=3 CAT 1 1985 k*=6 CAT 5 1990 k*=1 CAT 5 1990 k*=2 CAT 5 1990 k*=5 PRB 1980 k*=2 PRB 1980 k*=7 PRB 1980 k*=20 PRD 1980 k*=3 PRD 1980 k*=7 PRD 1980 k*=20 PRL 1980 k*=2 PRL 1980 k*=5 PRL 1980 k*=13 PRL 1990 k*=3 PRL 1990 k*=7 PRL 1990 k*=20 Model's parameters described by Eq. 1, and showed on figure 2. Each box correspond to a model's parameter and colors represent the type of cultural piece. The critical time, T c , is calculated by Eq. 2, and it is measured in years. Bars represent the standard deviation of the coefficient estimation. Next, for cultural pieces we have the standardized popularity of G) Songs based on spotify's popularity index (y-axis) as a function of the date the song first appeared in the Billboard ranking (x-axis). H) Songs based on Last.fm's play counts (y-axis) as a function of the date the song first appeared in the Billboard ranking (x-axis). I) Movies based on YouTube's view counts (y-axis) as a function of the date the movie was released (x-axis). J) Tennis players based on Wikipedia's page views (y-axis) as a function of the date that the tennis player was included in the Top 600 International males singles tennis player (x-axis). K) Olympic medalist based on Wikipedia's page views (y-axis) as a function of the date of the middle of the career of the Olympic medalist. L) Basketball players based on Wikipedia's page views (y-axis) as a function of the date that the Basketball player starts his career (x-axis). Red lines show our bi-exponential model fit, whereas the dashed lines and dotted lines show a log-normal decay and a exponential decay. Error bars represent standard errors. (2) (1) [email protected]

Upload: dotu

Post on 19-Nov-2018

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: C. Candia-Castro-Vallejos, C. Jara-Figueroa, Carlos ...dccs.udd.cl/files/2018/03/FINAL_Poster_Forgetting.pdf · Pablo Neruda “Es tan corto el amor y tan largo el olvido” “Love

Pablo Neruda“Es tan corto el amor y tan largo el olvido”

“Love is so short forgetting is so long”

THE UNIVERSAL DECAY OF HUMAN COLLECTIVE MEMORYC. Candia-Castro-Vallejos, C. Jara-Figueroa, Carlos Rodriguez-Sickert, Albert-László Barabási, César A. Hidalgo

Collective Learning Group, MIT Media Lab, Massachusetts Institute of Technology, Cambridge, United StatesNetwork Science Institute, Northeastern University, Boston, MA 02115, USA

Centro de Investigación en Complejidad Social (CICS), Facultad de Gobierno, Universidad del Desarrollo, Santiago, Chile

The literature on human collective memory proposes that memory decays through two mechanisms, one involving communicative memory--the memory sustained by oral communication--and another involving cultural memory--the memory sustained by the physical recording of information. Yet, there is no statistical evidence supporting the decay of collective memory through these two mechanisms, or exploring the universality of the decay function. Here, we use time series data on papers and patents' citations, and on the popularity of songs, movies, and biographies, to test the hypotheses that the decay of human collective memory involves the decay of communicative and cultural memory, and that the decay function is universal. We derive a mathematical model from first principles by formalizing these two mechanisms and we compare the bi-exponential function, predicted by this model, with other decay functions proposed in the literature. Our results support the hypotheses that the decay of human collective memory involves the combined decay of communicative and cultural memory, and that the decay function is universal across multiple cultural domains. These findings allow us to explain the dynamics of the attention received by a piece of cultural content during its lifetime, and suggest that the dynamics of human collective memory follows a universal decay mechanism.

ABSTRACT

MECHANISMS OF FORGETTINGA

0.01

0.10

1.00

0 10 20 30 40Age

S(t)

p

q

r

0.75 0.05 0.1 0.75 0.05 0.03

0.75 0.005 0.1 0.25 0.005 0.1

B

0.01

0.10

1.00

1 10 100Age

S(t)

C Bi-exponential Exponential Log-normal

0.01

0.10

1.00

0 10 20 30 40 50 60 70 80Age

S(t)

Communicative memory Cultural memory

r

p q

St+1

t+1t+1

vt+1

Stvt

ut

t

r

p

q

ut+1

Decay processes

Modelling collective memory. A) The red curve shows the bi-exponential function predicted by our model in log-lin scale. The light-blue and light-green curves show the two exponential of communicative and cultural memory. The inset illustrates the basic mechanics of the model. At any time point t the total memory is the sum of communicative memory u and cultural memory v. Both communicate and cultural memory decay with their own respective decay rates p and q. B) The bi-exponential model (Eq. 1) for various parameters p, q, and r, can account for a wide range of decays. C) Comparison between the bi-exponential model (in red), and the exponential and log-normal models in log-log scale.

DATA & RESULTS

10 2

10 1.5

10 1

10 0.5

100

100.5

100 100.5 101

Age [t-1980]

Aver

age

Num

ber o

f new

cita

tions

10 3

10 2

10 1

100

101

100 100.5 101 101.5

Age [t-1980] Av

erag

e N

umbe

r of n

ew c

itatio

ns

k*=3

k*=7k*=20

k*=3k*=7k*=20

10 3

10 2

10 1

100

101

100 100.5 101 101.5

Age [t-1980]

Aver

age

Num

ber o

f new

cita

tions

1.5

1.0

0.5

0.0

0.5

1.0

1.5

2.0

2.5

100 100.5 101 101.5

Stan

dariz

ed p

opul

arity

Spotify popularity

1.5

1.0

0.5

0.0

0.5

1.0

1.5

2.0

2.5

100 100.5 101 101.5

Age [2017- t] Age [2017- t]

Age [2017- t]

Age [2017- t]

Age [2017- t] Age [2017- t]

Stan

dariz

ed p

opul

arity

YouTube views

1.5

1.0

0.5

0.0

0.5

1.0

1.5

2.0

2.5

100 100.5 101 101.5

Stan

dariz

ed p

opul

arity

Last.fm playcounts

1.5

1.0

0.5

0.0

0.5

1.0

1.5

2.0

2.5

100 100.5 101 101.5 102

Stan

dariz

ed p

opul

arity

1.5

1.0

0.5

0.0

0.5

1.0

1.5

2.0

2.5

100 100.5 101 101.5

Stan

dariz

ed p

opul

arity

10 2

10 1.5

10 1

10 0.5

100

100.5

100 100.5 101

Age [t-1980]

Aver

age

Num

ber o

f new

cita

tions

k*=1

k*=3

k*=6

1.5

1.0

0.5

0.0

0.5

1.0

1.5

2.0

2.5

100 100.5 1 101.5 102

Stan

dariz

ed p

opul

arity

Wikipedia Views Wikipedia Views

10 2

10 1

100

101

100 100.5 101 101.5

Age [t-1980]

Aver

age

Num

ber

of n

ew c

itatio

ns

J

Billboard Songs

D E F

PRB 1980 PRD 1980 PRL 1990

G H I

Patents in CAT 1 granted in 1985Patents in CAT 5 granted in 1990

Billboard Songs IMDB Movies

K L

Tennis Players Basketball PlayersOlympic Medalists

A B C

10 3

10 2

10 1

100

101

100 100.5 101 101.5

Age [t-1980]

Aver

age

Num

ber o

f new

cita

tions

k*=2

Bi-exponential Exponential Log-normal

k*=7 k*=20

k*=2

k*=5 k*=13

k*=1

k*=2

k*=5 Bi-exponential Exponential Log-normal

Bi-exponential Exponential Log-normal

Bi-exponential Exponential Log-normal

Bi-exponential Exponential Log-normal

Bi-exponential Exponential Log-normal

Bi-exponential Wikipedia Views

Exponential Log-normal

Bi-exponential Exponential Log-normal

Bi-exponential Exponential Log-normal

Bi-exponential Exponential Log-normal

Bi-exponential Exponential Log-normal

Bi-exponential Exponential Log-normal

PRL 1980

The universal decay of collective memory. Average number of new citations received by All papers published in A) Physical Review B in 1980, B) Physical Review D in 1980, C) Physical Review Letters in 1990, D) Physical Review L in 1980, E) All Mechanical patents granted in 1990, and F) All Chemical patents granted in 1985.

Here we show that this fast decay followed by a mild decline is a universal bi-exponential curve that can be derived from first principles from two concepts already in the literature on collective memory: the ideas of communicative and cultural memory. We show that this bi-exponential decay offers a more accurate description of the decay function than previously proposed models, validating these mechanisms, and also, that it produces parameters that can be readily interpretable.

Our results empirically validate the concepts of communicative and cultural memory and show that the decay of collective memory is universal across a variety of domains.

DISCUSSION

qr

pTc [Years]

Spoti

fyLa

st.fm

Youtu

beTe

nnis

Play

ers

Olym

pic M

edali

stsBa

sketb

all P

layer

s

0.00

0.05

0.10

0.0

0.1

0.2

0.3

0.0

0.2

0.4

0.6

0.8

0

10

20

30

Coef

ficie

nt E

stim

ate

Songs Movies Biographies Patents Papers

CAT

1 198

5 k*=

1CA

T 1 1

985 k

*=3

CAT

1 198

5 k*=

6CA

T 5 1

990 k

*=1

CAT

5 199

0 k*=

2CA

T 5 1

990 k

*=5

PRB

1980

k*=2

PRB

1980

k*=7

PRB

1980

k*=2

0PR

D 19

80 k*

=3PR

D 19

80 k*

=7PR

D 19

80 k*

=20

PRL 1

980 k

*=2

PRL 1

980 k

*=5

PRL 1

980 k

*=13

PRL 1

990 k

*=3

PRL 1

990 k

*=7

PRL 1

990 k

*=20

Model's parameters described by Eq. 1, and showed on figure 2. Each box correspond to a model's parameter and colors represent the type of cultural piece. The critical time, Tc, is

calculated by Eq. 2, and it is measured in years. Bars represent the standard deviation of the coefficient estimation.

Next, for cultural pieces we have the standardized popularity of G) Songs based on spotify's popularity index (y-axis) as a function of the date the song first appeared in the Billboard ranking (x-axis). H) Songs based on Last.fm's play counts (y-axis) as a function of the date the song first appeared in the Billboard ranking (x-axis). I) Movies based on YouTube's view counts (y-axis) as a function of the date the movie was released (x-axis). J) Tennis players based on Wikipedia's page views (y-axis) as a function of the date that the tennis player was included in the Top 600 International males singles tennis player (x-axis). K) Olympic medalist based on Wikipedia's page views (y-axis) as a function of the date of the middle of the career of the Olympic medalist. L) Basketball players based on Wikipedia's page views (y-axis) as a function of the date that the Basketball player starts his career (x-axis). Red lines show our bi-exponential model fit, whereas the dashed lines and dotted lines show a log-normal decay and a exponential decay. Error bars represent standard errors.

(2)

(1)

[email protected]