what can reveal 1,018 speeches of fidel castro?

25
What can Reveal 1,018 Speeches of Fidel Castro? Sergio Peignier ? , Patricia Zapata ? CMLA, ENS Paris-Saclay, CNRS UMR 8536, Université Paris-Saclay, France Carrera de Lingüística e Idiomas, Universidad Mayor de San Andrés, La Paz, Bolivia 8 June 2018 1

Upload: others

Post on 19-Jan-2022

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: What can Reveal 1,018 Speeches of Fidel Castro?

What can Reveal 1,018 Speeches of Fidel Castro?

Sergio Peignier?, Patricia Zapata⇧

?CMLA, ENS Paris-Saclay, CNRS UMR 8536, Université Paris-Saclay, France

⇧Carrera de Lingüística e Idiomas, Universidad Mayor de San Andrés, La Paz, Bolivia

8 June 2018

1

Page 2: What can Reveal 1,018 Speeches of Fidel Castro?

Introduction

Fidel Castro Discourse Analysis

#Understand his Political Success

Fidel CastroMajor Cuban politician.Half-century government(1959-2008).Well known rhetoric skills.29/09/1960 UN longest speech (4h39min).

CorpusAvailable onlinea.1,018 speeches.+7,500,000 words.+4,000,000 non-empty words.

ahttp://www.cuba.cu/gobierno/discursos/

2

Page 3: What can Reveal 1,018 Speeches of Fidel Castro?

Previous WorksSystematic non-lexicometric approaches

Work Corpus size Methodology[Joyner 1964] 3 discourses Aristotelian Rhetoric Principles

[Nieto et al. 2002] 1 conversation Conversational Analysis

[Belisario 2016] 3 discourses Critical Discourse Analysis

[Reyes 2011] 6 months speeches (⇠ dozen) Sociolinguistics / Critical Discourse Analysis

Complex and time consuming task#

Use small (and possibly non-representative) samples#

Possibly biased results

3

Page 4: What can Reveal 1,018 Speeches of Fidel Castro?

Key Points

Corpus1,018 speeches

CharaudeauLinguisticDiscourseAnalysis

PrototypicDiscourses(Trees)

MachineLearningFramework

Machine Learning framework

#More representative systematic study.

4

Page 5: What can Reveal 1,018 Speeches of Fidel Castro?

Outline

1 Machine Learning Framework

2 Discourse Analysis Methodology

3 Results

4 Conclusion and Perspectives

5

Page 6: What can Reveal 1,018 Speeches of Fidel Castro?

Outline

1 Machine Learning Framework

2 Discourse Analysis Methodology

3 Results

4 Conclusion and Perspectives

6

Page 7: What can Reveal 1,018 Speeches of Fidel Castro?

Overall FrameworkCorpus1,018 speeches

Word-Embedding

CharaudeauLinguisticDiscourseAnalysis

SubspaceClustering

Directed Trees

Framework stepsPre-processing: data cleaning and empty words filtering.Word-Embedding ! Word Vector Representations.Subspace Clustering ! Contextually Linked Word Clusters.(Discursive Contexts)Directed Trees ! Intelligible Structure(Prototypic Discourses).Python 2.7 (re, nltk, gensim Word2Vec, SubCMedians)

7

Page 8: What can Reveal 1,018 Speeches of Fidel Castro?

Word-EmbeddingWord2Vec [Mikolov et al. 2013]

Distributional Hypothesis: "Words that occur in similar contextstend to have similar meanings" [Harris 1954]

Example (Fidel Castro)

” . . .

Context (width=2)z }| {Dentro ⇢⇢de �la revolucion todo contra�la revolucion nada . . . ”

Words ! D-dimensional Vector Space

Word : w = todo ! x = (0.3, 0.4,�0.9, . . . )

Context :c = {dentro, revolucion, ...} ! z = (0.1,�0.3, 0.2, . . . )

Probability that hw , ci existsP(exists|w , c) ⇡ p(x , z) = 1/(1 + e

�x

>z)

Objective function:Maximize for existing hw,ciz }| {

log(P(exists|w, c)) +

Maximize for k random words w̃z }| {k ⇥ E(log(P(exists|w̃, c)))

euros

producteconomy

trade

people

rights

freedom

left

vote

Dim 1

Dim

2

Vector

Space

Model

X

X

X

X

X

X

X

XX

8

Page 9: What can Reveal 1,018 Speeches of Fidel Castro?

Subspace Clustering: Discursive Context ClustersSubCMedians [Peignier et al. 2018]

Find clusters of words in the

Vector Space Model

Subspace Clustering is moreadapted to high-dimensionalspaces than clustering.

euros

producteconomy

X

trade

people

rights

freedomleft

vote

Dim 1

Dim

2

Vector

Space

Model

Economic

Context

Politic

Context

X

m1

m2 X

X

X

X

X

X

X

X

Word vectors X = {x1, x2, . . . }.Subspace Centers H = {m1,m2, . . . }.Minimise: SAE (S,H) =

Ps

i

2S min

m

j

2H(distance(xi

,mj

)).

9

Page 10: What can Reveal 1,018 Speeches of Fidel Castro?

Build Directed-Trees

Peoplexpeople = (0.2,0.3,...)

Votexvote = (0.3,0.29,...)

dist(xvote, xpeople)

Leftxleft = (0.33,0.4,...)

Rightsxvote = (0.2,0.31,...)

dist(xrights, xpeople)

dist(xvote, xleft)

Freedomxfreedom = (0.18,0.3,...)

Compute

Distances

Most frequent word from cluster ! root.Compute distances between cluster words and those in the tree.

Include the closest word to the tree.10

Page 11: What can Reveal 1,018 Speeches of Fidel Castro?

Outline

1 Machine Learning Framework

2 Discourse Analysis Methodology

3 Results

4 Conclusion and Perspectives

11

Page 12: What can Reveal 1,018 Speeches of Fidel Castro?

Charaudeau Discourse Analysis Methodology

Semio-pragmatic discourse analysis [Charaudeau 1995].

Based on the Aristotelian classification of rhetoric art.

Discourse Analysis

Ethos Pathos Logos

12

Page 13: What can Reveal 1,018 Speeches of Fidel Castro?

Ethos: How the politician depicts himself

Discursive Identity"Me"Personal commitment."Us"Identification with audience."Me-Spokesman"Incarnates an ideology.

Speech SituationCandidate.e.g., Political party leader.Ruler.e.g., President / Minister

Embody Truthful CharactersLeader (power).Expert (knowledge).Witness (veracity).Committed person.Charismatic person.

13

Page 14: What can Reveal 1,018 Speeches of Fidel Castro?

Pathos (1/2)

Persuade through feelings and passions.Recruitment process

Lead the audience to accept the speaker’s project willingly.

Moral Values.Social Welfare.Progress (economy, technology).

Religious belonging.Ethnic belonging.Nationalism.

Rhetoric of effectsCreate feelings and passions in

the audience.HopePrideCourage

FearHatredOutrage

14

Page 15: What can Reveal 1,018 Speeches of Fidel Castro?

Pathos (2/2)

Triadic Scenario

Disastrous scenariopeople = victim

EnemyExternal/internal

HeroSpeaker/Ideology

Fights

Causes

Solves

15

Page 16: What can Reveal 1,018 Speeches of Fidel Castro?

Logos

Convince through logical reasoning and argumentation.

Simplifying strategiesSingularizationReduce the number of ideas described.EssentializationExpress a complex idea with few words.Formula = Singularization + Essentialization.Slogans.

Argumentative strategiesSimplified causal arguments.Analogy with past events/characters.Detailed descriptions ! Veracity.

16

Page 17: What can Reveal 1,018 Speeches of Fidel Castro?

Outline

1 Machine Learning Framework

2 Discourse Analysis Methodology

3 Results

4 Conclusion and Perspectives

17

Page 18: What can Reveal 1,018 Speeches of Fidel Castro?

Cluster Model Structure

Four largest clusters: 466, 93, 70, 344⇠ 60% Number of words.⇠ 50% Vocabulary.⇠ Analysis of 595 discourses out of the 1,018.Small clusters ! Specific vocabulary or general words.

18

Page 19: What can Reveal 1,018 Speeches of Fidel Castro?

Cluster 466Political Marxist Project

Futur PlanMarxist RevolutionDefence of the project

Identification with PeopleDetails ! Veracity

Moral Values

MarxistRevolution

FuturProject

19

Page 20: What can Reveal 1,018 Speeches of Fidel Castro?

Cluster 344Cold War Triadic Scenario

EnemiesOligarchy, USA

Cold War conflicts

HeroesCommunism, Cubanarmy, people

FeelingsFear, Hatred,Outrage

HerosEnemy

Negative Scenario

20

Page 21: What can Reveal 1,018 Speeches of Fidel Castro?

Cluster 70Social welfare, audience as beneficiary

Society as actor and beneficiarySocial welfare: Education, employment, healthFeelings ! Hope, Pride

Welfare

21

Page 22: What can Reveal 1,018 Speeches of Fidel Castro?

Cluster 93Economic and Technological Progress

Progress

Economic progressInfrastructures

Needs/ObligationsFeelingsHope, Pride

EthosExpert, Authority

Details ! Veracity

22

Page 23: What can Reveal 1,018 Speeches of Fidel Castro?

Castro Discursive Strategies

Pathos (51%)Triadic scenarioProgress Welfare NationalismHope Pride Fear Heroism

Ethos (34%)Authority ExpertiseCommitment Identification

Logos (15%)Detailed descriptions !Veracity

23

Page 24: What can Reveal 1,018 Speeches of Fidel Castro?

Outline

1 Machine Learning Framework

2 Discourse Analysis Methodology

3 Results

4 Conclusion and Perspectives

24

Page 25: What can Reveal 1,018 Speeches of Fidel Castro?

Conclusion and Perspectives

Broader and more representative characterisation ofCastro’s speeches.

Study relationship with historical events.

Study influence on other politicians’ discourses.

Extension to Automatic Discourse Analysis.

25