calcolare opinioni e controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfsentiment...

30
Calcolare Opinioni e Controversialità Valerio Basile [email protected] http://valeriobasile.github.io/

Upload: others

Post on 03-Jun-2020

7 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Calcolare Opinioni e Controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfSentiment Analysis a.k.a. Opinion Mining “Given a set of evaluative text documents D that

Calcolare Opinioni e Controversialità

Valerio Basile

[email protected]://valeriobasile.github.io/

Page 2: Calcolare Opinioni e Controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfSentiment Analysis a.k.a. Opinion Mining “Given a set of evaluative text documents D that

> whoami

Content-Centered Computing grouphttp://hatespeech.di.unito.it/

AI NLP SemanticaSentiment,Hate Speech

Page 3: Calcolare Opinioni e Controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfSentiment Analysis a.k.a. Opinion Mining “Given a set of evaluative text documents D that

Sentiment Analysisa.k.a. Opinion Mining

“Given a set of evaluative text documents D that contain opinions (or sentiments) about an object, opinionmining aims to extract attributes and components of the object that have been commented on in eachdocument d D and to determine whether the ∈comments are positive, negative or neutral.”

Bing Liu

Grande richiesta dall’industriaParticolare attenzione ai social media

Page 4: Calcolare Opinioni e Controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfSentiment Analysis a.k.a. Opinion Mining “Given a set of evaluative text documents D that

Sentiment Analysis

Page 5: Calcolare Opinioni e Controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfSentiment Analysis a.k.a. Opinion Mining “Given a set of evaluative text documents D that

Sentiment Analysis

Page 6: Calcolare Opinioni e Controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfSentiment Analysis a.k.a. Opinion Mining “Given a set of evaluative text documents D that

Lessici affettivi

Basile and Nissim (2013): Sentiment Analysis on Italian Tweets, WASSA 2013

Page 7: Calcolare Opinioni e Controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfSentiment Analysis a.k.a. Opinion Mining “Given a set of evaluative text documents D that

Lessici affettivi

?

Page 8: Calcolare Opinioni e Controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfSentiment Analysis a.k.a. Opinion Mining “Given a set of evaluative text documents D that

Supervised Learning

Page 9: Calcolare Opinioni e Controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfSentiment Analysis a.k.a. Opinion Mining “Given a set of evaluative text documents D that

Supervised Learning

PhD Endang Wahyu Pamungkas, Alessandra Cignarella

Support Vector Machine,Recurrent Neural Network,

...

Page 10: Calcolare Opinioni e Controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfSentiment Analysis a.k.a. Opinion Mining “Given a set of evaluative text documents D that

Supervised Learning

Page 11: Calcolare Opinioni e Controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfSentiment Analysis a.k.a. Opinion Mining “Given a set of evaluative text documents D that

L’annotazione manuale

● Esperti● Crowdsourcing● Distant Supervision● Gamification● ...

Page 12: Calcolare Opinioni e Controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfSentiment Analysis a.k.a. Opinion Mining “Given a set of evaluative text documents D that

L’annotazione manuale

Groningen Meaning Bank http://gmb.let.rug.nlSentipolc 2014-2016 http://www.di.unito.it/~tutreeb/sentipolc-evalita16/index.htmlABSITA 2018 http://sag.art.uniroma2.it/absita/HatEval 2019 https://competitions.codalab.org/competitions/19935

Page 13: Calcolare Opinioni e Controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfSentiment Analysis a.k.a. Opinion Mining “Given a set of evaluative text documents D that

L’annotazione manuale

Page 14: Calcolare Opinioni e Controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfSentiment Analysis a.k.a. Opinion Mining “Given a set of evaluative text documents D that

Inter-rated agreement

(o reliability)

● % agreement● Cohen’s Kappa● Fleiss’ Kappa● Krippendorff's alpha● ...

Annotation →Harmonization →Agreement →Gold standard data

Page 15: Calcolare Opinioni e Controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfSentiment Analysis a.k.a. Opinion Mining “Given a set of evaluative text documents D that

Crowdsourcing

Page 16: Calcolare Opinioni e Controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfSentiment Analysis a.k.a. Opinion Mining “Given a set of evaluative text documents D that

Crowdsourcing

https://www.figure-eight.com/

Page 17: Calcolare Opinioni e Controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfSentiment Analysis a.k.a. Opinion Mining “Given a set of evaluative text documents D that

Gamification

https://www.wordrobe.org

Page 18: Calcolare Opinioni e Controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfSentiment Analysis a.k.a. Opinion Mining “Given a set of evaluative text documents D that

Siamo d’accordo?

Page 19: Calcolare Opinioni e Controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfSentiment Analysis a.k.a. Opinion Mining “Given a set of evaluative text documents D that

Fenomeni Altamente Soggettivi

• parte del discorso (sintassi)• sensi, entità (semantica)• sentimenti, emozioni• ironia, sarcasmo• razzismo, misoginia, omofobia

pragmatica

soggettività

Page 20: Calcolare Opinioni e Controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfSentiment Analysis a.k.a. Opinion Mining “Given a set of evaluative text documents D that

Fenomeni Altamente Soggettivi

sintassi

Valerio sale le scale

nome o verbo?

Page 21: Calcolare Opinioni e Controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfSentiment Analysis a.k.a. Opinion Mining “Given a set of evaluative text documents D that

Fenomeni Altamente Soggettivi

polarità

Page 22: Calcolare Opinioni e Controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfSentiment Analysis a.k.a. Opinion Mining “Given a set of evaluative text documents D that

Fenomeni Altamente Soggettivi

ironia

Page 23: Calcolare Opinioni e Controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfSentiment Analysis a.k.a. Opinion Mining “Given a set of evaluative text documents D that

Fenomeni Altamente Soggettivi

hate speech

Page 24: Calcolare Opinioni e Controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfSentiment Analysis a.k.a. Opinion Mining “Given a set of evaluative text documents D that

Omofobia online

Paola Franco (Tesi Magistrale)http://accept.arcigay.it/

Page 25: Calcolare Opinioni e Controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfSentiment Analysis a.k.a. Opinion Mining “Given a set of evaluative text documents D that

Omofobia online

Paola Franco (Tesi Magistrale)http://accept.arcigay.it/

Page 26: Calcolare Opinioni e Controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfSentiment Analysis a.k.a. Opinion Mining “Given a set of evaluative text documents D that

Controversialità

Una misura che: ● massimizza l’accordo all’interno dei gruppi● minimizza l’accordo tra I gruppi

g1 = {1,2,3}; g

2 = {4,5,6}

(agreement(g1) + agreement(g

2) ) (1 – agreement(g

1 ∪

g

2))

Calcolata per ogni istanza

Page 27: Calcolare Opinioni e Controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfSentiment Analysis a.k.a. Opinion Mining “Given a set of evaluative text documents D that

Controversialità

(agreement(g1) + agreement(g

2) ) (1 – agreement(g

1 ∪

g

2))

Es.:

(1, 1, 1) (1, 1, 1) 0.0→(0, 0, 0) (1, 0, 0) 0.27→(1, 0, 1) (0, 0, 0) 0.55→(1, 1, 1) (0, 0, 0) 1.0→

Page 28: Calcolare Opinioni e Controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfSentiment Analysis a.k.a. Opinion Mining “Given a set of evaluative text documents D that

Controversialità

Teoria genderNature vs. nurtureScuole

Page 29: Calcolare Opinioni e Controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfSentiment Analysis a.k.a. Opinion Mining “Given a set of evaluative text documents D that

Controversialità

PhD Sohail Akhtar

Tweet su Brexit

Page 30: Calcolare Opinioni e Controversialitàvaleriobasile.github.io/presentations/dipinfo2019.pdfSentiment Analysis a.k.a. Opinion Mining “Given a set of evaluative text documents D that

Controversialità

● Controversialitàcome feature (à la CrowdTruth)

● E se non sappiamo niente degli annotatori? (PhD Komal Florio)

● Possiamo predire la controversialità?