usfd at semeval-2016 - stance detection on twitter with autoencoders

Isabelle Augenstein, Andreas Vlachos, Kalina Bontcheva [email protected], {a.vlachos | k.bontcheva}@sheffield.ac.uk USFD at SemEval-2016 Task 6: Any-Target Stance Detection on Twitter with Autoencoders Stance Detection Subtask B Classify attitude of tweet towards target as “favor”, “against”, “none” Tweet: “No more Hillary Clinton” Target: Donald Trump Stance: FAVOR Subtask A training targets: Climate Change is a Real Concern, Feminist Movement, Atheism, Legalization of Abortion, Hillary Clinton Subtask B testing target: Donald Trump Challenges • Labelled data not available for the test target • Manual labelling of training data not allowed • Target does not always appear in tweet Feature Extraction • Aut-twe: Tweet auto-encoded tweet,100d feature vector • targetInTweet: is (shortened) target contained in tweet • Good indicator for non-neutral stance • Other features tested (not used for final run): WordNet- Affect gazetteers, emoticon detection • Baselines: bag of word, word2vec (trained on same data as autoencoder) Results Model Comparison (Hillary Clinton, dev) Model Comparison (Donald Trump, test) 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 Macro F1 BoW BoW+inTwe Word2Vec Aut-twe Aut-twe+inTwe Conclusions • It is important to detect if the target is mentioned in the tweet • Hillary Clinton: 0.4538 F1 (inTwe) vs 0.3243 F1 (not inTwe) • Donald Trump: 0.3745 F1 (inTwe) vs 0.2377 F1 (not inTwe) • Autoencoder can help to detect stance towards unseen targets • Developing method for new targets without labelled training data is challenging - discrepancies between what works for dev vs. test set • Future work: better incorporate the target for stance detection Acknowledgements This work was partially supported by the European Union, grant agreement No. 611233 PHEME (http://www.pheme.eu ) Data • 5 628 labelled train tweets about Subtask A targets • 1 278 about Hillary Clinton, used for dev • 278 013 unlabelled Donald Trump tweets • 395 212 collected unlabelled tweets about all targets • Keywords: hillary, clinton, trump, climate, femini, aborti • 707 Donald Trump test tweets Preprocessing • Phrase detection: Train phrase detection model on unlabelled +labelled tweets, e.g. “donald”, “trump” → “donald trump” Autoencoder • Bag-of-word autoencoder, using 50 000 most frequent words • trained on unlabelled+labelled tweets • Input vector: dimensionality 50 000. For each word in vocabulary, does tweet contain the word or not • One hidden layer (size 100), output size 100 • Trained encoder is applied to labelled train and test data to obtain 100d features, decoder not used Model Macro F1 Majority class (oﬃcial) 0.2972 SVM n-grams (oﬃcial) 0.2843 BoW 0.3453 Aut-twe (submi6ed) 0.3307 References • Code: https://github.com/sheffieldnlp/stance-semeval2016 • Phrases: Mikolov et al. (2013). Distributed Representations of Words and Phrases and Their Compositionality. NIPS. Tweets “No more Hillary Clinton”, “Donald Trump”, “FAVOR” Preprocessing: [“No”, “more”, “Hillary_Clinton”] Autoencoder Training [america: 0, …, Hillary_Clinton: 1] 50 000d input [0, 0, …, 1] 100d hidden layer [0, 1, …, 1] 100d output layer Feature Extraction Autoencoder inTwe [0, 1, …, 1] 0 Logistic Regression Model Predictions “#voteTrump (…)”, “Donald Trump”, “FAVOR” “youre fired (…)” “Donald Trump”, “AGAINST”

Upload: isabelle-augenstein

Post on 12-Apr-2017

363 views

Category:

Technology

2 download

Report

Download

Embed Size (px):

TRANSCRIPT

Page 1: USFD at SemEval-2016 - Stance Detection on Twitter with Autoencoders

Isabelle Augenstein, Andreas Vlachos, Kalina Bontcheva [email protected], {a.vlachos | k.bontcheva}@sheffield.ac.uk

USFD at SemEval-2016 Task 6: Any-Target Stance Detection on Twitter with Autoencoders

Stance Detection Subtask B Classify attitude of tweet towards target as “favor”, “against”, “none”

Tweet: “No more Hillary Clinton” Target: Donald Trump Stance: FAVOR

Subtask A training targets: Climate Change is a Real Concern, Feminist Movement, Atheism, Legalization of Abortion, Hillary Clinton

Subtask B testing target: Donald Trump

Challenges •  Labelled data not available for the test target •  Manual labelling of training data not allowed •  Target does not always appear in tweet

Feature Extraction •  Aut-twe: Tweet auto-encoded tweet,100d feature vector •  targetInTweet: is (shortened) target contained in tweet

•  Good indicator for non-neutral stance •  Other features tested (not used for final run): WordNet-

Affect gazetteers, emoticon detection •  Baselines: bag of word, word2vec (trained on same data

as autoencoder)

Results Model Comparison (Hillary Clinton, dev)

Model Comparison (Donald Trump, test)

0.05

0.1

0.15

0.2

0.25

0.3

0.35

0.4

0.45

MacroF1

BoWBoW+inTweWord2VecAut-tweAut-twe+inTweConclusions

•  It is important to detect if the target is mentioned in the tweet •  Hillary Clinton: 0.4538 F1 (inTwe) vs 0.3243 F1 (not inTwe) •  Donald Trump: 0.3745 F1 (inTwe) vs 0.2377 F1 (not inTwe)

•  Autoencoder can help to detect stance towards unseen targets •  Developing method for new targets without labelled training

data is challenging - discrepancies between what works for dev vs. test set

•  Future work: better incorporate the target for stance detection Acknowledgements

This work was partially supported by the European Union, grant agreement No. 611233 PHEME (http://www.pheme.eu)

Data •  5 628 labelled train tweets about Subtask A

targets •  1 278 about Hillary Clinton, used for dev

•  278 013 unlabelled Donald Trump tweets •  395 212 collected unlabelled tweets about all

targets •  Keywords: hillary, clinton, trump, climate,

femini, aborti •  707 Donald Trump test tweets

Preprocessing •  Phrase detection: Train phrase detection model on unlabelled

+labelled tweets, e.g. “donald”, “trump” → “donald trump”

Autoencoder •  Bag-of-word autoencoder, using 50 000 most

frequent words •  trained on unlabelled+labelled tweets •  Input vector: dimensionality 50 000. For each word

in vocabulary, does tweet contain the word or not •  One hidden layer (size 100), output size 100 •  Trained encoder is applied to labelled train and

test data to obtain 100d features, decoder not used

Model MacroF1Majorityclass(official) 0.2972SVMn-grams(official) 0.2843BoW 0.3453Aut-twe(submi6ed) 0.3307

References •  Code: https://github.com/sheffieldnlp/stance-semeval2016 •  Phrases: Mikolov et al. (2013). Distributed Representations

of Words and Phrases and Their Compositionality. NIPS.

Tweets

“No more Hillary Clinton”, “Donald Trump”, “FAVOR” Preprocessing: [“No”, “more”, “Hillary_Clinton”]

Autoencoder Training

[america: 0, …, Hillary_Clinton: 1] 50 000d input [0, 0, …, 1] 100d hidden layer [0, 1, …, 1] 100d output layer

Feature Extraction

Autoencoder inTwe [0, 1, …, 1] 0

Logistic Regression

Model

Predictions

“#voteTrump (…)”, “Donald Trump”, “FAVOR” “youre fired (…)” “Donald Trump”, “AGAINST”

USFD Manual

SemEval-2018 Task 1: Affect in Tweets

3. Research Planning and Supervision - Eunice Lawton (USFD)

SemEval - Aspect Based Sentiment Analysis

Topological Autoencoders - arXiv

SemEval-2010 Task 8: Multi-Way Classiﬁcation of Semantic ...iris/Pubs/sew09_relations.pdf · SemEval-2010 Task 8: Multi-Way Classiﬁcation of Semantic Relations Between Pairs of

Ladder Variational Autoencoders

Lecture 10: Autoencoders

Winner-Take-All Autoencoders

Autoencoders and Generative Adversarial Nets€¦ · Autoencoders and Generative Adversarial Nets Chapter 1 [ 5 ] Fixing corrupted data with denoising autoencoders The autoencoders

Semeval Deep Learning In Semantic Similarity

Autoencoders - Purdue University

SemEval-2015 Task 3: Answer Selection in Community Question

Multiresolution Convolutional Autoencoders

SemEval-2019 Task 10: Math Question Answering

Stacked Denoising Autoencoders: Learning Useful

Variational Autoencoders

SemEval-2015 Task 3: Answer Selection in …groups.csail.mit.edu/sls/publications/2015/Glass_SemEval...Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval

Autoencoders and Representation Learning

Deep Trajectory Clustering with Autoencoders

Proceedings of SemEval-2016 · PDF fileTask 8: Meaning Representation Parsing ... Haris Papageorgiou, ... Proceedings of SemEval-2016

SemEval 2016 Task 5 Aspect Based Sentiment Analysis (ABSA

SemEval-2021 Task 4: Reading Comprehension of Abstract Meaning

AutoEncoders& Kernels

Variational Autoencoders for Collaborative Filtering

Autoencoders - University at Buffalo

Autoencoders for image_classification

SemEval 2016 Task 5 Aspect Based Sentiment Analysis …alt.qcri.org/semeval2016/task5/data/uploads/absa2016_annotation... · 1 SemEval 2016 Task 5 Aspect Based Sentiment Analysis

Spatial Role Labeling Task for SemEval 2012

Intro to Deep learning - Autoencoders

USFD activity in WP11 NND Internal Review/USFD-CISTIB/February 2015 Method developed for DXA and delivered in D11.1. Volumetric mesh shape and intensity

USFD - New

Introduction to variational autoencoders · Introduction to variational autoencoders Abstract Variational autoencoders are interesting generative models, which combine ideas from

SemEval-2012 Task 2: Measuring Degrees of Relational …jurgens.people.si.umich.edu/docs/semeval-2012-task-2...SemEval-2012 Task 2: Measuring Degrees of Relational Similarity David

Proceedings of the 11th International Workshop on Semantic ...Welcome to SemEval-2017 The Semantic Evaluation (SemEval) series of workshops focuses on the evaluation and comparison