online debate summarization using topic directed sentiment analysis

Post on 01-Jul-2015

283 Views

Category:

Data & Analytics

2 Downloads

Preview:

Click to see full reader

DESCRIPTION

Online debate summarization using topic directed sentiment analysis @ WISDOM'13

TRANSCRIPT

+

Online Debate Summarization Using Topic Directed Sentiment Analysis

2014/10/31 (Fri.)�Chang Wei-Yuan @ MakeLab Group Meeting

Sarvesh Ranade, Jayant Gupta, Vasudeva Varma, Radhika Mamidi �WISDOM ‘13

+Outline

n Introduction �

n Method �

n Experiment �

n Conclusion �

n Thought

2

+Outline

n Introduction �

n Method �

n Experiment �

n Conclusion �

n Thought

3

+ Introduction 4

+ Introduction

n Online debate forum�n take a stance and argue debate topics �n dynamic and increase rapidly�

n This paper aims to summarize online debates. �n extracting highly topic relevant �n sentiment rich sentences �

n Effective opinion summarization without going through the entire debate.

5

+Outline

n Introduction �

n Method �

n Experiment �

n Conclusion �

n Thought

6

+Method

n Extractive summaries are generated by ranking the Dialogue Acts (DAs) from the original documents. �n DA is a smallest unit of debate �

7

+ 8

Score

Score

Score

Score

Score

+

n Most highly ranked DAs are chosen until summary length constraint is satisfied. �

n Scores of DAs n 

9

Equation.(Scores of DAs)�

where λ is weighted, s is a DA of the Document, D is the Document.

+

Feature Category Feature Names

Topic Relevance Topic Directed Sentiment Score Topic Co-occurrence

Document Relevance tf-idf Sentiment Score

Sentiment Relevance

Number of Sentiment Words Sentiment Strength

Context Relevance

Sentence position Sentence length

10

n Argument Structure Examples

+Word Sentiment Score �

n Word Sentiment Score �n Parsing dependency1 parse of the DAs. �n Calculating word sentiment score2. �n Updating word sentiment score. �

n  good student, great warrior���1Stanford dependency parse http://nlp.stanford.edu:8080/parser/ �2Sentiment lexicon SentiWordNet http://sentiwordnet.isti.cnr.it/�

11

+Word Sentiment Score �

n Parsing dependency parse of the DAs. �n “ A large company needs a sustainable

business model. ”

12

+Word Sentiment Score �

n Calculating word sentiment score. �

13

+0

+0

+0

+0

+0 +1 +0 +1

+Word Sentiment Score �

n Updating word sentiment score. �

14

+0

+0

+0 +1 +0 +1

+1 +1

+Extended Targets

n Extended Targets (ET)�n Extended targets are the entities closely

related to debate topics. �n To extract the extended targets, we capture

named entities (NE) from Wikipedia page of the debate topic.

15

+Topic Relevance

n Topic Directed Sentiment Score

16

Equation.(Topic Directed Sentiment Score)�

where w is a word in DA, ET() is Extended Targets.

+Topic Relevance

n Topic Co-occurrence �

17

Equation.(Topic Co-occurrence Score)�

+Document Relevance

n tf-idf Sentiment Score �

18

Equation.(Topic Co-occurrence Score)�

+Sentiment Relevance �

n Number of Sentiment Words �n  �

n Sentiment Strength

19

Equation.(Sentiment Strength Score)�

+Context Relevance �

n Sentence position �n In debates, initial and ending DAs of the

debate posts are more important than the middle ones. �

n Sentence length �n As the longer sentences tend to contain more

information, we have used sentence length as document context feature.

20

+Method

n Scores of DAs

21

Equation.(Scores of DAs)�

where λ is weighted, s is a DA of the Document, D is the Document.

+Outline

n Introduction �

n Method �

n Experiment �

n Conclusion �

n Thought

22

+Experiment

n This paper extracted 10 online debate discussions from www.convinceme.net. �n Number of users:1168 �n Number of posts :1945 �n Number of DA:23681

23

+Experiment

n Following values gave the best results as indicated by ROUGE results by grid search. �n λtopicRel = 0.3 �n λdocRel = 0.1 �n λsentiRel = 0.5 �n λconRel = 0.1

24

+Experiment

n We compared our system to the following systems: �n Max-length �n Lead �n pHAL �n tf-Idf�n OpinionSumm�

n  document similarity, topic relevance, sentiment and length

25

+Result 26

+Result 27

+Outline

n Introduction �

n Method �

n Experiment �

n Conclusion �

n Thought

28

+Conclusion

n This paper focuses on summarizing the on-line debates. �n topic directed sentiment �n topic related information �

n The results show that our system beats all these systems comprehensively.

29

+Future Work

n Sentiment scores �n word sense disambiguation �n domain specific sentiment analysis �

n Creating users' profile by capturing their intention. �

30

+Outline

n Introduction �

n Method �

n Experiment �

n Conclusion �

n Thought

31

+

Thought Debate Comparison

32

+ Introduction 33

+Thanks for listening. 2014 / 10 / 31 (Fri.) @ MakeLab Group Meeting �v123582@gmail.com�

top related