multiconvis: a visual text analytics system for exploring ... · •the audience is rising...

27
MultiConVis: A Visual Text Analytics System for Exploring a Collection of Online Conversations Enamul Hoque, Giuseppe Carenini {enamul, carenini}@cs.ubc.ca NLP group @ UBC Department of Computer Science University of British Columbia

Upload: others

Post on 09-Oct-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: MultiConVis: A Visual Text Analytics System for Exploring ... · •The audience is rising exponentially ... Bottom-up approach: 10 Collection-level topics ... Generate topics for

MultiConVis: A Visual Text Analytics System for Exploring a Collection of Online Conversations

Enamul Hoque, Giuseppe Carenini{enamul, carenini}@cs.ubc.ca

NLP group @ UBC

Department of Computer ScienceUniversity of British Columbia

Page 2: MultiConVis: A Visual Text Analytics System for Exploring ... · •The audience is rising exponentially ... Bottom-up approach: 10 Collection-level topics ... Generate topics for

Rise of Text Conversations

• People engage in asynchronous conversations frequently

• e.g., blogs, forums.

Blogs:

• More than 100 millions of blogs

• The audience is rising exponentially

• Many different categories: politics, technology, business, sports,…2

Page 3: MultiConVis: A Visual Text Analytics System for Exploring ... · •The audience is rising exponentially ... Bottom-up approach: 10 Collection-level topics ... Generate topics for

Problem Scenario

• Lot of articles and comments were posted on Macrumors.

• John is interested about buying iPhone6.

• He decides to explore blogs about this issue to verify whether the bending issue is serious.

3

Page 4: MultiConVis: A Visual Text Analytics System for Exploring ... · •The audience is rising exponentially ... Bottom-up approach: 10 Collection-level topics ... Generate topics for

Problem Scenario

4

4

Existing Interfaces•Lack of high-level abstraction• Only show conversations/comments

as paginated lists ordered by recency

•Too many conversations•Too many comments=> Information Overload

Users• Focus on most recent

conversations/comments• Generate short responses• Leave conversations prematurely

Page 5: MultiConVis: A Visual Text Analytics System for Exploring ... · •The audience is rising exponentially ... Bottom-up approach: 10 Collection-level topics ... Generate topics for

Our Solution

tightly integrate text analysis and interactive visualization to support users in exploring collection of online conversations.

NLPInteractive visualization

5

Page 6: MultiConVis: A Visual Text Analytics System for Exploring ... · •The audience is rising exponentially ... Bottom-up approach: 10 Collection-level topics ... Generate topics for

ConVis: Exploring a Long Conversation

Conversation Overview

Topics Authors Conversation view

6

highly negative highly positive

comment length

Enamul Hoque and Giuseppe Carenini (EuroVis 2014, IUI 2015).

Page 7: MultiConVis: A Visual Text Analytics System for Exploring ... · •The audience is rising exponentially ... Bottom-up approach: 10 Collection-level topics ... Generate topics for

MultiConVis: Exploring a Collection of Conversations• Large number of topics-> organize topics into hierarchy• Designed on top of ConVis: switch from exploring a collection of conversations to a single

conversation

7

Page 8: MultiConVis: A Visual Text Analytics System for Exploring ... · •The audience is rising exponentially ... Bottom-up approach: 10 Collection-level topics ... Generate topics for

Contributions

• Hierarchical topic modeling method• organizes large set of topics from multiple

conversations

• User-centered design of MultiConVis.• multi-scale exploration of a collection of

conversations

• Evaluation of MultiConVis :• user performance and subjective opinions

compared to a traditional interface

Page 9: MultiConVis: A Visual Text Analytics System for Exploring ... · •The audience is rising exponentially ... Bottom-up approach: 10 Collection-level topics ... Generate topics for

Contributions

• Hierarchical topic modeling method• organizes large set of topics from multiple

conversations

• User-centered design of MultiConVis.• multi-scale exploration of a collection of

conversations

• Evaluation of MultiConVis :• user performance and subjective opinions

compared to a traditional interface

Page 10: MultiConVis: A Visual Text Analytics System for Exploring ... · •The audience is rising exponentially ... Bottom-up approach: 10 Collection-level topics ... Generate topics for

Topic Hierarchy Generation for Multiple Conversations

Bottom-up approach:

10

Collection-level topics

Conversation C1 Conversation Ci Conversation Cn… …

… …T1

TiTn

Generate topics for each conversationTaking conversational features into account(Joty et al., 2013)

The sets of topics {T1, Ti, Tn} are clustered into a hierarchical topic structure

1

2

Page 11: MultiConVis: A Visual Text Analytics System for Exploring ... · •The audience is rising exponentially ... Bottom-up approach: 10 Collection-level topics ... Generate topics for

1) Create a weighted undirected graph:

Nodes: Topics from conversations

Edge weight w(x,y): Similarity between two topics x and y

Sum of the pairwise similarity between their sentences

Smaller iPhoneStructural parts

Topic Hierarchy Generation for Multiple Conversations

Apple customer care

Thin metal

Apple responses

11

� (�,�)

Page 12: MultiConVis: A Visual Text Analytics System for Exploring ... · •The audience is rising exponentially ... Bottom-up approach: 10 Collection-level topics ... Generate topics for

1) Create a weighted undirected graph:

2) Apply Graph based clustering• Normalized cut criteria (Shi & Malik, 2000)

Num. of topics:

Maximize:

3) Label each cluster

Smaller iPhone

Structural parts

Apple customer care

Thin metal

Apple responsesCustomer care

12

Structural issues

Topic Hierarchy Generation for Multiple Conversations

� (�,�)

�� � = �∑ � �,��∈��,�∈��

∑ � �,��∈�,�∈�

− (∑ �(�,�)�∈��,�∈�

∑ �(�,�)�∈�,�∈�

)�

���

(Newman and Girvan, 2004)

Page 13: MultiConVis: A Visual Text Analytics System for Exploring ... · •The audience is rising exponentially ... Bottom-up approach: 10 Collection-level topics ... Generate topics for

Contributions

• Hierarchical topic modeling method• organizes large set of topics from multiple

conversations

• User-centered design of MultiConVis.• multi-scale exploration of a collection of

conversations

• Evaluation of MultiConVis :• user performance and subjective opinions

compared to a traditional interface

Page 14: MultiConVis: A Visual Text Analytics System for Exploring ... · •The audience is rising exponentially ... Bottom-up approach: 10 Collection-level topics ... Generate topics for

User Requirements Analysis

14

Why and how people explore a collection of conversations?

• Information seeking• Fact checking• Guidance seeking• Keep track of arguments and evidences

• When aspect: Find out what are people thinking or feeling about X over time”

• Have fun and enjoyment

(Hearst 08)

Topics

Sentiment

Time

Authors

Page 15: MultiConVis: A Visual Text Analytics System for Exploring ... · •The audience is rising exponentially ... Bottom-up approach: 10 Collection-level topics ... Generate topics for

User Requirements Analysis

15

Why and how people explore a collection of conversations?

•Variety seeking behaviour:•Read various sub-topics of a topic

•Skimming behaviour: Explore vs. focused reading

•Switching between multiple-levels of granularity:

Various levels

All Conversations

Subset of relevant Conversations

One Conversation

-> Comments

Page 16: MultiConVis: A Visual Text Analytics System for Exploring ... · •The audience is rising exponentially ... Bottom-up approach: 10 Collection-level topics ... Generate topics for

Data Abstractions

Levels

FacetsCollection of Conversations One Conversation

TopicsHierarchy with all topics from all conversations

List of topics

Time- Start day/time

- Volume of comments over timecomments are ordered chronologically

Sentiment

- Sentiment distribution for each

conversation

- Sentiment evolution over time

for each conversation

Sentiment distribution for each comment

AuthorsNumber of authors for each

conversation List of authors

16

Page 17: MultiConVis: A Visual Text Analytics System for Exploring ... · •The audience is rising exponentially ... Bottom-up approach: 10 Collection-level topics ... Generate topics for

Topic hierarchy

Visual Encoding: Set of Conversations

Conversation List

TimelineSearch

17

Page 18: MultiConVis: A Visual Text Analytics System for Exploring ... · •The audience is rising exponentially ... Bottom-up approach: 10 Collection-level topics ... Generate topics for

Visual Encoding: Set of Conversations

Conversation List

Timeline

• Topic hierarchy- node labels are more important,

- Links are less important,

- Indented tree representation: compact

- Can show 50 nodes without vertical scrolling, sufficient for most datasets

- font size: How much this topic has been discussed

18

Page 19: MultiConVis: A Visual Text Analytics System for Exploring ... · •The audience is rising exponentially ... Bottom-up approach: 10 Collection-level topics ... Generate topics for

Conversation List

Sentiment distribution

Title Text snippetCount (topics) Count (authors)

Volume of comments over time

Visual Encoding: Set of Conversations

19

Information scent

Page 20: MultiConVis: A Visual Text Analytics System for Exploring ... · •The audience is rising exponentially ... Bottom-up approach: 10 Collection-level topics ... Generate topics for

Video Demo

20

Page 21: MultiConVis: A Visual Text Analytics System for Exploring ... · •The audience is rising exponentially ... Bottom-up approach: 10 Collection-level topics ... Generate topics for

Contributions

• Hierarchical topic modeling method• organizes large set of topics from multiple

conversations

• User-centered design of MultiConVis.• multi-scale exploration of a collection of

conversations

• Evaluation of MultiConVis :• user performance and subjective opinions

compared to a traditional interface

Page 22: MultiConVis: A Visual Text Analytics System for Exploring ... · •The audience is rising exponentially ... Bottom-up approach: 10 Collection-level topics ... Generate topics for

User Evaluation

Case studies:• Participants explored the datasets according to their information needs

• Regular blog reader: iPhone bending• Journalist: ObamaCare health reform• Business analyst: iWatch release

• In follow-up interviews: topic hierarchy was extremely useful

Laboratory study:• Compare with a traditional interface • Task: Explore the given set of conversations, write a summary of major

keypoints

22

Page 23: MultiConVis: A Visual Text Analytics System for Exploring ... · •The audience is rising exponentially ... Bottom-up approach: 10 Collection-level topics ... Generate topics for

Evaluation: Lab Study

• 16 subjects (aged 18-37, 6 females)

•Within subjects

Traditional interface

MultiConVis

23

Page 24: MultiConVis: A Visual Text Analytics System for Exploring ... · •The audience is rising exponentially ... Bottom-up approach: 10 Collection-level topics ... Generate topics for

User Study: Selected Results

• Time-to-task completion: No significant difference

• Subjective ratings:

• Preference:- MultiConVis (75%): topic organization, visual overview of conversations

- Traditional interface (25%): simplicity and familiarity

24

0 1 2 3 4 5

Usefulness

Ease of use

Enjoyable

Find major points

Find more insightful comments

Write a more informative summary

MultiConVis Traditional Interface

Page 25: MultiConVis: A Visual Text Analytics System for Exploring ... · •The audience is rising exponentially ... Bottom-up approach: 10 Collection-level topics ... Generate topics for

Conclusions

1) Hierarchical topic modeling for a collection of online conversations• consider unique features of conversations.

2) Design of MultiConVis.• Multi-scales exploration of a collection of conversation

• Consistency of encoding among various scales

3) Evaluation• MultiConVis was preferred by majority of participants

• Assessment of different interface features

25

Page 26: MultiConVis: A Visual Text Analytics System for Exploring ... · •The audience is rising exponentially ... Bottom-up approach: 10 Collection-level topics ... Generate topics for

Future Work

• Interactive topic hierarchy revisions• Allow user to modify topic hierarchy

• Apply and tailor to specific conversational genres• Community question answering forums

• MOOC forums

• ….

• Online longitudinal study• For ecologically validity

26

Page 27: MultiConVis: A Visual Text Analytics System for Exploring ... · •The audience is rising exponentially ... Bottom-up approach: 10 Collection-level topics ... Generate topics for

For More Information…www.cs.ubc.ca/cs-research/lci/research-groups/natural-language-processing/

27

Raymond T. NgTamara Munzner

Thanks: