sca2013 presentation: a web-based content analysis tool
DESCRIPTION
This is a presentation at SCA2013, Karlsruhe, Germany. This shows the design (architecture, database, sketch, wireframe, prototype) of a simple web-based tool that supports asynchronous collaboration among researchers when conducting content analysis on qualitative social media data.TRANSCRIPT
![Page 1: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/1.jpg)
A Web-Based Tool for Collaborative Social Media Data Analysis
Xin Chen, Mihaela Vorvoreanu, Krishna Madhavan{chen654, mihaela, cm}@purdue.edu
![Page 2: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/2.jpg)
Motivation
Social Media Discourse
![Page 3: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/3.jpg)
Motivation
Hidden Insights On Human Behaviors & Social Phenomenon
![Page 4: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/4.jpg)
Human generated textual data on social media are:
Qualitative Data
Large-scale Data
Motivation
![Page 5: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/5.jpg)
Human generated textual data on social media are:
Qualitative Data
Large-scale Data
requires qualitative interpretation
Motivation
![Page 6: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/6.jpg)
Human generated textual data on social media are:
Qualitative Data
Large-scale Data
requires qualitative interpretation
requires large-scale data mining techniques
Motivation
![Page 7: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/7.jpg)
Goal
To build a tool that:
![Page 8: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/8.jpg)
Goal
Acquire social media data.
To build a tool that:
![Page 9: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/9.jpg)
Goal
Acquire social media data.
Integrate qualitative content analysis and data mining techniques to analyze textual data on social media.
To build a tool that:
![Page 10: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/10.jpg)
Goal
Acquire social media data.
Integrate qualitative content analysis and data mining techniques to analyze textual data on social media.
Support asynchronous collaboration among researchers.
To build a tool that:
![Page 11: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/11.jpg)
Social Media Analytics & Monitoring Tools
Existing Tools
Focus on marketing
Do not usually incorporate human input
![Page 12: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/12.jpg)
Qualitative Analysis Tools
Existing Tools
Complicated to use
Expensive
Do not acquire social media data
![Page 13: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/13.jpg)
Social Media Content
API or Web Crawler
Researchers
Computation Server
Web UI
Web Server
SWAB (Social Web Analytics Buddy)
1
2
3
4
5
Data Server
![Page 14: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/14.jpg)
Social Media Content
API or Web Crawler
Researchers
SWAB (Social Web Analytics Buddy)
1
2
3
4
5
Data Server
Computation Server
Web UI
Web Server
![Page 15: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/15.jpg)
Social Media Content
API or Web Crawler
Researchers
SWAB (Social Web Analytics Buddy)
1
2
3
4
5
Twitter search API
Data Server
Computation Server
Web UI
Web Server
![Page 16: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/16.jpg)
Social Media Content
Data Server
API or Web Crawler
Researchers
SWAB (Social Web Analytics Buddy)
1
2
3
4
5
Computation Server
Web UI
Web Server
MySQL
![Page 17: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/17.jpg)
Social Media Content
Data Server
API or Web Crawler
Researchers
SWAB (Social Web Analytics Buddy)
1
2
3
4
5
Computation Server
Web UI
Web Server
MySQL
![Page 18: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/18.jpg)
Social Media Content
Data Server
API or Web Crawler
Researchers
SWAB (Social Web Analytics Buddy)
1
2
3
4
5
Computation Server
Web UI
Web Server
MySQL
![Page 19: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/19.jpg)
Social Media Content
Data Server
API or Web Crawler
Researchers
SWAB (Social Web Analytics Buddy)
1
2
3
4
5
Computation Server
Web UI
Web Server
MySQLClassification & Detection Modeling
Inter-rater Agreement Computation
![Page 20: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/20.jpg)
Social Media Content
Data Server
API or Web Crawler
Researchers
SWAB (Social Web Analytics Buddy)
1
2
3
4
5
Computation Server
Web UI
Web Server
MySQL
![Page 21: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/21.jpg)
Social Media Content
Data Server
API or Web Crawler
Researchers
SWAB (Social Web Analytics Buddy)
1
2
3
4
5
Computation Server
Web UI
Web Server
MySQL
Sample tweets for researchers to analyze
Send results back to data server
Communicate with computation server
![Page 22: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/22.jpg)
Social Media Content
Data Server
API or Web Crawler
Researchers
SWAB (Social Web Analytics Buddy)
1
2
3
4
5
Computation Server
Web UI
Web Server
MySQL
![Page 23: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/23.jpg)
Web UI DesignSketch
![Page 24: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/24.jpg)
Web UI DesignWireframes: the “Overview” tab
Project title
![Page 25: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/25.jpg)
Web UI Design
User account
Wireframes: the “Overview” tab
![Page 26: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/26.jpg)
Web UI Design
Collaborator List
Wireframes: the “Overview” tab
![Page 27: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/27.jpg)
Web UI Design
Datasource
Wireframes: the “Overview” tab
![Page 28: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/28.jpg)
Web UI Design
Multiple datasets streamed using different criteria.
Wireframes: the “Overview” tab
![Page 29: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/29.jpg)
Web UI Design
Export data and graphs.
Wireframes: the “Overview” tab
![Page 30: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/30.jpg)
Web UI Design
Multiple visualizations and charts to provide data overview.
Wireframes: the “Overview” tab
![Page 31: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/31.jpg)
Web UI DesignWireframes: the “Analyze” tab
Themes emerged from exploring the data.
![Page 32: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/32.jpg)
Web UI DesignWireframes: the “Analyze” tab
Choose Sample size.
![Page 33: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/33.jpg)
Web UI DesignWireframes: the “Analyze” tab
Analyze tweets and write comment.
![Page 34: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/34.jpg)
Web UI DesignWireframes: the “Result” tab
All researchers’ results are aggregated in the background. Collaboration happens asynchronously. Reliability measures are computed.
![Page 35: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/35.jpg)
Web UI DesignWireframes: the “Result” tab
Classification models can be trained based on the qualitative input.
![Page 36: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/36.jpg)
Web UI DesignWireframes: the “Model Application” tab
Apply the trained model to a new dataset to detect similar data as in dataset1 from dataset2.
![Page 37: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/37.jpg)
Web UI DesignWireframes: the “Model Application” tab
Choose how to explore the detected data from the new dataset: view list of tweets, user accounts, or geomap.
![Page 38: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/38.jpg)
Simple Working Prototype of “Analyze” Tab Feature
!Demo
![Page 39: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/39.jpg)
Future Work
![Page 40: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/40.jpg)
Future Work
Design features to better support data exploration.
![Page 41: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/41.jpg)
Future Work
Design features to better support data exploration.
Explore NoSQL database to handle large datasets.
![Page 42: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/42.jpg)
Future Work
Design features to better support data exploration.
Explore NoSQL database to handle large datasets.
Implement more sophisticated data mining and visualization features.
![Page 43: SCA2013 Presentation: A Web-Based Content Analysis Tool](https://reader034.vdocument.in/reader034/viewer/2022051616/553837d84a79596f718b46a5/html5/thumbnails/43.jpg)
Thank you!
Q & A