analysis of scientific research mario sangiorgio giordano tamburrelli

24
Analysis of scientific research Mario Sangiorgio Giordano Tamburrelli

Post on 21-Dec-2015

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Analysis of scientific research Mario Sangiorgio Giordano Tamburrelli

Analysis of scientific research

Mario SangiorgioGiordano Tamburrelli

Page 2: Analysis of scientific research Mario Sangiorgio Giordano Tamburrelli

The origin of this work

Carlo Ghezzi’s keynote:Reflections on 40+ years of

software engineering research and beyond: an

insider’s view

Analysis based on papers

Lack of tools to perform the

analysis

WHATresearch topics

WHOcontributors

HOW/WHENtrends

Page 3: Analysis of scientific research Mario Sangiorgio Giordano Tamburrelli

The origin of this work

Time consuming Boring

Requires an expert

Lack of tools to perform the

analysis

Page 4: Analysis of scientific research Mario Sangiorgio Giordano Tamburrelli

Automatic analysis

Faster

ScalableGeneral method

One-click(After

training)

Feasible with data mining techniques

BUTstill not perfect

(it is not semantic-based)

Page 5: Analysis of scientific research Mario Sangiorgio Giordano Tamburrelli

Steps of the analysis

Identificationof subtopics

Interpretation ofpaper content

Trend analysis(So far)

CLUSTERING

CLASSIFICATION

CLUSTERING

STATISTICS

Page 6: Analysis of scientific research Mario Sangiorgio Giordano Tamburrelli

Clustering

Page 7: Analysis of scientific research Mario Sangiorgio Giordano Tamburrelli

ClusteringHierarchical Expectation

Maximization Algorithm

The tool used is Crossbow

Thanks to Gianluca Staffiero and Gabriele Valentini

Abstracts of papers from both general and specificconferences and journals

Page 8: Analysis of scientific research Mario Sangiorgio Giordano Tamburrelli

The clustering process

Page 9: Analysis of scientific research Mario Sangiorgio Giordano Tamburrelli

Classification

Page 10: Analysis of scientific research Mario Sangiorgio Giordano Tamburrelli

Classification

Bayesian classifier

Ad hoc tool using Mallet

Analysis based on the abstract of the papers

Page 11: Analysis of scientific research Mario Sangiorgio Giordano Tamburrelli

Result evaluation

Clustering was iterated until the results were good

Classification performs well:high precision and recall values

human expert agrees with the classifier

Page 12: Analysis of scientific research Mario Sangiorgio Giordano Tamburrelli

Outcomes

Research analysistrends on main

conferences and journals

Tools to support research

automatic bidding

Page 13: Analysis of scientific research Mario Sangiorgio Giordano Tamburrelli

Some trends found

Data from IEEE Transactions on Software Engineering

Page 14: Analysis of scientific research Mario Sangiorgio Giordano Tamburrelli

Some trends found

Data from IEEE Transactions on Software Engineering

Page 15: Analysis of scientific research Mario Sangiorgio Giordano Tamburrelli

Automatic bidding

Build upon analysis methodologies and results

Page 16: Analysis of scientific research Mario Sangiorgio Giordano Tamburrelli

Bidding processGrouping the

submissions by topicCreation of a profilefor the reviewers

Matching papers’ topicwith reviewers’ interests

CLASSIFICATION

CLASSIFICATION

SELECTION

Page 17: Analysis of scientific research Mario Sangiorgio Giordano Tamburrelli

Grouping the submissions

Page 18: Analysis of scientific research Mario Sangiorgio Giordano Tamburrelli

Creation of the reviewer profile

Page 19: Analysis of scientific research Mario Sangiorgio Giordano Tamburrelli

Matching profiles and submissions

Page 20: Analysis of scientific research Mario Sangiorgio Giordano Tamburrelli

Result evaluation

ICSM 2010

Page 21: Analysis of scientific research Mario Sangiorgio Giordano Tamburrelli

Reviewers’ profiles

Carlo GhezziProfile:

web-servicesformal methods

middleware for distributed systemsmodels

software componentseducationCONFI

RMED

Harald GallProfile:

software miningmiddleware for distributed

systemsmodels

empirical studies

Do you agree?

Page 22: Analysis of scientific research Mario Sangiorgio Giordano Tamburrelli

Comparison with actual bids

Results apparently not so good: recall it is about 53%

BUT

The actual bid is not an oracle

We are suggesting papers for the most

relevant topics

Page 23: Analysis of scientific research Mario Sangiorgio Giordano Tamburrelli

Live Testing: ICSE 2011

Propose our bids to the reviewers

Get a feedback on our suggestions, based on reviewer impressions

Page 24: Analysis of scientific research Mario Sangiorgio Giordano Tamburrelli

Future worksImprovement of the system

Ranking of the suggested papers

Deeper statistical analysis

Paper assignment based onGenetic Algorithms assignment