inscite project

32
Copyright © 2013, KISTI MSRA Meeting (2013.1) InSciTe Project Hanmin Jung Head of the Dept. of Computer Intelligence Research

Upload: hanmin-jung

Post on 16-May-2015

214 views

Category:

Technology


0 download

TRANSCRIPT

Page 1: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1)

InSciTe Project

Hanmin JungHead of the Dept. of Computer Intelligence Research

Page 2: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1)

KISTI

Institute of Advanced Information

S/W Research Center

Dept. of Computer Intelligence Research

Page 3: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1) 3

Human vs. Machine Intelligence

Page 4: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1) 4

Machine Intelligence

http://powet.tv/powetblog/wp-content/uploads/2011/02/watson_the_computer_beats_ken_jennings_and_brad_rutter_at_jeopardy_full.jpg

� IBM Watson

Page 5: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1)

Machine Intelligence

http://cdn3.digitaltrends.com/wp-content/uploads/2011/10/1200-siri.jpg

� Standford’s Robotic Car

Page 6: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1)

Machine Intelligence

http://cdn3.digitaltrends.com/wp-content/uploads/2011/10/1200-siri.jpg

� Apple Siri

Page 7: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1) 7

Web Evolution

Page 8: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1) 8

Size of Data in the World

http://www.ektron.com/billcavablog/Big-Data-Big-Content-Big-Challenges/

Q: How about human?

A: Our brain has the capacityto store informationin the hundreds of terabytesto petabyte range.

Page 9: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1) 9

Effect of Big Data

� Search Evaluation

http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en//pubs/archive/40491.pdf

Page 10: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1) 10

Search

Clustering

Extracting

DecisionSupport

Forecasting

ScenarioPlanning

Advising

Modified from D. Bousfield & P. Fooladi, “STM Information: 2009 Final Market Size and Share Report”, 2010.

Value Pyramid

InSciTe Advanced (2011)

InSciTe Adaptive (2012)

Page 11: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1)

Needs of Experts

Relationship between technologiesRelationship between technologies

Leading companiesLeading companies

Technology gapTechnology gap

New entriesNew entries

Social informationSocial information

Technology hierarchyTechnology hierarchyStandard patentsStandard patents

Product informationProduct information

Trend reportsTrend reports

Search historySearch history

Partner candidates recommendationPartner candidates recommendation

Significance of papers/patentsSignificance of papers/patents

Market sharesMarket shares

Citation informationCitation information

Key players in groupKey players in group

Core technologiesCore technologiesMarket sizeMarket size

Information verificationInformation verification

11

Page 12: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1)

Technology Intelligence

R. Rohrbeck, H. Arnold, and J. Heuer, “Strategic Foresight in Multimedia Enterprises”, 2007.

Page 13: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1) 13

Quantitative Analytics

Page 14: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1) 14

Quantitative Analytics

http://www.google.com/insights/search/

� Insights for Search

Page 15: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1) 15

TI Projects

� FUSE

� Funded by IARPA (early 2011 ~ early 2016)

� Kick off meeting in summer, 2011

� Foresight and Understanding from Scientific Exposition Program

� Seeks to develop automated methods that aid in the systematic, continuous, and comprehensive assessment of technical emergence using information found in the published scientific, technical, and patent literature

� Partners

� BAE Systems, Brandeis Univ., New York Univ., 1790 Analytics, …

Page 16: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1)

TI Projects

� CUBIST

� Funded by the European Commission (late 2010 ~ late 2013)

� 1st CUBIST workshop in July, 2011

� Combining and Uniting Business Intelligence with Semantic TechnologiesProgram

� Aims to develop new ways to interrogate not only the massive volume data on the Internet, but also analyze the different formats it exist in – such as blogs, wikis, and video

� Partners

� SAP, Ontotext, Sheffield Hallam Univ., …

Page 17: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1)

TI Projects

� Common Technologies

� Semantic technologies

� Ontology, reasoning, URI scheme

� Analytics model

� BYOM (e.g. technology opportunity discovery model, technology evolution model, formal concept analysis model)

� Information extraction (InSciTe, FUSE)

� Named entities and events/relations in textual documents

Page 18: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1)

InSciTe Advanced (2011)

Page 19: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1)

InSciTe Advanced (2011)

� Data Fact Sheet

� Articles: 15.4 millions (6.7 millions for papers, 8.7 millions for patents)

� IEEE proceedings/journals (2001~2011)

� Papers for all technical areas (2009~2011)

� US/EU/Japan patents (2001~2011)

� Technical terms: 68 thousands

� Institutions: 340 thousands

Page 20: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1) 20

InSciTe Adaptive (2012)

Page 21: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1)

InSciTe Adaptive (2012)

� Crawling Web Data by RSS & Google API

Page 22: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1)

InSciTe Adaptive (2012)

� Data Fact Sheet

� Articles: 22.6 millions (9.8 millions for papers, 7.6 millions for patents, 5.3 millions for Web data)

� All technical areas (2001~2011)

� Named entities: 1.9 millions

� Authority dictionary: 1.5 millions entries

� Linked Data: 290 GB (will be connected)

Page 23: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1) 23

InSciTe Adaptive (2012)

� Big Data Test Bed

Page 24: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1)

Case Studies

� Ministry of Justice (2007~)

Page 25: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1)

Case Studies

� Korea Customs Service (2010~2011)

Page 26: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1) 26

Case Studies

� Defense Agency for Technology and Quality (2011~2012)

Page 27: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1) 27

� ISTIC, China

� For national digital library based on analytics

Case Studies

Page 28: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1)

InSciTe Architecture

Analytics Models

ETD ModelEmerging Technology Discovery Model

TLCD ModelTechnology Life Cycle Discovery Model

TLC ModelTechnology Life Cycle Model

OntoRelFinder®Relationship Path Finder

OntoReasoner®Reasoning Engine

OntoURI®Semantic Knowledge Manager

OntoPipeliner®Semantic Service Composer

SS&AESemantic Search & Analytics Engine

OntoURIResolver®Identity Resolver

SINDI-CORE/LINKEntity & Relationship Extractor

TUC ModelTerminology Use Cycle Model

Ontology

Linked Data

OntoFrame

OntoVerifier®Reasoning Verifier

Web Data CrawlerRSS/Google API

Web Data

Literatures

Page 29: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1)

InSciTe Project

� Goal & Tasks (2013)

� Development of S&T Literature Big Data Analytics/Application Platform

� Big Data mining technology

� Semantic analytics technology

� Big Data relationship analytics/application technology

� Technologies

� Text mining

� Multimedia mining

� Semantic integration

� Reasoning and graph analysis

� Modeling and assess for relationship analytics and application

Page 30: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1)

InSciTe Project

� Partners (2013)

� OVUM, UK

� Building analytics model

� Understanding business needs

� Planning InSciTe service

� MSRA, China

� TBD

� GESIS & Hildesheim Univ., Germany

� Analyzing patent trends

� Assessing InSciTe service platform

� …

Page 31: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1) 31

Homepage

http://semantics.kisti.re.kr

Page 32: InSciTe Project

Copyright © 2013, KISTIMSRA Meeting (2013.1) 3232

Thank you

[email protected]

“A lot of times, people don’t know what they want until you show it to them.”

by Steve Jobs

“Many people won’t be convinced until they’ve seen it for themselves.”

by Jakob Nielsen