32n2386-reference model for bigdata - jtc1 sc32jtc1sc32.org/doc/n2351-2400/32n2386-reference...

20
Jangwon Gim Sungjoon Lim Hanmin Jung ISO/IEC JTC1 SC32 Ad-hoc meeting May 29, 2013, Gyeongju Korea 32N2386

Upload: others

Post on 26-May-2020

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 32N2386-Reference Model for BigData - JTC1 SC32jtc1sc32.org/doc/N2351-2400/32N2386-Reference Model_for_BigDat… · 32N2386-Reference Model_for_BigData.pptx Author: Timothy Schoechle

Jangwon Gim Sungjoon Lim Hanmin Jung

ISO/IEC JTC1 SC32 Ad-hoc meeting May 29, 2013, Gyeongju Korea

32N2386

Page 2: 32N2386-Reference Model for BigData - JTC1 SC32jtc1sc32.org/doc/N2351-2400/32N2386-Reference Model_for_BigDat… · 32N2386-Reference Model_for_BigData.pptx Author: Timothy Schoechle

Contents

  Background    Brief  history  of  discussions    Case  study    Procedure  for  developing  standardiza9ons  for  Big  Data    Reference  model  for  Big  Data  

  Conclusions

2

Page 3: 32N2386-Reference Model for BigData - JTC1 SC32jtc1sc32.org/doc/N2351-2400/32N2386-Reference Model_for_BigDat… · 32N2386-Reference Model_for_BigData.pptx Author: Timothy Schoechle

Discussion  of  Big  Data

  Data  analy9cs    Data  analysis    Baba:  Vocabulary,  Use-­‐case,  and  so  on     Stabilize  Architecture   Define  Interfaces   Standardiza6on  opportuni6es  

  Jim:  The  aspect  of  Big  Data  is  “There  is  many  different  forms”  

  Krishna:  Refers  to  Wikipedia  defini9on  

  Keith  Gorden:  Volume,  Complex,  Velocity  

  Keith  W.  Hare:  Open  Big  Data  

   Volume,  Variety,  Velocity,  Value,  Veracity  Any  combina6on  is  OK.  

3

Page 4: 32N2386-Reference Model for BigData - JTC1 SC32jtc1sc32.org/doc/N2351-2400/32N2386-Reference Model_for_BigDat… · 32N2386-Reference Model_for_BigData.pptx Author: Timothy Schoechle

Background

  Emerging  Technologies  For  Big  Data   In  2012,  The  hype  cycle  of  Gartner  

 Diverse  defini6ons  of  technologies  and  services,  having  different  views  of  data  

4

Page 5: 32N2386-Reference Model for BigData - JTC1 SC32jtc1sc32.org/doc/N2351-2400/32N2386-Reference Model_for_BigDat… · 32N2386-Reference Model_for_BigData.pptx Author: Timothy Schoechle

Background

  Big  Data  on  hype  cycle  

  A  general  and  common  reference  model  for  Big  Data  is  needed

5

Page 6: 32N2386-Reference Model for BigData - JTC1 SC32jtc1sc32.org/doc/N2351-2400/32N2386-Reference Model_for_BigDat… · 32N2386-Reference Model_for_BigData.pptx Author: Timothy Schoechle

Brief  history  of  discussions  

6

Issue  Date   Summary  

16  November  2011.  [SC32N2181]  ISO/IEC  JTC  1/SC  32  N2181,  “Resolu6ons  and  topics  from  the  recent  JTC  1  mee6ng  of  par6cular  interest  to  SC  32  par6cipants”,  SC32  Chair  –  Jim  Melton  

12  January  2012.  

[SC32N2198]   ISO/IEC   JTC  1/SC  32  N  2198,   “Analysis  of  2012  Gartner  Technology  Trends”,  JTC1  SWG-­‐P  -­‐  Mario  Wendt  –  Convener     SC  6  Telecommunica6ons  and  informa6on  exchange  between  systems       SC  32  Data  management  and  interchange     SC  39  Sustainability  for  and  by  Informa6on  Technology  

19  March  2012.  [SC32N2199]ISO/IEC   JTC   1/SC   32   N   2199,   “Discussion:   SC   32   Response   to   2011   JTC   1  Resolu6on  33”,  SC32  Chair  –  Jim  Melton  

6  June  2012.   [SC32N2241]  Ad-­‐hoc  on  “Next  gen  analy6cs”  -­‐  Keith  Hair  -­‐  Chair  

Page 7: 32N2386-Reference Model for BigData - JTC1 SC32jtc1sc32.org/doc/N2351-2400/32N2386-Reference Model_for_BigDat… · 32N2386-Reference Model_for_BigData.pptx Author: Timothy Schoechle

The  view  of  Next-­‐Genera9on  Analy9cs  of  SC32  

  Referencing  from    [SC32N2241]  

  Need  a  reference  model  for  Big  Data  to  enhance  interoperability    

7

Next-Generation Analytics Social Analytics

From Baba

Architectural

Mechanisms

Metadata

Raw Storage

Page 8: 32N2386-Reference Model for BigData - JTC1 SC32jtc1sc32.org/doc/N2351-2400/32N2386-Reference Model_for_BigDat… · 32N2386-Reference Model_for_BigData.pptx Author: Timothy Schoechle

Case  Study  (1)

  Korea  Ins9tute  of  Science  and  Technology  (KISTI)   Dept.  of  Computer  Intelligence  Research  

8

Page 9: 32N2386-Reference Model for BigData - JTC1 SC32jtc1sc32.org/doc/N2351-2400/32N2386-Reference Model_for_BigDat… · 32N2386-Reference Model_for_BigData.pptx Author: Timothy Schoechle

Case  Study  (2)

  Architecture  of  InSciTe  Adap9ve  Service

9

Page 10: 32N2386-Reference Model for BigData - JTC1 SC32jtc1sc32.org/doc/N2351-2400/32N2386-Reference Model_for_BigDat… · 32N2386-Reference Model_for_BigData.pptx Author: Timothy Schoechle

Case  Study  (3)

  Seman9c  Analysis   Text  Data  to  Ontology

10

Page 11: 32N2386-Reference Model for BigData - JTC1 SC32jtc1sc32.org/doc/N2351-2400/32N2386-Reference Model_for_BigDat… · 32N2386-Reference Model_for_BigData.pptx Author: Timothy Schoechle

Case  Study  (4)

  Seman9c  Analysis   Ontology  Schema

11

Page 12: 32N2386-Reference Model for BigData - JTC1 SC32jtc1sc32.org/doc/N2351-2400/32N2386-Reference Model_for_BigDat… · 32N2386-Reference Model_for_BigData.pptx Author: Timothy Schoechle

Case  Study  (5)

  Seman9c  Analysis   Example  of  Seman6c  Analysis  

12

Page 13: 32N2386-Reference Model for BigData - JTC1 SC32jtc1sc32.org/doc/N2351-2400/32N2386-Reference Model_for_BigDat… · 32N2386-Reference Model_for_BigData.pptx Author: Timothy Schoechle

Case  Study  (6)

  InSciTe  Service  Func9ons  –  (Hybrid  Vehicle)

13

Technology Navigation

Technology Trend

Core Element Technology

Convergence Technology

Agent Level Agent Partner Integrated Roadmap

Report

Page 14: 32N2386-Reference Model for BigData - JTC1 SC32jtc1sc32.org/doc/N2351-2400/32N2386-Reference Model_for_BigDat… · 32N2386-Reference Model_for_BigData.pptx Author: Timothy Schoechle

Case  Study  (7)

  In  2013,  About  10  Billion  triples  from  diverse  sites  will  be  extracted

14

Sites The number of Count

Freebase 1,015,762,951

Yago 224,949,079

DBPedia 449,383,705

DBLP 81,986,947

baseKB 147,549,529

Etc (WhoisWho,NYTimes,LinkedObervedData,…) 2,296,838,760

Total 4,216,470,971

Page 15: 32N2386-Reference Model for BigData - JTC1 SC32jtc1sc32.org/doc/N2351-2400/32N2386-Reference Model_for_BigDat… · 32N2386-Reference Model_for_BigData.pptx Author: Timothy Schoechle

Case  Study  (8)

  In  2013,  System  Architecture  of  InSciTe  Adap9ve  Service

15

Page 16: 32N2386-Reference Model for BigData - JTC1 SC32jtc1sc32.org/doc/N2351-2400/32N2386-Reference Model_for_BigDat… · 32N2386-Reference Model_for_BigData.pptx Author: Timothy Schoechle

Procedure  for  developing  a  reference  model  for  Big  Data  

4. Deriving use-cases for applying the Big Data

3. Defining a concept model / a reference model / a framework for Big Data

2. Establishing visions and strategies for achieving the goal of Big Data

1. Eliciting requirements and analyzing the environment of Big Data

16

We are here

Page 17: 32N2386-Reference Model for BigData - JTC1 SC32jtc1sc32.org/doc/N2351-2400/32N2386-Reference Model_for_BigDat… · 32N2386-Reference Model_for_BigData.pptx Author: Timothy Schoechle

A lifecycle of Big Data  

17

1. 2.

3. 4.

•  Collection/Identification •  Repository/Registry •  Semantic

Intellectualization •  Integration

•  Data Curation •  Data Scientist •  Data Engineer

•  Workflow •  Data Quality

•  Analytics / Prediction

•  Visualization

Page 18: 32N2386-Reference Model for BigData - JTC1 SC32jtc1sc32.org/doc/N2351-2400/32N2386-Reference Model_for_BigDat… · 32N2386-Reference Model_for_BigData.pptx Author: Timothy Schoechle

Reference  Model  for  Big  Data  

  A  Reference  Model  for  Big  Data

18

Data Layer

Platform Layer Data Semantic Intellectualization

Data Integration

Data Quality Management

Big Data Management

Data Curation

Service Layer Analysis & Prediction

Security

Data Visualization

Service Support Layer

Workflow Management

Interface

Data Collection

Data Identification (Data Mining & Metadata Extraction)

Data Registry Data Repository

Interface

Interface

Page 19: 32N2386-Reference Model for BigData - JTC1 SC32jtc1sc32.org/doc/N2351-2400/32N2386-Reference Model_for_BigDat… · 32N2386-Reference Model_for_BigData.pptx Author: Timothy Schoechle

Reference  Model  for  Big  Data  

  A  Reference  Model  for  Big  Data

19

Data Layer

Platform Layer Data Semantic Intellectualization

Data Integration

Data Quality Management

Big Data Management

Data Curation

Service Layer Analysis & Prediction

Security

Data Visualization

Service Support Layer

Workflow Management

Interface

Data Collection

Data Identification (Data Mining & Metadata Extraction)

Data Registry Data Repository

Interface

Interface

9075

13249

11179

19763

???

Page 20: 32N2386-Reference Model for BigData - JTC1 SC32jtc1sc32.org/doc/N2351-2400/32N2386-Reference Model_for_BigDat… · 32N2386-Reference Model_for_BigData.pptx Author: Timothy Schoechle

Conclusions

  Summary   Analyzing  the  circumstance  of  Big  Data  

 Building  a  framework  for  Big  Data  

 Define  detail  procedure  to  create  the  Big  Data  

  Discussion   Possible  sugges6ons  

•  New  Working  Group  for  the  reference  model  of  Big  Data    New  Work  Items  could  be  derived  from  the  model  

•  New  Study  Group  

  Future  work   Discussion  of  the  concept  of  NWI  

•  2013.  11.  Interim  mee6ngs  

 Propose  extended  the  reference  model  of  Big  Data  (NWI)  •  2014.  5  Plenary  mee6ng  

20