1 core bringing the gsbpm to life! j. linnerud & j.-p. kent

33
1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

Upload: amberly-banks

Post on 08-Jan-2018

222 views

Category:

Documents


0 download

DESCRIPTION

3 Statistics: How? Specify a statistic Design a process that will produce this statistic Build a system that will execute this process

TRANSCRIPT

Page 1: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

1

CORE

Bringing the GSBPM to life!

J. Linnerud & J.-P. Kent

Page 2: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

2

Main points

1. An ideal development process for a statistical system

2. Why this ideal usually is not met

3. How CORE aims at supporting this ideal development process

Page 3: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

3

Statistics: How?

• Specify a statistic• Design a process that will

produce this statistic• Build a system that will execute

this process

Page 4: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

4

What is the product?• Define a statistic– What does it say?

• Measures, dimensions, explanations…– What does it look like?

• Tables, Press release, Analytic paper…– What is the input?

• Population, variables, data sources…– What is the relation between input and

output?• Methods to apply

Page 5: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

5

Statistics: How?

• Specify a statistic• Design a process that will

produce this statistic• Build a system that will execute

this process

Page 6: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

6

How to produce the statistic

• Model the data– Input, output, intermediary results

• Specify process steps to apply the chosen statistical methods

• Integrate these steps in a process flow

Page 7: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

7

Statistics: How?

• Specify a statistic• Design a process that will

produce this statistic• Build a system that will execute

this process

Page 8: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

8

Let the machine do it

• Implement the data models• Implement the process steps• Implement the process flow

Page 9: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

9

Why is this approach good? (1)

• Variability vs. stability– Statistical products are specific

• There is a great variety of products• A given product will vary in time

– Statistical processes are generic• The same method can be applied to many products• Process steps implementing methods can be reused• A significant change in the product can be

implemented with some simple changes in some process steps

Page 10: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

10

Why is this approach good? (2)

• It allows a clean specification of the product– In terms of what it is– In terms of what is used– In terms of what the relation is

between input and output

Page 11: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

11

• It separates product design from IT– The product is defined in terms of what

it is (and not how it is produced)– The process is defined in terms of what

it does (and not how it is implemented)– Only the system is defined in technical

terms

Why is this approach good? (3)

Page 12: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

12

• It supports optimalisation of process development– Possibility of developing

standardised, re-usable process steps– Generic process steps are not defined

for an actual statistic, but for use in different statistics

Why is this approach good? (4)

Page 13: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

13

Main points

1. An ideal development process for a statistical system

2. Why this ideal usually is not met

3. How CORE aims at supporting this ideal development process

Page 14: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

14

The usual approach

• Statisticians present a project in which product and process are combined

• IT people specify and build a system that creates the product by performing the process

Page 15: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

15

Why is the usual approach inefficient?

• Complexity• Process & product are tightly coupled

• Rigidity• Maintenance is labour-intensive

• Specificity• It is not easy to devise a generic solution

when developing for a specific product

Page 16: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

16

Main points

1. An ideal development process for a statistical system

2. Why this ideal usually is not met

3. How CORE aims at supporting this ideal development process

Page 17: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

17

Promoting the better approach

1. The CORA and CORE projects (Jenny)

2. Bringing the results into practice (Jean-Pierre)

Page 18: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

18

CORACORA

CORA ESSnet• COmmon Reference Architecture

(CORA)Financed by Eurostat under 2009 Statistical WorkprogrammeCountries involved: it (coordinator), ch, dk, lv, nl, no, seDuration: October 2009 - October 2010

Page 19: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

19

CORACORA

CORA deliverables• Questionnaire• Set of Requirements• State of the Art• Definition of the Layered Model• Technical Annex• Instruction Manual• Commercial and Legal Foundations for

the Exchange of Software between Statistical Offices

• Requirements Checklist for CORA Tools• Recommendations for CORA Tools

Page 20: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

20

After CORA … CORE!

COmmon Reference Environment (CORE)Financed by Eurostat under 2010 Statistical WorkprogrammeCountries involved: it (coordinator), fr, nl, no, pt, se Duration: December 2010 - January 2012

Page 21: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

21

CORE Workpackages• Design of the information model according to

GSBPM and alignment with NSI's information models

• Generic interface design for interconnecting GSBPM sub-processes

• Research workflow solutions for process management

• Implementation library for generic interface and production chain for .NET

• Implementation library for generic interface and production chain for Java

Page 22: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

22

Practical usage of CORA / CORE

• Modeling a process in terms of services (CORA)

• Classifying services (CORA)• Making services platform-

independent (CORE)

Page 23: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

Figures

Time series

Statistic

Population

Unit

Variable

Value

3Build

1Specify Needs

2Design

6Analyse

4Collect

5Process

9Evaluate

7Disseminate

8Archive

CORACORA

Page 24: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

Figures

Time series

Statistic

Population

Unit

Variable

Value

3Build

1Specify Needs

2Design

6Analyse

4Collect

5Process

9Evaluate

7Disseminate

8Archive

An example process

• A transport statistic– Input:

• Loading reports• Unloading reports

– Date, time, place, type & quantity goods, type vehicle

– Output:• Monthly transport data

– Same data also used for time series

CORACORA

Page 25: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

Figures

Time series

Statistic

Population

Unit

Variable

Value

3Build

1Specify Needs

2Design

6Analyse

4Collect

5Process

9Evaluate

7Disseminate

8Archive

Modeling approach

• Use the CORA space grid

CORACORA

Page 26: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

Figures

Time series

Statistic

Population

Unit

Variable

Value

3Build

1Specify Needs

2Design

6Analyse

4Collect

5Process

9Evaluate

7Disseminate

8Archive

Microdata

Macrodata

Page 27: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

Figures

Time series

Statistic

Population

Unit

Variable

Value

3Build

1Specify Needs

2Design

6Analyse

4Collect

5Process

9Evaluate

7Disseminate

8Archive

Modeling approach

• Use the CORA space grid• Display statistical services in the

appropriate cells

CORACORA

Page 28: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

Figures

Time series

Statistic

Population

Unit

Variable

Value

3Build

1Specify Needs

2Design

6Analyse

4Collect

5Process

9Evaluate

7Disseminate

8Archive

Aggregate

Macroediting

Page 29: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

Figures

Time series

Statistic

Population

Unit

Variable

Value

3Build

1Specify Needs

2Design

6Analyse

4Collect

5Process

9Evaluate

7Disseminate

8Archive

Modeling approach

• Use the CORA space grid• Display statistical services in the

appropriate cells• Join services with arrows to show

the dependencies

CORACORA

Page 30: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

Figures

Time series

Statistic

Population

Unit

Variable

Value

3Build

1Specify Needs

2Design

6Analyse

4Collect

5Process

9Evaluate

7Disseminate

8Archive

Aggregate

Macroediting

Page 31: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

Figures

Time series

Statistic

Population

Unit

Variable

Value

3Build

1Specify Needs

2Design

6Analyse

4Collect

5Process

9Evaluate

7Disseminate

8Archive

Monthly Transport Publication Confidentialty control

Select period data

Integrate data

ArchivePublication data

Archive TimeSeries data

Supply period data

AggregateArchive Statistic data

Macroediting

Microediting ArchiveUnit data

Compute distance

Combine

Archiveobs. vars.

Download

Outlier detection

Error detection

Correct outliers

Correct variables

?

?

Page 33: 1 CORE Bringing the GSBPM to life! J. Linnerud & J.-P. Kent

33

CORACORAA CORA service

Tool X

Model (X) Model (X)

Script (X)

Input (X) Output (X)

Model (CORA) Model (CORA)

Script (CORA)

CV CV CVCV CV

Input (CORA) Output (CORA)

CV = Convertor

Logging

Tool Y

Model (Y) Model (Y)

Script (Y)

Input (Y) Output (Y)

CV CV CVCV CV