application of service oriented architecture in statistics new zealand unsc modernisation of the...

20
Application of Service Oriented Architecture in Statistics New Zealand UNSC Modernisation of the Statistical Process Seminar New York, February 24, 2010 Geoff Bascand & Matjaz Jug

Upload: holly-alexander

Post on 23-Dec-2015

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Application of Service Oriented Architecture in Statistics New Zealand UNSC Modernisation of the Statistical Process Seminar New York, February 24, 2010

Application of Service Oriented Architecture in Statistics New Zealand

UNSC Modernisation of the Statistical Process Seminar

New York, February 24, 2010

Geoff Bascand & Matjaz Jug

Page 2: Application of Service Oriented Architecture in Statistics New Zealand UNSC Modernisation of the Statistical Process Seminar New York, February 24, 2010

Drivers for IT Architecture

• Agility: transformational changes like shift towards the increased use of administrative data, more automated data processing etc.

• Cost & Reuse: standardisation and reducing high costs of development and maintenance of statistical production systems.

• Integration: need for integration of outsourced statistical tools and legacy application assets

• Configuration: response to frequent changes in data sources, questionnaires, methodology and classifications.

Page 3: Application of Service Oriented Architecture in Statistics New Zealand UNSC Modernisation of the Statistical Process Seminar New York, February 24, 2010

SOA Definition• The Open Group describes Service Oriented

Architecture (SOA) as a:– “style of IT architecture that delivers agility and Boundaryless

Information Flow™. It is deployed on an increasing scale in enterprises today.”

• SOA is a message-based, independent component architecture where:– communication between components is managed by a

“service (or process) manager” that mediates communication, coordination and cooperation among components through messages. The message carries data and process data.

Page 4: Application of Service Oriented Architecture in Statistics New Zealand UNSC Modernisation of the Statistical Process Seminar New York, February 24, 2010

SOA Benefits

• Increased agility: organisations should be able to more quickly respond to changes in business process and external environment.

• Reduction of cost through reuse: new IT systems should be able to leverage the most readily available code and services from across the organization and externally.

• Better possibilities for integration using loosely coupled framework and orchestration.

• Configuration rather than programming

Page 5: Application of Service Oriented Architecture in Statistics New Zealand UNSC Modernisation of the Statistical Process Seminar New York, February 24, 2010

Situation in Statistical Organizations

• Many lessons learnt from early adopters• Even now there are not a lot of statistical

organisations implementing SOA on a large scale

• We are “behind” compared with some other sectors like the Airline Industry

• WHY? Are we really so different?

Page 6: Application of Service Oriented Architecture in Statistics New Zealand UNSC Modernisation of the Statistical Process Seminar New York, February 24, 2010

In Some Areas We are Different!

• Many semantically diverse data structures• Frequent change in data structure, sources,

questionnaires• Specific requirements like data confidentiality• Many stove-piped legacy application assets• Mainly non-transactional processing• End-user processing environments

Page 7: Application of Service Oriented Architecture in Statistics New Zealand UNSC Modernisation of the Statistical Process Seminar New York, February 24, 2010

Learning from Data Warehousing and Metadata-Driven Projects1. High degree of organisational change is required

which is usually slow process.

2. It is difficult to establish new governance.

3. New architecture usually requires complete replacement of legacy application assets portfolio.

4. Software development capability is difficult to upgrade and maintain in-house

5. Common challenge organisations often face involves effectively managing metadata.

6. Lack of standardisation – it appears every new paradigm requires more of it.

Page 8: Application of Service Oriented Architecture in Statistics New Zealand UNSC Modernisation of the Statistical Process Seminar New York, February 24, 2010

Additional lessons from early SOA attempts

• Standardisation of services and data structures is vital

• Too broad a business or services scope, then costs of generality & development are high

• Too specific a service or business request, then benefits of re-usability are limited

• Performance degrades with volume

Page 9: Application of Service Oriented Architecture in Statistics New Zealand UNSC Modernisation of the Statistical Process Seminar New York, February 24, 2010

Architecture in Stats NZ now – Platform approach and Shared Services (SOA)

Page 10: Application of Service Oriented Architecture in Statistics New Zealand UNSC Modernisation of the Statistical Process Seminar New York, February 24, 2010

CollectionCollection

CAPICAPI

CATICATI

ImagingImaging

Administrative DataAdministrative Data

DisseminationDissemination

Table

Builder

Table

Builder

InfoshareInfoshare

Business

Toolbox

Business

Toolbox

FutureFuture

Content Management

(www.stats.govt.nz)

Content Management

(www.stats.govt.nz)

Processing - Micro Economic StatisticsProcessing - Micro Economic Statistics

Processing - Macro Economic StatisticsProcessing - Macro Economic Statistics

Processing - Social/Household StatisticsProcessing - Social/Household Statistics

Statistical InfrastructureStatistical Infrastructure

IT InfrastructureIT Infrastructure

Platform for Micro economic statistics (BESt)Platform for Micro economic statistics (BESt)

Platform for HH statistics (POSS)Platform for HH statistics (POSS)

Other systems (mostly legacy)

Other systems (mostly legacy)

Other systems (mostly legacy)

Platform for National Accounts (DNA)Platform for National Accounts (DNA)

Frames and RegistersFrames and Registers Classification ManagementClassification Management Metadata ManagementMetadata Management MethodologiesMethodologies

HardwareHardware Server Software (OS, email, SQL DB, OLAP, CRM, CMS)

Server Software (OS, email, SQL DB, OLAP, CRM, CMS)

Applications & ToolsApplications & Tools Desktop Software (MS Office, Lotus Notes)

Desktop Software (MS Office, Lotus Notes)

Census PlatformCensus Platform

Future

(Web)

Future

(Web)

Page 11: Application of Service Oriented Architecture in Statistics New Zealand UNSC Modernisation of the Statistical Process Seminar New York, February 24, 2010

SOA in Data Collection• Description: data collected through CATI, CAPI and

Imaging are loaded (pushed) using messaging infrastructure to production databases. The grain is individual questionnaire response. Load service was built to deliver data to Legolution and POSS Input Data Environment (now Social Input Store).

• Challenges: We have dropped this approach in Process phase due to difficulties in moving large amounts of data as a messages. Requirement to pass process-metadata was overlooked so additional metadata transfer had to be used

• Benefits: infrastructure required for transactional data collection where every response can be pushed to production systems. This approach is anticipated as a result of Standard Business Reporting project.

Page 12: Application of Service Oriented Architecture in Statistics New Zealand UNSC Modernisation of the Statistical Process Seminar New York, February 24, 2010

SOA in Data Processing

• Description: Data is now transferred using ETL packages (pull). Service is used to initiate ETL packages. Configuration store is a central place where process is configured (metadata) and is currently used by two systems: BESt platform and SOFIE processing system.

• Challenges: Reuse of ETL packages is limited to the single platform (BESt) but some components (configuration store) can be used by other systems as well (as part of statistical infrastructure).

• Benefits: Highly configurable process workflow enabling WHAT-IF scenarios.

Page 13: Application of Service Oriented Architecture in Statistics New Zealand UNSC Modernisation of the Statistical Process Seminar New York, February 24, 2010

v1 18/02/2010 11

A user created pipeline of SAS operations – note the small scale of

each step

Settings for a BANFF SAS macro (Historic Imputation for Bad Debt

variable) Users can change the steps and

order in the pipeline – The system records changes and provides the

ability to rerun using an older version. Multiple configuration sets can be made and tested at one time

Page 14: Application of Service Oriented Architecture in Statistics New Zealand UNSC Modernisation of the Statistical Process Seminar New York, February 24, 2010

SOA in Data Dissemination

• Description: Dissemination tool Business Toolbox is using SDMX query service to get aggregated data from dissemination data warehouse OECD.stat and present it in a customized user friendly way.

• Challenges: integration of data warehouse with output production (legacy) systems.

• Benefits: Presentation of information is not dependent on the physical structure in data warehouse, possibility to easily add new SDMX-based web components as well as new data.

Page 15: Application of Service Oriented Architecture in Statistics New Zealand UNSC Modernisation of the Statistical Process Seminar New York, February 24, 2010
Page 16: Application of Service Oriented Architecture in Statistics New Zealand UNSC Modernisation of the Statistical Process Seminar New York, February 24, 2010

SOA in Statistical Infrastructure

• Description: Coding is the first example of statistical infrastructure to be offered through the service interface (internally and externally). CCS coder will offer automated coding service based on classification metadata in CARS.

• Challenges: metadata management & standardisation.

• Benefits: Statistical infrastructure (metadata management systems, registers) can provide services to internal and external platforms and individual systems.

Page 17: Application of Service Oriented Architecture in Statistics New Zealand UNSC Modernisation of the Statistical Process Seminar New York, February 24, 2010

How to Start? Areas Where SOA Can Deliver Significant Value

• Metadata services: a good candidate for reuse in many stovepipe and corporate applications.

• Statistical tools/components: making them more interoperable using service interface would significantly improve the possibilities to integrate them in different IT environments and therefore increase their shared usage and collaboration.

Page 18: Application of Service Oriented Architecture in Statistics New Zealand UNSC Modernisation of the Statistical Process Seminar New York, February 24, 2010

Summary• Iterative development (low hanging fruit first)

and proofs of the concepts• No emphasis on any particular approach:

SOA, DW and metadata-driven architecture are used together in a way which maximizes benefits and minimizes risk

• Strong focus on use of standards (SDMX)• Common IT Infrastructure is enabling

additional consolidation (MS SQL Server & Analysis Services, SAS Server, Blaise, .NET)

Page 19: Application of Service Oriented Architecture in Statistics New Zealand UNSC Modernisation of the Statistical Process Seminar New York, February 24, 2010

Annex: Systems Architecture and SOA Use – Detailed Version

• The following slide is a detailed picture of our systems architecture– Box 1 highlights SOA in the collections area– Box 2 highlights SOA in the processing area– Box 3 highlights SOA in the dissemination area– Box 4 highlights SOA in statistical infrastructure

Page 20: Application of Service Oriented Architecture in Statistics New Zealand UNSC Modernisation of the Statistical Process Seminar New York, February 24, 2010

Statistical Infrastructure

CARSCCS Coder

(SOA)

IT Infrastructure

Collection (Contact platform)

LoadService(SOA)

Load area

Micro-economic platform (BESt)

Load(ETL)

Main Store

Process Store

Run E&I Scenario

(ETL)Write back

(ETL)

Configuration Store

Initiate ETL package

(SOA)

Social-household platform (POSS)

Social InputStore

Dissemination

OECD.stat

Translate &Configure

(ETL)

Beyond 20/20

PC-Axis

New Table Builder

ContentManagement

Web Coder

Other statistical production systems (including legacy stovepipes)

Sprocet

Legolution

Sofie process

SAS & Banff

StatisticalOutputs

InternalSystem

Other systemsOther

systemsOther systemsOther

systems

Administrative data

Tax data

Process Store

SDMXData Query

(SOA)

Futurecomponents

Table Builder

Infoshare

Business Toolbox

External SystemBF LBF LEED

CATI

CAPI

Imaging

Respondent management

Collection Management

Architecture and SOA Use – Detailed Version