world meteorological organization data representation system xml jan w. noteboom royal netherlands...

26
World Meteorological Organization Data Representation System XML Jan W. Noteboom Royal Netherlands Meteorological Institute (KNMI) [email protected] CBS OPAG-ISS Expert Team on the Assessment of Data Representation Systems (ET-ADRS) Washington DC, USA, 23 - 25 April, 2008

Upload: reginald-west

Post on 27-Dec-2015

219 views

Category:

Documents


2 download

TRANSCRIPT

World Meteorological Organization

Data Representation System

XMLData Representation System

XML

Jan W. NoteboomRoyal Netherlands Meteorological Institute (KNMI)[email protected]

Jan W. NoteboomRoyal Netherlands Meteorological Institute (KNMI)[email protected]

CBS OPAG-ISS

Expert Team on the Assessment of Data Representation Systems (ET-ADRS)

Washington DC, USA, 23 - 25 April, 2008

ET-ADRS, 23-25 April 2008, Washington DC, USA 2

World Meteorological Organization

PresentationPresentation

• XML Overview

• XML SWOT Analysis

• XML Practical Experiences

• Discussion

• XML Overview

• XML SWOT Analysis

• XML Practical Experiences

• Discussion

ET-ADRS, 23-25 April 2008, Washington DC, USA 3

World Meteorological Organization

XML Overview

IntroductionXML Overview

Introduction

XML is:

• eXtensible Markup Language

• W3C Recommendation since feb 1998

• subset of Standard Generalized Markup Language (SGML)

• meta-language, used to create markup languages

• designed to represent and exchange data as structured documents across information systems particulary via the Internet

• human readable (text based, unicode support)

• open standard, licence free

XML is:

• eXtensible Markup Language

• W3C Recommendation since feb 1998

• subset of Standard Generalized Markup Language (SGML)

• meta-language, used to create markup languages

• designed to represent and exchange data as structured documents across information systems particulary via the Internet

• human readable (text based, unicode support)

• open standard, licence free

ET-ADRS, 23-25 April 2008, Washington DC, USA 4

World Meteorological Organization

XML Overview

Structure and SemanticsXML Overview

Structure and Semantics

Basic components• elements, attributes, comments, PCDATA, processing

information (e.g. declaration, namespaces)

Well-formed• one root element (hierarchical structure)• no open tags, proper nesting:

<parentTag><childTag1> </childTag1></parentTag> • attributes must be quoted• element names are case sensitive

Valid• document conforms to some semantic rules

(e.g. Document Type Definition DTD or XML Schema XSD)

Basic components• elements, attributes, comments, PCDATA, processing

information (e.g. declaration, namespaces)

Well-formed• one root element (hierarchical structure)• no open tags, proper nesting:

<parentTag><childTag1> </childTag1></parentTag> • attributes must be quoted• element names are case sensitive

Valid• document conforms to some semantic rules

(e.g. Document Type Definition DTD or XML Schema XSD)

ET-ADRS, 23-25 April 2008, Washington DC, USA 5

World Meteorological Organization

XML Overview

Structure and SemanticsXML Overview

Structure and Semantics

Semantic Rules - XML Schema XSD (W3C, 2001)• uses XML syntax (well-formed)• structural definitions, type definitions, defaults• very powerful and flexible• facilitates creation of own libraries with exchange data typesSchemas are also useful for:• prior agreements between parties for data exchange• application development that process data

Alternatives: RELAX NG (OASIS)*, Schematron* *Supported by DSDL: Document Schema Definition Languages (ISO 19757)

Namespaces (W3C, 2006):• Identify your vocabularies (usage: xmlns:prefix=“URI”) • qualify element and attribute names to avoid name collisions• Allows modularization of schemas

– Mix and match elements from multiple schemas in document instances– Import or include from one XML Schema into another (re-use)

Semantic Rules - XML Schema XSD (W3C, 2001)• uses XML syntax (well-formed)• structural definitions, type definitions, defaults• very powerful and flexible• facilitates creation of own libraries with exchange data typesSchemas are also useful for:• prior agreements between parties for data exchange• application development that process data

Alternatives: RELAX NG (OASIS)*, Schematron* *Supported by DSDL: Document Schema Definition Languages (ISO 19757)

Namespaces (W3C, 2006):• Identify your vocabularies (usage: xmlns:prefix=“URI”) • qualify element and attribute names to avoid name collisions• Allows modularization of schemas

– Mix and match elements from multiple schemas in document instances– Import or include from one XML Schema into another (re-use)

ET-ADRS, 23-25 April 2008, Washington DC, USA 6

World Meteorological Organization

XML ExampleXML Example

<?xml version="1.0"?> <swe:CompositePhenomenon xmlns:swe="http://www.opengis.net/swe/1.0.1" xmlns:gml="http://www.opengis.net/gml" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink" xsi:schemaLocation="http://www.opengis.net/swe/1.0.1 http://schemas.opengis.net/sweCommon/1.0.1/swe.xsd" gml:id="weather1" dimension="6"> <gml:name codeSpace="urn:ietf:rfc:2141" >urn:ogc:def:phenomenon:SEEGrid:weather1</gml:name> <swe:base xlink:href="urn:ogc:def:phenomenon:OGC:Weather"/> <swe:component xlink:href="urn:ogc:def:phenomenon:OGC:AirTemperature"/> <swe:component xlink:href="urn:ogc:def:phenomenon:OGC:WindSpeed"/> <swe:component xlink:href="urn:ogc:def:phenomenon:OGC:WindDirection"/> <swe:component xlink:href="http://sweet.jpl.nasa.gov/ontology/property.owl#Visibility"/></swe:CompositePhenomenon>

<?xml version="1.0"?> <swe:CompositePhenomenon xmlns:swe="http://www.opengis.net/swe/1.0.1" xmlns:gml="http://www.opengis.net/gml" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink" xsi:schemaLocation="http://www.opengis.net/swe/1.0.1 http://schemas.opengis.net/sweCommon/1.0.1/swe.xsd" gml:id="weather1" dimension="6"> <gml:name codeSpace="urn:ietf:rfc:2141" >urn:ogc:def:phenomenon:SEEGrid:weather1</gml:name> <swe:base xlink:href="urn:ogc:def:phenomenon:OGC:Weather"/> <swe:component xlink:href="urn:ogc:def:phenomenon:OGC:AirTemperature"/> <swe:component xlink:href="urn:ogc:def:phenomenon:OGC:WindSpeed"/> <swe:component xlink:href="urn:ogc:def:phenomenon:OGC:WindDirection"/> <swe:component xlink:href="http://sweet.jpl.nasa.gov/ontology/property.owl#Visibility"/></swe:CompositePhenomenon>

ET-ADRS, 23-25 April 2008, Washington DC, USA 7

World Meteorological Organization

XML Overview

Processing XMLXML Overview

Processing XML

Programming APIs• DOM - Document Object Model• SAX - Simple API for XML• StAX - Streaming API for XML

Data binding• JAXB - Java Architecture for XML Binding• Hibernate - relational/object/XML mapping tool

Programming APIs• DOM - Document Object Model• SAX - Simple API for XML• StAX - Streaming API for XML

Data binding• JAXB - Java Architecture for XML Binding• Hibernate - relational/object/XML mapping tool

ApplicationXMLdoc

XML validator

XML parser

XMLschema

APIimplementor

XML Infoset

ET-ADRS, 23-25 April 2008, Washington DC, USA 8

World Meteorological Organization

XML Overview

XML extensionsXML Overview

XML extensions

• XPath, XPointer: for addressing XML subdocuments• XLink : to create hyperlinks between resources• XSLT : for rearranging & restructuring XML docs• XQuery : for querying• SOAP : XML-Protocol for message and object

serialization and remote procedure calls• RDF : to describe resource metadata• XForms : for Web forms• XMI : XML Metadata Interchange

…..etc….

• XPath, XPointer: for addressing XML subdocuments• XLink : to create hyperlinks between resources• XSLT : for rearranging & restructuring XML docs• XQuery : for querying• SOAP : XML-Protocol for message and object

serialization and remote procedure calls• RDF : to describe resource metadata• XForms : for Web forms• XMI : XML Metadata Interchange

…..etc….

ET-ADRS, 23-25 April 2008, Washington DC, USA 9

World Meteorological Organization

XML Overview

DialectsXML Overview

Dialects

• Geography Markup Language (GML, ISO19136) • Keyhole Markup Language (KML)• Digital Weather Markup Language (DWML) • Climate Data Markup Language (CDML) • Weather Markup Language (WxML) • Emergency Data Exchange Language (EDXL)• Water Markup Language (WaterML) • Chemical Markup Language (CML),• Electronic Business XML Initiative (ebXML),• Scalable Vector Graphics (SVG), . . .

• Geography Markup Language (GML, ISO19136) • Keyhole Markup Language (KML)• Digital Weather Markup Language (DWML) • Climate Data Markup Language (CDML) • Weather Markup Language (WxML) • Emergency Data Exchange Language (EDXL)• Water Markup Language (WaterML) • Chemical Markup Language (CML),• Electronic Business XML Initiative (ebXML),• Scalable Vector Graphics (SVG), . . .

ET-ADRS, 23-25 April 2008, Washington DC, USA 10

World Meteorological Organization

XML SWOT AnalysisXML SWOT Analysis

ET-ADRS, 23-25 April 2008, Washington DC, USA 11

World Meteorological Organization

XML SWOT Analysis

CriteriaXML SWOT Analysis

Criteria• Ability/suitability to present WMO data

includes also: pictorial data, textual information (e.g. warnings), metadata

• Ability/suitability to exchange dataoperational data between NMHSs and information to users outside NMHSs

• Ability/suitability for store datausage in storage systems

• Compliance with and status of existing standards.

• Available support skills and technology (tooling)

• Other abilitiesability to translate back and forward to other DRSsability/suitability to envelope objects or act as a pseudo-carrier

• Ability/suitability to present WMO dataincludes also: pictorial data, textual information (e.g. warnings), metadata

• Ability/suitability to exchange dataoperational data between NMHSs and information to users outside NMHSs

• Ability/suitability for store datausage in storage systems

• Compliance with and status of existing standards.

• Available support skills and technology (tooling)

• Other abilitiesability to translate back and forward to other DRSsability/suitability to envelope objects or act as a pseudo-carrier

ET-ADRS, 23-25 April 2008, Washington DC, USA 12

World Meteorological Organization

XML SWOT Analysis

SummaryXML SWOT Analysis

Summary

Criteria Score Remarks

Ability/suitability to present WMO dataIncludes also: pictorial data, textual information, metadata

Not suitable for raster data (pictures)

Ability/suitability to exchange dataoperational data between NMHSsinformation to users outside NMHSs

Substantial bandwidth and processing overhead

Ability/suitability to store datausage in storage systems

Mapping schemes to RDBMS tables difficult.

Compliance with and status of existing standards

Only a Core Metadata XML scheme (WMO)

Available Supportskills & technology (tooling)

High level of support

Other abilitiesability to translate back and forward to other DRSs

ability to envelope objects or act as a pseudo-carrier

No BUFR translation tooling found

excellent poor

ET-ADRS, 23-25 April 2008, Washington DC, USA 13

World Meteorological Organization

XML SWOT Analysis

Presenting WMO data(weather, climate, water, atmospheric constituents, oceanography, aviation e.a)

XML SWOT Analysis

Presenting WMO data(weather, climate, water, atmospheric constituents, oceanography, aviation e.a)

Strengths:+ structured text format (hierarchical)+ self-documenting+ can represent common data structures: records, lists, trees+ human-readable+ very flexible – you can define and mix other languages (GML)+ supports modularity (namespaces)

Weaknesses:- fairly verbose and partially redundant- binary data- expressing non-hierarchical relationships is difficult

Opportunities & Threats:• many languages and schemes available that are useful to describe

weather, climate related aspects• potentially complex parsing (many namespaces)

Strengths:+ structured text format (hierarchical)+ self-documenting+ can represent common data structures: records, lists, trees+ human-readable+ very flexible – you can define and mix other languages (GML)+ supports modularity (namespaces)

Weaknesses:- fairly verbose and partially redundant- binary data- expressing non-hierarchical relationships is difficult

Opportunities & Threats:• many languages and schemes available that are useful to describe

weather, climate related aspects• potentially complex parsing (many namespaces)

ET-ADRS, 23-25 April 2008, Washington DC, USA 14

World Meteorological Organization

XML SWOT Analysis

Presenting WMO data XML SWOT Analysis

Presenting WMO data

Pictorial data:+ feature data, vector data (GML, KML)- presenting raster dataRemark• GML (ISO19136) enables OGC services (WFS, WMS WCS)

Text information (e.g. warnings):+ XML is human readable and multi-lingual (unicode)+ EDXL (Emergency Data Exchange Language) by OASIS+ rearranging & restructuring abilities XSL (to text, XHTML, pdf etc)Remark• CAP – Common Alert Protocol (Oasis), open, based on EDXL

Metadata:+ ISO 191xx support (ISO 19139 Metadata Schema)Remark• Available WMO Core Metadata Profile (extension to ISO 19139)

Pictorial data:+ feature data, vector data (GML, KML)- presenting raster dataRemark• GML (ISO19136) enables OGC services (WFS, WMS WCS)

Text information (e.g. warnings):+ XML is human readable and multi-lingual (unicode)+ EDXL (Emergency Data Exchange Language) by OASIS+ rearranging & restructuring abilities XSL (to text, XHTML, pdf etc)Remark• CAP – Common Alert Protocol (Oasis), open, based on EDXL

Metadata:+ ISO 191xx support (ISO 19139 Metadata Schema)Remark• Available WMO Core Metadata Profile (extension to ISO 19139)

ET-ADRS, 23-25 April 2008, Washington DC, USA 15

World Meteorological Organization

XML SWOT Analysis

Exchanging operational data (NMHS and centers)

XML SWOT Analysis

Exchanging operational data (NMHS and centers)

Strenghts:+ strict syntax and parser requirements+ XML schemes, useful for

• validating• defining exchange formats• writing applications that process the data

+ web enabling

Weaknesses:- verbose, bandwidth consumption- processing overhead (complex parsing, validation)

Opportunities & Threats:• data compression (gzip) • no agreed WMO schemas, except for Core Metadata

Usage: barely, e.g. Cyclone XML

Strenghts:+ strict syntax and parser requirements+ XML schemes, useful for

• validating• defining exchange formats• writing applications that process the data

+ web enabling

Weaknesses:- verbose, bandwidth consumption- processing overhead (complex parsing, validation)

Opportunities & Threats:• data compression (gzip) • no agreed WMO schemas, except for Core Metadata

Usage: barely, e.g. Cyclone XML

ET-ADRS, 23-25 April 2008, Washington DC, USA 16

World Meteorological Organization

XML SWOT Analysis

Transmitting information (Users outside NMHSs and centers)

XML SWOT Analysis

Transmitting information (Users outside NMHSs and centers)

Strenghts:+ human readable (text)+ language support (unicode)+ web enabling (Webservices, SOA support, e-Business)+ interoperability+ translation to text or HTML or other XML (XSL)+ open standard, licence free+ technology support (tools)

Weaknesses:- verbose, bandwidth consumption- transmitting binary data

Opportunities & Threats:• usage for metadata exchange is common practice (catalogue data)• too many proprietary schemas to transmit data

Usage: growing practice , e.g. Road Weather Information Network, Canada

Strenghts:+ human readable (text)+ language support (unicode)+ web enabling (Webservices, SOA support, e-Business)+ interoperability+ translation to text or HTML or other XML (XSL)+ open standard, licence free+ technology support (tools)

Weaknesses:- verbose, bandwidth consumption- transmitting binary data

Opportunities & Threats:• usage for metadata exchange is common practice (catalogue data)• too many proprietary schemas to transmit data

Usage: growing practice , e.g. Road Weather Information Network, Canada

ET-ADRS, 23-25 April 2008, Washington DC, USA 17

World Meteorological Organization

XML SWOT Analysis

Usage in storage systems XML SWOT Analysis

Usage in storage systems

Strenghts:+ Native XML DBMSs available+ Many XML enabled relational databases + technology such as XPath and XQuery, SQL/XML

Weaknesses:- Normalizing/mapping XML data into RDBMS tables can be

difficult- A native XML DBMS requires more space (less efficient)

Opportunities & Threats:• No much experience with XML for storage in NMHSs

Usage: writing XML queries iso SQL (growing practice)

Strenghts:+ Native XML DBMSs available+ Many XML enabled relational databases + technology such as XPath and XQuery, SQL/XML

Weaknesses:- Normalizing/mapping XML data into RDBMS tables can be

difficult- A native XML DBMS requires more space (less efficient)

Opportunities & Threats:• No much experience with XML for storage in NMHSs

Usage: writing XML queries iso SQL (growing practice)

ET-ADRS, 23-25 April 2008, Washington DC, USA 18

World Meteorological Organization

XML SWOT Analysis

Standards XML SWOT Analysis

Standards

Strenghts:+ XML is open standard (W3C)+ ISO 191xx standards are supported by XML schemas+ GML (ISO 19136) – XML for geospatial aspects + OGC OM model supported by GML schemas+ Ability to implement UML conceptual models (ISO 19103) in XML

schemes (using XMI)

Weaknesses:- No agreed WMO schemas available (except for Core Metadata)

Opportunities (& Threats):• All harmonization initiatives are based on ISO, W3C and OGC

standards (RA VI: EU-INSPIRE) • OGC Observation and Measurement model used to develop WXXM

(weather exchange model) for avaition (Eurocontrol) • HollowWorld applied to develop WMO Core metadata profile.

Strenghts:+ XML is open standard (W3C)+ ISO 191xx standards are supported by XML schemas+ GML (ISO 19136) – XML for geospatial aspects + OGC OM model supported by GML schemas+ Ability to implement UML conceptual models (ISO 19103) in XML

schemes (using XMI)

Weaknesses:- No agreed WMO schemas available (except for Core Metadata)

Opportunities (& Threats):• All harmonization initiatives are based on ISO, W3C and OGC

standards (RA VI: EU-INSPIRE) • OGC Observation and Measurement model used to develop WXXM

(weather exchange model) for avaition (Eurocontrol) • HollowWorld applied to develop WMO Core metadata profile.

ET-ADRS, 23-25 April 2008, Washington DC, USA 19

World Meteorological Organization

XML SWOT Analysis

Other AbilitiesXML SWOT Analysis

Other Abilities

Translating XML from and to other DRSs • NetCDF <> XML tooling:

– (to) NetCDF Markup Language (NcML), and NcML-GML– (to&from) LeoNetCDF

• BUFR <> XML tooling: unknown• HDF5 <> XML tooling: (to) d5dump

Combining XML with other data formats (e.g. envelope, pseudo carrier)• XML metadata /header for HDF5, NetCDF or BUFR datasets• SOAP (XML protocol) for exchanging data

Translating XML from and to other DRSs • NetCDF <> XML tooling:

– (to) NetCDF Markup Language (NcML), and NcML-GML– (to&from) LeoNetCDF

• BUFR <> XML tooling: unknown• HDF5 <> XML tooling: (to) d5dump

Combining XML with other data formats (e.g. envelope, pseudo carrier)• XML metadata /header for HDF5, NetCDF or BUFR datasets• SOAP (XML protocol) for exchanging data

ET-ADRS, 23-25 April 2008, Washington DC, USA 20

World Meteorological Organization

XML SWOT Analysis

Available support(skills & technology/tools)

XML SWOT Analysis

Available support(skills & technology/tools)

• Numerous XML extensions(XSL, XQuery XForms, XPath, KML, DWML, SOAP etc.) • XMI to translate data models (UML) into XML schemes • GML for geospatial aspects (GIS systems)

• XML technology is widespread, easily available and cheap

• Increasing number of individuals with XML skills

• Numerous XML extensions(XSL, XQuery XForms, XPath, KML, DWML, SOAP etc.) • XMI to translate data models (UML) into XML schemes • GML for geospatial aspects (GIS systems)

• XML technology is widespread, easily available and cheap

• Increasing number of individuals with XML skills

ET-ADRS, 23-25 April 2008, Washington DC, USA 21

World Meteorological Organization

XML SWOT Analysis

SummaryXML SWOT Analysis

Summary

Criteria Score Remarks

Ability/suitability to present WMO dataIncludes also: pictorial data, textual information, metadata

Not suitable for raster data (pictures)

Ability/suitability to exchange dataoperational data between NMHSsinformation to users outside NMHSs

Substantial bandwidth and processing overhead

Ability/suitability to store datausage in storage systems

Mapping schemes to RDBMS tables difficult.

Compliance with and status of existing standards

Only a Core Metadata XML scheme (WMO)

Available Supportskills & technology (tooling)

High level of support

Other abilitiesability to translate back and forward to other DRSs

ability to envelope objects or act as a pseudo-carrier

No BUFR translation tooling found

excellent poor

ET-ADRS, 23-25 April 2008, Washington DC, USA 22

World Meteorological Organization

XML practical experiencesXML practical experiences

• Cyclone XML, TIGGE project• RWIN, Canada• WXXM, EUROCONTROL

• Cyclone XML, TIGGE project• RWIN, Canada• WXXM, EUROCONTROL

ET-ADRS, 23-25 April 2008, Washington DC, USA 23

World Meteorological Organization

XML practical experiencesXML practical experiences

Cyclone XML• XML format for cyclone analyses and forecasts (CXML)• to improve sharing of cyclone information with other users than

NHMSs• alternative for the BUFR/CREX format• development recently started – TIGGE project• details: http://www.bom.gov.au/bmrc/projects/THORPEX/CXML/index.html

RWIN (Road Weather Information Network) • XML format for road weather observations (CMML)• Interchange between Canadian transportation ministries and

contractors (network maintenance) • 200 observing sites every 20 minutes• Operational • details: http://www.clarusinitiative.org/

Cyclone XML• XML format for cyclone analyses and forecasts (CXML)• to improve sharing of cyclone information with other users than

NHMSs• alternative for the BUFR/CREX format• development recently started – TIGGE project• details: http://www.bom.gov.au/bmrc/projects/THORPEX/CXML/index.html

RWIN (Road Weather Information Network) • XML format for road weather observations (CMML)• Interchange between Canadian transportation ministries and

contractors (network maintenance) • 200 observing sites every 20 minutes• Operational • details: http://www.clarusinitiative.org/

ET-ADRS, 23-25 April 2008, Washington DC, USA 24

World Meteorological Organization

XML practical experiencesXML practical experiences

WXXM Weather Exchange model • for data and objects related to weather for aviation• Conceptual Model (WXCM) based on OGC

Observations and Measurements model • Following ISO 19100 principles and OGC

recommendations• using GML for compatibility with third-party GML

applications• Under development • Details:http://www.eurocontrol.int/aim/public/standard_page/met.html

WXXM Weather Exchange model • for data and objects related to weather for aviation• Conceptual Model (WXCM) based on OGC

Observations and Measurements model • Following ISO 19100 principles and OGC

recommendations• using GML for compatibility with third-party GML

applications• Under development • Details:http://www.eurocontrol.int/aim/public/standard_page/met.html

2

AIM (2)

• AIXM – Aeronautical Information Exchange Model

• AMXM – Airport Mapping Exchange Model

WXXM

ANXM

ENXM

TERRAIN

DMEAN

FLTOBJECTS

?

AMXMAIXM

INCOBST

Established

Taking profit of other developments

Not included in AIM Domain & aero data elements only

• WXXM – Weather Information Exchange Model

• ANXM – Airport Network Information Exchange Model

• ENXM – Environmental Information Exchange Model

• DMEAN - Dynamic Management of the European Airspace Network

Prototype released

Under development

Aeronautical Information Exchange Model (AIXM)

ET-ADRS, 23-25 April 2008, Washington DC, USA 25

World Meteorological Organization

DiscussionDiscussion

ET-ADRS, 23-25 April 2008, Washington DC, USA 26

World Meteorological Organization

DiscussionDiscussion

• Is there a need to develop a “WMO Markup language”?

• How to benefit from the extensive and cheap XML support?

• What synergy XML <> HDF5/BUFR/NetCDF is achievable?

• What governance should WMO offer to support XML?

• Is there a need to develop a “WMO Markup language”?

• How to benefit from the extensive and cheap XML support?

• What synergy XML <> HDF5/BUFR/NetCDF is achievable?

• What governance should WMO offer to support XML?