world meteorological organization data representation system xml jan w. noteboom royal netherlands...
TRANSCRIPT
World Meteorological Organization
Data Representation System
XMLData Representation System
XML
Jan W. NoteboomRoyal Netherlands Meteorological Institute (KNMI)[email protected]
Jan W. NoteboomRoyal Netherlands Meteorological Institute (KNMI)[email protected]
CBS OPAG-ISS
Expert Team on the Assessment of Data Representation Systems (ET-ADRS)
Washington DC, USA, 23 - 25 April, 2008
ET-ADRS, 23-25 April 2008, Washington DC, USA 2
World Meteorological Organization
PresentationPresentation
• XML Overview
• XML SWOT Analysis
• XML Practical Experiences
• Discussion
• XML Overview
• XML SWOT Analysis
• XML Practical Experiences
• Discussion
ET-ADRS, 23-25 April 2008, Washington DC, USA 3
World Meteorological Organization
XML Overview
IntroductionXML Overview
Introduction
XML is:
• eXtensible Markup Language
• W3C Recommendation since feb 1998
• subset of Standard Generalized Markup Language (SGML)
• meta-language, used to create markup languages
• designed to represent and exchange data as structured documents across information systems particulary via the Internet
• human readable (text based, unicode support)
• open standard, licence free
XML is:
• eXtensible Markup Language
• W3C Recommendation since feb 1998
• subset of Standard Generalized Markup Language (SGML)
• meta-language, used to create markup languages
• designed to represent and exchange data as structured documents across information systems particulary via the Internet
• human readable (text based, unicode support)
• open standard, licence free
ET-ADRS, 23-25 April 2008, Washington DC, USA 4
World Meteorological Organization
XML Overview
Structure and SemanticsXML Overview
Structure and Semantics
Basic components• elements, attributes, comments, PCDATA, processing
information (e.g. declaration, namespaces)
Well-formed• one root element (hierarchical structure)• no open tags, proper nesting:
<parentTag><childTag1> </childTag1></parentTag> • attributes must be quoted• element names are case sensitive
Valid• document conforms to some semantic rules
(e.g. Document Type Definition DTD or XML Schema XSD)
Basic components• elements, attributes, comments, PCDATA, processing
information (e.g. declaration, namespaces)
Well-formed• one root element (hierarchical structure)• no open tags, proper nesting:
<parentTag><childTag1> </childTag1></parentTag> • attributes must be quoted• element names are case sensitive
Valid• document conforms to some semantic rules
(e.g. Document Type Definition DTD or XML Schema XSD)
ET-ADRS, 23-25 April 2008, Washington DC, USA 5
World Meteorological Organization
XML Overview
Structure and SemanticsXML Overview
Structure and Semantics
Semantic Rules - XML Schema XSD (W3C, 2001)• uses XML syntax (well-formed)• structural definitions, type definitions, defaults• very powerful and flexible• facilitates creation of own libraries with exchange data typesSchemas are also useful for:• prior agreements between parties for data exchange• application development that process data
Alternatives: RELAX NG (OASIS)*, Schematron* *Supported by DSDL: Document Schema Definition Languages (ISO 19757)
Namespaces (W3C, 2006):• Identify your vocabularies (usage: xmlns:prefix=“URI”) • qualify element and attribute names to avoid name collisions• Allows modularization of schemas
– Mix and match elements from multiple schemas in document instances– Import or include from one XML Schema into another (re-use)
Semantic Rules - XML Schema XSD (W3C, 2001)• uses XML syntax (well-formed)• structural definitions, type definitions, defaults• very powerful and flexible• facilitates creation of own libraries with exchange data typesSchemas are also useful for:• prior agreements between parties for data exchange• application development that process data
Alternatives: RELAX NG (OASIS)*, Schematron* *Supported by DSDL: Document Schema Definition Languages (ISO 19757)
Namespaces (W3C, 2006):• Identify your vocabularies (usage: xmlns:prefix=“URI”) • qualify element and attribute names to avoid name collisions• Allows modularization of schemas
– Mix and match elements from multiple schemas in document instances– Import or include from one XML Schema into another (re-use)
ET-ADRS, 23-25 April 2008, Washington DC, USA 6
World Meteorological Organization
XML ExampleXML Example
<?xml version="1.0"?> <swe:CompositePhenomenon xmlns:swe="http://www.opengis.net/swe/1.0.1" xmlns:gml="http://www.opengis.net/gml" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink" xsi:schemaLocation="http://www.opengis.net/swe/1.0.1 http://schemas.opengis.net/sweCommon/1.0.1/swe.xsd" gml:id="weather1" dimension="6"> <gml:name codeSpace="urn:ietf:rfc:2141" >urn:ogc:def:phenomenon:SEEGrid:weather1</gml:name> <swe:base xlink:href="urn:ogc:def:phenomenon:OGC:Weather"/> <swe:component xlink:href="urn:ogc:def:phenomenon:OGC:AirTemperature"/> <swe:component xlink:href="urn:ogc:def:phenomenon:OGC:WindSpeed"/> <swe:component xlink:href="urn:ogc:def:phenomenon:OGC:WindDirection"/> <swe:component xlink:href="http://sweet.jpl.nasa.gov/ontology/property.owl#Visibility"/></swe:CompositePhenomenon>
<?xml version="1.0"?> <swe:CompositePhenomenon xmlns:swe="http://www.opengis.net/swe/1.0.1" xmlns:gml="http://www.opengis.net/gml" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink" xsi:schemaLocation="http://www.opengis.net/swe/1.0.1 http://schemas.opengis.net/sweCommon/1.0.1/swe.xsd" gml:id="weather1" dimension="6"> <gml:name codeSpace="urn:ietf:rfc:2141" >urn:ogc:def:phenomenon:SEEGrid:weather1</gml:name> <swe:base xlink:href="urn:ogc:def:phenomenon:OGC:Weather"/> <swe:component xlink:href="urn:ogc:def:phenomenon:OGC:AirTemperature"/> <swe:component xlink:href="urn:ogc:def:phenomenon:OGC:WindSpeed"/> <swe:component xlink:href="urn:ogc:def:phenomenon:OGC:WindDirection"/> <swe:component xlink:href="http://sweet.jpl.nasa.gov/ontology/property.owl#Visibility"/></swe:CompositePhenomenon>
ET-ADRS, 23-25 April 2008, Washington DC, USA 7
World Meteorological Organization
XML Overview
Processing XMLXML Overview
Processing XML
Programming APIs• DOM - Document Object Model• SAX - Simple API for XML• StAX - Streaming API for XML
Data binding• JAXB - Java Architecture for XML Binding• Hibernate - relational/object/XML mapping tool
Programming APIs• DOM - Document Object Model• SAX - Simple API for XML• StAX - Streaming API for XML
Data binding• JAXB - Java Architecture for XML Binding• Hibernate - relational/object/XML mapping tool
ApplicationXMLdoc
XML validator
XML parser
XMLschema
APIimplementor
XML Infoset
ET-ADRS, 23-25 April 2008, Washington DC, USA 8
World Meteorological Organization
XML Overview
XML extensionsXML Overview
XML extensions
• XPath, XPointer: for addressing XML subdocuments• XLink : to create hyperlinks between resources• XSLT : for rearranging & restructuring XML docs• XQuery : for querying• SOAP : XML-Protocol for message and object
serialization and remote procedure calls• RDF : to describe resource metadata• XForms : for Web forms• XMI : XML Metadata Interchange
…..etc….
• XPath, XPointer: for addressing XML subdocuments• XLink : to create hyperlinks between resources• XSLT : for rearranging & restructuring XML docs• XQuery : for querying• SOAP : XML-Protocol for message and object
serialization and remote procedure calls• RDF : to describe resource metadata• XForms : for Web forms• XMI : XML Metadata Interchange
…..etc….
ET-ADRS, 23-25 April 2008, Washington DC, USA 9
World Meteorological Organization
XML Overview
DialectsXML Overview
Dialects
• Geography Markup Language (GML, ISO19136) • Keyhole Markup Language (KML)• Digital Weather Markup Language (DWML) • Climate Data Markup Language (CDML) • Weather Markup Language (WxML) • Emergency Data Exchange Language (EDXL)• Water Markup Language (WaterML) • Chemical Markup Language (CML),• Electronic Business XML Initiative (ebXML),• Scalable Vector Graphics (SVG), . . .
• Geography Markup Language (GML, ISO19136) • Keyhole Markup Language (KML)• Digital Weather Markup Language (DWML) • Climate Data Markup Language (CDML) • Weather Markup Language (WxML) • Emergency Data Exchange Language (EDXL)• Water Markup Language (WaterML) • Chemical Markup Language (CML),• Electronic Business XML Initiative (ebXML),• Scalable Vector Graphics (SVG), . . .
ET-ADRS, 23-25 April 2008, Washington DC, USA 10
World Meteorological Organization
XML SWOT AnalysisXML SWOT Analysis
ET-ADRS, 23-25 April 2008, Washington DC, USA 11
World Meteorological Organization
XML SWOT Analysis
CriteriaXML SWOT Analysis
Criteria• Ability/suitability to present WMO data
includes also: pictorial data, textual information (e.g. warnings), metadata
• Ability/suitability to exchange dataoperational data between NMHSs and information to users outside NMHSs
• Ability/suitability for store datausage in storage systems
• Compliance with and status of existing standards.
• Available support skills and technology (tooling)
• Other abilitiesability to translate back and forward to other DRSsability/suitability to envelope objects or act as a pseudo-carrier
• Ability/suitability to present WMO dataincludes also: pictorial data, textual information (e.g. warnings), metadata
• Ability/suitability to exchange dataoperational data between NMHSs and information to users outside NMHSs
• Ability/suitability for store datausage in storage systems
• Compliance with and status of existing standards.
• Available support skills and technology (tooling)
• Other abilitiesability to translate back and forward to other DRSsability/suitability to envelope objects or act as a pseudo-carrier
ET-ADRS, 23-25 April 2008, Washington DC, USA 12
World Meteorological Organization
XML SWOT Analysis
SummaryXML SWOT Analysis
Summary
Criteria Score Remarks
Ability/suitability to present WMO dataIncludes also: pictorial data, textual information, metadata
Not suitable for raster data (pictures)
Ability/suitability to exchange dataoperational data between NMHSsinformation to users outside NMHSs
Substantial bandwidth and processing overhead
Ability/suitability to store datausage in storage systems
Mapping schemes to RDBMS tables difficult.
Compliance with and status of existing standards
Only a Core Metadata XML scheme (WMO)
Available Supportskills & technology (tooling)
High level of support
Other abilitiesability to translate back and forward to other DRSs
ability to envelope objects or act as a pseudo-carrier
No BUFR translation tooling found
excellent poor
ET-ADRS, 23-25 April 2008, Washington DC, USA 13
World Meteorological Organization
XML SWOT Analysis
Presenting WMO data(weather, climate, water, atmospheric constituents, oceanography, aviation e.a)
XML SWOT Analysis
Presenting WMO data(weather, climate, water, atmospheric constituents, oceanography, aviation e.a)
Strengths:+ structured text format (hierarchical)+ self-documenting+ can represent common data structures: records, lists, trees+ human-readable+ very flexible – you can define and mix other languages (GML)+ supports modularity (namespaces)
Weaknesses:- fairly verbose and partially redundant- binary data- expressing non-hierarchical relationships is difficult
Opportunities & Threats:• many languages and schemes available that are useful to describe
weather, climate related aspects• potentially complex parsing (many namespaces)
Strengths:+ structured text format (hierarchical)+ self-documenting+ can represent common data structures: records, lists, trees+ human-readable+ very flexible – you can define and mix other languages (GML)+ supports modularity (namespaces)
Weaknesses:- fairly verbose and partially redundant- binary data- expressing non-hierarchical relationships is difficult
Opportunities & Threats:• many languages and schemes available that are useful to describe
weather, climate related aspects• potentially complex parsing (many namespaces)
ET-ADRS, 23-25 April 2008, Washington DC, USA 14
World Meteorological Organization
XML SWOT Analysis
Presenting WMO data XML SWOT Analysis
Presenting WMO data
Pictorial data:+ feature data, vector data (GML, KML)- presenting raster dataRemark• GML (ISO19136) enables OGC services (WFS, WMS WCS)
Text information (e.g. warnings):+ XML is human readable and multi-lingual (unicode)+ EDXL (Emergency Data Exchange Language) by OASIS+ rearranging & restructuring abilities XSL (to text, XHTML, pdf etc)Remark• CAP – Common Alert Protocol (Oasis), open, based on EDXL
Metadata:+ ISO 191xx support (ISO 19139 Metadata Schema)Remark• Available WMO Core Metadata Profile (extension to ISO 19139)
Pictorial data:+ feature data, vector data (GML, KML)- presenting raster dataRemark• GML (ISO19136) enables OGC services (WFS, WMS WCS)
Text information (e.g. warnings):+ XML is human readable and multi-lingual (unicode)+ EDXL (Emergency Data Exchange Language) by OASIS+ rearranging & restructuring abilities XSL (to text, XHTML, pdf etc)Remark• CAP – Common Alert Protocol (Oasis), open, based on EDXL
Metadata:+ ISO 191xx support (ISO 19139 Metadata Schema)Remark• Available WMO Core Metadata Profile (extension to ISO 19139)
ET-ADRS, 23-25 April 2008, Washington DC, USA 15
World Meteorological Organization
XML SWOT Analysis
Exchanging operational data (NMHS and centers)
XML SWOT Analysis
Exchanging operational data (NMHS and centers)
Strenghts:+ strict syntax and parser requirements+ XML schemes, useful for
• validating• defining exchange formats• writing applications that process the data
+ web enabling
Weaknesses:- verbose, bandwidth consumption- processing overhead (complex parsing, validation)
Opportunities & Threats:• data compression (gzip) • no agreed WMO schemas, except for Core Metadata
Usage: barely, e.g. Cyclone XML
Strenghts:+ strict syntax and parser requirements+ XML schemes, useful for
• validating• defining exchange formats• writing applications that process the data
+ web enabling
Weaknesses:- verbose, bandwidth consumption- processing overhead (complex parsing, validation)
Opportunities & Threats:• data compression (gzip) • no agreed WMO schemas, except for Core Metadata
Usage: barely, e.g. Cyclone XML
ET-ADRS, 23-25 April 2008, Washington DC, USA 16
World Meteorological Organization
XML SWOT Analysis
Transmitting information (Users outside NMHSs and centers)
XML SWOT Analysis
Transmitting information (Users outside NMHSs and centers)
Strenghts:+ human readable (text)+ language support (unicode)+ web enabling (Webservices, SOA support, e-Business)+ interoperability+ translation to text or HTML or other XML (XSL)+ open standard, licence free+ technology support (tools)
Weaknesses:- verbose, bandwidth consumption- transmitting binary data
Opportunities & Threats:• usage for metadata exchange is common practice (catalogue data)• too many proprietary schemas to transmit data
Usage: growing practice , e.g. Road Weather Information Network, Canada
Strenghts:+ human readable (text)+ language support (unicode)+ web enabling (Webservices, SOA support, e-Business)+ interoperability+ translation to text or HTML or other XML (XSL)+ open standard, licence free+ technology support (tools)
Weaknesses:- verbose, bandwidth consumption- transmitting binary data
Opportunities & Threats:• usage for metadata exchange is common practice (catalogue data)• too many proprietary schemas to transmit data
Usage: growing practice , e.g. Road Weather Information Network, Canada
ET-ADRS, 23-25 April 2008, Washington DC, USA 17
World Meteorological Organization
XML SWOT Analysis
Usage in storage systems XML SWOT Analysis
Usage in storage systems
Strenghts:+ Native XML DBMSs available+ Many XML enabled relational databases + technology such as XPath and XQuery, SQL/XML
Weaknesses:- Normalizing/mapping XML data into RDBMS tables can be
difficult- A native XML DBMS requires more space (less efficient)
Opportunities & Threats:• No much experience with XML for storage in NMHSs
Usage: writing XML queries iso SQL (growing practice)
Strenghts:+ Native XML DBMSs available+ Many XML enabled relational databases + technology such as XPath and XQuery, SQL/XML
Weaknesses:- Normalizing/mapping XML data into RDBMS tables can be
difficult- A native XML DBMS requires more space (less efficient)
Opportunities & Threats:• No much experience with XML for storage in NMHSs
Usage: writing XML queries iso SQL (growing practice)
ET-ADRS, 23-25 April 2008, Washington DC, USA 18
World Meteorological Organization
XML SWOT Analysis
Standards XML SWOT Analysis
Standards
Strenghts:+ XML is open standard (W3C)+ ISO 191xx standards are supported by XML schemas+ GML (ISO 19136) – XML for geospatial aspects + OGC OM model supported by GML schemas+ Ability to implement UML conceptual models (ISO 19103) in XML
schemes (using XMI)
Weaknesses:- No agreed WMO schemas available (except for Core Metadata)
Opportunities (& Threats):• All harmonization initiatives are based on ISO, W3C and OGC
standards (RA VI: EU-INSPIRE) • OGC Observation and Measurement model used to develop WXXM
(weather exchange model) for avaition (Eurocontrol) • HollowWorld applied to develop WMO Core metadata profile.
Strenghts:+ XML is open standard (W3C)+ ISO 191xx standards are supported by XML schemas+ GML (ISO 19136) – XML for geospatial aspects + OGC OM model supported by GML schemas+ Ability to implement UML conceptual models (ISO 19103) in XML
schemes (using XMI)
Weaknesses:- No agreed WMO schemas available (except for Core Metadata)
Opportunities (& Threats):• All harmonization initiatives are based on ISO, W3C and OGC
standards (RA VI: EU-INSPIRE) • OGC Observation and Measurement model used to develop WXXM
(weather exchange model) for avaition (Eurocontrol) • HollowWorld applied to develop WMO Core metadata profile.
ET-ADRS, 23-25 April 2008, Washington DC, USA 19
World Meteorological Organization
XML SWOT Analysis
Other AbilitiesXML SWOT Analysis
Other Abilities
Translating XML from and to other DRSs • NetCDF <> XML tooling:
– (to) NetCDF Markup Language (NcML), and NcML-GML– (to&from) LeoNetCDF
• BUFR <> XML tooling: unknown• HDF5 <> XML tooling: (to) d5dump
Combining XML with other data formats (e.g. envelope, pseudo carrier)• XML metadata /header for HDF5, NetCDF or BUFR datasets• SOAP (XML protocol) for exchanging data
Translating XML from and to other DRSs • NetCDF <> XML tooling:
– (to) NetCDF Markup Language (NcML), and NcML-GML– (to&from) LeoNetCDF
• BUFR <> XML tooling: unknown• HDF5 <> XML tooling: (to) d5dump
Combining XML with other data formats (e.g. envelope, pseudo carrier)• XML metadata /header for HDF5, NetCDF or BUFR datasets• SOAP (XML protocol) for exchanging data
ET-ADRS, 23-25 April 2008, Washington DC, USA 20
World Meteorological Organization
XML SWOT Analysis
Available support(skills & technology/tools)
XML SWOT Analysis
Available support(skills & technology/tools)
• Numerous XML extensions(XSL, XQuery XForms, XPath, KML, DWML, SOAP etc.) • XMI to translate data models (UML) into XML schemes • GML for geospatial aspects (GIS systems)
• XML technology is widespread, easily available and cheap
• Increasing number of individuals with XML skills
• Numerous XML extensions(XSL, XQuery XForms, XPath, KML, DWML, SOAP etc.) • XMI to translate data models (UML) into XML schemes • GML for geospatial aspects (GIS systems)
• XML technology is widespread, easily available and cheap
• Increasing number of individuals with XML skills
ET-ADRS, 23-25 April 2008, Washington DC, USA 21
World Meteorological Organization
XML SWOT Analysis
SummaryXML SWOT Analysis
Summary
Criteria Score Remarks
Ability/suitability to present WMO dataIncludes also: pictorial data, textual information, metadata
Not suitable for raster data (pictures)
Ability/suitability to exchange dataoperational data between NMHSsinformation to users outside NMHSs
Substantial bandwidth and processing overhead
Ability/suitability to store datausage in storage systems
Mapping schemes to RDBMS tables difficult.
Compliance with and status of existing standards
Only a Core Metadata XML scheme (WMO)
Available Supportskills & technology (tooling)
High level of support
Other abilitiesability to translate back and forward to other DRSs
ability to envelope objects or act as a pseudo-carrier
No BUFR translation tooling found
excellent poor
ET-ADRS, 23-25 April 2008, Washington DC, USA 22
World Meteorological Organization
XML practical experiencesXML practical experiences
• Cyclone XML, TIGGE project• RWIN, Canada• WXXM, EUROCONTROL
• Cyclone XML, TIGGE project• RWIN, Canada• WXXM, EUROCONTROL
ET-ADRS, 23-25 April 2008, Washington DC, USA 23
World Meteorological Organization
XML practical experiencesXML practical experiences
Cyclone XML• XML format for cyclone analyses and forecasts (CXML)• to improve sharing of cyclone information with other users than
NHMSs• alternative for the BUFR/CREX format• development recently started – TIGGE project• details: http://www.bom.gov.au/bmrc/projects/THORPEX/CXML/index.html
RWIN (Road Weather Information Network) • XML format for road weather observations (CMML)• Interchange between Canadian transportation ministries and
contractors (network maintenance) • 200 observing sites every 20 minutes• Operational • details: http://www.clarusinitiative.org/
Cyclone XML• XML format for cyclone analyses and forecasts (CXML)• to improve sharing of cyclone information with other users than
NHMSs• alternative for the BUFR/CREX format• development recently started – TIGGE project• details: http://www.bom.gov.au/bmrc/projects/THORPEX/CXML/index.html
RWIN (Road Weather Information Network) • XML format for road weather observations (CMML)• Interchange between Canadian transportation ministries and
contractors (network maintenance) • 200 observing sites every 20 minutes• Operational • details: http://www.clarusinitiative.org/
ET-ADRS, 23-25 April 2008, Washington DC, USA 24
World Meteorological Organization
XML practical experiencesXML practical experiences
WXXM Weather Exchange model • for data and objects related to weather for aviation• Conceptual Model (WXCM) based on OGC
Observations and Measurements model • Following ISO 19100 principles and OGC
recommendations• using GML for compatibility with third-party GML
applications• Under development • Details:http://www.eurocontrol.int/aim/public/standard_page/met.html
WXXM Weather Exchange model • for data and objects related to weather for aviation• Conceptual Model (WXCM) based on OGC
Observations and Measurements model • Following ISO 19100 principles and OGC
recommendations• using GML for compatibility with third-party GML
applications• Under development • Details:http://www.eurocontrol.int/aim/public/standard_page/met.html
2
AIM (2)
• AIXM – Aeronautical Information Exchange Model
• AMXM – Airport Mapping Exchange Model
WXXM
ANXM
ENXM
TERRAIN
DMEAN
FLTOBJECTS
?
AMXMAIXM
INCOBST
Established
Taking profit of other developments
Not included in AIM Domain & aero data elements only
• WXXM – Weather Information Exchange Model
• ANXM – Airport Network Information Exchange Model
• ENXM – Environmental Information Exchange Model
• DMEAN - Dynamic Management of the European Airspace Network
Prototype released
Under development
Aeronautical Information Exchange Model (AIXM)
ET-ADRS, 23-25 April 2008, Washington DC, USA 25
World Meteorological Organization
DiscussionDiscussion
ET-ADRS, 23-25 April 2008, Washington DC, USA 26
World Meteorological Organization
DiscussionDiscussion
• Is there a need to develop a “WMO Markup language”?
• How to benefit from the extensive and cheap XML support?
• What synergy XML <> HDF5/BUFR/NetCDF is achievable?
• What governance should WMO offer to support XML?
• Is there a need to develop a “WMO Markup language”?
• How to benefit from the extensive and cheap XML support?
• What synergy XML <> HDF5/BUFR/NetCDF is achievable?
• What governance should WMO offer to support XML?