the visibility information exchange web system (views): an approach to air quality data management...

1
The Visibility Information Exchange Web System (VIEWS): The Visibility Information Exchange Web System (VIEWS): An Approach to Air Quality Data Management and Presentation An Approach to Air Quality Data Management and Presentation In a broader sense, VIEWS facilitates the research and understanding of air quality issues in general. To fulfill these goals, the VIEWS team developed a generalized relational data model for air quality data and implemented a database system, website, and supporting software infrastructure for importing, managing, and presenting air quality data from a wide variety of sources. A primary focus of these efforts was to design a system capable of integrating diverse data sets into a common schematic and semantic framework in order to more easily manage and compare the constituent data. The data model was designed to support the mapping of source metadata onto a common collection of integrated metadata where possible, and an extensive software system was developed to import and transform the source data into a common relational schema while performing the associated metadata mappings. A carefully designed system of relational constraints and database rules was developed to ensure the accuracy, integrity, and relational consistency of all imported data. To make the data readily available for browsing, download, and analysis, the VIEWS team implemented a suite of online tools and resources in the form of the VIEWS website. Scientists, researchers, and policy makers from a broad range of organizations now use the website as a primary source of air quality data and resources. The VIEWS team intends to continually import new air quality data and improve its suite of tools for accessing and viewing this data. Web Address : http://vista.cira.colostate.edu/views Sponsor : Five EPA Regional Planning Organizations (RPOs) Guiding Body : VIEWS Steering Committee Location : Cooperative Institute for Research in the Atmosphere (CIRA), Colorado State University, Fort Collins, CO Staff : Scientists, researchers, and IT professionals Affiliations : Interagency Monitoring of Protected Visual Environments (IMPROVE) The Visibility Information Exchange Web System (VIEWS) is a database and website system that provides access to air quality data, data summaries, and research tools in support of the Regional Haze Rule enacted by the U.S. Environmental Protection Agency (EPA) to reduce regional haze in national parks and wilderness areas. Introduction Wet Deposition (NADP/NTN) Aqueous rain water concentration (mg/L). Precipitation weighted mean concentration (mg/L). Deposition (kg/ha) - the product of aqueous SO4 2- concentration in collected rain water and total precipitation over a given time period (e.g. season, year). Dry Deposition (CASTNet) Dry deposition (kg/ha) for atmospheric particles and gas phase species (e.g. SO2, HNO3, NH3) - the product of the species’ deposition velocity and the ambient air concentration integrated over time (e.g. season, year). Air Concentrations (IMPROVE, CASTNet, STN, other speciated networks) Aerosol and gas phase air concentrations (ug/m3). Inter-comparisons (Sulfur) Compare raw concentration data (ug/m3 to mg/L) Compare slopes in respective trends, for example S (SO2 plus SO4 2- ) air concentrations to S deposition expressed as %/season, %/yr. AIR D ATA _O LTP Data Acquisition System AIRDATA_SOURCE M etadata Im portSystem D ata Im portSystem Validation Validation D ata W arehouse G eneration System Validation AIRDATA_DSS AIR D ATA_O LTP AIRDATA_DSS AIR D ATA_O LTP AIRDATA_SOURCE Backup and Restore System Replication And Archival System Sites Data SOP SQ L SeverD TS N TN Sites N TN Param eters NTN M ethods N TN Flags -C hecksum s -R ecord C ounts Transform ation -Duplication -D B Integrity Mapping Validation ResultTable Validation Extraction Entry SO URCE DB O LTP Source O LTP O LTP Entry Form s Conclusion With deposition data now incorporated into a common schematic and semantic format with aerosol data in the VIEWS database system, the issues involved in performing meaningful comparisons between the two types of data can be more easily identified, explored, and resolved. Data Acquisition System: Data Acquisition System: Accepts submission of new data in a variety of formats Can automatically extract data from known online sources Uses database replication where possible Initially imports data “as-is” into the source database Metadata Import System: Facilitates the entry of new metadata Validates new metadata entries Detects overlap with existing metadata Data Import System: Extracts data from the source database Scrubs data and performs conversions Maps source metadata to integrated metadata Transforms the data into an integrated schema Verifies and validates imported data Loads data into the back-end OLTP system OLTP: OLTP: Functions as the “back-end” database Fully relational and in third normal form Used for data import, validation, and management Data Warehouse Generation System: Extracts data from the OLTP De-normalizes and transforms data Archives snapshots of existing data Loads data into the data warehouse Builds indexes on relevant tables Data Warehouse: Functions a the “front-end” database Uses a de-normalized star schema Used for querying and archiving data Automatically generated from OLTP Backup and Restore System: Backup and Restore System: Automatically and periodically backs-up critical VIEWS databases Restores database backups on demand Replication and Archival System: Vertically partitions the warehouse by time period Takes a full “snapshot” of the data warehouse at regular intervals Creates a historical audit trail for verifying archive integrity Data Acquisition: DTS Wizard used to import NTN Sites table DTS Wizard used to import NTN Data table NTN SOP information entered manually Metadata Import: Used SQL scripts to extract unique metadata from source data set Created new records for NTN metadata using the extracted source metadata Data Import: Used stored procedures and VB routines to transform the data from its source format Applied DB integrity constraints to verify the transformations Mapped source codes to relational primary keys (IDs) Loaded the results into a new table and validated the data using a series of row and column checksums and record counts NADP NTN Data Import Browsing NTN Metadata: A Basic Comparison between NTN and IMPROVE: Notes and Issues Regarding Comparisons Between Aerosol and Deposition Data: A SC IID ata File C ollection: A collection ofdata and m etadata in ASC IItextfile form at. Site B row ser: Forexploring detailed m onitoring site m etadata, history,photographs. Third Party Tools: A collection ofrelevantairquality research tools provided by various organizations and institutions. C lass IA rea W ebcam s: A grow ing collection oflinks to C lass IArea w ebcam s thatprovide a visual m ethod for assessing visibility and general airquality in national parks and w ilderness areas. D atabase Q uery W izard: Forbuilding,formatting,and subm itting queries to the VIEW S airquality database. M etadata B row ser: Forexploring the attributes, history,and organization of m onitoring netw orks and sites. A irQ uality C atalogs: A com pilation ofcarefully selected and described air-quality-related resources. C harts H TM L Text IM PRO VE AnnualSum m ary: D ata sum m aries based upon R egional H aze R ule-adjusted IM PRO VE data and intended to guide the R PO s in assessing theircom pliance progress. C om position C ontour M aps O ther A nalyses Some VIEWS Tools: Some VIEWS Tools: Site Browser ASCII Files Metadata Browser Query Wizard Third Party Tools Webcams Resource Catalogs Annual Summary

Post on 20-Dec-2015

212 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: The Visibility Information Exchange Web System (VIEWS): An Approach to Air Quality Data Management and Presentation In a broader sense, VIEWS facilitates

The Visibility Information Exchange Web System (VIEWS):The Visibility Information Exchange Web System (VIEWS):An Approach to Air Quality Data Management and PresentationAn Approach to Air Quality Data Management and Presentation

The Visibility Information Exchange Web System (VIEWS):The Visibility Information Exchange Web System (VIEWS):An Approach to Air Quality Data Management and PresentationAn Approach to Air Quality Data Management and Presentation

In a broader sense, VIEWS facilitates the research and understanding of air quality issues in general. To fulfill these goals, the VIEWS team developed a generalized relational data model for air quality data and implemented a database system, website, and supporting software infrastructure for importing, managing, and presenting air quality data from a wide variety of sources. A primary focus of these efforts was to design a system capable of integrating diverse data sets into a common schematic and semantic framework in order to more easily manage and compare the constituent data. The data model was designed to support the mapping of source metadata onto a common collection of integrated metadata where possible, and an extensive software system was developed to import and transform the source data into a common relational schema while performing the associated metadata mappings. A carefully designed system of relational constraints and database rules was developed to ensure the accuracy, integrity, and relational consistency of all imported data. To make the data readily available for browsing, download, and analysis, the VIEWS team implemented a suite of online tools and resources in the form of the VIEWS website. Scientists, researchers, and policy makers from a broad range of organizations now use the website as a primary source of air quality data and resources. The VIEWS team intends to continually import new air quality data and improve its suite of tools for accessing and viewing this data.

Web Address: http://vista.cira.colostate.edu/views Sponsor: Five EPA Regional Planning Organizations

(RPOs) Guiding Body: VIEWS Steering Committee Location: Cooperative Institute for Research in the

Atmosphere (CIRA), Colorado State University, Fort Collins, CO

Staff: Scientists, researchers, and IT professionals Affiliations: Interagency Monitoring of Protected

Visual Environments (IMPROVE)

The Visibility Information Exchange Web System (VIEWS) is a database and website system that provides access to air quality data, data summaries, and research tools in support of the Regional Haze Rule enacted by the U.S. Environmental Protection Agency (EPA) to reduce regional haze in national parks and wilderness areas.

Introduction

Wet Deposition (NADP/NTN) Aqueous rain water concentration (mg/L). Precipitation weighted mean concentration (mg/L). Deposition (kg/ha) - the product of aqueous SO42- concentration in collected rain water and total precipitation over a

given time period (e.g. season, year).

Dry Deposition (CASTNet) Dry deposition (kg/ha) for atmospheric particles and gas phase species (e.g. SO2, HNO3, NH3) - the product of the

species’ deposition velocity and the ambient air concentration integrated over time (e.g. season, year).

Air Concentrations (IMPROVE, CASTNet, STN, other speciated networks) Aerosol and gas phase air concentrations (ug/m3).

Inter-comparisons (Sulfur) Compare raw concentration data (ug/m3 to mg/L) Compare slopes in respective trends, for example S (SO2 plus SO42-) air concentrations to S deposition expressed as

%/season, %/yr.

AIRDATA_OLTP

Data Acquisition

System

AIRDATA_SOURCE

Metadata Import System

Data Import System

Validation

Validation

Data Warehouse Generation System

Validation

AIRDATA_DSSAIRDATA_OLTP

AIRDATA_DSS

AIRDATA_OLTP

AIRDATA_SOURCE

Backupand

Restore System

ReplicationAnd

ArchivalSystem

Sites Data

SOP

SQL Sever DTS

NTN Sites

NTN Parameters

NTN Methods

NTN Flags

- Checksums - Record Counts

Transformation

- Duplication - DB Integrity

MappingValidation

Result Table

Validation

Extraction Entry

SOURCE DB

OLTP

Source

OLTP OLTP

Entry Forms

ConclusionWith deposition data now incorporated into a common schematic and semantic format with aerosol data in the VIEWS database system, the issues involved in performing meaningful comparisons between the two types of data can be more easily identified, explored, and resolved.

Data Acquisition System:Data Acquisition System:• Accepts submission of new data in a variety of formats• Can automatically extract data from known online sources• Uses database replication where possible• Initially imports data “as-is” into the source database

Metadata Import System:• Facilitates the entry of new metadata• Validates new metadata entries• Detects overlap with existing metadata

Data Import System:• Extracts data from the source database• Scrubs data and performs conversions• Maps source metadata to integrated metadata• Transforms the data into an integrated schema• Verifies and validates imported data• Loads data into the back-end OLTP system

OLTP:OLTP:• Functions as the “back-end” database• Fully relational and in third normal form• Used for data import, validation, and management

Data Warehouse Generation System:• Extracts data from the OLTP• De-normalizes and transforms data• Archives snapshots of existing data• Loads data into the data warehouse• Builds indexes on relevant tables

Data Warehouse:• Functions a the “front-end” database• Uses a de-normalized star schema• Used for querying and archiving data• Automatically generated from OLTP

Backup and Restore System:Backup and Restore System:• Automatically and periodically backs-up critical VIEWS

databases• Restores database backups on demand

Replication and Archival System:• Vertically partitions the warehouse by time period• Takes a full “snapshot” of the data warehouse at

regular intervals• Creates a historical audit trail for verifying archive

integrity

Data Acquisition:• DTS Wizard used to import NTN Sites table• DTS Wizard used to import NTN Data table• NTN SOP information entered manually

Metadata Import:• Used SQL scripts to extract unique

metadata from source data set• Created new records for NTN metadata

using the extracted source metadata

Data Import:• Used stored procedures and VB routines to

transform the data from its source format• Applied DB integrity constraints to verify the

transformations• Mapped source codes to relational primary

keys (IDs)• Loaded the results into a new table and

validated the data using a series of row and column checksums and record counts

NADP NTN Data Import

Browsing NTN Metadata: A Basic Comparison between NTN and IMPROVE:

Notes and Issues Regarding Comparisons Between Aerosol and Deposition Data:

ASCII Data File Collection:A collection of data and metadata in ASCII text file format.

Site Browser:For exploring detailed monitoring site metadata, history, photographs.

Third Party Tools:A collection of relevant air quality research tools provided by various organizations and institutions.

Class I Area Webcams:A growing collection of links to Class I Area webcams that provide a visual method for assessing visibility and general air quality in national parks and wilderness areas.

Database Query Wizard:For building, formatting, and submitting queries to the VIEWS air quality database.

Metadata Browser:For exploring the attributes, history, and organization of monitoring networks and sites.

Air Quality Catalogs:A compilation of carefully selected and described air-quality-related resources.

Charts

HTML

Text

IMPROVE Annual Summary:Data summaries based upon Regional Haze Rule-adjusted IMPROVE data and intended to guide the RPOs in assessing their compliance progress.

Composition

Contour Maps

Other Analyses

Some VIEWS Tools:Some VIEWS Tools:

• Site Browser

• ASCII Files

• Metadata Browser

• Query Wizard

• Third Party Tools

• Webcams

• Resource Catalogs

• Annual Summary