the icos carbon portal service portfolio

1
Data Discovery services • Sensor data from >120 stations • Data and metadata archived in a central trusted repository based on the EUDAT/CDI B2SAFE service • All ICOS data products get a unique persistent identifier (PID) • Standardized data processing by experts at Thematic Centers • Quality controlled & aggregated data available from Carbon Portal • Centralized metadata catalog for data discovery & usage tracking • User-supplied “elaborated” data products distributed by Carbon Portal • ICOS data staged to HTC & HPC facilities (e.g. EGI FedCloud/EOSC) • All ICOS data free, available under Creative Commons 4.0 BY What is ICOS? ICOS (Integrated Carbon Observation System) is a pan-European research infrastructure for observing and understanding the greenhouse gas (GHG) balance of Europe and its adjacent regions. ICOS brings together researchers from three domains – atmosphere, ecosystem and ocean. As of July 2016, ICOS has 11 member countries, and is performing measurements at a total of 130 stations on land and at sea. The major mission of ICOS is to collect and make available high-quality observational data from its state-of-the-art measurement stations operated with a long-term perspective. To achieve this, ICOS is developing a series of sustainable data management and computing services based on Open Source technology. • Advanced, facetted search Visualize any combination of parameters from the selected datasets Inspect metadata of data at “landing pages” Download selected datasets placed in “Data Cart” • Raster data visualizations Time-series of multiple selected variables Suitable for any “elaborated” 2D data product, here “atm. station emission sensitivity footprints” Animations shows evolution over time Interactive graphs of data produced in the ICOS Jupyter Notebook Virtual Research Environment Full Python functionality available Visualizations, calculations of statistics etc. based on ICOS-provided code or user scripts Services summary Technologies used Trust & reproducibility ICOS operates different services to support ICOS’ internal data management and the end users: Data ingestion & storage • Minting of persistent identifiers Metadata management & cataloguing Search & discovery services Visualizations of time series and raster data Download service using Data Cart • Measurement station registry Virtual Research Environments: data processing Single-sign on & user profile management Content & document management system ICOS Carbon Portal uses the following software and technologies: Frontend: ES6 JavaScript, React/Redux, Leaflet, OpenLayers, Bootstrap • Backend: Scala/Java, Akka Metadata: RDF, OWL, SPARQL, PostgreSQL Deployment: Linux, Docker, JVM, EGI FedCloud Data storage: Linux RAID50, EUDAT/CDI Services • CMS: Drupal, Alfresco Scientific computing: Python, R, Fortran Environmental and climate observations data are unique and cannot be reproduced – extra care is needed to secure long-time storage of the data and its associated metadata Datasets should be citeable in an unambiguous manner – both in scientific literature and provenance contexts such as workflows ICOS is implementing an ontology-based metadata catalogue describing all aspects of the project: sensors, measurements, data objects, people and publications Persistent identifiers are assigned to all data objects: ePIC PIDs for sensor data, DataCite DOIs for finalized and elaborated data products Also see: Poster 137 , Karstens et al. Also see: Poster 142, Van der Laan et al 1 ICOS ERIC 2 ICOS Carbon Portal @ Lund University, Sölvegatan 12, 223 62 Lund, Sweden 3 Wageningen University & Research, Netherlands *[email protected] The ICOS Carbon Portal Service Portfolio Alex Vermeulen *,1,2 , Maggie Hellström 2 , Ute Karstens 2 , Ingrid van de Laan-Luijkx 3 , Harry Lankreijer 2 , Oleg Mirzov 2 , Andre Bjärby 2 , Roger Groth 2 , Lars Harrie 2 , Paul Hedberg 2 , Mitch Selander 2

Upload: others

Post on 11-Nov-2021

5 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: The ICOS Carbon Portal Service Portfolio

Data Discovery services

• Sensor data from >120 stations• Data and metadata archived in a central trusted repository based on the EUDAT/CDI B2SAFE service• All ICOS data products get a unique persistent identifier (PID)

• Standardized data processing by experts at Thematic Centers• Quality controlled & aggregated data available from Carbon Portal• Centralized metadata catalog for data discovery & usage tracking

• User-supplied “elaborated” data products distributed by Carbon Portal• ICOS data staged to HTC & HPC facilities (e.g. EGI FedCloud/EOSC)• All ICOS data free, available under Creative Commons 4.0 BY

What is ICOS?ICOS (Integrated Carbon Observation System) is a pan-European research infrastructure for observing and understanding the greenhouse gas (GHG) balance of Europe and its adjacent regions. ICOS brings together researchers from three domains – atmosphere, ecosystem and ocean. As of July 2016, ICOS has 11 member countries, and is performing measurements at a total of 130 stations on land and at sea.The major mission of ICOS is to collect and make available high-quality observational data from its state-of-the-art measurement stations operated with a long-term perspective. To achieve this, ICOS is developing a series of sustainable data management and computing services based on Open Source technology.

• Advanced, facetted search • Visualize any combination of parameters from the selected datasets• Inspect metadata of data at “landing pages”• Download selected datasets placed in “Data Cart”

• Raster data visualizations• Time-series of multiple selected variables• Suitable for any “elaborated” 2D data product, here “atm. station emission sensitivity footprints”• Animations shows evolution over time

• Interactive graphs of data produced in the ICOS Jupyter Notebook Virtual Research Environment• Full Python functionality available• Visualizations, calculations of statistics etc. based on ICOS-provided code or user scripts

Services summary Technologies used Trust & reproducibilityICOS operates different services to support ICOS’ internal data management and the end users: • Data ingestion & storage• Minting of persistent identifiers• Metadata management & cataloguing• Search & discovery services • Visualizations of time series and raster data• Download service using Data Cart• Measurement station registry• Virtual Research Environments: data processing• Single-sign on & user profile management• Content & document management system

ICOS Carbon Portal uses the following software and technologies:• Frontend: ES6 JavaScript, React/Redux, Leaflet, OpenLayers, Bootstrap• Backend: Scala/Java, Akka• Metadata: RDF, OWL, SPARQL, PostgreSQL• Deployment: Linux, Docker, JVM, EGI FedCloud• Data storage: Linux RAID50, EUDAT/CDI Services• CMS: Drupal, Alfresco• Scientific computing: Python, R, Fortran

• Environmental and climate observations data are unique and cannot be reproduced – extra care is needed to secure long-time storage of the data and its associated metadata• Datasets should be citeable in an unambiguous manner – both in scientific literature and provenance contexts such as workflows• ICOS is implementing an ontology-based metadata catalogue describing all aspects of the project: sensors, measurements, data objects, people and publications • Persistent identifiers are assigned to all data objects: ePIC PIDs for sensor data, DataCite DOIs for finalized and elaborated data products

Also see: Poster 137 , Karstens et al. Also see: Poster 142, Van der Laan et al

1 ICOS ERIC 2 ICOS Carbon Portal @ Lund University, Sölvegatan 12, 223 62 Lund, Sweden 3Wageningen University & Research, Netherlands *[email protected]

The ICOS Carbon Portal Service PortfolioAlex Vermeulen*,1,2, Maggie Hellström2, Ute Karstens2, Ingrid van de Laan-Luijkx3, Harry Lankreijer2, Oleg Mirzov2,

Andre Bjärby2, Roger Groth2, Lars Harrie2, Paul Hedberg2, Mitch Selander2