wide access to spatial citizen science data - ecsa berlin 2016
TRANSCRIPT
![Page 1: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/1.jpg)
Wide access to spatial Citizen Science data
ECSA 2016, Berlin
Paul van Genuchten, Lieke Verhelst, Clemens Portele
![Page 2: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/2.jpg)
![Page 3: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/3.jpg)
About the authors
Paul van Genuchten is a software engineer at “GeoCat BV”, supporting governments to publish (spatial/open) data on the web.
Lieke Verhelst is owner of “Linked Data Factory”. Lieke is a linked data expert and has developed multiple ontologies in the scope of food-safety, soil science, nature reserves, water management
Clemens Portele is managing director of “interactive instruments GmbH”. interactive instruments is a software engineering company in the spatial data infrastructure domain and is an active contributor to multiple OGC standards.
![Page 4: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/4.jpg)
COBWEBCOBWEB is a research project to empower citizens with the ability to collect environmental information using mobile devices, which will then be made suitable for use in research, decision making and policy formation.
GeoCat improves GeoNetwork opensource, targeting citizen science data discovery and visualisation in the scope of the COBWEB FP7 project.
The project has received funding from the European Union under grant agreement No 308513
![Page 5: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/5.jpg)
![Page 6: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/6.jpg)
![Page 7: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/7.jpg)
![Page 8: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/8.jpg)
![Page 9: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/9.jpg)
![Page 10: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/10.jpg)
The open data challenges- Discovery; people can’t find the data- Format; the data is exposed in complex services/formats- License; the license is restrictive- Aggregation level; “raw data now” *
* Rufus Pollock, 2007 http://blog.okfn.org/2007/11/07/give-us-the-data-raw-and-give-it-to-us-now/
![Page 11: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/11.jpg)
BackgroundOne of the objectives of COBWEB is to publish citizen science data to GEOSS
GEOSS has a focus on spatial standards (CSW, SensorWeb, WMS/WFS)
Major part of citizen science community is not aware of these standards
Average users use search engines to discover data and common formats to analyse data
How to bridge the gap between services in GEOSS and search engines
![Page 12: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/12.jpg)
![Page 13: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/13.jpg)
![Page 14: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/14.jpg)
![Page 15: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/15.jpg)
Geonovum testbedThe gap between OGC and WEB standards is a general challenge
W3C and OGC have set up a joint working group to develop best practices
At the start of 2016 Geonovum (dutch national government) organised a testbed to move the ‘spatial data on the web’ best practices forward.
![Page 16: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/16.jpg)
What search engines expectHTML (text) output on unique persistent url’s
An index that lists links to all url’s to discover
HTML documents annotated with “schema.org”-markup transform web pages into structured data
![Page 17: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/17.jpg)
![Page 18: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/18.jpg)
![Page 19: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/19.jpg)
Schema.org and Citizen ScienceThe Schema.org ontology currently does not provide classes for citizen science projects and observations
An extension to schema.org can be proposed to model citizen science communities and observations, for example based on schema.org/Measurement
![Page 20: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/20.jpg)
![Page 21: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/21.jpg)
A proxy approachA proxy layer transforms WFS/CSW requests to HTML annotated with schema.org
The CSW proxy approach is implemented in GeoNetwork opensource
For the WFS proxy approach a new open source product has been released by interactive instruments, called ‘LDproxy’
![Page 22: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/22.jpg)
![Page 23: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/23.jpg)
![Page 24: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/24.jpg)
![Page 25: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/25.jpg)
![Page 26: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/26.jpg)
{image of google structured data testing tool}
![Page 27: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/27.jpg)
![Page 28: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/28.jpg)
A proxy approach to reach other communitiesA similar approach can be used to expose OGC services to other communities, such as citizen science developer community
- CSW/iso19139 metadata exposed as DCAT/VOID in RDFa or rdf/xml- SOS/WFS/GML exposed as Darwin Core in RDFa or json-ld- A json API for web developers
Also interesting would be to look at a vice versa approach, in which a proxy is used to expose unstructured citizen science data to the geoss community as WFS/SOS.
![Page 29: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/29.jpg)
Privacy and the search engines
Some of the search engines are generally percieved as a challenge for privacy
However; in this case it is the campaign organiser that should take measures
A complicating factor is that citizens tend to like to advertise that they made a contribution, or even claim ownership of a contribution
![Page 30: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/30.jpg)
Privacy by designMinimise the transport and storage (timespan) of data that could be used to derive identity (minimise, separate, aggregate & hide*)
Communicate transparently about the transport and storage strategy
Offer users the ability to review and remove their personal data
Transport a location/timestamp to the level of detail that is required for the use case
Use a wallet with reliability-credits instead of keeping a user history for reliability assessment
* https://www.pilab.nl/wp-content/uploads/2013/12/Privacy-design-strategies-JHH-5-12-2013.pdf
![Page 31: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/31.jpg)
“Privacy awareness is growing,
it’s comparable with the stage of environmental awareness 40 years ago” *
*Jaap-Henk Hoepman, Privacy & Identity Lab, Radboud University Nijmegen
![Page 32: Wide access to spatial Citizen Science data - ECSA Berlin 2016](https://reader031.vdocument.in/reader031/viewer/2022022203/587807ea1a28ab971e8b4f39/html5/thumbnails/32.jpg)
ConclusionsA proxy approach for CSW is a good way to make existing published datasets more widely discoverable via alternative channels
A proxy approach for WFS/SOS has potential to bridge the gap between OGC services and search engines, however currently the search engines have limited implementations for using the schema.org annotations
Adopting an established standard helps in making data more widely available.
There’s a growing number of tools available to facilitate to engage with open data