open data open science open search · 2020. 6. 10. · open data. open scienc. e. open search @...
TRANSCRIPT
CERN – IT DepartmentCH-1211 Genève 23
Switzerlandwww.cern.ch/it
Dr Tim SMITH, Dr Andreas Wagner - CERN
OSSYM - Web-meeting on Open Internet SearchCERN – 2020/05/26
Open DataOpen Science
Open Search@ CERN
@TimSmithCH
CERN
Science for Peace
27km
-271.3oC
99.999999%23 Member States
70 Countries120 Nationalities600 Universities
Collaboration
FundamentalResearch
@TimSmithCH
Big Data !
40 million collisions /s
150 M sensors
Trigger 100 k events /s
Record 10 GB /s
Calibrate
Reconstruct
10 M lines of code
320 PBAnalysis at 167 sites around world
@TimSmithCH
Data Driven Science
1903 1995
Opal Experiment
@TimSmithCH
Research Iceberg kB
1,000
MB1,000,000
GB1,000,000,000
TB1,000,000,000,000
PB1,000,000,000,000,000
@TimSmithCH
The Web Free Market
Share ≠ Publish ≠ Preserve
@TimSmithCH
CERN Open Data
http://opendata.cern.chhttp://inveniosoftware.org
@TimSmithCH
Analysis Preservation
InternalPeer Review
ExternalPeer Review
http://github.com/cernanalysispreservation
@TimSmithCH
Open Access Mandate
Enabling Open Science
Open Data Pilot
Open Access Pilot
ORD & DMP by default
@TimSmithCH
Open Research as a Service
http://zenodo.org
@TimSmithCH
Open Research as a Service
@TimSmithCH
Share ≠ Publish ≠ Preserve
@TimSmithCH
Towards Digital Age Scholarship
Peer Reviewers
Article Centric
Research Centric
@TimSmithCH
Preservation ↔ Reusability
http://reana.io
@TimSmithCH
Open Science…
science
Reproduce
Public good
Efficiency
Economy
Impact
Innovation
Measure
Quality
Equality of access
…to advance knowledge, technology, society
Validate
Reuse
@TimSmithCH
Search @ CERN Moving from a commercial Enterprise Search solution …
... to an Open Source Enterprise Search solution
@TimSmithCH
Core components
• Invenio• open source framework for
large-scale digital repositories• Elasticsearch
Open Source Enterprise Search solution
• Multiple micro-services• Decentralized• Global aggregator
What’s new?
@TimSmithCH
Open Source Enterprise Search solution
• JSON documents (extensible)• Full text indexing (PDF, PPT, ...)• Access control and isolation• Document level ACLs• Web UI + RESTful API• Python, Invenio Framework• Docker, OpenShift• Elasticsearch
@TimSmithCH
Open Source Enterprise Search solution
• Presently working on migrating of CERN’s main document repositories:
• EDMS• CERN’s Engineering and Equipment Data Management System
• Indico • Provides support for organising events of all sizes
• JACoW• Joint Accelerator Conferences Website
@TimSmithCH
Looking forward to:
• Continue our exchange of knowledge and experience• Evaluate technical solutions• Support the activities of the Open Search Foundation• Foster the scientific exchange
• Open Search Symposium 2020 and beyond
Open Search – Search @ CERN
@TimSmithCH
Open Search…
Open Search
Public good
Efficiency
Economy
Impact
Innovation
Measure
Quality
Equality of access
…to advance knowledge, technology, society
@TimSmithCH
Thank You ! Questions ?CERN Open Datahttp://opendata.cern.chhttp://github.com/cernopendatacernopendata
CERN Analysis Preservationhttp://analysispreservation.cern.chhttp://github.com/cernanalysispreservationanalysispreserv
REANA http://www.reanahub.iohttp://github.com/reanahubreanahub
Inveniohttp://inveniosoftware.orghttp://github.com/ inveniosoftwareinveniosoftware
Zenodohttp://zenodo.orghttp://github.com/zenodozenodo_org
http://orcid.org/0000-0002-1567-7116 http://cern.ch/Tim.Smith
http://openaire.eu