support kit for tdm topics - openmintedopenminted.eu/wp-content/uploads/2017/12/open... ·...
TRANSCRIPT
Support Kit for TDM
topics December 12, 2017
Deliverable Code: D3.4
Version: 1.0 – Final
Dissemination level: Public
This deliverable is a continuation of deliverable 3.1, 3.2 and 3.3 and contains a support kit for
TDM topics for the various OpenMinTeD stakeholders. It contains a set of materials, FAQs on
legal issues, briefing papers and a description of training activities. The intended use of
deliverable 3.1-3.3 was for the project partners to discover what the content of the Knowledge
Base (OMTD-KB) would be. The use of deliverable 3.4 has shifted more towards an overview of
‘building blocks’ that the partners can use for their own courses and tutorials in the OMTD-KB.
H2020-EINFRA-2014-2015 / H2020-EINFRA-2014-2
Topic: EINFRA-1-2014
Managing, preserving and computing with big research data
Research & Innovation action
Grant Agreement 654021
Support Kit for TDM topics
• • •
Public Page 1 of 12
Document Description
D3.4 Support Kit for TDM topics
WP3 – Support and Training
WP participating organizations: OU, LIBER, ARC, University of Manchester, UKP-TUDA, INRA, EMBL,
AgroKnow I.K.E., EPFL, BSC, USFD, GESIS, Frontiers, UoG
Contractual Delivery Date: 11/2017 Actual Delivery Date: 12/2017
Nature: Other Version: 1.0 (Final)
Public Deliverable
Preparation slip
Name Organization Date
From Martine Oudenhoven LIBER 29-11-2017
Edited by Nancy Pontika, Panagiotis
Zervas
OU, AK 30-11-2017
Reviewed by Andrea Zielinski, Sophie Aubin GESIS, INRA 11-12-2017
Approved by Androniki Pavlidou ARC 12-12-2017
For delivery Mike Hatzopoulos ARC 12-12-2017
Document change record
Issue Item Reason for Change Author Organization
V0.1 Previous
version
To be updated for D3.4 Martine
Oudenhoven
LIBER
V0.2 Draft version Initial version Martine
Oudenhoven
LIBER
V 0.3 Second draft After comments Nancy Pontika and
Panagiotis Zervas
Martine
Oudenhoven
LIBER
V1.0 Final version After comments reviewers Martine
Oudenhoven
LIBER
Support Kit for TDM topics
• • •
Public Page 2 of 12
Table of Contents
1. INTRODUCTION .......................................................................................................................................4
PROJECT BACKGROUND ...............................................................................................................................4
PROJECT GOAL ..........................................................................................................................................4
INTRODUCTION TO WP3: “SUPPORT AND TRAINING” ........................................................................................4
2. DELIVERABLE CONTENT ...........................................................................................................................6
APPENDIX ......................................................................................................................................................8
OVERVIEW .......................................................................................................................................................8
0. BACKGROUND: TDM STORIES ...........................................................................................................................9
1. GENERAL INTRODUCTION: RESOURCES TO INTRODUCE TDM .....................................................................................9
2. BARRIERS, LEGAL AND POLICY ASPECTS ............................................................................................................... 10
3. TDM CONCEPTS AND AREAS ............................................................................................................................ 10
4. HANDS-ON COURSES AND GUIDELINES ON TDM................................................................................................ 11
5. TDM TOOLS, SERVICES AND REPOSITORIES ....................................................................................................... 11
WORKING FOLDER: RESOURCES FOR USE CASE TUTORIALS .......................................................................................... 12
Support Kit for TDM topics
• • •
Public Page 3 of 12
Disclaimer
This document contains description of the OpenMinTeD project findings, work and products. Certain
parts of it might be under partner Intellectual Property Right (IPR) rules so, prior to using its content
please contact the consortium head for approval.
In case you believe that this document harms in any way IPR held by you as a person or as a
representative of an entity, please do notify us immediately.
The authors of this document have taken any available measure in order for its content to be accurate,
consistent and lawful. However, neither the project consortium as a whole nor the individual partners
that implicitly or explicitly participated in the creation and publication of this document hold any sort
of responsibility that might occur as a result of using its content.
This publication has been produced with the assistance of the European Union. The content of this
publication is the sole responsibility of the OpenMinTeD consortium and can in no way be taken to
reflect the views of the European Union.
The European Union is established in accordance with the Treaty
on European Union (Maastricht). There are currently 28 Member
States of the Union. It is based on the European Communities
and the member states cooperation in the fields of Common
Foreign and Security Policy and Justice and Home Affairs. The five
main institutions of the European Union are the European
Parliament, the Council of Ministers, the European Commission,
the Court of Justice and the Court of Auditors.
(http://europa.eu/)
OpenMinTeD is a project funded by the European Union (Grant Agreement No 654021).
Support Kit for TDM topics
• • •
Public Page 4 of 12
1. Introduction
Project Background
OpenMinTeD aspires to enable the creation of an infrastructure that fosters and facilitates the use of
text and data mining (TDM) technologies in the scientific publications world and beyond, for both
application domain users and text-mining experts.
OpenMinTeD will make existing mining tools and platforms easily findable, by providing a clear
overview of these services in a registry. The services will also be interoperable through a standards-
based interoperability layer. OpenMinTeD works with use cases from different scientific areas (life
sciences, agriculture, social sciences and scholarly communications).
The project brings together the different stakeholders, content providers and scientific communities,
text mining and infrastructure builders, legal experts, data and computing centers, industrial players
and SMEs.
Through its infrastructural foresight activities, OpenMinTeD’s vision is to make a virtuous cycle
operational, in which primary content is accessible through standardised programmatic interfaces and
access rules:
1. by well-documented and easily discoverable text mining services and workflows which process,
analyse and annotate text to
2. identify patterns and extract new meaningful actionable knowledge, which will be used for
3. structuring, indexing and searching content, and,
4. act as a new knowledge resource useful for drawing new relations between content items and
firing a new mining cycle.
Project Goal
The goal of the project is to establish an open and sustainable TDM platform and infrastructure where
researchers can collaboratively create, discover, share and re-use knowledge from a wide range of
text-based scientific related sources in a seamless way to advance research, promote interdisciplinary
open science, and ultimately support evidence based decision making.
Introduction to WP3: “Support and Training”
On the OpenMinTeD Knowledge Base (OMTD-KB), concrete technical and legal support will be
provided to researchers, content and service providers. Training material will be provided on text and
data mining in general, as well as on specific ways of using the services on the OpenMinteD platform.
OpenMinTeD also developed a ticketing system to answer questions about these topics one-on-one.
The Knowledge Base will be hosted on the currently existing online training platform of the FOSTER
Open Science training project, in which both the Open University and LIBER are project partners.
Support Kit for TDM topics
• • •
Public Page 5 of 12
WP3: “Support and Training” addresses the need for delivering supporting services to the various
stakeholders that will enable the adoption of the infrastructure and will empower its sustainability. The
services are twofold:
1. Services that aim to raise stakeholders’ technical skills on the platform.
2. Services that aim to support those stakeholders into the adoption of the infrastructure, at the
technical, organizational, legal access and operational level.
The support and training activities of OpenMinTeD will follow two phases:
Phase 1: Preparatory phase (January 2015 –July 2017): From the start of the project until the release
of the specifications, guidelines and the first platform release. Focused on preparatory activities.
Phase 2: Adoption phase (July 2017 – May 2018): From the release to the platform onwards. Focused
on the adoption of the OpenMinTeD infrastructure.
As part of Task 3.2. “Support services”, WP3 supported stakeholders through the creation of a
Knowledge Base and expertise directory on TDM issues. This Knowledge Base contains information on
technical issues around machine access to digital publications, and information about legal barriers to
TDM. This expertise directory is available on: https://github.com/openminted/omtd-publisher-
connector-harvester
Support Kit for TDM topics
• • •
Public Page 6 of 12
2. Deliverable content
In order to make the online Knowledge Base a successful platform for finding relevant information
about text and data mining, the OpenMinTeD partners have already been gathering information in an
earlier stage of the project. The content in the Knowledge Base consists of technical and legal guides
and resources about text and data mining in general, and will contain tutorials on how to use the tools
and services as listed on the OpenMinTeD platform. This deliverable (3.4) is a living folder that contains
the overview of the content for the Knowledge Base, and serves as a continuation of the deliverables
3.1, 3.2 and 3.3. A large bulk of resources has already been uploaded to the Knowledge Base and also
resources on interoperability have been added. The new version of the support kit, deliverable 3.4,
moves towards a more specific and categorized set of ‘building blocks’. Project partners, for example
the ones involved in the use cases, can use this set in the following ways:
▪ They can use the building blocks that are part of this support kit for their own tutorials and
courses that they are planning to develop during the last phase of the project.
▪ They can add resources to the kit themselves, for example if they know a good hands-on
courses on a certain topic, they can add it to the OMTD-KB or there.
▪ They can add a separate document to collect the resources for their own tutorial (the
‘Resources for introductory library course’ is the first document that follows this approach).
The support kit in this way serves as a living folder that can be used to develop more courses and
tutorials for the Knowledge Base.
So far, the folder contains an overview of:
1. The preparatory work done for deliverable 3.1-3.3, including:
▪ An overview of ideas for training topics related to text and data mining and events.
▪ The different materials that are or will be featured on the Knowledge Base.
▪ A set of links related to various TDM topics.
▪ A set of legal barriers to text and data mining, as identified by the FutureTDM project.
▪ A number of links to documents about legal issues surrounding text and data mining.
▪ Snapshots of the training calendar that is maintained on Redmine, including:
A set of links to informative blogposts related to the OpenMinTed use cases
A set of links to OpenMinTeD webinars
A set of links to the presentations delivered for the OpenMinTeD project
A set of short videos created for the OpenMinTeD project
A set of FAQs related to legal aspects of TDM and OpenMinTeD.
2. D3.4: a reorganized version of the folder D3.1-3.3 aimed at supporting the partners in developing
their own courses and tutorials. It includes the same resources as D3.1-3.3, but on top of that:
▪ A new categorization of the resources, aimed at better helping the partners
Support Kit for TDM topics
• • •
Public Page 7 of 12
▪ Short descriptions of the resources and how they can be used.
▪ The link of the resources in the Knowledge Base (OMTD-KB) or the upload status
▪ The final resources resulting from the FutureTDM project, including practitioner guidelines and
policy recommendations.
▪ Links to a set of short video-clips that explain different concepts and areas in TDM in 30
seconds.
The resources that have not been uploaded yet, will be uploaded to the Knowledge Base shortly.
Hands-on training material (tutorials, guidelines etc. for applying the use cases and using the
OpenMinTeD platform) will be developed in the final phase of the project. Every use case can use the
living folder for resources and inspiration, and has a working document where they can collect the
resources relevant for the tutorial to be developed.
The link to the Knowledge Base is: https://www.fosteropenscience.eu/
The dedicated link to the OpenMinTeD project is: https://www.fosteropenscience.eu/openminted
The link to the Google Drive folder is:
https://drive.google.com/drive/folders/0BweN4_o0UigpMzFhWmdIcURpMlk
Snapshots of the different parts of the folder are included in the Appendix to give an impression.
Support Kit for TDM topics
• • •
Public Page 8 of 12
Appendix
Overview
The overview of the support kit, deliverable 3.4. The different categories reflect different kinds/levels
of building blocks. The different use cases or partners who plan to make a separate course or tutorial,
can add a page to this folder and use it as their working space.
Support Kit for TDM topics
• • •
Public Page 9 of 12
0. Background: TDM stories
1. General introduction: resources to introduce TDM
Support Kit for TDM topics
• • •
Public Page 10 of 12
2. Barriers, legal and policy aspects
3. TDM concepts and areas
Support Kit for TDM topics
• • •
Public Page 11 of 12
4. Hands-on courses and guidelines on TDM
5. TDM tools, services and repositories
Support Kit for TDM topics
• • •
Public Page 12 of 12
Working folder: resources for use case tutorials