the idea

24
The idea

Upload: wilma

Post on 17-Jan-2016

25 views

Category:

Documents


0 download

DESCRIPTION

The idea. Project data. DRIVER is a (small) FP6 STREP http://cordis.europa.eu/fp6/instr_strp.htm in http://cordis.europa.eu/ist/ with overall budget of M€ 2,507 and EC contribution of M€ 1,835 starting at 01/06/2006 for a period of 18 months - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: The idea

The idea

Page 2: The idea

Project data

• DRIVER is a (small) FP6 STREP http://cordis.europa.eu/fp6/instr_strp.htm in http://cordis.europa.eu/ist/

• with overall budget of M€ 2,507 and EC contribution of M€ 1,835

• starting at 01/06/2006 for a period of 18 months • Web: http://www.driver-repository.eu/

Page 3: The idea

Open access testbed

“DRIVER sets out to build the testbed for a future knowledge infrastructure of the European Research Area […] DRIVER will deliver the content resources, i.e. any form of scientific output, including scientific/technical reports, working papers, pre-prints, articles and original research data.”

Page 4: The idea

“The knowledge infrastructure testbed, delivered by DRIVER, will be based on nationally organized digital repository infrastructures […] The successful DARE network in the Netherlands, recently presented to the public by the project partner SURF, will serve as model to DRIVER.”

National agents

Page 5: The idea

“DRIVER with its testbed will not build a specific digital repository system with pre-defined services, based on a specific technology and serving dedicated communities. The testbed will in its inception focus on the infrastructure aspect, i.e., open, clearly defined interfaces to the content network, which allow any qualified service-provider to build services on top of it.”

Two layer concept (1/2)

Page 6: The idea

…..

Society

Research

Education

HarvestersVirtual Learning Environments (Blackboard, HIVE, etc)Course Ware, Readers, ...

InstitutionalRepository

TUDelft

Minho

Soton

CNRS

MIT

Subject repositories, refereed portals, databases, collaboratories,(Open Access) journals, ...

Institutional windows, professional journals, personal Web sites, national windows, Google ...

ArXiv

Two layer concept (2/2)

....

Page 7: The idea

Building

Focussed Studies Raising Awareness

Pan-European DR Infrastructure via FP7

ISTI-CNR

SURF UniNott

Infrastructure Middleware

Development/Implementation

UniBie / UniGoe

ContentOrganisation and Provision

blocks

Page 8: The idea

DRIVER Partners

Greece, UoA National and Capodistrian University of Athens, Administrative Co-ordination

Germany, SUB Niedersächsische Staats- und Universitätsbibliothek Göttingen, Germany

Scientifc-Technical Co-ordination

France, CNRS* Centre pour la Communication Scientifique Directe / CNRS DIS, France

Poland, ICM Interdiscipl. Centre for Math. and Comp. Mod. /Univers. of Warsaw, Poland

Italy, CNR* Istituto di Scienza e Tecnologie dell'Informazione "A. FAEDO" / CNR, Italy

The Netherlands, SURF* SURF Foundation, The Netherlands UK, UKOLN UK Office for Library and Information Networking

, University of BathBelgium, UGhent University Library Ghent, BelgiumUK, UniNott University of Nottingham, United Kingdom Germany, UB-Bi Universitätsbibliothek Bielefeld, Germany

* national organizations

Page 9: The idea

WP1 - Project Management 7%

WP2 - Strategic Organisation and Content Provision 13%

WP3 - DRIVER Design and Technical Co-ordination

WP4 - DRIVER Testbed Implementation

WP5 - DRIVER Testbed Testing and Validation WP6 - Service Activity

55%

WP7 - Focussed Studies 12%

WP8 - Awareness-raising and Advocacy programme 15%

testbed

DRIVER Work Packages

Page 10: The idea

Incremental approach:

1. test bed• spring 2007 with experimental group of 51 repositories* from: BE

(Ghent), DE (DINI), FR (CNRS), NL (DARE), UK (SHERPA)• International meeting planned for 6. February 2007• draft guidelines of content providers• systematic test bed testing

2. test bed extended• to be designed on the basis of experience with 1.• presumably summer 2007• include e.g. PT, DK, ES, GR, FI, IT, NO, PL, SE … & individuals

3. organisational structure• to be designed on the basis of experience with 1. & 2.

* See appended slides

Content providers

Page 11: The idea

Requirements for eligibility still under debate. E.g.• DINI certificate• OpenDOAR criteria• DARE guidelines

Settled on 6 February 2007New partners invited on 7 February 2007

Quality of data layer is crucial for quality of services

Content providers

Page 12: The idea

Software architecture

Page 13: The idea

Aggregator

Quality instrument for harvesting:• checks OAI protocol compliance • list of repositories and their characteristics

(e.g. base url, supported formats, platform, sets, log of harvests, country, …)

• normalizes harvested result by e.g. quality check of metadata, filtering on rights level (open access, licenced, embargoed), link to full text checking, prepares indexing, …

DRIVER will use SAHARA (= Open source DARE harvester/aggregator)

Page 14: The idea

Metadata guidelines

DC, plus rules for e.g. • document identifier • author identifier • provenance• date stamp (refresh vs incremental harvesting,

deletes)• classification scheme (DEWEY?)

Settled on 6 February 2007

Page 15: The idea

Issues

• which index/search engine: Lucene and/or Fast?• mapping e.g. MARC, IEEE LOM, METS, … to DC• authentication only for profiling and

recommendation/RSS• no sets but filters for document type, status,

language, … (quality of metadata is critical)• automated classification e.g. based on DEWEY, for

discipline based services • resource harvesting (not supported by OAI PMH,

DARE has developed XML container) for e.g.– long term preservation – full text indexing

Page 16: The idea

• Inventory of EU repositories

• Studies of related issues

• Investigation of technical standards

DRIVER Studies

Page 17: The idea

• Inventory of EU repositories

Deliverable: A complete inventory of the present state of Digital Repositories in all 25 countries of the European Union. A Digital Repository in this study is defined as (1) containing research output (2) institutional or thematic and (3) OIA compliant. Preliminary results:

www.pleiade.nl/wikiPlease complete!

DRIVER Studies

Page 18: The idea

• Studies of related issues– Business models, Key Perspectives/Alma Swan– IPR, SURF/Wilma Mossink– Long term preservation, KB/Barbara Sierman– Data curation, KNAW-DANS/Rene van Horik– IR population, Tilburg University/Vanessa Proudman

Workshop in March to discuss draft (-> WP8)

Deliverable: A DRIVER’s guide for IR managers

DRIVER Studies

Page 19: The idea

• Investigation of technical standards– metadata (formats, qualifiers)– content description (document types, versions)– data modeling – complex documents – identifier (documents, authors, institutes)

DRIVER Studies

Page 20: The idea

Awareness-raising and Advocacy programme• Raise awareness of:

– Open Access - repositories - DRIVER• Support work to establish:

– repositories - regional support network - repository networks

• Provide information on:– advocacy materials - advocacy strategies -

advice & resources• Active outreach• Efficient response• Comprehensive information

DRIVER Advocacy

Page 21: The idea

testbed repositories first batch (1/2)

SHERPA http://www.sherpa.ac.uk/repositories/index.html

– University of Birmingham - EPrints Service– University of Bristol - Bristol Rep. of Schl. Eprints (ROSE) – British Library - EPrints – University of Cambridge - DSpace @ Cambridge – University of Durham - Durham E-Print Repository – University of Edinburgh -

Edinburgh Research Archive (ERA) – University of Glasgow - Glasgow ePrints Service – London LEAP Consortium

• Birkbeck College - Birkbeck ePrints • Imperial College - Imperial Eprints • Kings College - King's ePrints • LSE - LSE Research Online • Royal Holloway - Royal Holloway Research Online • SOAS - SOAS Eprints • UCL - UCL Eprints

– University of Newcastle upon Tyne - E-Print Pilot – University of Nottingham - Nottingham ePrints – University of Oxford - Oxford Eprints – White Rose Partnership - White Rose Consortium e

• University of Leeds • University of Sheffield

DARE http://www.darenet.nl/nl/page/language.view/repositories

– Amsterdam University Press – Erasmus Universiteit Rotterdam – Fontys Hogescholen – KNAW – NWO – Open Universiteit Nederland – Radboud Universiteit Nijmegen – Rijksuniversiteit Groningen – Technische Universiteit Delft – Technische Universiteit Eindhoven – Universiteit Leiden – Universiteit Maastricht – Universiteit Twente – Universiteit Utrecht – Universiteit van Amsterdam – Universiteit van Tilburg – Vrije Universiteit Amsterdam

Content providers

Page 22: The idea

testbed repositories first batch (2/2)

DINI http://www.dini.de/dini/zertifikat/zertifiziert.php

– Universität Kassel KOBRA - Kasseler Online Bibliothek, Repository und Archiv– Universität Bielefeld BieSOn - Bielefelder Server für Online-Publikationen– Technische Universität Hamburg-Harburg TUBdok– Staats- und Universitätsbibliothek Bremen - Dokumentser– Universität Ulm Volltextserver der Universität Ulm (VTS)– Universität Göttingen Webdoc-Server– Hochschule für bildende Künste Hamburgask 23– Universität Heidelberg HeiDok– SLUB Dresden / TU Dresden HSSS Hochschulschriftenserver– Universitätsbibliothek Mannheim MADOC: Mannheim Electronic DOCument

Server der Universitätsbibliothek Mannheim– Universität Mainz ArchiMeD - Hochschulpublikations-Server der Universität Mainz– Universitätsbibliothek Hohenheim OPUS Hohenheim– PsyDok - Volltextserver der Virtuellen Fachbibliothek Psychologie– SciDok - Der Wissenschafts-Server der Universität des Saarlandes– Universität Stuttgart OPUS - Online Publikationsverbund der Universität Stuttgart– Universitätsbibliothek Tübingen TOBIAS-lib– Humboldt-Universität zu Berlin EDOC-Server– Universitaet Duisburg-Essen, Campus Duisburg Duisburger Elektronische Texte (

DuetT)– Technische Universität Chemnitz MONARCH (Multimedia ONline ARchiv CHemnitz)

GhentCNRS

– TEL– HAL– EduTice– SIC– IJN– MEMSIC

Content providers

Page 23: The idea

DRIVER Partners Administrative Co-ordination

UoA National and Capodistrian University of Athens, GreeceMike Hatzolopoulos, Yannis Ioannidis, Natalia Manola, Vassili Stoumpos, Antonis Lebessis

Scientifc-Technical Co-ordination

SUB Niedersächsische Staats- und Universitätsbibliothek Göttingen, GermanyNorbert Lossau, Heike Neuroth, Wolfram Horstmann

CNRS* Centre pour la Communication Scientifique Directe / CNRS DIS, France Francis Andre, Muriel Foullonneau

ICM Interdiscipl. Centre for Math. and Comp. Mod. /Univers. of Warsaw, PolandWojtek Sylvetsrzak, Jaroslaw Wypychowksi

CNR* Istituto di Scienza e Tecnologie dell'Informazione "A. FAEDO" / CNR, ItalyDonatella Castelli, Pasquale Pagano, Paolo Manghi

SURF* SURF Foundation, The Netherlands Leo Waaijers, Martin Feijen, Maurits van der Graaf, Vanessa Proudman, Alma Swan …

UKoln UK Office for Library and Information Networking, University of BathLiz Lyon, Rachel Heery

UGhent University Library Ghent, BelgiumSylvia van Peteghem, Karen van Godtsenhoven, Patrick Hochstenbach

UniNott University of Nottingham, United Kingdom Stephen Pinfield, Bill Hubbard, Gareth Johnson, Mary Robinson

UB-Bi Universitätsbibliothek Bielefeld, GermanyMichael Höppner, Friedrich Summann, Marek Imialek

* national organizations

Page 24: The idea

Work packages

WP1 - Project Management (UoA)

This work package aims at co-coordinating all project activities, and ensuring conformance with the project annex and quality plan.

7%

WP2 - Strategic Organisation and Content Provision (SUB)

Oversees and coordinates the activities between and across the various work packages to ensure and stimulate the incremental development of the DRIVER testbed.

13 %

WP3 - DRIVER Design and Technical Co-ordination (CNR)

Defines the functional and architectural specifications of the DRIVER testbed. Also plans and coordinates the development of the DRIVER testbed components; optimizes the different phases of the development iterations; and synchronizes the production of coherent releases.

WP4 - DRIVER Testbed Implementation (UoA)

Produces the detailed specification of the services that will provide the functionality of the DRIVER testbed and implements the service components of the DRIVER testbed.

WP5 - DRIVER Testbed Testing and Validation (ICM)

Defines functionality and efficiency tests based on work from WP3 and WP4.

55%

WP6 - Service Activity (CNR)

Designs the appropriate system deployment workflow and monitors quality for DRIVER service operations.

WP7 - Focussed Studies (SURF)

Conducts a set of strategic and coordinated studies on DR’s and DR-related topics to facilitate the iterative development of DRIVER and help develop the roadmap for EU-wide expansion of the future knowledge infrastructure as depicted in the DRIVER vision.

12%

WP8 - Awareness-raising and Advocacy programme (UniNott)

Creates an informed and active environment for repository infrastructure development in EU countries, through the provision of energizing activities, focused information and contextualized support.

15%

testbed