cerif: past, present and future 1 an overview anne asserson, uib (no) keith g jeffery, clrc (uk)...

42
CERIF: Past, Present and Future 1 CERIF: Past, Present and Future An Overview Anne Asserson, UiB (NO) Keith G Jeffery, CLRC (UK) Andrei Lopatenko, MU (UK) Kassel, 29 – 31 August 2002 This presentations had a lot of pictures that unfortunately and regrettable had to be removed because of the size of the file. I was unable to transmit it to the CRIS202 conference server. The in situ presentation was done on local PC. AA

Upload: damon-manning

Post on 26-Dec-2015

216 views

Category:

Documents


1 download

TRANSCRIPT

CERIF: Past, Present and Future 1

CERIF: Past, Present and Future An Overview

Anne Asserson, UiB (NO)

Keith G Jeffery, CLRC (UK)

Andrei Lopatenko, MU (UK)

Kassel, 29 – 31 August 2002

This presentations had a lot of pictures that unfortunately and regrettable had to be removed because of the size of the file. I was unable to transmit it to the CRIS202 conference server. The in situ presentation was done on local PC. AA

CERIF: Past, Present and Future 2

• CERIF: Past, Present and Future: An OverviewAnne Asserson, Andrei Lopatenko, Keith Jeffery

• CERIF - Information Retrieval of Research Information in a Distributed Heterogeneous EnvironmentAndrei Lopatenko, Keith Jeffery, Anne Asserson

• Comparative Study of Metadata for Scientific Information:The place of CERIF in CRISs and Scientific RepositoriesKeith Jeffery, Andrei Lopatenko, Anne Asserson,

CERIF: Past, Present and Future 3

Past• CERIF91• Experience and Problems• Requirements• CRISs and CERIF91

Present• CERIF 2000 • The Three Data Models• CERIF Data Model• The Advantages• Use of CERIF2000 today

Future • CERIF 2002• Revisions to CERIF2000• Status • Issues• Custodians

CERIF: Past, Present and Future 4

CERIF91

• Exchange of data on R&D in Europe• Workshop 1987 in Brussels• Recommendation publised in the

Official Journal of the European Communities,1991

CERIF: Past, Present and Future 5

CERIF 1991 experience & problems• Problem that CRISs had a single-entry focus

– Projects (national research councils)– Persons (BEST uk, COS us, ….) – Organisations (LABO fr, …..)

• Simple record format – Project was an entity with: Persons, organisations,

and other information represented as attributes

• Problems with repeated groups and relationship

• Research classifications scheme recommended 1991, not updated since 1988

CERIF: Past, Present and Future 6

Requirements • cover projects, persons, organisations • entities, not more attributes• lengths & types & language, character

set• repeating groups (logical)• flexibility - relationships (conceptual)• need better data quality • need for consistent coding (semantic). • no supporting systems for CERIF 1991

existing

CERIF: Past, Present and Future 7

CRISs and the state of CERIF 1991• CERIF 1991 needed updating

– to handle problems from experience of use

• CRIS becoming more important – noticeable both in EC and national governments

• Standard needed for ERGO (European Research Gateways Online) pilot initiative – A single gateway to national databases of research

projects via a central catalogue launched 1999– 20-30 countries submitted data, 90 000 records

CERIF: Past, Present and Future 8

Past• CERIF91• Experience and Problems• Requirements• CRISs and CERIF91

Present• CERIF 2000 • The Three Data Models• CERIF Data Model• The Advantages• Use of CERIF2000 today

Future • CERIF 2002• Revisions to CERIF2000• Status • Issues• Custodians

CERIF: Past, Present and Future 9

CERIF 2000• Working Group set up 1997 and Co-

ordinated by DG XIII-D4, European Comission

• CERIF 2000 Guidelines, Final report of the CERIF Revision Working Group, 1999

• Common format for development of new CRISs

• Exchange of data from records in existing and future multiple different CRIS

• Implies the need for a common format for metadata describing records in existing and future multiple different CRIS

CERIF: Past, Present and Future 10

The Three Data models

• Metadata format should be proper subset of Exchange format

• Exchange format should be proper subset of common superset of all known CRIS (and estimates of future CRIS)

Task: • Develop standard datamodel for Common

superset• first, then derive exchange and metadata subsets

CERIF: Past, Present and Future 11

CERIF: Past, Present and Future 12

CERIF2000 Data model

– Extended relational model– Linking relations with attributes (roles and

time stamp)– 3 base entities Person, Organisation,

Project– 12 secondary base entities (linked to base

entities)– 36 Look up tables (to ensure data quality)– 39 Link tables (flexibility)– all text fields have multiple language fields

– Maximum representativity with minimum complexity

CERIF: Past, Present and Future 13

’CERIF2000 in a nutshell’

by

Keith G Jeffery, 2001

CERIF: Past, Present and Future 14

’CERIF2000 in a nutshell’ by Keith G Jeffery, 2001

PROJECT

ORGUNIT

Skills

CV

GeneralFacility

ParticularEquipment

ContactResults

PublicationResultsPatentResultsProduct

Service

FundingProgramme

Event

ClassificationPrize/Award

PERSON

CERIF: Past, Present and Future 15

PROJECT

ORGUNITPERSON

’CERIF2000 in a nutshell’ by Keith G Jeffery, 2001 Three Primary Entities

Concepts:(1) entities that reflect main ‘views of entry’ into CRISs(2) entities with no direct functional dependency on each other(3) entities that can refer to themselves (recursion)(4) entities linked in pairs by ‘linking relations’(5) ‘linking relations’ represent temporally-bound roles(6) ‘linking relations’ have primary key of each entity, role, date/time start, date/time end and any other constraints

CERIF: Past, Present and Future 16

PROJECT

ORGUNITPERSON

’CERIF2000 in a nutshell’ by Keith G Jeffery, 2001 Linking Relations

As an Example: PERSON-ORGUNITConcepts:(1) May have many instances of the relationship for each instance of PERSON and ORGUNIT due to role and temporal bounding (2) Role: the purpose of the relationship e.g. employee | head | ….(3) Temporal: the use of <Start Date/Time> and <End Date/Time> defines the duration of this relationship

Analagous for PROJECT_ORGUNIT and PERSON_PROJECT

Person-Orgunit

CERIF: Past, Present and Future 17

PROJECT

ORGUNITPERSON

’CERIF2000 in a nutshell’ by Keith G Jeffery, 2001 ORGUNIT

Concepts:(1) ORGUNIT may have an organisationally subordinate relationship to another ORGUNIT e.g. a Group within a Department(2) ORGUNIT may have a symbiotic relationship to another ORGUNIT e.g. two Groups that have a cooperation agreement(3) ORGUNIT may have a financial relationship to another ORGUNIT e.g. customer - contractor

Orgunit-Orgunit

CERIF: Past, Present and Future 18

PROJECT

ORGUNITPERSON

’CERIF2000 in a nutshell’ by Keith G Jeffery, 2001 PROJECT

Concepts:(1) PROJECT may have an organisationally subordinate relationship to another PROJECT e.g. a sub-Project (2) PROJECT may have a symbiotic relationship to another PROJECT e.g. two Projects that cooperate by agreement(3) PROJECT may have a temporal relationship to another PROJECT e.g. one project follows on from another

CERIF: Past, Present and Future 19

PROJECT

ORGUNITPERSON

’CERIF2000 in a nutshell’ by Keith G Jeffery, 2001 PERSON

Concepts:(1) PERSON may have a socially subordinate relationship to another PERSON e.g. a child of a parent (2) PERSON may have a symbiotic relationship to another PERSON e.g. two researchers that cooperate by agreement(3) PERSON may have a temporal relationship to PERSON

e.g. a lecturer (dates) becomes a reader (dates)

CERIF: Past, Present and Future 20

PROJECT

ORGUNITPERSON

FundingProgramme

’CERIF2000 in a nutshell’ by Keith G Jeffery, 2001 FUNDING PROGRAMME

Concepts:

(1) Funding Programme is related to (a) ORGUNIT and / or (b) PROJECT

(2) A Person is only funded via (a) ORGUNIT and / or (b) PROJECT

(3) any other entities are only funded via (a) ORGUNIT and / or (b) PROJECT

CERIF: Past, Present and Future 21

PROJECT

ORGUNITPERSON

Contact

’CERIF2000 in a nutshell’ by Keith G Jeffery, 2001 Example: CONTACT

Concepts:(1) all contacts in one place - no replication, no update problems(2) >1 contact dependent on role e.g. private address|work address(3) the PROJECT contact is usually the project leader: a PERSON(4) the ORGUNIT contact is usually the head: a PERSON(5) but may have a generic address

e.g. project URI | Orgunit email ([email protected])Analagous for Publication, Product, Patent, Event, Prize/Award....

CERIF: Past, Present and Future 22

PROJECT

ORGUNITPERSON

Result_Publication

’CERIF2000 in a nutshell’ by Keith G Jeffery, 2001 RESULT_PUBLICATION

Concepts:(1) temporally-bound role linking relations(2) >1 linking relation : Result_Publication and other entities(3) PERSON role may be author, co-author, editor, reviewer….(4) ORGUNIT role may be publisher, IPR or copyright owner..(5) PROJECT role may be the source of the idea

CERIF: Past, Present and Future 23

’CERIF2000 in a nutshell’ by Keith G Jeffery, 2001 RESULT_PUBLICATION

PROJECT

ORGUNITPERSON

Result_Publication

Can Express:Person A (DT1 - DT2) (is author of) Publication XOrgunit O (DT1 - DT2) (is owner of IPR in) Publication XPerson A (DT1 - DT2) (is employee of ) Orgunit OPerson A (DT1 - DT2) (is project leader of) Project PPerson A (DT1-DT2) (is member of) Orgunit MPerson A (DT1-DT2) (is member of) Orgunit NOrgunit M (DT1-DT2) (is part of) Orgunit OOrgunit N (DT1-DT2) (is part of) Orgunit O

CERIF: Past, Present and Future 24

PROJECT

ORGUNITPERSON

Skills

CV

GeneralFacility

ParticularEquipment

Contact

ResultsPublication

ResultsPatent

ResultsProduct

Service

FundingProgramme

Event

PERSON Links ’CERIF2000 in a nutshell’ by Keith G Jeffery, 2001

Prize/Award

CERIF: Past, Present and Future 25

PROJECT

ORGUNITPERSON

Skills

CV

GeneralFacility

ParticularEquipment

Contact

ResultsPublication

ResultsPatent

ResultsProduct

Service

FundingProgramme

Event

PROJECT Links ’CERIF2000 in a nutshell’ by Keith G Jeffery, 2001

Prize/Award

CERIF: Past, Present and Future 26

PROJECT

ORGUNITPERSON

Skills

CV

GeneralFacility

ParticularEquipment

Contact

ResultsPublication

ResultsPatent

ResultsProduct

Service

FundingProgramme

Event

ORGUNIT Links ’CERIF2000 in a nutshell’ by Keith G Jeffery, 2001

Prize/Award

CERIF: Past, Present and Future 27

PROJECT

ORGUNITPERSON

Skills

CV

GeneralFacility

ParticularEquipment

Contact

ResultsPublication

ResultsPatent

ResultsProduct

Service

FundingProgramme

Event

Classification

Classification ’CERIF2000 in a nutshell’ by Keith G Jeffery, 2001

Prize/Award

CERIF: Past, Present and Future 28

PROJECT

ORGUNITPERSON

Skills

CV

GeneralFacility

ParticularEquipment

Contact

ResultsPublication

ResultsPatent

ResultsProduct

Service

FundingProgramme

Event

Classification

The Whole Thing ’CERIF2000 in a nutshell’ by Keith G Jeffery, 2001

Prize/Award

CERIF: Past, Present and Future 29

End

CERIF: Past, Present and Future 30

CERIF2000 A Template for a new CRIS

• CRIS can be implemented using subset or superset of full CERIF model:– for projects– for people– for organisations– for publications, patents , products– for services– for facilities, particular equipment

• with role-based relationships

CERIF: Past, Present and Future 31

The Advantages

• Neutral Architecture• Data Model can be implemented:

– relational – object-oriented – information retrieval (including WWW)

• Process model can be implemented– DBMS / query; distributed; – html web / harvesting / IR-query;– advanced knowledge-based technology

CERIF: Past, Present and Future 32

The use of CERIF2000 today• ICERIS (IC) access to information on Icelandic research

projects and R&D results • AURIS-MM (AU) Provides access to Austrian University

Research, extended with Mulitimedia• SICRIS (SL) Access to University research in Slovenia• SRIS (GB) Scottish Research Information Systems,

public research in Scotland• Fdok (NO) University of Bergen, results• CRIS-MER (EC) Research info on Migration and Ethnic

Relations (planned)• Corporate model, CRLC (UK) • METIS (previously OZIS) currently used by the majority

of the Dutch Universities

CERIF: Past, Present and Future 33

Past• CERIF91• Experience and Problems• Requirements• CRISs and CERIF91

Present• CERIF 2000 • The Three Data Models• CERIF Data Model• The Advantages• Use of CERIF2000 today

Future • CERIF 2002• Revisions to CERIF2000• Status • Issues• Custodians

CERIF: Past, Present and Future 34

CERIF2002

• Revisions to CERIF2000 standard

• Status of CERIF2002• Issues• Custodians of the model

CERIF: Past, Present and Future 35

Precision and formalisation

• CERIF2000 corrected, minor errors in the spreadsheet schema

• Inconsistencies in the EC-provided schema driven from the Entity-relationship diagrams

CERIF: Past, Present and Future 36

Some development directions

• Need to document problems / local solutions

• That means additional entities & attributes– grey literature: CERIF2000 provides for a

link to any bibliographic database; may need more information within CERIF

– ‘application groups’ e.g. Additional finance information for R&D budgetary control

– More….

CERIF: Past, Present and Future 37

• Figure from Keith G Jeffery: AN ARCHITECTURE FOR GREY LITERATURE IN A R&D CONTEXT, 1998

Publications

CERIF: Past, Present and Future 38

Status of CERIF 2002

• CERIF91 CERIF2000– Recommendation to member states in 1991– EC project coordinated changes of the model

• CERIF2000 CERIF2002– for implementation (correction to the

datamodel)– backward compatible– And developments required

• CERIF2002 ????

CERIF: Past, Present and Future 39

CERIF 2002 Issues

– Proposed changes/extensions has required few changes in the model.

– What is the basic model? Is there a part of the model that should always be consistent?

– Could that be the Meta data model?– Or is the minimum to keep the 3 basic

entities and their relations?– Should we have CERIF certified databases?– ????

CERIF: Past, Present and Future 40

CERIF2000 Data Model Base entities CERIF 2000 Guidelines, Final report of the CERIF Revision Working Group, co-ordinated by DG XIII-D4, European Comission, 1999

CERIF: Past, Present and Future 41

Custodians of the model

• Requires some organisation– EC handed responsibility to euroCRIS– euroCRIS set up CERIF Task Group

CERIF: Past, Present and Future 42

CERIF2000 is

– A flexible •Always developing compatibly

– and sound •Formally defined and validated

– data model

All CRISs should use it !