implementing rioxx

27
Paul Walk Head of Technology Strategy and Planning, EDINA [email protected] @paulwalk Implementing RIOXX

Upload: paul-walk

Post on 16-Feb-2017

729 views

Category:

Education


0 download

TRANSCRIPT

Page 1: Implementing RIOXX

Paul Walk Head of Technology Strategy and Planning, EDINA [email protected] @paulwalk

Implementing RIOXX

Page 2: Implementing RIOXX

…let us know a bit more about RIOXX so that we can discuss how it might be applied in the context of DSpace repositories.

Page 3: Implementing RIOXX

contents

• the need for RIOXX

• RIOXX in detail (some specifics)

• support for RIOXX implementation (in DSpace)

Page 4: Implementing RIOXX

the need for RIOXX

Page 5: Implementing RIOXX

why build RIOXX?

• new policies from RCUK and HEFCE mandate that any journal article funded by research grants be made publicly accessible in a repository

• these policies require that universities make metadata about such papers easily discoverable

• the available metadata formats were inadequate • OAI-DC was not rich enough • OpenAIRE was better but demanded project IDs be encoded in particular

syntax not compatible with project IDs from UK Research Councils • OpenAIRE syntax

• info:eu-repo/grantAgreement/Funder/FundingProgram/ProjectID/[Jurisdiction]/[ProjectName]/[ProjectAcronym]

• RCUK syntax: • OpaqueProjectID/version

Page 6: Implementing RIOXX

particular concerns

• how to represent the funder

• how to represent the project/grant

• implementing recommendations from the V4OA process: • controlled vocabularies for rioxxterms:version, rioxxterms:apc • use of NISO’s Open Access Metadata and Indicators (license_ref and

free_to_read)

• how to represent the persistent identifier of the item described

• provisions of identifier(s) pointing to related dataset(s)

• how to represent the rights of use of the item described

Page 7: Implementing RIOXX

principles

• purpose driven • designed to meet a singe, focussed use-case • solve one problem well, avoid ‘feature creep’

• focussed on implementation • has to be relatively easy to implement • ‘shallow’ structure • the simplest thing that can possibly work

• open development • public consultation • tested openly

• rapid development • short iterations

Page 8: Implementing RIOXX

RIOXX in detail (some specifics)

Page 9: Implementing RIOXX

• an application profile using properties from 4 namespaces: • 11 properties from Dublin Core (dc and dcterms) • 2 properties from NISO Open Access Metadata and Indicators • 8 from a new namespace - ‘rioxxterms’

• constraints imposed through several controlled vocabularies

• it has one purpose: to provide a mechanism to help institutional repositories in the UK comply with the RCUK policy on open access. • it is not designed to provide general interoperability!!

• born at UKOLN, developed by EDINA and Chygrove Ltd., supported by Research Councils UK (RCUK) & Higher Education Funding Councils of England (HEFCE) and funded (initially) by Jisc

• Version 2.0 released in January 2015

Page 10: Implementing RIOXX

components of RIOXX

• a metadata ‘application profile’ • technical documentation • an XSD schema to facilitate metadata validation • mapping to OpenAIRE 3

• a set of guidelines for systems implementation (with a focus on institutional repositories)

• implementation monitoring and testing framework

• a supporting website (http://www.rioxx.net)

• (+ independent development of software plugins etc. to support RIOXX implementation)

Page 11: Implementing RIOXX

some specific properties

• dc:identifier

• dc:relation & rioxxterms:version_of_record

• dcterms:dateAccepted

• rioxxterms:author & rioxxterms:contributor

• rioxxterms:project

• license_ref

Page 12: Implementing RIOXX

dc:identifier

• identifies the open access item being described by the RIOXX metadata record. • regardless of where it is located

• recommended to identify the resource itself, not a ‘splash page’ • this will not always be possible or desirable

• whatever it identifies, it MUST be an HTTP URI

• Example:

<dc:identifier>http://oro.open.ac.uk/2/1/LIBARTVICEprints.pdf

</dc:identifier>

Page 13: Implementing RIOXX

dc:relation & rioxxterms:version_of_record

• rioxxterms:version_of_record • an HTTP URI which is a persistent identifier for the published version of the

resource • will often (normally?) be a DOI

• dc:relation • optional property containing an HTTP URI identifying related resources

(e.g. research data sets, software source code etc.)

Page 14: Implementing RIOXX

dcterms:dateAccepted

• this MUST be provided

• is more precise than other possible dated events - such as ‘published’

Page 15: Implementing RIOXX

rioxxterms:author & rioxxterms:contributor

• both of these accept an optional ‘ID’ attribute • this MUST be an HTTP URI • use of ORCID is strongly recommended

• all authors should be represented as individual rioxxterms:author properties

• the ‘first named author’ can be indicated with another optional attribute called, er…, ‘first-named-author’

• rioxxterms:contributor is for other parties that are not authors but are credited with contributing in some way to the publication

• Example:

<rioxxterms:author id="http://orcid.org/0000-0002-1395-3092"> Lawson, Gerald</rioxxterms:author>

Page 16: Implementing RIOXX

rioxxterms:project

• this expresses funder and project_id in one, slightly more complex, property

• the use of global IDs, e.g. International Standard Name Identifier (ISNI) for funding organisations is recommended

• Example:

<rioxxterms:project funder_name="Engineering and Physical Sciences Research Council" funder_id="http://isni.org/isni/0000000403948681"> EP/K023195/1</rioxxterms:project>

Page 17: Implementing RIOXX

license_ref

• adopted from NISO’s Open Access Metadata and Indicators

• takes an HTTP URI and a start date

• the URI should identify a license • there is work under way to create a ‘white list’ of acceptable licenses

• embargoes can be expressed this way, with a license identified to ‘take effect’ at some (possibly) future date

• Example:

<ali:license_ref start_date=“2015-02-17”>http://creativecommons.org/licenses/by/4.0

</ali:license_ref>

Page 18: Implementing RIOXX

support for RIOXX implementation

(in DSpace)

Page 19: Implementing RIOXX

supporting website: www.rioxx.net

Page 20: Implementing RIOXX

DSpace patch

• developed by @mire

• specified and funded by Jisc

• https://www.jisc.ac.uk/repository-technical-support

• no online supporting information (as far as I can tell) but an email address to contact for info

Page 21: Implementing RIOXX

open, community support

• engagement from software suppliers

• community feedback

• good practice starting to be identified and discussed here

Page 22: Implementing RIOXX

continuous testing

Page 23: Implementing RIOXX

continuous testing - reporting

Page 24: Implementing RIOXX

variable compliance

• ePrints exposes records as ‘RIOXX’ when they are not even valid at a basic level

• the DSpace patch from @mire will, allegedly, not do this.

• “In the DSpace patch, items are only visible in the RIOXX OAI-PMH endpoint when they provide the mandatory fields. More details: • https://github.com/atmire/RIOXX/#RIOXX-test-OAI-PMH-endpoint”

Page 25: Implementing RIOXX

OpenAIRE Mapping

Page 26: Implementing RIOXX

39 repositories 4 aggregation systems

…in less than one year!

Page 27: Implementing RIOXX

Paul Walk Head of Technology Strategy and Planning, EDINA [email protected] @paulwalk

thanks for listening!

the RIOXX metadata application profile is maintained & supported by EDINA:

http://www.rioxx.net