putting open access into practice

48
VTT, Espoo (Finland) 11-12 May 2015 YEAR Annual Conference 2015: Open Science in Horizon 2020 This work is licensed under a Creative Commons Attribution 4.0 International License . Dr. Nancy Pontika Connecting Repositories (CORE) Knowledge Media Institute Open University Twitter: @oacore Putting Open Access into Practice

Upload: nancy-pontika

Post on 03-Aug-2015

244 views

Category:

Education


0 download

TRANSCRIPT

VTT, Espoo (Finland) 11-12 May 2015

YEAR Annual Conference 2015: Open Science in Horizon 2020

This work is licensed under a Creative Commons Attribution 4.0 International License.

Dr. Nancy PontikaConnecting Repositories (CORE)

Knowledge Media InstituteOpen UniversityTwitter: @oacore

Putting Open Access into Practice

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

Outline

* Introduction to Open Access (OA)* The need for aggregating open access content* The CORE system

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

Outline

* Introduction to Open Access (OA)* The need for aggregating open access content* The CORE system

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

Research Cycle

Research Idea

Receives funding

Research Conduction

Publication

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

Who does research?

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

How much does it cost to access it?

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

Who has access?

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

What is Open Access (OA)?

By “open access” to [peer-reviewed research literature], we mean its free availability on the public internet, permitting any users to read, download, copy, distribute, print, search, or link to the full texts of these articles, crawl them for indexing, pass them as data to software, or use them for any other lawful purpose, without financial, legal, or technical barriers other than those inseparable from gaining access to the internet itself. [BOAI, 2002]

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

OA Routes : open access repositories

* Do NOT perform peer-review

* Pre-prints, post-prints, final version

* Standardized: OAI-PMH compatible

RepositoriesGreen Route

Institutional Subject

Open Research Online (ORO)

arXiv.org

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

OA Routes : open access journals

Gold route

* Open Access Journals offer peer-reviewed research, pure open access journals. Sometimes they charge an Article Processing Charge (APC), sometimes they do not.

* Subscription based journals that offer an open route- hybrid journals- always charge Article Processing Charges (APCs)

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

Creative Commons Licenses

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

OA Growth

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

Growth of items in OA repositories

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

Records stored across all OARs

164,259,752 records across 2,531 repositories as estimated by OpenDOAR

[December, 2013 -http://www.opendoar.org/]

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

HEFCE OA policy

Higher market share for OA content imminent!

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

Access + reuse = Open Access

http://www.phdcomics.com/comics.php?f=1533

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

Outline

* Introduction to Open Access (OA)* The need for aggregating open access content* The CORE system

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

COAR: About harvesting and aggregations

“Each individual repository is of limited value for research: the real power of Open Access lies in the possibility of connecting and tying together repositories, which is why we need interoperability. In order to create a seamless layer of content through connected repositories from around the world, Open Access relies on interoperability, the ability for systems to communicate with each other and pass information back and forth in a usable format. Interoperability allows us to exploit today's computational power so that we can aggregate, data mine, create new tools and services, and generate new knowledge from repository content.’’

[COAR manifesto]

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

What is an aggregation?

Data providers

Aggregator

3. Standardised communication

1. Data standardisation

2. Data collection, harmonisation & enrichment

Users

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

What is an aggregation?

Aggregators are intermediaries between providers and users that collect resources from many sources and add value by improving access to them.

Physical world: * libraries* book stores * museums * art galleries*supermarkets

Digital world: * digital libraries (e.g. PubMed) * collections (e.g. The European Library) * search engines (e.g. Google cache) * newspaper aggregators (e.g. Google News) * online retailers (e.g. Amazon)* travel aggregators (e.g. Kayak) * insurance aggregators (GoCompare)* aggregators of research papers (e.g. CORE)

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

Successful online aggregators and value

* Reduce the time accessing information

* Standardise and harmonise content from many providers

* Enrich content with new information

* Provide harmonised access to users

* Enable the discovery of new information

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

Few aggregators provide unrestricted access to data

Data providers

Aggregator

ServicesHuman user

Machine user

Typically little or no support

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

Outline

* Introduction to Open Access (OA)* The need for aggregating open access content* The CORE system

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

UK need for aggregation

Bringing the UK’s open access research outputs together:• Feasibility study

commissioned by Jisc, published June 2014

• Referred to as “Open Mirror”

http://repository.jisc.ac.uk/5570/1/JISC_REPORT_open_mirror_090514_FINAL_WEB.pdf

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

UK need for aggregations - conclusion

Jisc should support CORE and seek international support for it.

“CORE should focus on [a]ggregating materials from UK IRs and from publishers and subject repositories of outputs with UK-based authors to ensure that UK resources are well represented in CORE”l

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

CORE's mission

Aggregate all open access content distributed across different systems worldwide, enrich this content and provide access to it through a set of services …

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

Genesis of the CORE project family Oct '10 First steps of CORE

Feb '11 (6M) CORE – Jisc

Nov '11 (9M) ServiceCORE

Jan '12 (28M) DiggiCORE, Jisc – ESRC - NWO

Feb '13 (36M) Europeana Cloud

Apr '13 (4M) CORE HEIF - HEFCE

Jul '13 CORE selected UK national aggregator

Feb '14 (4M) UK Aggregation

Jul '14 (3M) UK Aggregation 2

Summer '15 More to come...

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

The aggregation process

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

Processing pipeline

* Metadata download, extraction and cleaning* Full-text harvesting* Text-extraction* Language detection* Extraction of citation references from text* Detection of citation reference targets* Identification of related content* Detection of duplicate items* Parsing of author names* Indexing

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

CORE supports a 3 level access architecture

* Programmable (raw) data access- As downloadable files or through API

* Transaction information access- Explore content released through the use of a web portal and its search

* Analytical information access - Access to statistical information at the collection level through the use of tables or charts.

http://www.dlib.org/dlib/november12/knoth/11knoth.html

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

CORE supports a 3 level access architecture

* Programmable (raw) data access. - Developers, DLs, DL researchers, companies

* Transaction information access. - Researchers, students, life-long learners …

* Analytical information access.- Funders, government, bussiness intelligence …

http://www.dlib.org/dlib/november12/knoth/11knoth.html

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

CORE supports a 3 level access architecture

* Programmable (raw) data access. - Apps: CORE API, CORE Data Dumps

* Transaction information access. - Apps: CORE Portal, CORE Mobile, CORE (recommendation) Plugin

* Analytical information access. - Apps: Repository Analytics, CORE Policy Compliance Analytics, Repositories Dashboard (implementation phase)

http://www.dlib.org/dlib/november12/knoth/11knoth.html

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

Exposing the aggregated content

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

CORE applications (1)CORE Portal – Allows searching and navigating scientific publications aggregated from Open Access repositories

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

CORE applications (2)

CORE Mobile – Allows searching and navigating scientific publications aggregated from Open Access repositories

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

CORE applications (3)

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

CORE applications (4a)

CORE Plugin – A plugin to system that recommendations for related items.

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

CORE applications (4b)

CORE Plugin – A plugin to system that recommendations for related items.

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

Build on top of CORE API...CORE Plugin – A cross-repository recommendation system integrated into OJS.

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

CORE Applications (5)Repository Analytics – is an analytical tool supporting providers of open access content (in particular repository managers)

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

CORE Applications (6)Repository Dashboard (under development) – Tool to support the implementation and monitoring of the UK HEFCE OA policy.

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

The definition of OA for post-2014 REF

Consultation on OA in the post-2014 Research Excellence Framework, paragraph 25 says that:

- Accessible through a UK HEI repository (immediately upon acceptance or publication). - Made available as the final peer-reviewed text (full-text) after a (reasonable) embargo period specified by the publisher.- Harvestable using automated tools. - In a machine readable form to allow text-mining- Unambiguously identifiable in the institutional repository, including items available through a link to another website.

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

The developed tool

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

CORE statistics – current state (1)

* Content: 24M+ records, 670+ repositories, 1.8M+ full-texts * The world’s largest full-text open access dataset and still growing* The UK national aggregator (part of Repositories Shared Services project - Jisc)* Full-text aggregator (not just metadata)* Placed among Top 10 search engines for research that go beyond Google [Jisc, 2013]* Listed among Top 100 Thesis and Dissertation Resources* Part of Jisc’s Repositories Shared Services Project* Exploring a partnership of Jisc and OU to deliver CORE service

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

CORE statistics – current state (2)

Used by many researchers and organisations, including:

* the European Library* UNESCO* ResearchResearch.com* Open Access Button* OARR project (Nottingham University, Cottage Labs)* HyberLink (Los Alamos National Laboratory, University of Edinburgh)* Georgetown University researchers* Bauhaus University Weimar researchers* The repository community (OARR, RIOXX, IRUS-UK)

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

Conclusion

* Open Access outputs available online on the rise* OA infrastructure, (repositories, aggregators) must enable efficient use* CORE provides single access point to this knowledge and enables its mining* Opportunities for innovative applications and research* There are challenges making aggregators hard to operate and maintain* OA infrastructure should be available for the benefit of all and should not be owned by the publishing lobby

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

Have some fun with Open Access

* Open Access Explained!http://www.phdcomics.com/comics.php?f=1533

* Open Access quiz http://nile.lub.lu.se/loDownload/68/quiz_08.htm

YEAR Annual Conference 2015: Open Science in Horizon 2020 - Espoo (FI) – 11-12 May 2015Young European

Associated Researchers

Thank you!

Dr. Nancy PontikaEmail: [email protected]

Twitter: @nancypontika