researcher identifiers and national federated search portal for japanese institutional repositories

20
Researcher Identifiers and National Federated Search Portal for Japanese Institutional Repositories Kei Kurakawa 1 , Hideaki Takeda 1 , Ryo Shiozaki 1 , Shun Morimoto 2 , and Hideki Uchijima 2 1: National Institute of Informatics, Japan 2: Kanazawa University Library, Japan OR2012, The 7th international conference on Open Repositories 09 - 13/Jul/2012, the University of Edinburgh, UK RF3: Pecha Kucha – National Infrastructures, 11/Jul/2012: 11:00am – 12:30pm

Upload: kei-kurakawa

Post on 15-Jun-2015

695 views

Category:

Education


0 download

DESCRIPTION

OR2012, The 7th international conference on Open Repositories09 - 13/Jul/2012, the University of Edinburgh, UKRF3: Pecha Kucha – National Infrastructures, 11/Jul/2012: 11:00am – 12:30pm

TRANSCRIPT

Page 1: Researcher Identifiers and National Federated Search Portal for Japanese Institutional Repositories

Researcher Identifiers and National Federated Search Portal

for Japanese Institutional Repositories

Kei Kurakawa1, Hideaki Takeda1, Ryo Shiozaki1, Shun Morimoto2, and Hideki Uchijima2

1: National Institute of Informatics, Japan

2: Kanazawa University Library, Japan

OR2012, The 7th international conference on Open Repositories09 - 13/Jul/2012, the University of Edinburgh, UKRF3: Pecha Kucha – National Infrastructures, 11/Jul/2012: 11:00am – 12:30pm

Page 2: Researcher Identifiers and National Federated Search Portal for Japanese Institutional Repositories

Name disambiguation

• Name ambiguity problem is caused by– the same family name and the same first name, maiden name

usage, pen name,– character variants in Kanji (for Japanese), – and a variety of name formats for each journals

• To assign identifier for researcher is an answer.• If this problem is solved, it would be available to

– exactly compile a list of research achievement,– exactly assess contributions of the researcher,– exactly trace development paths by the researcher, – and exchange exact information of the researcher in various

scholarly communication scenes.

2

Page 3: Researcher Identifiers and National Federated Search Portal for Japanese Institutional Repositories

Assigning identifiers for authors of repository items

• DAI (Digital Author Identifier), the SURF foundation, the Netherlands

• Names Project, mimas, University of Manchester, UK

• Networking Names, OCLC• ORCID, ORCID Inc.• Researcher Name Resolver, NII, Japan

3

Page 4: Researcher Identifiers and National Federated Search Portal for Japanese Institutional Repositories

Objective of the Researcher Name Resolver (RNR)• Basic function as giving

– Identifiers of Japanese researcher on the Web– Web resource links for researcher

• Implementations– Researcher URI– Identifier management– Researcher pages– Web services

• Applications– Name authority for external systems– Researcher resource link resolver

4

http://rns.nii.ac.jp

Page 5: Researcher Identifiers and National Federated Search Portal for Japanese Institutional Repositories

Researcher identifier scheme and management

• Identifier is of the form,– 13-digit number– namely, Researcher Name Resolver ID.

– http://rns.nii.ac.jp/nr/xxxxxxxxxxxxx• Identifier management

– Identifiers of a researcher are grouped and aliased to a representative of them

5

Page 6: Researcher Identifiers and National Federated Search Portal for Japanese Institutional Repositories

Prior registration

• As priming a pump effect,– Registrants of the database of Grants-in-Aid for Scientific

Research in Japan, KAKEN are registered in RNR– 200,882 researchers are registered at 2012-07-06

• Extracted data attributes– Names (Japanese Kanji character, Japanese phonetic

character, English character)– Affiliated institution (last), department, job title– Grants-in-Aid for Scientific Research Researcher Number– (Research fields of Grants-in-Aid for Scientific Research)– (Research keywords of Grants-in-Aid for Scientific

Research)6

Page 7: Researcher Identifiers and National Federated Search Portal for Japanese Institutional Repositories

Web resource links for researcher

7

Campus Directories Planned

Planned

Academic information resource portal and link system of Japan

The biggest researcher directory of Japan

Database of Grants-in-Aidfor Scientific Research

The biggest article searchpublished in Japan

Direct links to external systems

Page 8: Researcher Identifiers and National Federated Search Portal for Japanese Institutional Repositories

Batch processing approach

• To make links to external systems, the RNR– loads profile data,

• ResearcherID based researcher profiles including a list of external identifiers

– uses identifier mapping rules,• RNR identifier is implicitly reuse of KAKEN identifier• CiNii identifier is implicitly reuse of KAKEN identifier

– and, loads external identifier mapping lists• J-GLOBAL identifier to KAKEN identifier• ReaD&Researchmap identifier to KAKEN identifier

8

Page 9: Researcher Identifiers and National Federated Search Portal for Japanese Institutional Repositories

Crawling approach

• To build direct links to campus directories, the RNR – crawled campus directories,– built a list of researcher pages with names,– and identify the researchers over RNR registrants

• To identify researchers, the RNR checks institution and whether or not– Kanji name is the same,– institution is the same,– and, the name is unique in the institution.

• In fact,– 226 university campus directories were crawled.– 102,949 researcher pages were indexed.– RNR has 23,252 links to those campus directories.– It is high precision rate, but low recall rate.– For improvement, machine learning is a promising approach. 9

Page 10: Researcher Identifiers and National Federated Search Portal for Japanese Institutional Repositories

Researcher page

10

Researcher name (Kanji, Katakana, English)

Researcher basic information

Direct links

Search links

Research keywords ofGrants-in-aid for Scientific Research

Research fields of Grants-in-aid for Scientific Research

URI of this document

Researcher URI

Researcher Name Resolver ID

Page 12: Researcher Identifiers and National Federated Search Portal for Japanese Institutional Repositories

Application for JAIRO

• JAIRO is the national federated search portal.• RNR is as being name authority for JAIRO.

• What repository managers do for the work is – to embed an “id” attribute in the “creator” field of

metadata JAIRO (IRDB) harvests, then1. put a RNR URI in “id” attribute value field, or

2. put anyURI for researcher in “id” attribute value field, and upload researcher profiles with external ids table on RNR

12

Page 13: Researcher Identifiers and National Federated Search Portal for Japanese Institutional Repositories

Name disambiguation framework

13

Page 14: Researcher Identifiers and National Federated Search Portal for Japanese Institutional Repositories

<?xml version="1.0" encoding="UTF-8" ?><OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"> <responseDate>2011-05-26T13:34:09Z</responseDate> <request metadataPrefix="junii2" verb="GetRecord" identifier="oai:ir.lib.shizuoka.ac.jp:10297/5644">http://ir.lib.shizuoka.ac.jp/dspace-oai/request</request> <GetRecord> <record> <header> <identifier>oai:ir.lib.shizuoka.ac.jp:10297/5644</identifier> <datestamp>2011-05-22T08:02:22Z</datestamp> <setSpec>hdl_10297_24</setSpec> </header> <metadata> <junii2 xmlns="http://irdb.nii.ac.jp/oai" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://irdb.nii.ac.jp/oai http://irdb.nii.ac.jp/oai/junii2.xsd"> <title>Orientation-dependent epitaxial growth of GaAs by current-controlled liquid phase epitaxy</title> <creator>Mouleeswaran, D.</creator> <creator id=“http://rns.nii.ac.jp/nr/1000001133354”>Koyama, T.</creator> <creator id=“http://rns.nii.ac.jp/nr/1000048520242”>Hayakawa, Yasuhiro</creator> <NDC>459</NDC> <description>The orientation dependence of the selective epitaxial growth of Gallium Arsenide (GaAs) has been investigated to achieve a thick epitaxial layer for application to X-ray detectors. Selective epitaxial growth was carried out on patterned GaAs with [0 1 1], [0 1 2], [0 1 0], [0 1 −2], [0 1 −1] and their equivalent seed orientations by current-controlled liquid phase epitaxy (CCLPE). SiO2 was used as a mask layer to fabricate the various seed orientations on the Si-doped GaAs (1 0 0) substrate and various growth periods and current densities were considered. Solute transport in the solution was enhanced by the electromigration of solute by an applied DC electric current, which caused an incremental growth in vertical and lateral directions in all orientations. The highest vertical thickness of 268 μm in the [0 1 −1] orientation and the largest lateral growth of 318 μm in the [0 1 2] orientation were achieved at 7.5 A cm−2 current density for 6 h. The seed aligned in the [0 1 2] orientation was favorable for high lateral growth of GaAs. The [0 1 1], [0 1 0] and [0 1 −2] seed orientations were suitable for application in a GaAs X-ray detector.</description> <publisher>Elsevier</publisher> <NIItype>Journal Article</NIItype> <format>application/pdf</format> <URI>http://hdl.handle.net/10297/5644</URI> <fullTextURL>http://ir.lib.shizuoka.ac.jp/bitstream/10297/5644/1/110520001.pdf</fullTextURL> <jtitle>Journal of Crystal Growth</jtitle> <issn>00220248</issn> <NCID>AA00696341</NCID> <volume>321</volume> <issue>1</issue> <spage>85</spage> <epage>90</epage> <dateofissued>2011-04-15</dateofissued> <language>eng</language> <doi>info:doi/10.1016/j.jcrysgro.2011.02.026</doi> <rights>Copyright © 2011 Elsevier B.V. All rights reserved.</rights> <textversion>author</textversion> </junii2> </metadata> </record> </GetRecord></OAI-PMH>

Harvested metadata embedded with Researcher URI (anyURI)

14

junii2 extension

<creator id=“http://rns.nii.ac.jp/nr/1000001133354”>Koyama, T.</creator><creator id=“http://rns.nii.ac.jp/nr/1000048520242”>Hayakawa, Yasuhiro</creator>

Page 15: Researcher Identifiers and National Federated Search Portal for Japanese Institutional Repositories

How author identifier changes UI?

• JAIRO is an example for exploring scholarly knowledge.• We assume that there are two modes of knowledge

exploration, i.e.– basically, we use keywords for search, and– sometimes, we use author name for search.

• When we use author id for search,– we expect to get a exact list of articles of the researcher– therefore, we could trace a research development path of

the researcher.• New UI ought to be designed, depending on search

conditions with or without author identifier.

15

Page 16: Researcher Identifiers and National Federated Search Portal for Japanese Institutional Repositories

Search by author

16

Suggest authors with ID

Page 17: Researcher Identifiers and National Federated Search Portal for Japanese Institutional Repositories

Search results (grouping)

17

Grouping Selection

Item type grouping

Institution grouping

Page 18: Researcher Identifiers and National Federated Search Portal for Japanese Institutional Repositories

Privacy and data protection

• On dealing with researcher profiles and identifiers, we should take care of privacy laws.

• Related acts in Japan– Act on the Protection of Personal Information (effect from 2005)

• Act on the Protection of Personal Information Held by Administrative Organs• Act on the Protection of Personal Information Held by Independent

Administrative Agencies, etc.• Regulations on the Protection of Personal Information

• Phase of practice– Academic research (feasibility study)– Business

• Current status in Japan– It is not enough to have discussions on privacy and data protection of

researcher profiles for repositories.– License is not explicit on the researcher profiles for repositories.

18

Exclusion from application

Page 19: Researcher Identifiers and National Federated Search Portal for Japanese Institutional Repositories

Future work

• To make linkage between RNR and external global services, e.g. ORCID, Microsoft Academic Search

• To improve precision/recall rate of linkage• To state privacy and data protection matter• To widely promote our activity to put

author identifiers in bibliographic metadata

19

Page 20: Researcher Identifiers and National Federated Search Portal for Japanese Institutional Repositories

Summary

• Identifier for researcher is required to solve name ambiguity problem.

• Researcher Name Resolver deals with identifiers for researchers in Japan.– It manages researcher identifiers.– It shows researcher profile on researcher pages.– It gives links on researcher pages to external systems– It provides relevant web services.

• When RNR functions as name authority for JAIRO, it provides a search by author identifier.

20