china’s scientific data sharing initiatives and future perspective pro. peng, jie...

23
China’s Scientific Data Sharing Initiatives and Future Perspective Pro. Peng, Jie ([email protected]) Dr. Liu, Runda ([email protected]) 5 March 2012, Paris, Delivering data to science Institute of Scientific & Technical Information of China 1

Upload: albert-holt

Post on 25-Dec-2015

217 views

Category:

Documents


1 download

TRANSCRIPT

China’s Scientific Data Sharing Initiatives and Future Perspective

Pro. Peng, Jie ([email protected]) Dr. Liu, Runda ([email protected])

5 March 2012, Paris, Delivering data to scienceInstitute of Scientific & Technical Information of China

1

Agenda

1. The Progress of Scientific Data Sharing Projects in China

2. Our Work in Scientific data Sharing3. Conclusion & Outlook

2

1The Progress of SDSP in China

3

• The Sharing of Scientific Data is important.• two types of sharing: sharing activities among resource

holding bodies vs. sharing service between resource holding bodies and users.

• Public Domain Data, Data the with the feature of Grey (greyData), Commercial data

Scientific Data Sharing

• In 1982, Chinese Academy of sciences(CAS) Proposed the project of “Scientific database and information system”

• In 1988, together with relating agencies and research institutes, CAS built World Data Center China Centers, and formed China Committee of Codata.

• In 2001,Ministry of Science and Technology (MOST) conducted series of investigation and released series of reports. In the same year, meteorology data sharing pilot project was launched.

5

China Scientific Data Sharing Project

• In 2003, Ministry of Finance start allocate special

funding for MOST to construct China Scientific

Infrastructure, Scientific Data Sharing Projects

(SDSP) are among it.

• Under the framework of SDSP, a comprehensive

scientific data sharing activities was started. 24

government agencies involved in the building of 8

platforms in the field of Agriculture, Earth System

Science, population and health etc. in the first

stage.

6

China Scientific Data Sharing Project(Cont.)

2001 2002 2003 2004 2005 2006 2007 2008 2009 2010

Experiment stageComprehensive construction stage

Preliminary research

Law and regulation

National Scientific Data center pilots

National Scientific Data Sharing Network Pilots

Spread of national data center

Spread of scientific data sharing network

Optimization and service

2011

Operation and service

Development stage of SDSP

• SDSP was developed under comprehensive plan at the national level. (regulations and management structure, 263 standards and criteria)

• Data of public good in different government sectors were put into a common sharing framework. (10 data centers or service network, 100 branches and nodes)

The whole view of SDSP

• SDSP make all these data accessible to all interested users at an affordable cost or free if possible(3000 databases for basic research and public welfare, 200 institutions, 140 TB data)

• SDSP form a multi-tiled, cross agency, cross geographic location, cross discipline distributed scientific data sharing system that bridge the gap between different data holding agencies and institutes of public good and users.(A statistics of 2009, 170, 000 registered users, 62 M visits, 430 TB download)

9http://www.sciencedata.cn

The whole view of SDSP(Cont.)

More results

• An open mindset is formed in scientific data field– Regional SDSP was set up– Seeking a lot of joint Scientific Data Exchange programs,

like HKH program with ICSITC– China actively participated in Codata– China also take part in WDC(now world data system) All

WDC in China took part in SDSP since 2002.

10

2Works relating Scientific data Sharing in ISTIC

11

Main duty• Institute of Scientific and Technical Information of

China(ISTIC) is the only research institute affiliated with the Ministry of Science & Technology of China (MOST) conducting S&T Information research & service.

• We collecting S&T literature. we also collecting other type of S&T information

• Resource Sharing and Promotion center (RSPC) conducting research and practice including(not limited):– S&T resource management Theory studies (start from

2006, investigations, regulations etc.)– S&T information resource sharing technical solutions

12

2.1Scientific data DOI in China

13

14

DOI Registration Agency in China

• ISTIC in conjunction with WANGFANG Data Group became China’s only DOI® Registration Agency in March, 2007.

• The agency focus on the development of Chinese language platform and gateway for DOI name use and are trying to attract metadata registration by building relating infrastructure.

• The project start from Chinese journal article and scientific data, expanding to books and thesis.

15

The Progress of Scientific Data DOI in China

• DOI name Coding Regulation• Metadata Description Standards• Service Platform Construction for Scientific data in Chinese

Language, Provide DOI resolution and retrieval services • Provide linking between data and journal articles.• Build up Service Alliance, Registration of 15K natural S&T

resources plus other more.

2.2Scientific Data Classification and navigation system

16

Scientific Data Classification and navigation system

• platform for scientific data resource information on the internet (metadata).

• The system accelerate DOI registration and application of Scientific Data.

• classify distributed scientific data resource on the internet effectively

• Help improve the standard of scientific data resources in China and provide fast navigation and link.

Multi-facet Keyword and Classification Connection mechanism

• organize scientific data resource catalogue– Dynamic multi-facet classification and keyword

connection indexing method– designs ranking scheme based on the weight of

classification and keyword connection.

19

3Conclusion & Outlook

20

Conclusion

• SDSP is a government effort to promote the sharing of Scientific Data: Big budget, new Drive?

• Theoretical foundation is important: Scientific Data Sharing is the transfer of certain rights of Scientific Data.

• Technical endeavor: DOI registration, Linking, classification will help the management of Scientific Data resources.

21

• Data publish (a long way to go)– Datacenter view

– Publisher view

– Funding agency view

• how to evaluate the result of data sharing infrastructures? It is important to build a third party evaluation mechanism.– The evaluation of data resource construction– Portal Information Architecture evaluation– Database function evaluation

Future focus

23

• Pro. Peng, Jie ([email protected])• Dr. Liu, Runda ([email protected])

• 5 March 2012, Paris, Delivering data to science• Institute of Scientific & Technical Information of China