the selection, appraisal and retention of social science data in the uk k. schürer uk data archive...

21
The selection, appraisal and retention of social science data in the UK K. Schürer UK Data Archive and Economic and Social Data Service CODATA/ERPANET Archiving Workshop, Lisbon 15-17 December 2003

Upload: felicia-dorsey

Post on 01-Jan-2016

217 views

Category:

Documents


3 download

TRANSCRIPT

The selection, appraisal and retention of social science

data in the UK

K. SchürerUK Data Archive and Economic and

Social Data Service

CODATA/ERPANET Archiving Workshop, Lisbon15-17 December 2003

Question 1

What constitutes social science data?

Types of data

• Similar to US– Quant and Quali– Surveys– Censuses– Administrative data

• Also increasing amounts of ‘non-survey’ type data– Images– Sound– Video– Mixed media

Who produces social science data?• Again, similar to US

– Government agencies– HE/FE sector– Private sector

• Within HE/FE not just ESRC funded– MRC, NERC, AHRB, Wellcome, Leverhulme

• Increasing number of large digitisation projects– JISC, NOF

• Increasing tendency for government agencies to contract out survey work to private sector (NatCen)

• University sector tend not to get Government contracts• Devolution• Local Government

Who owns data?

• Again similar to US• Nearly all Government data technically

owned by the Queen – Crown Copyright• HE/FE sector generated data, host

institution

But host of IPR, data copyright and consent issues.

Question 2: who archives the data?

Number of different centres

• UKDA/ESDS• The National Archive (former PRO)

– NDAD

• NERC data centres, AHDS• EDINA• Other ‘repositories’

– BL, BBC, D-Space

UKDA history & overview

• Archive established in 1968 (as ‘Data Bank’)• Funded by (then) SSRC to provide a service to UK HE

sector • Initial focus on government survey data • New distributed service established 1 Jan. 2003

– Economic and Social Data Service (ESDS)

• Mixed data types and formats– Specialist Qualidata unit and History Data Service

• Still predominately funded to provide service for HE/FE sectors– ESRC, JISC, University of Essex– Project funding (EC, JISC, MRC, AHRB, etc.)

Data In

Figure 3. New Acquisitions by Source

020406080

100120140160180

1998/1999 1999/2000 2000/2001 2001/2002

National Government Academic

Data Archive Other

Data OutFigure 6. Users, orders and datasets

0

2000

4000

6000

8000

10000

12000

14000

16000

1997-98

1998-99

1999-00

2000-01

2001-02

users orders datasets

Top titles….Rank Title

1 Quarterly Labour Force Surveys

2 General Household Survey

3 Family Expenditure Survey

4 British Household Panel Survey

5 British Social Attitudes Survey

6 Eurobarometer Survey Series

7 1970 British Cohort Study (BCS70)

8 Health Survey for England

9 Workplace Employee Relations Survey

10 1981 Census

11 Family Resources Survey

12 British Election Studies

13 Youth Cohort Study of England and Wales

14 National Child Development Study and BCS70

15 National Child Development Study

16 Northern Ireland Family Expenditure Survey

17 Road Accident Data

18 Continuous Household Survey

19 British Crime Survey

20 ONS Omnibus Surveys

Question 3: How are data selected and appraised?

The National Archives• The state's own administrative and policy processes

– formulation of policy and management of public– resources by the core executive– management of the economy– external relations and defence policy– administration of justice and the maintenance of security– formulation and delivery of social policies– cultural policy

• Interaction of the state with its citizens and its impact on the physical environment – the economic, social and demographic condition of the UK, as

documented by the state's dealings with individuals, communities and organisations outside its own formal boundaries

– impact of the state on the physical environment

UKDA/ESDS• Ensure they fall within the ‘Scope of Collections’ • Assess their content, long-term value and the level

of potential interest in their re-use. Factors influencing this evaluation include: – the geographic and/or temporal scope is significant; – the subject coverage of the data is broad and may be

of interest across the social science disciplines; – the data are not available in any other form; – accession into the Archive makes the resource more

accessible; – a dataset adds to or is made more valuable by existing

holdings (eg part of an existing series); – a dataset fills a gap in the existing holdings; – there is research and/or teaching activity in the subject

area covered by the data; – data for which longevity and access would otherwise

be threatened.

• Determine if they may be viably managed, preserved and distributed to potential secondary users. Factors influencing this evaluation include: – the data are of a type with which the Archive has

expertise or may easily obtain expertise or expert advice;

– the data format can be converted to suitable dissemination and preservation formats;

– the level and quality of documentation reaches an appropriate standard to enable a secondary analyst to make informed use of the data.

• Determine if there is another archive more appropriate for preservation and dissemination of these data.

• The criteria for rejection are as follows: – there are problems with the sample size, the data

are very localised or are of peripheral interest for social scientists;

– the documentation is insufficient to enable understanding and further analysis of the data;

– the data would be more effectively handled by another archive or data centre;

– there are insurmountable legal obstacles e.g. copyright clearance from all parties has not been achieved;

– the depositor wishes to impose too stringent access conditions or wishes to place an indefinite embargo on use.

Question 4: Prospects for the future?

• Legal issues– Move toward greater openness and

transparency – Freedom of Information Act, 2000– Yet greater concerns with confidentiality– Data Protection Act, 1994

• Technical issues– Web makes data more accessible– Everyone can have their own ‘archive’– Challenge of e-Science– Digital Curation Centre