time use survey coding and processing time use data

30
Time Use Survey Coding and Processing Time Use Data

Upload: laurel-mcdowell

Post on 01-Jan-2016

222 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Time Use Survey Coding and Processing Time Use Data

Time Use Survey

Coding and Processing Time Use Data

Page 2: Time Use Survey Coding and Processing Time Use Data

What Needed To Be Coded And Processed

Everything except the diary data came back electronically on interviewer laptops

So, we only had to- capture the paper diaries, and

- code the responses for activity, who for, who with, travel, non-profit-organisation, ethnicity, industry and occupation

Page 3: Time Use Survey Coding and Processing Time Use Data

Time Use Survey

The Diary Processing Team

Page 4: Time Use Survey Coding and Processing Time Use Data

First

- here is

a picture

of our

diary

processing

team

Page 5: Time Use Survey Coding and Processing Time Use Data

The Diary Processing Team

Was in existence for 12 months,

and

There were 3 roles in this team- Team Leader- Quality Assurer (2 people)- Data Processor (about 13 people)

Page 6: Time Use Survey Coding and Processing Time Use Data

The Data Processor Role

When capturing a diary for the first time, the data processor

- marked-up the paper diary into “episodes”, then

- entered each diary into the system as a list of episodes

Page 7: Time Use Survey Coding and Processing Time Use Data

The Quality Assurer Role involved

answering queries from data processors, and maintaining a written record of answers to queries

recording potential new entries for the activity codefile

reviewing each double-captured diary and either- selecting the correct capture to be saved, or- creating a hybrid diary by either combining correct episodes from either the first or second diary or creating new episodes

maintaining a written record of each data processor’s common errors, and feeding those back to the data processor weekly, and

meeting regularly with Team Leader and subject matter experts to- agree on new rules for diary processing- agree on new entries for the codefile, and - agree on changes to double-capture rates for each data processor

Page 8: Time Use Survey Coding and Processing Time Use Data

The Team Leader Role included

Daily Tasks- answering queries from the team- allocating households for diary capture- selecting households for the double capture of diaries- flagging bad diaries as non-response

Weekly Processes- loading an updated codefile to the processing system- viewing the monitoring reports to see how team is progressing- adding new rules for diary processing and ensuring all the team knew all the rules- adding new synonyms to the codefile and ensuring all the team knew l about all the new synonyms

Ongoing Processes- adding new users to the system, or changing user details- selecting the double-capture rate for each data processor

Page 9: Time Use Survey Coding and Processing Time Use Data

The Data Processor Role

Capturing The DiaryAnd

Coding The Activity

Page 10: Time Use Survey Coding and Processing Time Use Data

The Data Processors- Marked-up The Paper Diary Into “Episodes”,

and then- Entered Those Episodes Into The System

We defined an episode as any period in time where

- all activities- the who for,- the location and- the who with

were the same

Page 11: Time Use Survey Coding and Processing Time Use Data
Page 12: Time Use Survey Coding and Processing Time Use Data
Page 13: Time Use Survey Coding and Processing Time Use Data

The Data Processor Role

More Detail On Capturing Episodes

Page 14: Time Use Survey Coding and Processing Time Use Data

Activity coding:rules for entering the text string…

The text of the activity was entered in simple present tense, and in the singular. For example:- • Drove to shop = Drive to shop• Dress Children = Dress Child

AND

• A person’s name (say John) – was changed to the relationship with the respondent

• Spelling mistakes were corrected.

Page 15: Time Use Survey Coding and Processing Time Use Data

Activity coding:

it is about finding the right “synonym”

The data processor enters a text string for the activity, and then the system displays a list of possible classification synonyms that could match that text string.

We defined a ‘synonym’ as a probable survey response and lists of synonyms are stored with the classification category to which they belong. Synonym lists are used for creating codefiles for processing survey responses.

Page 16: Time Use Survey Coding and Processing Time Use Data

Activity coding: primary and secondary activities

Activities that were entered in the first column of the diary had to be entered as the first (or primary) activity

If there were multiple activities in the second column it was not important what order they were entered unless one was an “available for care” activity

Some episodes in the paper diary had simultaneous activities that could not be done at the same time:-for example, having a shower and getting dressed

- if this happened then the activity was split into separate activities of equal time

Page 17: Time Use Survey Coding and Processing Time Use Data

Activity coding: “available for care”

It is a result of the diary question‘were you responsible for anyone who could not be left alone’

It is marked by interviewers in the left-hand column of the diary

It was coded as the activity of either‘available for childcare’ or ‘available for care of an adult’

We also coded with this the appropriate ‘who for’ category from what column was marked in the diary

It had to be coded as a secondary activity.

Page 18: Time Use Survey Coding and Processing Time Use Data

Activity coding: “childcare”

Almost all activities related to children come into the childcare category• The activity code should start with ‘32’

It includedtalking, playing and transporting children

Children are defined as aged 0-13 for our Time Use survey

Page 19: Time Use Survey Coding and Processing Time Use Data

Activity coding: “travel”

Each major group in the classification has an associated travel category

The category depends on where the respondent is travelling to

For example,

drive to work is

‘travel associated with labour force activity’ 21811

Page 20: Time Use Survey Coding and Processing Time Use Data

Activity Coding: Internet Usage Activities

Any activity on the internet must have ‘internet’ in the synonym

Examples on internet usage synonyms:

• Internet banking 33215• Check emails on internet 42311• Watch YouTube over internet 44111

Page 21: Time Use Survey Coding and Processing Time Use Data

Coding “who for”

Is only used when the activity code starts with a 3 – it is for committed time

The default code is ‘own household (including self) nfd’

The default code will often include activities such as household shopping, cleaning, cooking and laundry

The rule was to use the default code unless the interviewer had written a code next to the activity, or the activity was childcare

Page 22: Time Use Survey Coding and Processing Time Use Data

If The “Who For” Is ANot-For-Profit Organisation

Thenot-for-profit organisationwas entered as it is written in the diary

Page 23: Time Use Survey Coding and Processing Time Use Data

Coding “Who With”

The Who with Categories Were:-

Alone

Family I live with

Family I don’t live with

Other people I know

People I don’t know

Page 24: Time Use Survey Coding and Processing Time Use Data

The Data Processor Role

The Double Entry Of Diaries

Page 25: Time Use Survey Coding and Processing Time Use Data

The Double Entry ProcessWhen entering a diary for the second time, the data processor

- checked the existing mark-up of the paper diary into “episodes”, then

- entered the diary into the system in the same way as for the first entry

All the diaries for any new data processor in the team were double-captured,

then, as the data processor’s accuracy rate improved, the double-capture rate was dropped, first to 25%, then to 10%.

Page 26: Time Use Survey Coding and Processing Time Use Data

Coding and Processing Time Use Data

Getting Quality Data

Page 27: Time Use Survey Coding and Processing Time Use Data

It Is Essential To Start WithA Good Quality Codefile For

Coding The Activity ClassificationThe diary coding team needs to start with the best codefile that can be put together.

If the diary processing team starts with a poor quality codefile they may just force responses into incorrect codes, and the result will be poor quality data

For the Statistics NZ 2009-10 survey, the first version of the activity codefile was put together by combining:-1. the codefile from the Statistics NZ 1998-99 Time Use Survey, and2. the updates to that codefile recorded during the March 2009 Field Test, and3. the codefile from the latest Australian Time Use Survey

Many entries in the 3 codefiles listed above needed to be rewritten to fit in with the our new classification of activity, and this took a lot of time. But we did start diary coding with a very good list of all the activities that New Zealanders might record in their diaries

Page 28: Time Use Survey Coding and Processing Time Use Data

Make Sure There Is A Good ProcessFor Agreeing On Updates,

And Frequent Updates Of The Activity Codefile

Any codefile of activity quickly becomes out-of-date when a large team are using it.

And once the codefile becomes out-of-date the data processors may just force responses into incorrect codes in order to get through their work, and the result will be poor quality data.

Page 29: Time Use Survey Coding and Processing Time Use Data

Start With A Good Process For Agreeing On The Rules ForDealing With All The Situations That Will Be Recorded In The Diaries,

And Update Those Rules Often

The data processors always had many queries about how to capture various combinations of activity, and “who for”, and “who with”, that they were finding in diaries

Capturing diary data is not very intuitive – there needs to be a lot of rules to cover the situations that will be found, and

It is impossible to record all the rules before the diary capture starts – there is no way to know in advance many of the tricky situations that will be found

So, it is worth putting the effort into building a very good process for maintaining the rules in order to get quality data

Page 30: Time Use Survey Coding and Processing Time Use Data

Thank you

非常感谢