moving to a centralized database for surveys in blaise at nass roger schou national agricultural...

21
Moving to a Centralized Database for Surveys in Blaise at NASS Roger Schou National Agricultural Statistics Service IBUC XII Riga, Latvia

Upload: brittany-dunn

Post on 31-Mar-2015

217 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Moving to a Centralized Database for Surveys in Blaise at NASS Roger Schou National Agricultural Statistics Service IBUC XII Riga, Latvia

Moving to a Centralized Database for Surveys in Blaise at

NASS

Roger SchouNational Agricultural Statistics Service

IBUC XIIRiga, Latvia

Page 2: Moving to a Centralized Database for Surveys in Blaise at NASS Roger Schou National Agricultural Statistics Service IBUC XII Riga, Latvia

NASS – 46 Field Offices

Page 3: Moving to a Centralized Database for Surveys in Blaise at NASS Roger Schou National Agricultural Statistics Service IBUC XII Riga, Latvia

NASS – 6 Data Collection Centers

Virginia

Kentucky

Montana

Wyoming

OklahomaArkansas

Page 4: Moving to a Centralized Database for Surveys in Blaise at NASS Roger Schou National Agricultural Statistics Service IBUC XII Riga, Latvia

Distributed Environment

• HQ-developed instrument typically distributed to 43 field offices

• 43 Blaise datasets

• Physical transfer of forms from Data Collection Centers to Client States

• Survey management done in individual field offices

Page 5: Moving to a Centralized Database for Surveys in Blaise at NASS Roger Schou National Agricultural Statistics Service IBUC XII Riga, Latvia

Centralization Idea

• Blaise 4.8.1 – Storing data in generic tables in a central database

• Enable the Blaise data collection processes to be centralized

• Splash of an idea to centralize one or two applications

(CASIC and Survey

Management System)

Page 6: Moving to a Centralized Database for Surveys in Blaise at NASS Roger Schou National Agricultural Statistics Service IBUC XII Riga, Latvia

Centralization Idea

• Current Status of Development:– Too many stovepipe applications– Development vacuums– Lack of sharing code

• Duplication of effort

– Duplication of functionality– Management has the “I want it now” mentality

Page 7: Moving to a Centralized Database for Surveys in Blaise at NASS Roger Schou National Agricultural Statistics Service IBUC XII Riga, Latvia

Centralization Idea

• Tidal wave - Centralization on an enterprise level– Blaise– Survey Management System– ELMO– Sample Design– EDR/EDC– PRISM– Analysis/Summary

Page 8: Moving to a Centralized Database for Surveys in Blaise at NASS Roger Schou National Agricultural Statistics Service IBUC XII Riga, Latvia

Database Decision

• Choosing a database– Cost– Support– Commonality

• NASS databases– Sybase– Redbrick– FoxPro– Oracle– MySQL Blaise

?

Page 9: Moving to a Centralized Database for Surveys in Blaise at NASS Roger Schou National Agricultural Statistics Service IBUC XII Riga, Latvia

Generic BOI Files

• Blaise creates 8 tables– Blaise_Case– Blaise_Data– Blaise_Dictionary– Blaise_Form– Blaise_ID– Blaise_Key– Blaise_Open– Blaise_Remark

Page 10: Moving to a Centralized Database for Surveys in Blaise at NASS Roger Schou National Agricultural Statistics Service IBUC XII Riga, Latvia

Generic BOI Files

• All surveys will be stored in a fixed, pre-defined structure

• Translation tool to extract the data to our PRISM database (Sybase) will work for all Blaise data

Blaise data PRISM

Translation Tool

Page 11: Moving to a Centralized Database for Surveys in Blaise at NASS Roger Schou National Agricultural Statistics Service IBUC XII Riga, Latvia

Versioning

• We plan on using Blaise versioning

• Management made a requirement that original reported data must be preserved

Page 12: Moving to a Centralized Database for Surveys in Blaise at NASS Roger Schou National Agricultural Statistics Service IBUC XII Riga, Latvia

Folder Structures

• At the onset we have to plan on allowing all field offices to run CATI

• Currently planning to continue to use the Blaise call scheduler

• The .BTR, .BOI, .BMI, .BDM, and .BXI files will be stored in unique folders for each field office (instrument and externals)– It would be nice to store the instrument files (and

externals) in one common folder*

\hqapps\casic\surveyname\\data\casic\surveyname\fo\06\

Page 13: Moving to a Centralized Database for Surveys in Blaise at NASS Roger Schou National Agricultural Statistics Service IBUC XII Riga, Latvia

User Access Rights

• Who can view and/or update the data?

• Distributed – Physical Boundaries

• Centralized – User Access Controls

• Roles– HQ Statistician– Field Office Statistician– Supervisor Interviewer– Interviewer

Page 14: Moving to a Centralized Database for Surveys in Blaise at NASS Roger Schou National Agricultural Statistics Service IBUC XII Riga, Latvia

User Access Rights

• A policy has been written and management approved

• A team has been assembled to discuss the details and prepare for implementation

Page 15: Moving to a Centralized Database for Surveys in Blaise at NASS Roger Schou National Agricultural Statistics Service IBUC XII Riga, Latvia

Additional CASIC Tables

• Information Tables– Survey Information Table– FIPS Allocation Table– User Information Table

• Sample Master Table

• Error Limits Table

• Previously Reported Data Table

Page 16: Moving to a Centralized Database for Surveys in Blaise at NASS Roger Schou National Agricultural Statistics Service IBUC XII Riga, Latvia

Initializing the Database

• Three Steps:– Survey Setup (HQ)

• Folder creation

– Initialize Preparation (FO)• Name & address files validated

– Initialize (CRON / HQ)• Populate the database

• Easily monitored by Survey Administrators• Problems easier to isolate

Page 17: Moving to a Centralized Database for Surveys in Blaise at NASS Roger Schou National Agricultural Statistics Service IBUC XII Riga, Latvia

Testing Plans

• Scheduler Testing– Make sure that we did not introduce issues

with the Blaise scheduler across field offices

• Load Testing– Must support a maximum of 600 users

simultaneously

• Process Testing– What are the locking issues, if any, between

processes

Page 18: Moving to a Centralized Database for Surveys in Blaise at NASS Roger Schou National Agricultural Statistics Service IBUC XII Riga, Latvia

Benefits

• Real-time reporting of survey progress

• Eliminating physical movement of forms

• Bringing survey processes together– Seamless communication between systems– Eliminate “stove-pipe” applications

• Sharing of code

Page 19: Moving to a Centralized Database for Surveys in Blaise at NASS Roger Schou National Agricultural Statistics Service IBUC XII Riga, Latvia

Challenges

• Implementing user access rights• Dealing with Blaise datamodel changes• Coordinating

– Interviewers across the country• Routing cases across call centers

– Multiple modes of data collection– Other processes being centralized

• Bridging the gap between applications during transition

Page 20: Moving to a Centralized Database for Surveys in Blaise at NASS Roger Schou National Agricultural Statistics Service IBUC XII Riga, Latvia

Conclusion

• Exciting time of development

• Positive attitude towards centralization– Anticipating the benefits to be gained

• Management is realizing the large scope– Stepping back and realizing the development

process (less of “I want it now”)

• Communication is open and working well

• Many processes evolving together

Page 21: Moving to a Centralized Database for Surveys in Blaise at NASS Roger Schou National Agricultural Statistics Service IBUC XII Riga, Latvia

Questions