the demograpic data base (ddb) umeå university, sweden professor anders brändström
DESCRIPTION
The Demograpic Data Base (DDB) Umeå University, Sweden Professor Anders Brändström Associate professor Sören Edvinsson. Centre for Population Studies. What is DDB…?. A national and an international research infrastructure - PowerPoint PPT PresentationTRANSCRIPT
The Demograpic The Demograpic Data Base (DDB)Data Base (DDB)
Umeå University, SwedenUmeå University, Sweden
Professor Anders BrändströmAssociate professor Sören Edvinsson
Cen
tre
for
Po
pu
lati
on
S
tud
ies
What is DDB…?What is DDB…?C
entr
e fo
r P
op
ula
tio
n
Stu
die
s
A national and an international research infrastructure
The databases Popum and Tabellverket are constructed from 18th and 19th century parish registers
The longitudinal database Popum is one of the largest historical population databases
Very rich and detailed (information per individual)
The The populatipopulati
on on databasdatabas
ee”Popum ”Popum
3”3”
Cen
tre
for
Po
pu
lati
on
S
tud
ies
LVPART
FK1,I1 MPOSTNRFK2,I2 KPOSTNR
PREFIX MPNR MNOTYP MNONR KPNR KNOTYP KNONR NOFRS
HLNVANM
PK POSTNRPK ANMNRPK ANMDEL
PREFIXI1 PNR NONR ANMKOD ANM NOTYP NOFRS
ANM
PK,FK1 POSTNRPK ANMNRPK ANMDEL
PREFIXI1 PNR NONR ANMKOD ANM NOKAL NOTYP NOFRS
BETANM (UND)
PK POSTNR
PNR NONR ANM EANM NOTYP
REL
PK,FK1 RPNR1PK,FK2,I1 RPNR2
PREFIX RTYP RBES NOFRS
PERSON
PK PNR
PREFIX KON NMNF1 NMNF2 NMNE NMNPMI1 FODDATI6 FODFRSI2 FODHFRS DOPDAT AB FB ANTBB ANTIB ANTP DFI3 DODDAT DODFRS DODHFRS DODORS BEGDAT FRSBOBTYPI4 FRSBOBDAT FRSBOSTYPI5 FRSBOSDAT ANTHL ANTFD ANTFL ANTLV ANTDB NOFRS
FLYTT
PK PNRPK LOPNR
PREFIX REGKOD NOFRS FLTKL FRNFRS FRNORT FRNDAT TILFRS TILORT TILDAT FLTREGNR FLTTYP ANTFLTM ANTFLTK FLTGRP FLTHM HMPNR RELHM FRNPOSTNRHF TILPOSTNRHF POSTNRFL
LV
PK,FK1 POSTNR
PREFIXI1 PNR NONR KON CIV ADELI3 FODDAT FODFRS FODFRSTXTI4 LYSDAT LYSFRS LYSFRSTXT LYSORDNR LYSREGNR VIGALDI2 VIGDAT VIGFRS VIGFRSTXT VIGORDNR VIGREGNR HEMFRS HEMFRSTXT HLVOL HLSID HLSIDTYP NOTYP NOFRS
YRKETXT
PK,FK1 POSTNRPK YRKEAGA
PREFIXI1 PNR NONR YRKE NOKAL NOTYP NOFRS
HLBETU
PK POSTNR
PREFIXI1 PNR NONR BETYG NOTYP NOFRS
NAMN
PK POSTNRPK NAMNAGAPK NAMNKATPK NAMNNR
PREFIXI1 PNR NOFRS NAMN TEMPUS SD
FL
PK,FK1 POSTNR
PREFIXI1 PNR NONR KON CIV AB FB VACCKOD VACCAR VACCTXT FODDAT FODFRS FODFRSTXT FLTTYPI2 FLTDATI3 FLTFRS FLTFRSTXT FLTPOS REGNR ANTFLTM ANTFLTK ANTFLTTOT HLVOL HLSID HLSIDTYP OVRFLY NOTYP NOFRS
BOSTTXT
PK,FK1 POSTNR
PREFIXI1 PNR NONR BST NOKAL NOTYP NOFRS
HFNVANM (UND)
PK POSTNR
PNR NONR ANM EANM NOTYP
BETYG (UND)
PK POSTNRPK GENKOLPK FKOLPK BETRAD
PNR NONR KOLB KOLE BETFORMTYP BETNAMN BETSYS KODGRUPP BETYG BETKOD AR NOTYP
HLREL
PK POSTNRPK RELHLTYPPK RELHLTYPNR
PREFIXI1 PNRFK1 NONR RELTYPKOD RELPOSTNR NOTYP NOFRS
BOENDE
PK PNRPK BONR
PREFIX BOFRS SUBORT BOBTYP BOBDAT BOSTYP BOSDAT BOBPOSTNR BOSPOSTNR BOBDATKST BOSDATKST
HLCIV
PK POSTNRPK CIVHLNRPK CIVHLTYP
PREFIXI1 PNR NONRI2 CIVHLDAT CIVHLDATKST UPLORS VIGHLNR NOTYP NOFRS
DB
PK,FK1 POSTNR
PREFIXI1 PNR NONR KON AB FB CIV ADEL VACCKOD VACCAR VACCTXT DF FODDAT FODFRS FODFRSTXTI2 DODDAT DODALDAR DODALDMAN DODALDDAG DODALDVECK DODALDTIM DODALDMIN DODFRS DODFRSTXT DODORS DODORSUTLI3 BEGDAT BEGFRS BEGFRSTXT BEGFOR HEMFRS HEMFRSTXT DODREGNR BEGREGNR HLVOL HLSID HLSIDTYP NOTYP NOFRS
HLRELFEL (UND)
PK POSTNRPK RELHLTYPPK RELHLTYPNR
PREFIXI1 PNR NONR RELTYPKOD RELSID RELSIDTYP RELRAD RELNRRAD NOTYP NOFRS
FDPENNING
PK POSTNR
PREFIXI1 PNR NONR RIKSKY SKILKY RUNSKY KRONKY ORENKY MARKKY PENKY DALKY RIKSFA SKILFA RUNSFA KRONFA ORENFA MARKFA PENFA DALFA NOTYP NOFRS
NAMNTXT
PK,FK1 POSTNRPK NAMNAGA
PREFIXI1 PNR NONR FNAMN ENAMN NOKAL NOTYP NOFRS
LANK
PK POSTNR
PREFIX NOFRSI1 PNR NOKAL NOTYP NONRI2 VOLI2 SID SIDTYP RAD NRRAD NOBTYP NOBDAT NOSTYP NOSDAT
HLBET
PK POSTNRPK BETNR
PREFIXI1 PNR NONR BETTYP BETSYS BETYG NOTYP NOFRS
FADDRAR
PK POSTNR
PREFIXI1 PNR NONR FADDRAR NOTYP NOFRS
HLHFANM
PK POSTNRPK ANMNRPK ANMDEL
PREFIXI1 PNR NONR ANMKOD ANM NOTYP NOFRS
BOORT
PK PNRPK BONR
PREFIX BOFRS ORT BOBTYP BOBDAT BOSTYP BOSDAT BOBPOSTNR BOSPOSTNR BOBDATKST BOSDATKST
HFNV (UND)
PK POSTNRPK ARPK FKOLPK FRAD
PNR NONR HFH NTA NV1DAT ADM KONF NTV NOTYP
FD
PK,FK1 POSTNR
PREFIXI1 PNR NONR KON AB FB ADEL VACCKOD VACCAR VACCTXT DOD BM PARITETI2 FODDAT FODFRS FODFRSTXT REGNRI3 DOPDAT DOPFRS DOPFRSTXT DOPFRT DOPKONF DOPKONFDAT DOPKONFFRS DOPKONFNMN DOPANM KTABS KTABSDAT KVARTER EKFRH FARFODDAT FARALD FARCIV FARHFRS FARHFRSTXT FARRT MORFODDAT MORALD MORALD2 MORCIV MORHFRS MORHFRSTXT MORRT VIGDAT AKTLGD ATIDAT ATTDAT ATTFRS ATTYP ATTAVS HLVOL HLSID HLSIDTYP NOTYP NOFRS
VIGSEL
PK PNRPK VIGNR
PREFIX PPNR VIGDAT UPLDAT UPLTYP NOKAL POSTNR NOFRS
OSKATT
NOFRS VOL SID SIDTYP RAD NRRAD F_RAD F_NRRAD S_RAD S_NRRAD NOT_FORM SID_INNEH TYP_INNEH HEMNR HEMNR_OSTR BRUKDEL BRUKDEL_OSTR SKATTETYP ROTEI1 BY_KVART BY_KVART_K BY_KVART_LIK GARD GARD_K GARD_LIK YRKE YRKE_LIK MTLH MTLT MTLN MALH MALT MALN ANM
HLBETANM
PK POSTNRPK ANMNRPK ANMDEL
PREFIXI1 PNR NONR ANMKOD ANM NOTYP NOFRS
HL
PK,FK2 POSTNR
PREFIXI3 PNR NONR KON AB FB CIV ADEL VACCKOD VACCAR VACCTXTI1 FODDATI2 FODFRS FODFRSTXT INTYP INDAT INDATKST INREGNRI7 FRNVOL FRNVOLKST FRNSID FRNSIDKST FRNSIDTYPI6 FRNFRS FRNFRSTXT FRNFRSKST FRNBST FRNBSTKST UTTYP UTDAT UTDATKST UTREGNRI5 TILVOL TILVOLKST TILSID TILSIDKST TILSIDTYPI4 TILFRS TILFRSTXT TILFRSKST TILBST TILBSTKST FAMNR FAMST HF HF0 HL1AR NV NV0 NV1DAT ATER NOTYP NOFRS
P o p u m v e r s i o n 3
2006- 08- 25
YRKE
PK POSTNRPK YRKEAGAPK TYKODNR
PREFIX TYFD TYREL TYSOC TYNHU TYNUG TYKOD KODIND NOFRS PNR NONR NOTYP NOKAL
OrganizationOrganizationC
entr
e fo
r P
op
ula
tio
n
Stu
die
s
Board DDB
Director
UmeåR & D
AdministrationSystems
development
Jörn
DigitalisationGenealogies
Haparanda
DigitalisationSystems
development
Karesuando
Digitalisation
Centre for Population Studies
Research
Faculty of Social Sciences
Parish registersParish registers
Birth and baptism
Death and burial
Catechetical examination
Marriage Migration
Nils Pehrsson
Nils Pehrsson
Nils PehrssonNils Pehrsson
Nils Pehrsson
Nils Pehrsson
Cen
tre
for
Po
pu
lati
on
S
tud
ies
Figure 1. Catechetical register page from Tuna parish 1865-1874. All presences at catechetical Name, occupation and other examinations and Holy Communion individual information on each Inmigration, when, where from, Outmigration, when, where to, member of a family / household certificate number certificate number Reading & other marks Exact date and place of birth Notes on conduct and other circumstances Place of residence Marital status Smallpox Death
Cen
tre
for
Po
pu
lati
on
S
tud
ies
Varying Varying quality!quality!
Cen
tre
for
Po
pu
lati
on
S
tud
ies
Two main structuresTwo main structures
Period source (catechetical registers)Event sources
Birth and baptism booksBanns and marriage booksMigration booksDeath and burial books
Cen
tre
for
Po
pu
lati
on
S
tud
ies
EventsEvents
Events can be indicated both in event and period sources
BirthsMarriagesMigrationsDeaths And so on
Cen
tre
for
Po
pu
lati
on
S
tud
ies
StatusStatus
Status can be indicated both in period and event sources
Marital status (explicit or implicit)OccupationsResidenceFamily positionAnd so on
Cen
tre
for
Po
pu
lati
on
S
tud
ies
Cen
tre
for
Po
pu
lati
on
S
tud
ies
Constructing the Constructing the databasedatabaseAutomatic linkeageAutomatic linkeage
91% Correct
4% Errors
5% UnlinkedInformation from different sources
Linkage
Linking individuals
Linking families
Standardized names
Well tested algoritms
DDB 2006-02-01
A A databasdatabase which e which
re-re-construcconstruc
ts the ts the sourcesource
Cen
tre
for
Po
pu
lati
on
S
tud
ies
Basic demandsBasic demands
Possible to reconstruct the original source
Possible to find all basic information about a person over time
Possible to find all information valid for a person at a certain point in time
The quality of the original sources defines the upper limit of the data quality
It should be easy to extend the database structure
Continuously maintained and documented
Cen
tre
for
Po
pu
lati
on
S
tud
ies
OverviewOverviewInput source preparation & design of entry formats for true
reproduction of various sources, entry of individual source records from all source types, monitoring of quality administration of data flow
Refinement encoding and standardization – names! structuring databases computerized + computer-supported record linkage,
Output creating datasets according to researcher specifications presenting data on the web in various forms, including relations & genealogy administration of research service and web subscriptions
Cen
tre
for
Po
pu
lati
on
S
tud
ies
POPUM 3POPUM 3StructureStructure
Copy
Digitisation
POPUM Source
Update
Compi-lation
POPUMUser
Retrieval
Cen
tre
for
Po
pu
lati
on
S
tud
ies
Cen
tre
for
Po
pu
lati
on
S
tud
ies
Key VariablesKey Variables
X = common occurrence Z = sporadic occurrence Y = Sporadic and unreliable
Variable Catechetical register
Migration register
Birth and baptism
Banns and marriages
Deaths and burials
Date of birth X Z X Z ZPlace of birth X Z X Z ZBaptism X X Legitimacy X X Assistance at birth X Churching Z X Publication of banns X Marriage X XGuardian X Burial XCause of death XMigration X X Village X X X X XHousehold membership
Y
Family status X X Civil status X X X XOccupation/trade X X X XVaccination X Z Z Remarks X X X X XPenance X Confirmation X Z Communion X Z Catechetical meetings
X Z
Reading, Catechism, Understanding
X Z
Data problemsData problems
Missing Redundant Inconsistent Level of precision Non-standardized sources Mainly event oriented
Cen
tre
for
Po
pu
lati
on
S
tud
ies
Cen
tre
for
Po
pu
lati
on
S
tud
ies
Popum 1700-1900 Tabellverket 1749-1859
75 parishes 5 million individual notations1 million individuals >300 variables -11 generations
2 500 parishes¼ million forms¼ billion details on events and conditions
Access to the databaseAccess to the database
Specially constructed files
Open to the public via InternetIndikoTabellverket online
Cen
tre
for
Po
pu
lati
on
S
tud
ies
Research service Research service 1994 - 20071994 - 2007
160 researchers - 35 from abroad
40 guest researchers from more than 15 countries
25 larger file deliveries per year
35 doctoral dissertations
800 scientific publications
DDB,
Cen
tre
for
Po
pu
lati
on
S
tud
ies
DDB in researchDDB in researchC
entr
e fo
r P
op
ula
tio
n
Stu
die
s
CPS Research CPS Research ProgrammesProgrammes
Cen
tre
for
Po
pu
lati
on
S
tud
ies
Historical Demography Programme
(HD)
2005-2010
22 mSek2,343 mEuro
Ageing and LivingConditions Programme
(ALC)
2006-2017
94 mSek10,088 mEuro
Arts and HumanitiesHistorical StudiesCulture and Media
CeSam
SamfakBETULA-project
Social and Economic GeographyStatistics
Economics
MedfakEpidemiologi (EPIPH)
Clinical Microbiology - Virology and Infectious Diseases
Influenza Pandemics Programme
(IP)
2007-2009
6,0 mSek638, tEuro
Centre for Population Studies
Transfer of Demographic Patterns between Generations -
Fertility and Infant Mortality
RJF 2006-20092,5 mSek
266,2 tEuro
DDB/CBSStatistics
Consequences of Colonisation - The
Epidemiological Transition in Sápmi
1750-1900
FAS 2006-20072,4 mSek
255,5 tEuro
DDB/CBSCeSam
A geneological A geneological databasedatabase
1720-20081720-2008
Historical DatabasePOPUM3
Multi-Gen
(SCB)
Extended POPUM
1700 1800 1900 2000C
entr
e fo
r P
op
ula
tio
n
Stu
die
s
WWW.DDB.UMU.SEWWW.DDB.UMU.SE
Cen
tre
for
Po
pu
lati
on
S
tud
ies