Use of the administrative data in Population CensusesLibourne, 4 - 7 June 2007
Kaija Ruotsalainen, Statistics Finland
7.6.2007 B 2
Content
1 Administrative data and registers as sources of statistics Advantages and disadvantages of register-based statistics
production
2 Use of register data in statistics compilation Ways of utilising registers Defining the families Definition the main type of activity
3 Data quality4 New statistics made possible by the use of registers
7.6.2007 B 3
Advantages of register-based statistics production
Total data can be produced annually Lower costs (1980 Finnish Census cost EUR 34 million, 2000 Census
EUR 840 000. Because of the identifying codes it is possible to compile the flow-
statistics Possibility to produce more data (for instance multiple activities) Possibility to link with other material No subjective opinions => All the cases are handled in the same way No additional burden to respondents Fewer personal problems Confidentiality advantages
7.6.2007 B 4
Disadvantages of register-based statistics production
Dependence on data suppliers An amendment in legislation or other administrative change can cause
changes to the data content of the register Changes in data suppliers’ information system can bring problems, at
least delays from the normal production rate The coverage of registers may be defective for some data, although the
register itself would contain all the units Consistency problems may arise when linking information from different
registers All the information that previously used to ask on census forms not
exist in the register
7.6.2007 B 5
Use of register data in statistics compilation
Register data are utilised primarily in two different ways:
1) Obtain the information directly from the register e.g. person’s demographic data (age, gender, marital status, citizenship) or income data
2) Another way is to form new variables with the so-called register estimation method
7.6.2007 B 6
The register estimation
Bases on different information in several registers The aim is to estimate for each statistical unit the value of
the target variable as close to the statistical concept and definition as possible
This is done by using all the existing data available and a set of decision rules to estimate the value of the statistical variable
7.6.2007 B 7
The register estimation
The sources: partial coverage overlapping different degrees of quality supplement each others
The definition takes place in steps The sequence of the steps is based on three principles:
international definitions quality of administrative sources other reasons
7.6.2007 B 8
Examples of using register data in statistics compilation
defining the families defining the person’s main type of activity
7.6.2007 B 9
Definition of family (1)
A family consists of a married, cohabiting or registered couple and their children living together; or either of the parents and his or her children living together; or a married, cohabiting or registered couple without children.
Persons living in the household-dwelling unit who are not members of the nuclear family are not included in the family population, even if they are related, unless they form their own family. Brothers and sisters or cousins living together are not a family and do not belong to the family population. The same applies to people who live alone.
7.6.2007 B 10
Definition of family (2)
Families living in residential homes are included in the family population. In contrast, persons who live in institutions are not included.
A family can consist of no more than two successive generations. If the household-dwelling unit comprises more than two generations, the family is formed starting from the youngest generation. This means, for example, that a mother-in-law or father-in-law living with their child's family will not be included in the family population unless they live together with their spouse, in which case the old couple form their own
family.
7.6.2007 B 11
Definition of child
In the family statistics children comprise the following persons living with their parents:
biological children adopted children and biological children and adopted children of one of the spouses
Foster children and children in the care of the family are not classified
as children.
7.6.2007 B 12
Computerised forming of families (1)
• Population is sorted by domicile code (DC). DC (23 digits) identifies a dwelling
004 001 0001 0001 1 001 A001 A Person 1004 001 0001 0001 1 001 A001 A Person 2004 001 0001 0001 1 001 A001 A Person 3004 001 0001 0001 1 001 A001 A Person 4004 001 0001 0001 1 001 A001 A Person 5004 001 0001 0001 1 001 A001 A Person 6
004 002 0001 0001 1 002 A001 A Person 1004 002 0001 0001 1 002 A001 A Person 2004 002 0001 0001 1 002 A001 A Person 3
004 002 0001 0001 1 002 A002 A Person 1
etc.
992 111 0111 0001 1 019 A020 A Person 1
Household 1
Household 2
Household 3
Household N
Mun
icip
ality
Build
ing
Entra
nce
& flat
Flat
div
ider
Rea
l est
ate
Che
ck c
ode
Village
Block
7.6.2007 B 13
Computerised forming of families (2)
The computer program inserts into a table the inhabitants of one household-dwelling unit.
The record of each person includes the personal identification codes (PIN) of his or her parents, married spouse and all born and adopted children.
The computer program forms families by comparing the PINs of the persons living permanently in the same household-dwelling unit.
7.6.2007 B 14
Computerised forming of families (3)
Household 1:004 001 0001 0001 1 001 A001 AGrandfather004 001 0001 0001 1 001 A001 AGrandmother004 001 0001 0001 1 001 A001 ASon 1004 001 0001 0001 1 001 A001 ASon 2004 001 0001 0001 1 001 A001 AHis wife004 001 0001 0001 1 001 A001 ATheir child
Household 2:004 002 0001 0001 1 002 A001 AHusband004 002 0001 0001 1 002 A001 AWife004 002 0001 0001 1 002 A001 AHer mother
Family 2
Family 1
Family 1
Not a family member
7.6.2007 B 15
Computerised forming of families (4)
After forming a possible married family the program identifies as a cohabiting couple a man and woman living together in the same dwelling if:
- both are aged 18 or over- their age difference is less than 16 years- they are not siblings
The program also identifies reconstituted families by comparing the PINs of the children and parents of each person in the same dwelling.
7.6.2007 B 16
Definition of the main type of activity
Labour forceemployedunemployed
Persons outside the labour force0-14-year oldsstudentsconscriptspensionersother persons outside the labour force
7.6.2007 B 17
Defining the main type of activity (1)
A cross-sectional data for the last week of the year The validity of the data in the last week of the year is not
totally certain for all data Nearly 30 different registers are used The classified population diminishes after each category
until all the persons have been classified to some group
7.6.2007 B 18
Defining the main type of activity (2)
The decision rules were formed by utilising the parallel statistics of the 1985 Census for testing the decision in different ways and in different order
The aim was to come up with such decision rules that 1) the numbers of persons in different groups would be as close as possible to the data in the questionnaire-based census 2) the proportion of those belonging to the same group would be as high as possible
7.6.2007 B 19
Defining the main type of activity (cont.)
STEP 1 All persons under fifteen year of
age
0-14-year-olds
=>
• 17.5 % of the population (in 2004)
7.6.2007 B 20
Defining the main type of activity (cont.)
STEP 2 All persons over 74 years
7.3 % of the population (in 2004) Pensioners
=>
7.6.2007 B 21
Defining the main type of activity (cont.)
STEP 3 All those who according to the
Ministry of Labour's register on job applicants were unemployed the last weekday of the year
5.7 % of the population (in 2004)Unemployed
=>
7.6.2007 B 22
Defining the main type of activity (cont.)
STEP 4 All those who were on their
military service or non-military service during the last week of the year
0.3 % of the population (in 2004) Conscripts
=>
7.6.2007 B 23
Defining the main type of activity (cont.)
STEPS 5 - 7 Persons who have only a valid self-
employment pension’s insurance during the last week of the year (4.1 % )
Persons who have both a valid self-employment pension’s insurance and employment pension insurance (employee relationship) and whose entrepreneurial income is larger than earned income (0.2 %)
Persons who according to registers do not have a valid employment relationship but whose entrepreneurial income is higher than EUR 810 and larger than earned income and the person is not retired (0.2 %)
Entrepreneurs
=>
7.6.2007 B 24
Defining the main type of activity (cont.)
STEPS 8 - 9 Persons who have a valid
employment pension insurance and who have earned income (37.8 %)
Persons who do not have employment relationship data but who are employed with labour policy measures (0.1 %)
Employees
=>
7.6.2007 B 25
Defining the main type of activity (cont.)
STEPS 10 - 13 Persons who according to the
student register study full-time in the autumn term (6.0 %)
Those 15-year-olds who were not classified into any other group at earlier stages (1.1 %)
Persons in labour market training in the last week of the year (0.3 %)
Persons who have received study aid either in the autumn term or both in the spring and autumn terms (0.2 % ) Students
=>
7.6.2007 B 26
Defining the main type of activity (cont.)
STEPS 14 - 15 Persons who at the turn of the year
receive old-age, disability, unemployment or special farmers’ pension (14.8 % )
All those whose pension is higher than their income from work and above EUR 580 and the pension is not a survivor's pension or part-time pension (0.2 %)
=>
Pensioners
7.6.2007 B 27
Defining the main type of activity (cont.)
STEP 16 Persons whose earned income is
higher than entrepreneurial income and over EUR 5600 per year and who were not classified into any other group at earlier stages
0,8 % of the population (in 2004)
=>
Employees
7.6.2007 B 28
Defining the main type of activity (cont.)
STEP 17 Persons who do not meet any of
the above criteria
3.4 % of the population (in 2004)
=>
Other persons outside the labour force
7.6.2007 B 29
Data quality
Municipal pilot study based on 1980 Population Census Register-based statistics in connection with 1985 census Evaluation study of the 1990 census Continuous quality assessment
Labour force survey as reference materialTwo purposes:
monitoring of the level of the results monitoring of the extent to which the methods produce data
classified in the same manner
7.6.2007 B 30
Monitoring of the extent to which the methods produce data classified in the same manner
to identify errors in data processing to identify situations requiring a change in decision rules to check the level of results
7.6.2007 B 31
Main type of activity of the population according to the registers and questionnaire in 1985 census (percentages)
Register QuestionnaireTotal Em-
ployedUnem-ployed
Stu-dents
Pensio-ners
Others
Total 100,0 57,5 3,5 9,4 23,7 5,9Employed 100,0 93,7 0,6 2,1 1,0 2,6Unemployed 100,0 17,7 70,3 1,3 2,6 8,1Students 100,0 3,5 1,1 90,7 0,2 4,5Pensioners 100,0 2,4 0,2 0,2 94,5 2,9Others 100,0 24,6 9,7 5,1 3,3 57,2
7.6.2007 B 32
Main type of activity of the population according to the register based census and evaluation study 1990 (percentages)
Evaluation study (questionnaire)Total Em-
ployedUnem-ployed
Stu-dents
Pensio-ners
Others
Total 100,0 61,4 4,0 9,7 20,4 4,5Employed 100,0 93,2 1,0 2,4 1,5 1,9Unemployed 100,0 24,5 68,9 2,2 1,3 3,2Students 100,0 10,3 1,6 85,8 0,6 1,6Pensioners 100,0 3,2 0,3 0,3 94,8 1,4Others 100,0 20,5 10,4 9,4 4,8 54,9
Census (register)
7.6.2007 B 33
Main type of activity by questionnaire and by register 1985
Employed Unemployed Students Pensioners Others0
500
1000
1500
2000
2500
3000Thousands
Questionnaire -85 Register -85 Evaluation study -85
7.6.2007 B 34
Main type of activity by questionnaire and by register 1990
Employed Unemployed Students Pensioners Others0
500
1000
1500
2000
2500
3000Thousands
Evaluation study -90 Register -90 LFS
7.6.2007 B 35
Employed according to the Labour Force Survey and Register-based Employment Statistics 1989-2003
19891990 1995 2000 2003
Year
0
500
1 000
1 500
2 000
2 500
3 000Thousands
LFS RES
7.6.2007 B 36
Persons according to the Register-based Employment Statistics (RES) and Labour Force Survey (LFS) on December 2002 (persons)
Total Em-ployed
Unem-ployed
Stu-dents
Pensio-ners
Cons-cripts
Others Non response
Total 12 308 6 194 489 1 097 2 136 55 474 1 863Employed 7 079 5 809 39 103 51 2 78 997Unemployed 886 118 345 15 33 1 186 188Students 1 302 92 66 953 12 3 33 143Pensioners 2 460 95 4 3 2 000 - 10 348Conscripts 52 - 1 - - 47 - 4Others 529 80 34 23 40 2 167 183
LFSRES
7.6.2007 B 37
Persons according to the Register-based Employment Statistics (RES) and Labour Force Survey (LFS) on December 2002 (percentages)
Total Em-ployed
Unem-ployed
Stu-dents
Pensio-ners
Cons-cripts
Others Non response
Total 100,0 50,3 4,0 8,9 17,4 0,4 3,9 15,1Employed 100,0 82,1 0,6 1,5 0,7 0,0 1,1 14,1Unemployed 100,0 13,3 38,9 1,7 3,7 0,1 21,0 21,2Students 100,0 7,1 5,1 73,2 0,9 0,2 2,5 11,0Pensioners 100,0 3,9 0,2 0,1 81,3 .. 0,4 14,1Conscripts 100,0 - 1,9 - - 90,4 - 7,7Others 100,0 15,1 6,4 4,3 7,6 0,4 31,6 34,6
LFSRES
7.6.2007 B 38
Unemployed according to the LFS and the RES 1989-2003
1990 1995 2000 2003
Year
0
100
200
300
400
500
600Thousands
LFS RES
7.6.2007 B 39
Unemployed according to the RES and LFS in December 2002
Unemployed in RES 886 Unemployed in LFS 489
Unemployed in both statistics:
345
Of which in LFS:
Employed 118Students 15Pensioners 33Others 187Non response 188
Of which in RES:
Employed 39Students 66Pensioners 4Others 35
Target population: LFS sample in December 2002, 12 497 persons
7.6.2007 B 40
Use of interview survey data with data from administrative sources (1)
The role of register data sampling primary data collection editing and imputing weighting quality control
Basic principles informed consent ethics
7.6.2007 B 41
Use of interview survey data with data from administrative sources (2)
long traditions especially demographic data and income data
Why? need for data of high quality request for cost effectiveness risk of growing non-response
7.6.2007 B 42
New statistics made possible by the use of registers
For instance: Flow statistics
employment flowsstudent flowsflows between branches of industry etc.
Placement statistics Longitudinal researches
7.6.2007 B 43
Qualification of father
% of children at university
Qualification of father 2= below upper secondary educationUpper secondary qualification 3 = lower level 4 = upper levelTertiary education 5= vocational college qualification 6= lower university degree 7= higher university degree 8= doctorate-level degree
University students aged 20-24 according to the level of educational attainment of father in 1995
7.6.2007 B 44
Progress of studies in higher university programmes (Master's degree) of those who entered education in 1995 by end-2003, %
Source: Education Statistics, Employment Statistics, Statistics Finland
Higher university degree
(Master)56%
Other degree3%
Lower university degree (Bachelor)
9%
No degree32%
2
6
Studied in other education
Did not study, employed
In other activity
Studied in university21
2Did not study, unemployed1
7.6.2007 B 45
Source: Education Statistics, Employment Statistics, Statistics Finland
21,6 21,7 21,1 20,2 20,5
33,5 34,7 35,4 37
45,949,5 51 50,2 50,8 52,7
57,6 57,8 55,9 55,7 56,9
2000 2001 2002 2003 20040
10
20
30
40
50
60
70%
Upper secondary general education
Vocational education (a change in coverage 2003->2004)
Polytechnic education
University education
Proportion of employed students during studies by sector of education 2000, 2001, 2002, 2003 and 2004, %
7.6.2007 B 46
Employment rate of persons graduated of healthcare in the first stage of tertiary education (nurses, midwives, laboratorian etc.) in 1989-2003 by year of graduation
1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003
year
0
20
40
60
80
100employment rate, %
Year of graduation1989 1991 1993 1995
7.6.2007 B 47
History of Employment1950,1970,75,80,85,87-2004
Unem-ployed
Pensioners
Students
Housework
Children
Industry
Level of ed.Field of ed.
Deaths and the history of different activities
Deaths1971-2005by causes of death
Immigr.Births
Occupation Socio/econIncomes
Completed educ.
Family background
RegionIncome level
7.6.2007 B 48
UNEMPLOYED
50 897
33 15833 117
6 774
83 896
95 791
14 159
Employed2 283 913
Employed2 245 780
2 035 543
47 635
5 87
3
STUDENTS
PENSIONERS
EMIGRANTSDied persons IMMIGRANTS
To where?
31.12.
200331.12.
2004
OTHERS(Homework, conscripts)
Flows between different activity groups: Employeed 2003-2004
7.6.2007 B 49
Persons left their job by the next year activity
49988
45389
73095
168269
191467
172721
106308
113034
112220
83696
88110
91871
80675
91957
106274
46500
55174
55977
59436
41404
41375
31485
37949
34190
41190
32951
45143
47981
48632
55499
64084
62420
67030
58261
52143
47991
36904
37725
33654
38627
29484
41103
41261
44175
45646
41455
49497
49647
45138
41910
51522
38441
37964
34456
26935
26761
32931
32987
39616
36066
1988
1989
1990
1991
1992
1993
1994
1995
1996
1997
1998
1999
2000
2001
2002
0 50000 100000 150000 200000 250000 300000
Unemployed
Student
Other
Pensioner
60425
58362
44193
46483
83714
93407
148460
126951
114268
127326
110348
101296
103534
84030
111383
93647
92833
87062
66071
51143
42832
61293
71133
79011
80534
108970
94721
94578
91439
92528
72073
73514
66005
50603
33538
34111
36377
37926
40131
41353
49588
46089
51829
49951
51718
9550
8323
7024
4812
3812
6550
5111
5163
5302
6612
9007
6582
7631
5720
6131
1988
1989
1990
1991
1992
1993
1994
1995
1996
1997
1998
1999
2000
2001
2002
0 50000 100000 150000 200000 250000 300000
Unemployed
Students
Others
Pernioners
New employed persons by the previous year activity
7.6.2007 B 50
More information
http://www.stat.fi/censusbyregisters