36
First Workshop on European Reference Grids (EuroGrid 2003): Ispra, Italy 27th-29th October 2003 Aggregation and disaggregation of statistics Statistics Denmark Erik Sommer

First Workshop on European Reference Grids (EuroGrid 2003): Ispra, Italy 27th-29th October 2003 Aggregation and disaggregation of statistics Statistics

Embed Size (px)

Citation preview

First Workshop on European Reference Grids (EuroGrid 2003):

Ispra, Italy 27th-29th October 2003

Aggregation and disaggregation of statistics

Statistics DenmarkErik Sommer

Ispra, Italy October 27-29, 2003.2

Presentation - Content

• Building a national square grid

• Disclosure of statistical data

• Supply and demand

• Selling statistical data

• Partnerships

• Integration with other countries

• Future work

Ispra, Italy October 27-29, 2003.3

GRIDdata - CLUSTERdata

København, Danmark

Ispra, Italy October 27-29, 2003.4

Danish National Grid – 5 grid sizes

Size Name/Net Grid ID Colour

100 km Overview KN100kmDK | Grey

10 km Place name KN10kmDK | Green

1 km Basic KN1kmDK | Red

250 m 250 meter KN250mDK | Orange

100 m Hectare KN100mDK | Blue

Ispra, Italy October 27-29, 2003.5

100m_62237_5749100m_62237_5749

Aarhus Kongreshus, Amaliegade 23, 8000 C.

Aarhus Kongreshus, Amaliegade 23, 8000 C.

1 km- og 100 m-grid1 km- og 100 m-grid

Ispra, Italy October 27-29, 2003.6

Grid ID

Formel:Prefix+’_’+Str(Div(N/f))+’_’+Str(Div(E/f))

Formel:Prefix+’_’+Str(Div(N/f))+’_’+Str(Div(E/f))

Northing: 6.223.600 mEasting: 574.900 mNorthing: 6.223.600 mEasting: 574.900 m

100m_62236_5749100m_62236_5749

Ispra, Italy October 27-29, 2003.7

Specification – National Square Grid

• UTM projection, zone 32

• Datum used is EUREF 89

• All cell names must refer to the original UTM32\EUREF89 projection to ensure standardized names.

Ispra, Italy October 27-29, 2003.8

Building adresse/street entrance

Ispra, Italy October 27-29, 2003.9

The key to griddata – building address/street entrance

Address IDMunicipality-code

Street-code

House-number Zipcode Kn100Dk Kn1kmDk

1810012001_ 181 0012 001_ 2840 100m_61907_7175 1km_6190_717

1810012001A 181 0012 001A 2840 100m_61907_7175 1km_6190_717

1810012003_ 181 0012 003_ 2840 100m_61907_7175 1km_6190_717

1810012010_ 181 0012 010_ 2840 100m_61907_7176 1km_6190_717

1810012011_ 181 0012 011_ 2840 100m_61908_7175 1km_6190_717

1810012015_ 181 0012 015_ 2840 100m_61908_7175 1km_6190_717

1810032003_ 181 0032 003_ 2840 100m_61909_7175 1km_6190_717

Ispra, Italy October 27-29, 2003.10

Placement of addresses – calculated addresses

Formel:Prefix+’_’+Str(Div(N/f))+’_’+Str(Div(E/f))

Formel:Prefix+’_’+Str(Div(N/f))+’_’+Str(Div(E/f))

Northing: 6.223.600 mEasting: 574.900 mNorthing: 6.223.600 mEasting: 574.900 m

100m_62236_5749100m_62236_5749

Ispra, Italy October 27-29, 2003.11

Person – Home – Dwelling - Workplace

Ispra, Italy October 27-29, 2003.12

Placement of addresses – corrected addresses

Ispra, Italy October 27-29, 2003.13

STATISTICAL INFORMATIONSYSTEM

Personid:

Person number

Workplaceor education

(daytime)

Home Dwelling

(night-time)Address

Health

Taxes

LabourmarketEducation

Social

etc

CPRPerson

BBRBuilding

CVRBusines

Questionaire

Inter-view

Parcel

Ispra, Italy October 27-29, 2003.14

Guidelines for discloure of data

Number of households (clusters)

Data from Statistics Denmark

1-19 households No data 20-49 households Key Figures 50-99 households Few intervals 100-149 households More intervals 150+ households Statistical

datasystem

Ispra, Italy October 27-29, 2003.15

Distribution of Households 100 x 100 meter – 1. January 2003, Denmark

Group Householdsintervals No. Cells % celler No. Households % Households0 0 107581 01 1-19 385544 95,64% 1328931 54,55%2 20-49 10625 2,64% 319342 13,11%3 50-99 4126 1,02% 289163 11,87%4 100-149 1451 0,36% 175537 7,21%5 150-399 1287 0,32% 273017 11,21%6 400+ 89 0,02% 50051 2,05%

Total cells households 403122 100,00% 2436041 100,00%

Not placed in grid cells 30652 1,24%Grid cells 510703 2436041 98,76%Denmark 1.1.2003 2466693 100,00%

100x100 meterKMS BAK version1.5

Ispra, Italy October 27-29, 2003.16

Distribution of Households 1x1 km – 1. January 2003

Group Householdsinterval No. Cells % cells No. Households % Households0 0 2753 01 1-19 32576 78,24% 189537 7,73%2 20-49 3664 8,80% 111817 4,56%3 50-99 1691 4,06% 120421 4,91%4 100-149 810 1,95% 98931 4,03%5 150-399 1490 3,58% 370208 15,10%6 400+ 1407 3,38% 1561396 63,67%

Total cells households 41638 100,00% 2452310 100,00%

Not placed in grid cells 14383 0,58%Grid cells 44391 2452310 99,42%Denmark 1.1.2003 2466693 100,00%

1x1 kmKMS BAK version1.5

Ispra, Italy October 27-29, 2003.17

Building Block: Number of householdsGRID id Municipality Households Population61901_7126 207 1 661902_7126 207 3 861903_7126 207 1 661904_7126 207 3 761905_7126 207 3 861909_7126 207 5 1061910_7126 207 3 661911_7126 207 2 561912_7126 207 2 761915_7126 207 5 1361916_7126 207 2 861917_7126 207 19 3861901_7127 207 9 3061902_7127 207 7 19

Ispra, Italy October 27-29, 2003.18

Options when clustering gridcells

• Proximity – districts, tradearea required to be a neighbour

• Optimizing – finding gridcells with equal value without necessarily being a neighbour

Ispra, Italy October 27-29, 2003.19

Finding gridcells with equal value without necessarily being a neighbour.

Optimizing:

Ispra, Italy October 27-29, 2003.20

Input clustering cells

CelleIDAntal_ husstande

Antal_ personer cl20id cl50id cl100id cl150id

7201_61721 142 262 384157 384157 384157 3841577201_61722 9 31 379388 394127 379158 3789457201_61723 7 24 381037 394127 379629 3786937201_61724 73 138 384160 384160 384160 3841607201_61725 32 59 384161 384161 383468 3829737201_61726 41 41 384162 384162 384162 3837167201_61728 3 7 379180 394127 377836 3778367201_61729 14 41 384164 394127 379632 3786977201_61730 12 25 382726 394127 378492 3784927201_61731 12 40 380092 394127 378265 3782657201_61732 13 34 383481 394127 382453 378922

Ispra, Italy October 27-29, 2003.21

OUTPUT: Dataset for ClustersKom. Clusterid I alt 0-149999 kr150-249999 kr 250-349999 kr 350-499999 kr 500-699999 kr over 700000 kr

101 0 436 111 77 85 77 53 33101 1 193 15 18 21 39 61 39101 2 183 54 59 27 25 13 5101 3 244 12 31 24 45 83 49101 4 115 7 11 13 26 34 24101 5 166 23 64 40 21 15 3101 6 107 40 31 20 11 4 1101 7 226 59 55 43 39 23 7101 8 137 12 25 24 24 28 24101 9 171 29 30 33 32 30 17101 10 408 198 94 51 40 21 4101 11 202 11 19 24 34 70 44101 12 196 28 29 32 42 40 25101 13 161 15 31 16 30 48 21101 14 192 32 33 24 33 54 16101 15 174 11 17 20 46 42 38101 16 189 35 26 35 39 40 14101 17 118 47 25 24 9 9 4

Ispra, Italy October 27-29, 2003.22

Selling statistical data

• National level

• Consumermarket

• Public sector

• Reserarch

• Selling directly or through partners

Ispra, Italy October 27-29, 2003.23

Segmentation New customers

Ispra, Italy October 27-29, 2003.24

Ownership dwelling > 95%

Ispra, Italy October 27-29, 2003.25

Householdincome 450.000+ > 65%

Ispra, Italy October 27-29, 2003.26

Households with children > 60%

Ispra, Italy October 27-29, 2003.27

Ownership > 95% Householdincome 450.000+ > 65% Households with children > 60%

Ispra, Italy October 27-29, 2003.28

Calculated addresses www.kms.dk

Ispra, Italy October 27-29, 2003.29

Calculated addresses www.kms.dk

Ispra, Italy October 27-29, 2003.30

Access to addresses in Denmark

Calculated address coordinateswww.kms.dkImproved access to public data

From 1. January 2003 a number of public data can be used freely. State, counties and municiapalities have made an agreement giving access to different registers including Building Register Data (BBR) and X,Y-address coordinates

www.ebst.dk/ejendom or www.ois.dk

Ispra, Italy October 27-29, 2003.31

Conzoom online – www.conzoom.net

Ispra, Italy October 27-29, 2003.32

Nordic maps and griddata (Conzoom online)

Ispra, Italy October 27-29, 2003.33

Nordic griddata online (Conzoom)

Ispra, Italy October 27-29, 2003.34

Householdstype – singles with no children

Ispra, Italy October 27-29, 2003.35

Future work

• Access to addresses

• Access to statistical data

• Disclosure policy

• Dissemination

• Integration (grid, data, pricing and promotion)

Ispra, Italy October 27-29, 2003.36

Access Statistical data

Thank you for your attention

For further information please contact:

Erik Sommer:

www.dst.dk/kvadratnet

[email protected]

phone +45 3917 3582