23
The Evolution of Banff in the Context of Modernization Darren Gray Statistics Canada UNECE Workshop on Statistical Data Editing Neuchâtel, Switzerland, 18-20 September 2018

The Evolution of Banff in the Context of Modernization€¦ · Presentation Outline 4. Banff Research and Development Plan 5. 6 Internal methodology Process flow using Banff Generalized

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: The Evolution of Banff in the Context of Modernization€¦ · Presentation Outline 4. Banff Research and Development Plan 5. 6 Internal methodology Process flow using Banff Generalized

The Evolution of Banff in the

Context of Modernization

Darren Gray

Statistics Canada

UNECE Workshop on Statistical Data Editing

Neuchâtel, Switzerland, 18-20 September 2018

Page 2: The Evolution of Banff in the Context of Modernization€¦ · Presentation Outline 4. Banff Research and Development Plan 5. 6 Internal methodology Process flow using Banff Generalized

• Generalized edit and imputation (data editing) system

developed and maintained by Statistics Canada

• Primary tool for business surveys at Statistics Canada

• External users include other Canadian agencies, other

National Statistical Institutes (NSIs) and some private

organizations

What is Banff?

2

Page 3: The Evolution of Banff in the Context of Modernization€¦ · Presentation Outline 4. Banff Research and Development Plan 5. 6 Internal methodology Process flow using Banff Generalized

• Statistics Canada is moving beyond a survey-first approach with new methods and integrating data from a variety of existing sources

• Existing methods must be reviewed for suitability, while new methods are tested and developed for use

• How do we maintain the “generalized” aspect of Banff (focusing on methods with broad application) while facilitating the integration of emerging methods in an efficient manner?

Modernization

3

Page 4: The Evolution of Banff in the Context of Modernization€¦ · Presentation Outline 4. Banff Research and Development Plan 5. 6 Internal methodology Process flow using Banff Generalized

• Overview of the Banff research & development plan

• Description of Banff as a process flow tool

• Project details

Presentation Outline

4

Page 5: The Evolution of Banff in the Context of Modernization€¦ · Presentation Outline 4. Banff Research and Development Plan 5. 6 Internal methodology Process flow using Banff Generalized

Banff Research and

Development Plan

5

Page 6: The Evolution of Banff in the Context of Modernization€¦ · Presentation Outline 4. Banff Research and Development Plan 5. 6 Internal methodology Process flow using Banff Generalized

6

Internal methodology

Process flow using Banff

Generalized assessment

and evaluation tools

Assess and

compare new

methods

Discrete /

categorical variables

Non-linear

constraints

Simplify / improve

user control

Current needs Future needs

Measure impact of

data editing on

statistical outputs

Research and development plan

Integration of

external methods

into Banff

framework

Page 7: The Evolution of Banff in the Context of Modernization€¦ · Presentation Outline 4. Banff Research and Development Plan 5. 6 Internal methodology Process flow using Banff Generalized

7

Internal methodology

Process flow using Banff

Generalized assessment

and evaluation tools

Assess and

compare new

methods

Discrete /

categorical variables

Non-linear

constraints

Simplify / improve

user control

Current needs Future needs

Measure impact of

data editing on

statistical outputs

Integration of

external methods

into Banff

framework

Research and development plan

Page 8: The Evolution of Banff in the Context of Modernization€¦ · Presentation Outline 4. Banff Research and Development Plan 5. 6 Internal methodology Process flow using Banff Generalized

8

Process flow using Banff

Integration of external

methods into Banff

framework

Simplify / improve

user control

1) Standardization of

Banff procedures

Framework for assessing

and “wrapping” external

methods

2) Improvements to

Banff processor

Users control integration

of external method into a

process flow

3) Introduction of

record-level statuses

Allows for a wider variety

of external methods

Address data

management “holes”

User control and

flexibility

Simplifies certain

procedures

Research and development plan

Page 9: The Evolution of Banff in the Context of Modernization€¦ · Presentation Outline 4. Banff Research and Development Plan 5. 6 Internal methodology Process flow using Banff Generalized

Process flow using Banff

9

Page 10: The Evolution of Banff in the Context of Modernization€¦ · Presentation Outline 4. Banff Research and Development Plan 5. 6 Internal methodology Process flow using Banff Generalized

• Banff consists of nine SAS procedures, performing

various edit and imputation related functions

• With one exception, these procedures fit naturally within

the Generic Statistical Data Editing Model (GSDEM)

developed by the UNECE

Description of Banff as a modular,

automated data editing tool

10

Process flow using Banff

Page 11: The Evolution of Banff in the Context of Modernization€¦ · Presentation Outline 4. Banff Research and Development Plan 5. 6 Internal methodology Process flow using Banff Generalized

11

Selection

Outlier

ErrorLoc

Review

EditStats VerifyEdits

Amendment

Deterministic

DonorImputation

Estimator

MassImputation

Prorate

GSDEM Data Editing Function

Classifications

Process flow using Banff

Page 12: The Evolution of Banff in the Context of Modernization€¦ · Presentation Outline 4. Banff Research and Development Plan 5. 6 Internal methodology Process flow using Banff Generalized

12

Module interaction

• Banff passes information from one procedure to another using

status flags

• Selection procedures identify values requiring further treatment

FieldID

ID

FieldID

ID

Process flow using Banff

Page 13: The Evolution of Banff in the Context of Modernization€¦ · Presentation Outline 4. Banff Research and Development Plan 5. 6 Internal methodology Process flow using Banff Generalized

13

Module interaction

• Banff passes information from one procedure to another using

status flags

• Selection procedures identify values requiring further treatment

• Amendment procedures make changes to the data

FieldID

ID

FieldID

ID

FieldID

ID

Process flow using Banff

Page 14: The Evolution of Banff in the Context of Modernization€¦ · Presentation Outline 4. Banff Research and Development Plan 5. 6 Internal methodology Process flow using Banff Generalized

14

Module interaction

• Banff passes information from one procedure to another using

status flags

• Selection procedures identify values requiring further treatment

• Amendment procedures make changes to the data

• All-in-one procedures do both

FieldID

ID

FieldID

ID

FieldID

ID

Process flow using Banff

Page 15: The Evolution of Banff in the Context of Modernization€¦ · Presentation Outline 4. Banff Research and Development Plan 5. 6 Internal methodology Process flow using Banff Generalized

Project details

15

Page 16: The Evolution of Banff in the Context of Modernization€¦ · Presentation Outline 4. Banff Research and Development Plan 5. 6 Internal methodology Process flow using Banff Generalized

• Objective: develop formal criteria defining a modular,

automated function within the Banff data editing

framework

• Elements:

• Function classification

• Standardized inputs and outputs

• Reserved status flags

1) Standardizing the Banff procedures

16

Standardizing Banff procedures

Page 17: The Evolution of Banff in the Context of Modernization€¦ · Presentation Outline 4. Banff Research and Development Plan 5. 6 Internal methodology Process flow using Banff Generalized

• Most procedures already fit this criteria, but we have

identified some gaps

• Key change: requiring that procedures output updated

statistical data and status files

17

Standardizing Banff procedures

Page 18: The Evolution of Banff in the Context of Modernization€¦ · Presentation Outline 4. Banff Research and Development Plan 5. 6 Internal methodology Process flow using Banff Generalized

• Banff processor: Metadata-driven tool designed to

facilitate the use of Banff procedures in a sequential data

editing process flow

• Advantages:

• Data management between steps is handled behind the scenes

• Overall process flow and procedure parameters are all stored in

a single, editable table

• Drawback: limited control of process flow details

2) Improving the Banff processor

18

Improving the Banff processor

Page 19: The Evolution of Banff in the Context of Modernization€¦ · Presentation Outline 4. Banff Research and Development Plan 5. 6 Internal methodology Process flow using Banff Generalized

• Process flows often need

to restrict how procedures

interact

• Example: some

amendment procedures

are designed to fix specific

errors

Linking procedures together

19

Improving the Banff processor

Selection

Outlier

ErrorLoc

Amendment

Deterministic

DonorImputation

Estimator

Page 20: The Evolution of Banff in the Context of Modernization€¦ · Presentation Outline 4. Banff Research and Development Plan 5. 6 Internal methodology Process flow using Banff Generalized

Linking procedures together

20

Improving the Banff processor

• Some restrictions are built into the Banff processor

• e.g., ErrorLoc and DonorImputation linked by “edit groups”

• Proposal: give control to users by implementing status

filters

Page 21: The Evolution of Banff in the Context of Modernization€¦ · Presentation Outline 4. Banff Research and Development Plan 5. 6 Internal methodology Process flow using Banff Generalized

• Status flags are currently associated with specific values,

identified by <ID, FieldID>

• Certain information can only be captured at the record

level, but is required for subsequent procedures

• Propose introducing record-level status fields

3) Record-level statuses

21

Record-level statuses

FieldID

ID

FieldID

ID

FieldID

ID

Page 22: The Evolution of Banff in the Context of Modernization€¦ · Presentation Outline 4. Banff Research and Development Plan 5. 6 Internal methodology Process flow using Banff Generalized

• Standardizing the Banff modules is a key component of

our plan to integrate external methods within the Banff

framework

• Presents opportunity for open-source collaboration

Remarks

22

Page 23: The Evolution of Banff in the Context of Modernization€¦ · Presentation Outline 4. Banff Research and Development Plan 5. 6 Internal methodology Process flow using Banff Generalized

THANK YOU!

MERCI!

For more information

please contact,

[email protected]

#StatCan100