Statistical principles of design and analysis for accuracy ... · Boston University Slideshow Title...

P. Olofsson and S.V. Stehman

Fifth GFOI Regional Technical Workshop: Accuracy Assessment

January 25-29 2016, Santa Rosa, Philippines

Statistical principles of design and analysis for

accuracy assessment and area estimation

Boston University Slideshow Title Goes Here

“Why is this necessary?”

A very common result of remote sensing is an

estimate of an amount, or area (e.g. change map)

The strength of remote sensing is that it often allows

wall-to-wall coverage of the area under consideration

The weakness of remote sensing is that results are

never perfect (e.g., a classified map contains error)

If the map has errors, then mapped areas of the map

categories are incorrect and need adjustment for

errors, and uncertainty in area quantified

Fifth GFOI Regional Technical Workshop

4/28/2016

Not new stuff! We’re making use of

Cochran (1977) and Särndal et al. (1992);

Card (1982) used these techniques to

improve map estimates

4/28/2016 Accuracy Assessment

4/28/2016

Some terminology

We identify classification errors in a map by designing

and implementing an accuracy assessment

Sample the map (i.e. the population) and collect

reference observations for sample: best assessment

of true class at a given location

Reference data: information used to obtain the

reference class

By comparing the map and reference labels –

compute map accuracy: the degree to which the map

corresponds to the reference condition

4/28/2016

More terminology

Statistical inference: collect information about

population (map) by analyzing sample of population

A conclusion of the inference is an estimate that best

approximates a particular population parameter

A confidence interval of the estimate is the interval

that contain the true parameter value with a probability

of the confidence level if the sampling is repeated

many times

Bias is the difference between an estimator’s

expected value and the true parameter value

4/28/2016

Components of Accuracy Assessment

Sampling design: Decide which elements of the

population (map) to visit

Response design: Determine the true class or

“reference condition” at a location

Analysis: Organize and summarize data to make

inference (accuracy, area) about the population (map)

Where will we observe the true condition?

What is the true (reference) condition?

And how will we use the data?

4/28/2016

Sampling design

Many options but we want:

A probability sample

Practical to implement given response design

Yields estimates with small standard errors

Low cost

Spatially well distributed across country

Can allow for change in sample size

Unbiased estimator of variance (not approximate)

Random

selection

By measuring height of people in

sample we can make inference of

height of population. We get an

estimate of population height (ℎ ).

Population, height (h) unknown

Random sample,

height measured

Simple random sample

We get a more precise estimate of the

of the height and we can get the

height of each stratum in the

population (ℎ 𝑔, ℎ 𝑏, ℎ 𝑤).

Population, height (h) unknown

Stratified random sample

Stratified random

selection; 3 strata

Stratified random

sample, height

measured

4/28/2016

Simple

Systematic

2-Stage

Clustered

Simple

Random

Stratified

Random

Basic Sampling Design Questions

Strata – group assessment units into strata?

Clusters – sample assessment units individually or in

groups?

Selection protocol – select the sample of assessment

units by a systematic or a simple random protocol?

Basic Sampling Design Questions

Decisions depend objective of analysis and design

criteria

Clusters – use for costs but complex variance

estimation, larger variance and spatial correlation

Strata – use for objectives and precision

Selection protocol – SYS more difficult to combine with

stratification by map class; SRS unappealing

distribution

Response Design

The method use to decide reference condition

Reference condition decided without knowing map class of the location

Four main features of response design

Information to decide reference condition

Spatial unit of the assessment

Assign reference condition

How do define if map class and reference condition agree

Spatial Unit of Assessment

Partition region of interest into non-overlapping

spatial units

Common choices are

Block of pixels (e.g. 3x3, 5x5)

Segment

Sampling design depends on choice of spatial

Sources of Information to Determine

Reference Condition

Google Earth

Landsat

RapidEye

Ground visit (e.g., National Forest Inventory)

Decision will impact sampling design options (e.g.,

should we use clusters or not)

Analysis

Accuracy Define parameters that describe accuracy

Estimate parameters from sample

Estimate standard errors

Area Reference condition is basis of estimate

Increase precision by incorporating map information into area

estimator

Estimate area and standard errors

Error matrix

Reference

Forest loss No loss Map prop.

Forest loss p11 p12 p1+

No loss p21 p22 p2+

Ref. prop. p+1 p+2 1

4/28/2016

Reference

No loss p21 p22 p2+

Bias-adjusted estimator: add omission

error and subtract commission error

4/28/2016

𝑝 +1 = 𝑝1+ + (𝑝 21 − 𝑝 12)

Reference

No loss p21 p22 p2+

Stratified/post-stratified estimator

4/28/2016

𝑝 +1 = 𝑝 𝑖𝑗𝑞𝑖=1 = 𝑝 11+ 𝑝 21

Area estimators

Bias-adjusted estimator

Unbiased for any sample size

Known as a “difference” estimator in sampling texts

More efficient if map class is continuous

Stratified/Post-stratified

Unbiased (but problem if no units from a post-stratum)

Allows use of all map classes as post-strata

More efficient if map classes is categorical

4/28/2016

Reference

No loss p21 p22 p2+

Measures of accuracy

4/28/2016

𝑂 = 𝑝 𝑗𝑗𝑞𝑖=1 = 𝑝 11+ 𝑝 22

𝑈𝑖 = 𝑝 𝑖𝑖 ÷ 𝑝 𝑖+ = 𝑝 11 ÷ 𝑝 1+

𝑃𝑗 = 𝑝 𝑗𝑗 ÷ 𝑝 𝑗+ = 𝑝 11 ÷ 𝑝 +1

Conclusions

All maps have errors – can’t count pixels – inference

of area from reference sample necessary

Sample design depends on assessment objectives

and practicalities

We use the map to increase precision in area

estimates – we are not validating the map

Area estimation typically of first priority – accuracy

secondary

4/28/2016

Statistical principles of design and analysis for accuracy ... · Boston University Slideshow Title...

Documents

HSA Slideshow

Plastic Slideshow

Update on CEOS Support to GFOI S. Briggs, S. Ward ESA, GFOI Office SIT Workshop Agenda Item #9 CARB-3/4/5 CEOS SIT Technical Workshop EUMETSAT, Darmstadt,

Bookmobile Slideshow

THE GLOBAL FOREST OBSERVATIONS INITIATIVE – Update on ...€¦ · Phase 2 Ø GFOI Phase 2 sarted in 2017, following external review of ﬁrst 5 years of GFOI (2012-2016) Ø GFOI

Curiosity slideshow

National Forest Monitoring System México GFOI Capacity Building Summit Sept 2014

Slideshow ballet

Slideshow Tutorial - cs.utah.edu · About Slideshow Slideshow is a library for creating slide presentations • A Slideshow presentation is a PLT Scheme program • Instead of a WYSIWYG

PowerPoint slideshow Cooking Healthy for the Holidays Slideshow

CEOS GFOI-FCT data strategy v1-1ceos.org/document_management/Ad_Hoc_Teams/SDCG_for_GFOI/... · 2015-01-26 · The CEOS Data Strategy for GFOI/FCT - 2 satellite, periodic ground, and

CEOS Strategy for Space Data Coverage and Continuity in Support of GFOI/FCT

Disney slideshow

Blue and Orange Fashion Sale Flyer - Home - MASS …...Logo on slideshow Logo on slideshow Logo on slideshow Logo on slideshow Logo on slideshow Logo on website Logo displayed as intro

China Slideshow

Erstellen einer Slideshow Slideshow Style-Tags · Erstellen einer Slideshow Style-Tags DVD Slideshow GUI (0.9.4.0) Slideshow Übersicht über verwendbare Style-Tags zur Gestaltung

16.01.2016 Global Forest Observations Initiative Simon Eggleston GFOI SDCG 5, Rome, Italy, 25 Feb 2014,

CV Slideshow

Create photo slideshow with photo slideshow software

9.0 Slideshow - help.infoboard.dkhelp.infoboard.dk › helpfiles › slideshow › uk › help.pdf · 9.0 Slideshow 2 Slideshows: create new slideshows or edit in an existing slideshow