1 James Brown James.D.Brown@noaa.gov An introduction to verifying probability forecasts RFC...

James Brown

James.D.Brown@noaa.gov

An introduction to verifying probability forecasts

RFC Verification Workshop

1. Introduction to methods• What methods are available?

• How do they reveal (or not) particular errors?

• Lecture now, and hands-on training later

2. Introduction to prototype software• Ensemble Verification System (EVS)

• Part of a larger experimental project (XEFS)

• Lecture now, and hands on training later

Goals for today

3. To establish user-requirements• EVS in very early (prototype) stage

• Pool of methods may expand or contract

• Need some input on verification products

• AND to address pre-workshop questions…...

Goals for today

How is ensemble verification done?

Same for short/long-term ensembles?

What tools, and are they operational?

Which metrics for which situations?

Simple metrics for end-users?

How best to manage the workload?

What data need to be archived/how?

Pre-workshop questions

1. Background and status

2. Overview of EVS

3. Metrics available in EVS

4. First look at the user-interface (GUI)

Contents for next hour

1. Background and status

A first look at operational needs• Two classes of verification identified

1. High time sensitivity (‘prognostic’)• e.g. how reliable is my live flood forecast?...

• …where should I hedge my bets?

2. Less time sensitive (‘diagnostic’)• e.g. which forecasts do less well and why?

A verification strategy?

Prognostic exampleT

Forecast lead day

Live forecast (L)

Historical observations | μH = μL ± 1.0˚C

Matching historical forecasts (H)

Diagnostic exampleP

Probability of warning incorrectly (‘false alarms’)

e.g. flood warningwhen P>=0.9

Climatology

Single-valuedforecast

Motivation for EVS (and XEFS)• Demand: forecasters and their customers

• Demand for useable verification products

• ….limitations of existing software

History• Ensemble Verification Program (EVP)

• Comprised (too) many parts, lacked flexibility

• Prototype EVS begun in May 07 for XEFS…..

Motivation for EVS

Position in XEFS

Ensemble Viewer

Raw flow ens.

Pp’ed flow ens.

Ensemble Verification Subsystem

Flow Data

Ens. Product Generation Subsystem

Ensemble verification products

Hydrologic Ensemble Hindcaster

Ens. User Interface

EPP User Interface

Ens. Pre-Processor

Atmospheric forcing data

Ensemble/prob.

products

Ens. Post-Proc.

Ens. Streamflow Prediction Subsystem

HMOS Ensemble Processor

EPP3ESP2 EnsPost EPG

Hydro-meteorol.

ensembles

Precip., temp. etc.

Streamflow

2. Overview of EVS

Diagnostic verification• For diagnostic purposes (less time-sensitive)

• Prognostic built into forecasting systems

Diagnostic questions include….• Are ensembles reliable?

• Prob[flood]=0.9: does it occur 9/10 times?

• Are forecaster MODS working well?

• What are the major sources of uncertainty?

Scope of EVS

Verification of continuous time-series• Temperature, precipitation, streamflow etc.

• > 1 forecast points, but not spatial products

All types of forecast times• Any lead time (e.g. 1 day – 2 years or longer)

• Any forecast resolution (e.g. hourly, daily)

• Pair forecasts/observed (in different t-zones)

• Ability to aggregate across forecast points

Design goals of EVS

Flexibility to target data of interest• Subset based on forecasts and observations

• Two conditions: 1) time; 2) variable value

• e.g. forecasts where ensemble mean < 0˚C

• e.g. max. observed flow in 90 day window

Ability to pool/aggregate forecast points• Number of observations can be limiting

• Sometimes appropriate to pool points

Design goals of EVS

Carefully selected metrics • Different levels of detail on errors

• Some are more complex than others, but….

• Use cases and online docs. to assist

To be ‘user-friendly’• Many factors determine this….

• GUI, I/O, exec. speed, batch modes

Design goals of EVS

Example of workflow

How biased are my winter flows > flood

level at dam A?

Coordinated across XEFS:

The forecasts• Streamflow: ESP binary files (.CS)

• Temperature and precip: OHD datacard files

The observations• OHD datacard files

Unlikely to be database in near future

Archiving requirements

3. Metrics available

Many ways to test a probability forecast

1. Tests for single-valued property (e.g. mean)

2. Tests of broader forecast distribution

• Both may involve reference forecasts (“skill”)

Caveats in testing probabilities• Observed probabilities require many events

• Big assumption 1: we can ‘pool’ events

• Big assumption 2: observations are ‘good’

Types of metrics

Discrete/categorical forecasts• Many metrics rely on discrete forecasts

• e.g. will it rain? {yes/no} (rain > 0.01)

• e.g. will it flood? {yes/no} (stage > flood level)

What about continuous forecasts?• An infinite number of events

• Arbitrary event thresholds (i.e. ‘bins’)?

• Typically, yes (and choice will affect results)

Problem of cont. forecasts

Detail varies with verification question • e.g. inspection of ‘blown’ forecasts (detailed)

• e.g. avg. reliability of flood forecast (< detail)

• e.g. rapid screening of forecasts (<< detail)

All included to some degree in EVS……

Metrics in EVS

Greatest + ve

90 percent.

80 percent.

50 percent.

20 percent.

10 percent.

‘Errors’ for 1 forecast

Greatest - ve

Observation

Most detailed (box plot)

0 2 4 6 8 10 12 14 16 18 20 Time (days since start time)

Greatest + ve

90 percent.

80 percent.

50 percent.

20 percent.

10 percent.

‘Errors’ for 1 forecast

Greatest - ve

Observation

Observed value (increasing size)

Most detailed (box plot)

Less detail (Reliability)O

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0

Forecast probability (probability of flooding)

“On occasions when flooding is forecast withprobability 0.5, it should occur 50% of the time.”

“Forecast bias”

Less detail (C. Talagrand)

“If river stage <=X is forecast with probability 0.5, it should be observed 50% of the time.”

0 10 20 30 40 50 60 70 80 90 100

Position of observation in forecast distribution

“Forecast bias”

Least detailed (a score)

0 5 10 15 20 25 30

Time (days)

Flood stage

Forecast Observation

Brier score = 1/5 x {(0.8-1.0)2 + (0.1-1.0)2 +

(0.0-0.0)2 + (0.95-1.0)2 + (1.0-1.0)2}4

Least detailed (a score)

0 5 10 15 20 25 30

Precipitation amount

Single forecast

Observation

CRPS = A2 + B2

Then average acrossmultiple forecasts: small scores are better

4. First look at the GUI

Two-hour lab sessions with EVS• Start with synthetic data (with simple errors)

• Then more on to a couple of real cases

Verification plans and feedback • Real-time (‘prognostic’) verification

• Screening verification outputs

• Developments in EVS

• Feedback: discussion and survey

Rest of today

1 James Brown James.D.Brown@noaa.gov An introduction to verifying probability forecasts RFC...

Documents

RMCARD205 Remote Management Card · RMCARD205 Remote Management Card FEATURES • Real time UPS monitoring ... RFC 1156, RFC 1157 v3: RFC 3414 MIB RFC 1213, RFC 1628 SMTP RFC 2821

Valliappa.Lakshmanan@noaa.gov 1 Bob.Rabin@noaa.gov National Severe Storms Laboratory & University of Oklahoma lakshman/ Nowcasting

VL7 iSeries Security 2 - Freie Universität Security_2.pdf · – CAST-128 (RFC 2451) – RC5 (RFC 2451) – IDEA (RFC 2451) – Blowfish (RFC 2451) – 3DES (RFC 2451) – RC4 –

VIA EMAIL (ITP.Laws@NOAA.gov) - NOIA

System Navigation - noaa.gov

Chapter 3 User Authentication RFC 2828 RFC 2828 defines user authentication as: “The process of verifying an identity claimed by or for a system entity.”

IPV6 ASSESSMENT - Malaysian Communications and Multimedia ... · RFC 4779 RFC 5180 . RFC 5375 . RFC 5741 . RFC 6105 . Expertise Experience 3 CISSP CSCI Co-Authored Books and Standards

charles.ross@noaa.gov Warning Program

VIA Email (ITP.Laws@noaa.gov) - IAGC

NESDIS – Corporate Web Identity WebShop 2002, Longmont, CO August 7, 2002 Susan McLean – Susan.McLean@noaa.gov Andrew Allegra - Andy.Allegra@noaa.gov

PhoQ The Photo Quiz. David Grohl RFC 2616 – HTTP 1.1 RFC 2068 - “ RFC 2396 – URL RFC 1945 - “ RFC 1738 - “ RFC 1866 – HTML 2.0

Lecture 3 Towards a Verifying Compiler: Verifying Object Invariants

· rfc 1573, rfc 1643, rfc 1757, rfc 1907, rfc 2011, rfc 2012, rfc 2013, rfc 2233, rfc 2 618, rfc 2665, rfc 2666, rfc 2674, rfc 2737, rfc 2819, rfc 2863, rfc 1157, rfc 1493, rfc

2019 - noaa.gov

RFC Editor Tutorial -- “How to Write an RFC”ftp.cerias.purdue.edu/pub/doc/rfc/rfc-editor/tutorial63.pdf31 Jul 05 RFC Editor 4 Overview of this Tutorial Background: The RFC Series

RFC Editor Tutorialweb.mit.edu/rfc/rfc-editor/tutorial69.pdf · 22 July 2007 RFC Editor 4 RFCs RFC document series Begun by Steve Crocker [RFC 3] and Jon Postel in 1969 Informal memos,

RFC Editor Tutorial - ftp.cerias.purdue.eduftp.cerias.purdue.edu/pub/doc/rfc/rfc-editor/tutorial64.pdf · 31 Jul 05 RFC Editor 4 RFCs RFC document series Begun by Steve Crocker [RFC

RFC 2869 - RADIUS Extensions - The RFC Archive

SWMME-3000 Series Modular Industrial Ethernet Switch (EN) · RFC 854 Telnet Client & Server RFC 951 BootP RFC 862 Echo Protocol RFC 863 Discard Protocol RFC 867 Daytime Protocol RFC

Verifying Architecture