13
www.eia.gov U.S. Energy Information Administration Independent Statistics & Analysis Adding Relational Metadata to Statistical Data APIs April 27, 2015 | Washington, D.C. Mark Elbert Director, Office of Web Management [email protected]

Adding Relational Metadata to Statistical Data APIs€¦ · April 27, 2015 2 web Application Programming Interface Data Browser Electricity (Apr 2012) Coal (Feb 2014) deep data dives

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Adding Relational Metadata to Statistical Data APIs€¦ · April 27, 2015 2 web Application Programming Interface Data Browser Electricity (Apr 2012) Coal (Feb 2014) deep data dives

www.eia.gov U.S. Energy Information Administration Independent Statistics & Analysis

Adding Relational Metadata to Statistical

Data APIs

April 27, 2015 | Washington, D.C.

Mark Elbert

Director, Office of Web Management

[email protected]

Page 2: Adding Relational Metadata to Statistical Data APIs€¦ · April 27, 2015 2 web Application Programming Interface Data Browser Electricity (Apr 2012) Coal (Feb 2014) deep data dives

Evolution of EIA Data Publishing

Mark Elbert, Washington, D.C.,

April 27, 2015 2

web

Application Programming Interface

Data Browser Electricity (Apr 2012)

Coal (Feb 2014)

Electricity (Apr 2012)

State Energy Data System (Sept 2013)

Petroleum and Natural Gas (Oct 2013)

Annual Energy Outlook (May 2014)

STEO (Sept 2015)

Crude Oil Imports (Nov 2015)

Short Term Energy Outlook (May 2014)

Crude Oil Imports (Nov 2015)

People wanting

quick summaries

People wanting

deep data dives

Systems Applications &

websites

Bulk Data File System (Jan 2014)

Coal (Feb 2014)

Program office databases create products to post to the web

Page 3: Adding Relational Metadata to Statistical Data APIs€¦ · April 27, 2015 2 web Application Programming Interface Data Browser Electricity (Apr 2012) Coal (Feb 2014) deep data dives

Leveraging EIA’s data API

Mark Elbert, Washington, D.C.,

April 27, 2015 3

Chart widget:

Fetches data and create charts from series

IDs. Created Winter Heating Fuels page with

408 interactive charts in record time.

Excel add-in:

Game changer for analysts who live in

Excel.

Page 4: Adding Relational Metadata to Statistical Data APIs€¦ · April 27, 2015 2 web Application Programming Interface Data Browser Electricity (Apr 2012) Coal (Feb 2014) deep data dives

Mark Elbert, Washington, D.C.,

April 27, 2015 4

geoset

time series

relation

Page 5: Adding Relational Metadata to Statistical Data APIs€¦ · April 27, 2015 2 web Application Programming Interface Data Browser Electricity (Apr 2012) Coal (Feb 2014) deep data dives

Population

hypercube

example:

Mark Elbert, Washington, D.C.,

April 27, 2015 5

prime vertex

secondary vertex

tertiary vertex

lowest level

aggregate

male

female

nativ

e A

meric

an

age

race

sex

white

bla

ck

Asia

n

oth

er

multi ra

cia

l

Page 6: Adding Relational Metadata to Statistical Data APIs€¦ · April 27, 2015 2 web Application Programming Interface Data Browser Electricity (Apr 2012) Coal (Feb 2014) deep data dives

Population hypercube example (cont.):

Mark Elbert, Washington, D.C.,

April 27, 2015 6

1 prime vertex

13 secondary vertices

52 tertiary vertices

60 lowest level

aggregate

126 geo-sets (= 𝑙𝑒𝑛𝑔𝑡ℎ 𝑜𝑓 𝑑𝑖𝑚𝑒𝑛𝑠𝑖𝑜𝑛𝑖 )

Each geo-set (cell) may contains thousands of data series each. The

use of data series and geo-sets structures simplifies the task of

provided relational metadata for the hyper-cube.

Page 7: Adding Relational Metadata to Statistical Data APIs€¦ · April 27, 2015 2 web Application Programming Interface Data Browser Electricity (Apr 2012) Coal (Feb 2014) deep data dives

1D relationships and visualization:

Mark Elbert, Washington, D.C.,

April 27, 2015 7

nativ

e A

meric

an

age

race

sex

white

bla

ck

Asia

n

oth

er

multi ra

cia

l

Page 8: Adding Relational Metadata to Statistical Data APIs€¦ · April 27, 2015 2 web Application Programming Interface Data Browser Electricity (Apr 2012) Coal (Feb 2014) deep data dives

2D relationship and visualization:

Mark Elbert, Washington, D.C.,

April 27, 2015 8

male

female

age

sex

Page 9: Adding Relational Metadata to Statistical Data APIs€¦ · April 27, 2015 2 web Application Programming Interface Data Browser Electricity (Apr 2012) Coal (Feb 2014) deep data dives

nativ

e A

meric

an

age

race

sex

white

bla

ck

Asia

n

oth

er

multi ra

cia

l

Non-prime vertices have relationships too:

Mark Elbert, Washington, D.C.,

April 27, 2015 9

Page 10: Adding Relational Metadata to Statistical Data APIs€¦ · April 27, 2015 2 web Application Programming Interface Data Browser Electricity (Apr 2012) Coal (Feb 2014) deep data dives

It can get a little complicated…

Mark Elbert, Washington, D.C.,

April 27, 2015 10

Dimensions can be

hierarchical, creating

cubes within cubes…

Capturing this information allows government

data APIs to drive data visualizations with

dynamic drill-downs and data context

…but

Page 11: Adding Relational Metadata to Statistical Data APIs€¦ · April 27, 2015 2 web Application Programming Interface Data Browser Electricity (Apr 2012) Coal (Feb 2014) deep data dives

EIA data API visualization library

Mark Elbert, Washington, D.C.,

April 27, 2015 11

• Open source JavaScript library to be released on

github.com/eiagov

• Pre-release examples

– SEDS Fossil Fuel relation:

• Chart

• Map

• Relation

• Map-relation interactive

– Electricity by fuel

– Map supported

• US states

• US electricity regions

• PADD and sub-PADD

• world

Page 12: Adding Relational Metadata to Statistical Data APIs€¦ · April 27, 2015 2 web Application Programming Interface Data Browser Electricity (Apr 2012) Coal (Feb 2014) deep data dives

Effort and Release Dates

Mark Elbert, Washington, D.C., April

27, 2015 12

• Surprisingly simple to add to an existing API of time-series

– Less than 4 developer weeks in total

• Very few additional records needed (for EIA’s data)

– less than 1 geoset record for every 50 time series

– Less than 1 relation for every 100 time series

• Creates a very close relation between statistical provider and

consumer

• Expected launch date: May 2015

Page 13: Adding Relational Metadata to Statistical Data APIs€¦ · April 27, 2015 2 web Application Programming Interface Data Browser Electricity (Apr 2012) Coal (Feb 2014) deep data dives

Office of Communications

April 10, 2015 13

Questions?