26
TWC-SWQP: A Semantically- Enabled Provenance-Aware Water Quality Portal Ping Wang, Jin Guang Zheng, Linyun Fu, Evan W. Patton, Timothy Lebo, Li Ding, Joanne S. Luciano, Deborah L. McGuinness Tetherless World Constellation RPI

TWC-SWQP: A Semantically-Enabled Provenance-Aware Water Quality Portal

Embed Size (px)

DESCRIPTION

TWC-SWQP: A Semantically-Enabled Provenance-Aware Water Quality Portal. Ping Wang, Jin Guang Zheng, Linyun Fu, Evan W. Patton, Timothy Lebo, Li Ding, Joanne S. Luciano, Deborah L. McGuinness Tetherless World Constellation RPI. Outline. Introduction Data Sources Semantic Web Approach - PowerPoint PPT Presentation

Citation preview

Page 1: TWC-SWQP: A Semantically-Enabled Provenance-Aware Water Quality Portal

TWC-SWQP: A Semantically-Enabled Provenance-Aware Water

Quality Portal

Ping Wang, Jin Guang Zheng, Linyun Fu, Evan W. Patton, Timothy Lebo, Li Ding,

Joanne S. Luciano, Deborah L. McGuinnessTetherless World Constellation

RPI

Page 2: TWC-SWQP: A Semantically-Enabled Provenance-Aware Water Quality Portal

Outline

• Introduction

• Data Sources

• Semantic Web Approach

• Future Work

Page 3: TWC-SWQP: A Semantically-Enabled Provenance-Aware Water Quality Portal

Outline

• Introduction

• Data Sources

• Semantic Web Approach

• Future Work

Page 4: TWC-SWQP: A Semantically-Enabled Provenance-Aware Water Quality Portal

SWQP Overview

Page 5: TWC-SWQP: A Semantically-Enabled Provenance-Aware Water Quality Portal

Apply CA Regulation

Page 6: TWC-SWQP: A Semantically-Enabled Provenance-Aware Water Quality Portal

Retrieval by Characteristic

Page 7: TWC-SWQP: A Semantically-Enabled Provenance-Aware Water Quality Portal

Detailed polluting facility

Page 8: TWC-SWQP: A Semantically-Enabled Provenance-Aware Water Quality Portal

Provenance of water data

Page 9: TWC-SWQP: A Semantically-Enabled Provenance-Aware Water Quality Portal

Provenance of regulations

Page 10: TWC-SWQP: A Semantically-Enabled Provenance-Aware Water Quality Portal

Measurement Visualization

Page 11: TWC-SWQP: A Semantically-Enabled Provenance-Aware Water Quality Portal

Outline

• Introduction

• Data Sources

• Semantic Web Approach

• Future Work

Page 12: TWC-SWQP: A Semantically-Enabled Provenance-Aware Water Quality Portal

Data Sources

Data Type Data Source

Water Quality Data EPA Enforcement & Compliance History Online (ECHO) Database

USGS National Water Information System (NWIS) Water-Quality Web Services

Water Quality Regulation

EPA (National Water Regulation)

California Code of Regulations

Massachusetts Department of Environmental Protection

New York Department of Health

State of Rhode Island Department of Environmental Management

Page 13: TWC-SWQP: A Semantically-Enabled Provenance-Aware Water Quality Portal

Outline

• Introduction

• Data Sources

• Semantic Web Approach

• Future Work

Page 14: TWC-SWQP: A Semantically-Enabled Provenance-Aware Water Quality Portal

Domain Knowledge Modeling

• Core ontology design1

1 http://purl.org/twc/ontology/swqp/core

Page 15: TWC-SWQP: A Semantically-Enabled Provenance-Aware Water Quality Portal

Domain Knowledge Modeling

• Regulation ontology design2

2e.g., http://purl.org/twc/ontology/swqp/region/ny and http://purl.org/twc/ontology/swqp/region/ri; others are listed at http://purl.org/twc/ontology/swqp/region/

Page 16: TWC-SWQP: A Semantically-Enabled Provenance-Aware Water Quality Portal

Reasoning Domain Data with Regulations

• Combining the water measurement data, the core and regulation ontologies, a reasoner can decide if a water body is polluted using OWL2 classification.

Benefits

The core ontology is small: 18 classes, 4 object properties, and 10 data properties.

The ontology component can be easily extended to incorporate more regulations

Flexible querying and reasoning: the user can select the regulation to apply

Page 17: TWC-SWQP: A Semantically-Enabled Provenance-Aware Water Quality Portal

Data Integration

• We used the open source tool csv2rdf4lod3,4.– Linking ontological terms– Aligning instance references– Converting complex objects

C1_VALUE C1_UNIT C2_VALUE C2_UNIT

34.07 MPN/100ML 53.83 MPN/100ML

3 Lebo, T., Williams, G.T., 2010. Converting governmental datasets into linked data. Proceedings of the 6th International Conference on Semantic Systems, I-SEMANTICS ’10, pp. 38:1–38:3.4 http://purl.org/twc/id/software/csv2rdf4lod

Page 18: TWC-SWQP: A Semantically-Enabled Provenance-Aware Water Quality Portal

Provenance Support

• Provenance Capture

• Provenance Usage– Data Source Widget– Data Trace Visualization

Page 19: TWC-SWQP: A Semantically-Enabled Provenance-Aware Water Quality Portal

Water Data Provenance Capture

Integration State Provenance Script

Retrieval source URL, modification time,inference engine, inference rule,involved actor

purl.sh

Adjust antecedent data, modification timeinference engine, inference rule,involved actor

punzip.shjustify.sh

Convert antecedent data, invocation time, inference engine, interpretation rule

convert*.sh (conversion trigger)

Publish URL of published dump file, publish time, involved actor

publish.sh

Page 20: TWC-SWQP: A Semantically-Enabled Provenance-Aware Water Quality Portal

Water Regulation Provenance Capture

See complete table at http://tw.rpi.edu/web/project/TWC-SWQP/compare_five_regulation

Page 21: TWC-SWQP: A Semantically-Enabled Provenance-Aware Water Quality Portal

Water Regulation Provenance Capture

See complete table at http://tw.rpi.edu/web/project/TWC-SWQP/compare_five_regulation

Page 22: TWC-SWQP: A Semantically-Enabled Provenance-Aware Water Quality Portal

Data Source Widget

Input URL of SPARQL endpoint and (optional) list of its named graphs, and name of the SimpleNamedGraphSourceGraph instance

Output SimpleNamedGraphSourceGraph instance filled with simple descriptions of the source organizations responsible for the data

Process Walk a big provenance graph for each named graph and abstracts it into one triple: <data_1> dct:source <source_1>

Page 23: TWC-SWQP: A Semantically-Enabled Provenance-Aware Water Quality Portal

Data Source Widget

• Usage

• Presentation of the data sources on the interface

• Source based data retrieval

Page 24: TWC-SWQP: A Semantically-Enabled Provenance-Aware Water Quality Portal

Provenance Visualization

Page 25: TWC-SWQP: A Semantically-Enabled Provenance-Aware Water Quality Portal

Future Work

• Convert data and encode the regulations for the remaining states

• Linking to Health Domain

• Utilize data from other sources, e.g. weather and flood forecasts

• Apply this architecture to other applications, e.g. the Clean Air Status and Trends demo5

5 http://logd.tw.rpi.edu/demo/clean_air_status_and_trends_-_ozone

Page 26: TWC-SWQP: A Semantically-Enabled Provenance-Aware Water Quality Portal

• Thank you!