32
CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

Embed Size (px)

Citation preview

Page 1: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

CSIG 08Cyberinfrastructure Summer

Institute for Geoscientists

August 11-15, 2008

San Diego

1

Page 2: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

WELCOME !

2

Page 3: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

AcknowledgementsCSIG’08 Speakers/Instructors

• Margaret Smeekens• Prof. Randy Keller, University

of Oklahoma• Dr. Don Middleton, NCAR• Dr. Peter Fox, NCAR• Dr. Deborah Kilb, SIO/UCSD

• Stuart Weir, UNAVCO• Prof. Falko Kuester, UCSD• Dr. David Nadeau,

SDSC/UCSD• John Moreland, SDSC/UCSD

• Prof. Ann Gates, UT El Paso• Dr. Sriram Krishnan, SDSC• Dr. Kai Lin, SDSC• Ashraf Memon, SDSC• Dr. Ilya Zaslavsky, SDSC• Tom Whitenack, SDSC

• Dr. Phil Maechlin, SCEC/USC• Prof. Ramon Arrowsmith, ASU• Chris Crosby, SDSC• Ilkay Altintas, SDSC

3

Page 4: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

AcknowledgementsGEON Team

• Margaret Banton

• Sandeep Chandra

• Chris Crosby

• Kai Lin

• Ashraf Memon

• John Moreland

• David Nadeau

• Viswanath Nandigam

• Choonhan Youn

• Randy Keller

• Brad Wallet

• Ramon Arrowsmith

• Charles Meertens

• Ann Gates

4

Page 5: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

5

Acknowledgements• National Science Foundation

• CSIG has been funded each year as a supplement to GEON, since 2004(has already been funded for 2009)

Page 6: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

6

Schedule• Monday – Introduction, Scientific Challenges, Examples of

Cyberinfrastructure from other projects/sciences

• Tuesday – Visualization Frameworks (and data issues)

• Wednesday – Service-Oriented Architecture and Web Services

• Thursday – GEON LiDAR Workflow (GLW)

• Friday – Related CI Resources, Tools, and Technologies

Page 7: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

7

LOGISTICS

• Webcasting and video archives

Page 8: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

8

INTRODUCTIONS

Page 9: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

9

Enabling Discoveries in the Earth Sciences Through GEON

Chaitan Baru

SDSC/UCSD

Page 10: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

A Vision for Geoinformatics

From the

NSF Workshop on Envisioning a National Geoinformatics System for the United States

Denver, March 2007

“…a future in which someone can sit at a terminal and have easy access to vast stores of data of almost any kind, with the easy ability to visualize, analyze and model those data.”

10

Page 11: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

An Example: The GEON Portalportal.geongrid.org

• GEON Search

• Workbench

• Dynamic map services, map integration

11

Page 12: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

Geologic Map IntegrationGeologic Map Integration

+/- a few hundred million years

Page 13: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

GEON Portal

• Paleo database integration

• LiDAR data access and data processing

• SYNSEIS: Online access to computational modeling system

• Gravity and Magnetic database for US

13

Page 14: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

Other Data Portal Examples

• EarthScope Data Portal

• Hydrologic Information System (HIS)– Data Access System for Hydrology (DASH)

14

Page 15: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

Integrated Cyberinfrastructure System Source: Dr. Deborah Crawford, Chair, NSF CI Working Committee

15

Hardware

Middleware Services

DevelopmentTools & Libraries

Application Domains• Geosciences, Engineering, Environmental Sciences, Physics, Astronomy, Archaeology, Neurosciences, Biomedicine, …

Domain-specific Cybertools (software)

Domain-specific Cybertools (software)

Shared Cybertools (software)

Shared Cybertools (software)

Distributed Resources (computation, storage, communication, etc.)

Distributed Resources (computation, storage, communication, etc.)

Ed

uca

tion a

nd

Tra

inin

g

Dis

covery

& In

novati

on

Page 16: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

16

Cyberinfrastructure• From NSF’s Cyberinfrastructure Vision for 21st

Century Discovery, www.nsf.gov/od/oci/ci-v7.pdf, July 20, 2006

“The comprehensive infrastructure needed to capitalize on dramatic advances in information technology has been termed cyberinfrastructure. Cyberinfrastructure integrates hardware for computing, data and networks, digitally-enabled sensors, observatories and experimental facilities, and an interoperable suite of software and middleware services and tools. Investments in interdisciplinary teams and cyberinfrastructure professionals with expertise in algorithm development, system operations, and applications development are also essential to exploit the full power of cyberinfrastructure to create, disseminate, and preserve scientific data, information, and knowledge…”

• pp40 of the report:“In 1999, the PITAC released the seminal report ITR-Investing in our Future, prompting new and complementary NSF investments in CI projects, such as the Grid Physics Network (GriPhyN) and international Virtual Data Grid Laboratory (iVDGL) and the Geosciences Network, known as GEON.”

Page 17: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

Friendly Work-Facilitating Portals

Authentication - Authorization – Auditing - Resource Discovery - Workflows - Visualization - Analysis

17

Community Cyberinfrastructure Projects

Middleware Services

DevelopmentTools & Libraries

Distributed Computing, Instruments and Data Resources

Bio

med

ical

In

form

atic

s (B

IRN

)

Hig

h E

neg

y P

hys

ics

(Gri

Ph

yN)

Geo

scie

nce

s (G

EO

N)

Eco

log

ical

Ob

serv

ato

ries

(N

EO

N)

Ear

thq

uak

e E

ng

inee

rin

g (

NE

ES

)

Oce

an O

bse

rvin

g (

OR

ION

)

Hardware

Source: Prof. Mark Ellisman, UC San Diego

Shared Tools ScienceDomains

Shared Tools ScienceDomains

Your Specific Tools

& User Apps.

Your Specific Tools

& User Apps.

Page 18: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

Portal-based Science EnvironmentsSupport for resource sharing and collaborations

Page 19: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

Virtual Organizations

• Multiple participants

• Distributed sites

• Participants are from different “administrative domains”

• Policies, rules, systems of the VO may be different than those of the participating organizations

19

Page 20: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

GEON Background

• Initiated in 2002 as a 5-year NSF ITR (IT Research) project

• Collaboration among 12 PI institutions and number of other organizations

• Distributed network of GEON “nodes”– Provides a standardized software platform– Provides a machine outside the local environment (for hosting data,

software tools, and applications for remote access– Can be centrally administered

• Funded now under the NSF Earth Sciences (EAR) Geoinformatics program

20

Page 21: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

21

GeoinformaticsFrom David Lambert, NSF EAR/GEO

Presentation at GEON Annual Meeting, 2005

Page 22: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

Other Geoinformatics Efforts• OneGeology.org

– International initiative of geological surveys to create dynamic geological map data available via the web.

• USGS initiative– Presentation by Dr. Linda

Gundersen, at Geoinformatics 2007, San Diego.

22

Page 23: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

USGS Role in USGS Role in GeoinformaticsGeoinformatics

Fundamental: develop, maintain, make accessible:Fundamental: develop, maintain, make accessible: Long-term national and regional geologic, Long-term national and regional geologic,

hydrologic, biologic, and geographic databaseshydrologic, biologic, and geographic databases Earth and planetary imagery Earth and planetary imagery Open-source models of the complex natural Open-source models of the complex natural

systems and human interaction with that systemsystems and human interaction with that system Physical collections of earth materials, biologic Physical collections of earth materials, biologic

materials, reference standards, geophysical materials, reference standards, geophysical recordings, paper records.recordings, paper records.

National geologic, biologic, hydrologic, and National geologic, biologic, hydrologic, and geographic monitoring systems geographic monitoring systems

Standards of practice for the geologic, hydrologic, Standards of practice for the geologic, hydrologic, biologic, and geographic sciencesbiologic, and geographic sciences

Source: Presentation by Dr. Linda Gundersen, USGS, at Geoinformatics 2007, San Diego, CA.

Page 24: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

USGS Role in USGS Role in GeoinfomaticsGeoinfomatics

All activities: Data creation, modeling, All activities: Data creation, modeling, monitoring, collections, standards etc. monitoring, collections, standards etc. Must be done in cooperation and Must be done in cooperation and collaboration with the public and collaboration with the public and governmental, academic, and private governmental, academic, and private sector partners and stakeholders.sector partners and stakeholders.

A critical USGS role: A critical USGS role: facilitate bringing communities facilitate bringing communities

together!together!Source: Presentation by Dr. Linda Gundersen, USGS, at Geoinformatics

2007, San Diego, CA.

Page 25: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

Data Collections versus Data Collections versus Communities of PracticeCommunities of Practice

Geoinformatics must evolve beyond the Geoinformatics must evolve beyond the accumulation of data, models, and standards accumulation of data, models, and standards to become the framework for a to become the framework for a community community of practiceof practice in the natural sciences. in the natural sciences.

Etienne Wegner and Jean Lave coined the Etienne Wegner and Jean Lave coined the term and developed the learning theory of term and developed the learning theory of communities of practice – that we learn not communities of practice – that we learn not only as individuals but as communities. By only as individuals but as communities. By engaging in communities of practice we engaging in communities of practice we increase our capacity and innovation as well increase our capacity and innovation as well as leverage our support for areas of interest. as leverage our support for areas of interest.

Source: Presentation by Dr. Linda Gundersen, USGS, at Geoinformatics 2007, San Diego, CA.

Page 26: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

Creativity, Learning, and Creativity, Learning, and InnovationInnovation

A community of practice is not merely a A community of practice is not merely a community with a common interest. But community with a common interest. But are practitioners who are practitioners who share experiences share experiences and learn from each otherand learn from each other. They develop a . They develop a shared repertoireshared repertoire of resources: experiences, of resources: experiences, stories, tools, vocabularies, ways of stories, tools, vocabularies, ways of addressing recurring problems. This takes addressing recurring problems. This takes time and sustained interactiontime and sustained interaction. Standards . Standards of practice and reference materials will of practice and reference materials will grow out of this. grow out of this. But the critical benefits But the critical benefits include: creating and sustaining include: creating and sustaining knowledge, leveraging of resources, and knowledge, leveraging of resources, and rapid learning and innovation.rapid learning and innovation.

Source: Presentation by Dr. Linda Gundersen, USGS, at Geoinformatics 2007, San Diego, CA.

Page 27: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

1000’s of National and 1000’s of National and Regional DatabasesRegional Databases

The National Map – topographic, The National Map – topographic, elevation, orthoimagery, elevation, orthoimagery, transportation hydrography etc.transportation hydrography etc.

Geospatial One Stop-portalGeospatial One Stop-portal MRDATA – Mineral Resources and MRDATA – Mineral Resources and

Related DataRelated Data The National Geologic Map Database The National Geologic Map Database

stnadardized community collection of stnadardized community collection of geologic mappinggeologic mapping

National Water Information System - National Water Information System - NWISWebNWISWeb

National Geochemical Survey National Geochemical Survey Database (PLUTO, NURE)Database (PLUTO, NURE)

National Geophysical Database National Geophysical Database (aeromag, gravity, aerorad)(aeromag, gravity, aerorad)

Earthquake CatalogsEarthquake Catalogs North American Breeding Bird SurveyNorth American Breeding Bird Survey National Vegetation/speciation mapsNational Vegetation/speciation maps National Oil and Gas AssessmentNational Oil and Gas Assessment National Coal Quality InventoryNational Coal Quality Inventory

Source: Presentation by Dr. Linda Gundersen, USGS, at Geoinformatics 2007, San Diego, CA.

Page 28: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

Geoscience Information Network

28

Page 29: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

A Use Case for GEON

29

• A user request of the form:

“For a given region (i.e. lat/long extent, plus depth), return a 3D structural model with accompanying physical parameters of density, seismic velocities, geochemistry, and geologic ages, using a cell size of 10km”

Page 30: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

Interoperability

• Data Interoperability– Ability to discover, access, integrate data sets– “Third-party”, heterogeneous, remote data

• Software Interoperability– “Software as a service”– Ability to discover services– Ability to link data with services, and

“orchestrate” services

30

Page 31: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

Data interoperability onion

31

Social NetworksSemantics

SyntaxSystems

Social NetworksSemantics

SyntaxSystems

• System Interop– Approaches: e.g., ODBC, JDBC, Java, Web services, …

– Purview of: Computer Science

• Syntactic– Approaches: Schema standards

– Purview of: Standards organizations, domain science

repositories, data archives

• Semantic– Approaches: Controlled vocabularies, thesaurii, domain ontologies

– Purview of: Domain scientists

• Social Networks– Approaches: recommendation systems

– Purview of: social networking software (CS and domain science, data driven)

Page 32: CSIG 08 Cyberinfrastructure Summer Institute for Geoscientists August 11-15, 2008 San Diego 1

Software interoperability onion

32

Social NetworksSemantics

SyntaxSystems

• System Interop– Approaches: e.g., REST, Web services

• Syntactic– Approaches: e.g., SOAP, WSDL

• Semantic– Approaches: Controlled vocabularies, thesaurii, domain ontologies

– Purview of: Domain scientists

• Social Networks– Approaches: recommendation systems

– Purview of: social networking software

• Service orchestration via worflow systems