26
University of ORNL NeSSI Workshop Computational Infrastructures for Science Marty Humphrey Assistant Professor Computer Science Department University of Virginia NeSSI Workshop October 13, 2003

University of VirginiaORNL NeSSI Workshop Computational Infrastructures for Science Marty Humphrey Assistant Professor Computer Science Department University

Embed Size (px)

Citation preview

University of Virginia ORNL NeSSI Workshop

Computational Infrastructures for Science

Marty HumphreyAssistant ProfessorComputer Science DepartmentUniversity of Virginia

NeSSI WorkshopOctober 13, 2003

University of Virginia ORNL NeSSI Workshop

“Traditional” Computational Science

• SP3, O2K, Linux clusters, etc.• PBS, LSF, LoadLeveler, etc.• Archival storage• MPI• Viz• SSH, SCP

University of Virginia ORNL NeSSI Workshop

Grid Definition (Foster and Kesselman)

• “Coordinates resources that are not subject to centralized control….”

• “Using standard, open, general-purpose protocols and interfaces…”

• “To deliver non-trivial qualities of service.”

University of Virginia ORNL NeSSI Workshop

Grid Computing

Host/OS 1,1 Host/OS 2,1

Grid “Operating System”

Host/OS 3,1

University of Virginia ORNL NeSSI Workshop

Grid User Wish-List• Who cares where it is?

– It must always be available when I need it

• Make it secure– no one can steal my data– no one can pretend to be

me– don’t tell me who I

will/can trust• Choose secure, fast,

cheap resources • Give me reasonable

quality of service

• Don’t make me manually move/copy stuff around

• Don’t make me learn a new OS– Allow me to run my existing

apps• I don’t want errors

– If errors occur, tell me in plain English how I can avoid them next time

• Allow me to more easily collaborate

Darnit, make my life easier !

University of Virginia ORNL NeSSI Workshop

Example: Transparent Remote Execution

• User initiates “run”• User/Grid SW selects site/resource• Grid SW copies binaries (if necessary)• Grid SW copies/moves input files• Grid SW starts job(s)• Grid SW monitors progress• Grid SW copies output files

Forms the basis of parameter-space or monte carlos

University of Virginia ORNL NeSSI Workshop

Web Interface

for CHARMM

and Amber

University of Virginia ORNL NeSSI Workshop

Status of CHARMM or Amber Run

University of Virginia ORNL NeSSI Workshop

Grid Focus: Virtual Organizations

• Logical grouping of resources and users• Support community-specific discovery• Specialized “views”• Dynamic collaborations of individuals and

institutions• Policy negotiation and enforcement will be

key issues looking forward

University of Virginia ORNL NeSSI Workshop

Grid Landscape Today: Globus

• Grid Resource Allocation and Management (GRAM)– Gatekeeper, Jobmanager (RSL

“schedulerspeak”)

• Grid Security Infrastructure (GSI)• Metacomputing Directory Service (MDS) (via

OpenLDAP)– Grid Index Information Service (GIIS)– Grid Resource Information Service (GRIS)

• GridFTP

University of Virginia ORNL NeSSI Workshop

Grid Landscape Today: Globus (cont.)

• “Add-ons”: – MPICH-G2– Replica Catalog and Management– Community Authorization Service (CAS)– Condor-G– etc.

• Basis of many large-scale Grids…

University of Virginia ORNL NeSSI Workshop

Selected Major Grid Projects (Oct 2001)

Name URL & Sponsors

Focus

Access Grid www.mcs.anl.gov/FL/accessgrid; DOE, NSF

Create & deploy group collaboration systems using commodity technologies

BlueGrid IBM Grid testbed linking IBM laboratories

DISCOM www.cs.sandia.gov/discomDOE Defense Programs

Create operational Grid providing access to resources at three U.S. DOE weapons laboratories

DOE Science Grid

sciencegrid.orgDOE Office of Science

Create operational Grid providing access to resources & applications at U.S. DOE science laboratories & partner universities

Earth System Grid (ESG)

earthsystemgrid.orgDOE Office of Science

Delivery and analysis of large climate model datasets for the climate research community

European Union (EU) DataGrid

eu-datagrid.orgEuropean Union

Create & apply an operational grid for applications in high energy physics, environmental science, bioinformatics

g

g

g

g

g

g

New

New

University of Virginia ORNL NeSSI Workshop

Selected Major Grid ProjectsName URL/Sponsor Focus

EuroGrid, Grid Interoperability (GRIP)

eurogrid.orgEuropean Union

Create tech for remote access to supercomp resources & simulation codes; in GRIP, integrate with Globus Toolkit™

Fusion Collaboratory

fusiongrid.orgDOE Off. Science

Create a national computational collaboratory for fusion research

Globus Project™ globus.orgDARPA, DOE, NSF, NASA, Msoft

Research on Grid technologies; development and support of Globus Toolkit™; application and deployment

GridLab gridlab.orgEuropean Union

Grid technologies and applications

GridPP gridpp.ac.ukU.K. eScience

Create & apply an operational grid within the U.K. for particle physics research

Grid Research Integration Dev. & Support Center

grids-center.orgNSF

Integration, deployment, support of the NSF Middleware Infrastructure for research & education

g

g

g

g

g

g

New

New

New

New

New

University of Virginia ORNL NeSSI Workshop

Selected Major Grid ProjectsName URL/Sponsor Focus

Grid Application Dev. Software

hipersoft.rice.edu/grads; NSF

Research into program development technologies for Grid applications

Grid Physics Network

griphyn.orgNSF

Technology R&D for data analysis in physics expts: ATLAS, CMS, LIGO, SDSS

Information Power Grid

ipg.nasa.govNASA

Create and apply a production Grid for aerosciences and other NASA missions

International Virtual Data Grid Laboratory

ivdgl.orgNSF

Create international Data Grid to enable large-scale experimentation on Grid technologies & applications

Network for Earthquake Eng. Simulation Grid

neesgrid.orgNSF

Create and apply a production Grid for earthquake engineering

Particle Physics Data Grid

ppdg.netDOE Science

Create and apply production Grids for data analysis in high energy and nuclear physics experiments

g

g

g

g

g

New

New

g

University of Virginia ORNL NeSSI Workshop

Selected Major Grid ProjectsName URL/Sponsor Focus

TeraGrid teragrid.orgNSF

U.S. science infrastructure linking four major resource sites at 40 Gb/s

UK Grid Support Center

grid-support.ac.ukU.K. eScience

Support center for Grid projects within the U.K.

Unicore BMBFT Technologies for remote access to supercomputers

g

g

New

New

University of Virginia ORNL NeSSI Workshop

PetaScale Virtual-Data Grids

Virtual Data Tools

Request Planning &

Scheduling ToolsRequest Execution & Management Tools

Transforms

Distributed resources(code, storage, CPUs,networks)

Resource Management

Services

Resource Management

Services

Security and Policy

Services

Security and Policy

Services

Other Grid ServicesOther Grid

Services

Interactive User Tools

Production TeamIndividual Investigator Workgroups

Raw data source

~1 Petaflop~100 Petabytes

Slide courtesy of Paul Avery

University of Virginia ORNL NeSSI Workshop

Data Grid Architecture

Application

Planner

Executor

Catalog Services

Info Services

Policy/Security

Monitoring

Repl. Mgmt.

Reliable TransferService

Compute Resource Storage Resource

DAG

DAG

DAGMAN, Kangaroo

GRAM GridFTP; GRAM; SRM

GSI, CAS

MDS

MCAT; GriPhyN catalogs

GDMP

MDS

Globus

Slide courtesy of Ian Foster

University of Virginia ORNL NeSSI Workshop

US-iVDGL Data Grid

UF

Wisconsin

Fermilab BNL

Indiana

Boston USKC

Brownsville

Hampton

PSU

J. Hopkins

Caltech

Tier1Tier2Tier3

FIU

FSUArlington

Michigan

LBL

Oklahoma

Argonne

Vanderbilt

UCSD/SDSC

NCSA

Partners?EUCERNBrazilAustraliaKoreaJapan

Slide courtesy of Paul Avery

University of Virginia ORNL NeSSI Workshop

Image courtesy Harvey Newman, Caltech

Data Grids for High Energy Physics

Tier2 Centre ~1 TIPS

Online System

Offline Processor Farm

~20 TIPS

CERN Computer Centre

FermiLab ~4 TIPSFrance Regional Centre

Italy Regional Centre

Germany Regional Centre

InstituteInstituteInstituteInstitute ~0.25TIPS

Physicist workstations

~100 MBytes/sec

~100 MBytes/sec

~622 Mbits/sec

~1 MBytes/sec

There is a “bunch crossing” every 25 nsecs.

There are 100 “triggers” per second

Each triggered event is ~1 MByte in size

Physicists work on analysis “channels”.

Each institute will have ~10 physicists working on one or more channels; data for these channels should be cached by the institute server

Physics data cache

~PBytes/sec

~622 Mbits/sec or Air Freight (deprecated)

Tier2 Centre ~1 TIPS

Tier2 Centre ~1 TIPS

Tier2 Centre ~1 TIPS

Caltech ~1 TIPS

~622 Mbits/sec

Tier 0Tier 0

Tier 1Tier 1

Tier 2Tier 2

Tier 4Tier 4

1 TIPS is approximately 25,000

SpecInt95 equivalents

University of Virginia ORNL NeSSI Workshop

Global Grid Forum (GGF)

• Grid standards• Best practices• Broad academic, national lab, and industry

involvement• Areas: Applications and programming environments,

architecture, data, information systems and performance, Peer-to-Peer, Scheduling and Resource Management, Security

• GGF9 was last week in Chicago

University of Virginia ORNL NeSSI Workshop

Many Excellent DOE Grid and Middleware Projects

• Reliable and Secure Group Communication• Commodity Grid Kits (CoGKits)• Middleware for Science Portals• Scientific Annotation Middleware (SAM)• Storage Resource Management for Data Grid

Applications• Common Component Architecture (CCA)• Scalable Software Initiative

University of Virginia ORNL NeSSI Workshop

Next-Generation Grids

• Web Services– “Semantically encapsulate discrete

functionality”– Loosely coupled, reusable components– XML, SOAP, WSDL, UDDI, etc.– Broad industrial support: Microsoft, IBM, Sun,

BEA, etc.

• Open Grid Services Architecture (OGSA)– Combine Grids (Globus, Legion) with Web

Services– GT3: Java, AXIS, J2EE, etc.

University of Virginia ORNL NeSSI Workshop

OGSI.NET

• University of Virginia hosting environment for Grid Services based on Microsoft Web Services approach– Focus: Grid security (e.g., explicit trust

management)– Focus: Grid programming models– Focus: Connection between UNIX and Win*

University of Virginia ORNL NeSSI Workshop

Biomolecular VO based on OGSI.NET

University of Virginia ORNL NeSSI Workshop

Grid Challenges: “UK E-Science Gap Analysis”

(Fox and Walker, Jun 30 2003)

• Security: VPNs/Firewalls, fine-grain access control

• Workflow (“orchestration”) specs and engines• Fault tolerance• Grid adaptability (e.g., real-time support)• Ease of use• Grid federations

University of Virginia ORNL NeSSI Workshop

Future Directions

• Grid has come a long way• Merging of Grid and Web Services shows

promise• Many difficult issues remain

– Manageable security– Integration with legacy applications/tools

• Challenge for SNS: Identify and meet requirements not being met by current Grid technologies