18
GriPhyN EAC Meeting (Apr . 12, 2001) Paul Avery 1 Paul Avery University of Florida http://www.phys.ufl.edu/~avery/ [email protected] Opening and Overview GriPhyN External Advisory Meeting Marina del Rey, April 12, 2001

GriPhyN EAC Meeting (Apr. 12, 2001)Paul Avery1 University of Florida avery/ [email protected] Opening and Overview GriPhyN External

Embed Size (px)

Citation preview

Page 1: GriPhyN EAC Meeting (Apr. 12, 2001)Paul Avery1 University of Florida avery/ avery@phys.ufl.edu Opening and Overview GriPhyN External

GriPhyN EAC Meeting (Apr. 12, 2001)

Paul Avery 1

Paul AveryUniversity of Floridahttp://www.phys.ufl.edu/~avery/[email protected]

Opening and Overview

GriPhyN External Advisory MeetingMarina del Rey, April 12, 2001

Page 2: GriPhyN EAC Meeting (Apr. 12, 2001)Paul Avery1 University of Florida avery/ avery@phys.ufl.edu Opening and Overview GriPhyN External

GriPhyN EAC Meeting (Apr. 12, 2001)

Paul Avery 2

Who We AreU FloridaU ChicagoBoston UCaltechU Wisconsin, MadisonUSC/ISIHarvard Indiana Johns HopkinsNorthwesternStanfordU Illinois at ChicagoU PennU Texas, BrownsvilleU Wisconsin,

MilwaukeeUC Berkeley

UC San DiegoSan Diego Supercomputer

CenterLawrence Berkeley LabArgonneFermilabBrookhaven

Page 3: GriPhyN EAC Meeting (Apr. 12, 2001)Paul Avery1 University of Florida avery/ avery@phys.ufl.edu Opening and Overview GriPhyN External

GriPhyN EAC Meeting (Apr. 12, 2001)

Paul Avery 3

GriPhyN = App. Science + CS + Grids

GriPhyN = Grid Physics NetworkUS-CMS High Energy PhysicsUS-ATLAS High Energy PhysicsLIGO/LSC Gravity wave researchSDSS Sloan Digital Sky SurveyStrong partnership with computer scientists

Design and implement production-scale grids Investigation of “Virtual Data” concept (fig) Integration into 4 major science experimentsDevelop common infrastructure, tools and servicesBuilds on existing foundations: Globus tools

Multi-year projectGrid R&DDevelopment, deployment of “Tier 2” hardware, personnel

(fig)Education & outreach

Page 4: GriPhyN EAC Meeting (Apr. 12, 2001)Paul Avery1 University of Florida avery/ avery@phys.ufl.edu Opening and Overview GriPhyN External

GriPhyN EAC Meeting (Apr. 12, 2001)

Paul Avery 4

GriPhyN Data Grid Challenge

“Global scientific communities, served by networks with bandwidths varying by orders of magnitude, need to perform computationally demanding analyses of geographically distributed datasets that will grow by at least 3 orders of magnitude over the next decade, from the 100 Terabyte to the 100 Petabyte scale.”

Page 5: GriPhyN EAC Meeting (Apr. 12, 2001)Paul Avery1 University of Florida avery/ avery@phys.ufl.edu Opening and Overview GriPhyN External

GriPhyN EAC Meeting (Apr. 12, 2001)

Paul Avery 5

Data Grid Hierarchy

Tier 1

T2

T2

T2

T2

T2

3

3

3

3

3

3

3

3

3

3

3

Tier 0 (CERN)

4 4 4 4

3 3

Tier0 CERNTier1 National LabTier2 Regional Center at UniversityTier3 University workgroupTier4 Workstation

GriPhyN:R&DTier2 centersUnify all IT resources

Page 6: GriPhyN EAC Meeting (Apr. 12, 2001)Paul Avery1 University of Florida avery/ avery@phys.ufl.edu Opening and Overview GriPhyN External

GriPhyN EAC Meeting (Apr. 12, 2001)

Paul Avery 6

LHC Global Grid Hierarchy

Tier2 Center

Online System

CERN Computer Center > 20

TIPS

USA CenterFrance Center

Italy Center UK Center

InstituteInstituteInstituteInstitute ~0.25TIPS

Workstations,other portals

~100 MBytes/sec

2.5 Gbits/sec

100 - 1000

Mbits/sec

Bunch crossing per 25 nsecs.100 triggers per secondEvent is ~1 MByte in size

Physicists work on analysis “channels”.

Each institute has ~10 physicists working on one or more channels

Physics data cache

~PBytes/sec

2.5 Gbits/sec

Tier2 CenterTier2 CenterTier2 Center

~622 Mbits/sec

Tier 0 +1

Tier 1

Tier 3

Tier 4

Tier2 Center Tier 2

Experiment

Page 7: GriPhyN EAC Meeting (Apr. 12, 2001)Paul Avery1 University of Florida avery/ avery@phys.ufl.edu Opening and Overview GriPhyN External

GriPhyN EAC Meeting (Apr. 12, 2001)

Paul Avery 7

GriPhyN I Funded (R&D)NSF results announced Sep. 13, 2000

$11.9M from NSF Information Technology Research Program

$ 1.4M in matching from universitiesLargest of all ITR awards

Scope of ITR fundingMajor costs for people, esp. students, postdocs2/3 CS + 1/3 application science Industry partnerships needed to realize scope

Still being pursued

Education and outreachReach non-traditional students and other constituenciesUniversity partnershipsGrids “natural” for integrating intellectual resources from

all locationsE/O led by UT Brownsville (Romano, Campanelli)

Page 8: GriPhyN EAC Meeting (Apr. 12, 2001)Paul Avery1 University of Florida avery/ avery@phys.ufl.edu Opening and Overview GriPhyN External

GriPhyN EAC Meeting (Apr. 12, 2001)

Paul Avery 8

GriPhyN Management NeedsGriPhyN is a complex project

17 universities, SDSC, 3 labs, >40 active participants4 physics experiments providing frontier challenges

GriPhyN I funded primarily as an IT research project

2/3 CS + 1/3 physics

Need to balance and coordinateResearch creativity with project goals and deliverablesGriPhyN schedule with 4 experiment schedulesGriPhyN design and architecture with that of other

projects whose work will be used by LHC or other experimentsPPDG, EU DataGrid

GriPhyN deliverables with those of other datagrid projects

Page 9: GriPhyN EAC Meeting (Apr. 12, 2001)Paul Avery1 University of Florida avery/ avery@phys.ufl.edu Opening and Overview GriPhyN External

GriPhyN EAC Meeting (Apr. 12, 2001)

Paul Avery 9

GriPhyN Management OrganizationProject Leadership

Project Directors: Paul Avery, Ian FosterProject Coordinator (active search)

Advisory CommitteesProject Coordination Group (weekly meetings)Collaboration Board (not met yet)External Advisory Board (1-2 times per year)

Coordinators Industrial ProgramsOutreach/EducationSystem Integration

NSF Review Committee

Page 10: GriPhyN EAC Meeting (Apr. 12, 2001)Paul Avery1 University of Florida avery/ avery@phys.ufl.edu Opening and Overview GriPhyN External

GriPhyN EAC Meeting (Apr. 12, 2001)

Paul Avery 10

External Advisory Board

VD Toolkit DevelopmentCoord.: M. Livny

Requirements Definition & Scheduling(Miron Livny)

Integration & Testing(Carl Kesselman?)

Documentation & Support(TBD)

CS ResearchCoord.: I. Foster

Execution Management(Miron Livny)

Performance Analysis(Valerie Taylor)

Request Planning & Scheduling

(Carl Kesselman)

Virtual Data(Reagan Moore)

ApplicationsCoord.: H. Newman

ATLAS(Rob Gardner)

CMS(Harvey Newman)

LSC(LIGO)(Bruce Allen)

SDSS(Alexander Szalay)

NSF Review Committee

Major Physics

Experiments Technical Coordination Committee

Chair: J. Bunn

H. Newman + T. DeFanti (Networks)

A. Szalay + M. Franklin(Databases)

T. DeFanti (Visualization)

R. Moore(Digital Libraries)

C. Kesselman(Grids)

P. Galvez + R. Stevens (Collaborative Systems)

Project DirectorsPaul Avery,Ian Foster

Inte

rnet

2

DO

E S

cien

ce

NS

F P

AC

Is

Collaboration BoardChair: Paul Avery

Project Coordination Group

Outreach/EducationJoseph Romano

Industrial ProgramsAlex Szalay

Other Grid Projects

System Integration

Project Coordinator

Page 11: GriPhyN EAC Meeting (Apr. 12, 2001)Paul Avery1 University of Florida avery/ avery@phys.ufl.edu Opening and Overview GriPhyN External

GriPhyN EAC Meeting (Apr. 12, 2001)

Paul Avery 11

GriPhyN Management Organization Technical Organization

Computer Science Research Virtual Data Toolkit Development Application (Physics experiment) Projects

Liaison with Experiments Reps on Project Coordination Group Subgroups in Application Projects organization Directors have direct contact with experiment computing

leaders

Liaison with Other Datagrid Projects Common participants with PPDG Cross committee memberships with EU Datagrid Datagrid Coordination meetings

First was March 4 in Amsterdam Next June 23 in Rome

Page 12: GriPhyN EAC Meeting (Apr. 12, 2001)Paul Avery1 University of Florida avery/ avery@phys.ufl.edu Opening and Overview GriPhyN External

GriPhyN EAC Meeting (Apr. 12, 2001)

Paul Avery 12

A Common Infrastructure Opportunity

Particle Physics Data Grid (US, DOE)Data Grid applications for HENPFunded 2000, 2001http://www.ppdg.net/

GriPhyN (US, NSF)Petascale Virtual-Data GridsFunded 9/2000 – 9/2005http://www.griphyn.org/

European Data Grid (EU)Data Grid technologies, EU deploymentFunded 1/2001 – 1/2004http://www.eu-datagrid.org/

HEP in common

Focus: infrastructure development & deployment

International scope

Page 13: GriPhyN EAC Meeting (Apr. 12, 2001)Paul Avery1 University of Florida avery/ avery@phys.ufl.edu Opening and Overview GriPhyN External

GriPhyN EAC Meeting (Apr. 12, 2001)

Paul Avery 13

Data Grid Project CollaborationGriPhyN + PPDG + EU-DataGrid + national efforts

France, Italy, UK, Japan

Have agreed to collaborate, develop joint infrastructure

Initial meeting March 4 in Amsterdam to discuss issuesFuture meetings in June, July

Preparing management document Joint management, technical boards + steering committee Coordination of people, resourcesAn expectation that this will lead to real work

Collaborative projectsGrid middleware Integration into applicationsGrid testbed: iVDGLNetwork testbed: T3 = Transatlantic Terabit Testbed

Page 14: GriPhyN EAC Meeting (Apr. 12, 2001)Paul Avery1 University of Florida avery/ avery@phys.ufl.edu Opening and Overview GriPhyN External

GriPhyN EAC Meeting (Apr. 12, 2001)

Paul Avery 14

iVDGL International Virtual-Data Grid Laboratory

A place to conduct Data Grid tests at scaleA concrete manifestation of world-wide grid activityA continuing activity that will drive Grid awarenessA basis for further funding

Scale of effortFor national, international scale Data Grid tests, operationsComputationally and data intensive computingFast networks

Who Initially US-UK-EUOther world regions laterDiscussions w/ Russia, Japan, China, Pakistan, India, South

America

Page 15: GriPhyN EAC Meeting (Apr. 12, 2001)Paul Avery1 University of Florida avery/ avery@phys.ufl.edu Opening and Overview GriPhyN External

GriPhyN EAC Meeting (Apr. 12, 2001)

Paul Avery 15

Status of Data Grid ProjectsGriPhyN

$12M funded by NSF/ITR 2000 program (5 year R&D)2001 supplemental funds requested for initial

deploymentsSubmitting 5-year proposal ($15M) to NSF to deploy iVDGL

Particle Physics Data GridFunded in 1999, 2000 by DOE ($1.2 M per year)Submitting 3-year Proposal ($12M) to DOE Office of

Science

EU DataGrid€10M funded by EU (3 years, 2001 – 2004)Submitting proposal in April for additional funds

GridPP in UKSubmitted proposal April 3 ($30M)

Japan, others?

Page 16: GriPhyN EAC Meeting (Apr. 12, 2001)Paul Avery1 University of Florida avery/ avery@phys.ufl.edu Opening and Overview GriPhyN External

GriPhyN EAC Meeting (Apr. 12, 2001)

Paul Avery 16

GriPhyN Activities Since Sept. 2000

All-hands meeting Oct. 2-3, 2000Architecture meeting Dec. 20Smaller meetings between CS-experimentsPreparation of requirements documents by

experimentsArchitecture document(s) Included in architecture definition for EU DataGridMar. 4 meeting to discuss collaboration of Grid

projectsAll-hands meeting April 9, 2001Hiring still proceeding (2/3 finished)Submitting new proposal Apr. 25, 2001

Page 17: GriPhyN EAC Meeting (Apr. 12, 2001)Paul Avery1 University of Florida avery/ avery@phys.ufl.edu Opening and Overview GriPhyN External

GriPhyN EAC Meeting (Apr. 12, 2001)

Paul Avery 17

Discussion PointsMaintaining the right balance between research

and developmentMaintaining focus vs. accepting broader scope

E.g., international collaborationE.g., GriPhyN in the large (GriPhyN II)E.g., Terascale

Creating a national cyberinfrastructureWhat is our appropriate role

Page 18: GriPhyN EAC Meeting (Apr. 12, 2001)Paul Avery1 University of Florida avery/ avery@phys.ufl.edu Opening and Overview GriPhyN External

GriPhyN EAC Meeting (Apr. 12, 2001)

Paul Avery 18

Discussion PointsOutreach to other disciplines

Biology, NEES, …

Outreach to other constituenciesSmall universities, K-12, public, international, …

Virtual data toolkit Inclusive or focused?Resource issue, again

Achieving critical mass of resources to deliver on the complete promise