28
les robertson - cern- it 1 st update: 27/04/22 02:51 LCG The LHC Computing Grid Project Preparing for LHC Data Analysis NorduGrid Workshop Stockholm, 11 November 2002 Les Robertson IT Division, CERN [email protected] www.cern.ch/lcg

Last update: 02/06/2015 23:05 LCG les robertson - cern-it 1 The LHC Computing Grid Project Preparing for LHC Data Analysis NorduGrid Workshop Stockholm,

  • View
    217

  • Download
    1

Embed Size (px)

Citation preview

Page 1: Last update: 02/06/2015 23:05 LCG les robertson - cern-it 1 The LHC Computing Grid Project Preparing for LHC Data Analysis NorduGrid Workshop Stockholm,

les robertson - cern-it 1last update: 18/04/23 20:01

LCG

The LHC Computing Grid Project

Preparing for LHC Data Analysis

NorduGrid Workshop

Stockholm, 11 November 2002

Les Robertson

IT Division, CERN

[email protected]

www.cern.ch/lcg

Page 2: Last update: 02/06/2015 23:05 LCG les robertson - cern-it 1 The LHC Computing Grid Project Preparing for LHC Data Analysis NorduGrid Workshop Stockholm,

les robertson - cern-it-2last update 18/04/23 20:01

LCG Project Goals

applications - tools, frameworks, environment, persistency

computing system global grid service cluster automated fabric collaborating computer centres grid CERN-centric analysis global analysis environment

central role of data challenges

Goal – Prepare and deploy the LHC computing environmentfor the analysis and management of the data coming from the detectors

This is not another grid technology project –

it is a grid deployment project

Page 3: Last update: 02/06/2015 23:05 LCG les robertson - cern-it 1 The LHC Computing Grid Project Preparing for LHC Data Analysis NorduGrid Workshop Stockholm,

les robertson - cern-it-3last update 18/04/23 20:01

LCG Background Recommendations of the LHC Computing Review

CERN-LHCC-2001-004 – 20 February 2001 Common solutions and support for applications Estimates of total requirements and costs Distributed computing environment using Grid technology

Data recording and reconstruction at CERN Analysis in Regional Centres and CERN

(CERN only a small fraction of the analysis capacity) Simulation in Regional Centres

Launch Committee – CERN/2379/Rev Council 20 September 200

organisation integrating and coordinating work done by experiments,

regional centres, grid projects separating requirements setting (SC2) from implementation

(PEB) reviewed by - technical & scientific – LHC Committee (LHCC)

- resources - Computing Resource Review Board (with representatives of all funding agencies)

Page 4: Last update: 02/06/2015 23:05 LCG les robertson - cern-it 1 The LHC Computing Grid Project Preparing for LHC Data Analysis NorduGrid Workshop Stockholm,

les robertson - cern-it-4last update 18/04/23 20:01

LCG Project Resources

National funding for resources at Regional Centres – funded as resources for LHC experiments

Grid projects – suppliers and maintainers of middleware funded by EU, NSF, DoE, national and regional investments

CERN personnel and materials from the base budget – IT and EP

Special contributions by member and observer states of people and materials at CERN during Phase I of the project

Resources from other institutes signing up for common applications projects providing infrastructure services for operating the grid,

supporting users, maintaining systems software, .. CERN openlab industrial contributions

Six components

Page 5: Last update: 02/06/2015 23:05 LCG les robertson - cern-it 1 The LHC Computing Grid Project Preparing for LHC Data Analysis NorduGrid Workshop Stockholm,

les robertson - cern-it-5last update 18/04/23 20:01

LCG

CERN will provide the data reconstruction & recording service (Tier 0)-- but only a small part of the analysis capacity (Tier 1)

Other Total CERN as Total CERN asTier 0 Tier 1 Total Tier 1 Tier 1 % of Tier 1 Tier 0 + 1 % of total

Tier 0 + 1

Processing (K SI2000) 12,000 8,000 20,000 49,000 57,000 14% 69,000 29%Disk (PetaBytes) 1.1 1.0 2.1 8.7 9.7 10% 10.8 20%Magnetic tape (PetaBytes) 12.3 1.2 13.5 20.3 21.6 6% 33.9 40%

-------------- CERN --------------

Summary of Computing Capacity Required for all LHC Experiments in 2008

current planning for capacity at principal Regional Centres + CERN 2002: 650 KSI2000 <1% of capacity required n 2008 2005: 6,600 KSI2000 < 10% of 2008 capacity

Page 6: Last update: 02/06/2015 23:05 LCG les robertson - cern-it 1 The LHC Computing Grid Project Preparing for LHC Data Analysis NorduGrid Workshop Stockholm,

les robertson - cern-it-6last update 18/04/23 20:01

LCG LCG Project Organisation

Four work areas –

Applications

Grid deployment

Grid Technology

Fabrics (management & technology of large computing clusters)

Page 7: Last update: 02/06/2015 23:05 LCG les robertson - cern-it 1 The LHC Computing Grid Project Preparing for LHC Data Analysis NorduGrid Workshop Stockholm,

les robertson - cern-it-7last update 18/04/23 20:01

LCG Applications Area

Base support for the development process, infrastructure, tools, libraries

Frameworks for simulation and analysis Object persistency and data management Projects common to several experiments

everything that is not an experiment-specific component is a potential candidate for a common project

long term advantages in use of resources, support, maintenance

Page 8: Last update: 02/06/2015 23:05 LCG les robertson - cern-it 1 The LHC Computing Grid Project Preparing for LHC Data Analysis NorduGrid Workshop Stockholm,

les robertson - cern-it-8last update 18/04/23 20:01

LCG Applications Work Packages

Software process and infrastructure (SPI) – Alberto Aimar

Persistency framework (POOL) – Dirk Duellmann

Math libraries – Fred James Core tools and services - Pere Mato Physics interfaces (Launching) Detector description (Requirements agreed) Event generators (Requirements agreed) Simulation (Requirements stage) Analysis tools, distributed analysis (next priority)

Page 9: Last update: 02/06/2015 23:05 LCG les robertson - cern-it 1 The LHC Computing Grid Project Preparing for LHC Data Analysis NorduGrid Workshop Stockholm,

les robertson - cern-it-9last update 18/04/23 20:01

LCG

EventGeneration

Core Services

Dictionary

Whiteboard

Foundation and Utility Libraries

DetectorSimulation

Engine

Persistency

StoreMgr

Reconstruction

Algorithms

Geometry Event Model

GridServices

I nteractiveServices

Modeler

GUIAnalysis

EvtGen

Calibration

Scheduler

Fitter

PluginMgr

Monitor

NTuple

Scripting

FileCatalog

ROOT GEANT4 DataGrid Python Qt

Monitor

. . .MySQLFLUKA

EventGeneration

Core Services

Dictionary

Whiteboard

Foundation and Utility Libraries

DetectorSimulation

Engine

Persistency

StoreMgr

Reconstruction

Algorithms

Geometry Event Model

GridServices

I nteractiveServices

Modeler

GUIAnalysis

EvtGen

Calibration

Scheduler

Fitter

PluginMgr

Monitor

NTuple

Scripting

FileCatalog

ROOT GEANT4 DataGrid Python Qt

Monitor

. . .MySQLFLUKA

Domain Decomposition

Products mentioned are examples; not a comprehensive list

slide by Torre Wenaus

Page 10: Last update: 02/06/2015 23:05 LCG les robertson - cern-it 1 The LHC Computing Grid Project Preparing for LHC Data Analysis NorduGrid Workshop Stockholm,

les robertson - cern-it-10last update 18/04/23 20:01

LCG Simulation

First set of formal requirements for LCG for MC generators (October) and simulation (November)

There is a need for support for both GEANT 4 and FLUKA

GEANT4 independent collaboration, including HEP institutes, LHC

and other experiments, other sciences significant CERN and LHC-related related resources MoU being re-discussed now Proposal to create an HEP User Requirements Committee

chaired by an LHC physicist need to ensure long-term support

Page 11: Last update: 02/06/2015 23:05 LCG les robertson - cern-it 1 The LHC Computing Grid Project Preparing for LHC Data Analysis NorduGrid Workshop Stockholm,

les robertson - cern-it-11last update 18/04/23 20:01

LCG Grid Deployment

Planning, building, commissioning, operating - -

a stable, reliable, manageable Grid for - -

Data Challenges Distributed Production Distributed Analysis

Integrating services from many Regional Centres around the world

Permanent service – on which Data Challenges are scheduled

Page 12: Last update: 02/06/2015 23:05 LCG les robertson - cern-it 1 The LHC Computing Grid Project Preparing for LHC Data Analysis NorduGrid Workshop Stockholm,

les robertson - cern-it-12last update 18/04/23 20:01

LCG

Current status

Experiments are doing their event production using distributed resources with a variety of solutions classic distributed production

– send jobs to specific sites, simple bookkeeping

some use of Globus, and some of the HEP Grid tools

vertically integrated solutions (ALIEN)

Grid Deployment

Page 13: Last update: 02/06/2015 23:05 LCG les robertson - cern-it 1 The LHC Computing Grid Project Preparing for LHC Data Analysis NorduGrid Workshop Stockholm,

les robertson - cern-it-13last update 18/04/23 20:01

LCG

Data Challenges in 2002

Page 14: Last update: 02/06/2015 23:05 LCG les robertson - cern-it 1 The LHC Computing Grid Project Preparing for LHC Data Analysis NorduGrid Workshop Stockholm,

Wisconsin 18%

INFN 18%

IN2P3 10%

RAL 6%UCSD 3%

UFL 5%

HIP 1%

Caltech 4%Moscow

10%

Bristol 3%

FNAL 8%

CERN 15%

IC 6%

Wisconsin 18%

INFN 18%

IN2P3 10%

RAL 6%UCSD 3%

UFL 5%

HIP 1%

Caltech 4%Moscow

10%

Bristol 3%

FNAL 8%

CERN 15%

IC 6%

Most Resources not at CERN (CERN not even biggest Single Resource)

Spring02: CPU Resources

6 million events~20 sites

Page 15: Last update: 02/06/2015 23:05 LCG les robertson - cern-it 1 The LHC Computing Grid Project Preparing for LHC Data Analysis NorduGrid Workshop Stockholm,

les robertson - cern-it-15 [email protected]

grid tools used at 11 sites

Page 16: Last update: 02/06/2015 23:05 LCG les robertson - cern-it 1 The LHC Computing Grid Project Preparing for LHC Data Analysis NorduGrid Workshop Stockholm,

les robertson - cern-it-16last update 18/04/23 20:01

LCG Grid Deployment

The hard problem for distributed computing is data analysis – ESD and AOD

chaotic workload unpredictable data access patterns

this is the problem that the LCG has to solve and this is where Grid technology should really help

After two years of grid developmentswe are just at the beginning of grid services

Page 17: Last update: 02/06/2015 23:05 LCG les robertson - cern-it 1 The LHC Computing Grid Project Preparing for LHC Data Analysis NorduGrid Workshop Stockholm,

les robertson - cern-it-17last update 18/04/23 20:01

LCG Deploying the LHC Grid

A priority now is to move

from testbeds to a SERVICE

We need to learn how to OPERATE a Grid

Service Quality is the Key to Acceptance of Grids

Reliable OPERATION will be the factor that

limits the practical size of Grids

Page 18: Last update: 02/06/2015 23:05 LCG les robertson - cern-it 1 The LHC Computing Grid Project Preparing for LHC Data Analysis NorduGrid Workshop Stockholm,

les robertson - cern-it-18last update 18/04/23 20:01

LCG Centres taking part in LCG-1

around the world around the clock

Page 19: Last update: 02/06/2015 23:05 LCG les robertson - cern-it 1 The LHC Computing Grid Project Preparing for LHC Data Analysis NorduGrid Workshop Stockholm,

les robertson - cern-it-19last update 18/04/23 20:01

LCG Centres taking part in LCG-1Tier 1 Centres FZK Karlsruhe, CNAF Bologna, Rutherford Appleton Lab (UK),

IN2P3 Lyon, University of Tokyo, Fermilab, Brookhaven National Lab

Other Centres GSI, Moscow State University, NIKHEF Amsterdam, Academica

Sinica (Taipei), NorduGrid, Caltech, University of Florida, Ohio Supercomputing Centre, Tata Institute (India), Torino, Milano, Legnaro, ……

year 2002 2003 2004 2005

Processing capacity (K-SI2000)CERN 200 380 730 1,440Other Tier 1 Centres 450 1290 2730 5240Other Regional Centres 610 1620 2200 2220Total 1,260 3,290 5,660 8,900CERN as % of total 16% 12% 13% 16%

Estimates of Processing Capacity Plannedin Regional Centres that are taking part in Phase 1

Page 20: Last update: 02/06/2015 23:05 LCG les robertson - cern-it 1 The LHC Computing Grid Project Preparing for LHC Data Analysis NorduGrid Workshop Stockholm,

les robertson - cern-it-20last update 18/04/23 20:01

LCG Grid Deployment - The Strategy

Get a basic grid service into production so that we know what works, what doesn’t, what the priorities are

And evolve from there to the full LHC service

Decide on a common set of middleware to be used for the first LCG grid service – LCG-1

target - LCG-1 in operation mid-2003 - LCG-1 in full service by end of 2003

this will be conservative – stability before functionalityand will not satisfy all of the stated requirements

but must be sufficient for the data challenges scheduled in 2004

Page 21: Last update: 02/06/2015 23:05 LCG les robertson - cern-it 1 The LHC Computing Grid Project Preparing for LHC Data Analysis NorduGrid Workshop Stockholm,

les robertson - cern-it-21last update 18/04/23 20:01

LCGLCG-1 as a service for LHC

experiments

Mid-2003 5~8 of the larger regional centres available as one of the services used for simulation

campaigns

2H03 add more capacity at operational regional centres add more regional centres initiate operations centre, user support infrastructure

Early 2004 principal service for LHC physics data challenges

Page 22: Last update: 02/06/2015 23:05 LCG les robertson - cern-it 1 The LHC Computing Grid Project Preparing for LHC Data Analysis NorduGrid Workshop Stockholm,

les robertson - cern-it-22last update 18/04/23 20:01

LCG Grid Technology in LCG

LCG expects to obtain Grid Technology from projects (well-)funded by national and regional e-science initiatives -- and (later) from industry

the LCG project will concentrate ondeploying a global grid service

Page 23: Last update: 02/06/2015 23:05 LCG les robertson - cern-it 1 The LHC Computing Grid Project Preparing for LHC Data Analysis NorduGrid Workshop Stockholm,

les robertson - cern-it-23last update 18/04/23 20:01

LCG Strategy

Press for complementary middleware developments (world wide)

parallel developments are a problem until the standards emerge

Reduce unnecessary divergence and complexity Ensure at least one complete and supported

solution for LHC Understand & resolve middleware support

issues

As a last resort – develop a plan to meet requirements not taken on by other projects

Page 24: Last update: 02/06/2015 23:05 LCG les robertson - cern-it 1 The LHC Computing Grid Project Preparing for LHC Data Analysis NorduGrid Workshop Stockholm,

les robertson - cern-it-24last update 18/04/23 20:01

LCG Grid Technology Status

A base set of requirements has been defined (HEPCAL) 43 use cases currently funded projects say they will satisfy ~2/3 of these in 2003

Good experience of experiments working with Grid projects in Europe and the United States

Practical results from testbeds used for physics simulation campaigns

Everyone builds on the Globus toolkit-- which is undergoing a radical re-design

GLUE initiative – working on integration of the (European) DataGrid project and the (US) Virtual Data Toolkit – VDT

Reliability, Scalability are of major concern Long term maintenance of tools developed by R&D projects is

unclear - quality and commitment

Evolution needed from Research & Development Engineering

Page 25: Last update: 02/06/2015 23:05 LCG les robertson - cern-it 1 The LHC Computing Grid Project Preparing for LHC Data Analysis NorduGrid Workshop Stockholm,

les robertson - cern-it-25last update 18/04/23 20:01

LCG Grid Technology – Next Steps

leverage the massive investments being made proposals being prepared for EU 6th Framework Programme, NSF

ITR funding round, national science infrastructure funding

priority: a basic set of reliable, maintainable middleware hardening / re-engineering of current prototypes including all the functions that must be common and at least one example of other essential functions

complementary (or at least coordinated) developments of higher-level functionality by different projects, experiments

prepare for major architectural changes before things mature - do not become too attached to current

implementations

Page 26: Last update: 02/06/2015 23:05 LCG les robertson - cern-it 1 The LHC Computing Grid Project Preparing for LHC Data Analysis NorduGrid Workshop Stockholm,

les robertson - cern-it-26last update 18/04/23 20:01

LCG

Proposal for integrated project to harden/re-engineer basic

middleware

System DesignGroup

Modelling

User SupportTools

System Tools Middleware

DESIGN

PRODUCTION

DEVELOPMENT

SoftwareStandards &

ProcessManagement

Integration,Testing &

Certification

ProductDelivery

End UserTools/Portals

• industrial management – s/w engineering focus

• few development centres – funded from different sources

• design group - representative of all m/w projects

Page 27: Last update: 02/06/2015 23:05 LCG les robertson - cern-it 1 The LHC Computing Grid Project Preparing for LHC Data Analysis NorduGrid Workshop Stockholm,

les robertson - cern-it-27last update 18/04/23 20:01

LCG Challenges

Integrating all of the players towards a common goal Regional Centres, Grid Projects, physics institutes, CERN

Influencing external providers to satisfy LCG requirements – GEANT4, ROOT, .. Grid Projects

Grid technology – the funding is excellent but not so obvious that it will provide solid infrastructure suitable for

LHC LCG will be a global grid service – Europe, Asia, America national ambitions, different agendas make it very hard to navigate –

major contrast with WWW!

We are only at the beginning of understanding Grid operation

Grids imply operation and management by the communitya federation not empires

Page 28: Last update: 02/06/2015 23:05 LCG les robertson - cern-it 1 The LHC Computing Grid Project Preparing for LHC Data Analysis NorduGrid Workshop Stockholm,

les robertson - cern-it-28last update 18/04/23 20:01

LCG The Work Plan

Get a production Grid into operation

Deliver a service to LHC experiments

Understand & fix the real problems