View
217
Download
1
Tags:
Embed Size (px)
Citation preview
les robertson - cern-it 1last update: 18/04/23 20:01
LCG
The LHC Computing Grid Project
Preparing for LHC Data Analysis
NorduGrid Workshop
Stockholm, 11 November 2002
Les Robertson
IT Division, CERN
www.cern.ch/lcg
les robertson - cern-it-2last update 18/04/23 20:01
LCG Project Goals
applications - tools, frameworks, environment, persistency
computing system global grid service cluster automated fabric collaborating computer centres grid CERN-centric analysis global analysis environment
central role of data challenges
Goal – Prepare and deploy the LHC computing environmentfor the analysis and management of the data coming from the detectors
This is not another grid technology project –
it is a grid deployment project
les robertson - cern-it-3last update 18/04/23 20:01
LCG Background Recommendations of the LHC Computing Review
CERN-LHCC-2001-004 – 20 February 2001 Common solutions and support for applications Estimates of total requirements and costs Distributed computing environment using Grid technology
Data recording and reconstruction at CERN Analysis in Regional Centres and CERN
(CERN only a small fraction of the analysis capacity) Simulation in Regional Centres
Launch Committee – CERN/2379/Rev Council 20 September 200
organisation integrating and coordinating work done by experiments,
regional centres, grid projects separating requirements setting (SC2) from implementation
(PEB) reviewed by - technical & scientific – LHC Committee (LHCC)
- resources - Computing Resource Review Board (with representatives of all funding agencies)
les robertson - cern-it-4last update 18/04/23 20:01
LCG Project Resources
National funding for resources at Regional Centres – funded as resources for LHC experiments
Grid projects – suppliers and maintainers of middleware funded by EU, NSF, DoE, national and regional investments
CERN personnel and materials from the base budget – IT and EP
Special contributions by member and observer states of people and materials at CERN during Phase I of the project
Resources from other institutes signing up for common applications projects providing infrastructure services for operating the grid,
supporting users, maintaining systems software, .. CERN openlab industrial contributions
Six components
les robertson - cern-it-5last update 18/04/23 20:01
LCG
CERN will provide the data reconstruction & recording service (Tier 0)-- but only a small part of the analysis capacity (Tier 1)
Other Total CERN as Total CERN asTier 0 Tier 1 Total Tier 1 Tier 1 % of Tier 1 Tier 0 + 1 % of total
Tier 0 + 1
Processing (K SI2000) 12,000 8,000 20,000 49,000 57,000 14% 69,000 29%Disk (PetaBytes) 1.1 1.0 2.1 8.7 9.7 10% 10.8 20%Magnetic tape (PetaBytes) 12.3 1.2 13.5 20.3 21.6 6% 33.9 40%
-------------- CERN --------------
Summary of Computing Capacity Required for all LHC Experiments in 2008
current planning for capacity at principal Regional Centres + CERN 2002: 650 KSI2000 <1% of capacity required n 2008 2005: 6,600 KSI2000 < 10% of 2008 capacity
les robertson - cern-it-6last update 18/04/23 20:01
LCG LCG Project Organisation
Four work areas –
Applications
Grid deployment
Grid Technology
Fabrics (management & technology of large computing clusters)
les robertson - cern-it-7last update 18/04/23 20:01
LCG Applications Area
Base support for the development process, infrastructure, tools, libraries
Frameworks for simulation and analysis Object persistency and data management Projects common to several experiments
everything that is not an experiment-specific component is a potential candidate for a common project
long term advantages in use of resources, support, maintenance
les robertson - cern-it-8last update 18/04/23 20:01
LCG Applications Work Packages
Software process and infrastructure (SPI) – Alberto Aimar
Persistency framework (POOL) – Dirk Duellmann
Math libraries – Fred James Core tools and services - Pere Mato Physics interfaces (Launching) Detector description (Requirements agreed) Event generators (Requirements agreed) Simulation (Requirements stage) Analysis tools, distributed analysis (next priority)
les robertson - cern-it-9last update 18/04/23 20:01
LCG
EventGeneration
Core Services
Dictionary
Whiteboard
Foundation and Utility Libraries
DetectorSimulation
Engine
Persistency
StoreMgr
Reconstruction
Algorithms
Geometry Event Model
GridServices
I nteractiveServices
Modeler
GUIAnalysis
EvtGen
Calibration
Scheduler
Fitter
PluginMgr
Monitor
NTuple
Scripting
FileCatalog
ROOT GEANT4 DataGrid Python Qt
Monitor
. . .MySQLFLUKA
EventGeneration
Core Services
Dictionary
Whiteboard
Foundation and Utility Libraries
DetectorSimulation
Engine
Persistency
StoreMgr
Reconstruction
Algorithms
Geometry Event Model
GridServices
I nteractiveServices
Modeler
GUIAnalysis
EvtGen
Calibration
Scheduler
Fitter
PluginMgr
Monitor
NTuple
Scripting
FileCatalog
ROOT GEANT4 DataGrid Python Qt
Monitor
. . .MySQLFLUKA
Domain Decomposition
Products mentioned are examples; not a comprehensive list
slide by Torre Wenaus
les robertson - cern-it-10last update 18/04/23 20:01
LCG Simulation
First set of formal requirements for LCG for MC generators (October) and simulation (November)
There is a need for support for both GEANT 4 and FLUKA
GEANT4 independent collaboration, including HEP institutes, LHC
and other experiments, other sciences significant CERN and LHC-related related resources MoU being re-discussed now Proposal to create an HEP User Requirements Committee
chaired by an LHC physicist need to ensure long-term support
les robertson - cern-it-11last update 18/04/23 20:01
LCG Grid Deployment
Planning, building, commissioning, operating - -
a stable, reliable, manageable Grid for - -
Data Challenges Distributed Production Distributed Analysis
Integrating services from many Regional Centres around the world
Permanent service – on which Data Challenges are scheduled
les robertson - cern-it-12last update 18/04/23 20:01
LCG
Current status
Experiments are doing their event production using distributed resources with a variety of solutions classic distributed production
– send jobs to specific sites, simple bookkeeping
some use of Globus, and some of the HEP Grid tools
vertically integrated solutions (ALIEN)
Grid Deployment
les robertson - cern-it-13last update 18/04/23 20:01
LCG
Data Challenges in 2002
Wisconsin 18%
INFN 18%
IN2P3 10%
RAL 6%UCSD 3%
UFL 5%
HIP 1%
Caltech 4%Moscow
10%
Bristol 3%
FNAL 8%
CERN 15%
IC 6%
Wisconsin 18%
INFN 18%
IN2P3 10%
RAL 6%UCSD 3%
UFL 5%
HIP 1%
Caltech 4%Moscow
10%
Bristol 3%
FNAL 8%
CERN 15%
IC 6%
Most Resources not at CERN (CERN not even biggest Single Resource)
Spring02: CPU Resources
6 million events~20 sites
les robertson - cern-it-15 [email protected]
grid tools used at 11 sites
les robertson - cern-it-16last update 18/04/23 20:01
LCG Grid Deployment
The hard problem for distributed computing is data analysis – ESD and AOD
chaotic workload unpredictable data access patterns
this is the problem that the LCG has to solve and this is where Grid technology should really help
After two years of grid developmentswe are just at the beginning of grid services
les robertson - cern-it-17last update 18/04/23 20:01
LCG Deploying the LHC Grid
A priority now is to move
from testbeds to a SERVICE
We need to learn how to OPERATE a Grid
Service Quality is the Key to Acceptance of Grids
Reliable OPERATION will be the factor that
limits the practical size of Grids
les robertson - cern-it-18last update 18/04/23 20:01
LCG Centres taking part in LCG-1
around the world around the clock
les robertson - cern-it-19last update 18/04/23 20:01
LCG Centres taking part in LCG-1Tier 1 Centres FZK Karlsruhe, CNAF Bologna, Rutherford Appleton Lab (UK),
IN2P3 Lyon, University of Tokyo, Fermilab, Brookhaven National Lab
Other Centres GSI, Moscow State University, NIKHEF Amsterdam, Academica
Sinica (Taipei), NorduGrid, Caltech, University of Florida, Ohio Supercomputing Centre, Tata Institute (India), Torino, Milano, Legnaro, ……
year 2002 2003 2004 2005
Processing capacity (K-SI2000)CERN 200 380 730 1,440Other Tier 1 Centres 450 1290 2730 5240Other Regional Centres 610 1620 2200 2220Total 1,260 3,290 5,660 8,900CERN as % of total 16% 12% 13% 16%
Estimates of Processing Capacity Plannedin Regional Centres that are taking part in Phase 1
les robertson - cern-it-20last update 18/04/23 20:01
LCG Grid Deployment - The Strategy
Get a basic grid service into production so that we know what works, what doesn’t, what the priorities are
And evolve from there to the full LHC service
Decide on a common set of middleware to be used for the first LCG grid service – LCG-1
target - LCG-1 in operation mid-2003 - LCG-1 in full service by end of 2003
this will be conservative – stability before functionalityand will not satisfy all of the stated requirements
but must be sufficient for the data challenges scheduled in 2004
les robertson - cern-it-21last update 18/04/23 20:01
LCGLCG-1 as a service for LHC
experiments
Mid-2003 5~8 of the larger regional centres available as one of the services used for simulation
campaigns
2H03 add more capacity at operational regional centres add more regional centres initiate operations centre, user support infrastructure
Early 2004 principal service for LHC physics data challenges
les robertson - cern-it-22last update 18/04/23 20:01
LCG Grid Technology in LCG
LCG expects to obtain Grid Technology from projects (well-)funded by national and regional e-science initiatives -- and (later) from industry
the LCG project will concentrate ondeploying a global grid service
les robertson - cern-it-23last update 18/04/23 20:01
LCG Strategy
Press for complementary middleware developments (world wide)
parallel developments are a problem until the standards emerge
Reduce unnecessary divergence and complexity Ensure at least one complete and supported
solution for LHC Understand & resolve middleware support
issues
As a last resort – develop a plan to meet requirements not taken on by other projects
les robertson - cern-it-24last update 18/04/23 20:01
LCG Grid Technology Status
A base set of requirements has been defined (HEPCAL) 43 use cases currently funded projects say they will satisfy ~2/3 of these in 2003
Good experience of experiments working with Grid projects in Europe and the United States
Practical results from testbeds used for physics simulation campaigns
Everyone builds on the Globus toolkit-- which is undergoing a radical re-design
GLUE initiative – working on integration of the (European) DataGrid project and the (US) Virtual Data Toolkit – VDT
Reliability, Scalability are of major concern Long term maintenance of tools developed by R&D projects is
unclear - quality and commitment
Evolution needed from Research & Development Engineering
les robertson - cern-it-25last update 18/04/23 20:01
LCG Grid Technology – Next Steps
leverage the massive investments being made proposals being prepared for EU 6th Framework Programme, NSF
ITR funding round, national science infrastructure funding
priority: a basic set of reliable, maintainable middleware hardening / re-engineering of current prototypes including all the functions that must be common and at least one example of other essential functions
complementary (or at least coordinated) developments of higher-level functionality by different projects, experiments
prepare for major architectural changes before things mature - do not become too attached to current
implementations
les robertson - cern-it-26last update 18/04/23 20:01
LCG
Proposal for integrated project to harden/re-engineer basic
middleware
System DesignGroup
Modelling
User SupportTools
System Tools Middleware
DESIGN
PRODUCTION
DEVELOPMENT
SoftwareStandards &
ProcessManagement
Integration,Testing &
Certification
ProductDelivery
End UserTools/Portals
• industrial management – s/w engineering focus
• few development centres – funded from different sources
• design group - representative of all m/w projects
les robertson - cern-it-27last update 18/04/23 20:01
LCG Challenges
Integrating all of the players towards a common goal Regional Centres, Grid Projects, physics institutes, CERN
Influencing external providers to satisfy LCG requirements – GEANT4, ROOT, .. Grid Projects
Grid technology – the funding is excellent but not so obvious that it will provide solid infrastructure suitable for
LHC LCG will be a global grid service – Europe, Asia, America national ambitions, different agendas make it very hard to navigate –
major contrast with WWW!
We are only at the beginning of understanding Grid operation
Grids imply operation and management by the communitya federation not empires
les robertson - cern-it-28last update 18/04/23 20:01
LCG The Work Plan
Get a production Grid into operation
Deliver a service to LHC experiments
Understand & fix the real problems