Upload
aidan-barnett
View
214
Download
0
Tags:
Embed Size (px)
Citation preview
30 November 2005 ScotGrid Phase 2 Procurement Tony Doyle - University of Glasgow
ScotGrid Procurement… a future news
item
• 25 June 2006: ScotGrid's 4th birthday• June 25th marks the fourth anniversary of ScotGrid deployment. After
four years, the system has provided more than 2 million CPU hours and 200,000 completed jobs.
• The system is fully integrated with the LCG Grid, the basic Grid system for the EU Enabling Grids for E-Science in Europe (EGEE) project, and the Grid for UK Particle Physics (GridPP) testbed as part of the National Grid Service.
• Locally, the user service supports 12 Groups (ATLAS, BaBar, Bioinformatics, CDF, Device Modelling, Grid Data Management, Information Retrieval, LHCb, Medipix, MICE, UKQCD and ZEUS) with approximately 100 individual users.
• Many external Grid Virtual Organisations are also being supported.
• The hardware is currently being upgraded and will be available for users in Q3 2006….
30 November 2005 ScotGrid Phase 2 Procurement Tony Doyle - University of Glasgow
Grid OverviewAim: by 2008 (full year’s data
taking)- CPU ~100MSi2k (100,000
CPUs)- Storage ~80PB - Involving >100 institutes
worldwide
- Build on complex middleware being developed in advanced Grid technology projects, both in Europe (Glite) and in the USA (VDT)
1. Prototype went live in September 2003 in 12 countries
2. Extensively tested by the LHC experiments in September 2004
3. 197 sites, 13,797 CPUs, 5PB storage in September 2005
30 November 2005 ScotGrid Phase 2 Procurement Tony Doyle - University of Glasgow
1. Large UK-wide Project (20 Institutes, £33m over six years [Sep 2001- Aug 07])
2. A team that built a working prototype Grid of significant scale
UK (worldwide)> 2,000 (10,000) CPUs> 1,000 (5,000) TB of storage> 1,000 (6,000) simultaneous jobs
3. A complex project where 88% of the milestones were completed in the first phase
What is GridPP?
1 . 1 2 . 1 3 . 1 4 . 1 5 . 1 6 . 1 7 . 1
1 . 1 . 1 1 . 1 . 2 1 . 1 . 3 1 . 1 . 4 2 . 1 . 1 2 . 1 . 2 2 . 1 . 3 2 . 1 . 4 3 . 1 . 1 3 . 1 . 2 3 . 1 . 3 3 . 1 . 4 4 . 1 . 1 4 . 1 . 2 4 . 1 . 3 4 . 1 . 4 5 . 1 . 1 5 . 1 . 2 5 . 1 . 3 6 . 1 . 1 6 . 1 . 2 6 . 1 . 3 6 . 1 . 4 7 . 1 . 1 7 . 1 . 2 7 . 1 . 3 7 . 1 . 41 . 1 . 5 2 . 1 . 5 2 . 1 . 6 2 . 1 . 7 2 . 1 . 8 3 . 1 . 5 3 . 1 . 6 3 . 1 . 7 3 . 1 . 8 4 . 1 . 5 4 . 1 . 6 4 . 1 . 7 4 . 1 . 8 6 . 1 . 5
2 . 1 . 9 3 . 1 . 9 3 . 1 . 1 0 4 . 1 . 9
1 . 2 2 . 2 3 . 2 4 . 2 5 . 2 6 . 2 7 . 2 1 . 2 . 1 1 . 2 . 2 1 . 2 . 3 1 . 2 . 4 2 . 2 . 1 2 . 2 . 2 2 . 2 . 3 2 . 2 . 4 3 . 2 . 1 3 . 2 . 2 3 . 2 . 3 3 . 2 . 4 4 . 2 . 1 4 . 2 . 2 4 . 2 . 3 4 . 2 . 4 5 . 2 . 1 5 . 2 . 2 5 . 2 . 3 6 . 2 . 1 6 . 2 . 2 6 . 2 . 3 7 . 2 . 1 7 . 2 . 2 7 . 2 . 31 . 2 . 5 1 . 2 . 6 1 . 2 . 7 1 . 2 . 8 2 . 2 . 5 2 . 2 . 6 2 . 2 . 7 3 . 2 . 5 3 . 2 . 6 3 . 2 . 7 3 . 2 . 8 4 . 2 . 5 4 . 2 . 6 4 . 2 . 71 . 2 . 9 1 . 2 . 1 0 3 . 2 . 9
1 . 3 2 . 3 3 . 3 4 . 3 5 . 3 6 . 3 7 . 3
1 . 3 . 1 1 . 3 . 2 1 . 3 . 3 1 . 3 . 4 2 . 3 . 1 2 . 3 . 2 2 . 3 . 3 2 . 3 . 4 3 . 3 . 1 3 . 3 . 2 3 . 3 . 3 3 . 3 . 4 4 . 3 . 1 4 . 3 . 2 4 . 3 . 3 4 . 3 . 4 5 . 3 . 1 5 . 3 . 2 5 . 3 . 3 6 . 3 . 1 6 . 3 . 2 6 . 3 . 3 6 . 3 . 4 7 . 3 . 1 7 . 3 . 2 7 . 3 . 3 7 . 3 . 41 . 3 . 5 1 . 3 . 6 1 . 3 . 7 1 . 3 . 8 2 . 3 . 5 2 . 3 . 6 2 . 3 . 7 3 . 3 . 5 3 . 3 . 6 4 . 3 . 51 . 3 . 9 1 . 3 . 1 0 1 . 3 . 1 1
1 . 4 2 . 4 3 . 4 4 . 4 5 . 4 1 . 4 . 1 1 . 4 . 2 1 . 4 . 3 1 . 4 . 4 2 . 4 . 1 2 . 4 . 2 2 . 4 . 3 2 . 4 . 4 3 . 4 . 1 3 . 4 . 2 3 . 4 . 3 3 . 4 . 4 4 . 4 . 1 4 . 4 . 2 4 . 4 . 3 4 . 4 . 4 5 . 4 . 1 5 . 4 . 2 5 . 4 . 3 5 . 4 . 41 . 4 . 5 1 . 4 . 6 1 . 4 . 7 1 . 4 . 8 2 . 4 . 5 2 . 4 . 6 2 . 4 . 7 3 . 4 . 5 3 . 4 . 6 3 . 4 . 7 3 . 4 . 8 4 . 4 . 5 4 . 4 . 6 5 . 4 . 51 . 4 . 9 3 . 4 . 9 3 . 4 . 1 0 M e t r i c O K 1 . 1 . 1
M e t r i c n o t O K 1 . 1 . 1 1 . 5 2 . 5 3 . 5 4 . 5 T a s k c o m p le t e 1 . 1 . 1
1 . 5 . 1 1 . 5 . 2 1 . 5 . 3 1 . 5 . 4 2 . 5 . 1 2 . 5 . 2 2 . 5 . 3 2 . 5 . 4 3 . 5 . 1 3 . 5 . 2 3 . 5 . 3 3 . 5 . 4 4 . 5 . 1 4 . 5 . 2 4 . 5 . 3 4 . 5 . 4 T a s k o v e r d u e 1 . 1 . 11 . 5 . 5 1 . 5 . 6 1 . 5 . 7 1 . 5 . 8 2 . 5 . 5 2 . 5 . 6 2 . 5 . 7 3 . 5 . 5 3 . 5 . 6 3 . 5 . 7 6 0 d a y s 1 . 1 . 11 . 5 . 9 1 . 5 . 1 0 T a s k n o t d u e s o o n 1 . 1 . 1
N o t A c t i v e 1 . 1 . 1 2 . 6 3 . 6 4 . 6 N o T a s k o r m e t r i c
2 . 6 . 1 2 . 6 . 2 2 . 6 . 3 2 . 6 . 4 3 . 6 . 1 3 . 6 . 2 3 . 6 . 3 3 . 6 . 4 4 . 6 . 1 4 . 6 . 2 4 . 6 . 32 . 6 . 5 2 . 6 . 6 2 . 6 . 7 2 . 6 . 8 3 . 6 . 5 3 . 6 . 6 3 . 6 . 7 3 . 6 . 8 N a v ig a t e u p 2 . 6 . 9 3 . 6 . 9 3 . 6 . 1 0 3 . 6 . 1 1 3 . 6 . 1 2 N a v ig a t e d o w n
E x t e r n a l l i n k 2 . 7 3 . 7 L in k t o g o a l s
2 . 7 . 1 2 . 7 . 2 2 . 7 . 3 2 . 7 . 4 3 . 7 . 1 3 . 7 . 2 3 . 7 . 3 3 . 7 . 42 . 7 . 5 2 . 7 . 6 2 . 7 . 7 2 . 7 . 8 3 . 7 . 5 3 . 7 . 6
2 . 8 3 . 8 2 . 8 . 1 2 . 8 . 2 2 . 8 . 3 2 . 8 . 4 3 . 8 . 1 3 . 8 . 2 3 . 8 . 32 . 8 . 5
W P 8
1 2 3
D e p l o y m e n t
W P 4
W P 5
F a b r i c
T e c h n o l o g y
W P 6
D u e w i t h i n
A T L A S
G r i d P P G o a l
R e s o u r c e sI n t e r o p e r a b i l i t y D i s s e m i n a t i o n
T i e r - 1
T i e r - A
L H C b T i e r - 2
C E R N D a t a G r i d A p p l i c a t i o n s I n f r a s t r u c t u r e
W P 1
W P 2
W P 3
L C G C r e a t i o n
A p p l i c a t i o n s
W P 7
A T L A S / L H C b
C M S
B a B a r
C D F / D O
U K Q C D
O t h e r
D a t a C h a l l e n g e s
R o l l o u t
T e s t b e d
1 - J a n - 0 4S t a t u s D a t e
I n t . S t a n d a r d s
O p e n S o u r c e
W o r l d w i d e I n t e g r a t i o n
U K I n t e g r a t i o n
M o n i t o r i n g
D e v e l o p i n gE n g a g e m e n t
P a r t i c i p a t i o n
T o d e v e l o p a n d d e p l o y a l a r g e s c a l e s c i e n c e G r i di n t h e U K f o r t h e u s e o f t h e P a r t i c l e P h y s i c s c o m m u n i t y
P r e s e n t a t i o n D e p l o y m e n t
5 6 74
U p d a t e
C l e a r
30 November 2005 ScotGrid Phase 2 Procurement Tony Doyle - University of Glasgow
1. Glasgow/Edinburgh/Durham/ (Dundee) Project
2. Compute-intensive jobs performed at Glasgow and Durham
3. Data-intensive jobs performed at Edinburgh
4. First applications using the Grid. Meeting real requirements of Grid applications: currently Particle Physics, Bioinformatics,
Computing Science, Electrical Engineering
5. Glasgow leading R&D in Grid Data Management and Security
6. Edinburgh leading co-ordination and R&D in Networking and Storage
7. Management part of a worldwide Grid infrastructure through GridPP and EGEE
What is?
30 November 2005 ScotGrid Phase 2 Procurement Tony Doyle - University of Glasgow
What are the Grid challenges?
Data Management, Security and
Sharing
1. Software process2. Software efficiency3. Deployment
planning 4. Link centres
5. Share data
6. Manage data7. Install software8. Analyse data9. Accounting
10. Policies
30 November 2005 ScotGrid Phase 2 Procurement Tony Doyle - University of Glasgow
Tier Structure
Tier 0
Tier 1National centres
Tier 2Regional groups
Tier 3Institutes
Tier 4Workstations
Offline farm
Online system
CERN computer centre
RAL,UK
ScotGrid NorthGridSouthGrid London
FranceItalyGermanyUSA
Glasgow Edinburgh Durham Dundee
30 November 2005 ScotGrid Phase 2 Procurement Tony Doyle - University of Glasgow
Timescales
• Service Challenges – UK deployment plans
• End pointApril ’07
• Context:first real(cosmics)
data ’05
30 November 2005 ScotGrid Phase 2 Procurement Tony Doyle - University of Glasgow
MiddlewareRequirement
Storage Element
Basic File Transfer
Reliable File Transfer
Catalogue Services
Data Management tools
Compute Element
Workload Management
VO Agents
VO Membership Services
DataBase Services
Posix-like I/O
Application Software Installation Tools
Job Monitoring
Reliable Messaging
Information System
15 Baseline Services for a functional Grid
Scottish (and National) Grid Service will rely upon gLite components
This middleware builds upon VDT (Globus and Condor) and meets the requirements of all the basic scientific use cases:
1. Green (amber) areas are (almost) agreed as part of the shared generic middleware stack by each of the application areas
2. Red are areas where generic middleware competes with application-specific software.
www.glite.org
30 November 2005 ScotGrid Phase 2 Procurement Tony Doyle - University of Glasgow
MiddlewareRequirement OMII VDT/GT gLite Other Comment
Storage Element Yes SRM via dCache, DPM or CASTOR
LCG includes Storage Resource Management capability
Basic File Transfer Yes GridFTP Yes LCG includes GridFTP
Reliable File Transfer
RFT File Transfer Service
FTS is built on top of GridFTP
Catalogue Services RLS LCG File Catalogue, gLite
FireMan
Central catalogues adequate, high throughput needed
Data Management tools
OMII Data Service (upload / download)
File Placement Service
gLite File Placement Service under development
Compute Element OMII Job Service Gatekeeper Yes gLite uses Globus 2 with mods
Workload Management
Manual resource allocation & job submission
Condor-G Resource Broker RB builds on Globus, Condor-G
VO Agents Perform localised activities on behalf of VO
VO Membership Services
Tools for account management(no GridMapFile equivalent)
CAS VOMS CAS does not provide all the needed functionality
DataBase Services MySQL, PostgreSQL,
ORACLE client
Off–the-shelf offerings are adequate
Posix-like I/O GFAL, gLite I/O Xrootd
Application Software Installation Tools
Yes Tools already exist in LCG-2 e.g. PACMAN
Job Monitoring Monalisa,Netlogger
Logging serviceR-GMA
Reliable Messaging Tools such as Jabber are used by experiments
Information System MDS(GLUE)
Yes BDII LCG based on BDII and GLUE schema
30 November 2005 ScotGrid Phase 2 Procurement Tony Doyle - University of Glasgow
• A series of gLite releases have been produced (1.1, 1.2, 1.3, and 1.4)– Driven by application and deployment needs– Focus on defect fixing
• gLite deployed on a Pre-Production Service and made available for application use– Independent evaluation by NGS– gLite components also available via VDT (US)
• gLite components deployed on the infrastructure
– More scheduled by the end of the year
• Emphasis is now on release of gLite 1.5– Will continue…– EGEE phase 2 starts in April 2006
Middleware Re-
engineering
www.glite.org
30 November 2005 ScotGrid Phase 2 Procurement Tony Doyle - University of Glasgow
Metrics and Quality
Assurance
Target Current status
Q2 2006 Target values
Number of Users
~ 1000 ≥ 3000
Number of sites
120 50
Number of CPU
~12000 9500 at month 15
Number of Disciplines
6 ≥ 5
Multinational
24 ≥ 15 countries
30 November 2005 ScotGrid Phase 2 Procurement Tony Doyle - University of Glasgow
• More than 20 applications from 7 domains– High Energy Physics
• 4 LHC experiments (ALICE, ATLAS, CMS, LHCb)• BaBar, CDF, DØ, ZEUS
– Biomedicine• Bioinformatics (Drug Discovery, GPS@, Xmipp_MLrefine, etc.)• Medical imaging (GATE, CDSS, gPTM3D, SiMRI 3D, etc.)
– Earth Sciences • Earth Observation, Solid Earth Physics,
Hydrology, Climate– Computational Chemistry– Astronomy
• MAGIC• Planck
– Geo-Physics• EGEODE
– Financial Simulation• E-GRID
Another 8 applications from 4 domains are in evaluation stage
Wider Application Support
30 November 2005 ScotGrid Phase 2 Procurement Tony Doyle - University of Glasgow
Example: “UK contributes to EGEE's battle with malaria”
BioMedSuccesses/Day 1107Success % 77%
WISDOM (Wide In Silico Docking On Malaria)
The first biomedical data challenge for drug discovery, which ran on the EGEE grid production service from 11 July 2005 until 19 August 2005.
GridPP resources in the UK contributed ~100,000 kSI2k-hours from 9 sites
Number of Biomedical jobs processed by country
Normalised CPU hours contributed to thebiomedical VO for UK sites, July-August 2005
30 November 2005 ScotGrid Phase 2 Procurement Tony Doyle - University of Glasgow
Accounting
30 November 2005 ScotGrid Phase 2 Procurement Tony Doyle - University of Glasgow
Summary
• ScotGrid is part of an evolving National and International Grid
• Deployment, middleware and application support status provide a context for Grid procurements here and elsewhere
• External developments and timelines place constraints on the hardware being procured for ScotGrid Phase 2