10
USCMS Grid USCMS Grid Infrastructure Infrastructure Troubleshooting Troubleshooting Shaowen Wang Shaowen Wang USCMS & GROW USCMS & GROW OSG Operations & Support Centers Workshop OSG Operations & Support Centers Workshop May 16, 2006 May 16, 2006

USCMS Grid Infrastructure Troubleshooting Shaowen Wang USCMS & GROW OSG Operations & Support Centers Workshop May 16, 2006

Embed Size (px)

Citation preview

Page 1: USCMS Grid Infrastructure Troubleshooting Shaowen Wang USCMS & GROW OSG Operations & Support Centers Workshop May 16, 2006

USCMS Grid USCMS Grid Infrastructure Infrastructure Troubleshooting Troubleshooting

Shaowen Shaowen WangWang

USCMS & GROWUSCMS & GROW

OSG Operations & Support Centers OSG Operations & Support Centers WorkshopWorkshop

May 16, 2006May 16, 2006

Page 2: USCMS Grid Infrastructure Troubleshooting Shaowen Wang USCMS & GROW OSG Operations & Support Centers Workshop May 16, 2006

Collaborative Collaborative TroubleshootingTroubleshooting A newly established USCMS Grid A newly established USCMS Grid

troubleshooting teamtroubleshooting team– Ransom BriggsRansom Briggs– Yan LiuYan Liu– Anand PadmanabhanAnand Padmanabhan– Eric ShookEric Shook– Shaowen WangShaowen Wang

The Tier-1 + 7 Tier2sThe Tier-1 + 7 Tier2s

Page 3: USCMS Grid Infrastructure Troubleshooting Shaowen Wang USCMS & GROW OSG Operations & Support Centers Workshop May 16, 2006

Troubleshooting Troubleshooting HelpDeskHelpDesk

Interface to usersInterface to users– [email protected]@list.uiowa.edu

FunctionsFunctions– TriagingTriaging– Solving USCMS Grid infrastructure problemsSolving USCMS Grid infrastructure problems

Interfacing with OSG and LCG GGUSInterfacing with OSG and LCG GGUS

– Directing CMS application-level problems to Directing CMS application-level problems to experts at the Tier-1 and Tier-2s when experts at the Tier-1 and Tier-2s when appropriateappropriate

Page 4: USCMS Grid Infrastructure Troubleshooting Shaowen Wang USCMS & GROW OSG Operations & Support Centers Workshop May 16, 2006

FNAL RemedyFNAL Remedy

Page 5: USCMS Grid Infrastructure Troubleshooting Shaowen Wang USCMS & GROW OSG Operations & Support Centers Workshop May 16, 2006

Grid-Related CMS Grid-Related CMS Application ToolsApplication Tools

CRAB CRAB – LCG LCG

LCG Resource Broker LCG Resource Broker – OSGOSG

Condor-GCondor-G SRM/dCacheSRM/dCache PhEDEx (PhEDEx (Physics Experimental Data Export)

– CMS data placement and file transfer systemCMS data placement and file transfer system Developing componentsDeveloping components

– gLitegLite– Condor GlidinCondor Glidin

Page 6: USCMS Grid Infrastructure Troubleshooting Shaowen Wang USCMS & GROW OSG Operations & Support Centers Workshop May 16, 2006

Toward Achieving Toward Achieving Proactive ResponsesProactive Responses

CurrentlyCurrently– Understanding the complexity of troubleshooting Understanding the complexity of troubleshooting

GridsGrids– Reconciling monitoring servicesReconciling monitoring services

MonALISAMonALISA CondorCondor Other monitoring and information servicesOther monitoring and information services

FutureFuture– To diagnose with the support of OSG information and To diagnose with the support of OSG information and

accounting servicesaccounting services– To help establish troubleshooting flowchart and To help establish troubleshooting flowchart and

automatic alert mechanismsautomatic alert mechanisms– To leverage our troubleshooting experience for OSG To leverage our troubleshooting experience for OSG

useuse

Page 7: USCMS Grid Infrastructure Troubleshooting Shaowen Wang USCMS & GROW OSG Operations & Support Centers Workshop May 16, 2006

Troubleshooting Troubleshooting WorkflowWorkflow

The secret is to follow the path.

Page 8: USCMS Grid Infrastructure Troubleshooting Shaowen Wang USCMS & GROW OSG Operations & Support Centers Workshop May 16, 2006
Page 9: USCMS Grid Infrastructure Troubleshooting Shaowen Wang USCMS & GROW OSG Operations & Support Centers Workshop May 16, 2006
Page 10: USCMS Grid Infrastructure Troubleshooting Shaowen Wang USCMS & GROW OSG Operations & Support Centers Workshop May 16, 2006

Thanks!Thanks!

Questions and comments?Questions and comments?