16
Status & Plan of the Xrootd Federation Wei Yang 1 3/19/12 US ATLAS Computing Facility Meeting at 2012 OSG AHM, University of Nebraska, Lincoln

Status & Plan of the Xrootd Federation

  • Upload
    casper

  • View
    47

  • Download
    0

Embed Size (px)

DESCRIPTION

Status & Plan of the Xrootd Federation. Wei Yang. Again, why do we do this. A data federation allows accessing data at other locations Opens many possibilities and use cases (need to understand/exploit pros and cons) See Rob Gardner’s slides at SW&C meeting last week: - PowerPoint PPT Presentation

Citation preview

Page 1: Status & Plan  of the  Xrootd  Federation

US ATLAS Computing Facility Meeting at 2012 OSG AHM, University of Nebraska, Lincoln

1

Status & Plan of the Xrootd Federation

Wei Yang

3/19/12

Page 2: Status & Plan  of the  Xrootd  Federation

2

Again, why do we do this

A data federation allows accessing data at other locations• Opens many possibilities and use cases (need to

understand/exploit pros and cons)• See Rob Gardner’s slides at SW&C meeting last week: https://indico.cern.ch/getFile.py/access?contribId=40&sessionId=9&resId=0&materialId=slides&confId=169695

Page 3: Status & Plan  of the  Xrootd  Federation

3

Where are we in R&D & Deployment

• Identified site architectures• Deployed a prototype federation with basic

monitoring in the US• Global name space issue addressed • X509 implementing, deployed at SLAC• Studying WAN reading performance using prun and

TTreeCache• Git repository to share configurations, knowledge• Bi-weekly working group meetings

Page 4: Status & Plan  of the  Xrootd  Federation

4

Site ArchitecturesSix possible site architectures https://www.usatlas.bnl.gov/twiki/pub/Admins/MinutesFedXrootdFeb8/FAX-site-diagram.pptx

Work with backend storage systems: dCache, Xrootd, Posix

Xrootd proxy based architecture works for all sites• Deployed at AGLT2, BNL, UTA, OU, SLAC

Overlapping xrootd on dCache• Deployed at UC

Xrootd with libXrdDcap.so (deprecated)• Deployed at UIUC

Working on direct access to dCache xrootd door• Need N2N via authorization plugin

Page 5: Status & Plan  of the  Xrootd  Federation

5

Deployment, Dashboardhttp://uct3-xrdp.uchicago.edu:8080/rsv/

Page 6: Status & Plan  of the  Xrootd  Federation

6

Deployment, CMS-like monitoring

Page 7: Status & Plan  of the  Xrootd  Federation

7

R&D, Technical Stuffs

Global Name Space is addressed• Global File Name to Storage File Name (N2N) translation ready• N2N via a local LFC introduces small latency (3ms?)• dq2-client 1.0 supports GFN

X509 work in progress• Implemented and working. Fixing a security hole

Prun jobs run successfully against the federation• Using prun -–pfnList to supply a list of input files in GFN

Git repository at BNL

Page 8: Status & Plan  of the  Xrootd  Federation

8

Next StepsContinue Deployment• More sites with Xrootd 3.1.1, X509, etc. • Can we do this in a month or two?

Latency study and TTreeCache• Probably need to get a sense of worst cases first• Talk to experts

Panda Analysis, for example:• Have pilot replacing local missing files with GFN• Take advantage of regional redirectors• Local caching• Or can we help ourselves? Note:

lsm-get [-t token] [--size N] [--checksum csum] [--guid guid] source destination

Page 9: Status & Plan  of the  Xrootd  Federation

9

Next Steps, cont’d

Global redirector evaluation• Reliability and performance• explore the concept of regional redirector

Monitoring • Leaning toward DB-less detailed stream monitoring• CMS-like monitoring: Know what is happening, needs aggregation and

history• T3mon

Bug fixes• Extreme copy mode in Xrdcp• Patch X509 security issue

Page 10: Status & Plan  of the  Xrootd  Federation

10

Backup Slides

Page 11: Status & Plan  of the  Xrootd  Federation

Redirectorxrootd proxy mgr

Xrd proxydata server

N2NXrd proxydata server

N2N

Global Redirector

Client

Backend Xrootd

storage

0. Join the Federation

1. Open (GFN)2. Open (GFN)

3. Open/Read (GFN)

4. Read (PFN) Fetched data

data

Export Xrootd storage via Xrootd Proxy cluster

GFN: global file namePFN: physical (storage) file nameN2N: translation GFN to PFNcms.dfs directive: see

cms.dfs : all data servers are equal

http://www.xrootd.org/doc/prod/cms_config.htm#_Toc295744046

Page 12: Status & Plan  of the  Xrootd  Federation

RedirectorXrootd mgr

Regluar Xrddata server

N2NRegular Xrddata server

N2N

Global Redirector

Client

Backend Posix

storage

0. Join the Federation

1. Open (GFN)2. Open (GFN)

3. Open/Read (GFN)

4. Read (PFN) Fetched data

data

Export Posix storage via regular Xrootd cluster

GFN: global file namePFN: physical (storage) file nameN2N: translation GFN to PFNPosix file system: GPFN, Luster, XrootdFS

cms.dfs : all data servers are equal

Posix File system Posix File system

Page 13: Status & Plan  of the  Xrootd  Federation

RedirectorXrootd mgr

Regular Xrddata server with

N2NRegular Xrddata server with

N2N

Global Redirector

Client

Backend dCache

dCap doors

0. Join the Federation

1. Open (GFN)2. Open (GFN)

3. Open/Read (GFN)

4. Read (PFN) Fetched data

data

Export dCache dCap doors via regular Xrootd cluster (deprecated)

GFN: global file namePFN: physical (storage) file nameN2N: translation GFN to PFNdCap plugin: a Xrootd OFS plug-in moudel using dCap protocol

cms.dfs : all data servers are equal

dCap plug-in dCap plug-in

Page 14: Status & Plan  of the  Xrootd  Federation

Redirectorxrootd proxy mgr

Xrd proxydata server

N2NXrd proxydata server

N2N

Global Redirector

Client

Backend dCache

Xrootd doors

0. Join the Federation

1. Open (GFN)2. Open (GFN)

3. Open/Read (GFN)

4. Read (PFN) Fetched data

data

Export dCache Xrootd doors via Xrootd Proxy cluster, 1

GFN: global file namePFN: physical (storage) file nameN2N: translation GFN to PFN

cms.dfs : all data servers are equal

Page 15: Status & Plan  of the  Xrootd  Federation

Regular xrootddata server withXrootd.redirect …

Global Redirector

Client

0. Join the Federation

1. Open (GFN)2. Open (GFN)

data

Export dCache Xrootd doors via Xrootd Proxy cluster, 2

GFN: global file namePFN: physical (storage) file nameN2N: translation GFN to PFNFor Xrootd.redirect directive, see:

dCache pool node with Auth plug-in for N2N

dCache pool node with Auth plug-in for N2N

dCache xrootd doorAuth plug-in for N2N

3. Open (GFN)

4. Open/Read (GFN)

http://www.xrootd.org/doc/prod/xrd_config.htm#_Toc305602916

Page 16: Status & Plan  of the  Xrootd  Federation

RedirectorXrootd mgr

Global Redirector

Client

0. Join the Federation

1. Open (GFN)2. Open (GFN)

3. Open/Read (GFN)

data

Overlapping Xrootd cluster on top of dCache

GFN: global file namePFN: physical (storage) file nameN2N: translation GFN to PFNGFN name space : native xrootd name space layout according to GFN, with symlinks pointing to actual file on the same node(some CMS sites do this)

dCache pool node

Native Xrootd data serverGFN name space (symlinks) or N2N

dCache pool node

Native Xrootd data serverGFN name space (symlinks) or N2N