18
Dr Matt Pritchard, Campus network engineering workshop 19/10/2016 Enabling efficient movement of data into & out of a high-performance analysis environment

Enabling efficient movement of data into & out of a high-performance analysis environment

  • Upload
    jisc

  • View
    283

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Enabling efficient movement of data into & out of a high-performance analysis environment

Dr Matt Pritchard, Campus network engineering workshop

19/10/2016 Enabling efficient movement of data into & out of a high-performance analysis environment

Page 2: Enabling efficient movement of data into & out of a high-performance analysis environment

JASMIN Data Transfer Zone

Enabling efficient movement of data into & out of a high-performance analysis environment

JISC Workshop:Campus Network Engineering for Data Intensive Science19 October 2016

Dr Matt PritchardCentre for Environmental Data AnalysisSTFC / RAL Space

Page 3: Enabling efficient movement of data into & out of a high-performance analysis environment

Outline

• What is JASMIN & who is it for?• Example science use cases & challenges posed• JASMIN “Data Transfer Zone”• Community Organisation• Changing workflows

Page 4: Enabling efficient movement of data into & out of a high-performance analysis environment

Context: JASMIN & CEDA

Page 5: Enabling efficient movement of data into & out of a high-performance analysis environment

JASMIN infrastructure

• Part data store, part supercomputer, part private cloud…

Page 6: Enabling efficient movement of data into & out of a high-performance analysis environment

Science use cases

Testing high resolution climate models

Earthquake monitoring

Whole-mission satellite data processing: greenhouse gases

Air pollution: model-observation comparison

Sea surface temperature from space

Climate data dissemination

Page 7: Enabling efficient movement of data into & out of a high-performance analysis environment

JASMIN data growth

Growth of data in the CEDA archive:• Step change in volume with ingestion of

CMIP5• Step change in no of files with ERA40,

EISCAT datasets

• NCEO & NCAS are Atmos / Earth Obs components of CEDA archive

• CMIP & Sentinel data shown separately• Major growth: users exploiting Group

Workspaces

Page 8: Enabling efficient movement of data into & out of a high-performance analysis environment

ArchiveGroup Workspaces

gws1 gws2 gws3

NERC Managed Cloud Analysis Compute

NERC Unmanaged Cloud

login bastion transfer

sciscisci Cluster

ArchiveTape

NERC EOS Cloud

ESA CCI SST

ElasticTape

Functional View

Page 9: Enabling efficient movement of data into & out of a high-performance analysis environment

JASMIN Internal Network

• Core network designed for optimum performance & flexibility for data analysis– Zero contention, non-blocking– Equal-cost multi-path routing (ECMP)– 3 Terabit/sec internal throughput– Over 1,100 10GbE ports– 12 x 40GbE switches (more than RAL site!)– Compute nodes interchangeable between

virtualization & batch processing

JC2-LSW1 JC2-LSW1 JC2-LSW1JC2-LSW1 JC2-LSW1 JC2-LSW1 JC2-LSW1 JC2-LSW1 JC2-LSW1JC2-LSW1 JC2-LSW1 JC2-LSW1

48 * 16 = 768 10GbE Non-blocking16 x 12 x 40GbE = 192 40GbE ports

S1036 = 32 x 40GbE

JC2-LSW1JC2-LSW1

JC2-SP1 JC2-SP1 JC2-SP1 JC2-SP1 JC2-SP1 JC2-SP1

16 x MSX1024B-1BFS48x10GBE + 12 40 GbE

16 x 12 40GbE = 192 Ports / 32 = 6Total 192 40 GbE Cable

Page 10: Enabling efficient movement of data into & out of a high-performance analysis environment

Moving big data

1. DON’T move it!– Co-locate the analysis environment with the data (move the compute)

• Archive data (~5 PB parallel file system mounted RO)• Group Workspaces (~10 PB, same storage technology, mounted RW)• Process the data in-place

2. Coordinate as a community– Reduce N x N transfers– Replicate data to a few key sites

• “Institutional” data transfers– Assign workspace managers

• Get users to talk to each other!

3. End-user data transfers

Page 11: Enabling efficient movement of data into & out of a high-performance analysis environment

Community coordination: ESGF

Page 12: Enabling efficient movement of data into & out of a high-performance analysis environment

Data transfer improvement

Page 13: Enabling efficient movement of data into & out of a high-performance analysis environment

ICNWG progress

• Created “Science DMZ” at core sites– Blueprint

• http://fasterdata.es.net/science-dmz/• Network zone outside site firewall• As close to site border routers as possible• Dedicated suite of high-performance equipment• Include only specific science data transfer services• Exclude all other traffic (general internet usage)• Monitoring/diagnostic tools : perfSONAR

– ICNWG Implementation• Sites benefit from ESnet “guru” expertise & coordination (Eli Dart)

Page 14: Enabling efficient movement of data into & out of a high-performance analysis environment

JASMIN Data Transfer ZoneDTZ services:

• CEDA archive FTP (RO)• FTP server

• Xfer servers (RW)• Scp, Rsync• Bbcp• Ssh-based GridFTP

• GridFTP servers (RW)• Cert-based GridFTP• Globus endpoint

• ESGF Data Node (RO)• Cert-based GridFTP• Globus endpoint

Page 15: Enabling efficient movement of data into & out of a high-performance analysis environment
Page 16: Enabling efficient movement of data into & out of a high-performance analysis environment
Page 17: Enabling efficient movement of data into & out of a high-performance analysis environment

Changing workflows

Data “gravity”

Page 18: Enabling efficient movement of data into & out of a high-performance analysis environment

Further information

• JASMIN– http://www.jasmin.ac.uk

• Centre for Environmental Data Analysis– http://www.ceda.ac.uk

• STFC Scientific Computing Department– http://www.stfc.ac.uk/SCD/

• JASMIN paperLawrence, B.N. , V.L. Bennett, J. Churchill, M. Juckes, P. Kershaw, S. Pascoe, S. Pepler, M. Pritchard, and A. Stephens. Storing and manipulating environmental big data with JASMIN. Proceedings of IEEE Big Data 2013, p68-75, doi:10.1109/BigData.2013.6691556