Upload
tim-bell
View
1.776
Download
0
Embed Size (px)
Citation preview
09/05/2014 LinuxTag 2014 3
CERN was founded 1954: 12 European States
“Science for Peace”
Today: 21 Member States
Member States: Austria, Belgium, Bulgaria, the Czech Republic, Denmark, Finland, France, Germany, Greece, Hungary, Israel, Italy, the Netherlands, Norway, Poland, Portugal, Slovakia, Spain, Sweden, Switzerland and the United Kingdom
Candidate for Accession: Romania
Associate Members in Pre-Stage to Membership: Serbia
Applicant States for Membership or Associate Membership:Brazil, Cyprus (awaiting ratification), Pakistan, Russia, Slovenia, Turkey, Ukraine
Observers to Council: India, Japan, Russia, Turkey, United States of America; European Commission and UNESCO
~ 2,300 staff
~ 1,000 other paid personnel
> 11,000 users
Budget (2013) ~1,000 MCHF
09/05/2014 LinuxTag 2014 4
09/05/2014 LinuxTag 2014 5
09/05/2014 LinuxTag 2014 6
Collisions
09/05/2014 LinuxTag 2014 7
A Big Data Challenge
09/05/2014 LinuxTag 2014 8
In 2014,
• 100PB archive with additional 35PB/year
• 10,000 servers
• 75,000 disk drives
• 45,000 tapes
In 2015,
• Run 2 of LHC expected to double data rates
• But many limits and limitations…
The CERN Meyrin Data Centre
09/05/2014 LinuxTag 2014 9
09/05/2014 LinuxTag 2014 10
09/05/2014
Bamboo
Koji, Mock
AIMS/PXE
Foreman
Yum repo
Pulp
Puppet-DB
mcollective, yum
JIRA
Lemon /
Hadoop /
LogStash /
Kibana
git
OpenStack
Nova
Hardware
database
Puppet
Active Directory /
LDAP
LinuxTag 2014 11
Status
• Multi-data centre cloud in production since July 2013 (Geneva and Budapest)
• Currently running OpenStack Havana• KVM and Hyper-V deployed
• All configured automatically with Puppet
• 65,000 cores in CERN IT Private Cloud
• 3PB Ceph pool available for volumes, images and other physics storage
09/05/2014 LinuxTag 2014 12
09/05/2014 LinuxTag 2014 13
Microsoft Active
Directory
CERN DB
on Demand
CERN Network
Database
Account mgmt
system
Horizon
Keystone
Glance
NetworkCompute
Scheduler
Cinder
Nova
Block Storage
Ceph & NetAppCERN
Accounting
Ceilometer
Monitoring - Flume, Elastic
Search, Kibana
14
HDFS
Flume
gatewayelasticsearch Kibana
OpenStack infrastructure
compute-nodescontrollers
compute-nodes
Scaling Architecture Overview
15
Child Cell
Geneva, Switzerland
Child Cell
Budapest, HungaryTop Cell - controllers
Geneva, Switzerland
Load Balancer
Geneva, Switzerland
controllers
Architecture Components
16
rabb
itmq
- Keystone
- Nova api
- Nova conductor
- Nova scheduler
- Nova network
- Nova cells
- Glance api
- Ceilometer agent-central
- Ceilometer collector
Controller
- Flume
- Nova compute
- Ceilometer agent-compute
Compute node
- Flume
- HDFS
- Elastic Search
- Kibana
- MySQL
- MongoDB
- Glance api
- Glance registry
- Keystone
- Nova api
- Nova consoleauth
- Nova novncproxy
- Nova cells
- Horizon
- Ceilometer api
- Cinder api
- Cinder volume
- Cinder scheduler
rabb
itmq
Controller
Top Cell Children Cells
- Stacktach
- Ceph
- Flume
Some Caution on Cells• Single cell limits around 1,000 hypervisors
• Can be adapted using Bluehost alternative approach
with MySQL replication
• Significant function gap being worked on
• Flavors, Availability zones, Scheduling, Ceilometer
need workarounds
• Tested in the OpenStack gate
• Not blocking so local QA environment needed
09/05/2014 LinuxTag 2014 17
Scheduling at Scale• CERN users want more sophisticated scheduling:
• Processor architecture
• Private network subnets
• Varying memory/core/disk ratios
• Hardware with more redundancy
• Servers should be used fully
• Tetris-like problem to find the matches
• Packing is more difficult the nearer to 100% used
• Cells scheduler is rather simple currently
• Try Cell X, if not match, try Cell Y…
09/05/2014 LinuxTag 2014 18
Upgrade Strategy• Surely “OpenStack can‟t be upgraded”
• Our Essex, Folsom and Grizzly clouds were „tear-down‟
migrations
• Puppet managed VMs are typical Cattle cases – re-create
• User VMs snapshot, download image and upload to new instance
• One month window to migrate
• Users of production services expect more
• Physicists accept not creating/changing VMs for a short period
• Running VMs must not be affected
09/05/2014 LinuxTag 2014 19
Phased Migration• Migrated by Component
• Choose an approach (online with load balancer, offline)
• Spin up „teststack‟ instance with production software
• Clone production databases to test environment
• Run through upgrade process
• Validate existing functions, Puppet configuration and monitoring
• Order by complexity and need• Ceilometer, Glance, Keystone
• Cinder, Client CLIs, Horizon
• Nova
09/05/2014 LinuxTag 2014 20
Upgrade Experience• No significant outage of the cloud
• During upgrade window, creation not possible
• Small incidents (see blog for details)
• Puppet can be enthusiastic! - we told it to be
• Community response has been great
• Bugs fixed and points are in Juno design summit
• Rolling upgrades in Icehouse will make it easier
09/05/2014 LinuxTag 2014 21
OpenStack Federation• OpenStack clouds in many high energy physics sites
• 2 more clouds at CERN in experiment areas (>20K cores each)
• Many collaborating sites adopting OpenStack
• Rackspace collaboration in Openlab
• Aim for seamless cloud resources (CERN, sites, public)
• All code to be included as open source in core OpenStack
• Federation building blocks (authentication, images, compute)
• Authentication included in Icehouse
• More to come…
09/05/2014 LinuxTag 2014 22
Next Steps• Scaling to >100,000 cores by 2015
• Around 100 hypervisors per week with fixed staff
• Deploying and configurimg the latest features
• Kerberos / X.509 certificate authentication
• Delegated quota management
• Orchestration
• Database as a Service
• Cells scaling and scheduling
• Federation
09/05/2014 LinuxTag 2014 23
Summary• OpenStack at CERN is in production for thousands of
physicists to analyse the results of the LHC
• Rapid innovation around OpenStack gives new function
at an incredible rate
• Upgrades already done at scale and are approaching
transparent in future
• Collaboration around vibrant open source communities
has delivered production quality services
09/05/2014 LinuxTag 2014 24
Questions ?
09/05/2014 LinuxTag 2014 25
• Details at http://openstack-in-
production.blogspot.fr
• CERN User guide at
http://information-
technology.web.cern.ch/boo
k/cern-private-cloud-user-
guide
• Previous presentations at
http://information-
technology.web.cern.ch/boo
k/cern-private-cloud-user-
guide/openstack-information
09/05/2014 LinuxTag 2014 26
Service Models
09/05/2014 LinuxTag 2014 27
• Pets are given names like pussinboots.cern.ch
• They are unique, lovingly hand raised and cared for
• When they get ill, you nurse them back to health
• Cattle are given numbers like vm0042.cern.ch
• They are almost identical to other cattle
• When they get ill, you get another one
09/05/2014 LinuxTag 2014 28
09/05/2014 LinuxTag 2014 29
http://www.eucalyptus.com/blog/2013/04/02/cy13-q1-community-analysis-%E2%80%94-openstack-vs-opennebula-vs-eucalyptus-vs-
cloudstack
09/05/2014 LinuxTag 2014 30
09/05/2014 LinuxTag 2014 31
Tier-1 (11 centres):•Permanent storage•Re-processing•Analysis
Tier-0 (CERN):•Data recording•Initial data reconstruction•Data distribution
Tier-2 (~200 centres):• Simulation• End-user analysis
• Data is recorded at CERN and Tier-1s and analysed in the Worldwide LHC
Computing Grid
• In a normal day, the grid provides 100,000 CPU days executing over 2 million jobs
09/05/2014 LinuxTag 2014 32
Training for Newcomers
09/05/2014 LinuxTag 2014 33
Buy the book rather than guru mentoring
What are the Origins of Mass ?
09/05/2014 LinuxTag 2014 34
Matter/Anti Matter Symmetric?
09/05/2014 LinuxTag 2014 35
Where is 95% of the Universe?
09/05/2014 LinuxTag 2014 36
New Data Centre in Budapest
09/05/2014 LinuxTag 2014 37
Monitoring - Kibana
38
Monitoring - Kibana
39
Metering at Scale• Ceilometer provides metering functions for
OpenStack
• Requires careful configuration for cells
09/05/2014 LinuxTag 2014 40
I/O at Scale• Most hypervisors are recycled servers
• Most are 2 SATA disks 1-2 TBs
• Some SSD but limited capacity
• IOPS limited with local storage
• Some guest tuning e.g. Linux scheduler
• General approach to use remote storage
• Ceph storage
• Network protocols such as webdav
09/05/2014 LinuxTag 2014 41