Upload
others
View
1
Download
0
Embed Size (px)
Citation preview
Name of presentation Month 2008
DCIM Deployment at University of Queensland
Don Caruana, Team Leader Data Centre Operations, UQ
Name of presentation Month 2008
UQ’s Data Centre Environment
ITS (Data Centre team of three) manages 280 racks in 9 data centres
• 5 at St Lucia
• 1 at CCR (Herston)
• 1 at Gatton
• 1 by agreement on behalf of TRI (PA Hospital)
• 1 commercial facility at Polaris, Springfield
DC Team accountable for Data Centre power, cooling, security, fire protection, rack deployment and layout except for Polaris facility.
Data Centre team on-call outside normal business hours
Power cost approx. $600,000 per annum
Name of presentation Month 2008
DC Management in a University
What I have learnt in past 15 years.
1) Dedicated ITS FM Team accountable for Data Centre operational management
• Daily operations• Monitoring – DCIM tool• Strong links to network and infrastructure team, monitor installs,
storage installs, FM works.
2) Relationship with Facilities Management department • Build and maintain strong working relationships with key FM
personnel. Ensure they understand the mission critical nature of the Data Centre/s
3) Business Continuity in DC Context• Risk management• Risk Identification• Business Impact Analysis• Risk rectification cost• Residual risk
Name of presentation Month 2008
DC Management in a University
4) Capacity management • Load v Capacity management (Enabler -> DCIM)• Regular reports to management
5) Cost of service• Annual budget – what if insufficient – power, cool, network,
contract maintenance.• Regular reports to management (Enabler -> DCIM)
Name of presentation Month 2008
DCIM tool selection
Avantax, Fujitsu, iTracs ((iiSolutions), OpenDCIM, Trellis (Emerson), Struxuware (APC/Schneider)
Deciding factors:• Capable of monitoring 1000+ devices
• Air Conditioners • DC room temperature and humidity• Rack temperature• Water leaks under floor• Environmental monitors / netbotz• Rack power rails (PDUs) and auto transfer switches (ATSs)• UPS• Power meters and power monitors• Phase Failure relays• Generators• Cameras
• Cost considering• Significant proportion of devices in UQ DCs were APC
Name of presentation Month 2008
Struxuware / Data Centre Expert deployment
DCE purchased December 2012, 90% complete 2014, 100% 2015
Noted that for every dollar spent on the tool (including licences) another three were spent on smart devices - PDUs, power meters and monitors, wireless temp sensors, leak ropes, smart cards for CRACs, netbotz…..
Starting Point – Create the rules
• Tree Structure• Passwords on devices• Naming conventions• Thresholds and reporting• Device totals – my tracking progress tool
Name of presentation Month 2008
Tree Structure and passwords
Data Centre• Air con• Environmentals
o Netbotzo Rack temps (= virtual sensors
via PDU)o Room temps
Data Centre Plant room
o Leak sensors• Power
o PDU, ATSo UPPSo Generatoro Power Meterso Power Monitors o MSB - Phase Failure relays
Naming conventionsDevice Name format Label (Examples)ATS datacentre-rack#-ats ccr-b5-ats
tri-c8-atsPower Rail datacentre-rack#-pr# ccr-b7-pr1
ccr-b7-pr2dc2-d4-pr2dc2-d4-pr3
UPS datacentre-ups# dc3-ups1datacentre-room#-ups# tri-8004-ups1
tri-8004-ups3tri-8004-ups4
Power Meters datacentre-db-name-pm dc2-db-2e1-pmdatacentre-msb-metername-pm dc3-msb-db3e2-pm
dc3-msb-ac_unit_3-pmdc2-msb-db2e1-pm
Generator location-generator prn-genAir Con datacentre-ac# tri-ac1
gpn2-ac2location-room#-ac# gpn-110-ac1
prn-104-ac1tri-8002-ac1
Netbotz datacentre-rack#-netbotz dc3-b9-netbotzTemp Sensor - Rack datacentre-rack#-temp# dc3-b2-temp1
tri-j9-temp1Temp Sensor – Room datacentre-room#-temp# dc3-room-temp1
dc3-room-temp3Temp Sensor – Plant Room location-room#-temp# tri-8002-temp1
prn-104-temp1gpn-110-temp1
Leak Detection datacentre-leak# tri-leak1gpn1-leak1
Name of presentation Month 2008
Thresholds and reporting vv Struxuware Threshold Setting and NotificationDevice Warning
Threshold Critical
Threshold Warning
NotificationCritical
Notification
PowerPDU 16 amp 3-phase or single-phase - Load per phase 7 amp 8 amp Email Email
PDU 32 amp 3-phase power rails on 32amp CB 14 amps 16 amps Email Email
PDU - loss of power to a phase, or loss of comms to PDU n/a Email and SMS
ATS 16 amp 10 amp 12 amp Email Email
UPS - General event/fault n/a Email and SMS
UPS 200kva Current per phase - 80% of capacity (DC1, DC2, DC3, CCR) 194 amp n/a Email and SMS n/a
UPS temp 300C 400C Email and SMS Email and SMS
Generator - Run state n/a Email and SMS
TRI power load per rack 5kw 6.4kw Email Email
Prentice generator General Alarm Control Switch Position
Email Email/SMS
Phase Fail Relay Open/Closed Email/SMS
Cooling
Rack inlet temp sensors 280C 320C Email Email and SMS
Room temp sensors 320C 350C Email and SMS Email and SMS
CRAC units - Return Air 320C 350C Email and SMS Email and SMS
Humidity -%RH <20 or >80 <15 or >85 Email Email
All Devices
Loss of connectivity to a device n/a Email and SMS
Name of presentation Month 2008
Device totals – Tracking progress
Example from Prentice Data Centre 2
DEVICE TOTALS DCE (Struxuware)
Device total
Devices in DCE
To be added in
DCE
Cannot be
added in DCE
Current Licence count
Final Licence count
DC2 - 33 racks A1>8 B1>9 C1>8 D1>8 (C1 passive) - single & 3 phase AUDITED 18/1/17
Air con 3 3 3 3
Environmentals
Netbotz - 550 1 1 1 1Rack temp sensors (virtual) 31 31 0 0DC temp/hum sensors 2 2 0 0Leak detection sensor 1 1 0 0Security cameras 6
Power
ATS 6 6 6 6Power Meter 7 7 7 7Power Monitor 4 4 4 4Power Rails 60 58 2 58 60Power Rails - dumb in 1 rack 4 4
UPS 1 1 1 1
Phase Fail relay 2 2 0 0
Name of presentation Month 2008
Device totals – All DCs
DCE cannot monitor every device
• 57 cameras motion capture -> axis camera suite
• 1 generator at CCR -> CCR BMS
• 640 batteries at TRI DC -> Power Shield app
• 2 generators at TRI DC -> TRI BMS
Name of presentation Month 2008
Load and Capacity reports
0
200
400
600
800
1000
1200
Jul-1
2
Sep-
12
Nov
-12
Jan-
13
Mar
-13
May
-13
Jul-1
3
Sep-
13
Nov
-13
Jan-
14
Mar
-14
May
-14
Jul-1
4
Sep-
14
Nov
-14
Jan-
15
Mar
-15
May
-15
Jul-1
5
Sep-
15
Nov
-15
Jan-
16
Mar
-16
May
-16
Jul-1
6
Sep-
16
Nov
-16
Jan-
17
Mar
-17
kilo
wat
ts
Prentice DC1, DC2, DC3Generator capacity v Total Prentice load
Generator Capacity (Shared) Total Prentice Power Load (kw)
Name of presentation Month 2008
Load and Capacity reports
0
100
200
300
400
500
600
700
Jul-1
2
Sep-
12
Nov
-12
Jan-
13
Mar
-13
May
-13
Jul-1
3
Sep-
13
Nov
-13
Jan-
14
Mar
-14
May
-14
Jul-1
4
Sep-
14
Nov
-14
Jan-
15
Mar
-15
May
-15
Jul-1
5
Sep-
15
Nov
-15
Jan-
16
Mar
-16
May
-16
Jul-1
6
Sep-
16
Nov
-16
Jan-
17
Mar
-17
Kilo
wat
ts
Prentice DC2MSB/Feeder capacity v Total load
MSB/Feeder Capacity (kw) Total Power Load (kw)
Name of presentation Month 2008
Load and Capacity reports
0
20
40
60
80
100
120
140
160
Jul-1
2
Sep-
12
Nov
-12
Jan-
13
Mar
-13
May
-13
Jul-1
3
Sep-
13
Nov
-13
Jan-
14
Mar
-14
May
-14
Jul-1
4
Sep-
14
Nov
-14
Jan-
15
Mar
-15
May
-15
Jul-1
5
Sep-
15
Nov
-15
Jan-
16
Mar
-16
May
-16
Jul-1
6
Sep-
16
Nov
-16
Jan-
17
Mar
-17
kilo
wat
ts
Prentice DC2UPS capacity v ICT load
UPS Capacity (kw) ICT Load (kw)
Name of presentation Month 2008
Load and Capacity reports
0
50
100
150
200
250
Jul-1
2
Sep-
12
Nov
-12
Jan-
13
Mar
-13
May
-13
Jul-1
3
Sep-
13
Nov
-13
Jan-
14
Mar
-14
May
-14
Jul-1
4
Sep-
14
Nov
-14
Jan-
15
Mar
-15
May
-15
Jul-1
5
Sep-
15
Nov
-15
Jan-
16
Mar
-16
May
-16
Jul-1
6
Sep-
16
Nov
-16
Jan-
17
Mar
-17
Kilo
wat
ts
Prentice DC2Cooling capacity v Heat load
Cooling Capacity N+1 (kw) Cooling Capacity (kw) Heat load
Name of presentation Month 2008
Load and Capacity reports
1.00
1.20
1.40
1.60
1.80
2.00
2.20
2.40
2.60
DC1 DC2 DC3 GPN CCR
Annual PUE per DC
2013 2014 2015 2016 2017
Name of presentation Month 2008
Load and Capacity reports
$40,000
$45,000
$50,000
$55,000
$60,000
$65,000
Jul-1
2
Sep-
12
Nov
-12
Jan-
13
Mar
-13
May
-13
Jul-1
3
Sep-
13
Nov
-13
Jan-
14
Mar
-14
May
-14
Jul-1
4
Sep-
14
Nov
-14
Jan-
15
Mar
-15
May
-15
Jul-1
5
Sep-
15
Nov
-15
Jan-
16
Mar
-16
May
-16
Jul-1
6
Sep-
16
Nov
-16
Jan-
17
Mar
-17
Total Monthly Cost - All ITS DCs
Total Cost - All DCs
Name of presentation Month 2008
Load and Capacity reports
0.0
2.0
4.0
6.0
8.0
10.0
12.0
14.0
16.0
18.0
20.0
DC1 DC2 DC3 GPN CCR GAT
kilo
wat
ts
Average Power per Rack
Ave load per rack (kw) Max ave load per rack (kw)
Name of presentation Month 2008
Load and Capacity reports
20.00
40.00
60.00
80.00
100.00
120.00
140.00
160.00
180.00
200.00
Mar
-14
Apr-
14M
ay-1
4Ju
n-14
Jul-1
4Au
g-14
Sep-
14O
ct-1
4N
ov-1
4De
c-14
Jan-
15Fe
b-15
Mar
-15
Apr-
15M
ay-1
5Ju
n-15
Jul-1
5Au
g-15
Sep-
15O
ct-1
5N
ov-1
5De
c-15
Jan-
16Fe
b-16
Mar
-16
Apr-
16M
ay-1
6Ju
n-16
Jul-1
6Au
g-16
Sep-
16O
ct-1
6N
ov-1
6De
c-16
Jan-
17Fe
b-17
Kilo
wat
ts
Polaris DC ICT Load
ICT load (KW) Min Load threshold
Name of presentation Month 2008
What does DCE deliver
Confidence – when alerts column is blank every device is within thresholds, that’s confidence when 24x7 service delivery is demanded.
Confidence – management receive monthly load and capacity data – accept business cases more readily when they trust the operation
Pro-active response rather than reactive to daily operational issues