20
Name of presentation Month 2008 DCIM Deployment at University of Queensland Don Caruana, Team Leader Data Centre Operations, UQ

DCIM Deployment at University of Queensland - QUDIT

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: DCIM Deployment at University of Queensland - QUDIT

Name of presentation Month 2008

DCIM Deployment at University of Queensland

Don Caruana, Team Leader Data Centre Operations, UQ

Page 2: DCIM Deployment at University of Queensland - QUDIT

Name of presentation Month 2008

UQ’s Data Centre Environment

ITS (Data Centre team of three) manages 280 racks in 9 data centres

• 5 at St Lucia

• 1 at CCR (Herston)

• 1 at Gatton

• 1 by agreement on behalf of TRI (PA Hospital)

• 1 commercial facility at Polaris, Springfield

DC Team accountable for Data Centre power, cooling, security, fire protection, rack deployment and layout except for Polaris facility.

Data Centre team on-call outside normal business hours

Power cost approx. $600,000 per annum

Page 3: DCIM Deployment at University of Queensland - QUDIT

Name of presentation Month 2008

DC Management in a University

What I have learnt in past 15 years.

1) Dedicated ITS FM Team accountable for Data Centre operational management

• Daily operations• Monitoring – DCIM tool• Strong links to network and infrastructure team, monitor installs,

storage installs, FM works.

2) Relationship with Facilities Management department • Build and maintain strong working relationships with key FM

personnel. Ensure they understand the mission critical nature of the Data Centre/s

3) Business Continuity in DC Context• Risk management• Risk Identification• Business Impact Analysis• Risk rectification cost• Residual risk

Page 4: DCIM Deployment at University of Queensland - QUDIT

Name of presentation Month 2008

DC Management in a University

4) Capacity management • Load v Capacity management (Enabler -> DCIM)• Regular reports to management

5) Cost of service• Annual budget – what if insufficient – power, cool, network,

contract maintenance.• Regular reports to management (Enabler -> DCIM)

Page 5: DCIM Deployment at University of Queensland - QUDIT

Name of presentation Month 2008

DCIM tool selection

Avantax, Fujitsu, iTracs ((iiSolutions), OpenDCIM, Trellis (Emerson), Struxuware (APC/Schneider)

Deciding factors:• Capable of monitoring 1000+ devices

• Air Conditioners • DC room temperature and humidity• Rack temperature• Water leaks under floor• Environmental monitors / netbotz• Rack power rails (PDUs) and auto transfer switches (ATSs)• UPS• Power meters and power monitors• Phase Failure relays• Generators• Cameras

• Cost considering• Significant proportion of devices in UQ DCs were APC

Page 6: DCIM Deployment at University of Queensland - QUDIT

Name of presentation Month 2008

Struxuware / Data Centre Expert deployment

DCE purchased December 2012, 90% complete 2014, 100% 2015

Noted that for every dollar spent on the tool (including licences) another three were spent on smart devices - PDUs, power meters and monitors, wireless temp sensors, leak ropes, smart cards for CRACs, netbotz…..

Starting Point – Create the rules

• Tree Structure• Passwords on devices• Naming conventions• Thresholds and reporting• Device totals – my tracking progress tool

Page 7: DCIM Deployment at University of Queensland - QUDIT

Name of presentation Month 2008

Tree Structure and passwords

Data Centre• Air con• Environmentals

o Netbotzo Rack temps (= virtual sensors

via PDU)o Room temps

Data Centre Plant room

o Leak sensors• Power

o PDU, ATSo UPPSo Generatoro Power Meterso Power Monitors o MSB - Phase Failure relays

Page 8: DCIM Deployment at University of Queensland - QUDIT

Naming conventionsDevice Name format Label (Examples)ATS datacentre-rack#-ats ccr-b5-ats

tri-c8-atsPower Rail datacentre-rack#-pr# ccr-b7-pr1

ccr-b7-pr2dc2-d4-pr2dc2-d4-pr3

UPS datacentre-ups# dc3-ups1datacentre-room#-ups# tri-8004-ups1

tri-8004-ups3tri-8004-ups4

Power Meters datacentre-db-name-pm dc2-db-2e1-pmdatacentre-msb-metername-pm dc3-msb-db3e2-pm

dc3-msb-ac_unit_3-pmdc2-msb-db2e1-pm

Generator location-generator prn-genAir Con datacentre-ac# tri-ac1

gpn2-ac2location-room#-ac# gpn-110-ac1

prn-104-ac1tri-8002-ac1

Netbotz datacentre-rack#-netbotz dc3-b9-netbotzTemp Sensor - Rack datacentre-rack#-temp# dc3-b2-temp1

tri-j9-temp1Temp Sensor – Room datacentre-room#-temp# dc3-room-temp1

dc3-room-temp3Temp Sensor – Plant Room location-room#-temp# tri-8002-temp1

prn-104-temp1gpn-110-temp1

Leak Detection datacentre-leak# tri-leak1gpn1-leak1

Page 9: DCIM Deployment at University of Queensland - QUDIT

Name of presentation Month 2008

Thresholds and reporting vv Struxuware Threshold Setting and NotificationDevice Warning

Threshold Critical

Threshold Warning

NotificationCritical

Notification

PowerPDU 16 amp 3-phase or single-phase - Load per phase 7 amp 8 amp Email Email

PDU 32 amp 3-phase power rails on 32amp CB 14 amps 16 amps Email Email

PDU - loss of power to a phase, or loss of comms to PDU n/a Email and SMS

ATS 16 amp 10 amp 12 amp Email Email

UPS - General event/fault n/a Email and SMS

UPS 200kva Current per phase - 80% of capacity (DC1, DC2, DC3, CCR) 194 amp n/a Email and SMS n/a

UPS temp 300C 400C Email and SMS Email and SMS

Generator - Run state n/a Email and SMS

TRI power load per rack 5kw 6.4kw Email Email

Prentice generator General Alarm Control Switch Position

Email Email/SMS

Phase Fail Relay Open/Closed Email/SMS

Cooling

Rack inlet temp sensors 280C 320C Email Email and SMS

Room temp sensors 320C 350C Email and SMS Email and SMS

CRAC units - Return Air 320C 350C Email and SMS Email and SMS

Humidity -%RH <20 or >80 <15 or >85 Email Email

All Devices

Loss of connectivity to a device n/a Email and SMS

Page 10: DCIM Deployment at University of Queensland - QUDIT

Name of presentation Month 2008

Device totals – Tracking progress

Example from Prentice Data Centre 2

DEVICE TOTALS DCE (Struxuware)

Device total

Devices in DCE

To be added in

DCE

Cannot be

added in DCE

Current Licence count

Final Licence count

DC2 - 33 racks A1>8 B1>9 C1>8 D1>8 (C1 passive) - single & 3 phase AUDITED 18/1/17

Air con 3 3 3 3

Environmentals

Netbotz - 550 1 1 1 1Rack temp sensors (virtual) 31 31 0 0DC temp/hum sensors 2 2 0 0Leak detection sensor 1 1 0 0Security cameras 6

Power

ATS 6 6 6 6Power Meter 7 7 7 7Power Monitor 4 4 4 4Power Rails 60 58 2 58 60Power Rails - dumb in 1 rack 4 4

UPS 1 1 1 1

Phase Fail relay 2 2 0 0

Page 11: DCIM Deployment at University of Queensland - QUDIT

Name of presentation Month 2008

Device totals – All DCs

DCE cannot monitor every device

• 57 cameras motion capture -> axis camera suite

• 1 generator at CCR -> CCR BMS

• 640 batteries at TRI DC -> Power Shield app

• 2 generators at TRI DC -> TRI BMS

Page 12: DCIM Deployment at University of Queensland - QUDIT

Name of presentation Month 2008

Load and Capacity reports

0

200

400

600

800

1000

1200

Jul-1

2

Sep-

12

Nov

-12

Jan-

13

Mar

-13

May

-13

Jul-1

3

Sep-

13

Nov

-13

Jan-

14

Mar

-14

May

-14

Jul-1

4

Sep-

14

Nov

-14

Jan-

15

Mar

-15

May

-15

Jul-1

5

Sep-

15

Nov

-15

Jan-

16

Mar

-16

May

-16

Jul-1

6

Sep-

16

Nov

-16

Jan-

17

Mar

-17

kilo

wat

ts

Prentice DC1, DC2, DC3Generator capacity v Total Prentice load

Generator Capacity (Shared) Total Prentice Power Load (kw)

Page 13: DCIM Deployment at University of Queensland - QUDIT

Name of presentation Month 2008

Load and Capacity reports

0

100

200

300

400

500

600

700

Jul-1

2

Sep-

12

Nov

-12

Jan-

13

Mar

-13

May

-13

Jul-1

3

Sep-

13

Nov

-13

Jan-

14

Mar

-14

May

-14

Jul-1

4

Sep-

14

Nov

-14

Jan-

15

Mar

-15

May

-15

Jul-1

5

Sep-

15

Nov

-15

Jan-

16

Mar

-16

May

-16

Jul-1

6

Sep-

16

Nov

-16

Jan-

17

Mar

-17

Kilo

wat

ts

Prentice DC2MSB/Feeder capacity v Total load

MSB/Feeder Capacity (kw) Total Power Load (kw)

Page 14: DCIM Deployment at University of Queensland - QUDIT

Name of presentation Month 2008

Load and Capacity reports

0

20

40

60

80

100

120

140

160

Jul-1

2

Sep-

12

Nov

-12

Jan-

13

Mar

-13

May

-13

Jul-1

3

Sep-

13

Nov

-13

Jan-

14

Mar

-14

May

-14

Jul-1

4

Sep-

14

Nov

-14

Jan-

15

Mar

-15

May

-15

Jul-1

5

Sep-

15

Nov

-15

Jan-

16

Mar

-16

May

-16

Jul-1

6

Sep-

16

Nov

-16

Jan-

17

Mar

-17

kilo

wat

ts

Prentice DC2UPS capacity v ICT load

UPS Capacity (kw) ICT Load (kw)

Page 15: DCIM Deployment at University of Queensland - QUDIT

Name of presentation Month 2008

Load and Capacity reports

0

50

100

150

200

250

Jul-1

2

Sep-

12

Nov

-12

Jan-

13

Mar

-13

May

-13

Jul-1

3

Sep-

13

Nov

-13

Jan-

14

Mar

-14

May

-14

Jul-1

4

Sep-

14

Nov

-14

Jan-

15

Mar

-15

May

-15

Jul-1

5

Sep-

15

Nov

-15

Jan-

16

Mar

-16

May

-16

Jul-1

6

Sep-

16

Nov

-16

Jan-

17

Mar

-17

Kilo

wat

ts

Prentice DC2Cooling capacity v Heat load

Cooling Capacity N+1 (kw) Cooling Capacity (kw) Heat load

Page 16: DCIM Deployment at University of Queensland - QUDIT

Name of presentation Month 2008

Load and Capacity reports

1.00

1.20

1.40

1.60

1.80

2.00

2.20

2.40

2.60

DC1 DC2 DC3 GPN CCR

Annual PUE per DC

2013 2014 2015 2016 2017

Page 17: DCIM Deployment at University of Queensland - QUDIT

Name of presentation Month 2008

Load and Capacity reports

$40,000

$45,000

$50,000

$55,000

$60,000

$65,000

Jul-1

2

Sep-

12

Nov

-12

Jan-

13

Mar

-13

May

-13

Jul-1

3

Sep-

13

Nov

-13

Jan-

14

Mar

-14

May

-14

Jul-1

4

Sep-

14

Nov

-14

Jan-

15

Mar

-15

May

-15

Jul-1

5

Sep-

15

Nov

-15

Jan-

16

Mar

-16

May

-16

Jul-1

6

Sep-

16

Nov

-16

Jan-

17

Mar

-17

Total Monthly Cost - All ITS DCs

Total Cost - All DCs

Page 18: DCIM Deployment at University of Queensland - QUDIT

Name of presentation Month 2008

Load and Capacity reports

0.0

2.0

4.0

6.0

8.0

10.0

12.0

14.0

16.0

18.0

20.0

DC1 DC2 DC3 GPN CCR GAT

kilo

wat

ts

Average Power per Rack

Ave load per rack (kw) Max ave load per rack (kw)

Page 19: DCIM Deployment at University of Queensland - QUDIT

Name of presentation Month 2008

Load and Capacity reports

20.00

40.00

60.00

80.00

100.00

120.00

140.00

160.00

180.00

200.00

Mar

-14

Apr-

14M

ay-1

4Ju

n-14

Jul-1

4Au

g-14

Sep-

14O

ct-1

4N

ov-1

4De

c-14

Jan-

15Fe

b-15

Mar

-15

Apr-

15M

ay-1

5Ju

n-15

Jul-1

5Au

g-15

Sep-

15O

ct-1

5N

ov-1

5De

c-15

Jan-

16Fe

b-16

Mar

-16

Apr-

16M

ay-1

6Ju

n-16

Jul-1

6Au

g-16

Sep-

16O

ct-1

6N

ov-1

6De

c-16

Jan-

17Fe

b-17

Kilo

wat

ts

Polaris DC ICT Load

ICT load (KW) Min Load threshold

Page 20: DCIM Deployment at University of Queensland - QUDIT

Name of presentation Month 2008

What does DCE deliver

Confidence – when alerts column is blank every device is within thresholds, that’s confidence when 24x7 service delivery is demanded.

Confidence – management receive monthly load and capacity data – accept business cases more readily when they trust the operation

Pro-active response rather than reactive to daily operational issues