30
ch 10 Monitoring NCNU CSIE 97321012 林林林 Stella

Ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia

Embed Size (px)

Citation preview

Page 1: Ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia

ch 10 Monitoring

NCNU CSIE 97321012 林似真 Stella

Page 2: Ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia

NCNU CSIE Stella2010/6/8 2

ganglia

Page 3: Ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia

NCNU CSIE Stella2010/6/8 3

[ gæ gl ] Kˈ ŋ ɪəK

Page 4: Ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia

NCNU CSIE Stella2010/6/8 4

Page 5: Ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia

NCNU CSIE Stella2010/6/8 5

outline● WHY?● HOW?● WHAT?

● Ganglia● Distributed Monitorin

g System● Ganglia Architecture

– gmond– gmetad– web front-end

● Monitor Intersection● Ganglia Data Flow● Distribution and

Install● Example

● Reference● Q&A

Page 6: Ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia

NCNU CSIE Stella2010/6/8 6

WHY?

Page 7: Ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia

NCNU CSIE Stella2010/6/8 7

Why to Monitor?

● 系統狀態の監控● 解決系統の問題● 系統或應用程式の調校● 系統升級時の依據

Page 8: Ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia

NCNU CSIE Stella2010/6/8 8

HOW?

Page 9: Ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia

NCNU CSIE Stella2010/6/8 9

How to Monitor?

● 單一主機の監控● 分散式監控系統

Page 10: Ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia

NCNU CSIE Stella2010/6/8 10

How to Monitor?

● 單一主機の監控● commands : top, free, uptime, ps● Log file● Shell scripts● MRTG● ...etc. 分散式監控系統

Page 11: Ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia

NCNU CSIE Stella2010/6/8 11

How to Monitor?

● 分散式監控系統● Network utilization● CPU utilization● Hardware● System load● Disk utilization● ...etc.

Page 12: Ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia

NCNU CSIE Stella2010/6/8 12

Ganglia?

● Nagios ←→ Ganglia● graph display → easy to see

● show host metrics● visualize trends

Page 13: Ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia

NCNU CSIE Stella2010/6/8 13

WHAT?

Page 14: Ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia

NCNU CSIE Stella2010/6/8 14

WHAT

IS

GANGLIA?

Page 15: Ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia

NCNU CSIE Stella2010/6/8 15

Ganglia - Wiki

Page 16: Ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia

NCNU CSIE Stella2010/6/8 16

Ganglia

● distributed monitoring system● monitor tool for clusters and grids● view live or historical statistics

● CPU load averages● network utilization

● long-term trends● graph display

Page 17: Ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia

NCNU CSIE Stella2010/6/8 17

WHAT

IS

Distributed System

?

Page 18: Ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia

NCNU CSIE Stella2010/6/8 18

Distributed Monitoring System

● Components : 1.daemons that reside on cluster nodes

2.server that collects clusters state information from nodes

3.GUI-based front-end, which provides system activity visulization

→ gmond

→ gmetad

→ web front-end

as for Ganglia components :

Page 19: Ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia

NCNU CSIE Stella2010/6/8 19

Ganglia Architecture

Page 20: Ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia

NCNU CSIE Stella2010/6/8 20

gmond

● Ganglia monitoring daemon● Multithreaded ● Responsibilities :

1.Monitor changes in host states

2.Announce relevant changes

3.Listen to the state of all the other ganglia nodes via a unicast or multicast channel

4.Answer requests for an XML description of the cluster state.

Page 21: Ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia

NCNU CSIE Stella2010/6/8 21

gmetad

● Ganglia meta daemon● Aggregate data, information● steps :

1.Poll gmond, clusters, or other gmetad

2.Parse the collected XML

3.Save numeric metrics in RRD(round-robin database)

4.Export aggregated XML over TCP to clients

Page 22: Ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia

NCNU CSIE Stella2010/6/8 22

web front-end

● web interface written in PHP● Graph generated by gmetad displays histor

ical information● highly dynamic

Page 23: Ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia

NCNU CSIE Stella2010/6/8 23

monitor interaction

● Gmond runs on each cluster node; gmetad can fail over between nodes.

Page 24: Ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia

NCNU CSIE Stella2010/6/8 24

Page 25: Ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia

NCNU CSIE Stella2010/6/8 25

● Distribution and Install

Page 26: Ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia

NCNU CSIE Stella2010/6/8 26

example

● official website demo ● wikimedia demo

Page 27: Ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia

NCNU CSIE Stella2010/6/8 27

Refernce

Page 29: Ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia

NCNU CSIE Stella2010/6/8 29

Q & A

Page 30: Ch 10 Monitoring NCNU CSIE 97321012 林似真 Stella. NCNU CSIE Stella2010/6/82 ganglia

NCNU CSIE Stella2010/6/8 30

Fin.Now You Can Rest Easy (In Peace ?).