Monitoring Kubernetes with OMD Labs Edition - FOSDEM · Monitoring Kubernetes with OMD Labs Edition...

Monitoring Kubernetes with OMD Labs Edition and Prometheus

Michael Kraus - FOSDEM 2017

About me

About meMichael Kraus

Doing monitoring for 12 years,

mainly with plain old Nagios,

open-source only.

Senior Monitoring Consultant

@ ConSol.

Background

WhyKubernetes in a

classical enterprise

Implementation of Kubernetes

PoC at $customer:

We have …

● already running some

monitoring instances there.

● but no idea about

monitoring Kubernetes.

WithEnter Prometheus

Natural choice for kubernetes monitoring:

● Integrated service

discovery

● Labels are retained

between Kubernetes and

Prometheus

HowWhere to start

There are excellent tutorials

and blog posts available as a

starting point, for example by

● coreos.com/blog/

( Fabian Reinartz )

● robustperception.io/blog/

( Brian Brazil )

● … many examples on

GitHub

Implementation

ImplementationPrometheus

kubernetes_sd

● kubernetes_sd_configs

- role: endpoints

- role: node

- role: pod

prometheus-kubernetes.yml

from prometheus/examples.

ImplementationPrometheus

kubernetes_sd

Metrics:

● apiserver_*

● container_cpu_*

● container_fs_*

● deployment_*

● etcd_*

● kubelet_*

● ...

Implementationnode_exporter

Prometheus exporter for

hardware and OS metrics

exposed by the kernel.

● DaemonSet

● prometheus.io/scrape:

'true'

Implementationnode_exporter

Metrics:

● node_cpu

● node_disk_*

● node_filesystem_*

● node_netstat_*

● node_vmstat_*

● ...

Implementationkube-state-metrics

“... focused … on the health of

the various objects inside, such

as deployments, nodes and

pods.”

● prometheus.io/scrape:

'true'

Implementationkube-state-metrics

Metrics:

● kube_deployment_*

● kube_node_*

● kube_pod_*

● kube_resource_quota

● ...

ImplementationDemo environment

Based on minikube:github.com/kubernetes/minikube

Sample config:github.com/m-kraus/kubernetes-monitoring

ImplementationWhat else?

What we also need:

● persistent storage

● Alertmanager

● Grafana

● Pushgateway

● ...

But we have that already

Classical monitoring

OMD Labs Edition

Monitoring in one package.

● completely open-source

● based on Nagios / Icinga

● bundles “best practices” of

many years of experience

● no root required

"Musterlösung" at $customer for monitoring projects:

OMD Labs Edition

Nagios Icinga1 Icinga2 Shinken

Naemon Thruk Mod-Gearman

LMD NagVis PNP4Nagios

Apache MySQL InfluxDB Nagflux

Prometheus Dokuwiki Grafana FreeTDS JMX4Perl

check_webinject check_logfiles

Jolokia check_mysql_health

coshsh check_mssql_health rrdcache check_nsc_web check_curly check_nwc_health check_multi

check_oracle_health Ansible

OMD sites and commads

omd create <MYSITE>

omd cp <PROD> <STAGE>

omd update <STAGE>

omd version

omd create <MYSITE>omd cp <PROD> <STAGE>

OMD Labs Edition

https://labs.consol.de/omd/

ImplementationConnecting OMD

Why not scrape Kubernetes directly from OMD:

● hard to access pods inside

Kubernetes

● hard to access API from

outside Kubernetes

● API secured via TLS and

token only (easily) available

from a serviceaccount

ImplementationConnecting OMD

Getting the metrics from Kubernetes to OMD:

● federation

- job_name: 'kube_federation'

metrics_path: '/federate'

honor_labels: true

params:

'match[]':

-'{job=~"^kubernetes.+"}'

Issues

IssuesFederation

“... Not quite the purpose of

federation.”

Brian Brazilwww.robustperception.io/

federation-what-is-it-good-for/

● Let’s try it anyway ...

IssuesSecuring

"Accessing metrics without

authentication is ok for a PoC,

but not allowed in

production..."

internal audit

● How to secure (federated)

Prometheus?

IssuesIntegration

"Should Nagios, Alertmanager

or both notify?"

“Do we need to define our

checks and alerts both, in

Nagios and Prometheus?”

● How to route alerts

● How to ease or centralize

configuration

IssuesLong-term storage

“How can we store (some) of

our graphs for a longer period

of time?”

● InfluxDB ?

IssuesCoverage

“Our kubernetes cluster died.

We had no monitoring until it

up again...”

operations team

● external monitoring of

crucial components

○ machine health

○ important services

○ important API queries

Thanks for watching

Monitoring Kubernetes with OMD Labs Edition - FOSDEM · Monitoring Kubernetes with OMD Labs Edition...

Documents

SigDigger - FOSDEM

Made fosdem v2

Kubernetes Patterns Kubernetes Patterns

FD.io VPP & Ligato Use Cases - FOSDEM · 2019-04-15 · FD.io VPP & Ligato Use Cases Contiv-VPP CNI plugin for Kubernetes IPSEC VPN gateway •Project at Linux Foundation ... Makes

Infrastructure Design for Kubernetes · Kubernetes 1.9 Kubernetes 1.10 Kubernetes 1.11 Kubernetes 1.12 December 2017 March 2018 June 2018 September 2018 Kubernetes 1.13 December 2018

Aptoide - FOSDEM

Ceph storage with Rook - FOSDEM · Cloud-Native Storage Orchestrator Extends Kubernetes with custom types and controllers Automates deployment, bootstrapping, configuration, provisioning,

FOSDEM 2014

Organización y Compensación - OMD HR Academy - OMD HR

Pharo3 at Fosdem

Deploying PostgreSQL on Kubernetes - FOSDEM...What this is not I. A demo of me fiddling with terminals and window tiling techniques on the screen II. Me typing in Kubernetes commands

Aptpbo fosdem

Instructions for use OMD Technology MR-16 Underwater & Wet … · OMD Technology Instructions for Use of OMD MR-16 4 6. TECHNICAL SPECIFICATIONS OPERATING SPECIFICATIONS OMD MR-16

kubernetes @ chefkoch.de - Kubernetes Meetup Cologne

Institucional OMD

OMD (17.10.)

SP0709 DIBG-OMD-0630:OMD-OM DIBG

FOSDEM 2006

Ukrep OMD v letu 2019 - arsktrp.gov.si · Novosti ali spremembe v 2019 glede OMD Nova razmejitev OMD Sprememba PRP že vložena Sprememba Uredbe EK, KOPOP, OMD v teku (spremeni se

Presentation Opsi Fosdem