Brussels / 1 & 2 February 2025

schedule

From Error to Alert using FOSS-Tools


At Jointech, we identified the need for a scalable, multi-tenant capable monitoring stack, which led us to establish a production-ready system utilizing Grafana Mimir and Loki on Kubernetes. In this presentation, we will share best practices for setting up an LGTMA (Loki, Grafana, Tempo, Mimir, Alloy) stack using open-source tools. Mimir will be employed for multi-tenant metric collection, Loki for log aggregation, Alloy as a vendor-neutral metrics collector, and Grafana as the universal visualization tool. To ensure that our infrastructure is optimal, we use tools such as OpenTofu, Ansible, and Flux.

By integrating Grafana OnCall with Zammad, we have developed an efficient and scalable on-call solution. This enables our small team of Site Reliability Engineers (SREs) to rest assured while maintaining best-in-class uptime for our customers.

Speakers

Photo of Claudi Grimm Claudi Grimm