From Error to Alert using FOSS-Tools
- Track: Monitoring and Observability
- Room: UD2.120 (Chavanne)
- Day: Sunday
- Start: 16:30
- End: 17:00
- Video only: ud2120
- Chat: Join the conversation!
At Jointech, we identified the need for a scalable, multi-tenant capable monitoring stack, which led us to establish a production-ready system utilizing Grafana Mimir and Loki on Kubernetes. In this presentation, we will share best practices for setting up an LGTMA (Loki, Grafana, Tempo, Mimir, Alloy) stack using open-source tools. Mimir will be employed for multi-tenant metric collection, Loki for log aggregation, Alloy as a vendor-neutral metrics collector, and Grafana as the universal visualization tool. To ensure that our infrastructure is optimal, we use tools such as OpenTofu, Ansible, and Flux.
By integrating Grafana OnCall with Zammad, we have developed an efficient and scalable on-call solution. This enables our small team of Site Reliability Engineers (SREs) to rest assured while maintaining best-in-class uptime for our customers.
Speakers
Claudi Grimm |