SRE/Observability: Difference between revisions
< SRE
Content deleted Content added
First |
(No difference)
|
Revision as of 13:59, 3 January 2020
The starting point for observability resources at Wikimedia SRE.
Alerts
- icinga.w.o/alerts: central monitoring and alerting platform. See also Icinga.
- Alerting infrastructure roadmap PDF
Logs
- Kibana (a.k.a. logstash): central logging platform. See also Logstash.
- Logging infrastructure design document PDF
Metrics
- grafana.w.o: central observability platform. See also Grafana.
- Prometheus, recommended and supported metrics toolkit
- Graphite, supported but deprecated time series framework
- Statsd, supported but deprecated metrics aggregation