SRE/Observability: Difference between revisions
< SRE
Content deleted Content added
m Jobo moved page Observability to SRE/Observability: consistency with all SRE |
No edit summary |
||
Line 1: | Line 1: | ||
{{Observability/Navigation}} |
{{Observability/Navigation}} |
||
<div class="mw-collapsible innercollapsed"><div class="mw-collapsible-toggle toccolours" style="float:none;text-align:left;font-size: 1.2em;background:#efefef;border:0px solid #9c3434;border-top:10px solid #ffffff;border-bottom:5px solid #9c3434">'''SRE Observability'''<div class="floatright">▼</div></div> <div class="mw-collapsible-content"> |
|||
<div style="border:4px solid #FFFFFF;background:#F7F7F7;padding: 10px"> |
|||
{|class="sortable" |
|||
! |
|||
[[SRE/Observability|SRE Observability]] - Monitoring and Logging (Prometheus/Grafana and ElasticSearch, plus some Kafka). |
|||
The Observability team, or "o11y" for short, works across SRE and Technology to provide teams with tools, platforms and insights into how systems and services are performing. It leverages technologies such as Grafana, Kibana/Logstash, Prometheus, AlertManager and more. |
|||
|- |
|||
|} |
|||
</div></div></div> |
|||
The starting point for observability resources at Wikimedia SRE. |
The starting point for observability resources at Wikimedia SRE. |
Revision as of 13:50, 21 June 2021
OKR's
Intake Standards
SRE Observability
▼
SRE Observability - Monitoring and Logging (Prometheus/Grafana and ElasticSearch, plus some Kafka). The Observability team, or "o11y" for short, works across SRE and Technology to provide teams with tools, platforms and insights into how systems and services are performing. It leverages technologies such as Grafana, Kibana/Logstash, Prometheus, AlertManager and more. |
---|
The starting point for observability resources at Wikimedia SRE.
Alerts
- icinga.w.o/alerts: central monitoring and alerting platform. See also Icinga.
- Alerting infrastructure roadmap PDF
Logs
- Kibana (a.k.a. logstash): central logging platform. See also Logstash.
- Logging infrastructure design document PDF
Metrics
- grafana.w.o: central observability platform. See also Grafana.
- Prometheus, recommended and supported metrics toolkit
- Graphite, supported but deprecated time series framework
- Statsd, supported but deprecated metrics aggregation
- Observability/Dashboard_guidelines, ideas towards better dashboards