Movatterモバイル変換


[0]ホーム

URL:


Jump to content
Wikitech
Search

Portal:Toolforge/Admin/Runbooks/BuildsApiUpMetricUnknown

From Wikitech
<Portal:Toolforge |Admin |Runbooks

Toolforge Admin

[edit]

This happens when prometheus has no data from k8s on the builds-api pod.

The procedures in this runbook requireadmin permissions to complete.

Error / Incident

This usually comes in the form of analert in alertmanager.

There you will get which project (tools, toolsbeta, ...) is the one it's failing for.

Debugging

This is a tricky one and it will be related to the way we gather metrics on tools/toolsbeta.

Note that this is not directly related to themetricsinfra monitoring project, but toolforge's own setup.

You can start by going to the project's prometheus page and trying to get the stats there, example for tools:

https://tools-prometheus.wmflabs.org/tools/graph?g0.expr=up%7Bjob%3D%22builds-api%22%7D&g0.tab=1&g0.stacked=0&g0.show_exemplars=0&g0.range_input=1h

Common issues

Add new issues here when you encounter them!

Prometheus k8s cert expired

If build-api seems up, you can check if the certificates that prometheus uses to connect to k8s have expired (there should have been another alert though)Portal:Toolforge/Admin/Runbooks/PrometheusK8sCertExpirySoon.

Related information

Old incidents

Add any incident tasks here!

Retrieved from "https://wikitech.wikimedia.org/w/index.php?title=Portal:Toolforge/Admin/Runbooks/BuildsApiUpMetricUnknown&oldid=2304238"
Categories:

[8]ページ先頭

©2009-2025 Movatter.jp