- Notifications
You must be signed in to change notification settings - Fork928
Open
Description
Related to#16482
Why
Customers need proactive alerts when users experience unexpected workspace disconnects (network blips, agent crashes). Current Prometheus metrics (coderd_agentstats_connection_count
,coderd_agents_connections{status="disconnected"}
) only expose agent ↔ coderd state and cannot distinguish intentional session closures from unexpected client↔agent drops.
Proposal
- Emit Prometheus metrics that increment on:
- graceful (user-initiated) disconnects
- ungraceful (timeout/crash/network) disconnects
- Document new metrics in
docs/admin/integrations/prometheus
with example alert rules. - Ensure metrics are exposed per workspace and, if feasible, per user for targeted alerting.