You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/production-deployment/cloud/service-health.mdx
+5-3Lines changed: 5 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -63,7 +63,9 @@ avg_over_time((
63
63
64
64
## Detecting Activity and Workflow Failures
65
65
66
-
The metrics `temporal_cloud_v1_activity_fail_count` and `temporal_cloud_v1_workflow_failed_count` together provide failure detection for Temporal applications. These metrics work in tandem to give you both granular component-level visibility and high-level workflow health insights.
66
+
The metrics `temporal_activity_execution_failed` and `temporal_cloud_v1_workflow_failed_count` together provide failure detection for Temporal applications. These metrics work in tandem to give you both granular component-level visibility and high-level workflow health insights.
67
+
68
+
Note that `temporal_activity_execution_failed` is an SDK metric that must be collected from the Worker.
67
69
68
70
### Activity failure cascade
69
71
@@ -84,7 +86,7 @@ Generally Temporal recommends that Workflows should be designed to always succee
84
86
Monitor the ratio of workflow failures to activity failures:
0 commit comments