Hi All,
I am trying to figure out the best way to communicate the SLA (i.e. not theocratically from SRE definitions) percentage number based on following criteria’s:
-
Percentage successful rate of specific requests? For example:
GET api/walletbalance/{$accountId}
Here is my NRQL
SELECT percentage(count(*), WHERE httpResponseCode like ‘2%’) FROM Transaction WHERE appName = ‘service-name’ FACET hourOf(timestamp) -
Evaluate downtime
- All requests failed for a period of time. For example: above mentioned request failed for 5 mins
then it should be equated as 5 mins downtime.
SELECT percentage(count(*),WHERE httpResponseCode like ‘5%’) FROM Transaction WHERE appName = ‘service-name’ FACET monthOf(timestamp)
- All requests failed for a period of time. For example: above mentioned request failed for 5 mins
So, if my SLA is 99.95% how do I translate these results in to that? Any recommendations is much appreciated. Thanks