Your data. Anywhere you go.

New Relic for iOS or Android


Download on the App Store    Android App on Google play


New Relic Insights App for iOS


Download on the App Store


Learn more

Close icon

Definitions of all the metrics in the Insights


#1

Hi Team,

I am new to this tool and for the application performance monitoring. I would like to know where i can find the documentation for all the metrics shows up in the INFRASTRUCTURE such as CPU % , Disk % , CPU % of process, Total utilization %, etc so that i can have those metrics in the insight dashboards.

Thanks


#2

Hi, @ramesh.ravichandran: You may find details on all standard attribute types in the product documentation.


#3

Thanks Phil. that’s really helpful :smile:


#4

Just some friendly advise. Be cautious to understand each of the metrics and what they provide. Many users at my company created metrics based on CPU and did not understand CPU is an aggregation of several items and dose not truly represent the actual CPU measurement. The way that I communicate with team I work with is this:

Insights should be used to show trending over time because sampling is not 100%. Troubleshooting should be done using APM.


#5

Hi, @reopelle.scott: Ramesh specifically asked about Infrastructure data. Has it been your experience that Infrastructure does not show actual CPU usage? It is my understanding that the data comes directly from the underlying operating system. You (and Ramesh) may find this post helpful:


#6

Thanks Phil. My idea is to create some dashboards in Insight and utilizing the charts provided in the infrastructure. This will help me to show how the Tableau server installed on the 7 hosts (which are AWS instance) performs and create some alerts to notify the group in case of any slowness on the server.


#7

FYI, if you like a particular chart in Infrastructure and you want to see the NRQL query used to create it, you can click the ellipsis menu in the upper-right corner and select View query:

infrastructure-chart-view-query


#8

Thanks much for the info. I have already played around with that :slight_smile:


#9

That’s 100% correct @philweber. I had to do a lot of reading in the attribute dictionary to understand the data is not always what it seems to be.

cpuPercent is an aggregate metric defined as

Total CPU utilization as a percentage. This is not an actual recorded value; it is an alias that combines percentage data from cpuSystemPercent, cpuUserPercent, cpuIoWaitPercent and cpuStealPercent.

cpuPercent can be helpful for trends but if you use it for alerting, you will chase your tail and generate a lot of pager fatigue!

To measure cpu strain I have had success using loadAverageFiveMinute. A measurement approaching 40 in my environment tells me the CPU is not able to keep up and login times, jobs, system tasks like patch scans, A/V scans, etc… begin to suffer or don’t run entirely.

This metric is defined as:

Over the last 5 minutes, the average number of system processes, threads, or tasks that are waiting and ready for CPU time.

Again though, you have to understand that this can be impacted by thread counts and other application configurations. Apps that generate tons of threads can quickly exhaust the constraints what a CPU can handle at any one time. I had to work with some teams to identify their issues, tweaking how many threads were running at any one time, even though the threads were tiny and the CPU appears relatively available. The CPU had to send “Waits” because it couldn’t keep up.