Mark Rogers wrote:
if loadavg (1min) > 4 for 2 cycles then alert if loadavg (5min) > 2 for 2 cycles then alert
On other words, if the 1 or 5 min load averages exceed the level shown for two or more consecutive tests (which take place every 2 mins) I'll get an alert.
My question is: what would be sensible load averages to set the thresholds at, or are they essentially meaningless and I should ditch them? The server has a AMD Athlon(tm) II Neo N36L Dual-Core Processor so as I understand it a load average of anything up to 2 means the CPU is under-utilised anyway?
I think my values for your situation are 10 (1min) and 4 (5min), but I might be misreading my configuration. As I understood it, Brett on IRC suggested warning at numcores+1 and alerting at numcores*2 for the 1min.
They're not the most meaningful of data, but there's few cases where you want them high, so I'd set some sort of alarms, but don't take it as the only measure.
Hope that helps,