Command-Line Parameters
Refer to sdb-report send-alert for more information on the available command-line arguments.
Alerts Reference and Default Thresholds
Check + Description | Warn default | Fail default | Configurable? |
---|---|---|---|
leavesNotOnline Offline leaf nodes |
Fail if >=1 Leaf Offline | Yes | |
offlineAggregators Offline aggregator nodes |
Fail if >= 1 Aggregator offline | Yes | |
explainRebalancePartitionsChecker Identifies if partitions are not balanced across the cluster |
Any output | Yes Configure to switch this to Warning |
|
orphanDatabases Identifies if any orphan databases are found. Orphan databases should be examined and dropped |
Any output | Yes Configure to switch this to Failure |
|
pendingDatabases Identifies databases that are in a pending state. Pending databases are not available for read/write queries. |
Any output | Yes Configure to switch this to Warning |
|
unrecoverableDatabases Identifies databases that are unrecoverable. |
Any output | Yes Configure to switch to Warning |
|
userDatabaseRedundancy Determines if a database is redundant |
Any output | Yes Configure to Switch to Warning |
|
clusterMemoryUsage Checks free memory against total available |
Less than 15% of the memory available | Less than 10% memory available | Yes |
userDatabaseRedundancy |
High availability not enabled (not configurable) | Master Partition missing its replica partition | Yes |
secondaryDatabases Checks for the presence of secondary replicating databases |
Any output | Yes Configure to switch to Warning |
System Checks Thresholds
Check + Description | Warn default | Fail default | Configurable? |
---|---|---|---|
cpuIdle Checks the percentage of CPU idle time |
25.0% | 5.0% | Yes |
diskLatencyRead Determines the average time taken by the device to complete read requests |
10 ms | 25 ms | Yes |
diskLatencyWrite Determines the average time taken by the device to complete write requests |
10 ms | 25 ms | Yes |
diskUsage Checks free disk space and identifies if the disk is approaching its capacity limits |
70% | 80% | Yes |
diskInodesUsage Checks free disk inodes |
70% | 85% | Yes |
majorPageFaults Identifies the number of major page faults generated by the system per second |
10 majftl/s | 20 majflt/s | Yes |
swapUsage Checks the percentage of swap space used |
5% | 10% | Yes |
memoryCommitted Determines the percentage of memory required for a given workload |
70% | 90% | Yes |