Alerts let you set up customizable alert policies for anything that you can monitor. Receive notifications for fluctuations in key performance metrics as data streams in from all of our products, including APM, infrastructure, and Kubernetes.
Alerts can be configured to notify you when certain conditions are met, such as when a metric value exceeds a certain threshold or when a log event occurs that matches a specific pattern. In addition, alerts can be configured to send notifications to Slack or Gmail, making it easier to stay informed about critical events.
- Navigate to “Alerts” from the left menu
- Go to the “Rules” tab and click on “Create new”
Select Monitor type
Select the Monitor type corresponding to the kind of alert you want to receive,
- Metrics: to get alerts on service check and query alert
- APM: query alert or trace-analytics.
- Logs: To get alerts for specific log
- Kubernetes: Get alert on kubernetes
Configure your monitor by defining the threshold and frequency etc. based on your selected monitor type in previous step, you can configure your alert rule.
- Threshold: it will help you to decide on what value you want to get alert and host will help you to get alert only on specific host
- Frequency: There will be 2 types of frequency. Single and multiple
- If you select single then it will send you only one alert on decided time and if you select multiple then it will give multiple alerts you can also decide the number of alerts when you select multiple alerts
- Time window: This will define how much time you want to alert, you can define your custom time as well
Select a threshold from the predefined list, you can choose only one threshold at a time. Threshold can help you to define the rule for your alerts.Here’s the details of some of the Thresholds.
- CPU % :- CPU % is a metric that measures the percentage of time that a CPU (Central Processing Unit) is busy processing data
- CPU I/O Wait % :- CPU I/O wait % is a metric that measures the percentage of time that a CPU is idle while waiting for input/output (I/O) operations to complete
- Disk write utilization MB :- Disk write utilization MB is a metric that measures the amount of data written to a disk over a period of time, typically in megabytes (MB).
- Disk read utilization MB :- Disk read utilization MB is a metric that measures the amount of data read from a disk over a period of time, typically in megabytes (MB).
- Load average in one minute :- Load average in one minute is a metric that measures the average number of processes that are running or waiting to run on a system over the last one minute. It is a measure of the workload on the system’s CPU.
- Memory used % :- Memory used % is a metric that measures the percentage of the total available physical memory that is currently being used by the system.
Set alert condition
You can specify the threshold value at which you wish to notify, as well as other levels such as Warning, and you can also construct custom thresholds.
Notify your team
Here you can define the name of the alert, tags with the alert’s state, and where you wish to be notified, such as slack or Gmail.
To receive alerts on slack, you must have integrated slack with Middleware.
We have created the default alerts for metrics, logs, and K8s so you don’t need to create manual alerts and get notified automatically for these alerts. This feature comes with some default rules that you can turn on or off and customize based on your needs. These default alerts will only fire up to 5 times when the condition matching the rule is met.
Maintenance Window allows users to specify a time frame during which alerts will not be fired. If the user sets a maintenance window, the alert will be suppressed during that time period. This feature enables users to perform routine maintenance and updates without being constantly interrupted by alert notifications.
Setup maintenance window
Go to “Maintenance” tab from alerts.
Click on the “Schedule” button to begin creating a new maintenance window.
Enter a name for the maintenance window. ex: “System Updates” or “Database Maintenance.”