Enhancement: Number of retries criteria #1620

Closed
opened 2026-04-05 16:30:08 +02:00 by MrUnknownDE · 0 comments
Owner

Originally created by @KoalaG on 9/13/2023

Is your feature request related to a problem? Please describe.
When you have a monitor that polls very frequently (say every minute), network issues, or server priorities may prevent the ping from happening succesfully, causing false positive incidents to be created.

Describe the solution you'd like
It would be great if there was a retry option, either implemented in the following ways:

  • On a failed polling, retry up to X times. e.g. if a ping fails on a probe, imediately follow it with another ping before reporting back to the probe-api.
  • As part of the Criteria, choose if any or all probes need to report the monitor as offline. This could be as simple as a checkbox and a number field to a) select the probes, and b) choose how many probes need to report offline for the criteria to be met. This would create an excellent benefit where you could mark a service as degraded if not reachable from only one probe, and still have a criteria to mark as offline if not reachable from all probes.
  • Probe fails X times in a row. e.g. If you have a monitor with an interval of every minute, then you may want the criteria to only match if the probe fails 5 times in a row.
  • Delay incident creation by X minutes. e.g Very similar to X times in a row, however, could be used differently. This solution could try testing the monitor again in X minutes, then, and then open an incident if still reporting as offline. This would have the benefit of trying again (but not straight away like the first suggestion), allowing a monitor to have a longer interval (say 30 minutes) but still get an incident created within a couple of minutes if the retry time is set to a low value (such as 2 minutes).

Describe alternatives you've considered
I haven't been able to find any alternative to achieve this.

Additional context
As a new user of OneUpTime, I recieved a few hundred emails over night from a ping monitor which was monitoring a DNS server. The server was never actually down, it's just the ping packets didn't come through due to network load at the time of attempt.

*Originally created by @KoalaG on 9/13/2023* **Is your feature request related to a problem? Please describe.** When you have a monitor that polls very frequently (say every minute), network issues, or server priorities may prevent the ping from happening succesfully, causing false positive incidents to be created. **Describe the solution you'd like** It would be great if there was a retry option, either implemented in the following ways: - On a failed polling, retry up to X times. e.g. if a ping fails on a probe, imediately follow it with another ping before reporting back to the probe-api. - As part of the Criteria, choose if any or all probes need to report the monitor as offline. This could be as simple as a checkbox and a number field to a) select the probes, and b) choose how many probes need to report offline for the criteria to be met. This would create an excellent benefit where you could mark a service as degraded if not reachable from only one probe, and still have a criteria to mark as offline if not reachable from all probes. - Probe fails X times in a row. e.g. If you have a monitor with an interval of every minute, then you may want the criteria to only match if the probe fails 5 times in a row. - Delay incident creation by X minutes. e.g Very similar to X times in a row, however, could be used differently. This solution could try testing the monitor again in X minutes, then, and then open an incident if still reporting as offline. This would have the benefit of trying again (but not straight away like the first suggestion), allowing a monitor to have a longer interval (say 30 minutes) but still get an incident created within a couple of minutes if the retry time is set to a low value (such as 2 minutes). **Describe alternatives you've considered** I haven't been able to find any alternative to achieve this. **Additional context** As a new user of OneUpTime, I recieved a few hundred emails over night from a ping monitor which was monitoring a DNS server. The server was never actually down, it's just the ping packets didn't come through due to network load at the time of attempt.
MrUnknownDE added the enhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancementenhancement labels 2026-04-05 16:30:12 +02:00
Sign in to join this conversation.
No Label enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement enhancement
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: github/oneuptime#1620