More precise (sensible, sensitive) server monitoring score

NTPman · January 3, 2025, 4:19pm

I am wondering does this retry logic still make sense?
With the new multi-location monitoring system in place, servers get very good scores in general. We may want more precise measurement, even knowing about one packet loss and not smoothing the offset value. So I suggest to change the line:

cfg.Samples = 3

to be:

cfg.Samples = 1

in monitor/client/monitor/monitor.go at main · ntppool/monitor · GitHub

gunter · January 3, 2025, 5:41pm

servers get very good scores in general

Always good reminder that NA/EU networking is pretty good, but outside of that fluctuations will happen.

Removing retries maybe once more global PoPs exists.

ebahapo · January 3, 2025, 6:46pm

Rather, there are fluctuations as soon as packets hop on an undersea cable, even between the EU and NA.

The reason being that different probes may go through wildly different, long latency routes at different times.

Sebhoster · January 4, 2025, 4:46pm

I agree with @NTPman that more precise monitoring would be better. Under the assumption that the monitors represent the pool clients, any packet loss should be taken into account. Even if the cause for packet drops is outside of a server operators influence, it still potentially impacts any clients.

To account for sporadic packet drops and the resiliency of ntp clients against such drops, the point penalty for a network timeout could be decreased if the decision is made to not retry unanswered queries.

Which leads to the more general question: How harsh should the monitoring punish packet drops?

After how many consecutive unanswered queries should a server be considered offline and dropped from the pool? (currently: 6, in 2 bursts of 3 packets each)
How many packet drops should be allowed on average until a server is not considered reliable enough for the pool? (currently: up to 2/3 of packets can be lost without consequence…)

Topic		Replies	Views
Suggestions for monitors, as Newark fails a lot and the scores are dropped too quickly Server operators monitoring	91	4060	August 2, 2021
Remove my server from pool Server operators monitoring	2	1110	November 2, 2019
Server score keeps dropping Server operators monitoring	14	2116	April 19, 2019
Packet loss as seen by the monitor Pool Development	1	636	December 3, 2021
List of Monitoring IPs? Server operators	18	193	January 2, 2025

More precise (sensible, sensitive) server monitoring score

Related topics