May I invite you to review especially the first item in the âAdditional Notesâ section on the âHow to use the poolâ page, as well as the Terms of Service, especially section 3, subsection (c).
The pool has a monitoring system in place to weed out bad servers, and is currently working on improving said monitoring system. But there is just so much that can be done on such a scale. At the end of the day, the user needs to evaluate whether the pool is suitable for their needs, and either have mechanisms in place to deal with potential issues. Or use sources that are more suitable to their needs. E.g., use a few hand-picked, reputable public servers. Or even run their own NTP upstream server(s), with a wide range as to what that could look like (from just using a self-operated server, or set of servers, as âbufferâ to shield downstream clients from the external world, to running oneâs own stratum 1 server, in turn spanning a wide range of potential options as to what that could look like).
E.g., when some instances of the pool reporting bad time leads to widespread outage on your side, maybe the pool is not the right time source for your systems.
Or, your infrastructure is not up to to your requirements. You write that you have âtypical ntp infra (which would have picked up the best time from across all reporting ntp instances and hosts)â. But the issues you report seem to contradict that statement.
As discussed above, picking up âthe best timeâ is but one part of what needs to be done for robust timekeeping. Another part is to properly vet the time received from upstream sources. E.g., to check whether it is consistent and makes sense. And refuse to update the time if it is not. Certainly when the risk of bad time is widespread outage.
And while it is not completely out of the picture that multiple servers report bad time simultaneously, and similarly bad time so as to have consensus, a sensible NTP client would simply refuse to make clock corrections as big as you describe even when received from multiple sources. Unless the bad time is consistently served over some period of time, and not a one-off, or just over a short period of time.
So Iâd invite you to reassess whether your current infrastructure as it is, and it getting time from the pool, is suitable to your needs, and to adjust accordingly if not. This thread contains some suggestions for what that could look like, and there is more elsewhere in this forum. Or ask for advice.