So I noticed that I get twice if not three times as much traffic as a Stratum 2 server and below and the traffic just keeps growing, which is to be expected. But, once I go back to my Stratum 1 server the traffic drastically drops.
@daniel.quick the only thing the system does with the stratum value is to record it and mark the server ābadā if the stratum is an unusual value (0 or higher than 6 if I remember right).
I canāt think of or have heard of why a client would behave differently in the two scenarios.
Did you see the pattern consistently? Maybe it just coincided with when your server (randomly) rotated through a popular DNS resolver? How did it match up with when you changed the ānet speedā of the server?
It request level coincided when the monitoring station discovered the server went from Stratum 1 to Stratum 2. The Stratum 2 server is a Windows 10 server running Domain Time II syncād to a local Symmetricom SyncServer S350 PTP Grandmaster. According to the live running stats the time is syncād to approximately +/- 0.000(+/-0-200). When the Stratum 1 is answering queries for NTP, Daytime, and Time itās getting ~12 queries a second. Number jumps quite a bit when itās turn for a heavier load comes around, but when the Windows 10 PC (Strat 2) starts answering it slowly builds to over hundreds if I dare say thousands of queries a second after a day running. When I switch the port forward back to the Stratum 1 it continues to handle the load, but once the monitoring station detects itās gone back to Strat 1 it almost instantaneously goes back to a drastically lower number of queries.
Sure. 68.97.68.79. Right now itās running the Strat 2 Windows 10 server. Iām also participating in the NTP Beta project so thereās also the other two monitoring stations reporting.
Donāt know if this is relevant. When server is stratum 2 it usually advertises root dispersion=0.
Running of stratum 2 it advertises root dispersion in 0.21 - 0.55.
I suspect the problem is with the poll value. What NTP implementation is running on the stratum 2 server and how it is configured? If the server responded with a small poll, some clients like ntpd would take that as a suggestion to increase their polling rate. The LeoNTP units had (or maybe still have) this problem too.
Root Dispersion is the total amount of uncertainty from the server to the S0 source. Since a S1 server is directly connected to a S0 source, it is usually 0. But each stratum below would figure out the delay and it gets compounded to the root delay, then ntp uses this value when deciding which servers are āthe bestā to use for itās time selection.
Iām using a program called Domain Time II from a company called Greyware. Further digging revealed this page, which may answer more of your questions. The answer to client queries was not immediately apparent to me on this single page.
Correction: āWhen server is stratum 2 it usually advertises root dispersion=0.
Running of stratum 1 it advertises root dispersion in 0.21 - 0.55 msec.ā
Sorry for typo. The point was that stratum 2 root dispersion is lower than stratum 1. Thatās not what I would normally expect.
Could it be because the machineās time is being synchronized via PTP which can in theory provide up to pico-second synchronization? my machines current offset is -0.0000018 Āµs. Since NTP measures in milliseconds Iām not sure of the value that that would be interpreted/translated into and served to NTP server-client queries.
It seems the poll value the server responds is hardcoded to 6. That might be ok in a local network, but doesnāt work well on Internet. All those ntpd clients having the default maxpoll of 10 would stay at 6, increasing their traffic by a factor of 16. If you see a 2-3x overall increase, it suggest that normally those clients would make about 10% of your traffic.
Thanks Iāll shift back to the stratum 1 server when I get home tonight. Iāll contact the company to see if this is a configurable parameter elsewhere in the file system.
@mlichvar is right. Here it is how does it look like with both of the two server:
remote refid st t when poll reach delay offset jitter
==============================================================================
-68.97.68.79 192.168.69.197 2 u 27 64 377 138.375 1.513 5.967
-68.97.68.79 .GPS. 1 u 46 1024 377 139.085 1.082 1.376
Look at the poll value. With the stratum two server it never went above 64 sec (2^6). With your stratum one server it went up to 1024 sec (2^10). That clearly explains the traffic quantity difference.
IIRC, they actually originally used a fixed poll of 0 ā probably under a theory of āletās set unnecessary fields to 0ā ā which led to ntpd clients using their minpoll setting.