Hello,
First post here but been lurking a while. Hope you can help.
Saw weird stuff today as below:
These logs from beta. But same seen on production monitors.
1583683084,"2020-03-08 15:58:04",-0.000316214,1,-21.5,23,Amsterdam,0,
1583683084,"2020-03-08 15:58:04",-0.000316214,1,-59.7,,,0,
1583682723,"2020-03-08 15:52:03",0.005422785,-5,-26.3,22,"Los Angeles, CA",,"bad stratum 0 (referenceID: 0xa077d8c5, �w��)"
1583682723,"2020-03-08 15:52:03",0.005422785,-5,-63.9,,,,"bad stratum 0 (referenceID: 0xa077d8c5, �w��)"
1583682476,"2020-03-08 15:47:56",0.001815355,-5,-29.3,20,"Newark, NJ, US",,"bad stratum 0 (referenceID: 0xa077d8c5, �w��)"
1583682476,"2020-03-08 15:47:56",0.001815355,-5,-62,,,,"bad stratum 0 (referenceID: 0xa077d8c5, �w��)"
1583682191,"2020-03-08 15:43:11",0.002304153,-5,-23.7,23,Amsterdam,,"bad stratum 0 (referenceID: 0xa077d8c5, �w��)"
1583682191,"2020-03-08 15:43:11",0.002304153,-5,-60,,,,"bad stratum 0 (referenceID: 0xa077d8c5, �w��)"
1583681943,"2020-03-08 15:39:03",0.005632071,-5,-22.4,22,"Los Angeles, CA",,"bad stratum 0 (referenceID: 0xa077d8c5, �w��)"
1583681943,"2020-03-08 15:39:03",0.005632071,-5,-57.9,,,,"bad stratum 0 (referenceID: 0xa077d8c5, �w��)"
1583681639,"2020-03-08 15:33:59",0.001871698,-5,-25.6,20,"Newark, NJ, US",,"bad stratum 0 (referenceID: 0xa077d8c5, �w��)"
1583681639,"2020-03-08 15:33:59",0.001871698,-5,-55.6,,,,"bad stratum 0 (referenceID: 0xa077d8c5, �w��)"
1583681317,"2020-03-08 15:28:37",0.001873217,-5,-19.7,23,Amsterdam,,"bad stratum 0 (referenceID: 0xa077d8c5, �w��)"
1583681317,"2020-03-08 15:28:37",0.001873217,-5,-53.3,,,,"bad stratum 0 (referenceID: 0xa077d8c5, �w��)"
1583681054,"2020-03-08 15:24:14",0.006278902,-5,-18.3,22,"Los Angeles, CA",,"bad stratum 0 (referenceID: 0xa077d8c5, �w��)"
1583681054,"2020-03-08 15:24:14",0.006278902,-5,-50.8,,,,"bad stratum 0 (referenceID: 0xa077d8c5, �w��)"
1583680809,"2020-03-08 15:20:09",0.002927684,-4,-21.7,20,"Newark, NJ, US",,"bad stratum 15"
1583680809,"2020-03-08 15:20:09",0.002927684,-4,-48.3,,,,"bad stratum 15"
1583680513,"2020-03-08 15:15:13",0.00423773,-4,-15.5,23,Amsterdam,,"bad stratum 15"
1583680513,"2020-03-08 15:15:13",0.00423773,-4,-46.6,,,,"bad stratum 15"
1583680217,"2020-03-08 15:10:17",0.007069186,-4,-14,22,"Los Angeles, CA",,"bad stratum 13"
1583680217,"2020-03-08 15:10:17",0.007069186,-4,-44.8,,,,"bad stratum 13"
1583679969,"2020-03-08 15:06:09",0.003438599,1,-18.6,20,"Newark, NJ, US",,"bad stratum 7"
1583679969,"2020-03-08 15:06:09",0.003438599,1,-43,,,,"bad stratum 7"
1583679627,"2020-03-08 15:00:27",-0.011037552,1,-12.1,23,Amsterdam,0,
1583679627,"2020-03-08 15:00:27",-0.011037552,1,-46.3,,,0,
1583679354,"2020-03-08 14:55:54",0.000636382,-5,-10.5,22,"Los Angeles, CA",,",?Lb"
1583679354,"2020-03-08 14:55:54",0.000636382,-5,-49.8,,,,",?Lb"
1583679106,"2020-03-08 14:51:46",-0.003117401,-5,-20.6,20,"Newark, NJ, US",,",?Lb"
1583679106,"2020-03-08 14:51:46",-0.003117401,-5,-47.1,,,,",?Lb"
1583678827,"2020-03-08 14:47:07",0,-5,-13.8,23,Amsterdam,,"i/o timeout"
1583678827,"2020-03-08 14:47:07",0,-5,-44.4,,,,"i/o timeout"
1583678467,"2020-03-08 14:41:07",0.001463921,-5,-5.8,22,"Los Angeles, CA",,",?Lb"
1583678467,"2020-03-08 14:41:07",0.001463921,-5,-41.4,,,,",?Lb"
1583678134,"2020-03-08 14:35:34",-0.002037602,-5,-16.5,20,"Newark, NJ, US",,",?Lb"
1583678134,"2020-03-08 14:35:34",-0.002037602,-5,-38.4,,,,",?Lb"
1583677890,"2020-03-08 14:31:30",-0.001901663,-5,-9.2,23,Amsterdam,,",?Lb"
1583677890,"2020-03-08 14:31:30",-0.001901663,-5,-35.1,,,,",?Lb"
1583677643,"2020-03-08 14:27:23",0.001858051,-5,-0.9,22,"Los Angeles, CA",,",?Lb"
1583677643,"2020-03-08 14:27:23",0.001858051,-5,-31.7,,,,",?Lb"
1583677380,"2020-03-08 14:23:00",-0.001472616,-5,-12.1,20,"Newark, NJ, US",,",?Lb"
1583677380,"2020-03-08 14:23:00",-0.001472616,-5,-28.1,,,,",?Lb"
1583677115,"2020-03-08 14:18:35",-0.00104997,-5,-4.4,23,Amsterdam,,",?Lb"
1583677115,"2020-03-08 14:18:35",-0.00104997,-5,-24.3,,,,",?Lb"
1583676869,"2020-03-08 14:14:29",0.002604457,-5,4.4,22,"Los Angeles, CA",,",?Lb"
1583676869,"2020-03-08 14:14:29",0.002604457,-5,-20.3,,,,",?Lb"
1583676578,"2020-03-08 14:09:38",-0.014846272,-5,-7.4,20,"Newark, NJ, US",,",?Lb"
My Setup:
Stratum 1
3 x Stratum 1 servers (hidden masters) running on raspberry pi 4 with adafruit ultimate gps hat and chronodot rtc, running ntpd, each stratum1 has redundant power supplies with ups. GPS signal is good with ten sats seen and a 3d fix. I do NOT use gpsd, I use driver 127.127.20.0, combining both pps and gps.
ntpq -pn -c ass normally shows very low offsets as below example:
user@host: ntpq -pn -c ass
remote refid st t when poll reach delay offset jitter
o127.127.20.0 .GPS0. 0 l 5 8 377 0.000 0.000 0.002
Stratum 2
3 x Stratum 2 servers (published in pool) running chrony.
Concept is that the 3xstratum2 are connected to the three stratum 1 servers and then peered (with keys) to each other. Stratum 1 are not peered with each other.
I am in Kenya, and attempting to improve ntp pool coverage in my neighbourhood. The Stratum 2 are connected via IP Transit to the submarine cable and the internet exchange points here. I use my own Public AS number with fixed public IPv6 and IPv4 addresses.
Problem
Today all three of my stratum 2 servers disappeared off the NTP Pool with the above logs.
Two Stratum 1 servers were fine. One of them started to throw large offsets of like 1 second. Dont know why. Sat fix was fine throughout. Restarting ntpd on that server restored it, though its taking time to settle as usual.
So, why did the Stratum2 servers not carryon and trust the remaining 2 stratum 1 servers?
What do the errors in the log mean, specifically the bad stratum ones.
Thanks for any insights you all can offer.
Salaams,
Alex
Additional info:
Stratum1 ntp.conf
driftfile /var/lib/ntp/ntp.drift
leapfile /usr/share/zoneinfo/leap-seconds.list
statistics loopstats peerstats clockstats
filegen loopstats file loopstats type day enable
filegen peerstats file peerstats type day enable
filegen clockstats file clockstats type day enable
server 127.127.20.0 mode 88 minpoll 3 iburst
fudge 127.127.20.0 stratum 0 flag1 1 flag2 0 flag3 1 flag4 0 time1 0.100 time2 0.0 refid GPS0
restrict -4 default kod notrap nomodify nopeer noquery limited
restrict -6 default kod notrap nomodify nopeer noquery limited
restrict 127.0.0.1
restrict ::1
restrict source notrap nomodify noquery
Stratum2 Chrony.conf
server ntp-s1-0.icolo.io iburst
server ntp-s1-1.icolo.io iburst
server ntp-s1-2.icolo.io iburst
peer ntp1.icolo.io key 1
peer ntp2.icolo.io key 2
keyfile /etc/chrony/chrony.keys
driftfile /var/lib/chrony/chrony.drift
logdir /var/log/chrony
maxupdateskew 100.0
rtcsync
makestep 1 3
allow
bindcmdaddress 127.0.0.1
bindcmdaddress ::1
Topology
Stratum 1 build