So we DO have a problem with monitoring, right?

#1

Greetings,

Reading thru a number of posts on the apparent problems with the monitoring system, I gather that something is actually going on and that it may be why my server keeps getting kicked out of the pool as soon as it reaches 10 or 11. I had for once the opportunity to see it “live” today and as far as I can tell by using my beginner’s knowledge and some tools like iftop, there was no outrageous traffic at the time it dipped below 10 (about 5 kb/s on port 123, my limit is set at 512kb/s). Mrulist does not show any particular agressive client. I confirmed that the config is as per the recommendations of the Pool Project web site and nothing as changed on that server for the past 2 years. I am located in Canada, near Montréal. The server is a small embedded device running OpenWrt on my network DMZ. Sooooo…

1- Can anyone confirm that what is been happeneing for the past few months is “normal” given the situation with the monitoring system?

2- Any recommendations as to what I could do on my side to further debug the problem?

3- Is there anything I could do to help?

Thank you for your time and I hope to be able to keep my little part of the internet on time! :wink:

Best Regards,

GM

0 Likes

#2

What’s the IP(s)? What is the monitoring system complaining about?

If your server’s score is only going down when the score is above 10, it sounds like it’s failing under the traffic load – often because of an overloaded connection tracking firewall or NAT.

If the score is shaky no matter what it is, it could be a problem with the monitoring system, or anything else.

0 Likes

#3

Sorry about that!

IP: 66.130.142.231. From the csv, I/O Timeout error when at/or above 10 and then going down. I agree with you that it does look like traffic load but I have had my speed set at 512kb since the beginning but I have been having problems only in the past few months. Any hints or suggestion on how I could prove that it is indeed traffic? Like I mentionned, I had iftop running when it got to 10 about 1 hour ago and the max traffic I saw was around 4-5kb/s. As far as the firewall is concerned, no change on that front for the last couple of years.

Again, any suggestions welcomed and thank you for your time!

0 Likes

#4

There is definitely some monitoring problem!
I have the same problem to/from Sweden, worked flawlessly for +2years, now out of the pool for the last 2-3 months.
Mine goes down even before reaching 10 most of the times.
I did my own 5 min external monitoring from another linux machine at home, cronjob:

*/5 * * * * /usr/sbin/ntpdate -q >>/tmp/ntptest.log 2>>/tmp/ntptest.err

I get about 0,03% errors from my own, from ntppool monitoring around 30%…
So, I think my server is fine.

0 Likes

#5

After several years always scoring 20 I have had suffered regular dropouts every week over the last year. I was out of the pool most of the weekend.
By the time i joined the beta only LA was working, it shows similar dropouts. I’m in London,UK 87.75.224.75 using LeoNTP. My logs show background traffic continues normally and local link is not overloaded.

0 Likes