Modem/Routers slowdown on heavy traffic

avij · January 12, 2026, 2:59pm

I went through this path quite a few years ago until I figured out that not tracking the connections at all is the ultimate solution.

However, yes, decreasing that value will probably ease the pain somewhat. As for NTP I’d say something like 3 seconds would suffice, but bear in mind that this timeout is not only for NTP, it’s for all UDP protocols. In particular, DNS is primarily UDP so you may want to use a timeout value that is long enough for all DNS queries.

YMMV, but:

$ time dig @192.0.2.33 example.com
;; communications error to 192.0.2.33#53: timed out
;; communications error to 192.0.2.33#53: timed out
;; communications error to 192.0.2.33#53: timed out

; <<>> DiG 9.18.33 <<>> @192.0.2.33 example.com
; (1 server found)
;; global options: +cmd
;; no servers could be reached

real 0m15.047s

so in this case I might pick something like 16 seconds. If you have or use other services that are primarily UDP you may want to take those into account as well.

Bas · January 12, 2026, 3:11pm

Well I had this error also a lot of times because of tables filling up, unable to reach DNS-servers on the net. Somewhere above in the topic. DNSmasq solved this partly by caching.

And yes, I’m aware it impacts other stuff like Voip as well.

On the MikroTik forum they speak about 15 seconds, some even 8.

I’m testing at 30s now, and I have set the pool to 1.5Mbit, let’s see how it goes.

avij · January 12, 2026, 3:23pm

The example was for a non-existent DNS server. 192.0.2.x is reserved for examples and I wanted to test how long it took for the DNS client to recognize that the server isn’t going to reply.

As a side remark – if you want to reduce the number of DNS queries going through your router, use only one DNSmasq instance to make the DNS cache hit ratio better (unlikely to really matter, but hey, anything that helps). I think you mentioned earlier that you had two DNSmasq instances running.

Some statements for the nitpickers:

I said earlier that the “best way to fix NAT problems is to have no NAT at all”. There may be other ways to “fix” NAT including shortening the timeouts, but I would not call that the best solution. NAT is fine if you can make it work. If not, get rid of NAT.

Stateless NAT (if the router supports it) would work fine for NTP server purposes, but ntppool-agent (monitor software) does require keeping some state because it uses random high source ports. The router might have room for these sessions, though.

Bas · January 12, 2026, 3:44pm

Can’t do that, I have only 1 IPv4 address.

Anyway, before in the pool I saw 955 sessions, now I’m down to about around 150 all the time and the pool speed is set higher then before.

For testing 1.5Mbit where last time before taking it out of the active-pool it was set at 512Kbit.

If it keeps this low, I can move the DNS back into the router, where I like it to be.
DNSmasq was just installed to reduce UDP table-overloads.

The lower timeout seems to do the job…fingers crossed, it’s just 1 hour atm.

Update: Running for a while now, set pool to 12Mbit…serving 10811 clients in less then 30 minutes.
But then I noticed the sessions:

Total 1.89 M / 12.65 M / Auto 246.04 K / 37.50 M / Auto 582 / 3854

Never seen that before, however, I notice NO slowdowns, it’s handled perfectly.

I have to keep testing, but so far so good. In the time I write this, clients is up 11682

Bas · January 12, 2026, 7:50pm

This is funny, I’m serving almost 19K clients…

I just changed the UPD-timeout, I set the MAX speed 3Gbit…I only have 100/35mbit…

Sessions, way higher then I ever seen…

Speed SET…yeah it’s testing…3 Gbit

Funny is, since changing UDP-timeout, it doesn’t seem to affect my own internet-use.

I’m pretty sure this is not a good setting…but hey, so far it ‘handles’ 3Gbit setting!!!

microchip8 · January 13, 2026, 11:57am

@Bas

How much do you ask for the MikroTik? I might consider it, though my own is still working great.

(I’ve lowered timeout and timeout_stream on the router to 10 and 15 respectively, been running like that with no issues)

Bas · January 13, 2026, 1:41pm

It’s about 1 month old, make me an offer.

As for traffic, it’s working…I have no problems with the DrayTek anymore…after setting the normal 12mbit setting, my traffic is ok.

I also had contact with DrayTek support and they confirm that their safe UDP setting of 180sec is giving problems with users like me that have many UDP connections.
But for most ‘normal’ users it’s a good setting.

Hopefully they document it better in the manual.

I hope this is the solution, we will soon know, typical in a few days problems start, often within hours. So far, no issues.

john1 · January 13, 2026, 2:21pm

While talking to DrayTek, you should ask them if they can’t add an option to disable keeping state of port forwarding (reverse NAT) sessions. There is no reason to keep state. All the information to rewrite the packets in both directions are already in the information you fill in to set up port forwarding.

Bas · January 13, 2026, 2:52pm

Don’t think they are going to change the firmware. As I asked an option to give timeout in the GUI, they replied it’s not going to change. But I can alter it in the CLI. No biggy.
In the Fritzbox you can even alter it at all.

mirolm · January 14, 2026, 11:13am

What you can do is to ask your ISP to put their device in bridge mode. This way it becomes a media convertor and your router will handle everything.

Bas · January 14, 2026, 7:15pm

Latest update…I have 30mbit upload…but set 3Gbit for the pool…

The router was the problem..UDP-timeout..

Running 25K sessions, 23K clients. No slowdowns.

Beware, I just have 100/30Mbit at home…UDP-time-out is causing routers to fail. Pretty sure.

No slowdowns…

Massive packets/sec.

Is it solved…maybe.

avij · January 15, 2026, 10:36am

As for the DNS queries, according to your server status page the script does not use the -n option. Maybe set CHRONY_ALLOW_DNS_LOOKUP=“no” in the script.

Generally speaking, I’d argue that it’d be better if the default was “no” and the script user would need to explicitly set it to “yes” if needed.

Bas · January 15, 2026, 12:01pm

True, it doesn’t, as Chronyd doesn’t resolve on it’s own.
Chronyc only does it for the peers I have, that is not an issue, just looks better

avij · January 17, 2026, 6:32pm

Why not? The time limit is not the maximum length of the session, but the time the session is idle before it gets cleared. The VoIP connection state will stay active in the router as long as there’s a call going on. I suggested 16 seconds earlier (due to DNS timeouts). Maybe try that value and see if everything still works.

Bas · January 17, 2026, 6:40pm

I changed it to 10 seconds, let’s see what happens.

Look what happens…I set 3Gbit-pool-speed again…

No issues…the problem is Router-Firmware, keeping 123 tooooo long into the NAT-tables.

gunnar · January 20, 2026, 8:56am

Sorry for the late answer, seems like I have botched the notification settings on my end.

Is Germany underserved? Certainly not. But my two instances of ntpd-rs and chrony combined get about 300 queries per second with a peak up to 500 qps every 15 minutes (SNTP clients I guess). My router doesn’t break a major sweat though, even with flow logging and Surricata IDS enabled. Sure I have also lowered the UDP timeout to 10 seconds since most clients exchange just 2 packets (one query, one reply).

A pool monitor is also running, every server has one IPv6 in the pool and the incoming IPv4 address is NATed to the ntpd-rs instance currently.

I guess you just had bad luck with the choice of routers you tried. I bought the Ubiquity one for the wife acceptance factor, but it’s charming that it is nice looking hardware running Debian inside with a ssh shell if you want. But I did not make any changes in the shell level to host ntpd at home, the UI had all the knobs needed.

Bas · January 20, 2026, 9:20am

Exactly, you did the same

gunnar · January 20, 2026, 12:11pm

Yeah, but just because the UI doesn’t expose the notrack option for the DNAT rule. So sure I lowered the timeout for UDP globally. But there are enough entries still in the connection tracking table of the kernel to kill your devices I guess.

root@Cloud-Gateway-Max:~# conntrack -L | grep "dport=123" | wc -l                         conntrack v1.4.6 (conntrack-tools): 3754 flow entries have been shown.                    3315

So of all tracked connections currently (I also host my webserver, ssh and VPN) there are over 3k incoming UDP connections open with destination port 123 and a UDP timeout of 10 seconds. And there’s still memory free in the router for plenty more. nf_conntrack_max is 131072 by default on this device

I played around with setting the iptable rule myself with the notrack option on the shell, but for my current volume it wasn’t worth it. The UI deletes all rules and populates them from the database on change, so handwritten rules get wiped frequently and I saw no real benefit on lowering CPU or memory usage of the router.

Bas · January 20, 2026, 1:09pm

DrayTek has CLI rules and those are not overwritten by the UI.
They stick.

As you can configure the modem/router (however you use it), via the UI, but also via CLI and the most advanced options are in the CLI.

I didn’t know about that, just noticed NAT-tables where maxed out and caused all sorts of troubles.

But memory or CPU-cycles was never an issue, that was the stange thing about it.

gunnar · January 20, 2026, 1:48pm

Yeah, I didn’t look into it further after a major firmware/software bump. Earlier versions had a well documented way of incorporating user changes, but Ubiquity started to dumb the device down a bit

Maybe there is another way now to save user changes, but I am afraid that it breaks a year from now. It’s not only a router, it also manages the three WiFi access points, switches and cameras at home and also has an SSD and plays the role of network video recorder. I got old enough to stop playing around with such gear, I appreciate that everything “just works”

I don’t want to sweat during updates right now and check if everything from my internet access, WiFi, internal VLANs, VPN connections, home security and my hosted services are still working after a firmware update So like you lower UDP timeouts are the easiest option right now, but with 3 GB of RAM and I think a 4 core 1.5 GHz CPU that slim white box does everything I need it to, including hosting two pool servers and a monitor at home without slowing down, killing DNS and still serving a few hundred NTP clients per second (500 Mbps setting in the pool, Germany)

Topic		Replies	Views
Network: i/o timeout Server operators monitoring	32	863	November 20, 2024
Joining the pool kills my Internet Pool Development	29	3498	June 7, 2021
Discussion of client traffic in BE (Belgium) zone Server operators	14	297	February 2, 2026
The issue of NTP requests exceeding bandwidth load Server operators	54	1533	November 24, 2024
Some client really can't behave Server operators	54	2315	December 18, 2023

Modem/Routers slowdown on heavy traffic

Related topics