Too many open files at 1000 req/sec

Sun Aug 10 14:28:20 UTC 2025

 Hello Maxim and thank you for your detailed answer.

First, about `multi_accept`: I can confirm that it indeed distributes
requests super unevenly.
Luckily I have 2 servers, handling 50-50% of the requests, so I could
experiment by turning it off on one and restarting the nginx service on
both.

multi_accept: on

for pid in $(pgrep -f "nginx: worker"); do echo "PID $pid: $(lsof -p $pid |
wc -l) open files"; done
PID 1761825: 66989 open files
PID 1761827: 8766 open files
PID 1761828: 962 open files
PID 1761830: 184 open files
PID 1761832: 46 open files
PID 1761833: 81 open files
PID 1761834: 47 open files
PID 1761835: 40 open files
PID 1761836: 45 open files
PID 1761837: 44 open files
PID 1761838: 40 open files
PID 1761839: 40 open files

multi_accept: off
PID 1600658: 11137 open files
PID 1600659: 10988 open files
PID 1600660: 10974 open files
PID 1600661: 11116 open files
PID 1600662: 10937 open files
PID 1600663: 10891 open files
PID 1600664: 10934 open files
PID 1600665: 10944 open files

This is from an everyday, low-load situation, not CDN "Purge Cache" or
similar flood.

Based on this, multi_accept: on clearly makes no sense, I wonder why it's
written in so many guides. Back then, I went through blog
posts/optimisation guides/StackOverflow before I ended up on the config I
used. multi_accept: on was in many of them.

2. Thanks, so I'll be rewriting it as
error_page 404 = @empty_tile;
log_not_found off;

3. What I didn't say was that the files are from a read-only mounted
disk-image (btrfs loop,ro 0 0). I guess modern Linux kernel level caching
should be quite optimised for this scenario, shouldn't it? I believe
open_file_cache in my situation introduces a huge complexity surface with
possibly no upside? I'll be definitely turning it off altogether.

4. Now for the limits/connections, I feel it's bit of a deeper water.
Currently (at normal load) I have this on the multi accept off server:

ss -Htnp state established '( sport = :80 or sport = :443 )' \
  | awk 'match($0,/pid=([0-9]+)/,m){c[m[1]]++} END{for (p in c) printf
"nginx worker pid %s: %d ESTAB\n", p, c[p]}' \
  | sort -k6,6nr
nginx worker pid 1600658: 203 ESTAB
nginx worker pid 1600659: 213 ESTAB
nginx worker pid 1600660: 211 ESTAB
nginx worker pid 1600661: 201 ESTAB
nginx worker pid 1600662: 220 ESTAB
nginx worker pid 1600663: 214 ESTAB
nginx worker pid 1600664: 213 ESTAB
nginx worker pid 1600665: 212 ESTAB

and this on the multi accept on one:

nginx worker pid 1761825: 1388 ESTAB
nginx worker pid 1761827: 114 ESTAB
nginx worker pid 1761828: 6 ESTAB

Isn't the default http2 128 streams a bit of a too high value? What do you
think about

worker_connections 8192;

http2_max_concurrent_streams 32;

=> 8192 * (32+1) = 270,336 < 300k

Also what do you think about adding http2_idle_timeout 30s;

Best regards,

Zsolt

On 10. Aug 2025 at 12:34:55, Maxim Dounin <mdounin at mdounin.ru> wrote:

> Hello!
>
> On Sat, Aug 09, 2025 at 09:57:40AM -0700, Zsolt Ero wrote:
>
> I'm seeking advice on the most robust way to configure Nginx for a specific
>
> scenario that led to a caching issue.
>
>
> I run a free vector tile map service (https://openfreemap.org/). The
>
> server's primary job is to serve a massive number of small (~70 kB),
>
> pre-gzipped PBF files.
>
>
> To optimize for ocean areas, tiles that don't exist on disk should be
>
> served as a 200 OK with an empty body. These are then rendered as empty
>
> space on the map.
>
>
> Recently, the server experienced an extremely high load: 100k req/sec on
>
> Cloudflare, and 1k req/sec on my two Hetzner servers. During this peak,
>
> Nginx started serving some *existing* tiles as empty bodies. Because these
>
> responses included cache-friendly headers (expires 10y), the CDN cached the
>
> incorrect empty responses, effectively making parts of the map disappear
>
> until a manual cache purge was performed.
>
>
> My goal is to prevent this from happening again. A temporary server
>
> overload should result in a server error (e.g., 5xx), not incorrect content
>
> that gets permanently cached.
>
>
> [...]
>
> Full generated config is uploaded here:
>
> https://github.com/hyperknot/openfreemap/blob/main/docs/assets/nginx.conf
>
> Questions
>
>
> 1. I think multi_accept + open_file_cache > worker_rlimit_nofile is causing
>
> the whole trouble by not distributing the requests across workers, and then
>
> reaching the limit. Can you confirm if this is the correct take?
>
>
> The root cause is definitely open_file_cache configured
> with maximum number of cached files higher than allowed by the
> number of open files resource limit.
>
> Using multi_accept makes this easier to hit by making request
> distribution between worker processes worse than it could be.
>
> Overall, I would recommend to:
>
> - Remove multi_accept, it's not needed unless you have very high
> connection rates (and with small connection rates it'll waste
> resources).  Even assuming 1k r/s translates to 1k connections per
> second, using multi_accept is unlikely to be beneficial.
>
> - Remove open_file_cache.  It is only beneficial if opening files
> requires significant resources, and this is unlikely for local
> files on Unix systems.  On the other hand, it is very likely to
> introduce various issues, either by itself due to bugs (e.g., I've
> recently fixed several open_file_cache bugs related to caching
> files with directio enabled), or by exposing and magnifying other
> issues, such as non-atomic file updates or misconfigurations like
> this one.
>
> 2. How should I handle the "missing file should be empty response, server
>
> error should be 5xx" scenario? I've asked 5 LLMs and each gave different
>
> answers, which I'm including below. I'd like to ask your expert opinion,
>
> and not trust LLMs in this regard.
>
>
> *o3*
>
>
> error_page 404 = @empty_tile;
>
>
> That's what I would recommend as well.
>
> You may also want to use "log_not_found off;" to avoid excessive
> logging.
>
> Also, it may make sense to actually rethink how empty tiles are
> stored.  With "no file means empty title" approach you are still
> risking the same issue even with "error_page 404" if files will be
> lost somehow - such as due to disk issues, during incomplete
> synchronization, or whatever.
>
> [...]
>
> 3. *open_file_cache Tuning:* My current open_file_cache settings are
>
> clearly too aggressive and caused the problem. For a workload of millions
>
> of tiny, static files, what would be considered a good configuration for
> max,
>
> inactive, and min_uses?
>
>
> I don't think that open_file_cache would be beneficial for your
> use case.  Rather, it may make sense to tune OS namei(9) cache
> (dentry cache on Linux; not sure there are any settings other than
> vm.vfs_cache_pressure) to the number of files.  On the other hand,
> given the 1k r/s request rate, most systems should be good enough
> without any tuning.
>
> 4. *open_file_cache_errors:* Should this be on or off? My intent for having
>
> it on was to cache the "not found" status for ocean tiles to reduce disk
>
> checks. I want to cache file-not-found scenarios, but not server errors.
>
> What is the correct usage in this context?
>
>
> The "open_file_cache_errors" directive currently caches all file
> system errors, and doesn't make any distinction between what
> exactly gone wrong - either the file or directory cannot be found,
> or there is a permissions error, or something else.  If you want
> to make sure that no unexpected errors will be cached, consider
> keeping it off.
>
> On the other hand, it may make sense to explicitly exclude EMFILE,
> ENFILE, and may be ENOMEM from caching.  I'll take a look.
>
> Note though, that as suggested above, my recommendation would be
> to avoid using "open_file_cache" at all.
>
> 5. *Limits:* What values would you recommend for values like
>
> worker_rlimit_nofile and worker_connections? Should I raise
> LimitNOFILESoft?
>
>
> In general, "worker_connections" should be set depending on the
> expected load (and "worker_processes").  Total number of
> connections nginx will be able to handle is worker_processes *
> worker_connections.
>
> Given you use "worker_connections 40000;" and at least 5 worker
> processes, your server is already able to handle more than 200k
> connections, and it is likely more than enough.  Looking into
> stub_status numbers (and/or system connections stats) might give
> you an idea if you needed more connections.  Note that using
> many worker connection might require OS tuning (but it looks like
> you've already set fs.file-max to an arbitrary high value).
>
> And the RLIMIT_NOFILE limit should be set to a value needed for
> your worker processes.  It doesn't matter how do you set it,
> either in system (such as with LimitNOFILESoft in systemd) or with
> worker_rlimit_nofile in nginx itself (assuming it's under the hard
> limit set in the system).
>
> The basic idea is that worker processes shouldn't hit the
> RLIMIT_NOFILE limit, but should hit worker_connections limit
> instead.  This way workers will be able to reuse least recently
> used connections to free some resources, and will be able to
> actively avoid accepting new connections to let other worker
> processes do this.
>
> Given each connection uses at least one file for the socket, and
> can use many (for the file it returns, for upstream connections,
> for various temporary files, subrequests, streams in HTTP/2, and
> so on), it is usually a good idea to keep RLIMIT_NOFILE several
> times higher than worker_connections.
>
> Since you have HTTP/2 enabled with the default
> max_concurrent_streams (128), and no proxying or subrequests, a
> reasonable limit would be worker_connections * (128 + 1) or so,
> that's about 5 mln open files (or you could consider reducing
> max_concurrent_streams, or worker_connections, or both).  And of
> course you'll have to add some for various files not related to
> connections, such as logs and open_file_cache if you'll decide to
> keep it.
>
> Finally, since this is the freenginx list: does freenginx offer anything
>
> over stock nginx which would help me in this use case? Even just a
>
> monitoring page with FD values would help.
>
>
> I don't think there is a significant difference in this particular
> use case.  While freenginx provides various fixes and
> improvements, including fixes in open_file_cache, they won't make
> a difference here - the root cause of the issue you've hit is
> fragile configuration combined with too low resource limits.
>
> --
> Maxim Dounin
> http://mdounin.ru/
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://freenginx.org/pipermail/nginx/attachments/20250810/aa9490e1/attachment-0001.htm>