Tweaks and some docs

author Otto Moerbeek <otto.moerbeek@open-xchange.com>

Wed, 30 Aug 2023 14:32:55 +0000 (16:32 +0200)

committer Otto Moerbeek <otto.moerbeek@open-xchange.com>

Wed, 13 Sep 2023 11:20:54 +0000 (13:20 +0200)
author Otto Moerbeek <otto.moerbeek@open-xchange.com>
Wed, 30 Aug 2023 14:32:55 +0000 (16:32 +0200)
committer Otto Moerbeek <otto.moerbeek@open-xchange.com>
Wed, 13 Sep 2023 11:20:54 +0000 (13:20 +0200)
diff --git a/pdns/recursordist/docs/performance.rst b/pdns/recursordist/docs/performance.rst

index 4f5f6e76d15eeac4c8603f8e016f4f2f06bb3726..2176d7d960fb640c62f172f57efc17f450c400f5 100644 (file)
--- a/pdns/recursordist/docs/performance.rst
+++ b/pdns/recursordist/docs/performance.rst
@@ -80,10 +80,14 @@ MTasker and MThreads
  PowerDNS Recursor uses a cooperative multitasking in userspace called ``MTasker``, based either on ``boost::context`` if available, or on ``System V ucontexts`` otherwise. For maximum performance, please make sure that your system supports ``boost::context``, as the alternative has been known to be quite slower.
  
  The maximum number of simultaneous MTasker threads, called ``MThreads``, can be tuned via :ref:`setting-max-mthreads`, as the default value of 2048 might not be enough for large-scale installations.
+This number limits the number of mthreads *per physical (Posix) thread*.
+The threads that create mthreads are the distributor and worker threads.
  
  When a ``MThread`` is started, a new stack is dynamically allocated for it on the heap. The size of that stack can be configured via the :ref:`setting-stack-size` parameter, whose default value is 200 kB which should be enough in most cases.
  
-To reduce the cost of allocating a new stack for every query, the recursor can cache a small amount of stacks to make sure that the allocation stays cheap. This can be configured via the :ref:`setting-stack-cache-size` setting. The only trade-off of enabling this cache is a slightly increased memory consumption, at worst equals to the number of stacks specified by :ref:`setting-stack-cache-size` multiplied by the size of one stack, itself specified via :ref:`setting-stack-size`.
+To reduce the cost of allocating a new stack for every query, the recursor can cache a small amount of stacks to make sure that the allocation stays cheap. This can be configured via the :ref:`setting-stack-cache-size` setting.
+This limit is per physcial (Posix) thread.
+The only trade-off of enabling this cache is a slightly increased memory consumption, at worst equals to the number of stacks specified by :ref:`setting-stack-cache-size` multiplied by the size of one stack, itself specified via :ref:`setting-stack-size`.
  
  Performance tips
  ----------------
@@ -177,8 +181,13 @@ Each of the queries processed will consume an mthread until processing is done.
  A response to a query is sent immediately when it becomes available; the response can be sent before other responses to queries that were received earlier by the Recursor.
  This is the Out-of-Order feature which greatly enhances performance, as a single slow query does not prevent other queries to be processed.
  
+Before version 5.0.0, TCP queries are processed by either the distributer thread(s) if :ref:`setting-pdns-distributes-queries` is true, or by worker threads if :ref:`setting-pdns-distributes-queries` is false.
+Starting with version 5.0.0, :program:`Recursor` has dedicated thread(s) processing TCP queries.
+
  The maximum number of mthreads consumed by TCP queries is :ref:`setting-max-tcp-clients` times :ref:`setting-max-concurrent-requests-per-tcp-connection`.
-This number should be (much) lower than :ref:`setting-max-mthreads`, to also allow UDP queries to be handled as these also consume mthreads.
+If :ref:`setting-pdns-distributes-queries` is true, this number should be (much) lower than :ref:`setting-max-mthreads`, to also allow UDP queries to be handled as these also consume mthrea ds.
+Note that :ref:`setting-max-mthreads` is a per Posix thread setting.
+This means that the global maximum number of mthreads  is (#distributor threads + #worker threads) * max-mthreads.
  
  If you expect few clients, you can increase :ref:`setting-max-concurrent-requests-per-tcp-connection`, to allow more concurrency per TCP connection.
  If you expect many clients and you have increased :ref:`setting-max-tcp-clients`, reduce :ref:`setting-max-concurrent-requests-per-tcp-connection` number to prevent mthread starvation or increase the maximum number of mthreads.
@@ -188,6 +197,7 @@ To see the current number of mthreads in use consult the :ref:`stat-concurrent-q
  If a query could not be handled due to mthread shortage, the :ref:`stat-over-capacity-drops` metric is increased.
  
  As an example, if you have typically 200 TCP clients, and the default maximum number of mthreads of 2048, a good number of concurrent requests per TCP connection would be 5. Assuming a worst case packet cache hit ratio, if all 200 TCP clients fill their connections with queries, about half (5 * 200) of the mthreads would be used by incoming TCP queries, leaving the other half for incoming UDP queries.
+Note that starting with versino 5.0.0, TCP queries are processed by dedicated TCP thread(s), so the sharing of mthreads between UDP and TCP queries no longer applies.
  
  The total number of incoming TCP connections is limited by :ref:`setting-max-tcp-clients`.
  There is also a per client address limit: :ref:`setting-max-tcp-per-client` to limit the impact of a single client.
diff --git a/pdns/recursordist/rec-main.cc b/pdns/recursordist/rec-main.cc

index 533a23ddcd7095f07377bc4407cdaf5779f3a19a..59ab27e7ed67f323fb0fb57ac3d76278e2980d41 100644 (file)
--- a/pdns/recursordist/rec-main.cc
+++ b/pdns/recursordist/rec-main.cc
@@ -910,8 +910,8 @@ static void checkLinuxIPv6Limits([[maybe_unused]] Logr::log_t log)
  static void checkOrFixFDS(Logr::log_t log)
  {
    unsigned int availFDs = getFilenumLimit();
-  unsigned int wantFDs = g_maxMThreads * RecThreadInfo::numWorkers() + 25; // even healthier margin then before
-  wantFDs += RecThreadInfo::numWorkers() * TCPOutConnectionManager::s_maxIdlePerThread;
+  unsigned int wantFDs = g_maxMThreads * (RecThreadInfo::numWorkers() + RecThreadInfo::numTCPWorkers()) + 25; // even healthier margin than before
+  wantFDs += (RecThreadInfo::numWorkers() + RecThreadInfo::numTCPWorkers()) * TCPOutConnectionManager::s_maxIdlePerThread;
  
    if (wantFDs > availFDs) {
      unsigned int hardlimit = getFilenumLimit(true);
@@ -921,7 +921,7 @@ static void checkOrFixFDS(Logr::log_t log)
             log->info(Logr::Warning, "Raised soft limit on number of filedescriptors to match max-mthreads and threads settings", "limit", Logging::Loggable(wantFDs)));
      }
      else {
-      auto newval = (hardlimit - 25 - TCPOutConnectionManager::s_maxIdlePerThread) / RecThreadInfo::numWorkers();
+      auto newval = (hardlimit - 25 - TCPOutConnectionManager::s_maxIdlePerThread) / (RecThreadInfo::numWorkers() + RecThreadInfo::numTCPWorkers());
        SLOG(g_log << Logger::Warning << "Insufficient number of filedescriptors available for max-mthreads*threads setting! (" << hardlimit << " < " << wantFDs << "), reducing max-mthreads to " << newval << endl,
             log->info(Logr::Warning, "Insufficient number of filedescriptors available for max-mthreads*threads setting! Reducing max-mthreads", "hardlimit", Logging::Loggable(hardlimit), "want", Logging::Loggable(wantFDs), "max-mthreads", Logging::Loggable(newval)));
        g_maxMThreads = newval;
diff --git a/pdns/recursordist/rec-tcp.cc b/pdns/recursordist/rec-tcp.cc

index 19ea2327f72dea0cd1a7b78ac1f4b7b376ac115c..3e0cb05a263abf6aec43464c9a29a629a3e146c4 100644 (file)
--- a/pdns/recursordist/rec-tcp.cc
+++ b/pdns/recursordist/rec-tcp.cc
@@ -52,6 +52,10 @@
  //
  // The drawback mentioned in https://github.com/PowerDNS/pdns/issues/8394 are not longer true, so an
  // alternative approach would be to introduce dedicated TCP worker thread(s).
+//
+// And this approach was implemented in https://github.com/PowerDNS/pdns/pull/13195. The distributor
+// and worker thread(s) now no longe process TCP queries.
+
  
  size_t g_tcpMaxQueriesPerConn;
  unsigned int g_maxTCPPerClient;
author	Otto Moerbeek <otto.moerbeek@open-xchange.com>
	Wed, 30 Aug 2023 14:32:55 +0000 (16:32 +0200)
committer	Otto Moerbeek <otto.moerbeek@open-xchange.com>
	Wed, 13 Sep 2023 11:20:54 +0000 (13:20 +0200)
pdns/recursordist/docs/performance.rst		patch \| blob \| blame \| history
pdns/recursordist/rec-main.cc		patch \| blob \| blame \| history
pdns/recursordist/rec-tcp.cc		patch \| blob \| blame \| history