[Fix] html: prevent buffer overflow in entity decoding

author Vsevolod Stakhov <vsevolod@rspamd.com>

Wed, 20 May 2026 10:39:51 +0000 (11:39 +0100)

committer Vsevolod Stakhov <vsevolod@rspamd.com>

Wed, 20 May 2026 10:39:51 +0000 (11:39 +0100)
author Vsevolod Stakhov <vsevolod@rspamd.com>
Wed, 20 May 2026 10:39:51 +0000 (11:39 +0100)
committer Vsevolod Stakhov <vsevolod@rspamd.com>
Wed, 20 May 2026 10:39:51 +0000 (11:39 +0100)
diff --git a/src/libserver/html/html_entities.cxx b/src/libserver/html/html_entities.cxx

index d7c709f2da11ba3c32de8f54c59c166da14ecd1b..5e18cf7a304b6d345234fb5d8bf1ddd98ec4695d 100644 (file)
--- a/src/libserver/html/html_entities.cxx
+++ b/src/libserver/html/html_entities.cxx
@@ -2260,8 +2260,17 @@ decode_html_entitles_inplace(char *s, std::size_t len, bool norm_spaces)
  
                 auto replace_entity = [&]() -> void {
                         auto l = strlen(entity_def->replacement);
-                       memcpy(t, entity_def->replacement, l);
-                       t += l;
+                       /*
+                        * The decoder works in place, so the replacement may only be
+                        * written while it fits the remaining buffer. Some short entity
+                        * names expand to longer multi-codepoint replacements, which
+                        * would otherwise overflow when the entity sits at the very end
+                        * of the buffer. Drop such a truncated entity instead.
+                        */
+                       if (end - t >= (decltype(end - t)) l) {
+                               memcpy(t, entity_def->replacement, l);
+                               t += l;
+                       }
                 };
  
                 if (entity_def) {
author	Vsevolod Stakhov <vsevolod@rspamd.com>
	Wed, 20 May 2026 10:39:51 +0000 (11:39 +0100)
committer	Vsevolod Stakhov <vsevolod@rspamd.com>
	Wed, 20 May 2026 10:39:51 +0000 (11:39 +0100)