Doc: improve user docs and code comments about EXISTS(SELECT * ...).

author Tom Lane <tgl@sss.pgh.pa.us>

Fri, 27 Feb 2026 20:20:16 +0000 (15:20 -0500)

committer Tom Lane <tgl@sss.pgh.pa.us>

Fri, 27 Feb 2026 20:20:16 +0000 (15:20 -0500)
author Tom Lane <tgl@sss.pgh.pa.us>
Fri, 27 Feb 2026 20:20:16 +0000 (15:20 -0500)
committer Tom Lane <tgl@sss.pgh.pa.us>
Fri, 27 Feb 2026 20:20:16 +0000 (15:20 -0500)
diff --git a/doc/src/sgml/func/func-subquery.sgml b/doc/src/sgml/func/func-subquery.sgml

index a9f2b12e48c66a1c14f0cfce6f78d99cd9d3373b..f954f3bf1339ea1d4656cd558cce4ef9db42cdd9 100644 (file)
--- a/doc/src/sgml/func/func-subquery.sgml
+++ b/doc/src/sgml/func/func-subquery.sgml
@@ -70,8 +70,14 @@ EXISTS (<replaceable>subquery</replaceable>)
     and not on the contents of those rows, the output list of the
     subquery is normally unimportant.  A common coding convention is
     to write all <literal>EXISTS</literal> tests in the form
-   <literal>EXISTS(SELECT 1 WHERE ...)</literal>.  There are exceptions to
-   this rule however, such as subqueries that use <token>INTERSECT</token>.
+   <literal>EXISTS(SELECT * FROM ... WHERE ...)</literal>, another common
+   convention is to write <literal>EXISTS(SELECT 1 FROM ... WHERE
+   ...)</literal> or some other dummy constant.  These conventions are
+   actually equivalent in <productname>PostgreSQL</productname>, which
+   will optimize away evaluation of the subquery's output list altogether
+   when it cannot affect the number of rows returned.  (An example
+   that cannot be optimized away is an output list containing a
+   set-returning function, since the function might return zero rows.)
    </para>
  
    <para>
diff --git a/src/backend/optimizer/plan/subselect.c b/src/backend/optimizer/plan/subselect.c

index e6bc7023562cb0b5ae31ded903bc57a31a37c8ff..d7f3cedf3d58660b17df142345f01ddcc715f8b2 100644 (file)
--- a/src/backend/optimizer/plan/subselect.c
+++ b/src/backend/optimizer/plan/subselect.c
@@ -1643,7 +1643,13 @@ convert_EXISTS_sublink_to_join(PlannerInfo *root, SubLink *sublink,
   * Note: by suppressing the targetlist we could cause an observable behavioral
   * change, namely that any errors that might occur in evaluating the tlist
   * won't occur, nor will other side-effects of volatile functions.  This seems
- * unlikely to bother anyone in practice.
+ * unlikely to bother anyone in practice.  Note that any column privileges are
+ * still checked even if the reference is removed here.
+ *
+ * The SQL standard specifies that a SELECT * immediately inside EXISTS
+ * expands to not all columns but an arbitrary literal.  That is kind of the
+ * same idea, but our optimization goes further in that it throws away the
+ * entire targetlist, and not only if it was written as *.
   *
   * Returns true if was able to discard the targetlist, else false.
   */
author	Tom Lane <tgl@sss.pgh.pa.us>
	Fri, 27 Feb 2026 20:20:16 +0000 (15:20 -0500)
committer	Tom Lane <tgl@sss.pgh.pa.us>
	Fri, 27 Feb 2026 20:20:16 +0000 (15:20 -0500)
doc/src/sgml/func/func-subquery.sgml		patch \| blob \| blame \| history
src/backend/optimizer/plan/subselect.c		patch \| blob \| blame \| history