migration guide

author Mike Bayer <mike_mp@zzzcomputing.com>

Sun, 13 Oct 2013 21:16:36 +0000 (17:16 -0400)

committer Mike Bayer <mike_mp@zzzcomputing.com>

Sun, 13 Oct 2013 21:16:36 +0000 (17:16 -0400)
author Mike Bayer <mike_mp@zzzcomputing.com>
Sun, 13 Oct 2013 21:16:36 +0000 (17:16 -0400)
committer Mike Bayer <mike_mp@zzzcomputing.com>
Sun, 13 Oct 2013 21:16:36 +0000 (17:16 -0400)
diff --git a/doc/build/changelog/changelog_08.rst b/doc/build/changelog/changelog_08.rst

index b9fb941a81c23b621059223e81397d5c3b17aab0..38da8b9163d803d8c6907610c8b1f4492f012bd8 100644 (file)
--- a/doc/build/changelog/changelog_08.rst
+++ b/doc/build/changelog/changelog_08.rst
@@ -29,6 +29,10 @@
          None in 0.9 (e.g. automatic by default).   Thanks to Alexander Koval
          for help with this.
  
+        .. seealso::
+
+            :ref:`change_2836`
+
      .. change::
          :tags: bug, mysql
          :tickets: 2515
diff --git a/doc/build/changelog/migration_09.rst b/doc/build/changelog/migration_09.rst

index da01dc5c84fbb4dd333d3aaa0f3489911d0d2db5..969e2b5dad685f2ae1e11c76e13643cff1905300 100644 (file)
--- a/doc/build/changelog/migration_09.rst
+++ b/doc/build/changelog/migration_09.rst
@@ -802,6 +802,47 @@ and SQL expression values inline with the INSERT or UPDATE.  The feature takes
  place automatically when the target backend and :class:`.Table` supports
  "implicit returning".
  
+.. _change_2836:
+
+Subquery Eager Loading will apply DISTINCT to the innermost SELECT for some queries
+------------------------------------------------------------------------------------
+
+In an effort to reduce the number of duplicate rows that can be generated
+by subquery eager loading when a many-to-one relationship is involved, a
+DISTINCT keyword will be applied to the innermost SELECT when the join is
+targeting columns that do not comprise the primary key, as in when loading
+along a many to one.
+
+That is, when subquery loading on a many-to-one from A->B::
+
+    SELECT b.id AS b_id, b.name AS b_name, anon_1.b_id AS a_b_id
+    FROM (SELECT DISTINCT a_b_id FROM a) AS anon_1
+    JOIN b ON b.id = anon_1.a_b_id
+
+Since ``a.b_id`` is a non-distinct foreign key, DISTINCT is applied so that
+redundant ``a.b_id`` are eliminated.  The behavior can be turned on or off
+unconditionally for a particular :func:`.relationship` using the flag
+``distinct_target_key``, setting the value to ``True`` for unconditionally
+on, ``False`` for unconditionally off, and ``None`` for the feature to take
+effect when the target SELECT is against columns that do not comprise a full
+primary key.  In 0.9, ``None`` is the default.
+
+The option is also backported to 0.8 where the ``distinct_target_key``
+option defaults to ``False``.
+
+While the feature here is designed to help performance by eliminating
+duplicate rows, the ``DISTINCT`` keyword in SQL itself can have a negative
+performance impact.  If columns in the SELECT are not indexed, ``DISTINCT``
+will likely perform an ``ORDER BY`` on the rowset which can be expensive.
+By keeping the feature limited just to foreign keys which are hopefully
+indexed in any case, it's expected that the new defaults are reasonable.
+
+The feature also does not eliminate every possible dupe-row scenario; if
+a many-to-one is present elsewhere in the chain of joins, dupe rows may still
+be present.
+
+:ticket:`2836`
+
  
  .. _migration_1068:
author	Mike Bayer <mike_mp@zzzcomputing.com>
	Sun, 13 Oct 2013 21:16:36 +0000 (17:16 -0400)
committer	Mike Bayer <mike_mp@zzzcomputing.com>
	Sun, 13 Oct 2013 21:16:36 +0000 (17:16 -0400)
doc/build/changelog/changelog_08.rst		patch \| blob \| blame \| history
doc/build/changelog/migration_09.rst		patch \| blob \| blame \| history