4.9-stable patches

author Greg Kroah-Hartman <gregkh@linuxfoundation.org>

Thu, 10 Jan 2019 14:16:49 +0000 (15:16 +0100)

committer Greg Kroah-Hartman <gregkh@linuxfoundation.org>

Thu, 10 Jan 2019 14:16:49 +0000 (15:16 +0100)
author Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Thu, 10 Jan 2019 14:16:49 +0000 (15:16 +0100)
committer Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Thu, 10 Jan 2019 14:16:49 +0000 (15:16 +0100)
diff --git a/queue-4.9/libceph-fix-ceph_feature_cephx_v2-check-in-calc_signature.patch b/queue-4.9/libceph-fix-ceph_feature_cephx_v2-check-in-calc_signature.patch

new file mode 100644 (file)

index 0000000..b548bc7
--- /dev/null
+++ b/queue-4.9/libceph-fix-ceph_feature_cephx_v2-check-in-calc_signature.patch
@@ -0,0 +1,32 @@
+From idryomov@gmail.com  Thu Jan 10 15:03:58 2019
+From: Ilya Dryomov <idryomov@gmail.com>
+Date: Wed,  9 Jan 2019 15:17:09 +0100
+Subject: libceph: fix CEPH_FEATURE_CEPHX_V2 check in calc_signature()
+To: Ben Hutchings <ben.hutchings@codethink.co.uk>, Greg Kroah-Hartman <gregkh@linuxfoundation.org>
+Cc: stable@vger.kernel.org
+Message-ID: <20190109141709.4921-1-idryomov@gmail.com>
+
+From: Ilya Dryomov <idryomov@gmail.com>
+
+Upstream commit cc255c76c70f ("libceph: implement CEPHX_V2 calculation
+mode") was adjusted incorrectly: CEPH_FEATURE_CEPHX_V2 if condition got
+inverted, thus breaking 4.9.144 and later kernels for all setups that
+use cephx.
+
+Cc: Ben Hutchings <ben.hutchings@codethink.co.uk>
+Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
+---
+ net/ceph/auth_x.c |    2 +-
+ 1 file changed, 1 insertion(+), 1 deletion(-)
+
+--- a/net/ceph/auth_x.c
++++ b/net/ceph/auth_x.c
+@@ -804,7 +804,7 @@ static int calc_signature(struct ceph_x_
+       void *enc_buf = au->enc_buf;
+       int ret;
+ 
+-      if (msg->con->peer_features & CEPH_FEATURE_CEPHX_V2) {
++      if (!(msg->con->peer_features & CEPH_FEATURE_CEPHX_V2)) {
+               struct {
+                       __le32 len;
+                       __le32 header_crc;
diff --git a/queue-4.9/scsi-zfcp-fix-posting-too-many-status-read-buffers-leading-to-adapter-shutdown.patch b/queue-4.9/scsi-zfcp-fix-posting-too-many-status-read-buffers-leading-to-adapter-shutdown.patch

new file mode 100644 (file)

index 0000000..3b27f1f
--- /dev/null
+++ b/queue-4.9/scsi-zfcp-fix-posting-too-many-status-read-buffers-leading-to-adapter-shutdown.patch
@@ -0,0 +1,92 @@
+From 60a161b7e5b2a252ff0d4c622266a7d8da1120ce Mon Sep 17 00:00:00 2001
+From: Steffen Maier <maier@linux.ibm.com>
+Date: Thu, 6 Dec 2018 17:31:20 +0100
+Subject: scsi: zfcp: fix posting too many status read buffers leading to adapter shutdown
+
+From: Steffen Maier <maier@linux.ibm.com>
+
+commit 60a161b7e5b2a252ff0d4c622266a7d8da1120ce upstream.
+
+Suppose adapter (open) recovery is between opened QDIO queues and before
+(the end of) initial posting of status read buffers (SRBs). This time
+window can be seconds long due to FSF_PROT_HOST_CONNECTION_INITIALIZING
+causing by design looping with exponential increase sleeps in the function
+performing exchange config data during recovery
+[zfcp_erp_adapter_strat_fsf_xconf()]. Recovery triggered by local link up.
+
+Suppose an event occurs for which the FCP channel would send an unsolicited
+notification to zfcp by means of a previously posted SRB.  We saw it with
+local cable pull (link down) in multi-initiator zoning with multiple
+NPIV-enabled subchannels of the same shared FCP channel.
+
+As soon as zfcp_erp_adapter_strategy_open_fsf() starts posting the initial
+status read buffers from within the adapter's ERP thread, the channel does
+send an unsolicited notification.
+
+Since v2.6.27 commit d26ab06ede83 ("[SCSI] zfcp: receiving an unsolicted
+status can lead to I/O stall"), zfcp_fsf_status_read_handler() schedules
+adapter->stat_work to re-fill the just consumed SRB from a work item.
+
+Now the ERP thread and the work item post SRBs in parallel.  Both contexts
+call the helper function zfcp_status_read_refill().  The tracking of
+missing (to be posted / re-filled) SRBs is not thread-safe due to separate
+atomic_read() and atomic_dec(), in order to depend on posting
+success. Hence, both contexts can see
+atomic_read(&adapter->stat_miss) == 1. One of the two contexts posts
+one too many SRB. Zfcp gets QDIO_ERROR_SLSB_STATE on the output queue
+(trace tag "qdireq1") leading to zfcp_erp_adapter_shutdown() in
+zfcp_qdio_handler_error().
+
+An obvious and seemingly clean fix would be to schedule stat_work from the
+ERP thread and wait for it to finish. This would serialize all SRB
+re-fills. However, we already have another work item wait on the ERP
+thread: adapter->scan_work runs zfcp_fc_scan_ports() which calls
+zfcp_fc_eval_gpn_ft(). The latter calls zfcp_erp_wait() to wait for all the
+open port recoveries during zfcp auto port scan, but in fact it waits for
+any pending recovery including an adapter recovery. This approach leads to
+a deadlock.  [see also v3.19 commit 18f87a67e6d6 ("zfcp: auto port scan
+resiliency"); v2.6.37 commit d3e1088d6873
+("[SCSI] zfcp: No ERP escalation on gpn_ft eval");
+v2.6.28 commit fca55b6fb587
+("[SCSI] zfcp: fix deadlock between wq triggered port scan and ERP")
+fixing v2.6.27 commit c57a39a45a76
+("[SCSI] zfcp: wait until adapter is finished with ERP during auto-port");
+v2.6.27 commit cc8c282963bd
+("[SCSI] zfcp: Automatically attach remote ports")]
+
+Instead make the accounting of missing SRBs atomic for parallel execution
+in both the ERP thread and adapter->stat_work.
+
+Signed-off-by: Steffen Maier <maier@linux.ibm.com>
+Fixes: d26ab06ede83 ("[SCSI] zfcp: receiving an unsolicted status can lead to I/O stall")
+Cc: <stable@vger.kernel.org> #2.6.27+
+Reviewed-by: Jens Remus <jremus@linux.ibm.com>
+Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com>
+Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
+
+---
+ drivers/s390/scsi/zfcp_aux.c |    6 +++---
+ 1 file changed, 3 insertions(+), 3 deletions(-)
+
+--- a/drivers/s390/scsi/zfcp_aux.c
++++ b/drivers/s390/scsi/zfcp_aux.c
+@@ -275,16 +275,16 @@ static void zfcp_free_low_mem_buffers(st
+  */
+ int zfcp_status_read_refill(struct zfcp_adapter *adapter)
+ {
+-      while (atomic_read(&adapter->stat_miss) > 0)
++      while (atomic_add_unless(&adapter->stat_miss, -1, 0))
+               if (zfcp_fsf_status_read(adapter->qdio)) {
++                      atomic_inc(&adapter->stat_miss); /* undo add -1 */
+                       if (atomic_read(&adapter->stat_miss) >=
+                           adapter->stat_read_buf_num) {
+                               zfcp_erp_adapter_reopen(adapter, 0, "axsref1");
+                               return 1;
+                       }
+                       break;
+-              } else
+-                      atomic_dec(&adapter->stat_miss);
++              }
+       return 0;
+ }
+ 
diff --git a/queue-4.9/series b/queue-4.9/series

index 8a07124b989e36517a47e3ae3ae97748b5462d46..eccc045f362c0771e86407cf35784bd810281129 100644 (file)
--- a/queue-4.9/series
+++ b/queue-4.9/series
@@ -30,3 +30,5 @@ lan78xx-resolve-issue-with-changing-mac-address.patch
  vxge-ensure-data0-is-initialized-in-when-fetching-fi.patch
  net-netxen-fix-a-missing-check-and-an-uninitialized-.patch
  serial-sunsu-fix-refcount-leak.patch
+scsi-zfcp-fix-posting-too-many-status-read-buffers-leading-to-adapter-shutdown.patch
+libceph-fix-ceph_feature_cephx_v2-check-in-calc_signature.patch
author	Greg Kroah-Hartman <gregkh@linuxfoundation.org>
	Thu, 10 Jan 2019 14:16:49 +0000 (15:16 +0100)
committer	Greg Kroah-Hartman <gregkh@linuxfoundation.org>
	Thu, 10 Jan 2019 14:16:49 +0000 (15:16 +0100)
queue-4.9/libceph-fix-ceph_feature_cephx_v2-check-in-calc_signature.patch	[new file with mode: 0644]	patch \| blob
queue-4.9/scsi-zfcp-fix-posting-too-many-status-read-buffers-leading-to-adapter-shutdown.patch	[new file with mode: 0644]	patch \| blob
queue-4.9/series		patch \| blob \| blame \| history