Fixes for 5.10

author Sasha Levin <sashal@kernel.org>

Mon, 15 Mar 2021 03:01:10 +0000 (23:01 -0400)

committer Sasha Levin <sashal@kernel.org>

Mon, 15 Mar 2021 03:01:10 +0000 (23:01 -0400)
author Sasha Levin <sashal@kernel.org>
Mon, 15 Mar 2021 03:01:10 +0000 (23:01 -0400)
committer Sasha Levin <sashal@kernel.org>
Mon, 15 Mar 2021 03:01:10 +0000 (23:01 -0400)
diff --git a/queue-5.10/arm64-mm-fix-pfn_valid-for-zone_device-based-memory.patch b/queue-5.10/arm64-mm-fix-pfn_valid-for-zone_device-based-memory.patch

new file mode 100644 (file)

index 0000000..bb7cb71
--- /dev/null
+++ b/queue-5.10/arm64-mm-fix-pfn_valid-for-zone_device-based-memory.patch
@@ -0,0 +1,77 @@
+From 735639b1c4e4b80643f5eee690211d68f9086260 Mon Sep 17 00:00:00 2001
+From: Sasha Levin <sashal@kernel.org>
+Date: Fri, 5 Mar 2021 10:54:57 +0530
+Subject: arm64/mm: Fix pfn_valid() for ZONE_DEVICE based memory
+
+From: Anshuman Khandual <anshuman.khandual@arm.com>
+
+[ Upstream commit eeb0753ba27b26f609e61f9950b14f1b934fe429 ]
+
+pfn_valid() validates a pfn but basically it checks for a valid struct page
+backing for that pfn. It should always return positive for memory ranges
+backed with struct page mapping. But currently pfn_valid() fails for all
+ZONE_DEVICE based memory types even though they have struct page mapping.
+
+pfn_valid() asserts that there is a memblock entry for a given pfn without
+MEMBLOCK_NOMAP flag being set. The problem with ZONE_DEVICE based memory is
+that they do not have memblock entries. Hence memblock_is_map_memory() will
+invariably fail via memblock_search() for a ZONE_DEVICE based address. This
+eventually fails pfn_valid() which is wrong. memblock_is_map_memory() needs
+to be skipped for such memory ranges. As ZONE_DEVICE memory gets hotplugged
+into the system via memremap_pages() called from a driver, their respective
+memory sections will not have SECTION_IS_EARLY set.
+
+Normal hotplug memory will never have MEMBLOCK_NOMAP set in their memblock
+regions. Because the flag MEMBLOCK_NOMAP was specifically designed and set
+for firmware reserved memory regions. memblock_is_map_memory() can just be
+skipped as its always going to be positive and that will be an optimization
+for the normal hotplug memory. Like ZONE_DEVICE based memory, all normal
+hotplugged memory too will not have SECTION_IS_EARLY set for their sections
+
+Skipping memblock_is_map_memory() for all non early memory sections would
+fix pfn_valid() problem for ZONE_DEVICE based memory and also improve its
+performance for normal hotplug memory as well.
+
+Cc: Catalin Marinas <catalin.marinas@arm.com>
+Cc: Will Deacon <will@kernel.org>
+Cc: Ard Biesheuvel <ardb@kernel.org>
+Cc: Robin Murphy <robin.murphy@arm.com>
+Cc: linux-arm-kernel@lists.infradead.org
+Cc: linux-kernel@vger.kernel.org
+Acked-by: David Hildenbrand <david@redhat.com>
+Fixes: 73b20c84d42d ("arm64: mm: implement pte_devmap support")
+Signed-off-by: Anshuman Khandual <anshuman.khandual@arm.com>
+Acked-by: Catalin Marinas <catalin.marinas@arm.com>
+Link: https://lore.kernel.org/r/1614921898-4099-2-git-send-email-anshuman.khandual@arm.com
+Signed-off-by: Will Deacon <will@kernel.org>
+Signed-off-by: Sasha Levin <sashal@kernel.org>
+---
+ arch/arm64/mm/init.c | 12 ++++++++++++
+ 1 file changed, 12 insertions(+)
+
+diff --git a/arch/arm64/mm/init.c b/arch/arm64/mm/init.c
+index b913844ab740..916e0547fdcc 100644
+--- a/arch/arm64/mm/init.c
++++ b/arch/arm64/mm/init.c
+@@ -218,6 +218,18 @@ int pfn_valid(unsigned long pfn)
+ 
+       if (!valid_section(__pfn_to_section(pfn)))
+               return 0;
++
++      /*
++       * ZONE_DEVICE memory does not have the memblock entries.
++       * memblock_is_map_memory() check for ZONE_DEVICE based
++       * addresses will always fail. Even the normal hotplugged
++       * memory will never have MEMBLOCK_NOMAP flag set in their
++       * memblock entries. Skip memblock search for all non early
++       * memory sections covering all of hotplug memory including
++       * both normal and ZONE_DEVICE based.
++       */
++      if (!early_section(__pfn_to_section(pfn)))
++              return pfn_section_valid(__pfn_to_section(pfn), pfn);
+ #endif
+       return memblock_is_map_memory(addr);
+ }
+-- 
+2.30.1
+
diff --git a/queue-5.10/arm64-mm-use-a-48-bit-id-map-when-possible-on-52-bit.patch b/queue-5.10/arm64-mm-use-a-48-bit-id-map-when-possible-on-52-bit.patch

new file mode 100644 (file)

index 0000000..5d26631
--- /dev/null
+++ b/queue-5.10/arm64-mm-use-a-48-bit-id-map-when-possible-on-52-bit.patch
@@ -0,0 +1,87 @@
+From 11c594563e388e924b6cd862538c804017bfe298 Mon Sep 17 00:00:00 2001
+From: Sasha Levin <sashal@kernel.org>
+Date: Wed, 10 Mar 2021 18:15:11 +0100
+Subject: arm64: mm: use a 48-bit ID map when possible on 52-bit VA builds
+
+From: Ard Biesheuvel <ardb@kernel.org>
+
+[ Upstream commit 7ba8f2b2d652cd8d8a2ab61f4be66973e70f9f88 ]
+
+52-bit VA kernels can run on hardware that is only 48-bit capable, but
+configure the ID map as 52-bit by default. This was not a problem until
+recently, because the special T0SZ value for a 52-bit VA space was never
+programmed into the TCR register anwyay, and because a 52-bit ID map
+happens to use the same number of translation levels as a 48-bit one.
+
+This behavior was changed by commit 1401bef703a4 ("arm64: mm: Always update
+TCR_EL1 from __cpu_set_tcr_t0sz()"), which causes the unsupported T0SZ
+value for a 52-bit VA to be programmed into TCR_EL1. While some hardware
+simply ignores this, Mark reports that Amberwing systems choke on this,
+resulting in a broken boot. But even before that commit, the unsupported
+idmap_t0sz value was exposed to KVM and used to program TCR_EL2 incorrectly
+as well.
+
+Given that we already have to deal with address spaces being either 48-bit
+or 52-bit in size, the cleanest approach seems to be to simply default to
+a 48-bit VA ID map, and only switch to a 52-bit one if the placement of the
+kernel in DRAM requires it. This is guaranteed not to happen unless the
+system is actually 52-bit VA capable.
+
+Fixes: 90ec95cda91a ("arm64: mm: Introduce VA_BITS_MIN")
+Reported-by: Mark Salter <msalter@redhat.com>
+Link: http://lore.kernel.org/r/20210310003216.410037-1-msalter@redhat.com
+Signed-off-by: Ard Biesheuvel <ardb@kernel.org>
+Link: https://lore.kernel.org/r/20210310171515.416643-2-ardb@kernel.org
+Signed-off-by: Will Deacon <will@kernel.org>
+Signed-off-by: Sasha Levin <sashal@kernel.org>
+---
+ arch/arm64/include/asm/mmu_context.h | 5 +----
+ arch/arm64/kernel/head.S             | 2 +-
+ arch/arm64/mm/mmu.c                  | 2 +-
+ 3 files changed, 3 insertions(+), 6 deletions(-)
+
+diff --git a/arch/arm64/include/asm/mmu_context.h b/arch/arm64/include/asm/mmu_context.h
+index 0672236e1aea..4e2ba9477845 100644
+--- a/arch/arm64/include/asm/mmu_context.h
++++ b/arch/arm64/include/asm/mmu_context.h
+@@ -65,10 +65,7 @@ extern u64 idmap_ptrs_per_pgd;
+ 
+ static inline bool __cpu_uses_extended_idmap(void)
+ {
+-      if (IS_ENABLED(CONFIG_ARM64_VA_BITS_52))
+-              return false;
+-
+-      return unlikely(idmap_t0sz != TCR_T0SZ(VA_BITS));
++      return unlikely(idmap_t0sz != TCR_T0SZ(vabits_actual));
+ }
+ 
+ /*
+diff --git a/arch/arm64/kernel/head.S b/arch/arm64/kernel/head.S
+index e7550a5289fe..78cdd6b24172 100644
+--- a/arch/arm64/kernel/head.S
++++ b/arch/arm64/kernel/head.S
+@@ -334,7 +334,7 @@ SYM_FUNC_START_LOCAL(__create_page_tables)
+        */
+       adrp    x5, __idmap_text_end
+       clz     x5, x5
+-      cmp     x5, TCR_T0SZ(VA_BITS)   // default T0SZ small enough?
++      cmp     x5, TCR_T0SZ(VA_BITS_MIN) // default T0SZ small enough?
+       b.ge    1f                      // .. then skip VA range extension
+ 
+       adr_l   x6, idmap_t0sz
+diff --git a/arch/arm64/mm/mmu.c b/arch/arm64/mm/mmu.c
+index f0125bb09fa3..6aabf1eced31 100644
+--- a/arch/arm64/mm/mmu.c
++++ b/arch/arm64/mm/mmu.c
+@@ -40,7 +40,7 @@
+ #define NO_BLOCK_MAPPINGS     BIT(0)
+ #define NO_CONT_MAPPINGS      BIT(1)
+ 
+-u64 idmap_t0sz = TCR_T0SZ(VA_BITS);
++u64 idmap_t0sz = TCR_T0SZ(VA_BITS_MIN);
+ u64 idmap_ptrs_per_pgd = PTRS_PER_PGD;
+ 
+ u64 __section(".mmuoff.data.write") vabits_actual;
+-- 
+2.30.1
+
diff --git a/queue-5.10/block-rsxx-fix-error-return-code-of-rsxx_pci_probe.patch b/queue-5.10/block-rsxx-fix-error-return-code-of-rsxx_pci_probe.patch

new file mode 100644 (file)

index 0000000..9b97beb
--- /dev/null
+++ b/queue-5.10/block-rsxx-fix-error-return-code-of-rsxx_pci_probe.patch
@@ -0,0 +1,39 @@
+From fe8f1931deb79091b43eb72207c142c19ec1ed15 Mon Sep 17 00:00:00 2001
+From: Sasha Levin <sashal@kernel.org>
+Date: Tue, 9 Mar 2021 19:30:17 -0800
+Subject: block: rsxx: fix error return code of rsxx_pci_probe()
+
+From: Jia-Ju Bai <baijiaju1990@gmail.com>
+
+[ Upstream commit df66617bfe87487190a60783d26175b65d2502ce ]
+
+When create_singlethread_workqueue returns NULL to card->event_wq, no
+error return code of rsxx_pci_probe() is assigned.
+
+To fix this bug, st is assigned with -ENOMEM in this case.
+
+Fixes: 8722ff8cdbfa ("block: IBM RamSan 70/80 device driver")
+Reported-by: TOTE Robot <oslab@tsinghua.edu.cn>
+Signed-off-by: Jia-Ju Bai <baijiaju1990@gmail.com>
+Link: https://lore.kernel.org/r/20210310033017.4023-1-baijiaju1990@gmail.com
+Signed-off-by: Jens Axboe <axboe@kernel.dk>
+Signed-off-by: Sasha Levin <sashal@kernel.org>
+---
+ drivers/block/rsxx/core.c | 1 +
+ 1 file changed, 1 insertion(+)
+
+diff --git a/drivers/block/rsxx/core.c b/drivers/block/rsxx/core.c
+index 5ac1881396af..227e1be4c6f9 100644
+--- a/drivers/block/rsxx/core.c
++++ b/drivers/block/rsxx/core.c
+@@ -871,6 +871,7 @@ static int rsxx_pci_probe(struct pci_dev *dev,
+       card->event_wq = create_singlethread_workqueue(DRIVER_NAME"_event");
+       if (!card->event_wq) {
+               dev_err(CARD_TO_DEV(card), "Failed card event setup.\n");
++              st = -ENOMEM;
+               goto failed_event_handler;
+       }
+ 
+-- 
+2.30.1
+
diff --git a/queue-5.10/configfs-fix-a-use-after-free-in-__configfs_open_fil.patch b/queue-5.10/configfs-fix-a-use-after-free-in-__configfs_open_fil.patch

new file mode 100644 (file)

index 0000000..1b3e408
--- /dev/null
+++ b/queue-5.10/configfs-fix-a-use-after-free-in-__configfs_open_fil.patch
@@ -0,0 +1,132 @@
+From 30570a0f5f976a75cc6f7ae00fbb447084a50bb7 Mon Sep 17 00:00:00 2001
+From: Sasha Levin <sashal@kernel.org>
+Date: Mon, 1 Mar 2021 14:10:53 +0800
+Subject: configfs: fix a use-after-free in __configfs_open_file
+
+From: Daiyue Zhang <zhangdaiyue1@huawei.com>
+
+[ Upstream commit 14fbbc8297728e880070f7b077b3301a8c698ef9 ]
+
+Commit b0841eefd969 ("configfs: provide exclusion between IO and removals")
+uses ->frag_dead to mark the fragment state, thus no bothering with extra
+refcount on config_item when opening a file. The configfs_get_config_item
+was removed in __configfs_open_file, but not with config_item_put. So the
+refcount on config_item will lost its balance, causing use-after-free
+issues in some occasions like this:
+
+Test:
+1. Mount configfs on /config with read-only items:
+drwxrwx--- 289 root   root            0 2021-04-01 11:55 /config
+drwxr-xr-x   2 root   root            0 2021-04-01 11:54 /config/a
+--w--w--w-   1 root   root         4096 2021-04-01 11:53 /config/a/1.txt
+......
+
+2. Then run:
+for file in /config
+do
+echo $file
+grep -R 'key' $file
+done
+
+3. __configfs_open_file will be called in parallel, the first one
+got called will do:
+if (file->f_mode & FMODE_READ) {
+       if (!(inode->i_mode & S_IRUGO))
+               goto out_put_module;
+                       config_item_put(buffer->item);
+                               kref_put()
+                                       package_details_release()
+                                               kfree()
+
+the other one will run into use-after-free issues like this:
+BUG: KASAN: use-after-free in __configfs_open_file+0x1bc/0x3b0
+Read of size 8 at addr fffffff155f02480 by task grep/13096
+CPU: 0 PID: 13096 Comm: grep VIP: 00 Tainted: G        W       4.14.116-kasan #1
+TGID: 13096 Comm: grep
+Call trace:
+dump_stack+0x118/0x160
+kasan_report+0x22c/0x294
+__asan_load8+0x80/0x88
+__configfs_open_file+0x1bc/0x3b0
+configfs_open_file+0x28/0x34
+do_dentry_open+0x2cc/0x5c0
+vfs_open+0x80/0xe0
+path_openat+0xd8c/0x2988
+do_filp_open+0x1c4/0x2fc
+do_sys_open+0x23c/0x404
+SyS_openat+0x38/0x48
+
+Allocated by task 2138:
+kasan_kmalloc+0xe0/0x1ac
+kmem_cache_alloc_trace+0x334/0x394
+packages_make_item+0x4c/0x180
+configfs_mkdir+0x358/0x740
+vfs_mkdir2+0x1bc/0x2e8
+SyS_mkdirat+0x154/0x23c
+el0_svc_naked+0x34/0x38
+
+Freed by task 13096:
+kasan_slab_free+0xb8/0x194
+kfree+0x13c/0x910
+package_details_release+0x524/0x56c
+kref_put+0xc4/0x104
+config_item_put+0x24/0x34
+__configfs_open_file+0x35c/0x3b0
+configfs_open_file+0x28/0x34
+do_dentry_open+0x2cc/0x5c0
+vfs_open+0x80/0xe0
+path_openat+0xd8c/0x2988
+do_filp_open+0x1c4/0x2fc
+do_sys_open+0x23c/0x404
+SyS_openat+0x38/0x48
+el0_svc_naked+0x34/0x38
+
+To fix this issue, remove the config_item_put in
+__configfs_open_file to balance the refcount of config_item.
+
+Fixes: b0841eefd969 ("configfs: provide exclusion between IO and removals")
+Signed-off-by: Daiyue Zhang <zhangdaiyue1@huawei.com>
+Signed-off-by: Yi Chen <chenyi77@huawei.com>
+Signed-off-by: Ge Qiu <qiuge@huawei.com>
+Reviewed-by: Chao Yu <yuchao0@huawei.com>
+Acked-by: Al Viro <viro@zeniv.linux.org.uk>
+Signed-off-by: Christoph Hellwig <hch@lst.de>
+Signed-off-by: Sasha Levin <sashal@kernel.org>
+---
+ fs/configfs/file.c | 6 ++----
+ 1 file changed, 2 insertions(+), 4 deletions(-)
+
+diff --git a/fs/configfs/file.c b/fs/configfs/file.c
+index 1f0270229d7b..da8351d1e455 100644
+--- a/fs/configfs/file.c
++++ b/fs/configfs/file.c
+@@ -378,7 +378,7 @@ static int __configfs_open_file(struct inode *inode, struct file *file, int type
+ 
+       attr = to_attr(dentry);
+       if (!attr)
+-              goto out_put_item;
++              goto out_free_buffer;
+ 
+       if (type & CONFIGFS_ITEM_BIN_ATTR) {
+               buffer->bin_attr = to_bin_attr(dentry);
+@@ -391,7 +391,7 @@ static int __configfs_open_file(struct inode *inode, struct file *file, int type
+       /* Grab the module reference for this attribute if we have one */
+       error = -ENODEV;
+       if (!try_module_get(buffer->owner))
+-              goto out_put_item;
++              goto out_free_buffer;
+ 
+       error = -EACCES;
+       if (!buffer->item->ci_type)
+@@ -435,8 +435,6 @@ static int __configfs_open_file(struct inode *inode, struct file *file, int type
+ 
+ out_put_module:
+       module_put(buffer->owner);
+-out_put_item:
+-      config_item_put(buffer->item);
+ out_free_buffer:
+       up_read(&frag->frag_sem);
+       kfree(buffer);
+-- 
+2.30.1
+
diff --git a/queue-5.10/cpufreq-qcom-hw-fix-dereferencing-freed-memory-data.patch b/queue-5.10/cpufreq-qcom-hw-fix-dereferencing-freed-memory-data.patch

new file mode 100644 (file)

index 0000000..e7d20f3
--- /dev/null
+++ b/queue-5.10/cpufreq-qcom-hw-fix-dereferencing-freed-memory-data.patch
@@ -0,0 +1,39 @@
+From 47f807108df287e41368def9e117568d891e4cf9 Mon Sep 17 00:00:00 2001
+From: Sasha Levin <sashal@kernel.org>
+Date: Sun, 28 Feb 2021 09:33:19 +0800
+Subject: cpufreq: qcom-hw: fix dereferencing freed memory 'data'
+
+From: Shawn Guo <shawn.guo@linaro.org>
+
+[ Upstream commit 02fc409540303801994d076fcdb7064bd634dbf3 ]
+
+Commit 67fc209b527d ("cpufreq: qcom-hw: drop devm_xxx() calls from
+init/exit hooks") introduces an issue of dereferencing freed memory
+'data'.  Fix it.
+
+Fixes: 67fc209b527d ("cpufreq: qcom-hw: drop devm_xxx() calls from init/exit hooks")
+Reported-by: kernel test robot <lkp@intel.com>
+Reported-by: Dan Carpenter <dan.carpenter@oracle.com>
+Signed-off-by: Shawn Guo <shawn.guo@linaro.org>
+Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org>
+Signed-off-by: Sasha Levin <sashal@kernel.org>
+---
+ drivers/cpufreq/qcom-cpufreq-hw.c | 2 +-
+ 1 file changed, 1 insertion(+), 1 deletion(-)
+
+diff --git a/drivers/cpufreq/qcom-cpufreq-hw.c b/drivers/cpufreq/qcom-cpufreq-hw.c
+index 2726e77c9e5a..5cdd20e38771 100644
+--- a/drivers/cpufreq/qcom-cpufreq-hw.c
++++ b/drivers/cpufreq/qcom-cpufreq-hw.c
+@@ -368,7 +368,7 @@ static int qcom_cpufreq_hw_cpu_init(struct cpufreq_policy *policy)
+ error:
+       kfree(data);
+ unmap_base:
+-      iounmap(data->base);
++      iounmap(base);
+ release_region:
+       release_mem_region(res->start, resource_size(res));
+       return ret;
+-- 
+2.30.1
+
diff --git a/queue-5.10/cpufreq-qcom-hw-fix-return-value-check-in-qcom_cpufr.patch b/queue-5.10/cpufreq-qcom-hw-fix-return-value-check-in-qcom_cpufr.patch

new file mode 100644 (file)

index 0000000..ca20dcf
--- /dev/null
+++ b/queue-5.10/cpufreq-qcom-hw-fix-return-value-check-in-qcom_cpufr.patch
@@ -0,0 +1,43 @@
+From 8adef8d5d56afef6a424c72fca2da25364ebd5fe Mon Sep 17 00:00:00 2001
+From: Sasha Levin <sashal@kernel.org>
+Date: Thu, 4 Mar 2021 10:04:23 +0000
+Subject: cpufreq: qcom-hw: Fix return value check in
+ qcom_cpufreq_hw_cpu_init()
+
+From: Wei Yongjun <weiyongjun1@huawei.com>
+
+[ Upstream commit 536eb97abeba857126ad055de5923fa592acef25 ]
+
+In case of error, the function ioremap() returns NULL pointer
+not ERR_PTR(). The IS_ERR() test in the return value check
+should be replaced with NULL test.
+
+Fixes: 67fc209b527d ("cpufreq: qcom-hw: drop devm_xxx() calls from init/exit hooks")
+Reported-by: Hulk Robot <hulkci@huawei.com>
+Signed-off-by: Wei Yongjun <weiyongjun1@huawei.com>
+Acked-by: Shawn Guo <shawn.guo@linaro.org>
+Signed-off-by: Viresh Kumar <viresh.kumar@linaro.org>
+Signed-off-by: Sasha Levin <sashal@kernel.org>
+---
+ drivers/cpufreq/qcom-cpufreq-hw.c | 4 ++--
+ 1 file changed, 2 insertions(+), 2 deletions(-)
+
+diff --git a/drivers/cpufreq/qcom-cpufreq-hw.c b/drivers/cpufreq/qcom-cpufreq-hw.c
+index 5cdd20e38771..6de07556665b 100644
+--- a/drivers/cpufreq/qcom-cpufreq-hw.c
++++ b/drivers/cpufreq/qcom-cpufreq-hw.c
+@@ -317,9 +317,9 @@ static int qcom_cpufreq_hw_cpu_init(struct cpufreq_policy *policy)
+       }
+ 
+       base = ioremap(res->start, resource_size(res));
+-      if (IS_ERR(base)) {
++      if (!base) {
+               dev_err(dev, "failed to map resource %pR\n", res);
+-              ret = PTR_ERR(base);
++              ret = -ENOMEM;
+               goto release_region;
+       }
+ 
+-- 
+2.30.1
+
diff --git a/queue-5.10/hrtimer-update-softirq_expires_next-correctly-after-.patch b/queue-5.10/hrtimer-update-softirq_expires_next-correctly-after-.patch

new file mode 100644 (file)

index 0000000..2a6be79
--- /dev/null
+++ b/queue-5.10/hrtimer-update-softirq_expires_next-correctly-after-.patch
@@ -0,0 +1,155 @@
+From cbc5f24de297c9aaf0bda598819d4f9cc4e5f16e Mon Sep 17 00:00:00 2001
+From: Sasha Levin <sashal@kernel.org>
+Date: Tue, 23 Feb 2021 17:02:40 +0100
+Subject: hrtimer: Update softirq_expires_next correctly after
+ __hrtimer_get_next_event()
+
+From: Anna-Maria Behnsen <anna-maria@linutronix.de>
+
+[ Upstream commit 46eb1701c046cc18c032fa68f3c8ccbf24483ee4 ]
+
+hrtimer_force_reprogram() and hrtimer_interrupt() invokes
+__hrtimer_get_next_event() to find the earliest expiry time of hrtimer
+bases. __hrtimer_get_next_event() does not update
+cpu_base::[softirq_]_expires_next to preserve reprogramming logic. That
+needs to be done at the callsites.
+
+hrtimer_force_reprogram() updates cpu_base::softirq_expires_next only when
+the first expiring timer is a softirq timer and the soft interrupt is not
+activated. That's wrong because cpu_base::softirq_expires_next is left
+stale when the first expiring timer of all bases is a timer which expires
+in hard interrupt context. hrtimer_interrupt() does never update
+cpu_base::softirq_expires_next which is wrong too.
+
+That becomes a problem when clock_settime() sets CLOCK_REALTIME forward and
+the first soft expiring timer is in the CLOCK_REALTIME_SOFT base. Setting
+CLOCK_REALTIME forward moves the clock MONOTONIC based expiry time of that
+timer before the stale cpu_base::softirq_expires_next.
+
+cpu_base::softirq_expires_next is cached to make the check for raising the
+soft interrupt fast. In the above case the soft interrupt won't be raised
+until clock monotonic reaches the stale cpu_base::softirq_expires_next
+value. That's incorrect, but what's worse it that if the softirq timer
+becomes the first expiring timer of all clock bases after the hard expiry
+timer has been handled the reprogramming of the clockevent from
+hrtimer_interrupt() will result in an interrupt storm. That happens because
+the reprogramming does not use cpu_base::softirq_expires_next, it uses
+__hrtimer_get_next_event() which returns the actual expiry time. Once clock
+MONOTONIC reaches cpu_base::softirq_expires_next the soft interrupt is
+raised and the storm subsides.
+
+Change the logic in hrtimer_force_reprogram() to evaluate the soft and hard
+bases seperately, update softirq_expires_next and handle the case when a
+soft expiring timer is the first of all bases by comparing the expiry times
+and updating the required cpu base fields. Split this functionality into a
+separate function to be able to use it in hrtimer_interrupt() as well
+without copy paste.
+
+Fixes: 5da70160462e ("hrtimer: Implement support for softirq based hrtimers")
+Reported-by: Mikael Beckius <mikael.beckius@windriver.com>
+Suggested-by: Thomas Gleixner <tglx@linutronix.de>
+Tested-by: Mikael Beckius <mikael.beckius@windriver.com>
+Signed-off-by: Anna-Maria Behnsen <anna-maria@linutronix.de>
+Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
+Signed-off-by: Ingo Molnar <mingo@kernel.org>
+Link: https://lore.kernel.org/r/20210223160240.27518-1-anna-maria@linutronix.de
+Signed-off-by: Sasha Levin <sashal@kernel.org>
+---
+ kernel/time/hrtimer.c | 60 ++++++++++++++++++++++++++++---------------
+ 1 file changed, 39 insertions(+), 21 deletions(-)
+
+diff --git a/kernel/time/hrtimer.c b/kernel/time/hrtimer.c
+index 387b4bef7dd1..4416f5d72c11 100644
+--- a/kernel/time/hrtimer.c
++++ b/kernel/time/hrtimer.c
+@@ -546,8 +546,11 @@ static ktime_t __hrtimer_next_event_base(struct hrtimer_cpu_base *cpu_base,
+ }
+ 
+ /*
+- * Recomputes cpu_base::*next_timer and returns the earliest expires_next but
+- * does not set cpu_base::*expires_next, that is done by hrtimer_reprogram.
++ * Recomputes cpu_base::*next_timer and returns the earliest expires_next
++ * but does not set cpu_base::*expires_next, that is done by
++ * hrtimer[_force]_reprogram and hrtimer_interrupt only. When updating
++ * cpu_base::*expires_next right away, reprogramming logic would no longer
++ * work.
+  *
+  * When a softirq is pending, we can ignore the HRTIMER_ACTIVE_SOFT bases,
+  * those timers will get run whenever the softirq gets handled, at the end of
+@@ -588,6 +591,37 @@ __hrtimer_get_next_event(struct hrtimer_cpu_base *cpu_base, unsigned int active_
+       return expires_next;
+ }
+ 
++static ktime_t hrtimer_update_next_event(struct hrtimer_cpu_base *cpu_base)
++{
++      ktime_t expires_next, soft = KTIME_MAX;
++
++      /*
++       * If the soft interrupt has already been activated, ignore the
++       * soft bases. They will be handled in the already raised soft
++       * interrupt.
++       */
++      if (!cpu_base->softirq_activated) {
++              soft = __hrtimer_get_next_event(cpu_base, HRTIMER_ACTIVE_SOFT);
++              /*
++               * Update the soft expiry time. clock_settime() might have
++               * affected it.
++               */
++              cpu_base->softirq_expires_next = soft;
++      }
++
++      expires_next = __hrtimer_get_next_event(cpu_base, HRTIMER_ACTIVE_HARD);
++      /*
++       * If a softirq timer is expiring first, update cpu_base->next_timer
++       * and program the hardware with the soft expiry time.
++       */
++      if (expires_next > soft) {
++              cpu_base->next_timer = cpu_base->softirq_next_timer;
++              expires_next = soft;
++      }
++
++      return expires_next;
++}
++
+ static inline ktime_t hrtimer_update_base(struct hrtimer_cpu_base *base)
+ {
+       ktime_t *offs_real = &base->clock_base[HRTIMER_BASE_REALTIME].offset;
+@@ -628,23 +662,7 @@ hrtimer_force_reprogram(struct hrtimer_cpu_base *cpu_base, int skip_equal)
+ {
+       ktime_t expires_next;
+ 
+-      /*
+-       * Find the current next expiration time.
+-       */
+-      expires_next = __hrtimer_get_next_event(cpu_base, HRTIMER_ACTIVE_ALL);
+-
+-      if (cpu_base->next_timer && cpu_base->next_timer->is_soft) {
+-              /*
+-               * When the softirq is activated, hrtimer has to be
+-               * programmed with the first hard hrtimer because soft
+-               * timer interrupt could occur too late.
+-               */
+-              if (cpu_base->softirq_activated)
+-                      expires_next = __hrtimer_get_next_event(cpu_base,
+-                                                              HRTIMER_ACTIVE_HARD);
+-              else
+-                      cpu_base->softirq_expires_next = expires_next;
+-      }
++      expires_next = hrtimer_update_next_event(cpu_base);
+ 
+       if (skip_equal && expires_next == cpu_base->expires_next)
+               return;
+@@ -1644,8 +1662,8 @@ void hrtimer_interrupt(struct clock_event_device *dev)
+ 
+       __hrtimer_run_queues(cpu_base, now, flags, HRTIMER_ACTIVE_HARD);
+ 
+-      /* Reevaluate the clock bases for the next expiry */
+-      expires_next = __hrtimer_get_next_event(cpu_base, HRTIMER_ACTIVE_ALL);
++      /* Reevaluate the clock bases for the [soft] next expiry */
++      expires_next = hrtimer_update_next_event(cpu_base);
+       /*
+        * Store the new expiry value so the migration code can verify
+        * against it.
+-- 
+2.30.1
+
diff --git a/queue-5.10/include-linux-sched-mm.h-use-rcu_dereference-in-in_v.patch b/queue-5.10/include-linux-sched-mm.h-use-rcu_dereference-in-in_v.patch

new file mode 100644 (file)

index 0000000..8a58286
--- /dev/null
+++ b/queue-5.10/include-linux-sched-mm.h-use-rcu_dereference-in-in_v.patch
@@ -0,0 +1,43 @@
+From 3962aab7f25e05c682634e9e1472817f443c374c Mon Sep 17 00:00:00 2001
+From: Sasha Levin <sashal@kernel.org>
+Date: Fri, 12 Mar 2021 21:08:03 -0800
+Subject: include/linux/sched/mm.h: use rcu_dereference in in_vfork()
+
+From: Matthew Wilcox (Oracle) <willy@infradead.org>
+
+[ Upstream commit 149fc787353f65b7e72e05e7b75d34863266c3e2 ]
+
+Fix a sparse warning by using rcu_dereference().  Technically this is a
+bug and a sufficiently aggressive compiler could reload the `real_parent'
+pointer outside the protection of the rcu lock (and access freed memory),
+but I think it's pretty unlikely to happen.
+
+Link: https://lkml.kernel.org/r/20210221194207.1351703-1-willy@infradead.org
+Fixes: b18dc5f291c0 ("mm, oom: skip vforked tasks from being selected")
+Signed-off-by: Matthew Wilcox (Oracle) <willy@infradead.org>
+Reviewed-by: Miaohe Lin <linmiaohe@huawei.com>
+Acked-by: Michal Hocko <mhocko@suse.com>
+Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
+Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
+Signed-off-by: Sasha Levin <sashal@kernel.org>
+---
+ include/linux/sched/mm.h | 3 ++-
+ 1 file changed, 2 insertions(+), 1 deletion(-)
+
+diff --git a/include/linux/sched/mm.h b/include/linux/sched/mm.h
+index d5ece7a9a403..dc1f4dcd9a82 100644
+--- a/include/linux/sched/mm.h
++++ b/include/linux/sched/mm.h
+@@ -140,7 +140,8 @@ static inline bool in_vfork(struct task_struct *tsk)
+        * another oom-unkillable task does this it should blame itself.
+        */
+       rcu_read_lock();
+-      ret = tsk->vfork_done && tsk->real_parent->mm == tsk->mm;
++      ret = tsk->vfork_done &&
++                      rcu_dereference(tsk->real_parent)->mm == tsk->mm;
+       rcu_read_unlock();
+ 
+       return ret;
+-- 
+2.30.1
+
diff --git a/queue-5.10/net-bonding-fix-error-return-code-of-bond_neigh_init.patch b/queue-5.10/net-bonding-fix-error-return-code-of-bond_neigh_init.patch

new file mode 100644 (file)

index 0000000..383c6f0
--- /dev/null
+++ b/queue-5.10/net-bonding-fix-error-return-code-of-bond_neigh_init.patch
@@ -0,0 +1,47 @@
+From 95947a3f8159e2a84e7623acd7e9d2dfac28f7ca Mon Sep 17 00:00:00 2001
+From: Sasha Levin <sashal@kernel.org>
+Date: Sun, 7 Mar 2021 19:11:02 -0800
+Subject: net: bonding: fix error return code of bond_neigh_init()
+
+From: Jia-Ju Bai <baijiaju1990@gmail.com>
+
+[ Upstream commit 2055a99da8a253a357bdfd359b3338ef3375a26c ]
+
+When slave is NULL or slave_ops->ndo_neigh_setup is NULL, no error
+return code of bond_neigh_init() is assigned.
+To fix this bug, ret is assigned with -EINVAL in these cases.
+
+Fixes: 9e99bfefdbce ("bonding: fix bond_neigh_init()")
+Reported-by: TOTE Robot <oslab@tsinghua.edu.cn>
+Signed-off-by: Jia-Ju Bai <baijiaju1990@gmail.com>
+Signed-off-by: David S. Miller <davem@davemloft.net>
+Signed-off-by: Sasha Levin <sashal@kernel.org>
+---
+ drivers/net/bonding/bond_main.c | 8 ++++++--
+ 1 file changed, 6 insertions(+), 2 deletions(-)
+
+diff --git a/drivers/net/bonding/bond_main.c b/drivers/net/bonding/bond_main.c
+index 47afc5938c26..6d5a39af1097 100644
+--- a/drivers/net/bonding/bond_main.c
++++ b/drivers/net/bonding/bond_main.c
+@@ -3918,11 +3918,15 @@ static int bond_neigh_init(struct neighbour *n)
+ 
+       rcu_read_lock();
+       slave = bond_first_slave_rcu(bond);
+-      if (!slave)
++      if (!slave) {
++              ret = -EINVAL;
+               goto out;
++      }
+       slave_ops = slave->dev->netdev_ops;
+-      if (!slave_ops->ndo_neigh_setup)
++      if (!slave_ops->ndo_neigh_setup) {
++              ret = -EINVAL;
+               goto out;
++      }
+ 
+       /* TODO: find another way [1] to implement this.
+        * Passing a zeroed structure is fragile,
+-- 
+2.30.1
+
diff --git a/queue-5.10/nfs-don-t-gratuitously-clear-the-inode-cache-when-lo.patch b/queue-5.10/nfs-don-t-gratuitously-clear-the-inode-cache-when-lo.patch

new file mode 100644 (file)

index 0000000..254c15e
--- /dev/null
+++ b/queue-5.10/nfs-don-t-gratuitously-clear-the-inode-cache-when-lo.patch
@@ -0,0 +1,54 @@
+From 24eb827794b84a32682c4d9d383ce19becbe3c9f Mon Sep 17 00:00:00 2001
+From: Sasha Levin <sashal@kernel.org>
+Date: Mon, 8 Mar 2021 14:42:52 -0500
+Subject: NFS: Don't gratuitously clear the inode cache when lookup failed
+
+From: Trond Myklebust <trond.myklebust@hammerspace.com>
+
+[ Upstream commit 47397915ede0192235474b145ebcd81b37b03624 ]
+
+The fact that the lookup revalidation failed, does not mean that the
+inode contents have changed.
+
+Fixes: 5ceb9d7fdaaf ("NFS: Refactor nfs_lookup_revalidate()")
+Signed-off-by: Trond Myklebust <trond.myklebust@hammerspace.com>
+Signed-off-by: Anna Schumaker <Anna.Schumaker@Netapp.com>
+Signed-off-by: Sasha Levin <sashal@kernel.org>
+---
+ fs/nfs/dir.c | 20 ++++++++------------
+ 1 file changed, 8 insertions(+), 12 deletions(-)
+
+diff --git a/fs/nfs/dir.c b/fs/nfs/dir.c
+index c96ae135b80f..c837675cd395 100644
+--- a/fs/nfs/dir.c
++++ b/fs/nfs/dir.c
+@@ -1245,18 +1245,14 @@ nfs_lookup_revalidate_done(struct inode *dir, struct dentry *dentry,
+                       __func__, dentry);
+               return 1;
+       case 0:
+-              if (inode && S_ISDIR(inode->i_mode)) {
+-                      /* Purge readdir caches. */
+-                      nfs_zap_caches(inode);
+-                      /*
+-                       * We can't d_drop the root of a disconnected tree:
+-                       * its d_hash is on the s_anon list and d_drop() would hide
+-                       * it from shrink_dcache_for_unmount(), leading to busy
+-                       * inodes on unmount and further oopses.
+-                       */
+-                      if (IS_ROOT(dentry))
+-                              return 1;
+-              }
++              /*
++               * We can't d_drop the root of a disconnected tree:
++               * its d_hash is on the s_anon list and d_drop() would hide
++               * it from shrink_dcache_for_unmount(), leading to busy
++               * inodes on unmount and further oopses.
++               */
++              if (inode && IS_ROOT(dentry))
++                      return 1;
+               dfprintk(LOOKUPCACHE, "NFS: %s(%pd2) is invalid\n",
+                               __func__, dentry);
+               return 0;
+-- 
+2.30.1
+
diff --git a/queue-5.10/nfs-don-t-revalidate-the-directory-permissions-on-a-.patch b/queue-5.10/nfs-don-t-revalidate-the-directory-permissions-on-a-.patch

new file mode 100644 (file)

index 0000000..119f4be
--- /dev/null
+++ b/queue-5.10/nfs-don-t-revalidate-the-directory-permissions-on-a-.patch
@@ -0,0 +1,90 @@
+From a83ff25071c529bc82b7f416093a186795dbe8d3 Mon Sep 17 00:00:00 2001
+From: Sasha Levin <sashal@kernel.org>
+Date: Mon, 8 Mar 2021 14:42:51 -0500
+Subject: NFS: Don't revalidate the directory permissions on a lookup failure
+
+From: Trond Myklebust <trond.myklebust@hammerspace.com>
+
+[ Upstream commit 82e7ca1334ab16e2e04fafded1cab9dfcdc11b40 ]
+
+There should be no reason to expect the directory permissions to change
+just because the directory contents changed or a negative lookup timed
+out. So let's avoid doing a full call to nfs_mark_for_revalidate() in
+that case.
+Furthermore, if this is a negative dentry, and we haven't actually done
+a new lookup, then we have no reason yet to believe the directory has
+changed at all. So let's remove the gratuitous directory inode
+invalidation altogether when called from
+nfs_lookup_revalidate_negative().
+
+Reported-by: Geert Jansen <gerardu@amazon.com>
+Fixes: 5ceb9d7fdaaf ("NFS: Refactor nfs_lookup_revalidate()")
+Signed-off-by: Trond Myklebust <trond.myklebust@hammerspace.com>
+Signed-off-by: Anna Schumaker <Anna.Schumaker@Netapp.com>
+Signed-off-by: Sasha Levin <sashal@kernel.org>
+---
+ fs/nfs/dir.c | 20 +++++++++++++++++---
+ 1 file changed, 17 insertions(+), 3 deletions(-)
+
+diff --git a/fs/nfs/dir.c b/fs/nfs/dir.c
+index 4e011adaf967..c96ae135b80f 100644
+--- a/fs/nfs/dir.c
++++ b/fs/nfs/dir.c
+@@ -1202,6 +1202,15 @@ int nfs_lookup_verify_inode(struct inode *inode, unsigned int flags)
+       goto out;
+ }
+ 
++static void nfs_mark_dir_for_revalidate(struct inode *inode)
++{
++      struct nfs_inode *nfsi = NFS_I(inode);
++
++      spin_lock(&inode->i_lock);
++      nfsi->cache_validity |= NFS_INO_REVAL_PAGECACHE;
++      spin_unlock(&inode->i_lock);
++}
++
+ /*
+  * We judge how long we want to trust negative
+  * dentries by looking at the parent inode mtime.
+@@ -1236,7 +1245,6 @@ nfs_lookup_revalidate_done(struct inode *dir, struct dentry *dentry,
+                       __func__, dentry);
+               return 1;
+       case 0:
+-              nfs_mark_for_revalidate(dir);
+               if (inode && S_ISDIR(inode->i_mode)) {
+                       /* Purge readdir caches. */
+                       nfs_zap_caches(inode);
+@@ -1326,6 +1334,13 @@ nfs_lookup_revalidate_dentry(struct inode *dir, struct dentry *dentry,
+       nfs_free_fattr(fattr);
+       nfs_free_fhandle(fhandle);
+       nfs4_label_free(label);
++
++      /*
++       * If the lookup failed despite the dentry change attribute being
++       * a match, then we should revalidate the directory cache.
++       */
++      if (!ret && nfs_verify_change_attribute(dir, dentry->d_time))
++              nfs_mark_dir_for_revalidate(dir);
+       return nfs_lookup_revalidate_done(dir, dentry, inode, ret);
+ }
+ 
+@@ -1368,7 +1383,7 @@ nfs_do_lookup_revalidate(struct inode *dir, struct dentry *dentry,
+               error = nfs_lookup_verify_inode(inode, flags);
+               if (error) {
+                       if (error == -ESTALE)
+-                              nfs_zap_caches(dir);
++                              nfs_mark_dir_for_revalidate(dir);
+                       goto out_bad;
+               }
+               nfs_advise_use_readdirplus(dir);
+@@ -1865,7 +1880,6 @@ nfs_add_or_obtain(struct dentry *dentry, struct nfs_fh *fhandle,
+       dput(parent);
+       return d;
+ out_error:
+-      nfs_mark_for_revalidate(dir);
+       d = ERR_PTR(error);
+       goto out;
+ }
+-- 
+2.30.1
+
diff --git a/queue-5.10/nfsv4.2-fix-return-value-of-_nfs4_get_security_label.patch b/queue-5.10/nfsv4.2-fix-return-value-of-_nfs4_get_security_label.patch

new file mode 100644 (file)

index 0000000..3be9dc0
--- /dev/null
+++ b/queue-5.10/nfsv4.2-fix-return-value-of-_nfs4_get_security_label.patch
@@ -0,0 +1,43 @@
+From 9479d557c85808ee541905e148f24c0d02d57189 Mon Sep 17 00:00:00 2001
+From: Sasha Levin <sashal@kernel.org>
+Date: Fri, 15 Jan 2021 18:43:56 +0100
+Subject: NFSv4.2: fix return value of _nfs4_get_security_label()
+
+From: Ondrej Mosnacek <omosnace@redhat.com>
+
+[ Upstream commit 53cb245454df5b13d7063162afd7a785aed6ebf2 ]
+
+An xattr 'get' handler is expected to return the length of the value on
+success, yet _nfs4_get_security_label() (and consequently also
+nfs4_xattr_get_nfs4_label(), which is used as an xattr handler) returns
+just 0 on success.
+
+Fix this by returning label.len instead, which contains the length of
+the result.
+
+Fixes: aa9c2669626c ("NFS: Client implementation of Labeled-NFS")
+Signed-off-by: Ondrej Mosnacek <omosnace@redhat.com>
+Reviewed-by: James Morris <jamorris@linux.microsoft.com>
+Reviewed-by: Paul Moore <paul@paul-moore.com>
+Signed-off-by: Anna Schumaker <Anna.Schumaker@Netapp.com>
+Signed-off-by: Sasha Levin <sashal@kernel.org>
+---
+ fs/nfs/nfs4proc.c | 2 +-
+ 1 file changed, 1 insertion(+), 1 deletion(-)
+
+diff --git a/fs/nfs/nfs4proc.c b/fs/nfs/nfs4proc.c
+index a811d42ffbd1..ba2dfba4854b 100644
+--- a/fs/nfs/nfs4proc.c
++++ b/fs/nfs/nfs4proc.c
+@@ -5967,7 +5967,7 @@ static int _nfs4_get_security_label(struct inode *inode, void *buf,
+               return ret;
+       if (!(fattr.valid & NFS_ATTR_FATTR_V4_SECURITY_LABEL))
+               return -ENOENT;
+-      return 0;
++      return label.len;
+ }
+ 
+ static int nfs4_get_security_label(struct inode *inode, void *buf,
+-- 
+2.30.1
+
diff --git a/queue-5.10/nvme-fc-fix-racing-controller-reset-and-create-assoc.patch b/queue-5.10/nvme-fc-fix-racing-controller-reset-and-create-assoc.patch

new file mode 100644 (file)

index 0000000..988753e
--- /dev/null
+++ b/queue-5.10/nvme-fc-fix-racing-controller-reset-and-create-assoc.patch
@@ -0,0 +1,45 @@
+From 8cc78d346f80d8e37d4f3f1941e44f9f573622c5 Mon Sep 17 00:00:00 2001
+From: Sasha Levin <sashal@kernel.org>
+Date: Mon, 8 Mar 2021 16:51:26 -0800
+Subject: nvme-fc: fix racing controller reset and create association
+
+From: James Smart <jsmart2021@gmail.com>
+
+[ Upstream commit f20ef34d71abc1fc56b322aaa251f90f94320140 ]
+
+Recent patch to prevent calling __nvme_fc_abort_outstanding_ios in
+interrupt context results in a possible race condition. A controller
+reset results in errored io completions, which schedules error
+work. The change of error work to a work element allows it to fire
+after the ctrl state transition to NVME_CTRL_CONNECTING, causing
+any outstanding io (used to initialize the controller) to fail and
+cause problems for connect_work.
+
+Add a state check to only schedule error work if not in the RESETTING
+state.
+
+Fixes: 19fce0470f05 ("nvme-fc: avoid calling _nvme_fc_abort_outstanding_ios from interrupt context")
+Signed-off-by: Nigel Kirkland <nkirkland2304@gmail.com>
+Signed-off-by: James Smart <jsmart2021@gmail.com>
+Signed-off-by: Christoph Hellwig <hch@lst.de>
+Signed-off-by: Sasha Levin <sashal@kernel.org>
+---
+ drivers/nvme/host/fc.c | 2 +-
+ 1 file changed, 1 insertion(+), 1 deletion(-)
+
+diff --git a/drivers/nvme/host/fc.c b/drivers/nvme/host/fc.c
+index 5ead217ac2bc..fab068c8ba02 100644
+--- a/drivers/nvme/host/fc.c
++++ b/drivers/nvme/host/fc.c
+@@ -2055,7 +2055,7 @@ nvme_fc_fcpio_done(struct nvmefc_fcp_req *req)
+               nvme_fc_complete_rq(rq);
+ 
+ check_error:
+-      if (terminate_assoc)
++      if (terminate_assoc && ctrl->ctrl.state != NVME_CTRL_RESETTING)
+               queue_work(nvme_reset_wq, &ctrl->ioerr_work);
+ }
+ 
+-- 
+2.30.1
+
diff --git a/queue-5.10/perf-core-flush-pmu-internal-buffers-for-per-cpu-eve.patch b/queue-5.10/perf-core-flush-pmu-internal-buffers-for-per-cpu-eve.patch

new file mode 100644 (file)

index 0000000..72c345e
--- /dev/null
+++ b/queue-5.10/perf-core-flush-pmu-internal-buffers-for-per-cpu-eve.patch
@@ -0,0 +1,177 @@
+From 3a6074af72cdb6e50be7d432bb793bf1edc2ca49 Mon Sep 17 00:00:00 2001
+From: Sasha Levin <sashal@kernel.org>
+Date: Mon, 30 Nov 2020 11:38:40 -0800
+Subject: perf/core: Flush PMU internal buffers for per-CPU events
+
+From: Kan Liang <kan.liang@linux.intel.com>
+
+[ Upstream commit a5398bffc01fe044848c5024e5e867e407f239b8 ]
+
+Sometimes the PMU internal buffers have to be flushed for per-CPU events
+during a context switch, e.g., large PEBS. Otherwise, the perf tool may
+report samples in locations that do not belong to the process where the
+samples are processed in, because PEBS does not tag samples with PID/TID.
+
+The current code only flush the buffers for a per-task event. It doesn't
+check a per-CPU event.
+
+Add a new event state flag, PERF_ATTACH_SCHED_CB, to indicate that the
+PMU internal buffers have to be flushed for this event during a context
+switch.
+
+Add sched_cb_entry and perf_sched_cb_usages back to track the PMU/cpuctx
+which is required to be flushed.
+
+Only need to invoke the sched_task() for per-CPU events in this patch.
+The per-task events have been handled in perf_event_context_sched_in/out
+already.
+
+Fixes: 9c964efa4330 ("perf/x86/intel: Drain the PEBS buffer during context switches")
+Reported-by: Gabriel Marin <gmx@google.com>
+Originally-by: Namhyung Kim <namhyung@kernel.org>
+Signed-off-by: Kan Liang <kan.liang@linux.intel.com>
+Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org>
+Signed-off-by: Ingo Molnar <mingo@kernel.org>
+Link: https://lkml.kernel.org/r/20201130193842.10569-1-kan.liang@linux.intel.com
+Signed-off-by: Sasha Levin <sashal@kernel.org>
+---
+ include/linux/perf_event.h |  2 ++
+ kernel/events/core.c       | 42 ++++++++++++++++++++++++++++++++++----
+ 2 files changed, 40 insertions(+), 4 deletions(-)
+
+diff --git a/include/linux/perf_event.h b/include/linux/perf_event.h
+index 96450f6fb1de..22ce0604b448 100644
+--- a/include/linux/perf_event.h
++++ b/include/linux/perf_event.h
+@@ -606,6 +606,7 @@ struct swevent_hlist {
+ #define PERF_ATTACH_TASK      0x04
+ #define PERF_ATTACH_TASK_DATA 0x08
+ #define PERF_ATTACH_ITRACE    0x10
++#define PERF_ATTACH_SCHED_CB  0x20
+ 
+ struct perf_cgroup;
+ struct perf_buffer;
+@@ -872,6 +873,7 @@ struct perf_cpu_context {
+       struct list_head                cgrp_cpuctx_entry;
+ #endif
+ 
++      struct list_head                sched_cb_entry;
+       int                             sched_cb_usage;
+ 
+       int                             online;
+diff --git a/kernel/events/core.c b/kernel/events/core.c
+index c3ba29d058b7..4af161b3f322 100644
+--- a/kernel/events/core.c
++++ b/kernel/events/core.c
+@@ -383,6 +383,7 @@ static DEFINE_MUTEX(perf_sched_mutex);
+ static atomic_t perf_sched_count;
+ 
+ static DEFINE_PER_CPU(atomic_t, perf_cgroup_events);
++static DEFINE_PER_CPU(int, perf_sched_cb_usages);
+ static DEFINE_PER_CPU(struct pmu_event_list, pmu_sb_events);
+ 
+ static atomic_t nr_mmap_events __read_mostly;
+@@ -3466,11 +3467,16 @@ static void perf_event_context_sched_out(struct task_struct *task, int ctxn,
+       }
+ }
+ 
++static DEFINE_PER_CPU(struct list_head, sched_cb_list);
++
+ void perf_sched_cb_dec(struct pmu *pmu)
+ {
+       struct perf_cpu_context *cpuctx = this_cpu_ptr(pmu->pmu_cpu_context);
+ 
+-      --cpuctx->sched_cb_usage;
++      this_cpu_dec(perf_sched_cb_usages);
++
++      if (!--cpuctx->sched_cb_usage)
++              list_del(&cpuctx->sched_cb_entry);
+ }
+ 
+ 
+@@ -3478,7 +3484,10 @@ void perf_sched_cb_inc(struct pmu *pmu)
+ {
+       struct perf_cpu_context *cpuctx = this_cpu_ptr(pmu->pmu_cpu_context);
+ 
+-      cpuctx->sched_cb_usage++;
++      if (!cpuctx->sched_cb_usage++)
++              list_add(&cpuctx->sched_cb_entry, this_cpu_ptr(&sched_cb_list));
++
++      this_cpu_inc(perf_sched_cb_usages);
+ }
+ 
+ /*
+@@ -3507,6 +3516,24 @@ static void __perf_pmu_sched_task(struct perf_cpu_context *cpuctx, bool sched_in
+       perf_ctx_unlock(cpuctx, cpuctx->task_ctx);
+ }
+ 
++static void perf_pmu_sched_task(struct task_struct *prev,
++                              struct task_struct *next,
++                              bool sched_in)
++{
++      struct perf_cpu_context *cpuctx;
++
++      if (prev == next)
++              return;
++
++      list_for_each_entry(cpuctx, this_cpu_ptr(&sched_cb_list), sched_cb_entry) {
++              /* will be handled in perf_event_context_sched_in/out */
++              if (cpuctx->task_ctx)
++                      continue;
++
++              __perf_pmu_sched_task(cpuctx, sched_in);
++      }
++}
++
+ static void perf_event_switch(struct task_struct *task,
+                             struct task_struct *next_prev, bool sched_in);
+ 
+@@ -3529,6 +3556,9 @@ void __perf_event_task_sched_out(struct task_struct *task,
+ {
+       int ctxn;
+ 
++      if (__this_cpu_read(perf_sched_cb_usages))
++              perf_pmu_sched_task(task, next, false);
++
+       if (atomic_read(&nr_switch_events))
+               perf_event_switch(task, next, false);
+ 
+@@ -3837,6 +3867,9 @@ void __perf_event_task_sched_in(struct task_struct *prev,
+ 
+       if (atomic_read(&nr_switch_events))
+               perf_event_switch(task, prev, true);
++
++      if (__this_cpu_read(perf_sched_cb_usages))
++              perf_pmu_sched_task(prev, task, true);
+ }
+ 
+ static u64 perf_calculate_period(struct perf_event *event, u64 nsec, u64 count)
+@@ -4661,7 +4694,7 @@ static void unaccount_event(struct perf_event *event)
+       if (event->parent)
+               return;
+ 
+-      if (event->attach_state & PERF_ATTACH_TASK)
++      if (event->attach_state & (PERF_ATTACH_TASK | PERF_ATTACH_SCHED_CB))
+               dec = true;
+       if (event->attr.mmap || event->attr.mmap_data)
+               atomic_dec(&nr_mmap_events);
+@@ -11056,7 +11089,7 @@ static void account_event(struct perf_event *event)
+       if (event->parent)
+               return;
+ 
+-      if (event->attach_state & PERF_ATTACH_TASK)
++      if (event->attach_state & (PERF_ATTACH_TASK | PERF_ATTACH_SCHED_CB))
+               inc = true;
+       if (event->attr.mmap || event->attr.mmap_data)
+               atomic_inc(&nr_mmap_events);
+@@ -12848,6 +12881,7 @@ static void __init perf_event_init_all_cpus(void)
+ #ifdef CONFIG_CGROUP_PERF
+               INIT_LIST_HEAD(&per_cpu(cgrp_cpuctx_list, cpu));
+ #endif
++              INIT_LIST_HEAD(&per_cpu(sched_cb_list, cpu));
+       }
+ }
+ 
+-- 
+2.30.1
+
diff --git a/queue-5.10/perf-x86-intel-set-perf_attach_sched_cb-for-large-pe.patch b/queue-5.10/perf-x86-intel-set-perf_attach_sched_cb-for-large-pe.patch

new file mode 100644 (file)

index 0000000..5c25948
--- /dev/null
+++ b/queue-5.10/perf-x86-intel-set-perf_attach_sched_cb-for-large-pe.patch
@@ -0,0 +1,58 @@
+From 10e4206f0bc3487e1813723f8b76e0ffe92517b3 Mon Sep 17 00:00:00 2001
+From: Sasha Levin <sashal@kernel.org>
+Date: Mon, 30 Nov 2020 11:38:41 -0800
+Subject: perf/x86/intel: Set PERF_ATTACH_SCHED_CB for large PEBS and LBR
+
+From: Kan Liang <kan.liang@linux.intel.com>
+
+[ Upstream commit afbef30149587ad46f4780b1e0cc5e219745ce90 ]
+
+To supply a PID/TID for large PEBS, it requires flushing the PEBS buffer
+in a context switch.
+
+For normal LBRs, a context switch can flip the address space and LBR
+entries are not tagged with an identifier, we need to wipe the LBR, even
+for per-cpu events.
+
+For LBR callstack, save/restore the stack is required during a context
+switch.
+
+Set PERF_ATTACH_SCHED_CB for the event with large PEBS & LBR.
+
+Fixes: 9c964efa4330 ("perf/x86/intel: Drain the PEBS buffer during context switches")
+Signed-off-by: Kan Liang <kan.liang@linux.intel.com>
+Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org>
+Signed-off-by: Ingo Molnar <mingo@kernel.org>
+Link: https://lkml.kernel.org/r/20201130193842.10569-2-kan.liang@linux.intel.com
+Signed-off-by: Sasha Levin <sashal@kernel.org>
+---
+ arch/x86/events/intel/core.c | 5 ++++-
+ 1 file changed, 4 insertions(+), 1 deletion(-)
+
+diff --git a/arch/x86/events/intel/core.c b/arch/x86/events/intel/core.c
+index aaa7bffdb20f..4b05c876f9f6 100644
+--- a/arch/x86/events/intel/core.c
++++ b/arch/x86/events/intel/core.c
+@@ -3565,8 +3565,10 @@ static int intel_pmu_hw_config(struct perf_event *event)
+               if (!(event->attr.freq || (event->attr.wakeup_events && !event->attr.watermark))) {
+                       event->hw.flags |= PERF_X86_EVENT_AUTO_RELOAD;
+                       if (!(event->attr.sample_type &
+-                            ~intel_pmu_large_pebs_flags(event)))
++                            ~intel_pmu_large_pebs_flags(event))) {
+                               event->hw.flags |= PERF_X86_EVENT_LARGE_PEBS;
++                              event->attach_state |= PERF_ATTACH_SCHED_CB;
++                      }
+               }
+               if (x86_pmu.pebs_aliases)
+                       x86_pmu.pebs_aliases(event);
+@@ -3579,6 +3581,7 @@ static int intel_pmu_hw_config(struct perf_event *event)
+               ret = intel_pmu_setup_lbr_filter(event);
+               if (ret)
+                       return ret;
++              event->attach_state |= PERF_ATTACH_SCHED_CB;
+ 
+               /*
+                * BTS is set up earlier in this path, so don't account twice
+-- 
+2.30.1
+
diff --git a/queue-5.10/powerpc-64s-exception-clean-up-a-missed-srr-specifie.patch b/queue-5.10/powerpc-64s-exception-clean-up-a-missed-srr-specifie.patch

new file mode 100644 (file)

index 0000000..2ac9fc3
--- /dev/null
+++ b/queue-5.10/powerpc-64s-exception-clean-up-a-missed-srr-specifie.patch
@@ -0,0 +1,40 @@
+From 74a2645f6895b8518ef5d0e56253530779f3aeeb Mon Sep 17 00:00:00 2001
+From: Sasha Levin <sashal@kernel.org>
+Date: Thu, 25 Feb 2021 14:09:59 +1100
+Subject: powerpc/64s/exception: Clean up a missed SRR specifier
+
+From: Daniel Axtens <dja@axtens.net>
+
+[ Upstream commit c080a173301ffc62cb6c76308c803c7fee05517a ]
+
+Nick's patch cleaning up the SRR specifiers in exception-64s.S missed
+a single instance of EXC_HV_OR_STD. Clean that up.
+
+Caught by clang's integrated assembler.
+
+Fixes: 3f7fbd97d07d ("powerpc/64s/exception: Clean up SRR specifiers")
+Signed-off-by: Daniel Axtens <dja@axtens.net>
+Acked-by: Nicholas Piggin <npiggin@gmail.com>
+Signed-off-by: Michael Ellerman <mpe@ellerman.id.au>
+Link: https://lore.kernel.org/r/20210225031006.1204774-2-dja@axtens.net
+Signed-off-by: Sasha Levin <sashal@kernel.org>
+---
+ arch/powerpc/kernel/exceptions-64s.S | 2 +-
+ 1 file changed, 1 insertion(+), 1 deletion(-)
+
+diff --git a/arch/powerpc/kernel/exceptions-64s.S b/arch/powerpc/kernel/exceptions-64s.S
+index 3cde2fbd74fc..9d3b468bd2d7 100644
+--- a/arch/powerpc/kernel/exceptions-64s.S
++++ b/arch/powerpc/kernel/exceptions-64s.S
+@@ -470,7 +470,7 @@ DEFINE_FIXED_SYMBOL(\name\()_common_real)
+ 
+       ld      r10,PACAKMSR(r13)       /* get MSR value for kernel */
+       /* MSR[RI] is clear iff using SRR regs */
+-      .if IHSRR == EXC_HV_OR_STD
++      .if IHSRR_IF_HVMODE
+       BEGIN_FTR_SECTION
+       xori    r10,r10,MSR_RI
+       END_FTR_SECTION_IFCLR(CPU_FTR_HVMODE)
+-- 
+2.30.1
+
diff --git a/queue-5.10/prctl-fix-pr_set_mm_auxv-kernel-stack-leak.patch b/queue-5.10/prctl-fix-pr_set_mm_auxv-kernel-stack-leak.patch

new file mode 100644 (file)

index 0000000..1f0fbac
--- /dev/null
+++ b/queue-5.10/prctl-fix-pr_set_mm_auxv-kernel-stack-leak.patch
@@ -0,0 +1,45 @@
+From c82294f7036c82b48b3fe96460ae73a437474eb2 Mon Sep 17 00:00:00 2001
+From: Sasha Levin <sashal@kernel.org>
+Date: Sun, 14 Mar 2021 23:51:14 +0300
+Subject: prctl: fix PR_SET_MM_AUXV kernel stack leak
+
+From: Alexey Dobriyan <adobriyan@gmail.com>
+
+[ Upstream commit c995f12ad8842dbf5cfed113fb52cdd083f5afd1 ]
+
+Doing a
+
+       prctl(PR_SET_MM, PR_SET_MM_AUXV, addr, 1);
+
+will copy 1 byte from userspace to (quite big) on-stack array
+and then stash everything to mm->saved_auxv.
+AT_NULL terminator will be inserted at the very end.
+
+/proc/*/auxv handler will find that AT_NULL terminator
+and copy original stack contents to userspace.
+
+This devious scheme requires CAP_SYS_RESOURCE.
+
+Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com>
+Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
+Signed-off-by: Sasha Levin <sashal@kernel.org>
+---
+ kernel/sys.c | 2 +-
+ 1 file changed, 1 insertion(+), 1 deletion(-)
+
+diff --git a/kernel/sys.c b/kernel/sys.c
+index a730c03ee607..2603280b97be 100644
+--- a/kernel/sys.c
++++ b/kernel/sys.c
+@@ -2079,7 +2079,7 @@ static int prctl_set_auxv(struct mm_struct *mm, unsigned long addr,
+        * up to the caller to provide sane values here, otherwise userspace
+        * tools which use this vector might be unhappy.
+        */
+-      unsigned long user_auxv[AT_VECTOR_SIZE];
++      unsigned long user_auxv[AT_VECTOR_SIZE] = {};
+ 
+       if (len > sizeof(user_auxv))
+               return -EINVAL;
+-- 
+2.30.1
+
diff --git a/queue-5.10/seqlock-lockdep-fix-seqcount_latch_init.patch b/queue-5.10/seqlock-lockdep-fix-seqcount_latch_init.patch

new file mode 100644 (file)

index 0000000..6018eb0
--- /dev/null
+++ b/queue-5.10/seqlock-lockdep-fix-seqcount_latch_init.patch
@@ -0,0 +1,44 @@
+From 9b34e83dc772bf459ac22ba62e4d80b5d8b9db78 Mon Sep 17 00:00:00 2001
+From: Sasha Levin <sashal@kernel.org>
+Date: Tue, 9 Mar 2021 15:21:18 +0100
+Subject: seqlock,lockdep: Fix seqcount_latch_init()
+
+From: Peter Zijlstra <peterz@infradead.org>
+
+[ Upstream commit 4817a52b306136c8b2b2271d8770401441e4cf79 ]
+
+seqcount_init() must be a macro in order to preserve the static
+variable that is used for the lockdep key. Don't then wrap it in an
+inline function, which destroys that.
+
+Luckily there aren't many users of this function, but fix it before it
+becomes a problem.
+
+Fixes: 80793c3471d9 ("seqlock: Introduce seqcount_latch_t")
+Reported-by: Eric Dumazet <eric.dumazet@gmail.com>
+Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org>
+Link: https://lkml.kernel.org/r/YEeFEbNUVkZaXDp4@hirez.programming.kicks-ass.net
+Signed-off-by: Sasha Levin <sashal@kernel.org>
+---
+ include/linux/seqlock.h | 5 +----
+ 1 file changed, 1 insertion(+), 4 deletions(-)
+
+diff --git a/include/linux/seqlock.h b/include/linux/seqlock.h
+index cbfc78b92b65..1ac20d75b061 100644
+--- a/include/linux/seqlock.h
++++ b/include/linux/seqlock.h
+@@ -659,10 +659,7 @@ typedef struct {
+  * seqcount_latch_init() - runtime initializer for seqcount_latch_t
+  * @s: Pointer to the seqcount_latch_t instance
+  */
+-static inline void seqcount_latch_init(seqcount_latch_t *s)
+-{
+-      seqcount_init(&s->seqcount);
+-}
++#define seqcount_latch_init(s) seqcount_init(&(s)->seqcount)
+ 
+ /**
+  * raw_read_seqcount_latch() - pick even/odd latch data copy
+-- 
+2.30.1
+
diff --git a/queue-5.10/series b/queue-5.10/series

index 3468dc6aa767d3a0592dfabbf319c970c3d9e4c7..c9631d0cc662ed39713e946f1bb316e98695bd87 100644 (file)
--- a/queue-5.10/series
+++ b/queue-5.10/series
@@ -239,3 +239,24 @@ staging-comedi-dmm32at-fix-endian-problem-for-ai-command-data.patch
  staging-comedi-me4000-fix-endian-problem-for-ai-command-data.patch
  staging-comedi-pcl711-fix-endian-problem-for-ai-command-data.patch
  staging-comedi-pcl818-fix-endian-problem-for-ai-command-data.patch
+sh_eth-fix-trscer-mask-for-r7s72100.patch
+cpufreq-qcom-hw-fix-dereferencing-freed-memory-data.patch
+cpufreq-qcom-hw-fix-return-value-check-in-qcom_cpufr.patch
+arm64-mm-fix-pfn_valid-for-zone_device-based-memory.patch
+net-bonding-fix-error-return-code-of-bond_neigh_init.patch
+sunrpc-set-memalloc_nofs_save-for-sync-tasks.patch
+nfs-don-t-revalidate-the-directory-permissions-on-a-.patch
+nfs-don-t-gratuitously-clear-the-inode-cache-when-lo.patch
+nfsv4.2-fix-return-value-of-_nfs4_get_security_label.patch
+block-rsxx-fix-error-return-code-of-rsxx_pci_probe.patch
+nvme-fc-fix-racing-controller-reset-and-create-assoc.patch
+configfs-fix-a-use-after-free-in-__configfs_open_fil.patch
+arm64-mm-use-a-48-bit-id-map-when-possible-on-52-bit.patch
+perf-core-flush-pmu-internal-buffers-for-per-cpu-eve.patch
+perf-x86-intel-set-perf_attach_sched_cb-for-large-pe.patch
+hrtimer-update-softirq_expires_next-correctly-after-.patch
+powerpc-64s-exception-clean-up-a-missed-srr-specifie.patch
+seqlock-lockdep-fix-seqcount_latch_init.patch
+stop_machine-mark-helpers-__always_inline.patch
+include-linux-sched-mm.h-use-rcu_dereference-in-in_v.patch
+prctl-fix-pr_set_mm_auxv-kernel-stack-leak.patch
diff --git a/queue-5.10/sh_eth-fix-trscer-mask-for-r7s72100.patch b/queue-5.10/sh_eth-fix-trscer-mask-for-r7s72100.patch

new file mode 100644 (file)

index 0000000..987a32e
--- /dev/null
+++ b/queue-5.10/sh_eth-fix-trscer-mask-for-r7s72100.patch
@@ -0,0 +1,38 @@
+From 121cca034871d66b956c830f9f59819ca725a98f Mon Sep 17 00:00:00 2001
+From: Sasha Levin <sashal@kernel.org>
+Date: Sun, 28 Feb 2021 23:26:34 +0300
+Subject: sh_eth: fix TRSCER mask for R7S72100
+
+From: Sergey Shtylyov <s.shtylyov@omprussia.ru>
+
+[ Upstream commit 75be7fb7f978202c4c3a1a713af4485afb2ff5f6 ]
+
+According  to  the RZ/A1H Group, RZ/A1M Group User's Manual: Hardware,
+Rev. 4.00, the TRSCER register has bit 9 reserved, hence we can't use
+the driver's default TRSCER mask.  Add the explicit initializer for
+sh_eth_cpu_data::trscer_err_mask for R7S72100.
+
+Fixes: db893473d313 ("sh_eth: Add support for r7s72100")
+Signed-off-by: Sergey Shtylyov <s.shtylyov@omprussia.ru>
+Signed-off-by: David S. Miller <davem@davemloft.net>
+Signed-off-by: Sasha Levin <sashal@kernel.org>
+---
+ drivers/net/ethernet/renesas/sh_eth.c | 2 ++
+ 1 file changed, 2 insertions(+)
+
+diff --git a/drivers/net/ethernet/renesas/sh_eth.c b/drivers/net/ethernet/renesas/sh_eth.c
+index 50fb372d417c..6d84266c03ca 100644
+--- a/drivers/net/ethernet/renesas/sh_eth.c
++++ b/drivers/net/ethernet/renesas/sh_eth.c
+@@ -560,6 +560,8 @@ static struct sh_eth_cpu_data r7s72100_data = {
+                         EESR_TDE,
+       .fdr_value      = 0x0000070f,
+ 
++      .trscer_err_mask = DESC_I_RINT8 | DESC_I_RINT5,
++
+       .no_psr         = 1,
+       .apr            = 1,
+       .mpr            = 1,
+-- 
+2.30.1
+
diff --git a/queue-5.10/stop_machine-mark-helpers-__always_inline.patch b/queue-5.10/stop_machine-mark-helpers-__always_inline.patch

new file mode 100644 (file)

index 0000000..a412029
--- /dev/null
+++ b/queue-5.10/stop_machine-mark-helpers-__always_inline.patch
@@ -0,0 +1,83 @@
+From 2c3ef91ea4c088662b41c7276aa841e3457635e9 Mon Sep 17 00:00:00 2001
+From: Sasha Levin <sashal@kernel.org>
+Date: Fri, 12 Mar 2021 21:07:04 -0800
+Subject: stop_machine: mark helpers __always_inline
+
+From: Arnd Bergmann <arnd@arndb.de>
+
+[ Upstream commit cbf78d85079cee662c45749ef4f744d41be85d48 ]
+
+With clang-13, some functions only get partially inlined, with a
+specialized version referring to a global variable.  This triggers a
+harmless build-time check for the intel-rng driver:
+
+WARNING: modpost: drivers/char/hw_random/intel-rng.o(.text+0xe): Section mismatch in reference from the function stop_machine() to the function .init.text:intel_rng_hw_init()
+The function stop_machine() references
+the function __init intel_rng_hw_init().
+This is often because stop_machine lacks a __init
+annotation or the annotation of intel_rng_hw_init is wrong.
+
+In this instance, an easy workaround is to force the stop_machine()
+function to be inline, along with related interfaces that did not show the
+same behavior at the moment, but theoretically could.
+
+The combination of the two patches listed below triggers the behavior in
+clang-13, but individually these commits are correct.
+
+Link: https://lkml.kernel.org/r/20210225130153.1956990-1-arnd@kernel.org
+Fixes: fe5595c07400 ("stop_machine: Provide stop_machine_cpuslocked()")
+Fixes: ee527cd3a20c ("Use stop_machine_run in the Intel RNG driver")
+Signed-off-by: Arnd Bergmann <arnd@arndb.de>
+Cc: Nathan Chancellor <nathan@kernel.org>
+Cc: Nick Desaulniers <ndesaulniers@google.com>
+Cc: Thomas Gleixner <tglx@linutronix.de>
+Cc: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
+Cc: "Paul E. McKenney" <paulmck@kernel.org>
+Cc: Ingo Molnar <mingo@kernel.org>
+Cc: Prarit Bhargava <prarit@redhat.com>
+Cc: Daniel Bristot de Oliveira <bristot@redhat.com>
+Cc: Peter Zijlstra <peterz@infradead.org>
+Cc: Valentin Schneider <valentin.schneider@arm.com>
+Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
+Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
+Signed-off-by: Sasha Levin <sashal@kernel.org>
+---
+ include/linux/stop_machine.h | 11 ++++++-----
+ 1 file changed, 6 insertions(+), 5 deletions(-)
+
+diff --git a/include/linux/stop_machine.h b/include/linux/stop_machine.h
+index 76d8b09384a7..63ea9aff368f 100644
+--- a/include/linux/stop_machine.h
++++ b/include/linux/stop_machine.h
+@@ -123,7 +123,7 @@ int stop_machine_from_inactive_cpu(cpu_stop_fn_t fn, void *data,
+                                  const struct cpumask *cpus);
+ #else /* CONFIG_SMP || CONFIG_HOTPLUG_CPU */
+ 
+-static inline int stop_machine_cpuslocked(cpu_stop_fn_t fn, void *data,
++static __always_inline int stop_machine_cpuslocked(cpu_stop_fn_t fn, void *data,
+                                         const struct cpumask *cpus)
+ {
+       unsigned long flags;
+@@ -134,14 +134,15 @@ static inline int stop_machine_cpuslocked(cpu_stop_fn_t fn, void *data,
+       return ret;
+ }
+ 
+-static inline int stop_machine(cpu_stop_fn_t fn, void *data,
+-                             const struct cpumask *cpus)
++static __always_inline int
++stop_machine(cpu_stop_fn_t fn, void *data, const struct cpumask *cpus)
+ {
+       return stop_machine_cpuslocked(fn, data, cpus);
+ }
+ 
+-static inline int stop_machine_from_inactive_cpu(cpu_stop_fn_t fn, void *data,
+-                                               const struct cpumask *cpus)
++static __always_inline int
++stop_machine_from_inactive_cpu(cpu_stop_fn_t fn, void *data,
++                             const struct cpumask *cpus)
+ {
+       return stop_machine(fn, data, cpus);
+ }
+-- 
+2.30.1
+
diff --git a/queue-5.10/sunrpc-set-memalloc_nofs_save-for-sync-tasks.patch b/queue-5.10/sunrpc-set-memalloc_nofs_save-for-sync-tasks.patch

new file mode 100644 (file)

index 0000000..feca901
--- /dev/null
+++ b/queue-5.10/sunrpc-set-memalloc_nofs_save-for-sync-tasks.patch
@@ -0,0 +1,41 @@
+From 5eab4260a84ecafd129394e0592d04a2bb55b149 Mon Sep 17 00:00:00 2001
+From: Sasha Levin <sashal@kernel.org>
+Date: Wed, 3 Mar 2021 08:47:16 -0500
+Subject: SUNRPC: Set memalloc_nofs_save() for sync tasks
+
+From: Benjamin Coddington <bcodding@redhat.com>
+
+[ Upstream commit f0940f4b3284a00f38a5d42e6067c2aaa20e1f2e ]
+
+We could recurse into NFS doing memory reclaim while sending a sync task,
+which might result in a deadlock.  Set memalloc_nofs_save for sync task
+execution.
+
+Fixes: a1231fda7e94 ("SUNRPC: Set memalloc_nofs_save() on all rpciod/xprtiod jobs")
+Signed-off-by: Benjamin Coddington <bcodding@redhat.com>
+Signed-off-by: Anna Schumaker <Anna.Schumaker@Netapp.com>
+Signed-off-by: Sasha Levin <sashal@kernel.org>
+---
+ net/sunrpc/sched.c | 5 ++++-
+ 1 file changed, 4 insertions(+), 1 deletion(-)
+
+diff --git a/net/sunrpc/sched.c b/net/sunrpc/sched.c
+index cf702a5f7fe5..39ed0e0afe6d 100644
+--- a/net/sunrpc/sched.c
++++ b/net/sunrpc/sched.c
+@@ -963,8 +963,11 @@ void rpc_execute(struct rpc_task *task)
+ 
+       rpc_set_active(task);
+       rpc_make_runnable(rpciod_workqueue, task);
+-      if (!is_async)
++      if (!is_async) {
++              unsigned int pflags = memalloc_nofs_save();
+               __rpc_execute(task);
++              memalloc_nofs_restore(pflags);
++      }
+ }
+ 
+ static void rpc_async_schedule(struct work_struct *work)
+-- 
+2.30.1
+
author	Sasha Levin <sashal@kernel.org>
	Mon, 15 Mar 2021 03:01:10 +0000 (23:01 -0400)
committer	Sasha Levin <sashal@kernel.org>
	Mon, 15 Mar 2021 03:01:10 +0000 (23:01 -0400)
queue-5.10/arm64-mm-fix-pfn_valid-for-zone_device-based-memory.patch	[new file with mode: 0644]	patch \| blob
queue-5.10/arm64-mm-use-a-48-bit-id-map-when-possible-on-52-bit.patch	[new file with mode: 0644]	patch \| blob
queue-5.10/block-rsxx-fix-error-return-code-of-rsxx_pci_probe.patch	[new file with mode: 0644]	patch \| blob
queue-5.10/configfs-fix-a-use-after-free-in-__configfs_open_fil.patch	[new file with mode: 0644]	patch \| blob
queue-5.10/cpufreq-qcom-hw-fix-dereferencing-freed-memory-data.patch	[new file with mode: 0644]	patch \| blob
queue-5.10/cpufreq-qcom-hw-fix-return-value-check-in-qcom_cpufr.patch	[new file with mode: 0644]	patch \| blob
queue-5.10/hrtimer-update-softirq_expires_next-correctly-after-.patch	[new file with mode: 0644]	patch \| blob
queue-5.10/include-linux-sched-mm.h-use-rcu_dereference-in-in_v.patch	[new file with mode: 0644]	patch \| blob
queue-5.10/net-bonding-fix-error-return-code-of-bond_neigh_init.patch	[new file with mode: 0644]	patch \| blob
queue-5.10/nfs-don-t-gratuitously-clear-the-inode-cache-when-lo.patch	[new file with mode: 0644]	patch \| blob
queue-5.10/nfs-don-t-revalidate-the-directory-permissions-on-a-.patch	[new file with mode: 0644]	patch \| blob
queue-5.10/nfsv4.2-fix-return-value-of-_nfs4_get_security_label.patch	[new file with mode: 0644]	patch \| blob
queue-5.10/nvme-fc-fix-racing-controller-reset-and-create-assoc.patch	[new file with mode: 0644]	patch \| blob
queue-5.10/perf-core-flush-pmu-internal-buffers-for-per-cpu-eve.patch	[new file with mode: 0644]	patch \| blob
queue-5.10/perf-x86-intel-set-perf_attach_sched_cb-for-large-pe.patch	[new file with mode: 0644]	patch \| blob
queue-5.10/powerpc-64s-exception-clean-up-a-missed-srr-specifie.patch	[new file with mode: 0644]	patch \| blob
queue-5.10/prctl-fix-pr_set_mm_auxv-kernel-stack-leak.patch	[new file with mode: 0644]	patch \| blob
queue-5.10/seqlock-lockdep-fix-seqcount_latch_init.patch	[new file with mode: 0644]	patch \| blob
queue-5.10/series		patch \| blob \| blame \| history
queue-5.10/sh_eth-fix-trscer-mask-for-r7s72100.patch	[new file with mode: 0644]	patch \| blob
queue-5.10/stop_machine-mark-helpers-__always_inline.patch	[new file with mode: 0644]	patch \| blob
queue-5.10/sunrpc-set-memalloc_nofs_save-for-sync-tasks.patch	[new file with mode: 0644]	patch \| blob