From: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Date: Tue, 11 Feb 2014 17:36:13 +0000 (-0800)
Subject: 3.4-stable patches
X-Git-Tag: v3.4.80~22
X-Git-Url: http://git.ipfire.org/?a=commitdiff_plain;h=c612bbb4186bb1aae5b213ec7e833e0f9715b270;p=thirdparty%2Fkernel%2Fstable-queue.git

3.4-stable patches

added patches:
	sched-rt-avoid-updating-rt-entry-timeout-twice-within-one.patch
	sched-unthrottle-rt-runqueues-in-__disable_runtime.patch
---

diff --git a/queue-3.4/sched-rt-avoid-updating-rt-entry-timeout-twice-within-one.patch b/queue-3.4/sched-rt-avoid-updating-rt-entry-timeout-twice-within-one.patch
new file mode 100644
index 00000000000..df6afa490e0
--- /dev/null
+++ b/queue-3.4/sched-rt-avoid-updating-rt-entry-timeout-twice-within-one.patch
@@ -0,0 +1,107 @@
+From 57d2aa00dcec67afa52478730f2b524521af14fb Mon Sep 17 00:00:00 2001
+From: Ying Xue <ying.xue@windriver.com>
+Date: Tue, 17 Jul 2012 15:03:43 +0800
+Subject: sched/rt: Avoid updating RT entry timeout twice within one
+ tick period
+
+From: Ying Xue <ying.xue@windriver.com>
+
+commit 57d2aa00dcec67afa52478730f2b524521af14fb upstream.
+
+The issue below was found in 2.6.34-rt rather than mainline rt
+kernel, but the issue still exists upstream as well.
+
+So please let me describe how it was noticed on 2.6.34-rt:
+
+On this version, each softirq has its own thread, it means there
+is at least one RT FIFO task per cpu. The priority of these
+tasks is set to 49 by default. If user launches an RT FIFO task
+with priority lower than 49 of softirq RT tasks, it's possible
+there are two RT FIFO tasks enqueued one cpu runqueue at one
+moment. By current strategy of balancing RT tasks, when it comes
+to RT tasks, we really need to put them off to a CPU that they
+can run on as soon as possible. Even if it means a bit of cache
+line flushing, we want RT tasks to be run with the least latency.
+
+When the user RT FIFO task which just launched before is
+running, the sched timer tick of the current cpu happens. In this
+tick period, the timeout value of the user RT task will be
+updated once. Subsequently, we try to wake up one softirq RT
+task on its local cpu. As the priority of current user RT task
+is lower than the softirq RT task, the current task will be
+preempted by the higher priority softirq RT task. Before
+preemption, we check to see if current can readily move to a
+different cpu. If so, we will reschedule to allow the RT push logic
+to try to move current somewhere else. Whenever the woken
+softirq RT task runs, it first tries to migrate the user FIFO RT
+task over to a cpu that is running a task of lesser priority. If
+migration is done, it will send a reschedule request to the found
+cpu by IPI interrupt. Once the target cpu responds the IPI
+interrupt, it will pick the migrated user RT task to preempt its
+current task. When the user RT task is running on the new cpu,
+the sched timer tick of the cpu fires. So it will tick the user
+RT task again. This also means the RT task timeout value will be
+updated again. As the migration may be done in one tick period,
+it means the user RT task timeout value will be updated twice
+within one tick.
+
+If we set a limit on the amount of cpu time for the user RT task
+by setrlimit(RLIMIT_RTTIME), the SIGXCPU signal should be posted
+upon reaching the soft limit.
+
+But exactly when the SIGXCPU signal should be sent depends on the
+RT task timeout value. In fact the timeout mechanism of sending
+the SIGXCPU signal assumes the RT task timeout is increased once
+every tick.
+
+However, currently the timeout value may be added twice per
+tick. So it results in the SIGXCPU signal being sent earlier
+than expected.
+
+To solve this issue, we prevent the timeout value from increasing
+twice within one tick time by remembering the jiffies value of
+last updating the timeout. As long as the RT task's jiffies is
+different with the global jiffies value, we allow its timeout to
+be updated.
+
+Signed-off-by: Ying Xue <ying.xue@windriver.com>
+Signed-off-by: Fan Du <fan.du@windriver.com>
+Reviewed-by: Yong Zhang <yong.zhang0@gmail.com>
+Acked-by: Steven Rostedt <rostedt@goodmis.org>
+Cc: <peterz@infradead.org>
+Link: http://lkml.kernel.org/r/1342508623-2887-1-git-send-email-ying.xue@windriver.com
+Signed-off-by: Ingo Molnar <mingo@kernel.org>
+[ lizf: backported to 3.4: adjust context ]
+Signed-off-by: Li Zefan <lizefan@huawei.com>
+Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
+
+---
+ include/linux/sched.h |    1 +
+ kernel/sched/rt.c     |    6 +++++-
+ 2 files changed, 6 insertions(+), 1 deletion(-)
+
+--- a/include/linux/sched.h
++++ b/include/linux/sched.h
+@@ -1237,6 +1237,7 @@ struct sched_entity {
+ struct sched_rt_entity {
+ 	struct list_head run_list;
+ 	unsigned long timeout;
++	unsigned long watchdog_stamp;
+ 	unsigned int time_slice;
+ 	int nr_cpus_allowed;
+ 
+--- a/kernel/sched/rt.c
++++ b/kernel/sched/rt.c
+@@ -2002,7 +2002,11 @@ static void watchdog(struct rq *rq, stru
+ 	if (soft != RLIM_INFINITY) {
+ 		unsigned long next;
+ 
+-		p->rt.timeout++;
++		if (p->rt.watchdog_stamp != jiffies) {
++			p->rt.timeout++;
++			p->rt.watchdog_stamp = jiffies;
++		}
++
+ 		next = DIV_ROUND_UP(min(soft, hard), USEC_PER_SEC/HZ);
+ 		if (p->rt.timeout > next)
+ 			p->cputime_expires.sched_exp = p->se.sum_exec_runtime;
diff --git a/queue-3.4/sched-unthrottle-rt-runqueues-in-__disable_runtime.patch b/queue-3.4/sched-unthrottle-rt-runqueues-in-__disable_runtime.patch
new file mode 100644
index 00000000000..d00b46e3511
--- /dev/null
+++ b/queue-3.4/sched-unthrottle-rt-runqueues-in-__disable_runtime.patch
@@ -0,0 +1,97 @@
+From a4c96ae319b8047f62dedbe1eac79e321c185749 Mon Sep 17 00:00:00 2001
+From: Peter Boonstoppel <pboonstoppel@nvidia.com>
+Date: Thu, 9 Aug 2012 15:34:47 -0700
+Subject: sched: Unthrottle rt runqueues in __disable_runtime()
+
+From: Peter Boonstoppel <pboonstoppel@nvidia.com>
+
+commit a4c96ae319b8047f62dedbe1eac79e321c185749 upstream.
+
+migrate_tasks() uses _pick_next_task_rt() to get tasks from the
+real-time runqueues to be migrated. When rt_rq is throttled
+_pick_next_task_rt() won't return anything, in which case
+migrate_tasks() can't move all threads over and gets stuck in an
+infinite loop.
+
+Instead unthrottle rt runqueues before migrating tasks.
+
+Additionally: move unthrottle_offline_cfs_rqs() to rq_offline_fair()
+
+Signed-off-by: Peter Boonstoppel <pboonstoppel@nvidia.com>
+Signed-off-by: Peter Zijlstra <a.p.zijlstra@chello.nl>
+Cc: Paul Turner <pjt@google.com>
+Link: http://lkml.kernel.org/r/5FBF8E85CA34454794F0F7ECBA79798F379D3648B7@HQMAIL04.nvidia.com
+Signed-off-by: Ingo Molnar <mingo@kernel.org>
+[ lizf: backported to 3.4: adjust context ]
+Signed-off-by: Li Zefan <lizefan@huawei.com>
+Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
+
+---
+ kernel/sched/core.c  |    3 ---
+ kernel/sched/fair.c  |    7 +++++--
+ kernel/sched/rt.c    |    1 +
+ kernel/sched/sched.h |    1 -
+ 4 files changed, 6 insertions(+), 6 deletions(-)
+
+--- a/kernel/sched/core.c
++++ b/kernel/sched/core.c
+@@ -5301,9 +5301,6 @@ static void migrate_tasks(unsigned int d
+ 	 */
+ 	rq->stop = NULL;
+ 
+-	/* Ensure any throttled groups are reachable by pick_next_task */
+-	unthrottle_offline_cfs_rqs(rq);
+-
+ 	for ( ; ; ) {
+ 		/*
+ 		 * There's this thread running, bail when that's the only
+--- a/kernel/sched/fair.c
++++ b/kernel/sched/fair.c
+@@ -2071,7 +2071,7 @@ static void destroy_cfs_bandwidth(struct
+ 	hrtimer_cancel(&cfs_b->slack_timer);
+ }
+ 
+-void unthrottle_offline_cfs_rqs(struct rq *rq)
++static void unthrottle_offline_cfs_rqs(struct rq *rq)
+ {
+ 	struct cfs_rq *cfs_rq;
+ 
+@@ -2125,7 +2125,7 @@ static inline struct cfs_bandwidth *tg_c
+ 	return NULL;
+ }
+ static inline void destroy_cfs_bandwidth(struct cfs_bandwidth *cfs_b) {}
+-void unthrottle_offline_cfs_rqs(struct rq *rq) {}
++static inline void unthrottle_offline_cfs_rqs(struct rq *rq) {}
+ 
+ #endif /* CONFIG_CFS_BANDWIDTH */
+ 
+@@ -5171,6 +5171,9 @@ static void rq_online_fair(struct rq *rq
+ static void rq_offline_fair(struct rq *rq)
+ {
+ 	update_sysctl();
++
++	/* Ensure any throttled groups are reachable by pick_next_task */
++	unthrottle_offline_cfs_rqs(rq);
+ }
+ 
+ #endif /* CONFIG_SMP */
+--- a/kernel/sched/rt.c
++++ b/kernel/sched/rt.c
+@@ -685,6 +685,7 @@ balanced:
+ 		 * runtime - in which case borrowing doesn't make sense.
+ 		 */
+ 		rt_rq->rt_runtime = RUNTIME_INF;
++		rt_rq->rt_throttled = 0;
+ 		raw_spin_unlock(&rt_rq->rt_runtime_lock);
+ 		raw_spin_unlock(&rt_b->rt_runtime_lock);
+ 	}
+--- a/kernel/sched/sched.h
++++ b/kernel/sched/sched.h
+@@ -1138,7 +1138,6 @@ extern void print_rt_stats(struct seq_fi
+ 
+ extern void init_cfs_rq(struct cfs_rq *cfs_rq);
+ extern void init_rt_rq(struct rt_rq *rt_rq, struct rq *rq);
+-extern void unthrottle_offline_cfs_rqs(struct rq *rq);
+ 
+ extern void cfs_bandwidth_usage_inc(void);
+ extern void cfs_bandwidth_usage_dec(void);
diff --git a/queue-3.4/series b/queue-3.4/series
index e2c1f8fac88..0f40577580f 100644
--- a/queue-3.4/series
+++ b/queue-3.4/series
@@ -23,3 +23,5 @@ pci-enable-ari-if-dev-and-upstream-bridge-support-it-disable-otherwise.patch
 hpfs-deadlock-and-race-in-directory-lseek.patch
 sched-rt-fix-sched_rr-across-cgroups.patch
 sched-rt-fix-isolated-cpus-leaving-root_task_group-indefinitely-throttled.patch
+sched-unthrottle-rt-runqueues-in-__disable_runtime.patch
+sched-rt-avoid-updating-rt-entry-timeout-twice-within-one.patch