x86/speculation/mds: Clear CPU buffers on exit to user Add a static key which controls the invocation of the CPU buffer clear mechanism on exit to user space and add the call into prepare_exit_to_usermode() and do_nmi() right before actually returning. Add documentation which kernel to user space transition this covers and explain why some corner cases are not mitigated. Signed-off-by: Thomas Gleixner <tglx@linutronix.de> Reviewed-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> Reviewed-by: Borislav Petkov <bp@suse.de> Reviewed-by: Frederic Weisbecker <frederic@kernel.org> Reviewed-by: Jon Masters <jcm@redhat.com> Tested-by: Jon Masters <jcm@redhat.com>

commit: 04dcbdb8057827b043b3c71aa397c4c63e67d086 [log] [tgz]
author: Thomas Gleixner <tglx@linutronix.de> Mon Feb 18 23:42:51 2019 +0100
committer: Thomas Gleixner <tglx@linutronix.de> Wed Mar 06 21:52:13 2019 +0100
tree: ef837807f05eaf4ce6abe5d62b5ef5703931c3e5
parent: 6a9e529272517755904b7afa639f6db59ddb793e [diff] [blame]
diff --git a/Documentation/x86/mds.rst b/Documentation/x86/mds.rst
index 1096738..54d935b 100644
--- a/Documentation/x86/mds.rst
+++ b/Documentation/x86/mds.rst

@@ -97,3 +97,55 @@
 itself are not required because the necessary gadgets to expose the leaked
 data cannot be controlled in a way which allows exploitation from malicious
 user space or VM guests.
+
+Mitigation points
+-----------------
+
+1. Return to user space
+^^^^^^^^^^^^^^^^^^^^^^^
+
+   When transitioning from kernel to user space the CPU buffers are flushed
+   on affected CPUs when the mitigation is not disabled on the kernel
+   command line. The migitation is enabled through the static key
+   mds_user_clear.
+
+   The mitigation is invoked in prepare_exit_to_usermode() which covers
+   most of the kernel to user space transitions. There are a few exceptions
+   which are not invoking prepare_exit_to_usermode() on return to user
+   space. These exceptions use the paranoid exit code.
+
+   - Non Maskable Interrupt (NMI):
+
+     Access to sensible data like keys, credentials in the NMI context is
+     mostly theoretical: The CPU can do prefetching or execute a
+     misspeculated code path and thereby fetching data which might end up
+     leaking through a buffer.
+
+     But for mounting other attacks the kernel stack address of the task is
+     already valuable information. So in full mitigation mode, the NMI is
+     mitigated on the return from do_nmi() to provide almost complete
+     coverage.
+
+   - Double fault (#DF):
+
+     A double fault is usually fatal, but the ESPFIX workaround, which can
+     be triggered from user space through modify_ldt(2) is a recoverable
+     double fault. #DF uses the paranoid exit path, so explicit mitigation
+     in the double fault handler is required.
+
+   - Machine Check Exception (#MC):
+
+     Another corner case is a #MC which hits between the CPU buffer clear
+     invocation and the actual return to user. As this still is in kernel
+     space it takes the paranoid exit path which does not clear the CPU
+     buffers. So the #MC handler repopulates the buffers to some
+     extent. Machine checks are not reliably controllable and the window is
+     extremly small so mitigation would just tick a checkbox that this
+     theoretical corner case is covered. To keep the amount of special
+     cases small, ignore #MC.
+
+   - Debug Exception (#DB):
+
+     This takes the paranoid exit path only when the INT1 breakpoint is in
+     kernel space. #DB on a user space address takes the regular exit path,
+     so no extra mitigation required.
commit	04dcbdb8057827b043b3c71aa397c4c63e67d086	[log] [tgz]
author	Thomas Gleixner <tglx@linutronix.de>	Mon Feb 18 23:42:51 2019 +0100
committer	Thomas Gleixner <tglx@linutronix.de>	Wed Mar 06 21:52:13 2019 +0100
tree	ef837807f05eaf4ce6abe5d62b5ef5703931c3e5
parent	6a9e529272517755904b7afa639f6db59ddb793e [diff] [blame]