c8c1eec578a9ae2dc8f14a1846942a0b7bf29d1d - SHIFTPHONES/android_kernel_shift_sdm845

commit	c8c1eec578a9ae2dc8f14a1846942a0b7bf29d1d	[log] [tgz]
author	Eric Dumazet <edumazet@google.com>	Fri Dec 16 16:29:17 2022 +0000
committer	Greg Kroah-Hartman <gregkh@linuxfoundation.org>	Sat Jan 07 12:07:30 2023 +0100
tree	002191cdd032c8c086d71027dd3b05f1355aa0ca
parent	a4da73a1d2c22239958e5e8117580285231c3e69 [diff]

net: stream: purge sk_error_queue in sk_stream_kill_queues()

[ Upstream commit e0c8bccd40fc1c19e1d246c39bcf79e357e1ada3 ]

Changheon Lee reported TCP socket leaks, with a nice repro.

It seems we leak TCP sockets with the following sequence:

1) SOF_TIMESTAMPING_TX_ACK is enabled on the socket.

   Each ACK will cook an skb put in error queue, from __skb_tstamp_tx().
   __skb_tstamp_tx() is using skb_clone(), unless
   SOF_TIMESTAMPING_OPT_TSONLY was also requested.

2) If the application is also using MSG_ZEROCOPY, then we put in the
   error queue cloned skbs that had a struct ubuf_info attached to them.

   Whenever an struct ubuf_info is allocated, sock_zerocopy_alloc()
   does a sock_hold().

   As long as the cloned skbs are still in sk_error_queue,
   socket refcount is kept elevated.

3) Application closes the socket, while error queue is not empty.

Since tcp_close() no longer purges the socket error queue,
we might end up with a TCP socket with at least one skb in
error queue keeping the socket alive forever.

This bug can be (ab)used to consume all kernel memory
and freeze the host.

We need to purge the error queue, with proper synchronization
against concurrent writers.

Fixes: 24bcbe1cc69f ("net: stream: don't purge sk_error_queue in sk_stream_kill_queues()")
Reported-by: Changheon Lee <darklight2357@icloud.com>
Signed-off-by: Eric Dumazet <edumazet@google.com>
Signed-off-by: David S. Miller <davem@davemloft.net>
Signed-off-by: Sasha Levin <sashal@kernel.org>

net/core/stream.c[diff]

1 file changed

tree: 002191cdd032c8c086d71027dd3b05f1355aa0ca