Thomas Gleixner | ec8f24b | 2019-05-19 13:07:45 +0100 | [diff] [blame] | 1 | # SPDX-License-Identifier: GPL-2.0-only |
Christoph Hellwig | 59e0b52 | 2018-07-31 13:39:35 +0200 | [diff] [blame] | 2 | |
| 3 | menu "Memory Management options" |
| 4 | |
Dave Hansen | e1785e8 | 2005-06-23 00:07:49 -0700 | [diff] [blame] | 5 | config SELECT_MEMORY_MODEL |
| 6 | def_bool y |
Kees Cook | a8826ee | 2013-01-16 18:54:17 -0800 | [diff] [blame] | 7 | depends on ARCH_SELECT_MEMORY_MODEL |
Dave Hansen | e1785e8 | 2005-06-23 00:07:49 -0700 | [diff] [blame] | 8 | |
Dave Hansen | 3a9da76 | 2005-06-23 00:07:42 -0700 | [diff] [blame] | 9 | choice |
| 10 | prompt "Memory model" |
Dave Hansen | e1785e8 | 2005-06-23 00:07:49 -0700 | [diff] [blame] | 11 | depends on SELECT_MEMORY_MODEL |
| 12 | default DISCONTIGMEM_MANUAL if ARCH_DISCONTIGMEM_DEFAULT |
Andy Whitcroft | d41dee3 | 2005-06-23 00:07:54 -0700 | [diff] [blame] | 13 | default SPARSEMEM_MANUAL if ARCH_SPARSEMEM_DEFAULT |
Dave Hansen | e1785e8 | 2005-06-23 00:07:49 -0700 | [diff] [blame] | 14 | default FLATMEM_MANUAL |
Mike Rapoport | d66d109 | 2019-05-13 17:23:05 -0700 | [diff] [blame] | 15 | help |
| 16 | This option allows you to change some of the ways that |
| 17 | Linux manages its memory internally. Most users will |
| 18 | only have one option here selected by the architecture |
| 19 | configuration. This is normal. |
Dave Hansen | 3a9da76 | 2005-06-23 00:07:42 -0700 | [diff] [blame] | 20 | |
Dave Hansen | e1785e8 | 2005-06-23 00:07:49 -0700 | [diff] [blame] | 21 | config FLATMEM_MANUAL |
Dave Hansen | 3a9da76 | 2005-06-23 00:07:42 -0700 | [diff] [blame] | 22 | bool "Flat Memory" |
Anton Blanchard | c898ec1 | 2006-01-06 00:12:07 -0800 | [diff] [blame] | 23 | depends on !(ARCH_DISCONTIGMEM_ENABLE || ARCH_SPARSEMEM_ENABLE) || ARCH_FLATMEM_ENABLE |
Dave Hansen | 3a9da76 | 2005-06-23 00:07:42 -0700 | [diff] [blame] | 24 | help |
Mike Rapoport | d66d109 | 2019-05-13 17:23:05 -0700 | [diff] [blame] | 25 | This option is best suited for non-NUMA systems with |
| 26 | flat address space. The FLATMEM is the most efficient |
| 27 | system in terms of performance and resource consumption |
| 28 | and it is the best option for smaller systems. |
Dave Hansen | 3a9da76 | 2005-06-23 00:07:42 -0700 | [diff] [blame] | 29 | |
Mike Rapoport | d66d109 | 2019-05-13 17:23:05 -0700 | [diff] [blame] | 30 | For systems that have holes in their physical address |
| 31 | spaces and for features like NUMA and memory hotplug, |
Randy Dunlap | dd33d29 | 2019-11-30 17:58:26 -0800 | [diff] [blame] | 32 | choose "Sparse Memory". |
Andy Whitcroft | d41dee3 | 2005-06-23 00:07:54 -0700 | [diff] [blame] | 33 | |
| 34 | If unsure, choose this option (Flat Memory) over any other. |
Dave Hansen | 3a9da76 | 2005-06-23 00:07:42 -0700 | [diff] [blame] | 35 | |
Dave Hansen | e1785e8 | 2005-06-23 00:07:49 -0700 | [diff] [blame] | 36 | config DISCONTIGMEM_MANUAL |
Dave Hansen | f3519f9 | 2005-09-16 19:27:54 -0700 | [diff] [blame] | 37 | bool "Discontiguous Memory" |
Dave Hansen | 3a9da76 | 2005-06-23 00:07:42 -0700 | [diff] [blame] | 38 | depends on ARCH_DISCONTIGMEM_ENABLE |
| 39 | help |
Dave Hansen | 785dcd4 | 2005-06-23 00:07:50 -0700 | [diff] [blame] | 40 | This option provides enhanced support for discontiguous |
| 41 | memory systems, over FLATMEM. These systems have holes |
| 42 | in their physical address spaces, and this option provides |
Mike Rapoport | d66d109 | 2019-05-13 17:23:05 -0700 | [diff] [blame] | 43 | more efficient handling of these holes. |
Dave Hansen | 785dcd4 | 2005-06-23 00:07:50 -0700 | [diff] [blame] | 44 | |
Mike Rapoport | d66d109 | 2019-05-13 17:23:05 -0700 | [diff] [blame] | 45 | Although "Discontiguous Memory" is still used by several |
| 46 | architectures, it is considered deprecated in favor of |
| 47 | "Sparse Memory". |
Dave Hansen | 785dcd4 | 2005-06-23 00:07:50 -0700 | [diff] [blame] | 48 | |
Mike Rapoport | d66d109 | 2019-05-13 17:23:05 -0700 | [diff] [blame] | 49 | If unsure, choose "Sparse Memory" over this option. |
Dave Hansen | 3a9da76 | 2005-06-23 00:07:42 -0700 | [diff] [blame] | 50 | |
Andy Whitcroft | d41dee3 | 2005-06-23 00:07:54 -0700 | [diff] [blame] | 51 | config SPARSEMEM_MANUAL |
| 52 | bool "Sparse Memory" |
| 53 | depends on ARCH_SPARSEMEM_ENABLE |
| 54 | help |
| 55 | This will be the only option for some systems, including |
Mike Rapoport | d66d109 | 2019-05-13 17:23:05 -0700 | [diff] [blame] | 56 | memory hot-plug systems. This is normal. |
Andy Whitcroft | d41dee3 | 2005-06-23 00:07:54 -0700 | [diff] [blame] | 57 | |
Mike Rapoport | d66d109 | 2019-05-13 17:23:05 -0700 | [diff] [blame] | 58 | This option provides efficient support for systems with |
| 59 | holes is their physical address space and allows memory |
| 60 | hot-plug and hot-remove. |
Andy Whitcroft | d41dee3 | 2005-06-23 00:07:54 -0700 | [diff] [blame] | 61 | |
Mike Rapoport | d66d109 | 2019-05-13 17:23:05 -0700 | [diff] [blame] | 62 | If unsure, choose "Flat Memory" over this option. |
Andy Whitcroft | d41dee3 | 2005-06-23 00:07:54 -0700 | [diff] [blame] | 63 | |
Dave Hansen | 3a9da76 | 2005-06-23 00:07:42 -0700 | [diff] [blame] | 64 | endchoice |
| 65 | |
Dave Hansen | e1785e8 | 2005-06-23 00:07:49 -0700 | [diff] [blame] | 66 | config DISCONTIGMEM |
| 67 | def_bool y |
| 68 | depends on (!SELECT_MEMORY_MODEL && ARCH_DISCONTIGMEM_ENABLE) || DISCONTIGMEM_MANUAL |
| 69 | |
Andy Whitcroft | d41dee3 | 2005-06-23 00:07:54 -0700 | [diff] [blame] | 70 | config SPARSEMEM |
| 71 | def_bool y |
Russell King | 1a83e17 | 2009-10-26 16:50:12 -0700 | [diff] [blame] | 72 | depends on (!SELECT_MEMORY_MODEL && ARCH_SPARSEMEM_ENABLE) || SPARSEMEM_MANUAL |
Andy Whitcroft | d41dee3 | 2005-06-23 00:07:54 -0700 | [diff] [blame] | 73 | |
Dave Hansen | e1785e8 | 2005-06-23 00:07:49 -0700 | [diff] [blame] | 74 | config FLATMEM |
| 75 | def_bool y |
Andy Whitcroft | d41dee3 | 2005-06-23 00:07:54 -0700 | [diff] [blame] | 76 | depends on (!DISCONTIGMEM && !SPARSEMEM) || FLATMEM_MANUAL |
| 77 | |
| 78 | config FLAT_NODE_MEM_MAP |
| 79 | def_bool y |
| 80 | depends on !SPARSEMEM |
Dave Hansen | e1785e8 | 2005-06-23 00:07:49 -0700 | [diff] [blame] | 81 | |
Dave Hansen | 93b7504 | 2005-06-23 00:07:47 -0700 | [diff] [blame] | 82 | # |
| 83 | # Both the NUMA code and DISCONTIGMEM use arrays of pg_data_t's |
| 84 | # to represent different areas of memory. This variable allows |
| 85 | # those dependencies to exist individually. |
| 86 | # |
| 87 | config NEED_MULTIPLE_NODES |
| 88 | def_bool y |
| 89 | depends on DISCONTIGMEM || NUMA |
Andy Whitcroft | af70536 | 2005-06-23 00:07:53 -0700 | [diff] [blame] | 90 | |
Bob Picco | 802f192 | 2005-09-03 15:54:26 -0700 | [diff] [blame] | 91 | # |
Bob Picco | 3e34726 | 2005-09-03 15:54:28 -0700 | [diff] [blame] | 92 | # SPARSEMEM_EXTREME (which is the default) does some bootmem |
Mike Rapoport | c89ab04 | 2020-08-06 23:24:02 -0700 | [diff] [blame] | 93 | # allocations when sparse_init() is called. If this cannot |
Bob Picco | 3e34726 | 2005-09-03 15:54:28 -0700 | [diff] [blame] | 94 | # be done on your architecture, select this option. However, |
| 95 | # statically allocating the mem_section[] array can potentially |
| 96 | # consume vast quantities of .bss, so be careful. |
| 97 | # |
| 98 | # This option will also potentially produce smaller runtime code |
| 99 | # with gcc 3.4 and later. |
| 100 | # |
| 101 | config SPARSEMEM_STATIC |
Jan Beulich | 9ba1608 | 2008-10-15 22:01:38 -0700 | [diff] [blame] | 102 | bool |
Bob Picco | 3e34726 | 2005-09-03 15:54:28 -0700 | [diff] [blame] | 103 | |
| 104 | # |
Matt LaPlante | 44c0920 | 2006-10-03 22:34:14 +0200 | [diff] [blame] | 105 | # Architecture platforms which require a two level mem_section in SPARSEMEM |
Bob Picco | 802f192 | 2005-09-03 15:54:26 -0700 | [diff] [blame] | 106 | # must select this option. This is usually for architecture platforms with |
| 107 | # an extremely sparse physical address space. |
| 108 | # |
Bob Picco | 3e34726 | 2005-09-03 15:54:28 -0700 | [diff] [blame] | 109 | config SPARSEMEM_EXTREME |
| 110 | def_bool y |
| 111 | depends on SPARSEMEM && !SPARSEMEM_STATIC |
Hugh Dickins | 4c21e2f | 2005-10-29 18:16:40 -0700 | [diff] [blame] | 112 | |
Andy Whitcroft | 29c7111 | 2007-10-16 01:24:14 -0700 | [diff] [blame] | 113 | config SPARSEMEM_VMEMMAP_ENABLE |
Jan Beulich | 9ba1608 | 2008-10-15 22:01:38 -0700 | [diff] [blame] | 114 | bool |
Andy Whitcroft | 29c7111 | 2007-10-16 01:24:14 -0700 | [diff] [blame] | 115 | |
| 116 | config SPARSEMEM_VMEMMAP |
Geoff Levand | a5ee6da | 2007-12-17 16:19:53 -0800 | [diff] [blame] | 117 | bool "Sparse Memory virtual memmap" |
| 118 | depends on SPARSEMEM && SPARSEMEM_VMEMMAP_ENABLE |
| 119 | default y |
| 120 | help |
Krzysztof Kozlowski | 19fa40a | 2019-11-30 17:58:23 -0800 | [diff] [blame] | 121 | SPARSEMEM_VMEMMAP uses a virtually mapped memmap to optimise |
| 122 | pfn_to_page and page_to_pfn operations. This is the most |
| 123 | efficient option when sufficient kernel resources are available. |
Andy Whitcroft | 29c7111 | 2007-10-16 01:24:14 -0700 | [diff] [blame] | 124 | |
Philipp Hachtmann | 70210ed | 2014-01-29 18:16:01 +0100 | [diff] [blame] | 125 | config HAVE_MEMBLOCK_PHYS_MAP |
Christoph Jaeger | 6341e62 | 2014-12-20 15:41:11 -0500 | [diff] [blame] | 126 | bool |
Philipp Hachtmann | 70210ed | 2014-01-29 18:16:01 +0100 | [diff] [blame] | 127 | |
Christoph Hellwig | 67a929e | 2019-07-11 20:57:14 -0700 | [diff] [blame] | 128 | config HAVE_FAST_GUP |
Christoph Hellwig | 050a9ad | 2019-07-11 20:57:21 -0700 | [diff] [blame] | 129 | depends on MMU |
Christoph Jaeger | 6341e62 | 2014-12-20 15:41:11 -0500 | [diff] [blame] | 130 | bool |
Steve Capper | 2667f50 | 2014-10-09 15:29:14 -0700 | [diff] [blame] | 131 | |
David Hildenbrand | 52219ae | 2020-06-04 16:48:38 -0700 | [diff] [blame] | 132 | # Don't discard allocated memory used to track "memory" and "reserved" memblocks |
| 133 | # after early boot, so it can still be used to test for validity of memory. |
| 134 | # Also, memblocks are updated with memory hot(un)plug. |
Mike Rapoport | 350e88b | 2019-05-13 17:22:59 -0700 | [diff] [blame] | 135 | config ARCH_KEEP_MEMBLOCK |
Christoph Jaeger | 6341e62 | 2014-12-20 15:41:11 -0500 | [diff] [blame] | 136 | bool |
Tejun Heo | c378ddd | 2011-07-14 11:46:03 +0200 | [diff] [blame] | 137 | |
Dan Williams | 1e5d8e1 | 2020-02-16 12:01:04 -0800 | [diff] [blame] | 138 | # Keep arch NUMA mapping infrastructure post-init. |
| 139 | config NUMA_KEEP_MEMINFO |
| 140 | bool |
| 141 | |
Minchan Kim | ee6f509 | 2012-07-31 16:43:50 -0700 | [diff] [blame] | 142 | config MEMORY_ISOLATION |
Christoph Jaeger | 6341e62 | 2014-12-20 15:41:11 -0500 | [diff] [blame] | 143 | bool |
Minchan Kim | ee6f509 | 2012-07-31 16:43:50 -0700 | [diff] [blame] | 144 | |
Yasuaki Ishimatsu | 46723bf | 2013-02-22 16:33:00 -0800 | [diff] [blame] | 145 | # |
| 146 | # Only be set on architectures that have completely implemented memory hotplug |
| 147 | # feature. If you are not sure, don't touch it. |
| 148 | # |
| 149 | config HAVE_BOOTMEM_INFO_NODE |
| 150 | def_bool n |
| 151 | |
Dave Hansen | 3947be1 | 2005-10-29 18:16:54 -0700 | [diff] [blame] | 152 | # eventually, we can have this option just 'select SPARSEMEM' |
| 153 | config MEMORY_HOTPLUG |
| 154 | bool "Allow for memory hot-add" |
David Hildenbrand | b30c592 | 2020-10-15 20:08:23 -0700 | [diff] [blame] | 155 | select MEMORY_ISOLATION |
Keith Mannthey | ec69acb | 2006-09-30 23:27:05 -0700 | [diff] [blame] | 156 | depends on SPARSEMEM || X86_64_ACPI_NUMA |
Stephen Rothwell | 40b3136 | 2013-05-21 13:49:35 +1000 | [diff] [blame] | 157 | depends on ARCH_ENABLE_MEMORY_HOTPLUG |
Michal Hocko | b59d02e | 2020-06-04 16:48:51 -0700 | [diff] [blame] | 158 | depends on 64BIT || BROKEN |
Dan Williams | 1e5d8e1 | 2020-02-16 12:01:04 -0800 | [diff] [blame] | 159 | select NUMA_KEEP_MEMINFO if NUMA |
Dave Hansen | 3947be1 | 2005-10-29 18:16:54 -0700 | [diff] [blame] | 160 | |
Keith Mannthey | ec69acb | 2006-09-30 23:27:05 -0700 | [diff] [blame] | 161 | config MEMORY_HOTPLUG_SPARSE |
| 162 | def_bool y |
| 163 | depends on SPARSEMEM && MEMORY_HOTPLUG |
| 164 | |
Vitaly Kuznetsov | 8604d9e | 2016-05-19 17:13:03 -0700 | [diff] [blame] | 165 | config MEMORY_HOTPLUG_DEFAULT_ONLINE |
Krzysztof Kozlowski | 19fa40a | 2019-11-30 17:58:23 -0800 | [diff] [blame] | 166 | bool "Online the newly added memory blocks by default" |
| 167 | depends on MEMORY_HOTPLUG |
| 168 | help |
Vitaly Kuznetsov | 8604d9e | 2016-05-19 17:13:03 -0700 | [diff] [blame] | 169 | This option sets the default policy setting for memory hotplug |
| 170 | onlining policy (/sys/devices/system/memory/auto_online_blocks) which |
| 171 | determines what happens to newly added memory regions. Policy setting |
| 172 | can always be changed at runtime. |
Mauro Carvalho Chehab | cb1aaeb | 2019-06-07 15:54:32 -0300 | [diff] [blame] | 173 | See Documentation/admin-guide/mm/memory-hotplug.rst for more information. |
Vitaly Kuznetsov | 8604d9e | 2016-05-19 17:13:03 -0700 | [diff] [blame] | 174 | |
| 175 | Say Y here if you want all hot-plugged memory blocks to appear in |
| 176 | 'online' state by default. |
| 177 | Say N here if you want the default policy to keep all hot-plugged |
| 178 | memory blocks in 'offline' state. |
| 179 | |
KAMEZAWA Hiroyuki | 0c0e619 | 2007-10-16 01:26:12 -0700 | [diff] [blame] | 180 | config MEMORY_HOTREMOVE |
| 181 | bool "Allow for memory hot remove" |
Nathan Fontenot | f7e3334 | 2013-09-27 10:18:09 -0500 | [diff] [blame] | 182 | select HAVE_BOOTMEM_INFO_NODE if (X86_64 || PPC64) |
KAMEZAWA Hiroyuki | 0c0e619 | 2007-10-16 01:26:12 -0700 | [diff] [blame] | 183 | depends on MEMORY_HOTPLUG && ARCH_ENABLE_MEMORY_HOTREMOVE |
| 184 | depends on MIGRATION |
| 185 | |
Hugh Dickins | 4c21e2f | 2005-10-29 18:16:40 -0700 | [diff] [blame] | 186 | # Heavily threaded applications may benefit from splitting the mm-wide |
| 187 | # page_table_lock, so that faults on different parts of the user address |
| 188 | # space can be handled with less contention: split it at this NR_CPUS. |
| 189 | # Default to 4 for wider testing, though 8 might be more appropriate. |
| 190 | # ARM's adjust_pte (unused if VIPT) depends on mm-wide page_table_lock. |
Hugh Dickins | 7b6ac9d | 2005-11-23 13:37:37 -0800 | [diff] [blame] | 191 | # PA-RISC 7xxx's spinlock_t would enlarge struct page from 32 to 44 bytes. |
Will Deacon | 60bccaa | 2020-05-26 18:33:01 +0100 | [diff] [blame] | 192 | # SPARC32 allocates multiple pte tables within a single page, and therefore |
| 193 | # a per-page lock leads to problems when multiple tables need to be locked |
| 194 | # at the same time (e.g. copy_page_range()). |
Hugh Dickins | a70caa8 | 2009-12-14 17:59:02 -0800 | [diff] [blame] | 195 | # DEBUG_SPINLOCK and DEBUG_LOCK_ALLOC spinlock_t also enlarge struct page. |
Hugh Dickins | 4c21e2f | 2005-10-29 18:16:40 -0700 | [diff] [blame] | 196 | # |
| 197 | config SPLIT_PTLOCK_CPUS |
| 198 | int |
Kirill A. Shutemov | 9164550 | 2014-04-07 15:37:14 -0700 | [diff] [blame] | 199 | default "999999" if !MMU |
Hugh Dickins | a70caa8 | 2009-12-14 17:59:02 -0800 | [diff] [blame] | 200 | default "999999" if ARM && !CPU_CACHE_VIPT |
| 201 | default "999999" if PARISC && !PA20 |
Will Deacon | 60bccaa | 2020-05-26 18:33:01 +0100 | [diff] [blame] | 202 | default "999999" if SPARC32 |
Hugh Dickins | 4c21e2f | 2005-10-29 18:16:40 -0700 | [diff] [blame] | 203 | default "4" |
Christoph Lameter | 7cbe34c | 2006-01-08 01:00:49 -0800 | [diff] [blame] | 204 | |
Kirill A. Shutemov | e009bb3 | 2013-11-14 14:31:07 -0800 | [diff] [blame] | 205 | config ARCH_ENABLE_SPLIT_PMD_PTLOCK |
Christoph Jaeger | 6341e62 | 2014-12-20 15:41:11 -0500 | [diff] [blame] | 206 | bool |
Kirill A. Shutemov | e009bb3 | 2013-11-14 14:31:07 -0800 | [diff] [blame] | 207 | |
Christoph Lameter | 7cbe34c | 2006-01-08 01:00:49 -0800 | [diff] [blame] | 208 | # |
Konstantin Khlebnikov | 09316c0 | 2014-10-09 15:29:32 -0700 | [diff] [blame] | 209 | # support for memory balloon |
| 210 | config MEMORY_BALLOON |
Christoph Jaeger | 6341e62 | 2014-12-20 15:41:11 -0500 | [diff] [blame] | 211 | bool |
Konstantin Khlebnikov | 09316c0 | 2014-10-09 15:29:32 -0700 | [diff] [blame] | 212 | |
| 213 | # |
Rafael Aquini | 18468d9 | 2012-12-11 16:02:38 -0800 | [diff] [blame] | 214 | # support for memory balloon compaction |
| 215 | config BALLOON_COMPACTION |
| 216 | bool "Allow for balloon memory compaction/migration" |
| 217 | def_bool y |
Konstantin Khlebnikov | 09316c0 | 2014-10-09 15:29:32 -0700 | [diff] [blame] | 218 | depends on COMPACTION && MEMORY_BALLOON |
Rafael Aquini | 18468d9 | 2012-12-11 16:02:38 -0800 | [diff] [blame] | 219 | help |
| 220 | Memory fragmentation introduced by ballooning might reduce |
| 221 | significantly the number of 2MB contiguous memory blocks that can be |
| 222 | used within a guest, thus imposing performance penalties associated |
| 223 | with the reduced number of transparent huge pages that could be used |
| 224 | by the guest workload. Allowing the compaction & migration for memory |
| 225 | pages enlisted as being part of memory balloon devices avoids the |
| 226 | scenario aforementioned and helps improving memory defragmentation. |
| 227 | |
| 228 | # |
Mel Gorman | e9e96b3 | 2010-05-24 14:32:21 -0700 | [diff] [blame] | 229 | # support for memory compaction |
| 230 | config COMPACTION |
| 231 | bool "Allow for memory compaction" |
Rik van Riel | 05106e6 | 2012-10-08 16:33:03 -0700 | [diff] [blame] | 232 | def_bool y |
Mel Gorman | e9e96b3 | 2010-05-24 14:32:21 -0700 | [diff] [blame] | 233 | select MIGRATION |
Andrea Arcangeli | 33a9387 | 2011-01-25 15:07:25 -0800 | [diff] [blame] | 234 | depends on MMU |
Mel Gorman | e9e96b3 | 2010-05-24 14:32:21 -0700 | [diff] [blame] | 235 | help |
Krzysztof Kozlowski | 19fa40a | 2019-11-30 17:58:23 -0800 | [diff] [blame] | 236 | Compaction is the only memory management component to form |
| 237 | high order (larger physically contiguous) memory blocks |
| 238 | reliably. The page allocator relies on compaction heavily and |
| 239 | the lack of the feature can lead to unexpected OOM killer |
| 240 | invocations for high order memory requests. You shouldn't |
| 241 | disable this option unless there really is a strong reason for |
| 242 | it and then we would be really interested to hear about that at |
| 243 | linux-mm@kvack.org. |
Mel Gorman | e9e96b3 | 2010-05-24 14:32:21 -0700 | [diff] [blame] | 244 | |
| 245 | # |
Alexander Duyck | 36e66c5 | 2020-04-06 20:04:56 -0700 | [diff] [blame] | 246 | # support for free page reporting |
| 247 | config PAGE_REPORTING |
| 248 | bool "Free page reporting" |
| 249 | def_bool n |
| 250 | help |
| 251 | Free page reporting allows for the incremental acquisition of |
| 252 | free pages from the buddy allocator for the purpose of reporting |
| 253 | those pages to another entity, such as a hypervisor, so that the |
| 254 | memory can be freed within the host for other uses. |
| 255 | |
| 256 | # |
Christoph Lameter | 7cbe34c | 2006-01-08 01:00:49 -0800 | [diff] [blame] | 257 | # support for page migration |
| 258 | # |
| 259 | config MIGRATION |
Christoph Lameter | b20a350 | 2006-03-22 00:09:12 -0800 | [diff] [blame] | 260 | bool "Page migration" |
Christoph Lameter | 6c5240a | 2006-06-23 02:03:37 -0700 | [diff] [blame] | 261 | def_bool y |
Chen Gang | de32a81 | 2013-09-12 15:14:08 -0700 | [diff] [blame] | 262 | depends on (NUMA || ARCH_ENABLE_MEMORY_HOTREMOVE || COMPACTION || CMA) && MMU |
Christoph Lameter | b20a350 | 2006-03-22 00:09:12 -0800 | [diff] [blame] | 263 | help |
| 264 | Allows the migration of the physical location of pages of processes |
Mel Gorman | e9e96b3 | 2010-05-24 14:32:21 -0700 | [diff] [blame] | 265 | while the virtual addresses are not changed. This is useful in |
| 266 | two situations. The first is on NUMA systems to put pages nearer |
| 267 | to the processors accessing. The second is when allocating huge |
| 268 | pages as migration can relocate pages to satisfy a huge page |
| 269 | allocation instead of reclaiming. |
Greg Kroah-Hartman | 6550e07 | 2006-06-12 17:11:31 -0700 | [diff] [blame] | 270 | |
Naoya Horiguchi | c177c81 | 2014-06-04 16:05:35 -0700 | [diff] [blame] | 271 | config ARCH_ENABLE_HUGEPAGE_MIGRATION |
Christoph Jaeger | 6341e62 | 2014-12-20 15:41:11 -0500 | [diff] [blame] | 272 | bool |
Naoya Horiguchi | c177c81 | 2014-06-04 16:05:35 -0700 | [diff] [blame] | 273 | |
Naoya Horiguchi | 9c670ea | 2017-09-08 16:10:53 -0700 | [diff] [blame] | 274 | config ARCH_ENABLE_THP_MIGRATION |
| 275 | bool |
| 276 | |
Alexandre Ghiti | 8df995f | 2019-05-13 17:19:00 -0700 | [diff] [blame] | 277 | config CONTIG_ALLOC |
Krzysztof Kozlowski | 19fa40a | 2019-11-30 17:58:23 -0800 | [diff] [blame] | 278 | def_bool (MEMORY_ISOLATION && COMPACTION) || CMA |
Alexandre Ghiti | 8df995f | 2019-05-13 17:19:00 -0700 | [diff] [blame] | 279 | |
Jeremy Fitzhardinge | 600715d | 2008-09-11 01:31:45 -0700 | [diff] [blame] | 280 | config PHYS_ADDR_T_64BIT |
Christoph Hellwig | d4a451d | 2018-04-03 16:24:20 +0200 | [diff] [blame] | 281 | def_bool 64BIT |
Jeremy Fitzhardinge | 600715d | 2008-09-11 01:31:45 -0700 | [diff] [blame] | 282 | |
Christoph Lameter | 2a7326b | 2007-07-17 04:03:37 -0700 | [diff] [blame] | 283 | config BOUNCE |
Vinayak Menon | 9ca24e2 | 2013-04-29 15:08:55 -0700 | [diff] [blame] | 284 | bool "Enable bounce buffers" |
| 285 | default y |
Christoph Lameter | 2a7326b | 2007-07-17 04:03:37 -0700 | [diff] [blame] | 286 | depends on BLOCK && MMU && (ZONE_DMA || HIGHMEM) |
Vinayak Menon | 9ca24e2 | 2013-04-29 15:08:55 -0700 | [diff] [blame] | 287 | help |
| 288 | Enable bounce buffers for devices that cannot access |
| 289 | the full range of memory available to the CPU. Enabled |
| 290 | by default when ZONE_DMA or HIGHMEM is selected, but you |
| 291 | may say n to override this. |
Christoph Lameter | 2a7326b | 2007-07-17 04:03:37 -0700 | [diff] [blame] | 292 | |
Stephen Rothwell | f057eac | 2007-07-15 23:40:05 -0700 | [diff] [blame] | 293 | config VIRT_TO_BUS |
Stephen Rothwell | 4febd95 | 2013-03-07 15:48:16 +1100 | [diff] [blame] | 294 | bool |
| 295 | help |
| 296 | An architecture should select this if it implements the |
| 297 | deprecated interface virt_to_bus(). All new architectures |
| 298 | should probably not select this. |
| 299 | |
Andrea Arcangeli | cddb8a5 | 2008-07-28 15:46:29 -0700 | [diff] [blame] | 300 | |
| 301 | config MMU_NOTIFIER |
| 302 | bool |
Pranith Kumar | 83fe27e | 2014-12-05 11:24:45 -0500 | [diff] [blame] | 303 | select SRCU |
Jason Gunthorpe | 99cb252 | 2019-11-12 16:22:19 -0400 | [diff] [blame] | 304 | select INTERVAL_TREE |
David Howells | fc4d5c2 | 2009-05-06 16:03:05 -0700 | [diff] [blame] | 305 | |
Hugh Dickins | f8af4da | 2009-09-21 17:01:57 -0700 | [diff] [blame] | 306 | config KSM |
| 307 | bool "Enable KSM for page merging" |
| 308 | depends on MMU |
Timofey Titovets | 59e1a2f4 | 2018-12-28 00:34:05 -0800 | [diff] [blame] | 309 | select XXHASH |
Hugh Dickins | f8af4da | 2009-09-21 17:01:57 -0700 | [diff] [blame] | 310 | help |
| 311 | Enable Kernel Samepage Merging: KSM periodically scans those areas |
| 312 | of an application's address space that an app has advised may be |
| 313 | mergeable. When it finds pages of identical content, it replaces |
Hugh Dickins | d0f209f | 2009-12-14 17:59:34 -0800 | [diff] [blame] | 314 | the many instances by a single page with that content, so |
Hugh Dickins | f8af4da | 2009-09-21 17:01:57 -0700 | [diff] [blame] | 315 | saving memory until one or another app needs to modify the content. |
| 316 | Recommended for use with KVM, or with other duplicative applications. |
Mike Rapoport | ad56b73 | 2018-03-21 21:22:47 +0200 | [diff] [blame] | 317 | See Documentation/vm/ksm.rst for more information: KSM is inactive |
Hugh Dickins | c73602a | 2009-10-07 16:32:22 -0700 | [diff] [blame] | 318 | until a program has madvised that an area is MADV_MERGEABLE, and |
| 319 | root has set /sys/kernel/mm/ksm/run to 1 (if CONFIG_SYSFS is set). |
Hugh Dickins | f8af4da | 2009-09-21 17:01:57 -0700 | [diff] [blame] | 320 | |
Christoph Lameter | e0a94c2 | 2009-06-03 16:04:31 -0400 | [diff] [blame] | 321 | config DEFAULT_MMAP_MIN_ADDR |
Krzysztof Kozlowski | 19fa40a | 2019-11-30 17:58:23 -0800 | [diff] [blame] | 322 | int "Low address space to protect from user allocation" |
David Howells | 6e14154 | 2009-12-15 19:27:45 +0000 | [diff] [blame] | 323 | depends on MMU |
Krzysztof Kozlowski | 19fa40a | 2019-11-30 17:58:23 -0800 | [diff] [blame] | 324 | default 4096 |
| 325 | help |
Christoph Lameter | e0a94c2 | 2009-06-03 16:04:31 -0400 | [diff] [blame] | 326 | This is the portion of low virtual memory which should be protected |
| 327 | from userspace allocation. Keeping a user from writing to low pages |
| 328 | can help reduce the impact of kernel NULL pointer bugs. |
| 329 | |
| 330 | For most ia64, ppc64 and x86 users with lots of address space |
| 331 | a value of 65536 is reasonable and should cause no problems. |
| 332 | On arm and other archs it should not be higher than 32768. |
Eric Paris | 788084a | 2009-07-31 12:54:11 -0400 | [diff] [blame] | 333 | Programs which use vm86 functionality or have some need to map |
| 334 | this low address space will need CAP_SYS_RAWIO or disable this |
| 335 | protection by setting the value to 0. |
Christoph Lameter | e0a94c2 | 2009-06-03 16:04:31 -0400 | [diff] [blame] | 336 | |
| 337 | This value can be changed after boot using the |
| 338 | /proc/sys/vm/mmap_min_addr tunable. |
| 339 | |
Linus Torvalds | d949f36 | 2009-09-26 09:35:07 -0700 | [diff] [blame] | 340 | config ARCH_SUPPORTS_MEMORY_FAILURE |
| 341 | bool |
Christoph Lameter | e0a94c2 | 2009-06-03 16:04:31 -0400 | [diff] [blame] | 342 | |
Andi Kleen | 6a46079 | 2009-09-16 11:50:15 +0200 | [diff] [blame] | 343 | config MEMORY_FAILURE |
| 344 | depends on MMU |
Linus Torvalds | d949f36 | 2009-09-26 09:35:07 -0700 | [diff] [blame] | 345 | depends on ARCH_SUPPORTS_MEMORY_FAILURE |
Andi Kleen | 6a46079 | 2009-09-16 11:50:15 +0200 | [diff] [blame] | 346 | bool "Enable recovery from hardware memory errors" |
Minchan Kim | ee6f509 | 2012-07-31 16:43:50 -0700 | [diff] [blame] | 347 | select MEMORY_ISOLATION |
Xie XiuQi | 97f0b13 | 2015-06-24 16:57:36 -0700 | [diff] [blame] | 348 | select RAS |
Andi Kleen | 6a46079 | 2009-09-16 11:50:15 +0200 | [diff] [blame] | 349 | help |
| 350 | Enables code to recover from some memory failures on systems |
| 351 | with MCA recovery. This allows a system to continue running |
| 352 | even when some of its memory has uncorrected errors. This requires |
| 353 | special hardware support and typically ECC memory. |
| 354 | |
Andi Kleen | cae681f | 2009-09-16 11:50:17 +0200 | [diff] [blame] | 355 | config HWPOISON_INJECT |
Andi Kleen | 413f9ef | 2009-12-16 12:20:00 +0100 | [diff] [blame] | 356 | tristate "HWPoison pages injector" |
Andi Kleen | 27df506 | 2009-12-21 19:56:42 +0100 | [diff] [blame] | 357 | depends on MEMORY_FAILURE && DEBUG_KERNEL && PROC_FS |
Wu Fengguang | 478c5ff | 2009-12-16 12:19:59 +0100 | [diff] [blame] | 358 | select PROC_PAGE_MONITOR |
Andi Kleen | cae681f | 2009-09-16 11:50:17 +0200 | [diff] [blame] | 359 | |
David Howells | fc4d5c2 | 2009-05-06 16:03:05 -0700 | [diff] [blame] | 360 | config NOMMU_INITIAL_TRIM_EXCESS |
| 361 | int "Turn on mmap() excess space trimming before booting" |
| 362 | depends on !MMU |
| 363 | default 1 |
| 364 | help |
| 365 | The NOMMU mmap() frequently needs to allocate large contiguous chunks |
| 366 | of memory on which to store mappings, but it can only ask the system |
| 367 | allocator for chunks in 2^N*PAGE_SIZE amounts - which is frequently |
| 368 | more than it requires. To deal with this, mmap() is able to trim off |
| 369 | the excess and return it to the allocator. |
| 370 | |
| 371 | If trimming is enabled, the excess is trimmed off and returned to the |
| 372 | system allocator, which can cause extra fragmentation, particularly |
| 373 | if there are a lot of transient processes. |
| 374 | |
| 375 | If trimming is disabled, the excess is kept, but not used, which for |
| 376 | long-term mappings means that the space is wasted. |
| 377 | |
| 378 | Trimming can be dynamically controlled through a sysctl option |
| 379 | (/proc/sys/vm/nr_trim_pages) which specifies the minimum number of |
| 380 | excess pages there must be before trimming should occur, or zero if |
| 381 | no trimming is to occur. |
| 382 | |
| 383 | This option specifies the initial value of this option. The default |
| 384 | of 1 says that all excess pages should be trimmed. |
| 385 | |
Stephen Kitt | dd19d29 | 2020-08-12 11:22:30 +0200 | [diff] [blame] | 386 | See Documentation/admin-guide/mm/nommu-mmap.rst for more information. |
Tejun Heo | bbddff0 | 2010-09-03 18:22:48 +0200 | [diff] [blame] | 387 | |
Andrea Arcangeli | 4c76d9d | 2011-01-13 15:46:39 -0800 | [diff] [blame] | 388 | config TRANSPARENT_HUGEPAGE |
Andrea Arcangeli | 13ece88 | 2011-01-13 15:47:07 -0800 | [diff] [blame] | 389 | bool "Transparent Hugepage Support" |
Gerald Schaefer | 1562606 | 2012-10-08 16:30:04 -0700 | [diff] [blame] | 390 | depends on HAVE_ARCH_TRANSPARENT_HUGEPAGE |
Andrea Arcangeli | 5d68924 | 2011-01-13 15:47:07 -0800 | [diff] [blame] | 391 | select COMPACTION |
Matthew Wilcox | 3a08cd5 | 2018-09-22 16:14:30 -0400 | [diff] [blame] | 392 | select XARRAY_MULTI |
Andrea Arcangeli | 4c76d9d | 2011-01-13 15:46:39 -0800 | [diff] [blame] | 393 | help |
| 394 | Transparent Hugepages allows the kernel to use huge pages and |
| 395 | huge tlb transparently to the applications whenever possible. |
| 396 | This feature can improve computing performance to certain |
| 397 | applications by speeding up page faults during memory |
| 398 | allocation, by reducing the number of tlb misses and by speeding |
| 399 | up the pagetable walking. |
| 400 | |
| 401 | If memory constrained on embedded, you may want to say N. |
| 402 | |
Andrea Arcangeli | 13ece88 | 2011-01-13 15:47:07 -0800 | [diff] [blame] | 403 | choice |
| 404 | prompt "Transparent Hugepage Support sysfs defaults" |
| 405 | depends on TRANSPARENT_HUGEPAGE |
| 406 | default TRANSPARENT_HUGEPAGE_ALWAYS |
| 407 | help |
| 408 | Selects the sysfs defaults for Transparent Hugepage Support. |
| 409 | |
| 410 | config TRANSPARENT_HUGEPAGE_ALWAYS |
| 411 | bool "always" |
| 412 | help |
| 413 | Enabling Transparent Hugepage always, can increase the |
| 414 | memory footprint of applications without a guaranteed |
| 415 | benefit but it will work automatically for all applications. |
| 416 | |
| 417 | config TRANSPARENT_HUGEPAGE_MADVISE |
| 418 | bool "madvise" |
| 419 | help |
| 420 | Enabling Transparent Hugepage madvise, will only provide a |
| 421 | performance improvement benefit to the applications using |
| 422 | madvise(MADV_HUGEPAGE) but it won't risk to increase the |
| 423 | memory footprint of applications without a guaranteed |
| 424 | benefit. |
| 425 | endchoice |
| 426 | |
Huang Ying | 38d8b4e | 2017-07-06 15:37:18 -0700 | [diff] [blame] | 427 | config ARCH_WANTS_THP_SWAP |
Krzysztof Kozlowski | 19fa40a | 2019-11-30 17:58:23 -0800 | [diff] [blame] | 428 | def_bool n |
Huang Ying | 38d8b4e | 2017-07-06 15:37:18 -0700 | [diff] [blame] | 429 | |
| 430 | config THP_SWAP |
| 431 | def_bool y |
Huang Ying | 14fef28 | 2018-08-17 15:49:41 -0700 | [diff] [blame] | 432 | depends on TRANSPARENT_HUGEPAGE && ARCH_WANTS_THP_SWAP && SWAP |
Huang Ying | 38d8b4e | 2017-07-06 15:37:18 -0700 | [diff] [blame] | 433 | help |
| 434 | Swap transparent huge pages in one piece, without splitting. |
Huang Ying | 14fef28 | 2018-08-17 15:49:41 -0700 | [diff] [blame] | 435 | XXX: For now, swap cluster backing transparent huge page |
| 436 | will be split after swapout. |
Huang Ying | 38d8b4e | 2017-07-06 15:37:18 -0700 | [diff] [blame] | 437 | |
| 438 | For selection by architectures with reasonable THP sizes. |
| 439 | |
Kirill A. Shutemov | e496cf3 | 2016-07-26 15:26:35 -0700 | [diff] [blame] | 440 | # |
Tejun Heo | bbddff0 | 2010-09-03 18:22:48 +0200 | [diff] [blame] | 441 | # UP and nommu archs use km based percpu allocator |
| 442 | # |
| 443 | config NEED_PER_CPU_KM |
| 444 | depends on !SMP |
| 445 | bool |
| 446 | default y |
Dan Magenheimer | 077b1f8 | 2011-05-26 10:01:36 -0600 | [diff] [blame] | 447 | |
| 448 | config CLEANCACHE |
| 449 | bool "Enable cleancache driver to cache clean pages if tmem is present" |
Dan Magenheimer | 077b1f8 | 2011-05-26 10:01:36 -0600 | [diff] [blame] | 450 | help |
| 451 | Cleancache can be thought of as a page-granularity victim cache |
| 452 | for clean pages that the kernel's pageframe replacement algorithm |
| 453 | (PFRA) would like to keep around, but can't since there isn't enough |
| 454 | memory. So when the PFRA "evicts" a page, it first attempts to use |
Michael Witten | 140a1ef | 2011-06-10 03:57:26 +0000 | [diff] [blame] | 455 | cleancache code to put the data contained in that page into |
Dan Magenheimer | 077b1f8 | 2011-05-26 10:01:36 -0600 | [diff] [blame] | 456 | "transcendent memory", memory that is not directly accessible or |
| 457 | addressable by the kernel and is of unknown and possibly |
| 458 | time-varying size. And when a cleancache-enabled |
| 459 | filesystem wishes to access a page in a file on disk, it first |
| 460 | checks cleancache to see if it already contains it; if it does, |
| 461 | the page is copied into the kernel and a disk access is avoided. |
| 462 | When a transcendent memory driver is available (such as zcache or |
| 463 | Xen transcendent memory), a significant I/O reduction |
| 464 | may be achieved. When none is available, all cleancache calls |
| 465 | are reduced to a single pointer-compare-against-NULL resulting |
| 466 | in a negligible performance hit. |
| 467 | |
| 468 | If unsure, say Y to enable cleancache |
Dan Magenheimer | 27c6aec | 2012-04-09 17:10:34 -0600 | [diff] [blame] | 469 | |
| 470 | config FRONTSWAP |
| 471 | bool "Enable frontswap to cache swap pages if tmem is present" |
| 472 | depends on SWAP |
Dan Magenheimer | 27c6aec | 2012-04-09 17:10:34 -0600 | [diff] [blame] | 473 | help |
| 474 | Frontswap is so named because it can be thought of as the opposite |
| 475 | of a "backing" store for a swap device. The data is stored into |
| 476 | "transcendent memory", memory that is not directly accessible or |
| 477 | addressable by the kernel and is of unknown and possibly |
| 478 | time-varying size. When space in transcendent memory is available, |
| 479 | a significant swap I/O reduction may be achieved. When none is |
| 480 | available, all frontswap calls are reduced to a single pointer- |
| 481 | compare-against-NULL resulting in a negligible performance hit |
| 482 | and swap data is stored as normal on the matching swap device. |
| 483 | |
| 484 | If unsure, say Y to enable frontswap. |
Aneesh Kumar K.V | f825c73 | 2013-07-02 11:15:15 +0530 | [diff] [blame] | 485 | |
| 486 | config CMA |
| 487 | bool "Contiguous Memory Allocator" |
Mike Rapoport | aca52c3 | 2018-10-30 15:07:44 -0700 | [diff] [blame] | 488 | depends on MMU |
Aneesh Kumar K.V | f825c73 | 2013-07-02 11:15:15 +0530 | [diff] [blame] | 489 | select MIGRATION |
| 490 | select MEMORY_ISOLATION |
| 491 | help |
| 492 | This enables the Contiguous Memory Allocator which allows other |
| 493 | subsystems to allocate big physically-contiguous blocks of memory. |
| 494 | CMA reserves a region of memory and allows only movable pages to |
| 495 | be allocated from it. This way, the kernel can use the memory for |
| 496 | pagecache and when a subsystem requests for contiguous area, the |
| 497 | allocated pages are migrated away to serve the contiguous request. |
| 498 | |
| 499 | If unsure, say "n". |
| 500 | |
| 501 | config CMA_DEBUG |
| 502 | bool "CMA debug messages (DEVELOPMENT)" |
| 503 | depends on DEBUG_KERNEL && CMA |
| 504 | help |
| 505 | Turns on debug messages in CMA. This produces KERN_DEBUG |
| 506 | messages for every CMA call as well as various messages while |
| 507 | processing calls such as dma_alloc_from_contiguous(). |
| 508 | This option does not affect warning and error messages. |
Alexander Graf | bf550fc | 2013-08-29 00:41:59 +0200 | [diff] [blame] | 509 | |
Sasha Levin | 28b24c1 | 2015-04-14 15:44:57 -0700 | [diff] [blame] | 510 | config CMA_DEBUGFS |
| 511 | bool "CMA debugfs interface" |
| 512 | depends on CMA && DEBUG_FS |
| 513 | help |
| 514 | Turns on the DebugFS interface for CMA. |
| 515 | |
Minchan Kim | a590359 | 2021-03-24 16:10:53 -0700 | [diff] [blame] | 516 | config CMA_SYSFS |
| 517 | bool "CMA information through sysfs interface" |
| 518 | depends on CMA && SYSFS |
| 519 | help |
| 520 | This option exposes some sysfs attributes to get information |
| 521 | from CMA. |
| 522 | |
Joonsoo Kim | a254129 | 2014-08-06 16:05:25 -0700 | [diff] [blame] | 523 | config CMA_AREAS |
| 524 | int "Maximum count of the CMA areas" |
| 525 | depends on CMA |
Barry Song | b7176c2 | 2020-08-24 11:03:07 +1200 | [diff] [blame] | 526 | default 19 if NUMA |
Joonsoo Kim | a254129 | 2014-08-06 16:05:25 -0700 | [diff] [blame] | 527 | default 7 |
| 528 | help |
| 529 | CMA allows to create CMA areas for particular purpose, mainly, |
| 530 | used as device private area. This parameter sets the maximum |
| 531 | number of CMA area in the system. |
| 532 | |
Barry Song | b7176c2 | 2020-08-24 11:03:07 +1200 | [diff] [blame] | 533 | If unsure, leave the default value "7" in UMA and "19" in NUMA. |
Joonsoo Kim | a254129 | 2014-08-06 16:05:25 -0700 | [diff] [blame] | 534 | |
Dan Streetman | af8d417 | 2014-08-06 16:08:36 -0700 | [diff] [blame] | 535 | config MEM_SOFT_DIRTY |
| 536 | bool "Track memory changes" |
| 537 | depends on CHECKPOINT_RESTORE && HAVE_ARCH_SOFT_DIRTY && PROC_FS |
| 538 | select PROC_PAGE_MONITOR |
Seth Jennings | 4e2e277 | 2013-07-10 16:04:55 -0700 | [diff] [blame] | 539 | help |
Dan Streetman | af8d417 | 2014-08-06 16:08:36 -0700 | [diff] [blame] | 540 | This option enables memory changes tracking by introducing a |
| 541 | soft-dirty bit on pte-s. This bit it set when someone writes |
| 542 | into a page just as regular dirty bit, but unlike the latter |
| 543 | it can be cleared by hands. |
| 544 | |
Mike Rapoport | 1ad1335 | 2018-04-18 11:07:49 +0300 | [diff] [blame] | 545 | See Documentation/admin-guide/mm/soft-dirty.rst for more details. |
Seth Jennings | 4e2e277 | 2013-07-10 16:04:55 -0700 | [diff] [blame] | 546 | |
Seth Jennings | 2b28111 | 2013-07-10 16:05:03 -0700 | [diff] [blame] | 547 | config ZSWAP |
| 548 | bool "Compressed cache for swap pages (EXPERIMENTAL)" |
| 549 | depends on FRONTSWAP && CRYPTO=y |
Dan Streetman | 12d79d6 | 2014-08-06 16:08:40 -0700 | [diff] [blame] | 550 | select ZPOOL |
Seth Jennings | 2b28111 | 2013-07-10 16:05:03 -0700 | [diff] [blame] | 551 | help |
| 552 | A lightweight compressed cache for swap pages. It takes |
| 553 | pages that are in the process of being swapped out and attempts to |
| 554 | compress them into a dynamically allocated RAM-based memory pool. |
| 555 | This can result in a significant I/O reduction on swap device and, |
| 556 | in the case where decompressing from RAM is faster that swap device |
| 557 | reads, can also improve workload performance. |
| 558 | |
| 559 | This is marked experimental because it is a new feature (as of |
| 560 | v3.11) that interacts heavily with memory reclaim. While these |
| 561 | interactions don't cause any known issues on simple memory setups, |
| 562 | they have not be fully explored on the large set of potential |
| 563 | configurations and workloads that exist. |
| 564 | |
Maciej S. Szmigiero | bb8b93b | 2020-04-06 20:08:03 -0700 | [diff] [blame] | 565 | choice |
| 566 | prompt "Compressed cache for swap pages default compressor" |
| 567 | depends on ZSWAP |
| 568 | default ZSWAP_COMPRESSOR_DEFAULT_LZO |
| 569 | help |
| 570 | Selects the default compression algorithm for the compressed cache |
| 571 | for swap pages. |
| 572 | |
| 573 | For an overview what kind of performance can be expected from |
| 574 | a particular compression algorithm please refer to the benchmarks |
| 575 | available at the following LWN page: |
| 576 | https://lwn.net/Articles/751795/ |
| 577 | |
| 578 | If in doubt, select 'LZO'. |
| 579 | |
| 580 | The selection made here can be overridden by using the kernel |
| 581 | command line 'zswap.compressor=' option. |
| 582 | |
| 583 | config ZSWAP_COMPRESSOR_DEFAULT_DEFLATE |
| 584 | bool "Deflate" |
| 585 | select CRYPTO_DEFLATE |
| 586 | help |
| 587 | Use the Deflate algorithm as the default compression algorithm. |
| 588 | |
| 589 | config ZSWAP_COMPRESSOR_DEFAULT_LZO |
| 590 | bool "LZO" |
| 591 | select CRYPTO_LZO |
| 592 | help |
| 593 | Use the LZO algorithm as the default compression algorithm. |
| 594 | |
| 595 | config ZSWAP_COMPRESSOR_DEFAULT_842 |
| 596 | bool "842" |
| 597 | select CRYPTO_842 |
| 598 | help |
| 599 | Use the 842 algorithm as the default compression algorithm. |
| 600 | |
| 601 | config ZSWAP_COMPRESSOR_DEFAULT_LZ4 |
| 602 | bool "LZ4" |
| 603 | select CRYPTO_LZ4 |
| 604 | help |
| 605 | Use the LZ4 algorithm as the default compression algorithm. |
| 606 | |
| 607 | config ZSWAP_COMPRESSOR_DEFAULT_LZ4HC |
| 608 | bool "LZ4HC" |
| 609 | select CRYPTO_LZ4HC |
| 610 | help |
| 611 | Use the LZ4HC algorithm as the default compression algorithm. |
| 612 | |
| 613 | config ZSWAP_COMPRESSOR_DEFAULT_ZSTD |
| 614 | bool "zstd" |
| 615 | select CRYPTO_ZSTD |
| 616 | help |
| 617 | Use the zstd algorithm as the default compression algorithm. |
| 618 | endchoice |
| 619 | |
| 620 | config ZSWAP_COMPRESSOR_DEFAULT |
| 621 | string |
| 622 | depends on ZSWAP |
| 623 | default "deflate" if ZSWAP_COMPRESSOR_DEFAULT_DEFLATE |
| 624 | default "lzo" if ZSWAP_COMPRESSOR_DEFAULT_LZO |
| 625 | default "842" if ZSWAP_COMPRESSOR_DEFAULT_842 |
| 626 | default "lz4" if ZSWAP_COMPRESSOR_DEFAULT_LZ4 |
| 627 | default "lz4hc" if ZSWAP_COMPRESSOR_DEFAULT_LZ4HC |
| 628 | default "zstd" if ZSWAP_COMPRESSOR_DEFAULT_ZSTD |
| 629 | default "" |
| 630 | |
| 631 | choice |
| 632 | prompt "Compressed cache for swap pages default allocator" |
| 633 | depends on ZSWAP |
| 634 | default ZSWAP_ZPOOL_DEFAULT_ZBUD |
| 635 | help |
| 636 | Selects the default allocator for the compressed cache for |
| 637 | swap pages. |
| 638 | The default is 'zbud' for compatibility, however please do |
| 639 | read the description of each of the allocators below before |
| 640 | making a right choice. |
| 641 | |
| 642 | The selection made here can be overridden by using the kernel |
| 643 | command line 'zswap.zpool=' option. |
| 644 | |
| 645 | config ZSWAP_ZPOOL_DEFAULT_ZBUD |
| 646 | bool "zbud" |
| 647 | select ZBUD |
| 648 | help |
| 649 | Use the zbud allocator as the default allocator. |
| 650 | |
| 651 | config ZSWAP_ZPOOL_DEFAULT_Z3FOLD |
| 652 | bool "z3fold" |
| 653 | select Z3FOLD |
| 654 | help |
| 655 | Use the z3fold allocator as the default allocator. |
| 656 | |
| 657 | config ZSWAP_ZPOOL_DEFAULT_ZSMALLOC |
| 658 | bool "zsmalloc" |
| 659 | select ZSMALLOC |
| 660 | help |
| 661 | Use the zsmalloc allocator as the default allocator. |
| 662 | endchoice |
| 663 | |
| 664 | config ZSWAP_ZPOOL_DEFAULT |
| 665 | string |
| 666 | depends on ZSWAP |
| 667 | default "zbud" if ZSWAP_ZPOOL_DEFAULT_ZBUD |
| 668 | default "z3fold" if ZSWAP_ZPOOL_DEFAULT_Z3FOLD |
| 669 | default "zsmalloc" if ZSWAP_ZPOOL_DEFAULT_ZSMALLOC |
| 670 | default "" |
| 671 | |
| 672 | config ZSWAP_DEFAULT_ON |
| 673 | bool "Enable the compressed cache for swap pages by default" |
| 674 | depends on ZSWAP |
| 675 | help |
| 676 | If selected, the compressed cache for swap pages will be enabled |
| 677 | at boot, otherwise it will be disabled. |
| 678 | |
| 679 | The selection made here can be overridden by using the kernel |
| 680 | command line 'zswap.enabled=' option. |
| 681 | |
Dan Streetman | af8d417 | 2014-08-06 16:08:36 -0700 | [diff] [blame] | 682 | config ZPOOL |
| 683 | tristate "Common API for compressed memory storage" |
Pavel Emelyanov | 0f8975e | 2013-07-03 15:01:20 -0700 | [diff] [blame] | 684 | help |
Dan Streetman | af8d417 | 2014-08-06 16:08:36 -0700 | [diff] [blame] | 685 | Compressed memory storage API. This allows using either zbud or |
| 686 | zsmalloc. |
Pavel Emelyanov | 0f8975e | 2013-07-03 15:01:20 -0700 | [diff] [blame] | 687 | |
Dan Streetman | af8d417 | 2014-08-06 16:08:36 -0700 | [diff] [blame] | 688 | config ZBUD |
Vitaly Wool | 9a001fc | 2016-05-20 16:58:30 -0700 | [diff] [blame] | 689 | tristate "Low (Up to 2x) density storage for compressed pages" |
Dan Streetman | af8d417 | 2014-08-06 16:08:36 -0700 | [diff] [blame] | 690 | help |
| 691 | A special purpose allocator for storing compressed pages. |
| 692 | It is designed to store up to two compressed pages per physical |
| 693 | page. While this design limits storage density, it has simple and |
| 694 | deterministic reclaim properties that make it preferable to a higher |
| 695 | density approach when reclaim will be used. |
Minchan Kim | bcf1647 | 2014-01-30 15:45:50 -0800 | [diff] [blame] | 696 | |
Vitaly Wool | 9a001fc | 2016-05-20 16:58:30 -0700 | [diff] [blame] | 697 | config Z3FOLD |
| 698 | tristate "Up to 3x density storage for compressed pages" |
| 699 | depends on ZPOOL |
Vitaly Wool | 9a001fc | 2016-05-20 16:58:30 -0700 | [diff] [blame] | 700 | help |
| 701 | A special purpose allocator for storing compressed pages. |
| 702 | It is designed to store up to three compressed pages per physical |
| 703 | page. It is a ZBUD derivative so the simplicity and determinism are |
| 704 | still there. |
| 705 | |
Minchan Kim | bcf1647 | 2014-01-30 15:45:50 -0800 | [diff] [blame] | 706 | config ZSMALLOC |
Minchan Kim | d867f20 | 2014-06-04 16:11:10 -0700 | [diff] [blame] | 707 | tristate "Memory allocator for compressed pages" |
Minchan Kim | bcf1647 | 2014-01-30 15:45:50 -0800 | [diff] [blame] | 708 | depends on MMU |
Minchan Kim | bcf1647 | 2014-01-30 15:45:50 -0800 | [diff] [blame] | 709 | help |
| 710 | zsmalloc is a slab-based memory allocator designed to store |
| 711 | compressed RAM pages. zsmalloc uses virtual memory mapping |
| 712 | in order to reduce fragmentation. However, this results in a |
| 713 | non-standard allocator interface where a handle, not a pointer, is |
| 714 | returned by an alloc(). This handle must be mapped in order to |
| 715 | access the allocated space. |
| 716 | |
Ganesh Mahendran | 0f050d9 | 2015-02-12 15:00:54 -0800 | [diff] [blame] | 717 | config ZSMALLOC_STAT |
| 718 | bool "Export zsmalloc statistics" |
| 719 | depends on ZSMALLOC |
| 720 | select DEBUG_FS |
| 721 | help |
| 722 | This option enables code in the zsmalloc to collect various |
| 723 | statistics about whats happening in zsmalloc and exports that |
| 724 | information to userspace via debugfs. |
| 725 | If unsure, say N. |
| 726 | |
Mark Salter | 9e5c33d | 2014-04-07 15:39:48 -0700 | [diff] [blame] | 727 | config GENERIC_EARLY_IOREMAP |
| 728 | bool |
Helge Deller | 042d27a | 2014-04-30 23:26:02 +0200 | [diff] [blame] | 729 | |
| 730 | config MAX_STACK_SIZE_MB |
| 731 | int "Maximum user stack size for 32-bit processes (MB)" |
| 732 | default 80 |
Helge Deller | 042d27a | 2014-04-30 23:26:02 +0200 | [diff] [blame] | 733 | range 8 2048 |
| 734 | depends on STACK_GROWSUP && (!64BIT || COMPAT) |
| 735 | help |
| 736 | This is the maximum stack size in Megabytes in the VM layout of 32-bit |
| 737 | user processes when the stack grows upwards (currently only on parisc |
James Hogan | 5f17157 | 2017-10-24 16:52:32 +0100 | [diff] [blame] | 738 | arch). The stack will be located at the highest memory address minus |
| 739 | the given value, unless the RLIMIT_STACK hard limit is changed to a |
| 740 | smaller value in which case that is used. |
Helge Deller | 042d27a | 2014-04-30 23:26:02 +0200 | [diff] [blame] | 741 | |
| 742 | A sane initial value is 80 MB. |
Mel Gorman | 3a80a7f | 2015-06-30 14:57:02 -0700 | [diff] [blame] | 743 | |
Mel Gorman | 3a80a7f | 2015-06-30 14:57:02 -0700 | [diff] [blame] | 744 | config DEFERRED_STRUCT_PAGE_INIT |
Vlastimil Babka | 1ce2210 | 2016-02-05 15:36:21 -0800 | [diff] [blame] | 745 | bool "Defer initialisation of struct pages to kthreads" |
Mike Rapoport | d39f8fb | 2018-08-17 15:47:07 -0700 | [diff] [blame] | 746 | depends on SPARSEMEM |
Pavel Tatashin | ab1e8d8 | 2018-05-18 16:09:13 -0700 | [diff] [blame] | 747 | depends on !NEED_PER_CPU_KM |
Pasha Tatashin | 889c695 | 2018-09-20 12:22:30 -0700 | [diff] [blame] | 748 | depends on 64BIT |
Daniel Jordan | e444314 | 2020-06-03 15:59:51 -0700 | [diff] [blame] | 749 | select PADATA |
Mel Gorman | 3a80a7f | 2015-06-30 14:57:02 -0700 | [diff] [blame] | 750 | help |
| 751 | Ordinarily all struct pages are initialised during early boot in a |
| 752 | single thread. On very large machines this can take a considerable |
| 753 | amount of time. If this option is set, large machines will bring up |
Daniel Jordan | e444314 | 2020-06-03 15:59:51 -0700 | [diff] [blame] | 754 | a subset of memmap at boot and then initialise the rest in parallel. |
| 755 | This has a potential performance impact on tasks running early in the |
Vlastimil Babka | 1ce2210 | 2016-02-05 15:36:21 -0800 | [diff] [blame] | 756 | lifetime of the system until these kthreads finish the |
| 757 | initialisation. |
Dan Williams | 033fbae | 2015-08-09 15:29:06 -0400 | [diff] [blame] | 758 | |
SeongJae Park | 75e13ba | 2021-09-07 19:56:40 -0700 | [diff] [blame] | 759 | config PAGE_IDLE_FLAG |
| 760 | bool |
| 761 | select PAGE_EXTENSION if !64BIT |
| 762 | help |
| 763 | This adds PG_idle and PG_young flags to 'struct page'. PTE Accessed |
| 764 | bit writers can set the state of the bit in the flags so that PTE |
| 765 | Accessed bit readers may avoid disturbance. |
| 766 | |
Vladimir Davydov | 33c3fc7 | 2015-09-09 15:35:45 -0700 | [diff] [blame] | 767 | config IDLE_PAGE_TRACKING |
| 768 | bool "Enable idle page tracking" |
| 769 | depends on SYSFS && MMU |
SeongJae Park | 75e13ba | 2021-09-07 19:56:40 -0700 | [diff] [blame] | 770 | select PAGE_IDLE_FLAG |
Vladimir Davydov | 33c3fc7 | 2015-09-09 15:35:45 -0700 | [diff] [blame] | 771 | help |
| 772 | This feature allows to estimate the amount of user pages that have |
| 773 | not been touched during a given period of time. This information can |
| 774 | be useful to tune memory cgroup limits and/or for job placement |
| 775 | within a compute cluster. |
| 776 | |
Mike Rapoport | 1ad1335 | 2018-04-18 11:07:49 +0300 | [diff] [blame] | 777 | See Documentation/admin-guide/mm/idle_page_tracking.rst for |
| 778 | more details. |
Vladimir Davydov | 33c3fc7 | 2015-09-09 15:35:45 -0700 | [diff] [blame] | 779 | |
Robin Murphy | 1759673 | 2019-07-16 16:30:47 -0700 | [diff] [blame] | 780 | config ARCH_HAS_PTE_DEVMAP |
Oliver O'Halloran | 65f7d04 | 2017-06-28 11:32:31 +1000 | [diff] [blame] | 781 | bool |
| 782 | |
Dan Williams | 033fbae | 2015-08-09 15:29:06 -0400 | [diff] [blame] | 783 | config ZONE_DEVICE |
Jérôme Glisse | 5042db4 | 2017-09-08 16:11:43 -0700 | [diff] [blame] | 784 | bool "Device memory (pmem, HMM, etc...) hotplug support" |
Dan Williams | 033fbae | 2015-08-09 15:29:06 -0400 | [diff] [blame] | 785 | depends on MEMORY_HOTPLUG |
| 786 | depends on MEMORY_HOTREMOVE |
Dan Williams | 99490f1 | 2016-03-17 14:19:58 -0700 | [diff] [blame] | 787 | depends on SPARSEMEM_VMEMMAP |
Robin Murphy | 1759673 | 2019-07-16 16:30:47 -0700 | [diff] [blame] | 788 | depends on ARCH_HAS_PTE_DEVMAP |
Matthew Wilcox | 3a08cd5 | 2018-09-22 16:14:30 -0400 | [diff] [blame] | 789 | select XARRAY_MULTI |
Dan Williams | 033fbae | 2015-08-09 15:29:06 -0400 | [diff] [blame] | 790 | |
| 791 | help |
| 792 | Device memory hotplug support allows for establishing pmem, |
| 793 | or other device driver discovered memory regions, in the |
| 794 | memmap. This allows pfn_to_page() lookups of otherwise |
| 795 | "device-physical" addresses which is needed for using a DAX |
| 796 | mapping in an O_DIRECT operation, among other things. |
| 797 | |
| 798 | If FS_DAX is enabled, then say Y. |
Linus Torvalds | 06a660a | 2015-09-11 16:42:39 -0700 | [diff] [blame] | 799 | |
Dan Williams | e7638488 | 2018-05-16 11:46:08 -0700 | [diff] [blame] | 800 | config DEV_PAGEMAP_OPS |
| 801 | bool |
| 802 | |
Christoph Hellwig | 9c240a7 | 2019-08-06 19:05:52 +0300 | [diff] [blame] | 803 | # |
| 804 | # Helpers to mirror range of the CPU page tables of a process into device page |
| 805 | # tables. |
| 806 | # |
Jérôme Glisse | c0b1240 | 2017-09-08 16:11:27 -0700 | [diff] [blame] | 807 | config HMM_MIRROR |
Christoph Hellwig | 9c240a7 | 2019-08-06 19:05:52 +0300 | [diff] [blame] | 808 | bool |
Christoph Hellwig | f442c28 | 2019-08-06 19:05:51 +0300 | [diff] [blame] | 809 | depends on MMU |
Jérôme Glisse | c0b1240 | 2017-09-08 16:11:27 -0700 | [diff] [blame] | 810 | |
Jérôme Glisse | 5042db4 | 2017-09-08 16:11:43 -0700 | [diff] [blame] | 811 | config DEVICE_PRIVATE |
| 812 | bool "Unaddressable device memory (GPU memory, ...)" |
Christoph Hellwig | 7328d9c | 2019-06-26 14:27:22 +0200 | [diff] [blame] | 813 | depends on ZONE_DEVICE |
Dan Williams | e7638488 | 2018-05-16 11:46:08 -0700 | [diff] [blame] | 814 | select DEV_PAGEMAP_OPS |
Jérôme Glisse | 5042db4 | 2017-09-08 16:11:43 -0700 | [diff] [blame] | 815 | |
| 816 | help |
| 817 | Allows creation of struct pages to represent unaddressable device |
| 818 | memory; i.e., memory that is only accessible from the device (or |
| 819 | group of devices). You likely also want to select HMM_MIRROR. |
| 820 | |
Christoph Hellwig | 3e9a9e2 | 2020-10-17 16:15:10 -0700 | [diff] [blame] | 821 | config VMAP_PFN |
| 822 | bool |
| 823 | |
Jan Kara | 8025e5d | 2015-07-13 11:55:44 -0300 | [diff] [blame] | 824 | config FRAME_VECTOR |
| 825 | bool |
Dave Hansen | 63c17fb | 2016-02-12 13:02:08 -0800 | [diff] [blame] | 826 | |
| 827 | config ARCH_USES_HIGH_VMA_FLAGS |
| 828 | bool |
Dave Hansen | 66d3757 | 2016-02-12 13:02:32 -0800 | [diff] [blame] | 829 | config ARCH_HAS_PKEYS |
| 830 | bool |
Dennis Zhou | 30a5b53 | 2017-06-19 19:28:31 -0400 | [diff] [blame] | 831 | |
| 832 | config PERCPU_STATS |
| 833 | bool "Collect percpu memory statistics" |
Dennis Zhou | 30a5b53 | 2017-06-19 19:28:31 -0400 | [diff] [blame] | 834 | help |
| 835 | This feature collects and exposes statistics via debugfs. The |
| 836 | information includes global and per chunk statistics, which can |
| 837 | be used to help understand percpu memory usage. |
Kirill A. Shutemov | 64c349f | 2017-11-17 15:31:22 -0800 | [diff] [blame] | 838 | |
Laurent Dufour | 1ca86fa | 2018-04-17 16:33:07 +0200 | [diff] [blame] | 839 | config ARCH_SUPPORTS_SPECULATIVE_PAGE_FAULT |
| 840 | def_bool n |
| 841 | |
| 842 | config SPECULATIVE_PAGE_FAULT |
| 843 | bool "Speculative page faults" |
| 844 | default y |
| 845 | depends on ARCH_SUPPORTS_SPECULATIVE_PAGE_FAULT |
| 846 | depends on MMU && SMP |
| 847 | help |
| 848 | Try to handle user space page faults without holding the mmap_sem. |
| 849 | |
| 850 | This should allow better concurrency for massively threaded process |
| 851 | since the page fault handler will not wait for other threads memory |
| 852 | layout change to be done, assuming that this change is done in another |
| 853 | part of the process's memory space. This type of page fault is named |
| 854 | speculative page fault. |
| 855 | |
| 856 | If the speculative page fault fails because of a concurrency is |
| 857 | detected or because underlying PMD or PTE tables are not yet |
| 858 | allocating, it is failing its processing and a classic page fault |
| 859 | is then tried. |
| 860 | |
Kirill A. Shutemov | 64c349f | 2017-11-17 15:31:22 -0800 | [diff] [blame] | 861 | config GUP_BENCHMARK |
Barry Song | 4c6cd03 | 2020-10-13 16:51:51 -0700 | [diff] [blame] | 862 | bool "Enable infrastructure for get_user_pages() and related calls benchmarking" |
Kirill A. Shutemov | 64c349f | 2017-11-17 15:31:22 -0800 | [diff] [blame] | 863 | help |
| 864 | Provides /sys/kernel/debug/gup_benchmark that helps with testing |
Barry Song | 4c6cd03 | 2020-10-13 16:51:51 -0700 | [diff] [blame] | 865 | performance of get_user_pages() and related calls. |
Kirill A. Shutemov | 64c349f | 2017-11-17 15:31:22 -0800 | [diff] [blame] | 866 | |
| 867 | See tools/testing/selftests/vm/gup_benchmark.c |
Laurent Dufour | 3010a5e | 2018-06-07 17:06:08 -0700 | [diff] [blame] | 868 | |
Christoph Hellwig | 39656e8 | 2019-07-11 20:56:49 -0700 | [diff] [blame] | 869 | config GUP_GET_PTE_LOW_HIGH |
| 870 | bool |
| 871 | |
Song Liu | 99cb0db | 2019-09-23 15:38:00 -0700 | [diff] [blame] | 872 | config READ_ONLY_THP_FOR_FS |
| 873 | bool "Read-only THP for filesystems (EXPERIMENTAL)" |
Matthew Wilcox (Oracle) | 396bcc5 | 2020-04-06 20:04:35 -0700 | [diff] [blame] | 874 | depends on TRANSPARENT_HUGEPAGE && SHMEM |
Song Liu | 99cb0db | 2019-09-23 15:38:00 -0700 | [diff] [blame] | 875 | |
| 876 | help |
| 877 | Allow khugepaged to put read-only file-backed pages in THP. |
| 878 | |
| 879 | This is marked experimental because it is a new feature. Write |
| 880 | support of file THPs will be developed in the next few release |
| 881 | cycles. |
| 882 | |
Laurent Dufour | 3010a5e | 2018-06-07 17:06:08 -0700 | [diff] [blame] | 883 | config ARCH_HAS_PTE_SPECIAL |
| 884 | bool |
Christoph Hellwig | 59e0b52 | 2018-07-31 13:39:35 +0200 | [diff] [blame] | 885 | |
Christoph Hellwig | cbd34da | 2019-07-11 20:57:28 -0700 | [diff] [blame] | 886 | # |
| 887 | # Some architectures require a special hugepage directory format that is |
| 888 | # required to support multiple hugepage sizes. For example a4fe3ce76 |
| 889 | # "powerpc/mm: Allow more flexible layouts for hugepage pagetables" |
| 890 | # introduced it on powerpc. This allows for a more flexible hugepage |
| 891 | # pagetable layouts. |
| 892 | # |
| 893 | config ARCH_HAS_HUGEPD |
| 894 | bool |
| 895 | |
Thomas Hellstrom | c5acad8 | 2019-03-19 13:12:30 +0100 | [diff] [blame] | 896 | config MAPPING_DIRTY_HELPERS |
| 897 | bool |
| 898 | |
SeongJae Park | bc19dd9 | 2021-09-07 19:56:28 -0700 | [diff] [blame] | 899 | source "mm/damon/Kconfig" |
| 900 | |
Christoph Hellwig | 59e0b52 | 2018-07-31 13:39:35 +0200 | [diff] [blame] | 901 | endmenu |