Blame - Documentation/sysctl/kernel.txt - SHIFTPHONES/kernel/common

blob: 57baff5bdb806b3b725bdc7859c385fbdc1cac54 [file] [log] [blame]

Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	1	Documentation for /proc/sys/kernel/* kernel version 2.2.10
				2	(c) 1998, 1999, Rik van Riel <riel@nl.linux.org>
Shen Feng	760df93	2009-04-02 16:57:20 -0700	[diff] [blame]	3	(c) 2009, Shen Feng<shen@cn.fujitsu.com>
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	4
				5	For general info and legal blurb, please look in README.
				6
				7	==============================================================
				8
				9	This file contains documentation for the sysctl files in
				10	/proc/sys/kernel/ and is valid for Linux kernel version 2.2.
				11
				12	The files in this directory can be used to tune and monitor
				13	miscellaneous and general things in the operation of the Linux
				14	kernel. Since some of the files _can_ be used to screw up your
				15	system, it is advisable to read both documentation and source
				16	before actually making adjustments.
				17
				18	Currently, these files might (depending on your configuration)
				19	show up in /proc/sys/kernel:
Borislav Petkov	807094c	2011-07-23 10:39:29 -0700	[diff] [blame]	20
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	21	- acct
Borislav Petkov	807094c	2011-07-23 10:39:29 -0700	[diff] [blame]	22	- acpi_video_flags
				23	- auto_msgmni
H. Peter Anvin	d75757a	2009-12-11 14:23:44 -0800	[diff] [blame]	24	- bootloader_type [ X86 only ]
				25	- bootloader_version [ X86 only ]
Hans-Joachim Picht	c114728a	2009-09-11 10:28:47 +0200	[diff] [blame]	26	- callhome [ S390 only ]
Dan Ballard	73efc03	2011-10-31 17:11:20 -0700	[diff] [blame]	27	- cap_last_cap
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	28	- core_pattern
Neil Horman	a293980	2009-09-23 15:56:56 -0700	[diff] [blame]	29	- core_pipe_limit
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	30	- core_uses_pid
				31	- ctrl-alt-del
Dan Rosenberg	eaf06b2	2010-11-11 14:05:18 -0800	[diff] [blame]	32	- dmesg_restrict
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	33	- domainname
				34	- hostname
				35	- hotplug
Aaron Tomlin	270750db	2014-01-20 17:34:13 +0000	[diff] [blame]	36	- hung_task_panic
				37	- hung_task_check_count
				38	- hung_task_timeout_secs
				39	- hung_task_warnings
Kees Cook	7984754	2014-01-23 15:55:59 -0800	[diff] [blame]	40	- kexec_load_disabled
Dan Rosenberg	455cd5a	2011-01-12 16:59:41 -0800	[diff] [blame]	41	- kptr_restrict
Chuck Ebbert	0741f4d	2006-12-07 02:14:11 +0100	[diff] [blame]	42	- kstack_depth_to_print [ X86 only ]
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	43	- l2cr [ PPC only ]
Michael Opdenacker	ac76cff	2008-02-13 15:03:32 -0800	[diff] [blame]	44	- modprobe ==> Documentation/debugging-modules.txt
Kees Cook	3d43321	2009-04-02 15:49:29 -0700	[diff] [blame]	45	- modules_disabled
Stanislav Kinsbursky	03f5956	2013-01-04 15:34:50 -0800	[diff] [blame]	46	- msg_next_id [ sysv ipc ]
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	47	- msgmax
				48	- msgmnb
				49	- msgmni
Shen Feng	760df93	2009-04-02 16:57:20 -0700	[diff] [blame]	50	- nmi_watchdog
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	51	- osrelease
				52	- ostype
				53	- overflowgid
				54	- overflowuid
				55	- panic
Borislav Petkov	807094c	2011-07-23 10:39:29 -0700	[diff] [blame]	56	- panic_on_oops
				57	- panic_on_unrecovered_nmi
Mitsuo Hayasaka	55af779	2011-11-29 15:08:36 +0900	[diff] [blame]	58	- panic_on_stackoverflow
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	59	- pid_max
				60	- powersave-nap [ PPC only ]
				61	- printk
Borislav Petkov	807094c	2011-07-23 10:39:29 -0700	[diff] [blame]	62	- printk_delay
				63	- printk_ratelimit
				64	- printk_ratelimit_burst
Jiri Kosina	1ec7fd5	2008-02-09 23:24:08 +0100	[diff] [blame]	65	- randomize_va_space
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	66	- real-root-dev ==> Documentation/initrd.txt
				67	- reboot-cmd [ SPARC only ]
				68	- rtsig-max
				69	- rtsig-nr
				70	- sem
Stanislav Kinsbursky	03f5956	2013-01-04 15:34:50 -0800	[diff] [blame]	71	- sem_next_id [ sysv ipc ]
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	72	- sg-big-buff [ generic SCSI device (sg) ]
Stanislav Kinsbursky	03f5956	2013-01-04 15:34:50 -0800	[diff] [blame]	73	- shm_next_id [ sysv ipc ]
Vasiliy Kulikov	b34a6b1	2011-07-26 16:08:48 -0700	[diff] [blame]	74	- shm_rmid_forced
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	75	- shmall
				76	- shmmax [ sysv ipc ]
				77	- shmmni
Aaron Tomlin	ed23587	2014-06-23 13:22:05 -0700	[diff] [blame]	78	- softlockup_all_cpu_backtrace
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	79	- stop-a [ SPARC only ]
				80	- sysrq ==> Documentation/sysrq.txt
Kees Cook	f4aacea	2014-06-06 14:37:19 -0700	[diff] [blame]	81	- sysctl_writes_strict
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	82	- tainted
				83	- threads-max
Shen Feng	760df93	2009-04-02 16:57:20 -0700	[diff] [blame]	84	- unknown_nmi_panic
Li Zefan	08825c9	2013-05-17 10:31:20 +0800	[diff] [blame]	85	- watchdog_thresh
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	86	- version
				87
				88	==============================================================
				89
				90	acct:
				91
				92	highwater lowwater frequency
				93
				94	If BSD-style process accounting is enabled these values control
				95	its behaviour. If free space on filesystem where the log lives
				96	goes below <lowwater>% accounting suspends. If free space gets
				97	above <highwater>% accounting resumes. <Frequency> determines
				98	how often do we check the amount of free space (value is in
				99	seconds). Default:
				100	4 2 30
				101	That is, suspend accounting if there left <= 2% free; resume it
				102	if we got >=4%; consider information about amount of free space
				103	valid for 30 seconds.
				104
				105	==============================================================
				106
Borislav Petkov	807094c	2011-07-23 10:39:29 -0700	[diff] [blame]	107	acpi_video_flags:
				108
				109	flags
				110
				111	See Doc*/kernel/power/video.txt, it allows mode of video boot to be
				112	set during run time.
				113
				114	==============================================================
				115
				116	auto_msgmni:
				117
				118	Enables/Disables automatic recomputing of msgmni upon memory add/remove
				119	or upon ipc namespace creation/removal (see the msgmni description
				120	above). Echoing "1" into this file enables msgmni automatic recomputing.
				121	Echoing "0" turns it off. auto_msgmni default value is 1.
				122
				123
				124	==============================================================
				125
H. Peter Anvin	d75757a	2009-12-11 14:23:44 -0800	[diff] [blame]	126	bootloader_type:
				127
				128	x86 bootloader identification
				129
				130	This gives the bootloader type number as indicated by the bootloader,
				131	shifted left by 4, and OR'd with the low four bits of the bootloader
				132	version. The reason for this encoding is that this used to match the
				133	type_of_loader field in the kernel header; the encoding is kept for
				134	backwards compatibility. That is, if the full bootloader type number
				135	is 0x15 and the full version number is 0x234, this file will contain
				136	the value 340 = 0x154.
				137
				138	See the type_of_loader and ext_loader_type fields in
				139	Documentation/x86/boot.txt for additional information.
				140
				141	==============================================================
				142
				143	bootloader_version:
				144
				145	x86 bootloader version
				146
				147	The complete bootloader version number. In the example above, this
				148	file will contain the value 564 = 0x234.
				149
				150	See the type_of_loader and ext_loader_ver fields in
				151	Documentation/x86/boot.txt for additional information.
				152
				153	==============================================================
				154
Hans-Joachim Picht	c114728a	2009-09-11 10:28:47 +0200	[diff] [blame]	155	callhome:
				156
				157	Controls the kernel's callhome behavior in case of a kernel panic.
				158
				159	The s390 hardware allows an operating system to send a notification
				160	to a service organization (callhome) in case of an operating system panic.
				161
				162	When the value in this file is 0 (which is the default behavior)
				163	nothing happens in case of a kernel panic. If this value is set to "1"
				164	the complete kernel oops message is send to the IBM customer service
				165	organization in case the mainframe the Linux operating system is running
				166	on has a service contract with IBM.
				167
				168	==============================================================
				169
Dan Ballard	73efc03	2011-10-31 17:11:20 -0700	[diff] [blame]	170	cap_last_cap
				171
				172	Highest valid capability of the running kernel. Exports
				173	CAP_LAST_CAP from the kernel.
				174
				175	==============================================================
				176
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	177	core_pattern:
				178
				179	core_pattern is used to specify a core dumpfile pattern name.
Matthias Urlichs	cd08104	2006-10-11 01:21:57 -0700	[diff] [blame]	180	. max length 128 characters; default value is "core"
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	181	. core_pattern is used as a pattern template for the output filename;
				182	certain string patterns (beginning with '%') are substituted with
				183	their actual values.
				184	. backward compatibility with core_uses_pid:
				185	If core_pattern does not include "%p" (default does not)
				186	and core_uses_pid is set, then .PID will be appended to
				187	the filename.
				188	. corename format specifiers:
				189	%<NUL> '%' is dropped
				190	%% output one '%'
				191	%p pid
Stéphane Graber	65aafb1	2013-09-11 14:24:32 -0700	[diff] [blame]	192	%P global pid (init PID namespace)
Oleg Nesterov	b03023e	2014-10-13 15:53:35 -0700	[diff] [blame]	193	%i tid
				194	%I global tid (init PID namespace)
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	195	%u uid
				196	%g gid
Oleg Nesterov	12a2b4b	2012-10-04 17:15:25 -0700	[diff] [blame]	197	%d dump mode, matches PR_SET_DUMPABLE and
				198	/proc/sys/fs/suid_dumpable
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	199	%s signal number
				200	%t UNIX time of dump
				201	%h hostname
Jiri Slaby	57cc083	2011-05-26 16:25:46 -0700	[diff] [blame]	202	%e executable filename (may be shortened)
				203	%E executable path
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	204	%<OTHER> both are dropped
Matthias Urlichs	cd08104	2006-10-11 01:21:57 -0700	[diff] [blame]	205	. If the first character of the pattern is a '\|', the kernel will treat
				206	the rest of the pattern as a command to run. The core dump will be
				207	written to the standard input of that program instead of to a file.
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	208
				209	==============================================================
				210
Neil Horman	a293980	2009-09-23 15:56:56 -0700	[diff] [blame]	211	core_pipe_limit:
				212
Borislav Petkov	807094c	2011-07-23 10:39:29 -0700	[diff] [blame]	213	This sysctl is only applicable when core_pattern is configured to pipe
				214	core files to a user space helper (when the first character of
				215	core_pattern is a '\|', see above). When collecting cores via a pipe
				216	to an application, it is occasionally useful for the collecting
				217	application to gather data about the crashing process from its
				218	/proc/pid directory. In order to do this safely, the kernel must wait
				219	for the collecting process to exit, so as not to remove the crashing
				220	processes proc files prematurely. This in turn creates the
				221	possibility that a misbehaving userspace collecting process can block
				222	the reaping of a crashed process simply by never exiting. This sysctl
				223	defends against that. It defines how many concurrent crashing
				224	processes may be piped to user space applications in parallel. If
				225	this value is exceeded, then those crashing processes above that value
				226	are noted via the kernel log and their cores are skipped. 0 is a
				227	special value, indicating that unlimited processes may be captured in
				228	parallel, but that no waiting will take place (i.e. the collecting
				229	process is not guaranteed access to /proc/<crashing pid>/). This
				230	value defaults to 0.
Neil Horman	a293980	2009-09-23 15:56:56 -0700	[diff] [blame]	231
				232	==============================================================
				233
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	234	core_uses_pid:
				235
				236	The default coredump filename is "core". By setting
				237	core_uses_pid to 1, the coredump filename becomes core.PID.
				238	If core_pattern does not include "%p" (default does not)
				239	and core_uses_pid is set, then .PID will be appended to
				240	the filename.
				241
				242	==============================================================
				243
				244	ctrl-alt-del:
				245
				246	When the value in this file is 0, ctrl-alt-del is trapped and
				247	sent to the init(1) program to handle a graceful restart.
				248	When, however, the value is > 0, Linux's reaction to a Vulcan
				249	Nerve Pinch (tm) will be an immediate reboot, without even
				250	syncing its dirty buffers.
				251
				252	Note: when a program (like dosemu) has the keyboard in 'raw'
				253	mode, the ctrl-alt-del is intercepted by the program before it
				254	ever reaches the kernel tty layer, and it's up to the program
				255	to decide what to do with it.
				256
				257	==============================================================
				258
Dan Rosenberg	eaf06b2	2010-11-11 14:05:18 -0800	[diff] [blame]	259	dmesg_restrict:
				260
Borislav Petkov	807094c	2011-07-23 10:39:29 -0700	[diff] [blame]	261	This toggle indicates whether unprivileged users are prevented
				262	from using dmesg(8) to view messages from the kernel's log buffer.
				263	When dmesg_restrict is set to (0) there are no restrictions. When
Serge E. Hallyn	38ef4c2	2010-12-08 15:19:01 +0000	[diff] [blame]	264	dmesg_restrict is set set to (1), users must have CAP_SYSLOG to use
Dan Rosenberg	eaf06b2	2010-11-11 14:05:18 -0800	[diff] [blame]	265	dmesg(8).
				266
Borislav Petkov	807094c	2011-07-23 10:39:29 -0700	[diff] [blame]	267	The kernel config option CONFIG_SECURITY_DMESG_RESTRICT sets the
				268	default value of dmesg_restrict.
Dan Rosenberg	eaf06b2	2010-11-11 14:05:18 -0800	[diff] [blame]	269
				270	==============================================================
				271
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	272	domainname & hostname:
				273
				274	These files can be used to set the NIS/YP domainname and the
				275	hostname of your box in exactly the same way as the commands
				276	domainname and hostname, i.e.:
				277	# echo "darkstar" > /proc/sys/kernel/hostname
				278	# echo "mydomain" > /proc/sys/kernel/domainname
				279	has the same effect as
				280	# hostname "darkstar"
				281	# domainname "mydomain"
				282
				283	Note, however, that the classic darkstar.frop.org has the
				284	hostname "darkstar" and DNS (Internet Domain Name Server)
				285	domainname "frop.org", not to be confused with the NIS (Network
				286	Information Service) or YP (Yellow Pages) domainname. These two
				287	domain names are in general different. For a detailed discussion
				288	see the hostname(1) man page.
				289
				290	==============================================================
				291
				292	hotplug:
				293
				294	Path for the hotplug policy agent.
				295	Default value is "/sbin/hotplug".
				296
				297	==============================================================
				298
Aaron Tomlin	270750db	2014-01-20 17:34:13 +0000	[diff] [blame]	299	hung_task_panic:
				300
				301	Controls the kernel's behavior when a hung task is detected.
				302	This file shows up if CONFIG_DETECT_HUNG_TASK is enabled.
				303
				304	0: continue operation. This is the default behavior.
				305
				306	1: panic immediately.
				307
				308	==============================================================
				309
				310	hung_task_check_count:
				311
				312	The upper bound on the number of tasks that are checked.
				313	This file shows up if CONFIG_DETECT_HUNG_TASK is enabled.
				314
				315	==============================================================
				316
				317	hung_task_timeout_secs:
				318
				319	Check interval. When a task in D state did not get scheduled
				320	for more than this value report a warning.
				321	This file shows up if CONFIG_DETECT_HUNG_TASK is enabled.
				322
				323	0: means infinite timeout - no checking done.
Liu Hua	80df284	2014-04-07 15:38:57 -0700	[diff] [blame]	324	Possible values to set are in range {0..LONG_MAX/HZ}.
Aaron Tomlin	270750db	2014-01-20 17:34:13 +0000	[diff] [blame]	325
				326	==============================================================
				327
Aaron Tomlin	70e0ac5	2014-01-27 09:00:57 +0000	[diff] [blame]	328	hung_task_warnings:
Aaron Tomlin	270750db	2014-01-20 17:34:13 +0000	[diff] [blame]	329
				330	The maximum number of warnings to report. During a check interval
Aaron Tomlin	70e0ac5	2014-01-27 09:00:57 +0000	[diff] [blame]	331	if a hung task is detected, this value is decreased by 1.
				332	When this value reaches 0, no more warnings will be reported.
Aaron Tomlin	270750db	2014-01-20 17:34:13 +0000	[diff] [blame]	333	This file shows up if CONFIG_DETECT_HUNG_TASK is enabled.
				334
				335	-1: report an infinite number of warnings.
				336
				337	==============================================================
				338
Kees Cook	7984754	2014-01-23 15:55:59 -0800	[diff] [blame]	339	kexec_load_disabled:
				340
				341	A toggle indicating if the kexec_load syscall has been disabled. This
				342	value defaults to 0 (false: kexec_load enabled), but can be set to 1
				343	(true: kexec_load disabled). Once true, kexec can no longer be used, and
				344	the toggle cannot be set back to false. This allows a kexec image to be
				345	loaded before disabling the syscall, allowing a system to set up (and
				346	later use) an image without it being altered. Generally used together
				347	with the "modules_disabled" sysctl.
				348
				349	==============================================================
				350
Dan Rosenberg	455cd5a	2011-01-12 16:59:41 -0800	[diff] [blame]	351	kptr_restrict:
				352
				353	This toggle indicates whether restrictions are placed on
Ryan Mallon	312b4e2	2013-11-12 15:08:51 -0800	[diff] [blame]	354	exposing kernel addresses via /proc and other interfaces.
				355
				356	When kptr_restrict is set to (0), the default, there are no restrictions.
				357
				358	When kptr_restrict is set to (1), kernel pointers printed using the %pK
				359	format specifier will be replaced with 0's unless the user has CAP_SYSLOG
				360	and effective user and group ids are equal to the real ids. This is
				361	because %pK checks are done at read() time rather than open() time, so
				362	if permissions are elevated between the open() and the read() (e.g via
				363	a setuid binary) then %pK will not leak kernel pointers to unprivileged
				364	users. Note, this is a temporary solution only. The correct long-term
				365	solution is to do the permission checks at open() time. Consider removing
				366	world read permissions from files that use %pK, and using dmesg_restrict
				367	to protect against uses of %pK in dmesg(8) if leaking kernel pointer
				368	values to unprivileged users is a concern.
				369
				370	When kptr_restrict is set to (2), kernel pointers printed using
				371	%pK will be replaced with 0's regardless of privileges.
Dan Rosenberg	455cd5a	2011-01-12 16:59:41 -0800	[diff] [blame]	372
				373	==============================================================
				374
Chuck Ebbert	0741f4d	2006-12-07 02:14:11 +0100	[diff] [blame]	375	kstack_depth_to_print: (X86 only)
				376
				377	Controls the number of words to print when dumping the raw
				378	kernel stack.
				379
				380	==============================================================
				381
Borislav Petkov	807094c	2011-07-23 10:39:29 -0700	[diff] [blame]	382	l2cr: (PPC only)
				383
				384	This flag controls the L2 cache of G3 processor boards. If
				385	0, the cache is disabled. Enabled if nonzero.
				386
				387	==============================================================
				388
Kees Cook	3d43321	2009-04-02 15:49:29 -0700	[diff] [blame]	389	modules_disabled:
				390
				391	A toggle value indicating if modules are allowed to be loaded
				392	in an otherwise modular kernel. This toggle defaults to off
				393	(0), but can be set true (1). Once true, modules can be
				394	neither loaded nor unloaded, and the toggle cannot be set back
Kees Cook	7984754	2014-01-23 15:55:59 -0800	[diff] [blame]	395	to false. Generally used with the "kexec_load_disabled" toggle.
Kees Cook	3d43321	2009-04-02 15:49:29 -0700	[diff] [blame]	396
				397	==============================================================
				398
Stanislav Kinsbursky	03f5956	2013-01-04 15:34:50 -0800	[diff] [blame]	399	msg_next_id, sem_next_id, and shm_next_id:
				400
				401	These three toggles allows to specify desired id for next allocated IPC
				402	object: message, semaphore or shared memory respectively.
				403
				404	By default they are equal to -1, which means generic allocation logic.
				405	Possible values to set are in range {0..INT_MAX}.
				406
				407	Notes:
				408	1) kernel doesn't guarantee, that new object will have desired id. So,
				409	it's up to userspace, how to handle an object with "wrong" id.
				410	2) Toggle with non-default value will be set back to -1 by kernel after
				411	successful IPC object allocation.
				412
				413	==============================================================
				414
Borislav Petkov	807094c	2011-07-23 10:39:29 -0700	[diff] [blame]	415	nmi_watchdog:
				416
				417	Enables/Disables the NMI watchdog on x86 systems. When the value is
				418	non-zero the NMI watchdog is enabled and will continuously test all
				419	online cpus to determine whether or not they are still functioning
				420	properly. Currently, passing "nmi_watchdog=" parameter at boot time is
				421	required for this function to work.
				422
				423	If LAPIC NMI watchdog method is in use (nmi_watchdog=2 kernel
				424	parameter), the NMI watchdog shares registers with oprofile. By
				425	disabling the NMI watchdog, oprofile may have more registers to
				426	utilize.
				427
				428	==============================================================
				429
Mel Gorman	10fc05d	2013-10-07 11:28:40 +0100	[diff] [blame]	430	numa_balancing
				431
				432	Enables/disables automatic page fault based NUMA memory
				433	balancing. Memory is moved automatically to nodes
				434	that access it often.
				435
				436	Enables/disables automatic NUMA memory balancing. On NUMA machines, there
				437	is a performance penalty if remote memory is accessed by a CPU. When this
				438	feature is enabled the kernel samples what task thread is accessing memory
				439	by periodically unmapping pages and later trapping a page fault. At the
				440	time of the page fault, it is determined if the data being accessed should
				441	be migrated to a local memory node.
				442
				443	The unmapping of pages and trapping faults incur additional overhead that
				444	ideally is offset by improved memory locality but there is no universal
				445	guarantee. If the target workload is already bound to NUMA nodes then this
				446	feature should be disabled. Otherwise, if the system overhead from the
				447	feature is too high then the rate the kernel samples for NUMA hinting
				448	faults may be controlled by the numa_balancing_scan_period_min_ms,
Mel Gorman	930aa17	2013-10-07 11:29:37 +0100	[diff] [blame]	449	numa_balancing_scan_delay_ms, numa_balancing_scan_period_max_ms,
Rik van Riel	52bf84a	2014-01-27 17:03:40 -0500	[diff] [blame]	450	numa_balancing_scan_size_mb, and numa_balancing_settle_count sysctls.
Mel Gorman	10fc05d	2013-10-07 11:28:40 +0100	[diff] [blame]	451
				452	==============================================================
				453
				454	numa_balancing_scan_period_min_ms, numa_balancing_scan_delay_ms,
Mel Gorman	930aa17	2013-10-07 11:29:37 +0100	[diff] [blame]	455	numa_balancing_scan_period_max_ms, numa_balancing_scan_size_mb
Mel Gorman	10fc05d	2013-10-07 11:28:40 +0100	[diff] [blame]	456
				457	Automatic NUMA balancing scans tasks address space and unmaps pages to
				458	detect if pages are properly placed or if the data should be migrated to a
				459	memory node local to where the task is running. Every "scan delay" the task
				460	scans the next "scan size" number of pages in its address space. When the
				461	end of the address space is reached the scanner restarts from the beginning.
				462
				463	In combination, the "scan delay" and "scan size" determine the scan rate.
				464	When "scan delay" decreases, the scan rate increases. The scan delay and
				465	hence the scan rate of every task is adaptive and depends on historical
				466	behaviour. If pages are properly placed then the scan delay increases,
				467	otherwise the scan delay decreases. The "scan size" is not adaptive but
				468	the higher the "scan size", the higher the scan rate.
				469
				470	Higher scan rates incur higher system overhead as page faults must be
				471	trapped and potentially data must be migrated. However, the higher the scan
				472	rate, the more quickly a tasks memory is migrated to a local node if the
				473	workload pattern changes and minimises performance impact due to remote
				474	memory accesses. These sysctls control the thresholds for scan delays and
				475	the number of pages scanned.
				476
Mel Gorman	598f0ec	2013-10-07 11:28:55 +0100	[diff] [blame]	477	numa_balancing_scan_period_min_ms is the minimum time in milliseconds to
				478	scan a tasks virtual memory. It effectively controls the maximum scanning
				479	rate for each task.
Mel Gorman	10fc05d	2013-10-07 11:28:40 +0100	[diff] [blame]	480
				481	numa_balancing_scan_delay_ms is the starting "scan delay" used for a task
				482	when it initially forks.
				483
Mel Gorman	598f0ec	2013-10-07 11:28:55 +0100	[diff] [blame]	484	numa_balancing_scan_period_max_ms is the maximum time in milliseconds to
				485	scan a tasks virtual memory. It effectively controls the minimum scanning
				486	rate for each task.
Mel Gorman	10fc05d	2013-10-07 11:28:40 +0100	[diff] [blame]	487
				488	numa_balancing_scan_size_mb is how many megabytes worth of pages are
				489	scanned for a given scan.
				490
Mel Gorman	10fc05d	2013-10-07 11:28:40 +0100	[diff] [blame]	491	==============================================================
				492
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	493	osrelease, ostype & version:
				494
				495	# cat osrelease
				496	2.1.88
				497	# cat ostype
				498	Linux
				499	# cat version
				500	#5 Wed Feb 25 21:49:24 MET 1998
				501
				502	The files osrelease and ostype should be clear enough. Version
				503	needs a little more clarification however. The '#5' means that
				504	this is the fifth kernel built from this source base and the
				505	date behind it indicates the time the kernel was built.
				506	The only way to tune these values is to rebuild the kernel :-)
				507
				508	==============================================================
				509
				510	overflowgid & overflowuid:
				511
Borislav Petkov	807094c	2011-07-23 10:39:29 -0700	[diff] [blame]	512	if your architecture did not always support 32-bit UIDs (i.e. arm,
				513	i386, m68k, sh, and sparc32), a fixed UID and GID will be returned to
				514	applications that use the old 16-bit UID/GID system calls, if the
				515	actual UID or GID would exceed 65535.
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	516
				517	These sysctls allow you to change the value of the fixed UID and GID.
				518	The default is 65534.
				519
				520	==============================================================
				521
				522	panic:
				523
Borislav Petkov	807094c	2011-07-23 10:39:29 -0700	[diff] [blame]	524	The value in this file represents the number of seconds the kernel
				525	waits before rebooting on a panic. When you use the software watchdog,
				526	the recommended setting is 60.
				527
				528	==============================================================
				529
				530	panic_on_unrecovered_nmi:
				531
				532	The default Linux behaviour on an NMI of either memory or unknown is
				533	to continue operation. For many environments such as scientific
				534	computing it is preferable that the box is taken out and the error
				535	dealt with than an uncorrected parity/ECC error get propagated.
				536
				537	A small number of systems do generate NMI's for bizarre random reasons
				538	such as power management so the default is off. That sysctl works like
				539	the existing panic controls already in that directory.
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	540
				541	==============================================================
				542
				543	panic_on_oops:
				544
				545	Controls the kernel's behaviour when an oops or BUG is encountered.
				546
				547	0: try to continue operation
				548
Matt LaPlante	a982ac0	2007-05-09 07:35:06 +0200	[diff] [blame]	549	1: panic immediately. If the `panic' sysctl is also non-zero then the
Maxime Bizon	8b23d04d	2006-08-05 12:14:32 -0700	[diff] [blame]	550	machine will be rebooted.
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	551
				552	==============================================================
				553
Mitsuo Hayasaka	55af779	2011-11-29 15:08:36 +0900	[diff] [blame]	554	panic_on_stackoverflow:
				555
				556	Controls the kernel's behavior when detecting the overflows of
				557	kernel, IRQ and exception stacks except a user stack.
				558	This file shows up if CONFIG_DEBUG_STACKOVERFLOW is enabled.
				559
				560	0: try to continue operation.
				561
				562	1: panic immediately.
				563
				564	==============================================================
				565
Dave Hansen	14c63f1	2013-06-21 08:51:36 -0700	[diff] [blame]	566	perf_cpu_time_max_percent:
				567
				568	Hints to the kernel how much CPU time it should be allowed to
				569	use to handle perf sampling events. If the perf subsystem
				570	is informed that its samples are exceeding this limit, it
				571	will drop its sampling frequency to attempt to reduce its CPU
				572	usage.
				573
				574	Some perf sampling happens in NMIs. If these samples
				575	unexpectedly take too long to execute, the NMIs can become
				576	stacked up next to each other so much that nothing else is
				577	allowed to execute.
				578
				579	0: disable the mechanism. Do not monitor or correct perf's
				580	sampling rate no matter how CPU time it takes.
				581
				582	1-100: attempt to throttle perf's sample rate to this
				583	percentage of CPU. Note: the kernel calculates an
				584	"expected" length of each sample event. 100 here means
				585	100% of that expected length. Even if this is set to
				586	100, you may still see sample throttling if this
				587	length is exceeded. Set to 0 if you truly do not care
				588	how much CPU is consumed.
				589
				590	==============================================================
				591
Mitsuo Hayasaka	55af779	2011-11-29 15:08:36 +0900	[diff] [blame]	592
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	593	pid_max:
				594
Robert P. J. Day	beb7dd8	2007-05-09 07:14:03 +0200	[diff] [blame]	595	PID allocation wrap value. When the kernel's next PID value
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	596	reaches this value, it wraps back to a minimum PID value.
				597	PIDs of value pid_max or larger are not allocated.
				598
				599	==============================================================
				600
Pavel Emelyanov	b8f566b	2012-01-12 17:20:27 -0800	[diff] [blame]	601	ns_last_pid:
				602
				603	The last pid allocated in the current (the one task using this sysctl
				604	lives in) pid namespace. When selecting a pid for a next task on fork
				605	kernel tries to allocate a number starting from this one.
				606
				607	==============================================================
				608
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	609	powersave-nap: (PPC only)
				610
				611	If set, Linux-PPC will use the 'nap' mode of powersaving,
				612	otherwise the 'doze' mode will be used.
				613
				614	==============================================================
				615
				616	printk:
				617
				618	The four values in printk denote: console_loglevel,
				619	default_message_loglevel, minimum_console_loglevel and
				620	default_console_loglevel respectively.
				621
				622	These values influence printk() behavior when printing or
				623	logging error messages. See 'man 2 syslog' for more info on
				624	the different loglevels.
				625
				626	- console_loglevel: messages with a higher priority than
				627	this will be printed to the console
Paul Bolle	87889e1	2011-02-06 21:00:41 +0100	[diff] [blame]	628	- default_message_loglevel: messages without an explicit priority
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	629	will be printed with this priority
				630	- minimum_console_loglevel: minimum (highest) value to which
				631	console_loglevel can be set
				632	- default_console_loglevel: default value for console_loglevel
				633
				634	==============================================================
				635
Borislav Petkov	807094c	2011-07-23 10:39:29 -0700	[diff] [blame]	636	printk_delay:
				637
				638	Delay each printk message in printk_delay milliseconds
				639
				640	Value from 0 - 10000 is allowed.
				641
				642	==============================================================
				643
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	644	printk_ratelimit:
				645
				646	Some warning messages are rate limited. printk_ratelimit specifies
				647	the minimum length of time between these messages (in jiffies), by
				648	default we allow one every 5 seconds.
				649
				650	A value of 0 will disable rate limiting.
				651
				652	==============================================================
				653
				654	printk_ratelimit_burst:
				655
				656	While long term we enforce one message per printk_ratelimit
				657	seconds, we do allow a burst of messages to pass through.
				658	printk_ratelimit_burst specifies the number of messages we can
				659	send before ratelimiting kicks in.
				660
				661	==============================================================
				662
Borislav Petkov	807094c	2011-07-23 10:39:29 -0700	[diff] [blame]	663	randomize_va_space:
Jiri Kosina	1ec7fd5	2008-02-09 23:24:08 +0100	[diff] [blame]	664
				665	This option can be used to select the type of process address
				666	space randomization that is used in the system, for architectures
				667	that support this feature.
				668
Horst Schirmeier	b7f5ab6	2009-07-03 14:20:17 +0200	[diff] [blame]	669	0 - Turn the process address space randomization off. This is the
				670	default for architectures that do not support this feature anyways,
				671	and kernels that are booted with the "norandmaps" parameter.
Jiri Kosina	1ec7fd5	2008-02-09 23:24:08 +0100	[diff] [blame]	672
				673	1 - Make the addresses of mmap base, stack and VDSO page randomized.
				674	This, among other things, implies that shared libraries will be
Horst Schirmeier	b7f5ab6	2009-07-03 14:20:17 +0200	[diff] [blame]	675	loaded to random addresses. Also for PIE-linked binaries, the
				676	location of code start is randomized. This is the default if the
				677	CONFIG_COMPAT_BRK option is enabled.
Jiri Kosina	1ec7fd5	2008-02-09 23:24:08 +0100	[diff] [blame]	678
Horst Schirmeier	b7f5ab6	2009-07-03 14:20:17 +0200	[diff] [blame]	679	2 - Additionally enable heap randomization. This is the default if
				680	CONFIG_COMPAT_BRK is disabled.
				681
				682	There are a few legacy applications out there (such as some ancient
Jiri Kosina	1ec7fd5	2008-02-09 23:24:08 +0100	[diff] [blame]	683	versions of libc.so.5 from 1996) that assume that brk area starts
Horst Schirmeier	b7f5ab6	2009-07-03 14:20:17 +0200	[diff] [blame]	684	just after the end of the code+bss. These applications break when
				685	start of the brk area is randomized. There are however no known
Jiri Kosina	1ec7fd5	2008-02-09 23:24:08 +0100	[diff] [blame]	686	non-legacy applications that would be broken this way, so for most
Horst Schirmeier	b7f5ab6	2009-07-03 14:20:17 +0200	[diff] [blame]	687	systems it is safe to choose full randomization.
				688
				689	Systems with ancient and/or broken binaries should be configured
				690	with CONFIG_COMPAT_BRK enabled, which excludes the heap from process
				691	address space randomization.
Jiri Kosina	1ec7fd5	2008-02-09 23:24:08 +0100	[diff] [blame]	692
				693	==============================================================
				694
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	695	reboot-cmd: (Sparc only)
				696
				697	??? This seems to be a way to give an argument to the Sparc
				698	ROM/Flash boot loader. Maybe to tell it what to do after
				699	rebooting. ???
				700
				701	==============================================================
				702
				703	rtsig-max & rtsig-nr:
				704
				705	The file rtsig-max can be used to tune the maximum number
				706	of POSIX realtime (queued) signals that can be outstanding
				707	in the system.
				708
				709	rtsig-nr shows the number of RT signals currently queued.
				710
				711	==============================================================
				712
				713	sg-big-buff:
				714
				715	This file shows the size of the generic SCSI (sg) buffer.
				716	You can't tune it just yet, but you could change it on
				717	compile time by editing include/scsi/sg.h and changing
				718	the value of SG_BIG_BUFF.
				719
				720	There shouldn't be any reason to change this value. If
				721	you can come up with one, you probably know what you
				722	are doing anyway :)
				723
				724	==============================================================
				725
Carlos Alberto Lopez Perez	358e419	2013-01-04 15:35:05 -0800	[diff] [blame]	726	shmall:
				727
				728	This parameter sets the total amount of shared memory pages that
				729	can be used system wide. Hence, SHMALL should always be at least
				730	ceil(shmmax/PAGE_SIZE).
				731
				732	If you are not sure what the default PAGE_SIZE is on your Linux
				733	system, you can run the following command:
				734
				735	# getconf PAGE_SIZE
				736
				737	==============================================================
				738
Borislav Petkov	807094c	2011-07-23 10:39:29 -0700	[diff] [blame]	739	shmmax:
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	740
				741	This value can be used to query and set the run time limit
				742	on the maximum shared memory segment size that can be created.
Borislav Petkov	807094c	2011-07-23 10:39:29 -0700	[diff] [blame]	743	Shared memory segments up to 1Gb are now supported in the
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	744	kernel. This value defaults to SHMMAX.
				745
				746	==============================================================
				747
Vasiliy Kulikov	b34a6b1	2011-07-26 16:08:48 -0700	[diff] [blame]	748	shm_rmid_forced:
				749
				750	Linux lets you set resource limits, including how much memory one
				751	process can consume, via setrlimit(2). Unfortunately, shared memory
				752	segments are allowed to exist without association with any process, and
				753	thus might not be counted against any resource limits. If enabled,
				754	shared memory segments are automatically destroyed when their attach
				755	count becomes zero after a detach or a process termination. It will
				756	also destroy segments that were created, but never attached to, on exit
				757	from the process. The only use left for IPC_RMID is to immediately
				758	destroy an unattached segment. Of course, this breaks the way things are
				759	defined, so some applications might stop working. Note that this
				760	feature will do you no good unless you also configure your resource
				761	limits (in particular, RLIMIT_AS and RLIMIT_NPROC). Most systems don't
				762	need this.
				763
				764	Note that if you change this from 0 to 1, already created segments
				765	without users and with a dead originative process will be destroyed.
				766
				767	==============================================================
				768
Kees Cook	f4aacea	2014-06-06 14:37:19 -0700	[diff] [blame]	769	sysctl_writes_strict:
				770
				771	Control how file position affects the behavior of updating sysctl values
				772	via the /proc/sys interface:
				773
				774	-1 - Legacy per-write sysctl value handling, with no printk warnings.
				775	Each write syscall must fully contain the sysctl value to be
				776	written, and multiple writes on the same sysctl file descriptor
				777	will rewrite the sysctl value, regardless of file position.
				778	0 - (default) Same behavior as above, but warn about processes that
				779	perform writes to a sysctl file descriptor when the file position
				780	is not 0.
				781	1 - Respect file position when writing sysctl strings. Multiple writes
				782	will append to the sysctl value buffer. Anything past the max length
				783	of the sysctl value buffer will be ignored. Writes to numeric sysctl
				784	entries must always be at file position 0 and the value must be
				785	fully contained in the buffer sent in the write syscall.
				786
				787	==============================================================
				788
Aaron Tomlin	ed23587	2014-06-23 13:22:05 -0700	[diff] [blame]	789	softlockup_all_cpu_backtrace:
				790
				791	This value controls the soft lockup detector thread's behavior
				792	when a soft lockup condition is detected as to whether or not
				793	to gather further debug information. If enabled, each cpu will
				794	be issued an NMI and instructed to capture stack trace.
				795
				796	This feature is only applicable for architectures which support
				797	NMI.
				798
				799	0: do nothing. This is the default behavior.
				800
				801	1: on detection capture more debug information.
				802
				803	==============================================================
				804
Borislav Petkov	807094c	2011-07-23 10:39:29 -0700	[diff] [blame]	805	tainted:
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	806
				807	Non-zero if the kernel has been tainted. Numeric values, which
				808	can be ORed together:
				809
Greg Kroah-Hartman	bb20698	2008-10-17 15:01:07 -0700	[diff] [blame]	810	1 - A module with a non-GPL license has been loaded, this
				811	includes modules with no license.
				812	Set by modutils >= 2.4.9 and module-init-tools.
				813	2 - A module was force loaded by insmod -f.
				814	Set by modutils >= 2.4.9 and module-init-tools.
				815	4 - Unsafe SMP processors: SMP with CPUs not designed for SMP.
				816	8 - A module was forcibly unloaded from the system by rmmod -f.
				817	16 - A hardware machine check error occurred on the system.
				818	32 - A bad page was discovered on the system.
				819	64 - The user has asked that the system be marked "tainted". This
				820	could be because they are running software that directly modifies
				821	the hardware, or for other reasons.
				822	128 - The system has died.
				823	256 - The ACPI DSDT has been overridden with one supplied by the user
				824	instead of using the one provided by the hardware.
				825	512 - A kernel warning has occurred.
				826	1024 - A module from drivers/staging was loaded.
Larry Finger	f5fe184	2012-02-06 09:49:50 -0800	[diff] [blame]	827	2048 - The system is working around a severe firmware bug.
				828	4096 - An out-of-tree module has been loaded.
Mathieu Desnoyers	66cc69e	2014-03-13 12:11:30 +1030	[diff] [blame]	829	8192 - An unsigned module has been loaded in a kernel supporting module
				830	signature.
Josh Hunt	69361ee	2014-08-08 14:22:31 -0700	[diff] [blame]	831	16384 - A soft lockup has previously occurred on the system.
Linus Torvalds	1da177e	2005-04-16 15:20:36 -0700	[diff] [blame]	832
Shen Feng	760df93	2009-04-02 16:57:20 -0700	[diff] [blame]	833	==============================================================
				834
Shen Feng	760df93	2009-04-02 16:57:20 -0700	[diff] [blame]	835	unknown_nmi_panic:
				836
Borislav Petkov	807094c	2011-07-23 10:39:29 -0700	[diff] [blame]	837	The value in this file affects behavior of handling NMI. When the
				838	value is non-zero, unknown NMI is trapped and then panic occurs. At
				839	that time, kernel debugging information is displayed on console.
Shen Feng	760df93	2009-04-02 16:57:20 -0700	[diff] [blame]	840
Borislav Petkov	807094c	2011-07-23 10:39:29 -0700	[diff] [blame]	841	NMI switch that most IA32 servers have fires unknown NMI up, for
				842	example. If a system hangs up, try pressing the NMI switch.
Li Zefan	08825c9	2013-05-17 10:31:20 +0800	[diff] [blame]	843
				844	==============================================================
				845
				846	watchdog_thresh:
				847
				848	This value can be used to control the frequency of hrtimer and NMI
				849	events and the soft and hard lockup thresholds. The default threshold
				850	is 10 seconds.
				851
				852	The softlockup threshold is (2 * watchdog_thresh). Setting this
				853	tunable to zero will disable lockup detection altogether.
				854
				855	==============================================================