blob: ebff3c10a07fc124b865f815de5b38d65f38d1fa [file] [log] [blame]
Linus Torvalds1da177e2005-04-16 15:20:36 -07001------------------------------------------------------------------------------
2 T H E /proc F I L E S Y S T E M
3------------------------------------------------------------------------------
4/proc/sys Terrehon Bowden <terrehon@pacbell.net> October 7 1999
5 Bodo Bauer <bb@ricochet.net>
6
72.4.x update Jorge Nerin <comandante@zaralinux.com> November 14 2000
Shen Feng760df932009-04-02 16:57:20 -07008move /proc/sys Shen Feng <shen@cn.fujitsu.com> April 1 2009
Linus Torvalds1da177e2005-04-16 15:20:36 -07009------------------------------------------------------------------------------
10Version 1.3 Kernel version 2.2.12
11 Kernel version 2.4.0-test11-pre4
12------------------------------------------------------------------------------
13
14Table of Contents
15-----------------
16
17 0 Preface
18 0.1 Introduction/Credits
19 0.2 Legal Stuff
20
21 1 Collecting System Information
22 1.1 Process-Specific Subdirectories
23 1.2 Kernel data
24 1.3 IDE devices in /proc/ide
25 1.4 Networking info in /proc/net
26 1.5 SCSI info
27 1.6 Parallel port info in /proc/parport
28 1.7 TTY info in /proc/tty
29 1.8 Miscellaneous kernel statistics in /proc/stat
Shen Feng760df932009-04-02 16:57:20 -070030 1.9 Ext4 file system parameters
Linus Torvalds1da177e2005-04-16 15:20:36 -070031
32 2 Modifying System Parameters
Shen Feng760df932009-04-02 16:57:20 -070033
34 3 Per-Process Parameters
35 3.1 /proc/<pid>/oom_adj - Adjust the oom-killer score
36 3.2 /proc/<pid>/oom_score - Display current oom-killer score
37 3.3 /proc/<pid>/io - Display the IO accounting fields
38 3.4 /proc/<pid>/coredump_filter - Core dump filtering settings
39 3.5 /proc/<pid>/mountinfo - Information about mounts
40
Linus Torvalds1da177e2005-04-16 15:20:36 -070041
42------------------------------------------------------------------------------
43Preface
44------------------------------------------------------------------------------
45
460.1 Introduction/Credits
47------------------------
48
49This documentation is part of a soon (or so we hope) to be released book on
50the SuSE Linux distribution. As there is no complete documentation for the
51/proc file system and we've used many freely available sources to write these
52chapters, it seems only fair to give the work back to the Linux community.
53This work is based on the 2.2.* kernel version and the upcoming 2.4.*. I'm
54afraid it's still far from complete, but we hope it will be useful. As far as
55we know, it is the first 'all-in-one' document about the /proc file system. It
56is focused on the Intel x86 hardware, so if you are looking for PPC, ARM,
57SPARC, AXP, etc., features, you probably won't find what you are looking for.
58It also only covers IPv4 networking, not IPv6 nor other protocols - sorry. But
59additions and patches are welcome and will be added to this document if you
60mail them to Bodo.
61
62We'd like to thank Alan Cox, Rik van Riel, and Alexey Kuznetsov and a lot of
63other people for help compiling this documentation. We'd also like to extend a
64special thank you to Andi Kleen for documentation, which we relied on heavily
65to create this document, as well as the additional information he provided.
66Thanks to everybody else who contributed source or docs to the Linux kernel
67and helped create a great piece of software... :)
68
69If you have any comments, corrections or additions, please don't hesitate to
70contact Bodo Bauer at bb@ricochet.net. We'll be happy to add them to this
71document.
72
73The latest version of this document is available online at
74http://skaro.nightcrawler.com/~bb/Docs/Proc as HTML version.
75
76If the above direction does not works for you, ypu could try the kernel
77mailing list at linux-kernel@vger.kernel.org and/or try to reach me at
78comandante@zaralinux.com.
79
800.2 Legal Stuff
81---------------
82
83We don't guarantee the correctness of this document, and if you come to us
84complaining about how you screwed up your system because of incorrect
85documentation, we won't feel responsible...
86
87------------------------------------------------------------------------------
88CHAPTER 1: COLLECTING SYSTEM INFORMATION
89------------------------------------------------------------------------------
90
91------------------------------------------------------------------------------
92In This Chapter
93------------------------------------------------------------------------------
94* Investigating the properties of the pseudo file system /proc and its
95 ability to provide information on the running Linux system
96* Examining /proc's structure
97* Uncovering various information about the kernel and the processes running
98 on the system
99------------------------------------------------------------------------------
100
101
102The proc file system acts as an interface to internal data structures in the
103kernel. It can be used to obtain information about the system and to change
104certain kernel parameters at runtime (sysctl).
105
106First, we'll take a look at the read-only parts of /proc. In Chapter 2, we
107show you how you can use /proc/sys to change settings.
108
1091.1 Process-Specific Subdirectories
110-----------------------------------
111
112The directory /proc contains (among other things) one subdirectory for each
113process running on the system, which is named after the process ID (PID).
114
115The link self points to the process reading the file system. Each process
116subdirectory has the entries listed in Table 1-1.
117
118
119Table 1-1: Process specific entries in /proc
120..............................................................................
David Rientjesb813e932007-05-06 14:49:24 -0700121 File Content
122 clear_refs Clears page referenced bits shown in smaps output
123 cmdline Command line arguments
124 cpu Current and last cpu in which it was executed (2.4)(smp)
125 cwd Link to the current working directory
126 environ Values of environment variables
127 exe Link to the executable of this process
128 fd Directory, which contains all file descriptors
129 maps Memory maps to executables and library files (2.4)
130 mem Memory held by this process
131 root Link to the root directory of this process
132 stat Process status
133 statm Process memory status information
134 status Process status in human readable form
135 wchan If CONFIG_KALLSYMS is set, a pre-decoded wchan
Ken Chen2ec220e2008-11-10 11:26:08 +0300136 stack Report full stack trace, enable via CONFIG_STACKTRACE
David Rientjesb813e932007-05-06 14:49:24 -0700137 smaps Extension based on maps, the rss size for each mapped file
Linus Torvalds1da177e2005-04-16 15:20:36 -0700138..............................................................................
139
140For example, to get the status information of a process, all you have to do is
141read the file /proc/PID/status:
142
143 >cat /proc/self/status
144 Name: cat
145 State: R (running)
146 Pid: 5452
147 PPid: 743
148 TracerPid: 0 (2.4)
149 Uid: 501 501 501 501
150 Gid: 100 100 100 100
151 Groups: 100 14 16
152 VmSize: 1112 kB
153 VmLck: 0 kB
154 VmRSS: 348 kB
155 VmData: 24 kB
156 VmStk: 12 kB
157 VmExe: 8 kB
158 VmLib: 1044 kB
159 SigPnd: 0000000000000000
160 SigBlk: 0000000000000000
161 SigIgn: 0000000000000000
162 SigCgt: 0000000000000000
163 CapInh: 00000000fffffeff
164 CapPrm: 0000000000000000
165 CapEff: 0000000000000000
166
167
168This shows you nearly the same information you would get if you viewed it with
169the ps command. In fact, ps uses the proc file system to obtain its
170information. The statm file contains more detailed information about the
Kees Cook18d96772007-07-15 23:40:38 -0700171process memory usage. Its seven fields are explained in Table 1-2. The stat
172file contains details information about the process itself. Its fields are
173explained in Table 1-3.
Linus Torvalds1da177e2005-04-16 15:20:36 -0700174
175
176Table 1-2: Contents of the statm files (as of 2.6.8-rc3)
177..............................................................................
178 Field Content
179 size total program size (pages) (same as VmSize in status)
180 resident size of memory portions (pages) (same as VmRSS in status)
181 shared number of pages that are shared (i.e. backed by a file)
182 trs number of pages that are 'code' (not including libs; broken,
183 includes data segment)
184 lrs number of pages of library (always 0 on 2.6)
185 drs number of pages of data/stack (including libs; broken,
186 includes library text)
187 dt number of dirty pages (always 0 on 2.6)
188..............................................................................
189
Kees Cook18d96772007-07-15 23:40:38 -0700190
191Table 1-3: Contents of the stat files (as of 2.6.22-rc3)
192..............................................................................
193 Field Content
194 pid process id
195 tcomm filename of the executable
196 state state (R is running, S is sleeping, D is sleeping in an
197 uninterruptible wait, Z is zombie, T is traced or stopped)
198 ppid process id of the parent process
199 pgrp pgrp of the process
200 sid session id
201 tty_nr tty the process uses
202 tty_pgrp pgrp of the tty
203 flags task flags
204 min_flt number of minor faults
205 cmin_flt number of minor faults with child's
206 maj_flt number of major faults
207 cmaj_flt number of major faults with child's
208 utime user mode jiffies
209 stime kernel mode jiffies
210 cutime user mode jiffies with child's
211 cstime kernel mode jiffies with child's
212 priority priority level
213 nice nice level
214 num_threads number of threads
Leonardo Chiquitto2e01e002008-02-03 16:17:16 +0200215 it_real_value (obsolete, always 0)
Kees Cook18d96772007-07-15 23:40:38 -0700216 start_time time the process started after system boot
217 vsize virtual memory size
218 rss resident set memory size
219 rsslim current limit in bytes on the rss
220 start_code address above which program text can run
221 end_code address below which program text can run
222 start_stack address of the start of the stack
223 esp current value of ESP
224 eip current value of EIP
225 pending bitmap of pending signals (obsolete)
226 blocked bitmap of blocked signals (obsolete)
227 sigign bitmap of ignored signals (obsolete)
228 sigcatch bitmap of catched signals (obsolete)
229 wchan address where process went to sleep
230 0 (place holder)
231 0 (place holder)
232 exit_signal signal to send to parent thread on exit
233 task_cpu which CPU the task is scheduled on
234 rt_priority realtime priority
235 policy scheduling policy (man sched_setscheduler)
236 blkio_ticks time spent waiting for block IO
237..............................................................................
238
239
Linus Torvalds1da177e2005-04-16 15:20:36 -07002401.2 Kernel data
241---------------
242
243Similar to the process entries, the kernel data files give information about
244the running kernel. The files used to obtain this information are contained in
Kees Cook18d96772007-07-15 23:40:38 -0700245/proc and are listed in Table 1-4. Not all of these will be present in your
Linus Torvalds1da177e2005-04-16 15:20:36 -0700246system. It depends on the kernel configuration and the loaded modules, which
247files are there, and which are missing.
248
Kees Cook18d96772007-07-15 23:40:38 -0700249Table 1-4: Kernel info in /proc
Linus Torvalds1da177e2005-04-16 15:20:36 -0700250..............................................................................
251 File Content
252 apm Advanced power management info
253 buddyinfo Kernel memory allocator information (see text) (2.5)
254 bus Directory containing bus specific information
255 cmdline Kernel command line
256 cpuinfo Info about the CPU
257 devices Available devices (block and character)
258 dma Used DMS channels
259 filesystems Supported filesystems
260 driver Various drivers grouped here, currently rtc (2.4)
261 execdomains Execdomains, related to security (2.4)
262 fb Frame Buffer devices (2.4)
263 fs File system parameters, currently nfs/exports (2.4)
264 ide Directory containing info about the IDE subsystem
265 interrupts Interrupt usage
266 iomem Memory map (2.4)
267 ioports I/O port usage
268 irq Masks for irq to cpu affinity (2.4)(smp?)
269 isapnp ISA PnP (Plug&Play) Info (2.4)
270 kcore Kernel core image (can be ELF or A.OUT(deprecated in 2.4))
271 kmsg Kernel messages
272 ksyms Kernel symbol table
273 loadavg Load average of last 1, 5 & 15 minutes
274 locks Kernel locks
275 meminfo Memory info
276 misc Miscellaneous
277 modules List of loaded modules
278 mounts Mounted filesystems
279 net Networking info (see text)
280 partitions Table of partitions known to the system
Randy Dunlap8b607562007-05-09 07:19:14 +0200281 pci Deprecated info of PCI bus (new way -> /proc/bus/pci/,
Linus Torvalds1da177e2005-04-16 15:20:36 -0700282 decoupled by lspci (2.4)
283 rtc Real time clock
284 scsi SCSI info (see text)
285 slabinfo Slab pool info
286 stat Overall statistics
287 swaps Swap space utilization
288 sys See chapter 2
289 sysvipc Info of SysVIPC Resources (msg, sem, shm) (2.4)
290 tty Info of tty drivers
291 uptime System uptime
292 version Kernel version
293 video bttv info of video resources (2.4)
Eric Dumazeta47a1262008-07-23 21:27:38 -0700294 vmallocinfo Show vmalloced areas
Linus Torvalds1da177e2005-04-16 15:20:36 -0700295..............................................................................
296
297You can, for example, check which interrupts are currently in use and what
298they are used for by looking in the file /proc/interrupts:
299
300 > cat /proc/interrupts
301 CPU0
302 0: 8728810 XT-PIC timer
303 1: 895 XT-PIC keyboard
304 2: 0 XT-PIC cascade
305 3: 531695 XT-PIC aha152x
306 4: 2014133 XT-PIC serial
307 5: 44401 XT-PIC pcnet_cs
308 8: 2 XT-PIC rtc
309 11: 8 XT-PIC i82365
310 12: 182918 XT-PIC PS/2 Mouse
311 13: 1 XT-PIC fpu
312 14: 1232265 XT-PIC ide0
313 15: 7 XT-PIC ide1
314 NMI: 0
315
316In 2.4.* a couple of lines where added to this file LOC & ERR (this time is the
317output of a SMP machine):
318
319 > cat /proc/interrupts
320
321 CPU0 CPU1
322 0: 1243498 1214548 IO-APIC-edge timer
323 1: 8949 8958 IO-APIC-edge keyboard
324 2: 0 0 XT-PIC cascade
325 5: 11286 10161 IO-APIC-edge soundblaster
326 8: 1 0 IO-APIC-edge rtc
327 9: 27422 27407 IO-APIC-edge 3c503
328 12: 113645 113873 IO-APIC-edge PS/2 Mouse
329 13: 0 0 XT-PIC fpu
330 14: 22491 24012 IO-APIC-edge ide0
331 15: 2183 2415 IO-APIC-edge ide1
332 17: 30564 30414 IO-APIC-level eth0
333 18: 177 164 IO-APIC-level bttv
334 NMI: 2457961 2457959
335 LOC: 2457882 2457881
336 ERR: 2155
337
338NMI is incremented in this case because every timer interrupt generates a NMI
339(Non Maskable Interrupt) which is used by the NMI Watchdog to detect lockups.
340
341LOC is the local interrupt counter of the internal APIC of every CPU.
342
343ERR is incremented in the case of errors in the IO-APIC bus (the bus that
344connects the CPUs in a SMP system. This means that an error has been detected,
345the IO-APIC automatically retry the transmission, so it should not be a big
346problem, but you should read the SMP-FAQ.
347
Joe Korty38e760a2007-10-17 18:04:40 +0200348In 2.6.2* /proc/interrupts was expanded again. This time the goal was for
349/proc/interrupts to display every IRQ vector in use by the system, not
350just those considered 'most important'. The new vectors are:
351
352 THR -- interrupt raised when a machine check threshold counter
353 (typically counting ECC corrected errors of memory or cache) exceeds
354 a configurable threshold. Only available on some systems.
355
356 TRM -- a thermal event interrupt occurs when a temperature threshold
357 has been exceeded for the CPU. This interrupt may also be generated
358 when the temperature drops back to normal.
359
360 SPU -- a spurious interrupt is some interrupt that was raised then lowered
361 by some IO device before it could be fully processed by the APIC. Hence
362 the APIC sees the interrupt but does not know what device it came from.
363 For this case the APIC will generate the interrupt with a IRQ vector
364 of 0xff. This might also be generated by chipset bugs.
365
366 RES, CAL, TLB -- rescheduling, call and TLB flush interrupts are
367 sent from one CPU to another per the needs of the OS. Typically,
368 their statistics are used by kernel developers and interested users to
Matt LaPlante19f59462009-04-27 15:06:31 +0200369 determine the occurrence of interrupts of the given type.
Joe Korty38e760a2007-10-17 18:04:40 +0200370
371The above IRQ vectors are displayed only when relevent. For example,
372the threshold vector does not exist on x86_64 platforms. Others are
373suppressed when the system is a uniprocessor. As of this writing, only
374i386 and x86_64 platforms support the new IRQ vector displays.
375
376Of some interest is the introduction of the /proc/irq directory to 2.4.
Linus Torvalds1da177e2005-04-16 15:20:36 -0700377It could be used to set IRQ to CPU affinity, this means that you can "hook" an
378IRQ to only one CPU, or to exclude a CPU of handling IRQs. The contents of the
Max Krasnyansky18404752008-05-29 11:02:52 -0700379irq subdir is one subdir for each IRQ, and two files; default_smp_affinity and
380prof_cpu_mask.
Linus Torvalds1da177e2005-04-16 15:20:36 -0700381
382For example
383 > ls /proc/irq/
384 0 10 12 14 16 18 2 4 6 8 prof_cpu_mask
Max Krasnyansky18404752008-05-29 11:02:52 -0700385 1 11 13 15 17 19 3 5 7 9 default_smp_affinity
Linus Torvalds1da177e2005-04-16 15:20:36 -0700386 > ls /proc/irq/0/
387 smp_affinity
388
Max Krasnyansky18404752008-05-29 11:02:52 -0700389smp_affinity is a bitmask, in which you can specify which CPUs can handle the
390IRQ, you can set it by doing:
Linus Torvalds1da177e2005-04-16 15:20:36 -0700391
Max Krasnyansky18404752008-05-29 11:02:52 -0700392 > echo 1 > /proc/irq/10/smp_affinity
393
394This means that only the first CPU will handle the IRQ, but you can also echo
3955 which means that only the first and fourth CPU can handle the IRQ.
396
397The contents of each smp_affinity file is the same by default:
398
399 > cat /proc/irq/0/smp_affinity
Linus Torvalds1da177e2005-04-16 15:20:36 -0700400 ffffffff
401
Max Krasnyansky18404752008-05-29 11:02:52 -0700402The default_smp_affinity mask applies to all non-active IRQs, which are the
403IRQs which have not yet been allocated/activated, and hence which lack a
404/proc/irq/[0-9]* directory.
Linus Torvalds1da177e2005-04-16 15:20:36 -0700405
Max Krasnyansky18404752008-05-29 11:02:52 -0700406prof_cpu_mask specifies which CPUs are to be profiled by the system wide
407profiler. Default value is ffffffff (all cpus).
Linus Torvalds1da177e2005-04-16 15:20:36 -0700408
409The way IRQs are routed is handled by the IO-APIC, and it's Round Robin
410between all the CPUs which are allowed to handle it. As usual the kernel has
411more info than you and does a better job than you, so the defaults are the
412best choice for almost everyone.
413
414There are three more important subdirectories in /proc: net, scsi, and sys.
415The general rule is that the contents, or even the existence of these
416directories, depend on your kernel configuration. If SCSI is not enabled, the
417directory scsi may not exist. The same is true with the net, which is there
418only when networking support is present in the running kernel.
419
420The slabinfo file gives information about memory usage at the slab level.
421Linux uses slab pools for memory management above page level in version 2.2.
422Commonly used objects have their own slab pool (such as network buffers,
423directory cache, and so on).
424
425..............................................................................
426
427> cat /proc/buddyinfo
428
429Node 0, zone DMA 0 4 5 4 4 3 ...
430Node 0, zone Normal 1 0 0 1 101 8 ...
431Node 0, zone HighMem 2 0 0 1 1 0 ...
432
433Memory fragmentation is a problem under some workloads, and buddyinfo is a
434useful tool for helping diagnose these problems. Buddyinfo will give you a
435clue as to how big an area you can safely allocate, or why a previous
436allocation failed.
437
438Each column represents the number of pages of a certain order which are
439available. In this case, there are 0 chunks of 2^0*PAGE_SIZE available in
440ZONE_DMA, 4 chunks of 2^1*PAGE_SIZE in ZONE_DMA, 101 chunks of 2^4*PAGE_SIZE
441available in ZONE_NORMAL, etc...
442
443..............................................................................
444
445meminfo:
446
447Provides information about distribution and utilization of memory. This
448varies by architecture and compile options. The following is from a
44916GB PIII, which has highmem enabled. You may not have all of these fields.
450
451> cat /proc/meminfo
452
453
454MemTotal: 16344972 kB
455MemFree: 13634064 kB
456Buffers: 3656 kB
457Cached: 1195708 kB
458SwapCached: 0 kB
459Active: 891636 kB
460Inactive: 1077224 kB
461HighTotal: 15597528 kB
462HighFree: 13629632 kB
463LowTotal: 747444 kB
464LowFree: 4432 kB
465SwapTotal: 0 kB
466SwapFree: 0 kB
467Dirty: 968 kB
468Writeback: 0 kB
Miklos Szeredib88473f2008-04-30 00:54:39 -0700469AnonPages: 861800 kB
Linus Torvalds1da177e2005-04-16 15:20:36 -0700470Mapped: 280372 kB
Miklos Szeredib88473f2008-04-30 00:54:39 -0700471Slab: 284364 kB
472SReclaimable: 159856 kB
473SUnreclaim: 124508 kB
474PageTables: 24448 kB
475NFS_Unstable: 0 kB
476Bounce: 0 kB
477WritebackTmp: 0 kB
Linus Torvalds1da177e2005-04-16 15:20:36 -0700478CommitLimit: 7669796 kB
479Committed_AS: 100056 kB
Linus Torvalds1da177e2005-04-16 15:20:36 -0700480VmallocTotal: 112216 kB
481VmallocUsed: 428 kB
482VmallocChunk: 111088 kB
483
484 MemTotal: Total usable ram (i.e. physical ram minus a few reserved
485 bits and the kernel binary code)
486 MemFree: The sum of LowFree+HighFree
487 Buffers: Relatively temporary storage for raw disk blocks
488 shouldn't get tremendously large (20MB or so)
489 Cached: in-memory cache for files read from the disk (the
490 pagecache). Doesn't include SwapCached
491 SwapCached: Memory that once was swapped out, is swapped back in but
492 still also is in the swapfile (if memory is needed it
493 doesn't need to be swapped out AGAIN because it is already
494 in the swapfile. This saves I/O)
495 Active: Memory that has been used more recently and usually not
496 reclaimed unless absolutely necessary.
497 Inactive: Memory which has been less recently used. It is more
498 eligible to be reclaimed for other purposes
499 HighTotal:
500 HighFree: Highmem is all memory above ~860MB of physical memory
501 Highmem areas are for use by userspace programs, or
502 for the pagecache. The kernel must use tricks to access
503 this memory, making it slower to access than lowmem.
504 LowTotal:
505 LowFree: Lowmem is memory which can be used for everything that
Matt LaPlante3f6dee92006-10-03 22:45:33 +0200506 highmem can be used for, but it is also available for the
Linus Torvalds1da177e2005-04-16 15:20:36 -0700507 kernel's use for its own data structures. Among many
508 other things, it is where everything from the Slab is
509 allocated. Bad things happen when you're out of lowmem.
510 SwapTotal: total amount of swap space available
511 SwapFree: Memory which has been evicted from RAM, and is temporarily
512 on the disk
513 Dirty: Memory which is waiting to get written back to the disk
514 Writeback: Memory which is actively being written back to the disk
Miklos Szeredib88473f2008-04-30 00:54:39 -0700515 AnonPages: Non-file backed pages mapped into userspace page tables
Linus Torvalds1da177e2005-04-16 15:20:36 -0700516 Mapped: files which have been mmaped, such as libraries
Adrian Bunke82443c2006-01-10 00:20:30 +0100517 Slab: in-kernel data structures cache
Miklos Szeredib88473f2008-04-30 00:54:39 -0700518SReclaimable: Part of Slab, that might be reclaimed, such as caches
519 SUnreclaim: Part of Slab, that cannot be reclaimed on memory pressure
520 PageTables: amount of memory dedicated to the lowest level of page
521 tables.
522NFS_Unstable: NFS pages sent to the server, but not yet committed to stable
523 storage
524 Bounce: Memory used for block device "bounce buffers"
525WritebackTmp: Memory used by FUSE for temporary writeback buffers
Linus Torvalds1da177e2005-04-16 15:20:36 -0700526 CommitLimit: Based on the overcommit ratio ('vm.overcommit_ratio'),
527 this is the total amount of memory currently available to
528 be allocated on the system. This limit is only adhered to
529 if strict overcommit accounting is enabled (mode 2 in
530 'vm.overcommit_memory').
531 The CommitLimit is calculated with the following formula:
532 CommitLimit = ('vm.overcommit_ratio' * Physical RAM) + Swap
533 For example, on a system with 1G of physical RAM and 7G
534 of swap with a `vm.overcommit_ratio` of 30 it would
535 yield a CommitLimit of 7.3G.
536 For more details, see the memory overcommit documentation
537 in vm/overcommit-accounting.
538Committed_AS: The amount of memory presently allocated on the system.
539 The committed memory is a sum of all of the memory which
540 has been allocated by processes, even if it has not been
541 "used" by them as of yet. A process which malloc()'s 1G
542 of memory, but only touches 300M of it will only show up
543 as using 300M of memory even if it has the address space
544 allocated for the entire 1G. This 1G is memory which has
545 been "committed" to by the VM and can be used at any time
546 by the allocating application. With strict overcommit
547 enabled on the system (mode 2 in 'vm.overcommit_memory'),
548 allocations which would exceed the CommitLimit (detailed
549 above) will not be permitted. This is useful if one needs
550 to guarantee that processes will not fail due to lack of
551 memory once that memory has been successfully allocated.
Linus Torvalds1da177e2005-04-16 15:20:36 -0700552VmallocTotal: total size of vmalloc memory area
553 VmallocUsed: amount of vmalloc area which is used
Matt LaPlante19f59462009-04-27 15:06:31 +0200554VmallocChunk: largest contiguous block of vmalloc area which is free
Linus Torvalds1da177e2005-04-16 15:20:36 -0700555
Eric Dumazeta47a1262008-07-23 21:27:38 -0700556..............................................................................
557
558vmallocinfo:
559
560Provides information about vmalloced/vmaped areas. One line per area,
561containing the virtual address range of the area, size in bytes,
562caller information of the creator, and optional information depending
563on the kind of area :
564
565 pages=nr number of pages
566 phys=addr if a physical address was specified
567 ioremap I/O mapping (ioremap() and friends)
568 vmalloc vmalloc() area
569 vmap vmap()ed pages
570 user VM_USERMAP area
571 vpages buffer for pages pointers was vmalloced (huge area)
572 N<node>=nr (Only on NUMA kernels)
573 Number of pages allocated on memory node <node>
574
575> cat /proc/vmallocinfo
5760xffffc20000000000-0xffffc20000201000 2101248 alloc_large_system_hash+0x204 ...
577 /0x2c0 pages=512 vmalloc N0=128 N1=128 N2=128 N3=128
5780xffffc20000201000-0xffffc20000302000 1052672 alloc_large_system_hash+0x204 ...
579 /0x2c0 pages=256 vmalloc N0=64 N1=64 N2=64 N3=64
5800xffffc20000302000-0xffffc20000304000 8192 acpi_tb_verify_table+0x21/0x4f...
581 phys=7fee8000 ioremap
5820xffffc20000304000-0xffffc20000307000 12288 acpi_tb_verify_table+0x21/0x4f...
583 phys=7fee7000 ioremap
5840xffffc2000031d000-0xffffc2000031f000 8192 init_vdso_vars+0x112/0x210
5850xffffc2000031f000-0xffffc2000032b000 49152 cramfs_uncompress_init+0x2e ...
586 /0x80 pages=11 vmalloc N0=3 N1=3 N2=2 N3=3
5870xffffc2000033a000-0xffffc2000033d000 12288 sys_swapon+0x640/0xac0 ...
588 pages=2 vmalloc N1=2
5890xffffc20000347000-0xffffc2000034c000 20480 xt_alloc_table_info+0xfe ...
590 /0x130 [x_tables] pages=4 vmalloc N0=4
5910xffffffffa0000000-0xffffffffa000f000 61440 sys_init_module+0xc27/0x1d00 ...
592 pages=14 vmalloc N2=14
5930xffffffffa000f000-0xffffffffa0014000 20480 sys_init_module+0xc27/0x1d00 ...
594 pages=4 vmalloc N1=4
5950xffffffffa0014000-0xffffffffa0017000 12288 sys_init_module+0xc27/0x1d00 ...
596 pages=2 vmalloc N1=2
5970xffffffffa0017000-0xffffffffa0022000 45056 sys_init_module+0xc27/0x1d00 ...
598 pages=10 vmalloc N0=10
Linus Torvalds1da177e2005-04-16 15:20:36 -0700599
6001.3 IDE devices in /proc/ide
601----------------------------
602
603The subdirectory /proc/ide contains information about all IDE devices of which
604the kernel is aware. There is one subdirectory for each IDE controller, the
605file drivers and a link for each IDE device, pointing to the device directory
606in the controller specific subtree.
607
608The file drivers contains general information about the drivers used for the
609IDE devices:
610
611 > cat /proc/ide/drivers
612 ide-cdrom version 4.53
613 ide-disk version 1.08
614
615More detailed information can be found in the controller specific
616subdirectories. These are named ide0, ide1 and so on. Each of these
Kees Cook18d96772007-07-15 23:40:38 -0700617directories contains the files shown in table 1-5.
Linus Torvalds1da177e2005-04-16 15:20:36 -0700618
619
Kees Cook18d96772007-07-15 23:40:38 -0700620Table 1-5: IDE controller info in /proc/ide/ide?
Linus Torvalds1da177e2005-04-16 15:20:36 -0700621..............................................................................
622 File Content
623 channel IDE channel (0 or 1)
624 config Configuration (only for PCI/IDE bridge)
625 mate Mate name
626 model Type/Chipset of IDE controller
627..............................................................................
628
629Each device connected to a controller has a separate subdirectory in the
Kees Cook18d96772007-07-15 23:40:38 -0700630controllers directory. The files listed in table 1-6 are contained in these
Linus Torvalds1da177e2005-04-16 15:20:36 -0700631directories.
632
633
Kees Cook18d96772007-07-15 23:40:38 -0700634Table 1-6: IDE device information
Linus Torvalds1da177e2005-04-16 15:20:36 -0700635..............................................................................
636 File Content
637 cache The cache
638 capacity Capacity of the medium (in 512Byte blocks)
639 driver driver and version
640 geometry physical and logical geometry
641 identify device identify block
642 media media type
643 model device identifier
644 settings device setup
645 smart_thresholds IDE disk management thresholds
646 smart_values IDE disk management values
647..............................................................................
648
649The most interesting file is settings. This file contains a nice overview of
650the drive parameters:
651
652 # cat /proc/ide/ide0/hda/settings
653 name value min max mode
654 ---- ----- --- --- ----
655 bios_cyl 526 0 65535 rw
656 bios_head 255 0 255 rw
657 bios_sect 63 0 63 rw
658 breada_readahead 4 0 127 rw
659 bswap 0 0 1 r
660 file_readahead 72 0 2097151 rw
661 io_32bit 0 0 3 rw
662 keepsettings 0 0 1 rw
663 max_kb_per_request 122 1 127 rw
664 multcount 0 0 8 rw
665 nice1 1 0 1 rw
666 nowerr 0 0 1 rw
667 pio_mode write-only 0 255 w
668 slow 0 0 1 rw
669 unmaskirq 0 0 1 rw
670 using_dma 0 0 1 rw
671
672
6731.4 Networking info in /proc/net
674--------------------------------
675
676The subdirectory /proc/net follows the usual pattern. Table 1-6 shows the
677additional values you get for IP version 6 if you configure the kernel to
678support this. Table 1-7 lists the files and their meaning.
679
680
681Table 1-6: IPv6 info in /proc/net
682..............................................................................
683 File Content
684 udp6 UDP sockets (IPv6)
685 tcp6 TCP sockets (IPv6)
686 raw6 Raw device statistics (IPv6)
687 igmp6 IP multicast addresses, which this host joined (IPv6)
688 if_inet6 List of IPv6 interface addresses
689 ipv6_route Kernel routing table for IPv6
690 rt6_stats Global IPv6 routing tables statistics
691 sockstat6 Socket statistics (IPv6)
692 snmp6 Snmp data (IPv6)
693..............................................................................
694
695
696Table 1-7: Network info in /proc/net
697..............................................................................
698 File Content
699 arp Kernel ARP table
700 dev network devices with statistics
701 dev_mcast the Layer2 multicast groups a device is listening too
702 (interface index, label, number of references, number of bound
703 addresses).
704 dev_stat network device status
705 ip_fwchains Firewall chain linkage
706 ip_fwnames Firewall chain names
707 ip_masq Directory containing the masquerading tables
708 ip_masquerade Major masquerading table
709 netstat Network statistics
710 raw raw device statistics
711 route Kernel routing table
712 rpc Directory containing rpc info
713 rt_cache Routing cache
714 snmp SNMP data
715 sockstat Socket statistics
716 tcp TCP sockets
717 tr_rif Token ring RIF routing table
718 udp UDP sockets
719 unix UNIX domain sockets
720 wireless Wireless interface data (Wavelan etc)
721 igmp IP multicast addresses, which this host joined
722 psched Global packet scheduler parameters.
723 netlink List of PF_NETLINK sockets
724 ip_mr_vifs List of multicast virtual interfaces
725 ip_mr_cache List of multicast routing cache
726..............................................................................
727
728You can use this information to see which network devices are available in
729your system and how much traffic was routed over those devices:
730
731 > cat /proc/net/dev
732 Inter-|Receive |[...
733 face |bytes packets errs drop fifo frame compressed multicast|[...
734 lo: 908188 5596 0 0 0 0 0 0 [...
735 ppp0:15475140 20721 410 0 0 410 0 0 [...
736 eth0: 614530 7085 0 0 0 0 0 1 [...
737
738 ...] Transmit
739 ...] bytes packets errs drop fifo colls carrier compressed
740 ...] 908188 5596 0 0 0 0 0 0
741 ...] 1375103 17405 0 0 0 0 0 0
742 ...] 1703981 5535 0 0 0 3 0 0
743
744In addition, each Channel Bond interface has it's own directory. For
745example, the bond0 device will have a directory called /proc/net/bond0/.
746It will contain information that is specific to that bond, such as the
747current slaves of the bond, the link status of the slaves, and how
748many times the slaves link has failed.
749
7501.5 SCSI info
751-------------
752
753If you have a SCSI host adapter in your system, you'll find a subdirectory
754named after the driver for this adapter in /proc/scsi. You'll also see a list
755of all recognized SCSI devices in /proc/scsi:
756
757 >cat /proc/scsi/scsi
758 Attached devices:
759 Host: scsi0 Channel: 00 Id: 00 Lun: 00
760 Vendor: IBM Model: DGHS09U Rev: 03E0
761 Type: Direct-Access ANSI SCSI revision: 03
762 Host: scsi0 Channel: 00 Id: 06 Lun: 00
763 Vendor: PIONEER Model: CD-ROM DR-U06S Rev: 1.04
764 Type: CD-ROM ANSI SCSI revision: 02
765
766
767The directory named after the driver has one file for each adapter found in
768the system. These files contain information about the controller, including
769the used IRQ and the IO address range. The amount of information shown is
770dependent on the adapter you use. The example shows the output for an Adaptec
771AHA-2940 SCSI adapter:
772
773 > cat /proc/scsi/aic7xxx/0
774
775 Adaptec AIC7xxx driver version: 5.1.19/3.2.4
776 Compile Options:
777 TCQ Enabled By Default : Disabled
778 AIC7XXX_PROC_STATS : Disabled
779 AIC7XXX_RESET_DELAY : 5
780 Adapter Configuration:
781 SCSI Adapter: Adaptec AHA-294X Ultra SCSI host adapter
782 Ultra Wide Controller
783 PCI MMAPed I/O Base: 0xeb001000
784 Adapter SEEPROM Config: SEEPROM found and used.
785 Adaptec SCSI BIOS: Enabled
786 IRQ: 10
787 SCBs: Active 0, Max Active 2,
788 Allocated 15, HW 16, Page 255
789 Interrupts: 160328
790 BIOS Control Word: 0x18b6
791 Adapter Control Word: 0x005b
792 Extended Translation: Enabled
793 Disconnect Enable Flags: 0xffff
794 Ultra Enable Flags: 0x0001
795 Tag Queue Enable Flags: 0x0000
796 Ordered Queue Tag Flags: 0x0000
797 Default Tag Queue Depth: 8
798 Tagged Queue By Device array for aic7xxx host instance 0:
799 {255,255,255,255,255,255,255,255,255,255,255,255,255,255,255,255}
800 Actual queue depth per device for aic7xxx host instance 0:
801 {1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1}
802 Statistics:
803 (scsi0:0:0:0)
804 Device using Wide/Sync transfers at 40.0 MByte/sec, offset 8
805 Transinfo settings: current(12/8/1/0), goal(12/8/1/0), user(12/15/1/0)
806 Total transfers 160151 (74577 reads and 85574 writes)
807 (scsi0:0:6:0)
808 Device using Narrow/Sync transfers at 5.0 MByte/sec, offset 15
809 Transinfo settings: current(50/15/0/0), goal(50/15/0/0), user(50/15/0/0)
810 Total transfers 0 (0 reads and 0 writes)
811
812
8131.6 Parallel port info in /proc/parport
814---------------------------------------
815
816The directory /proc/parport contains information about the parallel ports of
817your system. It has one subdirectory for each port, named after the port
818number (0,1,2,...).
819
820These directories contain the four files shown in Table 1-8.
821
822
823Table 1-8: Files in /proc/parport
824..............................................................................
825 File Content
826 autoprobe Any IEEE-1284 device ID information that has been acquired.
827 devices list of the device drivers using that port. A + will appear by the
828 name of the device currently using the port (it might not appear
829 against any).
830 hardware Parallel port's base address, IRQ line and DMA channel.
831 irq IRQ that parport is using for that port. This is in a separate
832 file to allow you to alter it by writing a new value in (IRQ
833 number or none).
834..............................................................................
835
8361.7 TTY info in /proc/tty
837-------------------------
838
839Information about the available and actually used tty's can be found in the
840directory /proc/tty.You'll find entries for drivers and line disciplines in
841this directory, as shown in Table 1-9.
842
843
844Table 1-9: Files in /proc/tty
845..............................................................................
846 File Content
847 drivers list of drivers and their usage
848 ldiscs registered line disciplines
849 driver/serial usage statistic and status of single tty lines
850..............................................................................
851
852To see which tty's are currently in use, you can simply look into the file
853/proc/tty/drivers:
854
855 > cat /proc/tty/drivers
856 pty_slave /dev/pts 136 0-255 pty:slave
857 pty_master /dev/ptm 128 0-255 pty:master
858 pty_slave /dev/ttyp 3 0-255 pty:slave
859 pty_master /dev/pty 2 0-255 pty:master
860 serial /dev/cua 5 64-67 serial:callout
861 serial /dev/ttyS 4 64-67 serial
862 /dev/tty0 /dev/tty0 4 0 system:vtmaster
863 /dev/ptmx /dev/ptmx 5 2 system
864 /dev/console /dev/console 5 1 system:console
865 /dev/tty /dev/tty 5 0 system:/dev/tty
866 unknown /dev/tty 4 1-63 console
867
868
8691.8 Miscellaneous kernel statistics in /proc/stat
870-------------------------------------------------
871
872Various pieces of information about kernel activity are available in the
873/proc/stat file. All of the numbers reported in this file are aggregates
874since the system first booted. For a quick look, simply cat the file:
875
876 > cat /proc/stat
Leonardo Chiquittob68f2c3a2007-10-20 03:03:38 +0200877 cpu 2255 34 2290 22625563 6290 127 456 0
878 cpu0 1132 34 1441 11311718 3675 127 438 0
879 cpu1 1123 0 849 11313845 2614 0 18 0
Linus Torvalds1da177e2005-04-16 15:20:36 -0700880 intr 114930548 113199788 3 0 5 263 0 4 [... lots more numbers ...]
881 ctxt 1990473
882 btime 1062191376
883 processes 2915
884 procs_running 1
885 procs_blocked 0
886
887The very first "cpu" line aggregates the numbers in all of the other "cpuN"
888lines. These numbers identify the amount of time the CPU has spent performing
889different kinds of work. Time units are in USER_HZ (typically hundredths of a
890second). The meanings of the columns are as follows, from left to right:
891
892- user: normal processes executing in user mode
893- nice: niced processes executing in user mode
894- system: processes executing in kernel mode
895- idle: twiddling thumbs
896- iowait: waiting for I/O to complete
897- irq: servicing interrupts
898- softirq: servicing softirqs
Leonardo Chiquittob68f2c3a2007-10-20 03:03:38 +0200899- steal: involuntary wait
Linus Torvalds1da177e2005-04-16 15:20:36 -0700900
901The "intr" line gives counts of interrupts serviced since boot time, for each
902of the possible system interrupts. The first column is the total of all
903interrupts serviced; each subsequent column is the total for that particular
904interrupt.
905
906The "ctxt" line gives the total number of context switches across all CPUs.
907
908The "btime" line gives the time at which the system booted, in seconds since
909the Unix epoch.
910
911The "processes" line gives the number of processes and threads created, which
912includes (but is not limited to) those created by calls to the fork() and
913clone() system calls.
914
915The "procs_running" line gives the number of processes currently running on
916CPUs.
917
918The "procs_blocked" line gives the number of processes currently blocked,
919waiting for I/O to complete.
920
Theodore Ts'o37515fa2008-10-09 23:21:54 -0400921
Alex Tomasc9de5602008-01-29 00:19:52 -05009221.9 Ext4 file system parameters
923------------------------------
Alex Tomasc9de5602008-01-29 00:19:52 -0500924
Theodore Ts'o37515fa2008-10-09 23:21:54 -0400925Information about mounted ext4 file systems can be found in
926/proc/fs/ext4. Each mounted filesystem will have a directory in
927/proc/fs/ext4 based on its device name (i.e., /proc/fs/ext4/hdc or
928/proc/fs/ext4/dm-0). The files in each per-device directory are shown
929in Table 1-10, below.
Alex Tomasc9de5602008-01-29 00:19:52 -0500930
Theodore Ts'o37515fa2008-10-09 23:21:54 -0400931Table 1-10: Files in /proc/fs/ext4/<devname>
932..............................................................................
933 File Content
934 mb_groups details of multiblock allocator buddy cache of free blocks
935 mb_history multiblock allocation history
Theodore Ts'o37515fa2008-10-09 23:21:54 -0400936..............................................................................
Alex Tomasc9de5602008-01-29 00:19:52 -0500937
Linus Torvalds1da177e2005-04-16 15:20:36 -0700938
939------------------------------------------------------------------------------
940Summary
941------------------------------------------------------------------------------
942The /proc file system serves information about the running system. It not only
943allows access to process data but also allows you to request the kernel status
944by reading files in the hierarchy.
945
946The directory structure of /proc reflects the types of information and makes
947it easy, if not obvious, where to look for specific data.
948------------------------------------------------------------------------------
949
950------------------------------------------------------------------------------
951CHAPTER 2: MODIFYING SYSTEM PARAMETERS
952------------------------------------------------------------------------------
953
954------------------------------------------------------------------------------
955In This Chapter
956------------------------------------------------------------------------------
957* Modifying kernel parameters by writing into files found in /proc/sys
958* Exploring the files which modify certain parameters
959* Review of the /proc/sys file tree
960------------------------------------------------------------------------------
961
962
963A very interesting part of /proc is the directory /proc/sys. This is not only
964a source of information, it also allows you to change parameters within the
965kernel. Be very careful when attempting this. You can optimize your system,
966but you can also cause it to crash. Never alter kernel parameters on a
967production system. Set up a development machine and test to make sure that
968everything works the way you want it to. You may have no alternative but to
969reboot the machine once an error has been made.
970
971To change a value, simply echo the new value into the file. An example is
972given below in the section on the file system data. You need to be root to do
973this. You can create your own boot script to perform this every time your
974system boots.
975
976The files in /proc/sys can be used to fine tune and monitor miscellaneous and
977general things in the operation of the Linux kernel. Since some of the files
978can inadvertently disrupt your system, it is advisable to read both
979documentation and source before actually making adjustments. In any case, be
980very careful when writing to any of these files. The entries in /proc may
981change slightly between the 2.1.* and the 2.2 kernel, so if there is any doubt
982review the kernel documentation in the directory /usr/src/linux/Documentation.
983This chapter is heavily based on the documentation included in the pre 2.2
984kernels, and became part of it in version 2.2.1 of the Linux kernel.
985
Shen Feng760df932009-04-02 16:57:20 -0700986Please see: Documentation/sysctls/ directory for descriptions of these
Peter W Morrealedb0fb182009-01-15 13:50:42 -0800987entries.
Andrew Morton9d0243b2006-01-08 01:00:39 -0800988
Shen Feng760df932009-04-02 16:57:20 -0700989------------------------------------------------------------------------------
990Summary
991------------------------------------------------------------------------------
992Certain aspects of kernel behavior can be modified at runtime, without the
993need to recompile the kernel, or even to reboot the system. The files in the
994/proc/sys tree can not only be read, but also modified. You can use the echo
995command to write value into these files, thereby changing the default settings
996of the kernel.
997------------------------------------------------------------------------------
Andrew Morton9d0243b2006-01-08 01:00:39 -0800998
Shen Feng760df932009-04-02 16:57:20 -0700999------------------------------------------------------------------------------
1000CHAPTER 3: PER-PROCESS PARAMETERS
1001------------------------------------------------------------------------------
Linus Torvalds1da177e2005-04-16 15:20:36 -07001002
Shen Feng760df932009-04-02 16:57:20 -070010033.1 /proc/<pid>/oom_adj - Adjust the oom-killer score
Jan-Frode Myklebustd7ff0db2006-09-29 01:59:45 -07001004------------------------------------------------------
1005
David Rientjes2ff05b22009-06-16 15:32:56 -07001006This file can be used to adjust the score used to select which processes should
1007be killed in an out-of-memory situation. The oom_adj value is a characteristic
1008of the task's mm, so all threads that share an mm with pid will have the same
1009oom_adj value. A high value will increase the likelihood of this process being
1010killed by the oom-killer. Valid values are in the range -16 to +15 as
1011explained below and a special value of -17, which disables oom-killing
1012altogether for threads sharing pid's mm.
Jan-Frode Myklebustd7ff0db2006-09-29 01:59:45 -07001013
Evgeniy Polyakov9e9e3cb2009-01-29 14:25:09 -08001014The process to be killed in an out-of-memory situation is selected among all others
1015based on its badness score. This value equals the original memory size of the process
1016and is then updated according to its CPU time (utime + stime) and the
1017run time (uptime - start time). The longer it runs the smaller is the score.
1018Badness score is divided by the square root of the CPU time and then by
1019the double square root of the run time.
1020
1021Swapped out tasks are killed first. Half of each child's memory size is added to
1022the parent's score if they do not share the same memory. Thus forking servers
1023are the prime candidates to be killed. Having only one 'hungry' child will make
1024parent less preferable than the child.
1025
David Rientjes2ff05b22009-06-16 15:32:56 -07001026/proc/<pid>/oom_adj cannot be changed for kthreads since they are immune from
1027oom-killing already.
1028
Evgeniy Polyakov9e9e3cb2009-01-29 14:25:09 -08001029/proc/<pid>/oom_score shows process' current badness score.
1030
1031The following heuristics are then applied:
1032 * if the task was reniced, its score doubles
1033 * superuser or direct hardware access tasks (CAP_SYS_ADMIN, CAP_SYS_RESOURCE
1034 or CAP_SYS_RAWIO) have their score divided by 4
1035 * if oom condition happened in one cpuset and checked task does not belong
1036 to it, its score is divided by 8
1037 * the resulting score is multiplied by two to the power of oom_adj, i.e.
1038 points <<= oom_adj when it is positive and
1039 points >>= -(oom_adj) otherwise
1040
1041The task with the highest badness score is then selected and its children
1042are killed, process itself will be killed in an OOM situation when it does
1043not have children or some of them disabled oom like described above.
1044
Shen Feng760df932009-04-02 16:57:20 -070010453.2 /proc/<pid>/oom_score - Display current oom-killer score
Jan-Frode Myklebustd7ff0db2006-09-29 01:59:45 -07001046-------------------------------------------------------------
1047
Jan-Frode Myklebustd7ff0db2006-09-29 01:59:45 -07001048This file can be used to check the current score used by the oom-killer is for
1049any given <pid>. Use it together with /proc/<pid>/oom_adj to tune which
1050process should be killed in an out-of-memory situation.
Linus Torvalds1da177e2005-04-16 15:20:36 -07001051
Roland Kletzingf9c99462007-03-05 00:30:54 -08001052
Shen Feng760df932009-04-02 16:57:20 -070010533.3 /proc/<pid>/io - Display the IO accounting fields
Roland Kletzingf9c99462007-03-05 00:30:54 -08001054-------------------------------------------------------
1055
1056This file contains IO statistics for each running process
1057
1058Example
1059-------
1060
1061test:/tmp # dd if=/dev/zero of=/tmp/test.dat &
1062[1] 3828
1063
1064test:/tmp # cat /proc/3828/io
1065rchar: 323934931
1066wchar: 323929600
1067syscr: 632687
1068syscw: 632675
1069read_bytes: 0
1070write_bytes: 323932160
1071cancelled_write_bytes: 0
1072
1073
1074Description
1075-----------
1076
1077rchar
1078-----
1079
1080I/O counter: chars read
1081The number of bytes which this task has caused to be read from storage. This
1082is simply the sum of bytes which this process passed to read() and pread().
1083It includes things like tty IO and it is unaffected by whether or not actual
1084physical disk IO was required (the read might have been satisfied from
1085pagecache)
1086
1087
1088wchar
1089-----
1090
1091I/O counter: chars written
1092The number of bytes which this task has caused, or shall cause to be written
1093to disk. Similar caveats apply here as with rchar.
1094
1095
1096syscr
1097-----
1098
1099I/O counter: read syscalls
1100Attempt to count the number of read I/O operations, i.e. syscalls like read()
1101and pread().
1102
1103
1104syscw
1105-----
1106
1107I/O counter: write syscalls
1108Attempt to count the number of write I/O operations, i.e. syscalls like
1109write() and pwrite().
1110
1111
1112read_bytes
1113----------
1114
1115I/O counter: bytes read
1116Attempt to count the number of bytes which this process really did cause to
1117be fetched from the storage layer. Done at the submit_bio() level, so it is
1118accurate for block-backed filesystems. <please add status regarding NFS and
1119CIFS at a later time>
1120
1121
1122write_bytes
1123-----------
1124
1125I/O counter: bytes written
1126Attempt to count the number of bytes which this process caused to be sent to
1127the storage layer. This is done at page-dirtying time.
1128
1129
1130cancelled_write_bytes
1131---------------------
1132
1133The big inaccuracy here is truncate. If a process writes 1MB to a file and
1134then deletes the file, it will in fact perform no writeout. But it will have
1135been accounted as having caused 1MB of write.
1136In other words: The number of bytes which this process caused to not happen,
1137by truncating pagecache. A task can cause "negative" IO too. If this task
1138truncates some dirty pagecache, some IO which another task has been accounted
1139for (in it's write_bytes) will not be happening. We _could_ just subtract that
1140from the truncating task's write_bytes, but there is information loss in doing
1141that.
1142
1143
1144Note
1145----
1146
1147At its current implementation state, this is a bit racy on 32-bit machines: if
1148process A reads process B's /proc/pid/io while process B is updating one of
1149those 64-bit counters, process A could see an intermediate result.
1150
1151
1152More information about this can be found within the taskstats documentation in
1153Documentation/accounting.
1154
Shen Feng760df932009-04-02 16:57:20 -070011553.4 /proc/<pid>/coredump_filter - Core dump filtering settings
Kawai, Hidehirobb901102007-07-19 01:48:31 -07001156---------------------------------------------------------------
1157When a process is dumped, all anonymous memory is written to a core file as
1158long as the size of the core file isn't limited. But sometimes we don't want
1159to dump some memory segments, for example, huge shared memory. Conversely,
1160sometimes we want to save file-backed memory segments into a core file, not
1161only the individual files.
1162
1163/proc/<pid>/coredump_filter allows you to customize which memory segments
1164will be dumped when the <pid> process is dumped. coredump_filter is a bitmask
1165of memory types. If a bit of the bitmask is set, memory segments of the
1166corresponding memory type are dumped, otherwise they are not dumped.
1167
KOSAKI Motohiroe575f112008-10-18 20:27:08 -07001168The following 7 memory types are supported:
Kawai, Hidehirobb901102007-07-19 01:48:31 -07001169 - (bit 0) anonymous private memory
1170 - (bit 1) anonymous shared memory
1171 - (bit 2) file-backed private memory
1172 - (bit 3) file-backed shared memory
Hidehiro Kawaib261dfe2008-09-13 02:33:10 -07001173 - (bit 4) ELF header pages in file-backed private memory areas (it is
1174 effective only if the bit 2 is cleared)
KOSAKI Motohiroe575f112008-10-18 20:27:08 -07001175 - (bit 5) hugetlb private memory
1176 - (bit 6) hugetlb shared memory
Kawai, Hidehirobb901102007-07-19 01:48:31 -07001177
1178 Note that MMIO pages such as frame buffer are never dumped and vDSO pages
1179 are always dumped regardless of the bitmask status.
1180
KOSAKI Motohiroe575f112008-10-18 20:27:08 -07001181 Note bit 0-4 doesn't effect any hugetlb memory. hugetlb memory are only
1182 effected by bit 5-6.
1183
1184Default value of coredump_filter is 0x23; this means all anonymous memory
1185segments and hugetlb private memory are dumped.
Kawai, Hidehirobb901102007-07-19 01:48:31 -07001186
1187If you don't want to dump all shared memory segments attached to pid 1234,
KOSAKI Motohiroe575f112008-10-18 20:27:08 -07001188write 0x21 to the process's proc file.
Kawai, Hidehirobb901102007-07-19 01:48:31 -07001189
KOSAKI Motohiroe575f112008-10-18 20:27:08 -07001190 $ echo 0x21 > /proc/1234/coredump_filter
Kawai, Hidehirobb901102007-07-19 01:48:31 -07001191
1192When a new process is created, the process inherits the bitmask status from its
1193parent. It is useful to set up coredump_filter before the program runs.
1194For example:
1195
1196 $ echo 0x7 > /proc/self/coredump_filter
1197 $ ./some_program
1198
Shen Feng760df932009-04-02 16:57:20 -070011993.5 /proc/<pid>/mountinfo - Information about mounts
Ram Pai2d4d4862008-03-27 13:06:25 +01001200--------------------------------------------------------
1201
1202This file contains lines of the form:
1203
120436 35 98:0 /mnt1 /mnt2 rw,noatime master:1 - ext3 /dev/root rw,errors=continue
1205(1)(2)(3) (4) (5) (6) (7) (8) (9) (10) (11)
1206
1207(1) mount ID: unique identifier of the mount (may be reused after umount)
1208(2) parent ID: ID of parent (or of self for the top of the mount tree)
1209(3) major:minor: value of st_dev for files on filesystem
1210(4) root: root of the mount within the filesystem
1211(5) mount point: mount point relative to the process's root
1212(6) mount options: per mount options
1213(7) optional fields: zero or more fields of the form "tag[:value]"
1214(8) separator: marks the end of the optional fields
1215(9) filesystem type: name of filesystem of the form "type[.subtype]"
1216(10) mount source: filesystem specific information or "none"
1217(11) super options: per super block options
1218
1219Parsers should ignore all unrecognised optional fields. Currently the
1220possible optional fields are:
1221
1222shared:X mount is shared in peer group X
1223master:X mount is slave to peer group X
Miklos Szeredi97e7e0f2008-03-27 13:06:26 +01001224propagate_from:X mount is slave and receives propagation from peer group X (*)
Ram Pai2d4d4862008-03-27 13:06:25 +01001225unbindable mount is unbindable
1226
Miklos Szeredi97e7e0f2008-03-27 13:06:26 +01001227(*) X is the closest dominant peer group under the process's root. If
1228X is the immediate master of the mount, or if there's no dominant peer
1229group under the same root, then only the "master:X" field is present
1230and not the "propagate_from:X" field.
1231
Ram Pai2d4d4862008-03-27 13:06:25 +01001232For more information on mount propagation see:
1233
1234 Documentation/filesystems/sharedsubtree.txt
1235