blob: 197fe319cbec38a544ea7875fdeb6271009a8509 [file] [log] [blame]
Mauro Carvalho Chehabc3123552019-04-17 05:46:08 -03001================
Shailabh Nagara3baf642006-07-14 00:24:42 -07002Delay accounting
Mauro Carvalho Chehabc3123552019-04-17 05:46:08 -03003================
Shailabh Nagara3baf642006-07-14 00:24:42 -07004
5Tasks encounter delays in execution when they wait
6for some kernel resource to become available e.g. a
7runnable task may wait for a free CPU to run on.
8
9The per-task delay accounting functionality measures
10the delays experienced by a task while
11
12a) waiting for a CPU (while being runnable)
13b) completion of synchronous block I/O initiated by the task
14c) swapping in pages
Keika Kobayashi9b0975a2008-07-25 01:48:54 -070015d) memory reclaim
wangyongec710aa2022-01-19 18:10:12 -080016e) thrashing page cache
17f) direct compact
Shailabh Nagara3baf642006-07-14 00:24:42 -070018
19and makes these statistics available to userspace through
20the taskstats interface.
21
22Such delays provide feedback for setting a task's cpu priority,
23io priority and rss limit values appropriately. Long delays for
24important tasks could be a trigger for raising its corresponding priority.
25
26The functionality, through its use of the taskstats interface, also provides
27delay statistics aggregated for all tasks (or threads) belonging to a
28thread group (corresponding to a traditional Unix process). This is a commonly
29needed aggregation that is more efficiently done by the kernel.
30
31Userspace utilities, particularly resource management applications, can also
32aggregate delay statistics into arbitrary groups. To enable this, delay
33statistics of a task are available both during its lifetime as well as on its
34exit, ensuring continuous and complete monitoring can be done.
35
36
37Interface
38---------
39
40Delay accounting uses the taskstats interface which is described
41in detail in a separate document in this directory. Taskstats returns a
42generic data structure to userspace corresponding to per-pid and per-tgid
43statistics. The delay accounting functionality populates specific fields of
44this structure. See
Mauro Carvalho Chehabc3123552019-04-17 05:46:08 -030045
wangyongec710aa2022-01-19 18:10:12 -080046 include/uapi/linux/taskstats.h
Mauro Carvalho Chehabc3123552019-04-17 05:46:08 -030047
Shailabh Nagara3baf642006-07-14 00:24:42 -070048for a description of the fields pertaining to delay accounting.
49It will generally be in the form of counters returning the cumulative
wangyongec710aa2022-01-19 18:10:12 -080050delay seen for cpu, sync block I/O, swapin, memory reclaim, thrash page
51cache, direct compact etc.
Shailabh Nagara3baf642006-07-14 00:24:42 -070052
53Taking the difference of two successive readings of a given
54counter (say cpu_delay_total) for a task will give the delay
55experienced by the task waiting for the corresponding resource
56in that interval.
57
Shailabh Nagarad4ecbc2006-07-14 00:24:44 -070058When a task exits, records containing the per-task statistics
59are sent to userspace without requiring a command. If it is the last exiting
60task of a thread group, the per-tgid statistics are also sent. More details
61are given in the taskstats interface description.
Shailabh Nagara3baf642006-07-14 00:24:42 -070062
Shuah Khand522b2c2016-09-21 16:19:35 -060063The getdelays.c userspace utility in tools/accounting directory allows simple
64commands to be run and the corresponding delay statistics to be displayed. It
65also serves as an example of using the taskstats interface.
Shailabh Nagara3baf642006-07-14 00:24:42 -070066
67Usage
68-----
69
Mauro Carvalho Chehabc3123552019-04-17 05:46:08 -030070Compile the kernel with::
71
Shailabh Nagara3baf642006-07-14 00:24:42 -070072 CONFIG_TASK_DELAY_ACCT=y
73 CONFIG_TASKSTATS=y
74
Peter Zijlstrae4042ad2021-05-04 22:43:32 +020075Delay accounting is disabled by default at boot up.
76To enable, add::
Mauro Carvalho Chehabc3123552019-04-17 05:46:08 -030077
Peter Zijlstrae4042ad2021-05-04 22:43:32 +020078 delayacct
Mauro Carvalho Chehabc3123552019-04-17 05:46:08 -030079
Peter Zijlstra0cd7c742021-05-10 14:01:00 +020080to the kernel boot options. The rest of the instructions below assume this has
81been done. Alternatively, use sysctl kernel.task_delayacct to switch the state
82at runtime. Note however that only tasks started after enabling it will have
83delayacct information.
Shailabh Nagara3baf642006-07-14 00:24:42 -070084
Shailabh Nagar163ecdf2006-07-30 03:03:11 -070085After the system has booted up, use a utility
Shailabh Nagara3baf642006-07-14 00:24:42 -070086similar to getdelays.c to access the delays
87seen by a given task or a task group (tgid).
88The utility also allows a given command to be
89executed and the corresponding delays to be
90seen.
91
Mauro Carvalho Chehabc3123552019-04-17 05:46:08 -030092General format of the getdelays command::
Shailabh Nagara3baf642006-07-14 00:24:42 -070093
wangyongec710aa2022-01-19 18:10:12 -080094 getdelays [-dilv] [-t tgid] [-p pid]
Shailabh Nagara3baf642006-07-14 00:24:42 -070095
Mauro Carvalho Chehabc3123552019-04-17 05:46:08 -030096Get delays, since system boot, for pid 10::
Shailabh Nagara3baf642006-07-14 00:24:42 -070097
wangyongec710aa2022-01-19 18:10:12 -080098 # ./getdelays -d -p 10
Mauro Carvalho Chehabc3123552019-04-17 05:46:08 -030099 (output similar to next case)
100
101Get sum of delays, since system boot, for all pids with tgid 5::
102
wangyongec710aa2022-01-19 18:10:12 -0800103 # ./getdelays -d -t 5
104 print delayacct stats ON
105 TGID 5
Shailabh Nagara3baf642006-07-14 00:24:42 -0700106
107
wangyongec710aa2022-01-19 18:10:12 -0800108 CPU count real total virtual total delay total delay average
109 8 7000000 6872122 3382277 0.423ms
110 IO count delay total delay average
111 0 0 0ms
112 SWAP count delay total delay average
113 0 0 0ms
114 RECLAIM count delay total delay average
115 0 0 0ms
116 THRASHING count delay total delay average
117 0 0 0ms
118 COMPACT count delay total delay average
119 0 0 0ms
Shailabh Nagara3baf642006-07-14 00:24:42 -0700120
wangyongec710aa2022-01-19 18:10:12 -0800121Get IO accounting for pid 1, it works only with -p::
Shailabh Nagara3baf642006-07-14 00:24:42 -0700122
wangyongec710aa2022-01-19 18:10:12 -0800123 # ./getdelays -i -p 1
124 printing IO accounting
125 linuxrc: read=65536, write=0, cancelled_write=0
Mauro Carvalho Chehabc3123552019-04-17 05:46:08 -0300126
wangyongec710aa2022-01-19 18:10:12 -0800127The above command can be used with -v to get more debug information.