blob: 93ae8d60e3d33b99bd06b85a03052c036a808278 [file] [log] [blame]
Ingo Molnar133dc4c2010-11-16 18:45:39 +01001perf-script(1)
Arnaldo Carvalho de Melo4778e0e2010-05-05 11:23:27 -03002=============
Ingo Molnar0a02ad92009-09-11 12:12:54 +02003
4NAME
5----
Ingo Molnar133dc4c2010-11-16 18:45:39 +01006perf-script - Read perf.data (created by perf record) and display trace output
Ingo Molnar0a02ad92009-09-11 12:12:54 +02007
8SYNOPSIS
9--------
10[verse]
Ingo Molnar133dc4c2010-11-16 18:45:39 +010011'perf script' [<options>]
12'perf script' [<options>] record <script> [<record-options>] <command>
13'perf script' [<options>] report <script> [script-args]
14'perf script' [<options>] <script> <required-script-args> [<record-options>] <command>
15'perf script' [<options>] <top-script> [script-args]
Ingo Molnar0a02ad92009-09-11 12:12:54 +020016
17DESCRIPTION
18-----------
19This command reads the input file and displays the trace recorded.
20
Ingo Molnar133dc4c2010-11-16 18:45:39 +010021There are several variants of perf script:
Tom Zanussia6005122009-12-15 02:53:40 -060022
Ingo Molnar133dc4c2010-11-16 18:45:39 +010023 'perf script' to see a detailed trace of the workload that was
Tom Zanussia6005122009-12-15 02:53:40 -060024 recorded.
25
Tom Zanussicff68e52010-01-27 02:28:03 -060026 You can also run a set of pre-canned scripts that aggregate and
27 summarize the raw trace data in various ways (the list of scripts is
Ingo Molnar133dc4c2010-11-16 18:45:39 +010028 available via 'perf script -l'). The following variants allow you to
Tom Zanussicff68e52010-01-27 02:28:03 -060029 record and run those scripts:
30
Ingo Molnar133dc4c2010-11-16 18:45:39 +010031 'perf script record <script> <command>' to record the events required
32 for 'perf script report'. <script> is the name displayed in the
33 output of 'perf script --list' i.e. the actual script name minus any
Tom Zanussid3c4f792010-11-10 08:19:35 -060034 language extension. If <command> is not specified, the events are
35 recorded using the -a (system-wide) 'perf record' option.
Tom Zanussia6005122009-12-15 02:53:40 -060036
Ingo Molnar133dc4c2010-11-16 18:45:39 +010037 'perf script report <script> [args]' to run and display the results
Tom Zanussid3c4f792010-11-10 08:19:35 -060038 of <script>. <script> is the name displayed in the output of 'perf
Michael Petlan5c64f992017-01-13 10:06:52 -050039 script --list' i.e. the actual script name minus any language
Ingo Molnar133dc4c2010-11-16 18:45:39 +010040 extension. The perf.data output from a previous run of 'perf script
Tom Zanussia6005122009-12-15 02:53:40 -060041 record <script>' is used and should be present for this command to
Tom Zanussid3c4f792010-11-10 08:19:35 -060042 succeed. [args] refers to the (mainly optional) args expected by
43 the script.
44
Ingo Molnar133dc4c2010-11-16 18:45:39 +010045 'perf script <script> <required-script-args> <command>' to both
Tom Zanussid3c4f792010-11-10 08:19:35 -060046 record the events required for <script> and to run the <script>
47 using 'live-mode' i.e. without writing anything to disk. <script>
Ingo Molnar133dc4c2010-11-16 18:45:39 +010048 is the name displayed in the output of 'perf script --list' i.e. the
Tom Zanussid3c4f792010-11-10 08:19:35 -060049 actual script name minus any language extension. If <command> is
50 not specified, the events are recorded using the -a (system-wide)
51 'perf record' option. If <script> has any required args, they
52 should be specified before <command>. This mode doesn't allow for
53 optional script args to be specified; if optional script args are
Ingo Molnar133dc4c2010-11-16 18:45:39 +010054 desired, they can be specified using separate 'perf script record'
55 and 'perf script report' commands, with the stdout of the record step
Tom Zanussid3c4f792010-11-10 08:19:35 -060056 piped to the stdin of the report script, using the '-o -' and '-i -'
57 options of the corresponding commands.
58
Ingo Molnar133dc4c2010-11-16 18:45:39 +010059 'perf script <top-script>' to both record the events required for
Tom Zanussid3c4f792010-11-10 08:19:35 -060060 <top-script> and to run the <top-script> using 'live-mode'
61 i.e. without writing anything to disk. <top-script> is the name
Ingo Molnar133dc4c2010-11-16 18:45:39 +010062 displayed in the output of 'perf script --list' i.e. the actual
Tom Zanussid3c4f792010-11-10 08:19:35 -060063 script name minus any language extension; a <top-script> is defined
64 as any script name ending with the string 'top'.
65
Ingo Molnar133dc4c2010-11-16 18:45:39 +010066 [<record-options>] can be passed to the record steps of 'perf script
Tom Zanussid3c4f792010-11-10 08:19:35 -060067 record' and 'live-mode' variants; this isn't possible however for
Ingo Molnar133dc4c2010-11-16 18:45:39 +010068 <top-script> 'live-mode' or 'perf script report' variants.
Tom Zanussia6005122009-12-15 02:53:40 -060069
Tom Zanussicff68e52010-01-27 02:28:03 -060070 See the 'SEE ALSO' section for links to language-specific
71 information on how to write and run your own trace scripts.
72
Ingo Molnar0a02ad92009-09-11 12:12:54 +020073OPTIONS
74-------
Tom Zanussid3c4f792010-11-10 08:19:35 -060075<command>...::
76 Any command you can specify in a shell.
77
Ingo Molnar0a02ad92009-09-11 12:12:54 +020078-D::
Michael Petlan5c64f992017-01-13 10:06:52 -050079--dump-raw-trace=::
Ingo Molnar0a02ad92009-09-11 12:12:54 +020080 Display verbose dump of the trace data.
81
Tom Zanussia6005122009-12-15 02:53:40 -060082-L::
83--Latency=::
84 Show latency attributes (irqs/preemption disabled, etc).
85
86-l::
87--list=::
88 Display a list of available trace scripts.
89
Tom Zanussif526d682010-01-27 02:27:52 -060090-s ['lang']::
Tom Zanussi89fbf0b2009-11-25 01:15:51 -060091--script=::
92 Process trace data with the given script ([lang]:script[.ext]).
Tom Zanussif526d682010-01-27 02:27:52 -060093 If the string 'lang' is specified in place of a script name, a
94 list of supported languages will be displayed instead.
Tom Zanussi89fbf0b2009-11-25 01:15:51 -060095
96-g::
97--gen-script=::
Ingo Molnar133dc4c2010-11-16 18:45:39 +010098 Generate perf-script.[ext] starter script for given language,
Tom Zanussi89fbf0b2009-11-25 01:15:51 -060099 using current perf.data.
100
Tom Zanussid3c4f792010-11-10 08:19:35 -0600101-a::
102 Force system-wide collection. Scripts run without a <command>
103 normally use -a by default, while scripts run with a <command>
104 normally don't - this option allows the latter to be run in
105 system-wide mode.
106
Shawn Bohrer646420f2010-11-30 19:57:22 -0600107-i::
108--input=::
Robert Richterefad1412011-12-07 10:02:54 +0100109 Input file name. (default: perf.data unless stdin is a fifo)
Shawn Bohrer646420f2010-11-30 19:57:22 -0600110
111-d::
112--debug-mode::
113 Do various checks like samples ordering and lost events.
Tom Zanussid3c4f792010-11-10 08:19:35 -0600114
Stephane Eraniandc323ce2015-08-31 18:41:13 +0200115-F::
Arnaldo Carvalho de Melo176fcc52011-03-30 15:30:43 -0300116--fields::
David Ahern745f43e2011-03-09 22:23:26 -0700117 Comma separated list of fields to print. Options are:
Adrian Hunter400ea6d2015-04-09 18:54:05 +0300118 comm, tid, pid, time, cpu, event, trace, ip, sym, dso, addr, symoff,
Andi Kleenb1491ac2017-09-05 11:40:57 -0700119 srcline, period, iregs, uregs, brstack, brstacksym, flags, bpf-output, brstackinsn,
Jiri Olsa28a0b392018-01-07 17:03:52 +0100120 brstackoff, callindent, insn, insnlen, synth, phys_addr, metric, misc.
Adrian Hunter47e78082017-05-26 11:17:22 +0300121 Field list can be prepended with the type, trace, sw or hw,
David Ahern1424dc92011-03-09 22:23:28 -0700122 to indicate to which event type the field list applies.
Adrian Huntercbb0bba2016-06-16 16:51:26 -0300123 e.g., -F sw:comm,tid,time,ip,sym and -F trace:time,cpu,trace
David Ahernc0230b22011-03-09 22:23:27 -0700124
Adrian Huntercbb0bba2016-06-16 16:51:26 -0300125 perf script -F <fields>
Arnaldo Carvalho de Melo176fcc52011-03-30 15:30:43 -0300126
127 is equivalent to:
128
Adrian Huntercbb0bba2016-06-16 16:51:26 -0300129 perf script -F trace:<fields> -F sw:<fields> -F hw:<fields>
Arnaldo Carvalho de Melo48000a12014-12-17 17:24:45 -0300130
Arnaldo Carvalho de Melo176fcc52011-03-30 15:30:43 -0300131 i.e., the specified fields apply to all event types if the type string
132 is not given.
Arnaldo Carvalho de Melo48000a12014-12-17 17:24:45 -0300133
Andi Kleen36ce5652017-06-02 08:48:10 -0700134 In addition to overriding fields, it is also possible to add or remove
135 fields from the defaults. For example
136
137 -F -cpu,+insn
138
139 removes the cpu field and adds the insn field. Adding/removing fields
140 cannot be mixed with normal overriding.
141
Arnaldo Carvalho de Melo176fcc52011-03-30 15:30:43 -0300142 The arguments are processed in the order received. A later usage can
143 reset a prior request. e.g.:
Arnaldo Carvalho de Melo48000a12014-12-17 17:24:45 -0300144
Adrian Huntercbb0bba2016-06-16 16:51:26 -0300145 -F trace: -F comm,tid,time,ip,sym
Arnaldo Carvalho de Melo48000a12014-12-17 17:24:45 -0300146
Adrian Huntercbb0bba2016-06-16 16:51:26 -0300147 The first -F suppresses trace events (field list is ""), but then the
David Ahern787bef12011-05-27 14:28:43 -0600148 second invocation sets the fields to comm,tid,time,ip,sym. In this case a
Arnaldo Carvalho de Melo176fcc52011-03-30 15:30:43 -0300149 warning is given to the user:
Arnaldo Carvalho de Melo48000a12014-12-17 17:24:45 -0300150
Arnaldo Carvalho de Melo176fcc52011-03-30 15:30:43 -0300151 "Overriding previous field request for all events."
Arnaldo Carvalho de Melo48000a12014-12-17 17:24:45 -0300152
Masanari Iida96355f22014-09-10 00:18:50 +0900153 Alternatively, consider the order:
Arnaldo Carvalho de Melo48000a12014-12-17 17:24:45 -0300154
Adrian Huntercbb0bba2016-06-16 16:51:26 -0300155 -F comm,tid,time,ip,sym -F trace:
Arnaldo Carvalho de Melo48000a12014-12-17 17:24:45 -0300156
Adrian Huntercbb0bba2016-06-16 16:51:26 -0300157 The first -F sets the fields for all events and the second -F
Arnaldo Carvalho de Melo176fcc52011-03-30 15:30:43 -0300158 suppresses trace events. The user is given a warning message about
159 the override, and the result of the above is that only S/W and H/W
160 events are displayed with the given fields.
Arnaldo Carvalho de Melo48000a12014-12-17 17:24:45 -0300161
Arnaldo Carvalho de Melo176fcc52011-03-30 15:30:43 -0300162 For the 'wildcard' option if a user selected field is invalid for an
163 event type, a message is displayed to the user that the option is
164 ignored for that type. For example:
Arnaldo Carvalho de Melo48000a12014-12-17 17:24:45 -0300165
Adrian Huntercbb0bba2016-06-16 16:51:26 -0300166 $ perf script -F comm,tid,trace
Arnaldo Carvalho de Melo176fcc52011-03-30 15:30:43 -0300167 'trace' not valid for hardware events. Ignoring.
168 'trace' not valid for software events. Ignoring.
Arnaldo Carvalho de Melo48000a12014-12-17 17:24:45 -0300169
Arnaldo Carvalho de Melo176fcc52011-03-30 15:30:43 -0300170 Alternatively, if the type is given an invalid field is specified it
171 is an error. For example:
Arnaldo Carvalho de Melo48000a12014-12-17 17:24:45 -0300172
Adrian Huntercbb0bba2016-06-16 16:51:26 -0300173 perf script -v -F sw:comm,tid,trace
Arnaldo Carvalho de Melo176fcc52011-03-30 15:30:43 -0300174 'trace' not valid for software events.
Arnaldo Carvalho de Melo48000a12014-12-17 17:24:45 -0300175
Arnaldo Carvalho de Melo176fcc52011-03-30 15:30:43 -0300176 At this point usage is displayed, and perf-script exits.
Arnaldo Carvalho de Melo48000a12014-12-17 17:24:45 -0300177
Adrian Hunter400ea6d2015-04-09 18:54:05 +0300178 The flags field is synthesized and may have a value when Instruction
179 Trace decoding. The flags are "bcrosyiABEx" which stand for branch,
180 call, return, conditional, system, asynchronous, interrupt,
181 transaction abort, trace begin, trace end, and in transaction,
Adrian Hunter055cd332016-06-23 16:40:56 +0300182 respectively. Known combinations of flags are printed more nicely e.g.
183 "call" for "bc", "return" for "br", "jcc" for "bo", "jmp" for "b",
184 "int" for "bci", "iret" for "bri", "syscall" for "bcs", "sysret" for "brs",
185 "async" for "by", "hw int" for "bcyi", "tx abrt" for "bA", "tr strt" for "bB",
186 "tr end" for "bE". However the "x" flag will be display separately in those
187 cases e.g. "jcc (x)" for a condition branch within a transaction.
Adrian Hunter400ea6d2015-04-09 18:54:05 +0300188
Adrian Huntere2167082016-06-23 16:40:58 +0300189 The callindent field is synthesized and may have a value when
190 Instruction Trace decoding. For calls and returns, it will display the
191 name of the symbol indented with spaces to reflect the stack depth.
192
Andi Kleen224e2c92016-10-07 16:42:27 +0300193 When doing instruction trace decoding insn and insnlen give the
194 instruction bytes and the instruction length of the current
195 instruction.
196
Adrian Hunter47e78082017-05-26 11:17:22 +0300197 The synth field is used by synthesized events which may be created when
198 Instruction Trace decoding.
199
Arnaldo Carvalho de Melo176fcc52011-03-30 15:30:43 -0300200 Finally, a user may not set fields to none for all event types.
Adrian Huntercbb0bba2016-06-16 16:51:26 -0300201 i.e., -F "" is not allowed.
Arnaldo Carvalho de Melo176fcc52011-03-30 15:30:43 -0300202
Stephane Eraniandc323ce2015-08-31 18:41:13 +0200203 The brstack output includes branch related information with raw addresses using the
Andi Kleen48d02a12017-02-23 15:46:34 -0800204 /v/v/v/v/cycles syntax in the following order:
Stephane Eraniandc323ce2015-08-31 18:41:13 +0200205 FROM: branch source instruction
206 TO : branch target instruction
207 M/P/-: M=branch target mispredicted or branch direction was mispredicted, P=target predicted or direction predicted, -=not supported
208 X/- : X=branch inside a transactional region, -=not in transaction region or not supported
209 A/- : A=TSX abort entry, -=not aborted region or not supported
Andi Kleen48d02a12017-02-23 15:46:34 -0800210 cycles
Stephane Eraniandc323ce2015-08-31 18:41:13 +0200211
212 The brstacksym is identical to brstack, except that the FROM and TO addresses are printed in a symbolic form if possible.
213
Andi Kleen48d02a12017-02-23 15:46:34 -0800214 When brstackinsn is specified the full assembler sequences of branch sequences for each sample
215 is printed. This is the full execution path leading to the sample. This is only supported when the
216 sample was recorded with perf record -b or -j any.
217
Mark Santaniello106dacd2017-06-19 09:38:25 -0700218 The brstackoff field will print an offset into a specific dso/binary.
219
Andi Kleen4bd1bef2017-11-17 13:43:00 -0800220 With the metric option perf script can compute metrics for
221 sampling periods, similar to perf stat. This requires
222 specifying a group with multiple metrics with the :S option
223 for perf record. perf will sample on the first event, and
224 compute metrics for all the events in the group. Please note
225 that the metric computed is averaged over the whole sampling
226 period, not just for the sample point.
227
Jiri Olsa28a0b392018-01-07 17:03:52 +0100228 For sample events it's possible to display misc field with -F +misc option,
229 following letters are displayed for each bit:
230
231 PERF_RECORD_MISC_KERNEL K
232 PERF_RECORD_MISC_USER U
233 PERF_RECORD_MISC_HYPERVISOR H
234 PERF_RECORD_MISC_GUEST_KERNEL G
235 PERF_RECORD_MISC_GUEST_USER g
236 PERF_RECORD_MISC_MMAP_DATA* M
237 PERF_RECORD_MISC_COMM_EXEC E
238 PERF_RECORD_MISC_SWITCH_OUT S
239
240 $ perf script -F +misc ...
241 sched-messaging 1414 K 28690.636582: 4590 cycles ...
242 sched-messaging 1407 U 28690.636600: 325620 cycles ...
243 sched-messaging 1414 K 28690.636608: 19473 cycles ...
244 misc field ___________/
245
David Ahernc0230b22011-03-09 22:23:27 -0700246-k::
247--vmlinux=<file>::
248 vmlinux pathname
249
250--kallsyms=<file>::
251 kallsyms pathname
252
253--symfs=<directory>::
254 Look for files with symbols relative to this directory.
255
256-G::
257--hide-call-graph::
258 When printing symbols do not display call chain.
David Ahern745f43e2011-03-09 22:23:26 -0700259
David Ahern64eff7d2016-11-25 13:00:21 -0700260--stop-bt::
261 Stop display of callgraph at these symbols
262
David Ahernc8e66722011-11-13 11:30:08 -0700263-C::
Anton Blanchard5d67be92011-07-04 21:57:50 +1000264--cpu:: Only report samples for the list of CPUs provided. Multiple CPUs can
265 be provided as a comma-separated list with no space: 0,1. Ranges of
266 CPUs are specified with -: 0-2. Default is to report samples on all
267 CPUs.
268
David Aherne7984b72011-11-21 10:02:52 -0700269-c::
270--comms=::
271 Only display events for these comms. CSV that understands
272 file://filename entries.
273
David Aherne03eaa42015-03-24 09:52:41 -0600274--pid=::
275 Only show events for given process ID (comma separated list).
276
277--tid=::
278 Only show events for given thread ID (comma separated list).
279
Stephane Eranianfbe96f22011-09-30 15:40:40 +0200280-I::
281--show-info::
282 Display extended information about the perf.data file. This adds
283 information which may be very large and thus may clutter the display.
284 It currently includes: cpu and numa topology of the host system.
285 It can only be used with the perf script report mode.
286
Akihiro Nagai0bc8d202012-01-30 13:43:20 +0900287--show-kernel-path::
288 Try to resolve the path of [kernel.kallsyms]
289
Namhyung Kimad7ebb92013-11-26 17:51:12 +0900290--show-task-events
291 Display task related events (e.g. FORK, COMM, EXIT).
292
Namhyung Kimba1ddf42013-11-26 17:54:26 +0900293--show-mmap-events
294 Display mmap related events (e.g. MMAP, MMAP2).
295
Hari Bathini96a44bb2017-03-08 02:12:06 +0530296--show-namespace-events
297 Display namespace events i.e. events of type PERF_RECORD_NAMESPACES.
298
Adrian Hunter7c148982015-07-21 12:44:06 +0300299--show-switch-events
300 Display context switch events i.e. events of type PERF_RECORD_SWITCH or
301 PERF_RECORD_SWITCH_CPU_WIDE.
302
Mark Drayton77e00702015-08-26 12:18:15 -0700303--demangle::
304 Demangle symbol names to human readable form. It's enabled by default,
305 disable with --no-demangle.
306
307--demangle-kernel::
308 Demangle kernel symbol names to human readable form (for C++ kernels).
309
Jiri Olsae90debd2013-12-09 11:02:50 +0100310--header
311 Show perf.data header.
312
313--header-only
314 Show only perf.data header.
315
Adrian Hunter7a680eb2015-04-09 18:53:56 +0300316--itrace::
317 Options for decoding instruction tracing data. The options are:
318
Adrian Hunter60b88d82015-07-17 19:33:44 +0300319include::itrace.txt[]
Adrian Hunter7a680eb2015-04-09 18:53:56 +0300320
321 To disable decoding entirely, use --no-itrace.
322
Andi Kleena9710ba2015-08-07 15:24:05 -0700323--full-source-path::
324 Show the full path for source files for srcline output.
325
Arnaldo Carvalho de Melo6125cc82016-04-14 18:15:18 -0300326--max-stack::
327 Set the stack depth limit when parsing the callchain, anything
328 beyond the specified depth will be ignored. This is a trade-off
329 between information loss and faster processing especially for
330 workloads that can have a very long callchain stack.
331 Note that when using the --itrace option the synthesized callchain size
332 will override this value if the synthesized callchain size is bigger.
333
Arnaldo Carvalho de Melofe176082016-05-19 11:34:06 -0300334 Default: 127
Arnaldo Carvalho de Melo6125cc82016-04-14 18:15:18 -0300335
Adrian Hunter83e19862015-09-25 16:15:36 +0300336--ns::
337 Use 9 decimal places when displaying time (i.e. show the nanoseconds)
338
Jiri Olsae0be62c2016-03-24 13:52:19 +0100339-f::
340--force::
341 Don't do ownership validation.
342
David Aherna91f4c42016-11-29 10:15:43 -0700343--time::
344 Only analyze samples within given time window: <start>,<stop>. Times
345 have the format seconds.microseconds. If start is not given (i.e., time
346 string is ',x.y') then analysis starts at the beginning of the file. If
347 stop time is not given (i.e, time string is 'x.y,') then analysis goes
348 to end of file.
349
Jin Yao2ab046c2017-12-08 21:13:46 +0800350 Also support time percent with multipe time range. Time string is
351 'a%/n,b%/m,...' or 'a%-b%,c%-%d,...'. The maximum number of slices is 10.
352
353 For example:
354 Select the second 10% time slice
355 perf script --time 10%/2
356
357 Select from 0% to 10% time slice
358 perf script --time 0%-10%
359
360 Select the first and second 10% time slices
361 perf script --time 10%/1,10%/2
362
363 Select from 0% to 10% and 30% to 40% slices
364 perf script --time 0%-10%,30%-40%
365
Andi Kleen48d02a12017-02-23 15:46:34 -0800366--max-blocks::
367 Set the maximum number of program blocks to print with brstackasm for
368 each sample.
369
Arnaldo Carvalho de Meloa14390f2017-10-26 10:30:20 -0300370--per-event-dump::
371 Create per event files with a "perf.data.EVENT.dump" name instead of
372 printing to stdout, useful, for instance, for generating flamegraphs.
373
Namhyung Kim325fbff2017-05-24 15:21:26 +0900374--inline::
375 If a callgraph address belongs to an inlined function, the inline stack
Milian Wolffd8a88dd2017-10-19 13:38:36 +0200376 will be printed. Each entry has function name and file/line. Enabled by
377 default, disable with --no-inline.
Namhyung Kim325fbff2017-05-24 15:21:26 +0900378
Ingo Molnar0a02ad92009-09-11 12:12:54 +0200379SEE ALSO
380--------
Ingo Molnar133dc4c2010-11-16 18:45:39 +0100381linkperf:perf-record[1], linkperf:perf-script-perl[1],
382linkperf:perf-script-python[1]