Blame - tools/perf/Documentation/perf-bench.txt - SHIFTPHONES/mainline/linux

blob: a0529c7fa5ef988e4ab487e4665a478c2862f386 [file] [log] [blame]

Hitoshi Mitake	9fbc04f	2009-11-10 20:50:54 +0900	[diff] [blame]	1	perf-bench(1)
Arnaldo Carvalho de Melo	4778e0e	2010-05-05 11:23:27 -0300	[diff] [blame]	2	=============
Hitoshi Mitake	9fbc04f	2009-11-10 20:50:54 +0900	[diff] [blame]	3
				4	NAME
				5	----
				6	perf-bench - General framework for benchmark suites
				7
				8	SYNOPSIS
				9	--------
				10	[verse]
				11	'perf bench' [<common options>] <subsystem> <suite> [<options>]
				12
				13	DESCRIPTION
				14	-----------
Namhyung Kim	08942f6	2012-06-20 15:08:06 +0900	[diff] [blame]	15	This 'perf bench' command is a general framework for benchmark suites.
Hitoshi Mitake	9fbc04f	2009-11-10 20:50:54 +0900	[diff] [blame]	16
				17	COMMON OPTIONS
				18	--------------
Davidlohr Bueso	b6f0629	2014-06-16 11:14:19 -0700	[diff] [blame]	19	-r::
				20	--repeat=::
				21	Specify amount of times to repeat the run (default 10).
				22
Hitoshi Mitake	9fbc04f	2009-11-10 20:50:54 +0900	[diff] [blame]	23	-f::
				24	--format=::
				25	Specify format style.
Randy Dunlap	854c554	2010-03-31 11:31:00 -0700	[diff] [blame]	26	Current available format styles are:
Hitoshi Mitake	9fbc04f	2009-11-10 20:50:54 +0900	[diff] [blame]	27
				28	'default'::
				29	Default style. This is mainly for human reading.
				30	---------------------
Randy Dunlap	854c554	2010-03-31 11:31:00 -0700	[diff] [blame]	31	% perf bench sched pipe # with no style specified
Hitoshi Mitake	9fbc04f	2009-11-10 20:50:54 +0900	[diff] [blame]	32	(executing 1000000 pipe operations between two tasks)
				33	Total time:5.855 sec
				34	5.855061 usecs/op
				35	170792 ops/sec
				36	---------------------
				37
				38	'simple'::
				39	This simple style is friendly for automated
				40	processing by scripts.
				41	---------------------
				42	% perf bench --format=simple sched pipe # specified simple
				43	5.988
				44	---------------------
				45
				46	SUBSYSTEM
				47	---------
				48
				49	'sched'::
				50	Scheduler and IPC mechanisms.
				51
Davidlohr Bueso	c2a0820	2019-03-08 10:17:47 -0800	[diff] [blame]	52	'syscall'::
				53	System call performance (throughput).
				54
Namhyung Kim	08942f6	2012-06-20 15:08:06 +0900	[diff] [blame]	55	'mem'::
				56	Memory access performance.
				57
Ramkumar Ramachandra	95a2b3c	2014-03-27 19:50:18 -0400	[diff] [blame]	58	'numa'::
				59	NUMA scheduling and MM benchmarks.
				60
				61	'futex'::
				62	Futex stressing benchmarks.
				63
Davidlohr Bueso	121dd9e	2018-11-06 07:22:25 -0800	[diff] [blame]	64	'epoll'::
				65	Eventpoll (epoll) stressing benchmarks.
				66
Ian Rogers	2a4b516	2020-04-02 08:43:53 -0700	[diff] [blame]	67	'internals'::
				68	Benchmark internal perf functionality.
				69
Namhyung Kim	08942f6	2012-06-20 15:08:06 +0900	[diff] [blame]	70	'all'::
				71	All benchmark subsystems.
				72
Hitoshi Mitake	9fbc04f	2009-11-10 20:50:54 +0900	[diff] [blame]	73	SUITES FOR 'sched'
				74	~~~~~~~~~~~~~~~~~~
				75	messaging::
				76	Suite for evaluating performance of scheduler and IPC mechanisms.
				77	Based on hackbench by Rusty Russell.
				78
Namhyung Kim	08942f6	2012-06-20 15:08:06 +0900	[diff] [blame]	79	Options of messaging
				80	^^^^^^^^^^^^^^^^^^^^^^
Hitoshi Mitake	9fbc04f	2009-11-10 20:50:54 +0900	[diff] [blame]	81	-p::
				82	--pipe::
				83	Use pipe() instead of socketpair()
				84
				85	-t::
				86	--thread::
				87	Be multi thread instead of multi process
				88
				89	-g::
				90	--group=::
				91	Specify number of groups
				92
				93	-l::
Ingo Molnar	b0d22e5	2015-10-19 10:04:28 +0200	[diff] [blame]	94	--nr_loops=::
Hitoshi Mitake	9fbc04f	2009-11-10 20:50:54 +0900	[diff] [blame]	95	Specify number of loops
				96
				97	Example of messaging
				98	^^^^^^^^^^^^^^^^^^^^^^
				99
				100	---------------------
				101	% perf bench sched messaging # run with default
				102	options (20 sender and receiver processes per group)
				103	(10 groups == 400 processes run)
				104
				105	Total time:0.308 sec
				106
Randy Dunlap	854c554	2010-03-31 11:31:00 -0700	[diff] [blame]	107	% perf bench sched messaging -t -g 20 # be multi-thread, with 20 groups
Hitoshi Mitake	9fbc04f	2009-11-10 20:50:54 +0900	[diff] [blame]	108	(20 sender and receiver threads per group)
				109	(20 groups == 800 threads run)
				110
				111	Total time:0.582 sec
				112	---------------------
				113
				114	pipe::
				115	Suite for pipe() system call.
				116	Based on pipe-test-1m.c by Ingo Molnar.
				117
				118	Options of pipe
				119	^^^^^^^^^^^^^^^^^
				120	-l::
				121	--loop=::
				122	Specify number of loops.
				123
				124	Example of pipe
				125	^^^^^^^^^^^^^^^^^
				126
				127	---------------------
				128	% perf bench sched pipe
				129	(executing 1000000 pipe operations between two tasks)
				130
				131	Total time:8.091 sec
				132	8.091833 usecs/op
				133	123581 ops/sec
				134
				135	% perf bench sched pipe -l 1000 # loop 1000
				136	(executing 1000 pipe operations between two tasks)
				137
				138	Total time:0.016 sec
				139	16.948000 usecs/op
				140	59004 ops/sec
				141	---------------------
				142
Davidlohr Bueso	c2a0820	2019-03-08 10:17:47 -0800	[diff] [blame]	143	SUITES FOR 'syscall'
				144	~~~~~~~~~~~~~~~~~~
				145	basic::
				146	Suite for evaluating performance of core system call throughput (both usecs/op and ops/sec metrics).
				147	This uses a single thread simply doing getppid(2), which is a simple syscall where the result is not
				148	cached by glibc.
				149
				150
Namhyung Kim	08942f6	2012-06-20 15:08:06 +0900	[diff] [blame]	151	SUITES FOR 'mem'
				152	~~~~~~~~~~~~~~~~
				153	memcpy::
				154	Suite for evaluating performance of simple memory copy in various ways.
				155
				156	Options of memcpy
				157	^^^^^^^^^^^^^^^^^^^
				158	-l::
Ingo Molnar	a69b4f7	2015-10-19 10:04:25 +0200	[diff] [blame]	159	--size::
				160	Specify size of memory to copy (default: 1MB).
Namhyung Kim	08942f6	2012-06-20 15:08:06 +0900	[diff] [blame]	161	Available units are B, KB, MB, GB and TB (case insensitive).
				162
Ingo Molnar	2f211c8	2015-10-19 10:04:29 +0200	[diff] [blame]	163	-f::
				164	--function::
				165	Specify function to copy (default: default).
				166	Available functions are depend on the architecture.
Namhyung Kim	08942f6	2012-06-20 15:08:06 +0900	[diff] [blame]	167	On x86-64, x86-64-unrolled, x86-64-movsq and x86-64-movsb are supported.
				168
Ingo Molnar	b0d22e5	2015-10-19 10:04:28 +0200	[diff] [blame]	169	-l::
				170	--nr_loops::
Namhyung Kim	08942f6	2012-06-20 15:08:06 +0900	[diff] [blame]	171	Repeat memcpy invocation this number of times.
				172
				173	-c::
Ingo Molnar	b14f2d3	2015-10-19 10:04:23 +0200	[diff] [blame]	174	--cycles::
Namhyung Kim	08942f6	2012-06-20 15:08:06 +0900	[diff] [blame]	175	Use perf's cpu-cycles event instead of gettimeofday syscall.
				176
Namhyung Kim	08942f6	2012-06-20 15:08:06 +0900	[diff] [blame]	177	memset::
				178	Suite for evaluating performance of simple memory set in various ways.
				179
				180	Options of memset
				181	^^^^^^^^^^^^^^^^^^^
				182	-l::
Ingo Molnar	a69b4f7	2015-10-19 10:04:25 +0200	[diff] [blame]	183	--size::
				184	Specify size of memory to set (default: 1MB).
Namhyung Kim	08942f6	2012-06-20 15:08:06 +0900	[diff] [blame]	185	Available units are B, KB, MB, GB and TB (case insensitive).
				186
Ingo Molnar	2f211c8	2015-10-19 10:04:29 +0200	[diff] [blame]	187	-f::
				188	--function::
				189	Specify function to set (default: default).
				190	Available functions are depend on the architecture.
Namhyung Kim	08942f6	2012-06-20 15:08:06 +0900	[diff] [blame]	191	On x86-64, x86-64-unrolled, x86-64-stosq and x86-64-stosb are supported.
				192
Ingo Molnar	b0d22e5	2015-10-19 10:04:28 +0200	[diff] [blame]	193	-l::
				194	--nr_loops::
Namhyung Kim	08942f6	2012-06-20 15:08:06 +0900	[diff] [blame]	195	Repeat memset invocation this number of times.
				196
				197	-c::
Ingo Molnar	b14f2d3	2015-10-19 10:04:23 +0200	[diff] [blame]	198	--cycles::
Namhyung Kim	08942f6	2012-06-20 15:08:06 +0900	[diff] [blame]	199	Use perf's cpu-cycles event instead of gettimeofday syscall.
				200
Ramkumar Ramachandra	95a2b3c	2014-03-27 19:50:18 -0400	[diff] [blame]	201	SUITES FOR 'numa'
				202	~~~~~~~~~~~~~~~~~
				203	mem::
				204	Suite for evaluating NUMA workloads.
				205
				206	SUITES FOR 'futex'
				207	~~~~~~~~~~~~~~~~~~
				208	hash::
				209	Suite for evaluating hash tables.
				210
				211	wake::
				212	Suite for evaluating wake calls.
				213
Davidlohr Bueso	d65817b	2015-05-08 11:37:59 -0700	[diff] [blame]	214	wake-parallel::
				215	Suite for evaluating parallel wake calls.
				216
Ramkumar Ramachandra	95a2b3c	2014-03-27 19:50:18 -0400	[diff] [blame]	217	requeue::
				218	Suite for evaluating requeue calls.
				219
Davidlohr Bueso	d2f3f5d	2015-07-07 01:55:53 -0700	[diff] [blame]	220	lock-pi::
				221	Suite for evaluating futex lock_pi calls.
				222
Davidlohr Bueso	121dd9e	2018-11-06 07:22:25 -0800	[diff] [blame]	223	SUITES FOR 'epoll'
				224	~~~~~~~~~~~~~~~~~~
				225	wait::
				226	Suite for evaluating concurrent epoll_wait calls.
Davidlohr Bueso	d2f3f5d	2015-07-07 01:55:53 -0700	[diff] [blame]	227
Davidlohr Bueso	231457e	2018-11-06 07:22:26 -0800	[diff] [blame]	228	ctl::
				229	Suite for evaluating multiple epoll_ctl calls.
				230
Ian Rogers	2a4b516	2020-04-02 08:43:53 -0700	[diff] [blame]	231	SUITES FOR 'internals'
				232	~~~~~~~~~~~~~~~~~~~~~~
				233	synthesize::
				234	Suite for evaluating perf's event synthesis performance.
				235
Hitoshi Mitake	9fbc04f	2009-11-10 20:50:54 +0900	[diff] [blame]	236	SEE ALSO
				237	--------
				238	linkperf:perf[1]