blob: 3bed9fd953360d3ceb92f0a74a17fbad2ddd256a [file] [log] [blame]
Jakub Kicinski9b038082020-04-09 14:21:58 -07001======================================================
Tal Gilboafaf4db002018-03-21 20:33:45 +02002Net DIM - Generic Network Dynamic Interrupt Moderation
3======================================================
4
Jakub Kicinski9b038082020-04-09 14:21:58 -07005:Author: Tal Gilboa <talgi@mellanox.com>
Tal Gilboafaf4db002018-03-21 20:33:45 +02006
Jakub Kicinski9b038082020-04-09 14:21:58 -07007.. contents:: :depth: 2
Tal Gilboafaf4db002018-03-21 20:33:45 +02008
Jakub Kicinski9b038082020-04-09 14:21:58 -07009Assumptions
10===========
Tal Gilboafaf4db002018-03-21 20:33:45 +020011
12This document assumes the reader has basic knowledge in network drivers
13and in general interrupt moderation.
14
15
Jakub Kicinski9b038082020-04-09 14:21:58 -070016Introduction
17============
Tal Gilboafaf4db002018-03-21 20:33:45 +020018
19Dynamic Interrupt Moderation (DIM) (in networking) refers to changing the
20interrupt moderation configuration of a channel in order to optimize packet
21processing. The mechanism includes an algorithm which decides if and how to
22change moderation parameters for a channel, usually by performing an analysis on
23runtime data sampled from the system. Net DIM is such a mechanism. In each
24iteration of the algorithm, it analyses a given sample of the data, compares it
25to the previous sample and if required, it can decide to change some of the
26interrupt moderation configuration fields. The data sample is composed of data
27bandwidth, the number of packets and the number of events. The time between
28samples is also measured. Net DIM compares the current and the previous data and
29returns an adjusted interrupt moderation configuration object. In some cases,
30the algorithm might decide not to change anything. The configuration fields are
31the minimum duration (microseconds) allowed between events and the maximum
32number of wanted packets per event. The Net DIM algorithm ascribes importance to
33increase bandwidth over reducing interrupt rate.
34
35
Jakub Kicinski9b038082020-04-09 14:21:58 -070036Net DIM Algorithm
37=================
Tal Gilboafaf4db002018-03-21 20:33:45 +020038
39Each iteration of the Net DIM algorithm follows these steps:
Jakub Kicinski9b038082020-04-09 14:21:58 -070040
41#. Calculates new data sample.
42#. Compares it to previous sample.
43#. Makes a decision - suggests interrupt moderation configuration fields.
44#. Applies a schedule work function, which applies suggested configuration.
Tal Gilboafaf4db002018-03-21 20:33:45 +020045
46The first two steps are straightforward, both the new and the previous data are
47supplied by the driver registered to Net DIM. The previous data is the new data
48supplied to the previous iteration. The comparison step checks the difference
49between the new and previous data and decides on the result of the last step.
50A step would result as "better" if bandwidth increases and as "worse" if
51bandwidth reduces. If there is no change in bandwidth, the packet rate is
52compared in a similar fashion - increase == "better" and decrease == "worse".
53In case there is no change in the packet rate as well, the interrupt rate is
54compared. Here the algorithm tries to optimize for lower interrupt rate so an
55increase in the interrupt rate is considered "worse" and a decrease is
56considered "better". Step #2 has an optimization for avoiding false results: it
57only considers a difference between samples as valid if it is greater than a
58certain percentage. Also, since Net DIM does not measure anything by itself, it
59assumes the data provided by the driver is valid.
60
61Step #3 decides on the suggested configuration based on the result from step #2
62and the internal state of the algorithm. The states reflect the "direction" of
63the algorithm: is it going left (reducing moderation), right (increasing
64moderation) or standing still. Another optimization is that if a decision
65to stay still is made multiple times, the interval between iterations of the
66algorithm would increase in order to reduce calculation overhead. Also, after
67"parking" on one of the most left or most right decisions, the algorithm may
68decide to verify this decision by taking a step in the other direction. This is
69done in order to avoid getting stuck in a "deep sleep" scenario. Once a
70decision is made, an interrupt moderation configuration is selected from
71the predefined profiles.
72
73The last step is to notify the registered driver that it should apply the
74suggested configuration. This is done by scheduling a work function, defined by
75the Net DIM API and provided by the registered driver.
76
77As you can see, Net DIM itself does not actively interact with the system. It
78would have trouble making the correct decisions if the wrong data is supplied to
79it and it would be useless if the work function would not apply the suggested
80configuration. This does, however, allow the registered driver some room for
81manoeuvre as it may provide partial data or ignore the algorithm suggestion
82under some conditions.
83
84
Jakub Kicinski9b038082020-04-09 14:21:58 -070085Registering a Network Device to DIM
86===================================
Tal Gilboafaf4db002018-03-21 20:33:45 +020087
Jakub Kicinski9b038082020-04-09 14:21:58 -070088Net DIM API exposes the main function net_dim().
89This function is the entry point to the Net
Tal Gilboafaf4db002018-03-21 20:33:45 +020090DIM algorithm and has to be called every time the driver would like to check if
91it should change interrupt moderation parameters. The driver should provide two
Jakub Kicinski9b038082020-04-09 14:21:58 -070092data structures: :c:type:`struct dim <dim>` and
93:c:type:`struct dim_sample <dim_sample>`. :c:type:`struct dim <dim>`
Tal Gilboafaf4db002018-03-21 20:33:45 +020094describes the state of DIM for a specific object (RX queue, TX queue,
95other queues, etc.). This includes the current selected profile, previous data
96samples, the callback function provided by the driver and more.
Jakub Kicinski9b038082020-04-09 14:21:58 -070097:c:type:`struct dim_sample <dim_sample>` describes a data sample,
98which will be compared to the data sample stored in :c:type:`struct dim <dim>`
99in order to decide on the algorithm's next
Tal Gilboafaf4db002018-03-21 20:33:45 +0200100step. The sample should include bytes, packets and interrupts, measured by
101the driver.
102
103In order to use Net DIM from a networking driver, the driver needs to call the
104main net_dim() function. The recommended method is to call net_dim() on each
105interrupt. Since Net DIM has a built-in moderation and it might decide to skip
106iterations under certain conditions, there is no need to moderate the net_dim()
107calls as well. As mentioned above, the driver needs to provide an object of type
Jakub Kicinski9b038082020-04-09 14:21:58 -0700108:c:type:`struct dim <dim>` to the net_dim() function call. It is advised for
109each entity using Net DIM to hold a :c:type:`struct dim <dim>` as part of its
110data structure and use it as the main Net DIM API object.
111The :c:type:`struct dim_sample <dim_sample>` should hold the latest
Tal Gilboafaf4db002018-03-21 20:33:45 +0200112bytes, packets and interrupts count. No need to perform any calculations, just
113include the raw data.
114
115The net_dim() call itself does not return anything. Instead Net DIM relies on
116the driver to provide a callback function, which is called when the algorithm
117decides to make a change in the interrupt moderation parameters. This callback
118will be scheduled and run in a separate thread in order not to add overhead to
119the data flow. After the work is done, Net DIM algorithm needs to be set to
120the proper state in order to move to the next iteration.
121
122
Jakub Kicinski9b038082020-04-09 14:21:58 -0700123Example
124=======
Tal Gilboafaf4db002018-03-21 20:33:45 +0200125
126The following code demonstrates how to register a driver to Net DIM. The actual
127usage is not complete but it should make the outline of the usage clear.
128
Jakub Kicinski9b038082020-04-09 14:21:58 -0700129.. code-block:: c
Tal Gilboafaf4db002018-03-21 20:33:45 +0200130
Jakub Kicinski9b038082020-04-09 14:21:58 -0700131 #include <linux/dim.h>
Tal Gilboafaf4db002018-03-21 20:33:45 +0200132
Jakub Kicinski9b038082020-04-09 14:21:58 -0700133 /* Callback for net DIM to schedule on a decision to change moderation */
134 void my_driver_do_dim_work(struct work_struct *work)
135 {
Jacob Keller2168da42019-10-09 12:18:31 -0700136 /* Get struct dim from struct work_struct */
137 struct dim *dim = container_of(work, struct dim,
138 work);
Tal Gilboafaf4db002018-03-21 20:33:45 +0200139 /* Do interrupt moderation related stuff */
140 ...
141
142 /* Signal net DIM work is done and it should move to next iteration */
Jacob Keller2168da42019-10-09 12:18:31 -0700143 dim->state = DIM_START_MEASURE;
Jakub Kicinski9b038082020-04-09 14:21:58 -0700144 }
Tal Gilboafaf4db002018-03-21 20:33:45 +0200145
Jakub Kicinski9b038082020-04-09 14:21:58 -0700146 /* My driver's interrupt handler */
147 int my_driver_handle_interrupt(struct my_driver_entity *my_entity, ...)
148 {
Tal Gilboafaf4db002018-03-21 20:33:45 +0200149 ...
150 /* A struct to hold current measured data */
Jacob Keller2168da42019-10-09 12:18:31 -0700151 struct dim_sample dim_sample;
Tal Gilboafaf4db002018-03-21 20:33:45 +0200152 ...
153 /* Initiate data sample struct with current data */
Jacob Keller2168da42019-10-09 12:18:31 -0700154 dim_update_sample(my_entity->events,
155 my_entity->packets,
156 my_entity->bytes,
157 &dim_sample);
Tal Gilboafaf4db002018-03-21 20:33:45 +0200158 /* Call net DIM */
159 net_dim(&my_entity->dim, dim_sample);
160 ...
Jakub Kicinski9b038082020-04-09 14:21:58 -0700161 }
Tal Gilboafaf4db002018-03-21 20:33:45 +0200162
Jakub Kicinski9b038082020-04-09 14:21:58 -0700163 /* My entity's initialization function (my_entity was already allocated) */
164 int my_driver_init_my_entity(struct my_driver_entity *my_entity, ...)
165 {
Tal Gilboafaf4db002018-03-21 20:33:45 +0200166 ...
167 /* Initiate struct work_struct with my driver's callback function */
168 INIT_WORK(&my_entity->dim.work, my_driver_do_dim_work);
169 ...
Jakub Kicinski9b038082020-04-09 14:21:58 -0700170 }
Randy Dunlap9d859282020-04-09 14:21:59 -0700171
172Dynamic Interrupt Moderation (DIM) library API
173==============================================
174
175.. kernel-doc:: include/linux/dim.h
176 :internal: