blob: 9b93953f69cf866289fd8e49632860185bda1e6c [file] [log] [blame]
Wim Van Sebroeck43316042011-07-22 18:55:18 +00001The Linux WatchDog Timer Driver Core kernel API.
2===============================================
Fabio Porcedda30482532013-01-08 11:04:10 +01003Last reviewed: 12-Feb-2013
Wim Van Sebroeck43316042011-07-22 18:55:18 +00004
5Wim Van Sebroeck <wim@iguana.be>
6
7Introduction
8------------
9This document does not describe what a WatchDog Timer (WDT) Driver or Device is.
10It also does not describe the API which can be used by user space to communicate
11with a WatchDog Timer. If you want to know this then please read the following
12file: Documentation/watchdog/watchdog-api.txt .
13
14So what does this document describe? It describes the API that can be used by
15WatchDog Timer Drivers that want to use the WatchDog Timer Driver Core
16Framework. This framework provides all interfacing towards user space so that
17the same code does not have to be reproduced each time. This also means that
18a watchdog timer driver then only needs to provide the different routines
19(operations) that control the watchdog timer (WDT).
20
21The API
22-------
23Each watchdog timer driver that wants to use the WatchDog Timer Driver Core
24must #include <linux/watchdog.h> (you would have to do this anyway when
25writing a watchdog device driver). This include file contains following
26register/unregister routines:
27
28extern int watchdog_register_device(struct watchdog_device *);
29extern void watchdog_unregister_device(struct watchdog_device *);
30
31The watchdog_register_device routine registers a watchdog timer device.
32The parameter of this routine is a pointer to a watchdog_device structure.
33This routine returns zero on success and a negative errno code for failure.
34
35The watchdog_unregister_device routine deregisters a registered watchdog timer
36device. The parameter of this routine is the pointer to the registered
37watchdog_device structure.
38
Jean-Baptiste Theouef901742015-06-09 09:55:02 -070039The watchdog subsystem includes an registration deferral mechanism,
40which allows you to register an watchdog as early as you wish during
41the boot process.
42
Wim Van Sebroeck43316042011-07-22 18:55:18 +000043The watchdog device structure looks like this:
44
45struct watchdog_device {
Alan Cox45f5fed2012-05-10 21:48:59 +020046 int id;
Alan Coxd6b469d2012-05-11 12:00:20 +020047 struct device *parent;
Guenter Roeckfaa58472016-01-03 15:11:56 -080048 const struct attribute_group **groups;
Wim Van Sebroeck43316042011-07-22 18:55:18 +000049 const struct watchdog_info *info;
50 const struct watchdog_ops *ops;
Vladimir Zapolskiyff841362016-10-07 15:39:54 +030051 const struct watchdog_governor *gov;
Wim Van Sebroeck2fa03562011-07-22 18:56:38 +000052 unsigned int bootstatus;
Wim Van Sebroeck014d6942011-07-22 18:58:21 +000053 unsigned int timeout;
Wolfram Sangdf044e02016-08-31 14:52:41 +030054 unsigned int pretimeout;
Wim Van Sebroeck3f43f682011-07-22 19:00:16 +000055 unsigned int min_timeout;
56 unsigned int max_timeout;
Guenter Roeck15013ad2016-02-28 13:12:18 -080057 unsigned int min_hw_heartbeat_ms;
Guenter Roeck664a3922016-02-28 13:12:15 -080058 unsigned int max_hw_heartbeat_ms;
Damien Riegele1313192015-11-20 16:54:51 -050059 struct notifier_block reboot_nb;
Damien Riegel2165bf52015-11-16 12:27:59 -050060 struct notifier_block restart_nb;
Wim Van Sebroeck43316042011-07-22 18:55:18 +000061 void *driver_data;
Guenter Roeckb4ffb192015-12-25 16:01:42 -080062 struct watchdog_core_data *wd_data;
Wim Van Sebroeck43316042011-07-22 18:55:18 +000063 unsigned long status;
Jean-Baptiste Theouef901742015-06-09 09:55:02 -070064 struct list_head deferred;
Wim Van Sebroeck43316042011-07-22 18:55:18 +000065};
66
67It contains following fields:
Alan Cox45f5fed2012-05-10 21:48:59 +020068* id: set by watchdog_register_device, id 0 is special. It has both a
69 /dev/watchdog0 cdev (dynamic major, minor 0) as well as the old
70 /dev/watchdog miscdev. The id is set automatically when calling
71 watchdog_register_device.
Alan Coxd6b469d2012-05-11 12:00:20 +020072* parent: set this to the parent device (or NULL) before calling
73 watchdog_register_device.
Guenter Roeckfaa58472016-01-03 15:11:56 -080074* groups: List of sysfs attribute groups to create when creating the watchdog
75 device.
Wim Van Sebroeck43316042011-07-22 18:55:18 +000076* info: a pointer to a watchdog_info structure. This structure gives some
77 additional information about the watchdog timer itself. (Like it's unique name)
78* ops: a pointer to the list of watchdog operations that the watchdog supports.
Vladimir Zapolskiyff841362016-10-07 15:39:54 +030079* gov: a pointer to the assigned watchdog device pretimeout governor or NULL.
Wim Van Sebroeck014d6942011-07-22 18:58:21 +000080* timeout: the watchdog timer's timeout value (in seconds).
Guenter Roeck664a3922016-02-28 13:12:15 -080081 This is the time after which the system will reboot if user space does
82 not send a heartbeat request if WDOG_ACTIVE is set.
Wolfram Sangdf044e02016-08-31 14:52:41 +030083* pretimeout: the watchdog timer's pretimeout value (in seconds).
Wim Van Sebroeck3f43f682011-07-22 19:00:16 +000084* min_timeout: the watchdog timer's minimum timeout value (in seconds).
Guenter Roeck664a3922016-02-28 13:12:15 -080085 If set, the minimum configurable value for 'timeout'.
86* max_timeout: the watchdog timer's maximum timeout value (in seconds),
87 as seen from userspace. If set, the maximum configurable value for
88 'timeout'. Not used if max_hw_heartbeat_ms is non-zero.
Guenter Roeckf9f535c2016-05-27 14:19:06 -070089* min_hw_heartbeat_ms: Hardware limit for minimum time between heartbeats,
90 in milli-seconds. This value is normally 0; it should only be provided
91 if the hardware can not tolerate lower intervals between heartbeats.
Guenter Roeck664a3922016-02-28 13:12:15 -080092* max_hw_heartbeat_ms: Maximum hardware heartbeat, in milli-seconds.
93 If set, the infrastructure will send heartbeats to the watchdog driver
94 if 'timeout' is larger than max_hw_heartbeat_ms, unless WDOG_ACTIVE
95 is set and userspace failed to send a heartbeat for at least 'timeout'
Guenter Roeckd0684c82016-02-28 13:12:17 -080096 seconds. max_hw_heartbeat_ms must be set if a driver does not implement
97 the stop function.
Damien Riegele1313192015-11-20 16:54:51 -050098* reboot_nb: notifier block that is registered for reboot notifications, for
99 internal use only. If the driver calls watchdog_stop_on_reboot, watchdog core
100 will stop the watchdog on such notifications.
Damien Riegel2165bf52015-11-16 12:27:59 -0500101* restart_nb: notifier block that is registered for machine restart, for
102 internal use only. If a watchdog is capable of restarting the machine, it
103 should define ops->restart. Priority can be changed through
104 watchdog_set_restart_priority.
Wim Van Sebroeck2fa03562011-07-22 18:56:38 +0000105* bootstatus: status of the device after booting (reported with watchdog
106 WDIOF_* status bits).
Wim Van Sebroeck43316042011-07-22 18:55:18 +0000107* driver_data: a pointer to the drivers private data of a watchdog device.
Devendra Naga2deca732012-05-14 14:33:37 +0530108 This data should only be accessed via the watchdog_set_drvdata and
Wim Van Sebroeck43316042011-07-22 18:55:18 +0000109 watchdog_get_drvdata routines.
Guenter Roeckb4ffb192015-12-25 16:01:42 -0800110* wd_data: a pointer to watchdog core internal data.
Wim Van Sebroeck43316042011-07-22 18:55:18 +0000111* status: this field contains a number of status bits that give extra
Wim Van Sebroeck234445b2011-07-22 18:57:55 +0000112 information about the status of the device (Like: is the watchdog timer
Guenter Roeckb4ffb192015-12-25 16:01:42 -0800113 running/active, or is the nowayout bit set).
Jean-Baptiste Theouef901742015-06-09 09:55:02 -0700114* deferred: entry in wtd_deferred_reg_list which is used to
115 register early initialized watchdogs.
Wim Van Sebroeck43316042011-07-22 18:55:18 +0000116
117The list of watchdog operations is defined as:
118
119struct watchdog_ops {
120 struct module *owner;
121 /* mandatory operations */
122 int (*start)(struct watchdog_device *);
123 int (*stop)(struct watchdog_device *);
124 /* optional operations */
125 int (*ping)(struct watchdog_device *);
Wim Van Sebroeck2fa03562011-07-22 18:56:38 +0000126 unsigned int (*status)(struct watchdog_device *);
Wim Van Sebroeck014d6942011-07-22 18:58:21 +0000127 int (*set_timeout)(struct watchdog_device *, unsigned int);
Wolfram Sangdf044e02016-08-31 14:52:41 +0300128 int (*set_pretimeout)(struct watchdog_device *, unsigned int);
Viresh Kumarfd7b6732012-03-16 09:14:00 +0100129 unsigned int (*get_timeleft)(struct watchdog_device *);
Damien Riegel2165bf52015-11-16 12:27:59 -0500130 int (*restart)(struct watchdog_device *);
Guenter Roeckb4ffb192015-12-25 16:01:42 -0800131 void (*ref)(struct watchdog_device *) __deprecated;
132 void (*unref)(struct watchdog_device *) __deprecated;
Wim Van Sebroeck78d88fc2011-07-22 18:59:49 +0000133 long (*ioctl)(struct watchdog_device *, unsigned int, unsigned long);
Wim Van Sebroeck43316042011-07-22 18:55:18 +0000134};
135
136It is important that you first define the module owner of the watchdog timer
137driver's operations. This module owner will be used to lock the module when
138the watchdog is active. (This to avoid a system crash when you unload the
139module and /dev/watchdog is still open).
Hans de Goedee907df32012-05-22 11:40:26 +0200140
Wim Van Sebroeck43316042011-07-22 18:55:18 +0000141Some operations are mandatory and some are optional. The mandatory operations
142are:
143* start: this is a pointer to the routine that starts the watchdog timer
144 device.
145 The routine needs a pointer to the watchdog timer device structure as a
146 parameter. It returns zero on success or a negative errno code for failure.
Wim Van Sebroeck43316042011-07-22 18:55:18 +0000147
148Not all watchdog timer hardware supports the same functionality. That's why
149all other routines/operations are optional. They only need to be provided if
150they are supported. These optional routines/operations are:
Guenter Roeckd0684c82016-02-28 13:12:17 -0800151* stop: with this routine the watchdog timer device is being stopped.
152 The routine needs a pointer to the watchdog timer device structure as a
153 parameter. It returns zero on success or a negative errno code for failure.
154 Some watchdog timer hardware can only be started and not be stopped. A
155 driver supporting such hardware does not have to implement the stop routine.
156 If a driver has no stop function, the watchdog core will set WDOG_HW_RUNNING
157 and start calling the driver's keepalive pings function after the watchdog
158 device is closed.
159 If a watchdog driver does not implement the stop function, it must set
160 max_hw_heartbeat_ms.
Wim Van Sebroeck43316042011-07-22 18:55:18 +0000161* ping: this is the routine that sends a keepalive ping to the watchdog timer
162 hardware.
163 The routine needs a pointer to the watchdog timer device structure as a
164 parameter. It returns zero on success or a negative errno code for failure.
165 Most hardware that does not support this as a separate function uses the
166 start function to restart the watchdog timer hardware. And that's also what
167 the watchdog timer driver core does: to send a keepalive ping to the watchdog
168 timer hardware it will either use the ping operation (when available) or the
169 start operation (when the ping operation is not available).
Wim Van Sebroeckc2dc00e2011-07-22 18:57:23 +0000170 (Note: the WDIOC_KEEPALIVE ioctl call will only be active when the
171 WDIOF_KEEPALIVEPING bit has been set in the option field on the watchdog's
172 info structure).
Wim Van Sebroeck2fa03562011-07-22 18:56:38 +0000173* status: this routine checks the status of the watchdog timer device. The
174 status of the device is reported with watchdog WDIOF_* status flags/bits.
Guenter Roeck90b826f2016-07-17 15:04:11 -0700175 WDIOF_MAGICCLOSE and WDIOF_KEEPALIVEPING are reported by the watchdog core;
176 it is not necessary to report those bits from the driver. Also, if no status
177 function is provided by the driver, the watchdog core reports the status bits
178 provided in the bootstatus variable of struct watchdog_device.
Wim Van Sebroeck014d6942011-07-22 18:58:21 +0000179* set_timeout: this routine checks and changes the timeout of the watchdog
180 timer device. It returns 0 on success, -EINVAL for "parameter out of range"
Hans de Goedeb10f7c12011-09-12 11:56:59 +0200181 and -EIO for "could not write value to the watchdog". On success this
182 routine should set the timeout value of the watchdog_device to the
183 achieved timeout value (which may be different from the requested one
Guenter Roeck664a3922016-02-28 13:12:15 -0800184 because the watchdog does not necessarily have a 1 second resolution).
185 Drivers implementing max_hw_heartbeat_ms set the hardware watchdog heartbeat
186 to the minimum of timeout and max_hw_heartbeat_ms. Those drivers set the
187 timeout value of the watchdog_device either to the requested timeout value
188 (if it is larger than max_hw_heartbeat_ms), or to the achieved timeout value.
Wim Van Sebroeck014d6942011-07-22 18:58:21 +0000189 (Note: the WDIOF_SETTIMEOUT needs to be set in the options field of the
190 watchdog's info structure).
Guenter Roeckfb32e9b2016-02-28 13:12:14 -0800191 If the watchdog driver does not have to perform any action but setting the
192 watchdog_device.timeout, this callback can be omitted.
193 If set_timeout is not provided but, WDIOF_SETTIMEOUT is set, the watchdog
194 infrastructure updates the timeout value of the watchdog_device internally
195 to the requested value.
Wolfram Sangdf044e02016-08-31 14:52:41 +0300196 If the pretimeout feature is used (WDIOF_PRETIMEOUT), then set_timeout must
197 also take care of checking if pretimeout is still valid and set up the timer
198 accordingly. This can't be done in the core without races, so it is the
199 duty of the driver.
200* set_pretimeout: this routine checks and changes the pretimeout value of
201 the watchdog. It is optional because not all watchdogs support pretimeout
202 notification. The timeout value is not an absolute time, but the number of
203 seconds before the actual timeout would happen. It returns 0 on success,
204 -EINVAL for "parameter out of range" and -EIO for "could not write value to
205 the watchdog". A value of 0 disables pretimeout notification.
206 (Note: the WDIOF_PRETIMEOUT needs to be set in the options field of the
207 watchdog's info structure).
208 If the watchdog driver does not have to perform any action but setting the
209 watchdog_device.pretimeout, this callback can be omitted. That means if
210 set_pretimeout is not provided but WDIOF_PRETIMEOUT is set, the watchdog
211 infrastructure updates the pretimeout value of the watchdog_device internally
212 to the requested value.
Viresh Kumarfd7b6732012-03-16 09:14:00 +0100213* get_timeleft: this routines returns the time that's left before a reset.
Damien Riegel2165bf52015-11-16 12:27:59 -0500214* restart: this routine restarts the machine. It returns 0 on success or a
215 negative errno code for failure.
Wim Van Sebroeck78d88fc2011-07-22 18:59:49 +0000216* ioctl: if this routine is present then it will be called first before we do
217 our own internal ioctl call handling. This routine should return -ENOIOCTLCMD
218 if a command is not supported. The parameters that are passed to the ioctl
219 call are: watchdog_device, cmd and arg.
Wim Van Sebroeck43316042011-07-22 18:55:18 +0000220
Guenter Roeckb4ffb192015-12-25 16:01:42 -0800221The 'ref' and 'unref' operations are no longer used and deprecated.
222
Wim Van Sebroeck43316042011-07-22 18:55:18 +0000223The status bits should (preferably) be set with the set_bit and clear_bit alike
224bit-operations. The status bits that are defined are:
Wim Van Sebroeck234445b2011-07-22 18:57:55 +0000225* WDOG_ACTIVE: this status bit indicates whether or not a watchdog timer device
Guenter Roeckee142882016-02-28 13:12:16 -0800226 is active or not from user perspective. User space is expected to send
227 heartbeat requests to the driver while this flag is set.
Wim Van Sebroeck7e192b92011-07-22 18:59:17 +0000228* WDOG_NO_WAY_OUT: this bit stores the nowayout setting for the watchdog.
229 If this bit is set then the watchdog timer will not be able to stop.
Guenter Roeckee142882016-02-28 13:12:16 -0800230* WDOG_HW_RUNNING: Set by the watchdog driver if the hardware watchdog is
231 running. The bit must be set if the watchdog timer hardware can not be
232 stopped. The bit may also be set if the watchdog timer is running after
233 booting, before the watchdog device is opened. If set, the watchdog
234 infrastructure will send keepalives to the watchdog hardware while
235 WDOG_ACTIVE is not set.
236 Note: when you register the watchdog timer device with this bit set,
237 then opening /dev/watchdog will skip the start operation but send a keepalive
238 request instead.
Wim Van Sebroeck017cf082011-07-22 18:58:54 +0000239
Wim Van Sebroeckff0b3cd2011-11-29 16:24:16 +0100240 To set the WDOG_NO_WAY_OUT status bit (before registering your watchdog
241 timer device) you can either:
242 * set it statically in your watchdog_device struct with
243 .status = WATCHDOG_NOWAYOUT_INIT_STATUS,
244 (this will set the value the same as CONFIG_WATCHDOG_NOWAYOUT) or
245 * use the following helper function:
246 static inline void watchdog_set_nowayout(struct watchdog_device *wdd, int nowayout)
247
Wim Van Sebroeck7e192b92011-07-22 18:59:17 +0000248Note: The WatchDog Timer Driver Core supports the magic close feature and
249the nowayout feature. To use the magic close feature you must set the
250WDIOF_MAGICCLOSE bit in the options field of the watchdog's info structure.
251The nowayout feature will overrule the magic close feature.
Wim Van Sebroeck43316042011-07-22 18:55:18 +0000252
253To get or set driver specific data the following two helper functions should be
254used:
255
256static inline void watchdog_set_drvdata(struct watchdog_device *wdd, void *data)
257static inline void *watchdog_get_drvdata(struct watchdog_device *wdd)
258
259The watchdog_set_drvdata function allows you to add driver specific data. The
260arguments of this function are the watchdog device where you want to add the
261driver specific data to and a pointer to the data itself.
262
263The watchdog_get_drvdata function allows you to retrieve driver specific data.
264The argument of this function is the watchdog device where you want to retrieve
Masanari Iidae1986522012-02-11 00:09:20 +0900265data from. The function returns the pointer to the driver specific data.
Fabio Porcedda30482532013-01-08 11:04:10 +0100266
267To initialize the timeout field, the following function can be used:
268
269extern int watchdog_init_timeout(struct watchdog_device *wdd,
270 unsigned int timeout_parm, struct device *dev);
271
272The watchdog_init_timeout function allows you to initialize the timeout field
273using the module timeout parameter or by retrieving the timeout-sec property from
274the device tree (if the module timeout parameter is invalid). Best practice is
275to set the default timeout value as timeout value in the watchdog_device and
276then use this function to set the user "preferred" timeout value.
277This routine returns zero on success and a negative errno code for failure.
Damien Riegel2165bf52015-11-16 12:27:59 -0500278
Damien Riegele1313192015-11-20 16:54:51 -0500279To disable the watchdog on reboot, the user must call the following helper:
280
281static inline void watchdog_stop_on_reboot(struct watchdog_device *wdd);
282
Guenter Roeckbb292ac2017-01-25 14:21:10 -0800283To disable the watchdog when unregistering the watchdog, the user must call
284the following helper. Note that this will only stop the watchdog if the
285nowayout flag is not set.
286
287static inline void watchdog_stop_on_unregister(struct watchdog_device *wdd);
288
Damien Riegel2165bf52015-11-16 12:27:59 -0500289To change the priority of the restart handler the following helper should be
290used:
291
292void watchdog_set_restart_priority(struct watchdog_device *wdd, int priority);
293
294User should follow the following guidelines for setting the priority:
295* 0: should be called in last resort, has limited restart capabilities
296* 128: default restart handler, use if no other handler is expected to be
297 available, and/or if restart is sufficient to restart the entire system
298* 255: highest priority, will preempt all other restart handlers
Vladimir Zapolskiyff841362016-10-07 15:39:54 +0300299
300To raise a pretimeout notification, the following function should be used:
301
302void watchdog_notify_pretimeout(struct watchdog_device *wdd)
303
304The function can be called in the interrupt context. If watchdog pretimeout
305governor framework (kbuild CONFIG_WATCHDOG_PRETIMEOUT_GOV symbol) is enabled,
306an action is taken by a preconfigured pretimeout governor preassigned to
307the watchdog device. If watchdog pretimeout governor framework is not
308enabled, watchdog_notify_pretimeout() prints a notification message to
309the kernel log buffer.