blob: 6e353d00cdc687eac454c4c586f7075cb73399eb [file] [log] [blame]
Linus Torvalds1da177e2005-04-16 15:20:36 -07001 CPU frequency and voltage scaling code in the Linux(TM) kernel
2
3
4 L i n u x C P U F r e q
5
6 C P U D r i v e r s
7
8 - information for developers -
9
10
11 Dominik Brodowski <linux@brodo.de>
Viresh Kumar7de962c2017-01-06 11:08:05 +053012 Rafael J. Wysocki <rafael.j.wysocki@intel.com>
13 Viresh Kumar <viresh.kumar@linaro.org>
Linus Torvalds1da177e2005-04-16 15:20:36 -070014
15
16
17 Clock scaling allows you to change the clock speed of the CPUs on the
18 fly. This is a nice method to save battery power, because the lower
19 the clock speed, the less power the CPU consumes.
20
21
22Contents:
23---------
241. What To Do?
251.1 Initialization
261.2 Per-CPU Initialization
271.3 verify
Viresh Kumar9c0ebcf2013-10-25 19:45:48 +0530281.4 target/target_index or setpolicy?
291.5 target/target_index
Linus Torvalds1da177e2005-04-16 15:20:36 -0700301.6 setpolicy
Viresh Kumar1c03a2d2014-06-02 22:49:28 +0530311.7 get_intermediate and target_intermediate
Linus Torvalds1da177e2005-04-16 15:20:36 -0700322. Frequency Table Helpers
33
34
35
361. What To Do?
37==============
38
39So, you just got a brand-new CPU / chipset with datasheets and want to
40add cpufreq support for this CPU / chipset? Great. Here are some hints
41on what is necessary:
42
43
441.1 Initialization
45------------------
46
47First of all, in an __initcall level 7 (module_init()) or later
48function check whether this kernel runs on the right CPU and the right
49chipset. If so, register a struct cpufreq_driver with the CPUfreq core
50using cpufreq_register_driver()
51
52What shall this struct cpufreq_driver contain?
53
Viresh Kumar7de962c2017-01-06 11:08:05 +053054 .name - The name of this driver.
Linus Torvalds1da177e2005-04-16 15:20:36 -070055
Viresh Kumar7de962c2017-01-06 11:08:05 +053056 .init - A pointer to the per-policy initialization function.
Linus Torvalds1da177e2005-04-16 15:20:36 -070057
Viresh Kumar7de962c2017-01-06 11:08:05 +053058 .verify - A pointer to a "verification" function.
Linus Torvalds1da177e2005-04-16 15:20:36 -070059
Viresh Kumar7de962c2017-01-06 11:08:05 +053060 .setpolicy _or_ .fast_switch _or_ .target _or_ .target_index - See
61 below on the differences.
Linus Torvalds1da177e2005-04-16 15:20:36 -070062
63And optionally
64
Viresh Kumar7de962c2017-01-06 11:08:05 +053065 .flags - Hints for the cpufreq core.
Dirk Brandewie367dc4a2014-03-19 08:45:53 -070066
Viresh Kumar7de962c2017-01-06 11:08:05 +053067 .driver_data - cpufreq driver specific data.
Linus Torvalds1da177e2005-04-16 15:20:36 -070068
Viresh Kumar7de962c2017-01-06 11:08:05 +053069 .resolve_freq - Returns the most appropriate frequency for a target
70 frequency. Doesn't change the frequency though.
Linus Torvalds1da177e2005-04-16 15:20:36 -070071
Viresh Kumar7de962c2017-01-06 11:08:05 +053072 .get_intermediate and target_intermediate - Used to switch to stable
73 frequency while changing CPU frequency.
Linus Torvalds1da177e2005-04-16 15:20:36 -070074
Viresh Kumar7de962c2017-01-06 11:08:05 +053075 .get - Returns current frequency of the CPU.
76
77 .bios_limit - Returns HW/BIOS max frequency limitations for the CPU.
78
79 .exit - A pointer to a per-policy cleanup function called during
80 CPU_POST_DEAD phase of cpu hotplug process.
81
82 .stop_cpu - A pointer to a per-policy stop function called during
83 CPU_DOWN_PREPARE phase of cpu hotplug process.
84
85 .suspend - A pointer to a per-policy suspend function which is called
86 with interrupts disabled and _after_ the governor is stopped for the
87 policy.
88
89 .resume - A pointer to a per-policy resume function which is called
90 with interrupts disabled and _before_ the governor is started again.
91
92 .ready - A pointer to a per-policy ready function which is called after
93 the policy is fully initialized.
94
95 .attr - A pointer to a NULL-terminated list of "struct freq_attr" which
96 allow to export values to sysfs.
97
98 .boost_enabled - If set, boost frequencies are enabled.
99
100 .set_boost - A pointer to a per-policy function to enable/disable boost
101 frequencies.
Viresh Kumar1c03a2d2014-06-02 22:49:28 +0530102
Linus Torvalds1da177e2005-04-16 15:20:36 -0700103
1041.2 Per-CPU Initialization
105--------------------------
106
107Whenever a new CPU is registered with the device model, or after the
Viresh Kumar7de962c2017-01-06 11:08:05 +0530108cpufreq driver registers itself, the per-policy initialization function
109cpufreq_driver.init is called if no cpufreq policy existed for the CPU.
110Note that the .init() and .exit() routines are called only once for the
111policy and not for each CPU managed by the policy. It takes a struct
112cpufreq_policy *policy as argument. What to do now?
Linus Torvalds1da177e2005-04-16 15:20:36 -0700113
114If necessary, activate the CPUfreq support on your CPU.
115
116Then, the driver must fill in the following values:
117
118policy->cpuinfo.min_freq _and_
119policy->cpuinfo.max_freq - the minimum and maximum frequency
120 (in kHz) which is supported by
121 this CPU
122policy->cpuinfo.transition_latency the time it takes on this CPU to
Mark Brownbbe237a2009-11-12 16:06:45 +0000123 switch between two frequencies in
124 nanoseconds (if appropriate, else
125 specify CPUFREQ_ETERNAL)
Linus Torvalds1da177e2005-04-16 15:20:36 -0700126
127policy->cur The current operating frequency of
128 this CPU (if appropriate)
129policy->min,
130policy->max,
131policy->policy and, if necessary,
132policy->governor must contain the "default policy" for
133 this CPU. A few moments later,
134 cpufreq_driver.verify and either
135 cpufreq_driver.setpolicy or
Viresh Kumar9c0ebcf2013-10-25 19:45:48 +0530136 cpufreq_driver.target/target_index is called
137 with these values.
Viresh Kumar7de962c2017-01-06 11:08:05 +0530138policy->cpus Update this with the masks of the
139 (online + offline) CPUs that do DVFS
140 along with this CPU (i.e. that share
141 clock/voltage rails with it).
Linus Torvalds1da177e2005-04-16 15:20:36 -0700142
Viresh Kumareb2f50f2013-04-01 12:57:48 +0000143For setting some of these values (cpuinfo.min[max]_freq, policy->min[max]), the
144frequency table helpers might be helpful. See the section 2 for more information
145on them.
Linus Torvalds1da177e2005-04-16 15:20:36 -0700146
147
1481.3 verify
Viresh Kumar7de962c2017-01-06 11:08:05 +0530149----------
Linus Torvalds1da177e2005-04-16 15:20:36 -0700150
151When the user decides a new policy (consisting of
152"policy,governor,min,max") shall be set, this policy must be validated
153so that incompatible values can be corrected. For verifying these
Viresh Kumar7de962c2017-01-06 11:08:05 +0530154values cpufreq_verify_within_limits(struct cpufreq_policy *policy,
155unsigned int min_freq, unsigned int max_freq) function might be helpful.
156See section 2 for details on frequency table helpers.
Linus Torvalds1da177e2005-04-16 15:20:36 -0700157
158You need to make sure that at least one valid frequency (or operating
159range) is within policy->min and policy->max. If necessary, increase
160policy->max first, and only if this is no solution, decrease policy->min.
161
162
Viresh Kumar7de962c2017-01-06 11:08:05 +05301631.4 target or target_index or setpolicy or fast_switch?
164-------------------------------------------------------
Linus Torvalds1da177e2005-04-16 15:20:36 -0700165
166Most cpufreq drivers or even most cpu frequency scaling algorithms
Viresh Kumar7de962c2017-01-06 11:08:05 +0530167only allow the CPU frequency to be set to predefined fixed values. For
168these, you use the ->target(), ->target_index() or ->fast_switch()
169callbacks.
Linus Torvalds1da177e2005-04-16 15:20:36 -0700170
Viresh Kumar7de962c2017-01-06 11:08:05 +0530171Some cpufreq capable processors switch the frequency between certain
172limits on their own. These shall use the ->setpolicy() callback.
Linus Torvalds1da177e2005-04-16 15:20:36 -0700173
174
Viresh Kumar1c03a2d2014-06-02 22:49:28 +05301751.5. target/target_index
Viresh Kumar7de962c2017-01-06 11:08:05 +0530176------------------------
Linus Torvalds1da177e2005-04-16 15:20:36 -0700177
Viresh Kumar9c0ebcf2013-10-25 19:45:48 +0530178The target_index call has two arguments: struct cpufreq_policy *policy,
179and unsigned int index (into the exposed frequency table).
180
181The CPUfreq driver must set the new frequency when called here. The
182actual frequency must be determined by freq_table[index].frequency.
183
Viresh Kumar1c03a2d2014-06-02 22:49:28 +0530184It should always restore to earlier frequency (i.e. policy->restore_freq) in
185case of errors, even if we switched to intermediate frequency earlier.
186
Viresh Kumar9c0ebcf2013-10-25 19:45:48 +0530187Deprecated:
188----------
Linus Torvalds1da177e2005-04-16 15:20:36 -0700189The target call has three arguments: struct cpufreq_policy *policy,
190unsigned int target_frequency, unsigned int relation.
191
192The CPUfreq driver must set the new frequency when called here. The
193actual frequency must be determined using the following rules:
194
195- keep close to "target_freq"
196- policy->min <= new_freq <= policy->max (THIS MUST BE VALID!!!)
197- if relation==CPUFREQ_REL_L, try to select a new_freq higher than or equal
198 target_freq. ("L for lowest, but no lower than")
199- if relation==CPUFREQ_REL_H, try to select a new_freq lower than or equal
200 target_freq. ("H for highest, but no higher than")
201
Chumbalkar Nagananda51555c02009-05-21 23:29:48 +0000202Here again the frequency table helper might assist you - see section 2
Linus Torvalds1da177e2005-04-16 15:20:36 -0700203for details.
204
Viresh Kumar7de962c2017-01-06 11:08:05 +05302051.6. fast_switch
206----------------
Linus Torvalds1da177e2005-04-16 15:20:36 -0700207
Viresh Kumar7de962c2017-01-06 11:08:05 +0530208This function is used for frequency switching from scheduler's context.
209Not all drivers are expected to implement it, as sleeping from within
210this callback isn't allowed. This callback must be highly optimized to
211do switching as fast as possible.
212
213This function has two arguments: struct cpufreq_policy *policy and
214unsigned int target_frequency.
215
216
2171.7 setpolicy
218-------------
Linus Torvalds1da177e2005-04-16 15:20:36 -0700219
220The setpolicy call only takes a struct cpufreq_policy *policy as
221argument. You need to set the lower limit of the in-processor or
222in-chipset dynamic frequency switching to policy->min, the upper limit
223to policy->max, and -if supported- select a performance-oriented
224setting when policy->policy is CPUFREQ_POLICY_PERFORMANCE, and a
225powersaving-oriented setting when CPUFREQ_POLICY_POWERSAVE. Also check
Wanlong Gao25eb6502011-06-13 17:53:53 +0800226the reference implementation in drivers/cpufreq/longrun.c
Linus Torvalds1da177e2005-04-16 15:20:36 -0700227
Viresh Kumar7de962c2017-01-06 11:08:05 +05302281.8 get_intermediate and target_intermediate
Viresh Kumar1c03a2d2014-06-02 22:49:28 +0530229--------------------------------------------
230
231Only for drivers with target_index() and CPUFREQ_ASYNC_NOTIFICATION unset.
232
233get_intermediate should return a stable intermediate frequency platform wants to
sayli karnik54f5d132017-03-09 11:48:21 +0530234switch to, and target_intermediate() should set CPU to that frequency, before
Viresh Kumar1c03a2d2014-06-02 22:49:28 +0530235jumping to the frequency corresponding to 'index'. Core will take care of
236sending notifications and driver doesn't have to handle them in
237target_intermediate() or target_index().
238
239Drivers can return '0' from get_intermediate() in case they don't wish to switch
240to intermediate frequency for some target frequency. In that case core will
241directly call ->target_index().
242
243NOTE: ->target_index() should restore to policy->restore_freq in case of
244failures as core would send notifications for that.
Linus Torvalds1da177e2005-04-16 15:20:36 -0700245
246
2472. Frequency Table Helpers
248==========================
249
250As most cpufreq processors only allow for being set to a few specific
251frequencies, a "frequency table" with some functions might assist in
Viresh Kumar7de962c2017-01-06 11:08:05 +0530252some work of the processor driver. Such a "frequency table" consists of
253an array of struct cpufreq_frequency_table entries, with driver specific
254values in "driver_data", the corresponding frequency in "frequency" and
255flags set. At the end of the table, you need to add a
256cpufreq_frequency_table entry with frequency set to CPUFREQ_TABLE_END.
257And if you want to skip one entry in the table, set the frequency to
258CPUFREQ_ENTRY_INVALID. The entries don't need to be in sorted in any
259particular order, but if they are cpufreq core will do DVFS a bit
260quickly for them as search for best match is faster.
Linus Torvalds1da177e2005-04-16 15:20:36 -0700261
Viresh Kumar2dd0df82018-04-03 15:37:39 +0530262The cpufreq table is verified automatically by the core if the policy contains a
263valid pointer in its policy->freq_table field.
Linus Torvalds1da177e2005-04-16 15:20:36 -0700264
Viresh Kumar7de962c2017-01-06 11:08:05 +0530265cpufreq_frequency_table_verify() assures that at least one valid
266frequency is within policy->min and policy->max, and all other criteria
267are met. This is helpful for the ->verify call.
Linus Torvalds1da177e2005-04-16 15:20:36 -0700268
Viresh Kumar7de962c2017-01-06 11:08:05 +0530269cpufreq_frequency_table_target() is the corresponding frequency table
270helper for the ->target stage. Just pass the values to this function,
271and this function returns the of the frequency table entry which
272contains the frequency the CPU shall be set to.
Stratos Karafotis27e289d2014-04-25 23:15:23 +0300273
274The following macros can be used as iterators over cpufreq_frequency_table:
275
276cpufreq_for_each_entry(pos, table) - iterates over all entries of frequency
277table.
278
Viresh Kumar7de962c2017-01-06 11:08:05 +0530279cpufreq_for_each_valid_entry(pos, table) - iterates over all entries,
Stratos Karafotis27e289d2014-04-25 23:15:23 +0300280excluding CPUFREQ_ENTRY_INVALID frequencies.
281Use arguments "pos" - a cpufreq_frequency_table * as a loop cursor and
282"table" - the cpufreq_frequency_table * you want to iterate over.
283
284For example:
285
286 struct cpufreq_frequency_table *pos, *driver_freq_table;
287
288 cpufreq_for_each_entry(pos, driver_freq_table) {
289 /* Do something with pos */
290 pos->frequency = ...
291 }
Dominik Brodowskiffd81dc2018-01-30 06:42:37 +0100292
293If you need to work with the position of pos within driver_freq_table,
294do not subtract the pointers, as it is quite costly. Instead, use the
295macros cpufreq_for_each_entry_idx() and cpufreq_for_each_valid_entry_idx().