bfq-iosched.txt 25 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370371372373374375376377378379380381382383384385386387388389390391392393394395396397398399400401402403404405406407408409410411412413414415416417418419420421422423424425426427428429430431432433434435436437438439440441442443444445446447448449450451452453454455456457458459460461462463464465466467468469470471472473474475476477478479480481482483484485486487488489490491492493494495496497498499500501502503504505506507508509510511512513514515516517518519520521522523524525526527528529530531532533534535536537538539540541542543544545546547548549550551552553554555556557558559560561
  1. BFQ (Budget Fair Queueing)
  2. ==========================
  3. BFQ is a proportional-share I/O scheduler, with some extra
  4. low-latency capabilities. In addition to cgroups support (blkio or io
  5. controllers), BFQ's main features are:
  6. - BFQ guarantees a high system and application responsiveness, and a
  7. low latency for time-sensitive applications, such as audio or video
  8. players;
  9. - BFQ distributes bandwidth, and not just time, among processes or
  10. groups (switching back to time distribution when needed to keep
  11. throughput high).
  12. In its default configuration, BFQ privileges latency over
  13. throughput. So, when needed for achieving a lower latency, BFQ builds
  14. schedules that may lead to a lower throughput. If your main or only
  15. goal, for a given device, is to achieve the maximum-possible
  16. throughput at all times, then do switch off all low-latency heuristics
  17. for that device, by setting low_latency to 0. See Section 3 for
  18. details on how to configure BFQ for the desired tradeoff between
  19. latency and throughput, or on how to maximize throughput.
  20. BFQ has a non-null overhead, which limits the maximum IOPS that a CPU
  21. can process for a device scheduled with BFQ. To give an idea of the
  22. limits on slow or average CPUs, here are, first, the limits of BFQ for
  23. three different CPUs, on, respectively, an average laptop, an old
  24. desktop, and a cheap embedded system, in case full hierarchical
  25. support is enabled (i.e., CONFIG_BFQ_GROUP_IOSCHED is set), but
  26. CONFIG_DEBUG_BLK_CGROUP is not set (Section 4-2):
  27. - Intel i7-4850HQ: 400 KIOPS
  28. - AMD A8-3850: 250 KIOPS
  29. - ARM CortexTM-A53 Octa-core: 80 KIOPS
  30. If CONFIG_DEBUG_BLK_CGROUP is set (and of course full hierarchical
  31. support is enabled), then the sustainable throughput with BFQ
  32. decreases, because all blkio.bfq* statistics are created and updated
  33. (Section 4-2). For BFQ, this leads to the following maximum
  34. sustainable throughputs, on the same systems as above:
  35. - Intel i7-4850HQ: 310 KIOPS
  36. - AMD A8-3850: 200 KIOPS
  37. - ARM CortexTM-A53 Octa-core: 56 KIOPS
  38. BFQ works for multi-queue devices too.
  39. The table of contents follow. Impatients can just jump to Section 3.
  40. CONTENTS
  41. 1. When may BFQ be useful?
  42. 1-1 Personal systems
  43. 1-2 Server systems
  44. 2. How does BFQ work?
  45. 3. What are BFQ's tunables and how to properly configure BFQ?
  46. 4. BFQ group scheduling
  47. 4-1 Service guarantees provided
  48. 4-2 Interface
  49. 1. When may BFQ be useful?
  50. ==========================
  51. BFQ provides the following benefits on personal and server systems.
  52. 1-1 Personal systems
  53. --------------------
  54. Low latency for interactive applications
  55. Regardless of the actual background workload, BFQ guarantees that, for
  56. interactive tasks, the storage device is virtually as responsive as if
  57. it was idle. For example, even if one or more of the following
  58. background workloads are being executed:
  59. - one or more large files are being read, written or copied,
  60. - a tree of source files is being compiled,
  61. - one or more virtual machines are performing I/O,
  62. - a software update is in progress,
  63. - indexing daemons are scanning filesystems and updating their
  64. databases,
  65. starting an application or loading a file from within an application
  66. takes about the same time as if the storage device was idle. As a
  67. comparison, with CFQ, NOOP or DEADLINE, and in the same conditions,
  68. applications experience high latencies, or even become unresponsive
  69. until the background workload terminates (also on SSDs).
  70. Low latency for soft real-time applications
  71. Also soft real-time applications, such as audio and video
  72. players/streamers, enjoy a low latency and a low drop rate, regardless
  73. of the background I/O workload. As a consequence, these applications
  74. do not suffer from almost any glitch due to the background workload.
  75. Higher speed for code-development tasks
  76. If some additional workload happens to be executed in parallel, then
  77. BFQ executes the I/O-related components of typical code-development
  78. tasks (compilation, checkout, merge, ...) much more quickly than CFQ,
  79. NOOP or DEADLINE.
  80. High throughput
  81. On hard disks, BFQ achieves up to 30% higher throughput than CFQ, and
  82. up to 150% higher throughput than DEADLINE and NOOP, with all the
  83. sequential workloads considered in our tests. With random workloads,
  84. and with all the workloads on flash-based devices, BFQ achieves,
  85. instead, about the same throughput as the other schedulers.
  86. Strong fairness, bandwidth and delay guarantees
  87. BFQ distributes the device throughput, and not just the device time,
  88. among I/O-bound applications in proportion their weights, with any
  89. workload and regardless of the device parameters. From these bandwidth
  90. guarantees, it is possible to compute tight per-I/O-request delay
  91. guarantees by a simple formula. If not configured for strict service
  92. guarantees, BFQ switches to time-based resource sharing (only) for
  93. applications that would otherwise cause a throughput loss.
  94. 1-2 Server systems
  95. ------------------
  96. Most benefits for server systems follow from the same service
  97. properties as above. In particular, regardless of whether additional,
  98. possibly heavy workloads are being served, BFQ guarantees:
  99. . audio and video-streaming with zero or very low jitter and drop
  100. rate;
  101. . fast retrieval of WEB pages and embedded objects;
  102. . real-time recording of data in live-dumping applications (e.g.,
  103. packet logging);
  104. . responsiveness in local and remote access to a server.
  105. 2. How does BFQ work?
  106. =====================
  107. BFQ is a proportional-share I/O scheduler, whose general structure,
  108. plus a lot of code, are borrowed from CFQ.
  109. - Each process doing I/O on a device is associated with a weight and a
  110. (bfq_)queue.
  111. - BFQ grants exclusive access to the device, for a while, to one queue
  112. (process) at a time, and implements this service model by
  113. associating every queue with a budget, measured in number of
  114. sectors.
  115. - After a queue is granted access to the device, the budget of the
  116. queue is decremented, on each request dispatch, by the size of the
  117. request.
  118. - The in-service queue is expired, i.e., its service is suspended,
  119. only if one of the following events occurs: 1) the queue finishes
  120. its budget, 2) the queue empties, 3) a "budget timeout" fires.
  121. - The budget timeout prevents processes doing random I/O from
  122. holding the device for too long and dramatically reducing
  123. throughput.
  124. - Actually, as in CFQ, a queue associated with a process issuing
  125. sync requests may not be expired immediately when it empties. In
  126. contrast, BFQ may idle the device for a short time interval,
  127. giving the process the chance to go on being served if it issues
  128. a new request in time. Device idling typically boosts the
  129. throughput on rotational devices and on non-queueing flash-based
  130. devices, if processes do synchronous and sequential I/O. In
  131. addition, under BFQ, device idling is also instrumental in
  132. guaranteeing the desired throughput fraction to processes
  133. issuing sync requests (see the description of the slice_idle
  134. tunable in this document, or [1, 2], for more details).
  135. - With respect to idling for service guarantees, if several
  136. processes are competing for the device at the same time, but
  137. all processes and groups have the same weight, then BFQ
  138. guarantees the expected throughput distribution without ever
  139. idling the device. Throughput is thus as high as possible in
  140. this common scenario.
  141. - On flash-based storage with internal queueing of commands
  142. (typically NCQ), device idling happens to be always detrimental
  143. for throughput. So, with these devices, BFQ performs idling
  144. only when strictly needed for service guarantees, i.e., for
  145. guaranteeing low latency or fairness. In these cases, overall
  146. throughput may be sub-optimal. No solution currently exists to
  147. provide both strong service guarantees and optimal throughput
  148. on devices with internal queueing.
  149. - If low-latency mode is enabled (default configuration), BFQ
  150. executes some special heuristics to detect interactive and soft
  151. real-time applications (e.g., video or audio players/streamers),
  152. and to reduce their latency. The most important action taken to
  153. achieve this goal is to give to the queues associated with these
  154. applications more than their fair share of the device
  155. throughput. For brevity, we call just "weight-raising" the whole
  156. sets of actions taken by BFQ to privilege these queues. In
  157. particular, BFQ provides a milder form of weight-raising for
  158. interactive applications, and a stronger form for soft real-time
  159. applications.
  160. - BFQ automatically deactivates idling for queues born in a burst of
  161. queue creations. In fact, these queues are usually associated with
  162. the processes of applications and services that benefit mostly
  163. from a high throughput. Examples are systemd during boot, or git
  164. grep.
  165. - As CFQ, BFQ merges queues performing interleaved I/O, i.e.,
  166. performing random I/O that becomes mostly sequential if
  167. merged. Differently from CFQ, BFQ achieves this goal with a more
  168. reactive mechanism, called Early Queue Merge (EQM). EQM is so
  169. responsive in detecting interleaved I/O (cooperating processes),
  170. that it enables BFQ to achieve a high throughput, by queue
  171. merging, even for queues for which CFQ needs a different
  172. mechanism, preemption, to get a high throughput. As such EQM is a
  173. unified mechanism to achieve a high throughput with interleaved
  174. I/O.
  175. - Queues are scheduled according to a variant of WF2Q+, named
  176. B-WF2Q+, and implemented using an augmented rb-tree to preserve an
  177. O(log N) overall complexity. See [2] for more details. B-WF2Q+ is
  178. also ready for hierarchical scheduling, details in Section 4.
  179. - B-WF2Q+ guarantees a tight deviation with respect to an ideal,
  180. perfectly fair, and smooth service. In particular, B-WF2Q+
  181. guarantees that each queue receives a fraction of the device
  182. throughput proportional to its weight, even if the throughput
  183. fluctuates, and regardless of: the device parameters, the current
  184. workload and the budgets assigned to the queue.
  185. - The last, budget-independence, property (although probably
  186. counterintuitive in the first place) is definitely beneficial, for
  187. the following reasons:
  188. - First, with any proportional-share scheduler, the maximum
  189. deviation with respect to an ideal service is proportional to
  190. the maximum budget (slice) assigned to queues. As a consequence,
  191. BFQ can keep this deviation tight not only because of the
  192. accurate service of B-WF2Q+, but also because BFQ *does not*
  193. need to assign a larger budget to a queue to let the queue
  194. receive a higher fraction of the device throughput.
  195. - Second, BFQ is free to choose, for every process (queue), the
  196. budget that best fits the needs of the process, or best
  197. leverages the I/O pattern of the process. In particular, BFQ
  198. updates queue budgets with a simple feedback-loop algorithm that
  199. allows a high throughput to be achieved, while still providing
  200. tight latency guarantees to time-sensitive applications. When
  201. the in-service queue expires, this algorithm computes the next
  202. budget of the queue so as to:
  203. - Let large budgets be eventually assigned to the queues
  204. associated with I/O-bound applications performing sequential
  205. I/O: in fact, the longer these applications are served once
  206. got access to the device, the higher the throughput is.
  207. - Let small budgets be eventually assigned to the queues
  208. associated with time-sensitive applications (which typically
  209. perform sporadic and short I/O), because, the smaller the
  210. budget assigned to a queue waiting for service is, the sooner
  211. B-WF2Q+ will serve that queue (Subsec 3.3 in [2]).
  212. - If several processes are competing for the device at the same time,
  213. but all processes and groups have the same weight, then BFQ
  214. guarantees the expected throughput distribution without ever idling
  215. the device. It uses preemption instead. Throughput is then much
  216. higher in this common scenario.
  217. - ioprio classes are served in strict priority order, i.e.,
  218. lower-priority queues are not served as long as there are
  219. higher-priority queues. Among queues in the same class, the
  220. bandwidth is distributed in proportion to the weight of each
  221. queue. A very thin extra bandwidth is however guaranteed to
  222. the Idle class, to prevent it from starving.
  223. 3. What are BFQ's tunables and how to properly configure BFQ?
  224. =============================================================
  225. Most BFQ tunables affect service guarantees (basically latency and
  226. fairness) and throughput. For full details on how to choose the
  227. desired tradeoff between service guarantees and throughput, see the
  228. parameters slice_idle, strict_guarantees and low_latency. For details
  229. on how to maximise throughput, see slice_idle, timeout_sync and
  230. max_budget. The other performance-related parameters have been
  231. inherited from, and have been preserved mostly for compatibility with
  232. CFQ. So far, no performance improvement has been reported after
  233. changing the latter parameters in BFQ.
  234. In particular, the tunables back_seek-max, back_seek_penalty,
  235. fifo_expire_async and fifo_expire_sync below are the same as in
  236. CFQ. Their description is just copied from that for CFQ. Some
  237. considerations in the description of slice_idle are copied from CFQ
  238. too.
  239. per-process ioprio and weight
  240. -----------------------------
  241. Unless the cgroups interface is used (see "4. BFQ group scheduling"),
  242. weights can be assigned to processes only indirectly, through I/O
  243. priorities, and according to the relation:
  244. weight = (IOPRIO_BE_NR - ioprio) * 10.
  245. Beware that, if low-latency is set, then BFQ automatically raises the
  246. weight of the queues associated with interactive and soft real-time
  247. applications. Unset this tunable if you need/want to control weights.
  248. slice_idle
  249. ----------
  250. This parameter specifies how long BFQ should idle for next I/O
  251. request, when certain sync BFQ queues become empty. By default
  252. slice_idle is a non-zero value. Idling has a double purpose: boosting
  253. throughput and making sure that the desired throughput distribution is
  254. respected (see the description of how BFQ works, and, if needed, the
  255. papers referred there).
  256. As for throughput, idling can be very helpful on highly seeky media
  257. like single spindle SATA/SAS disks where we can cut down on overall
  258. number of seeks and see improved throughput.
  259. Setting slice_idle to 0 will remove all the idling on queues and one
  260. should see an overall improved throughput on faster storage devices
  261. like multiple SATA/SAS disks in hardware RAID configuration, as well
  262. as flash-based storage with internal command queueing (and
  263. parallelism).
  264. So depending on storage and workload, it might be useful to set
  265. slice_idle=0. In general for SATA/SAS disks and software RAID of
  266. SATA/SAS disks keeping slice_idle enabled should be useful. For any
  267. configurations where there are multiple spindles behind single LUN
  268. (Host based hardware RAID controller or for storage arrays), or with
  269. flash-based fast storage, setting slice_idle=0 might end up in better
  270. throughput and acceptable latencies.
  271. Idling is however necessary to have service guarantees enforced in
  272. case of differentiated weights or differentiated I/O-request lengths.
  273. To see why, suppose that a given BFQ queue A must get several I/O
  274. requests served for each request served for another queue B. Idling
  275. ensures that, if A makes a new I/O request slightly after becoming
  276. empty, then no request of B is dispatched in the middle, and thus A
  277. does not lose the possibility to get more than one request dispatched
  278. before the next request of B is dispatched. Note that idling
  279. guarantees the desired differentiated treatment of queues only in
  280. terms of I/O-request dispatches. To guarantee that the actual service
  281. order then corresponds to the dispatch order, the strict_guarantees
  282. tunable must be set too.
  283. There is an important flipside for idling: apart from the above cases
  284. where it is beneficial also for throughput, idling can severely impact
  285. throughput. One important case is random workload. Because of this
  286. issue, BFQ tends to avoid idling as much as possible, when it is not
  287. beneficial also for throughput (as detailed in Section 2). As a
  288. consequence of this behavior, and of further issues described for the
  289. strict_guarantees tunable, short-term service guarantees may be
  290. occasionally violated. And, in some cases, these guarantees may be
  291. more important than guaranteeing maximum throughput. For example, in
  292. video playing/streaming, a very low drop rate may be more important
  293. than maximum throughput. In these cases, consider setting the
  294. strict_guarantees parameter.
  295. strict_guarantees
  296. -----------------
  297. If this parameter is set (default: unset), then BFQ
  298. - always performs idling when the in-service queue becomes empty;
  299. - forces the device to serve one I/O request at a time, by dispatching a
  300. new request only if there is no outstanding request.
  301. In the presence of differentiated weights or I/O-request sizes, both
  302. the above conditions are needed to guarantee that every BFQ queue
  303. receives its allotted share of the bandwidth. The first condition is
  304. needed for the reasons explained in the description of the slice_idle
  305. tunable. The second condition is needed because all modern storage
  306. devices reorder internally-queued requests, which may trivially break
  307. the service guarantees enforced by the I/O scheduler.
  308. Setting strict_guarantees may evidently affect throughput.
  309. back_seek_max
  310. -------------
  311. This specifies, given in Kbytes, the maximum "distance" for backward seeking.
  312. The distance is the amount of space from the current head location to the
  313. sectors that are backward in terms of distance.
  314. This parameter allows the scheduler to anticipate requests in the "backward"
  315. direction and consider them as being the "next" if they are within this
  316. distance from the current head location.
  317. back_seek_penalty
  318. -----------------
  319. This parameter is used to compute the cost of backward seeking. If the
  320. backward distance of request is just 1/back_seek_penalty from a "front"
  321. request, then the seeking cost of two requests is considered equivalent.
  322. So scheduler will not bias toward one or the other request (otherwise scheduler
  323. will bias toward front request). Default value of back_seek_penalty is 2.
  324. fifo_expire_async
  325. -----------------
  326. This parameter is used to set the timeout of asynchronous requests. Default
  327. value of this is 248ms.
  328. fifo_expire_sync
  329. ----------------
  330. This parameter is used to set the timeout of synchronous requests. Default
  331. value of this is 124ms. In case to favor synchronous requests over asynchronous
  332. one, this value should be decreased relative to fifo_expire_async.
  333. low_latency
  334. -----------
  335. This parameter is used to enable/disable BFQ's low latency mode. By
  336. default, low latency mode is enabled. If enabled, interactive and soft
  337. real-time applications are privileged and experience a lower latency,
  338. as explained in more detail in the description of how BFQ works.
  339. DISABLE this mode if you need full control on bandwidth
  340. distribution. In fact, if it is enabled, then BFQ automatically
  341. increases the bandwidth share of privileged applications, as the main
  342. means to guarantee a lower latency to them.
  343. In addition, as already highlighted at the beginning of this document,
  344. DISABLE this mode if your only goal is to achieve a high throughput.
  345. In fact, privileging the I/O of some application over the rest may
  346. entail a lower throughput. To achieve the highest-possible throughput
  347. on a non-rotational device, setting slice_idle to 0 may be needed too
  348. (at the cost of giving up any strong guarantee on fairness and low
  349. latency).
  350. timeout_sync
  351. ------------
  352. Maximum amount of device time that can be given to a task (queue) once
  353. it has been selected for service. On devices with costly seeks,
  354. increasing this time usually increases maximum throughput. On the
  355. opposite end, increasing this time coarsens the granularity of the
  356. short-term bandwidth and latency guarantees, especially if the
  357. following parameter is set to zero.
  358. max_budget
  359. ----------
  360. Maximum amount of service, measured in sectors, that can be provided
  361. to a BFQ queue once it is set in service (of course within the limits
  362. of the above timeout). According to what said in the description of
  363. the algorithm, larger values increase the throughput in proportion to
  364. the percentage of sequential I/O requests issued. The price of larger
  365. values is that they coarsen the granularity of short-term bandwidth
  366. and latency guarantees.
  367. The default value is 0, which enables auto-tuning: BFQ sets max_budget
  368. to the maximum number of sectors that can be served during
  369. timeout_sync, according to the estimated peak rate.
  370. For specific devices, some users have occasionally reported to have
  371. reached a higher throughput by setting max_budget explicitly, i.e., by
  372. setting max_budget to a higher value than 0. In particular, they have
  373. set max_budget to higher values than those to which BFQ would have set
  374. it with auto-tuning. An alternative way to achieve this goal is to
  375. just increase the value of timeout_sync, leaving max_budget equal to 0.
  376. weights
  377. -------
  378. Read-only parameter, used to show the weights of the currently active
  379. BFQ queues.
  380. 4. Group scheduling with BFQ
  381. ============================
  382. BFQ supports both cgroups-v1 and cgroups-v2 io controllers, namely
  383. blkio and io. In particular, BFQ supports weight-based proportional
  384. share. To activate cgroups support, set BFQ_GROUP_IOSCHED.
  385. 4-1 Service guarantees provided
  386. -------------------------------
  387. With BFQ, proportional share means true proportional share of the
  388. device bandwidth, according to group weights. For example, a group
  389. with weight 200 gets twice the bandwidth, and not just twice the time,
  390. of a group with weight 100.
  391. BFQ supports hierarchies (group trees) of any depth. Bandwidth is
  392. distributed among groups and processes in the expected way: for each
  393. group, the children of the group share the whole bandwidth of the
  394. group in proportion to their weights. In particular, this implies
  395. that, for each leaf group, every process of the group receives the
  396. same share of the whole group bandwidth, unless the ioprio of the
  397. process is modified.
  398. The resource-sharing guarantee for a group may partially or totally
  399. switch from bandwidth to time, if providing bandwidth guarantees to
  400. the group lowers the throughput too much. This switch occurs on a
  401. per-process basis: if a process of a leaf group causes throughput loss
  402. if served in such a way to receive its share of the bandwidth, then
  403. BFQ switches back to just time-based proportional share for that
  404. process.
  405. 4-2 Interface
  406. -------------
  407. To get proportional sharing of bandwidth with BFQ for a given device,
  408. BFQ must of course be the active scheduler for that device.
  409. Within each group directory, the names of the files associated with
  410. BFQ-specific cgroup parameters and stats begin with the "bfq."
  411. prefix. So, with cgroups-v1 or cgroups-v2, the full prefix for
  412. BFQ-specific files is "blkio.bfq." or "io.bfq." For example, the group
  413. parameter to set the weight of a group with BFQ is blkio.bfq.weight
  414. or io.bfq.weight.
  415. As for cgroups-v1 (blkio controller), the exact set of stat files
  416. created, and kept up-to-date by bfq, depends on whether
  417. CONFIG_DEBUG_BLK_CGROUP is set. If it is set, then bfq creates all
  418. the stat files documented in
  419. Documentation/cgroup-v1/blkio-controller.txt. If, instead,
  420. CONFIG_DEBUG_BLK_CGROUP is not set, then bfq creates only the files
  421. blkio.bfq.io_service_bytes
  422. blkio.bfq.io_service_bytes_recursive
  423. blkio.bfq.io_serviced
  424. blkio.bfq.io_serviced_recursive
  425. The value of CONFIG_DEBUG_BLK_CGROUP greatly influences the maximum
  426. throughput sustainable with bfq, because updating the blkio.bfq.*
  427. stats is rather costly, especially for some of the stats enabled by
  428. CONFIG_DEBUG_BLK_CGROUP.
  429. Parameters to set
  430. -----------------
  431. For each group, there is only the following parameter to set.
  432. weight (namely blkio.bfq.weight or io.bfq-weight): the weight of the
  433. group inside its parent. Available values: 1..10000 (default 100). The
  434. linear mapping between ioprio and weights, described at the beginning
  435. of the tunable section, is still valid, but all weights higher than
  436. IOPRIO_BE_NR*10 are mapped to ioprio 0.
  437. Recall that, if low-latency is set, then BFQ automatically raises the
  438. weight of the queues associated with interactive and soft real-time
  439. applications. Unset this tunable if you need/want to control weights.
  440. [1] P. Valente, A. Avanzini, "Evolution of the BFQ Storage I/O
  441. Scheduler", Proceedings of the First Workshop on Mobile System
  442. Technologies (MST-2015), May 2015.
  443. http://algogroup.unimore.it/people/paolo/disk_sched/mst-2015.pdf
  444. [2] P. Valente and M. Andreolini, "Improving Application
  445. Responsiveness with the BFQ Disk I/O Scheduler", Proceedings of
  446. the 5th Annual International Systems and Storage Conference
  447. (SYSTOR '12), June 2012.
  448. Slightly extended version:
  449. http://algogroup.unimore.it/people/paolo/disk_sched/bfq-v1-suite-
  450. results.pdf