kfuncs.rst 25 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370371372373374375376377378379380381382383384385386387388389390391392393394395396397398399400401402403404405406407408409410411412413414415416417418419420421422423424425426427428429430431432433434435436437438439440441442443444445446447448449450451452453454455456457458459460461462463464465466467468469470471472473474475476477478479480481482483484485486487488489490491492493494495496497498499500501502503504505506507508509510511512513514515516517518519520521522523524525526527528529530531532533534535536537538539540541542543544545546547548549550551552553554555556557558559560561562563564565566567568569570571572573574575576577578579580581582583584585586587588589590591592593594595596597598599600601602603604605606607608609610611612613614615616617618619620621622623624625626627628629630631632633634635636637638639640641642643644645646647648649650651652653654655656657658659660661662663664665666667668669670671672673674675676677678
  1. .. SPDX-License-Identifier: GPL-2.0
  2. .. _kfuncs-header-label:
  3. =============================
  4. BPF Kernel Functions (kfuncs)
  5. =============================
  6. 1. Introduction
  7. ===============
  8. BPF Kernel Functions or more commonly known as kfuncs are functions in the Linux
  9. kernel which are exposed for use by BPF programs. Unlike normal BPF helpers,
  10. kfuncs do not have a stable interface and can change from one kernel release to
  11. another. Hence, BPF programs need to be updated in response to changes in the
  12. kernel. See :ref:`BPF_kfunc_lifecycle_expectations` for more information.
  13. 2. Defining a kfunc
  14. ===================
  15. There are two ways to expose a kernel function to BPF programs, either make an
  16. existing function in the kernel visible, or add a new wrapper for BPF. In both
  17. cases, care must be taken that BPF program can only call such function in a
  18. valid context. To enforce this, visibility of a kfunc can be per program type.
  19. If you are not creating a BPF wrapper for existing kernel function, skip ahead
  20. to :ref:`BPF_kfunc_nodef`.
  21. 2.1 Creating a wrapper kfunc
  22. ----------------------------
  23. When defining a wrapper kfunc, the wrapper function should have extern linkage.
  24. This prevents the compiler from optimizing away dead code, as this wrapper kfunc
  25. is not invoked anywhere in the kernel itself. It is not necessary to provide a
  26. prototype in a header for the wrapper kfunc.
  27. An example is given below::
  28. /* Disables missing prototype warnings */
  29. __bpf_kfunc_start_defs();
  30. __bpf_kfunc struct task_struct *bpf_find_get_task_by_vpid(pid_t nr)
  31. {
  32. return find_get_task_by_vpid(nr);
  33. }
  34. __bpf_kfunc_end_defs();
  35. A wrapper kfunc is often needed when we need to annotate parameters of the
  36. kfunc. Otherwise one may directly make the kfunc visible to the BPF program by
  37. registering it with the BPF subsystem. See :ref:`BPF_kfunc_nodef`.
  38. 2.2 Annotating kfunc parameters
  39. -------------------------------
  40. Similar to BPF helpers, there is sometime need for additional context required
  41. by the verifier to make the usage of kernel functions safer and more useful.
  42. Hence, we can annotate a parameter by suffixing the name of the argument of the
  43. kfunc with a __tag, where tag may be one of the supported annotations.
  44. 2.2.1 __sz Annotation
  45. ---------------------
  46. This annotation is used to indicate a memory and size pair in the argument list.
  47. An example is given below::
  48. __bpf_kfunc void bpf_memzero(void *mem, int mem__sz)
  49. {
  50. ...
  51. }
  52. Here, the verifier will treat first argument as a PTR_TO_MEM, and second
  53. argument as its size. By default, without __sz annotation, the size of the type
  54. of the pointer is used. Without __sz annotation, a kfunc cannot accept a void
  55. pointer.
  56. 2.2.2 __k Annotation
  57. --------------------
  58. This annotation is only understood for scalar arguments, where it indicates that
  59. the verifier must check the scalar argument to be a known constant, which does
  60. not indicate a size parameter, and the value of the constant is relevant to the
  61. safety of the program.
  62. An example is given below::
  63. __bpf_kfunc void *bpf_obj_new(u32 local_type_id__k, ...)
  64. {
  65. ...
  66. }
  67. Here, bpf_obj_new uses local_type_id argument to find out the size of that type
  68. ID in program's BTF and return a sized pointer to it. Each type ID will have a
  69. distinct size, hence it is crucial to treat each such call as distinct when
  70. values don't match during verifier state pruning checks.
  71. Hence, whenever a constant scalar argument is accepted by a kfunc which is not a
  72. size parameter, and the value of the constant matters for program safety, __k
  73. suffix should be used.
  74. 2.2.3 __uninit Annotation
  75. -------------------------
  76. This annotation is used to indicate that the argument will be treated as
  77. uninitialized.
  78. An example is given below::
  79. __bpf_kfunc int bpf_dynptr_from_skb(..., struct bpf_dynptr_kern *ptr__uninit)
  80. {
  81. ...
  82. }
  83. Here, the dynptr will be treated as an uninitialized dynptr. Without this
  84. annotation, the verifier will reject the program if the dynptr passed in is
  85. not initialized.
  86. 2.2.4 __opt Annotation
  87. -------------------------
  88. This annotation is used to indicate that the buffer associated with an __sz or __szk
  89. argument may be null. If the function is passed a nullptr in place of the buffer,
  90. the verifier will not check that length is appropriate for the buffer. The kfunc is
  91. responsible for checking if this buffer is null before using it.
  92. An example is given below::
  93. __bpf_kfunc void *bpf_dynptr_slice(..., void *buffer__opt, u32 buffer__szk)
  94. {
  95. ...
  96. }
  97. Here, the buffer may be null. If buffer is not null, it at least of size buffer_szk.
  98. Either way, the returned buffer is either NULL, or of size buffer_szk. Without this
  99. annotation, the verifier will reject the program if a null pointer is passed in with
  100. a nonzero size.
  101. 2.2.5 __str Annotation
  102. ----------------------------
  103. This annotation is used to indicate that the argument is a constant string.
  104. An example is given below::
  105. __bpf_kfunc bpf_get_file_xattr(..., const char *name__str, ...)
  106. {
  107. ...
  108. }
  109. In this case, ``bpf_get_file_xattr()`` can be called as::
  110. bpf_get_file_xattr(..., "xattr_name", ...);
  111. Or::
  112. const char name[] = "xattr_name"; /* This need to be global */
  113. int BPF_PROG(...)
  114. {
  115. ...
  116. bpf_get_file_xattr(..., name, ...);
  117. ...
  118. }
  119. .. _BPF_kfunc_nodef:
  120. 2.3 Using an existing kernel function
  121. -------------------------------------
  122. When an existing function in the kernel is fit for consumption by BPF programs,
  123. it can be directly registered with the BPF subsystem. However, care must still
  124. be taken to review the context in which it will be invoked by the BPF program
  125. and whether it is safe to do so.
  126. 2.4 Annotating kfuncs
  127. ---------------------
  128. In addition to kfuncs' arguments, verifier may need more information about the
  129. type of kfunc(s) being registered with the BPF subsystem. To do so, we define
  130. flags on a set of kfuncs as follows::
  131. BTF_KFUNCS_START(bpf_task_set)
  132. BTF_ID_FLAGS(func, bpf_get_task_pid, KF_ACQUIRE | KF_RET_NULL)
  133. BTF_ID_FLAGS(func, bpf_put_pid, KF_RELEASE)
  134. BTF_KFUNCS_END(bpf_task_set)
  135. This set encodes the BTF ID of each kfunc listed above, and encodes the flags
  136. along with it. Ofcourse, it is also allowed to specify no flags.
  137. kfunc definitions should also always be annotated with the ``__bpf_kfunc``
  138. macro. This prevents issues such as the compiler inlining the kfunc if it's a
  139. static kernel function, or the function being elided in an LTO build as it's
  140. not used in the rest of the kernel. Developers should not manually add
  141. annotations to their kfunc to prevent these issues. If an annotation is
  142. required to prevent such an issue with your kfunc, it is a bug and should be
  143. added to the definition of the macro so that other kfuncs are similarly
  144. protected. An example is given below::
  145. __bpf_kfunc struct task_struct *bpf_get_task_pid(s32 pid)
  146. {
  147. ...
  148. }
  149. 2.4.1 KF_ACQUIRE flag
  150. ---------------------
  151. The KF_ACQUIRE flag is used to indicate that the kfunc returns a pointer to a
  152. refcounted object. The verifier will then ensure that the pointer to the object
  153. is eventually released using a release kfunc, or transferred to a map using a
  154. referenced kptr (by invoking bpf_kptr_xchg). If not, the verifier fails the
  155. loading of the BPF program until no lingering references remain in all possible
  156. explored states of the program.
  157. 2.4.2 KF_RET_NULL flag
  158. ----------------------
  159. The KF_RET_NULL flag is used to indicate that the pointer returned by the kfunc
  160. may be NULL. Hence, it forces the user to do a NULL check on the pointer
  161. returned from the kfunc before making use of it (dereferencing or passing to
  162. another helper). This flag is often used in pairing with KF_ACQUIRE flag, but
  163. both are orthogonal to each other.
  164. 2.4.3 KF_RELEASE flag
  165. ---------------------
  166. The KF_RELEASE flag is used to indicate that the kfunc releases the pointer
  167. passed in to it. There can be only one referenced pointer that can be passed
  168. in. All copies of the pointer being released are invalidated as a result of
  169. invoking kfunc with this flag. KF_RELEASE kfuncs automatically receive the
  170. protection afforded by the KF_TRUSTED_ARGS flag described below.
  171. 2.4.4 KF_TRUSTED_ARGS flag
  172. --------------------------
  173. The KF_TRUSTED_ARGS flag is used for kfuncs taking pointer arguments. It
  174. indicates that the all pointer arguments are valid, and that all pointers to
  175. BTF objects have been passed in their unmodified form (that is, at a zero
  176. offset, and without having been obtained from walking another pointer, with one
  177. exception described below).
  178. There are two types of pointers to kernel objects which are considered "valid":
  179. 1. Pointers which are passed as tracepoint or struct_ops callback arguments.
  180. 2. Pointers which were returned from a KF_ACQUIRE kfunc.
  181. Pointers to non-BTF objects (e.g. scalar pointers) may also be passed to
  182. KF_TRUSTED_ARGS kfuncs, and may have a non-zero offset.
  183. The definition of "valid" pointers is subject to change at any time, and has
  184. absolutely no ABI stability guarantees.
  185. As mentioned above, a nested pointer obtained from walking a trusted pointer is
  186. no longer trusted, with one exception. If a struct type has a field that is
  187. guaranteed to be valid (trusted or rcu, as in KF_RCU description below) as long
  188. as its parent pointer is valid, the following macros can be used to express
  189. that to the verifier:
  190. * ``BTF_TYPE_SAFE_TRUSTED``
  191. * ``BTF_TYPE_SAFE_RCU``
  192. * ``BTF_TYPE_SAFE_RCU_OR_NULL``
  193. For example,
  194. .. code-block:: c
  195. BTF_TYPE_SAFE_TRUSTED(struct socket) {
  196. struct sock *sk;
  197. };
  198. or
  199. .. code-block:: c
  200. BTF_TYPE_SAFE_RCU(struct task_struct) {
  201. const cpumask_t *cpus_ptr;
  202. struct css_set __rcu *cgroups;
  203. struct task_struct __rcu *real_parent;
  204. struct task_struct *group_leader;
  205. };
  206. In other words, you must:
  207. 1. Wrap the valid pointer type in a ``BTF_TYPE_SAFE_*`` macro.
  208. 2. Specify the type and name of the valid nested field. This field must match
  209. the field in the original type definition exactly.
  210. A new type declared by a ``BTF_TYPE_SAFE_*`` macro also needs to be emitted so
  211. that it appears in BTF. For example, ``BTF_TYPE_SAFE_TRUSTED(struct socket)``
  212. is emitted in the ``type_is_trusted()`` function as follows:
  213. .. code-block:: c
  214. BTF_TYPE_EMIT(BTF_TYPE_SAFE_TRUSTED(struct socket));
  215. 2.4.5 KF_SLEEPABLE flag
  216. -----------------------
  217. The KF_SLEEPABLE flag is used for kfuncs that may sleep. Such kfuncs can only
  218. be called by sleepable BPF programs (BPF_F_SLEEPABLE).
  219. 2.4.6 KF_DESTRUCTIVE flag
  220. --------------------------
  221. The KF_DESTRUCTIVE flag is used to indicate functions calling which is
  222. destructive to the system. For example such a call can result in system
  223. rebooting or panicking. Due to this additional restrictions apply to these
  224. calls. At the moment they only require CAP_SYS_BOOT capability, but more can be
  225. added later.
  226. 2.4.7 KF_RCU flag
  227. -----------------
  228. The KF_RCU flag is a weaker version of KF_TRUSTED_ARGS. The kfuncs marked with
  229. KF_RCU expect either PTR_TRUSTED or MEM_RCU arguments. The verifier guarantees
  230. that the objects are valid and there is no use-after-free. The pointers are not
  231. NULL, but the object's refcount could have reached zero. The kfuncs need to
  232. consider doing refcnt != 0 check, especially when returning a KF_ACQUIRE
  233. pointer. Note as well that a KF_ACQUIRE kfunc that is KF_RCU should very likely
  234. also be KF_RET_NULL.
  235. .. _KF_deprecated_flag:
  236. 2.4.8 KF_DEPRECATED flag
  237. ------------------------
  238. The KF_DEPRECATED flag is used for kfuncs which are scheduled to be
  239. changed or removed in a subsequent kernel release. A kfunc that is
  240. marked with KF_DEPRECATED should also have any relevant information
  241. captured in its kernel doc. Such information typically includes the
  242. kfunc's expected remaining lifespan, a recommendation for new
  243. functionality that can replace it if any is available, and possibly a
  244. rationale for why it is being removed.
  245. Note that while on some occasions, a KF_DEPRECATED kfunc may continue to be
  246. supported and have its KF_DEPRECATED flag removed, it is likely to be far more
  247. difficult to remove a KF_DEPRECATED flag after it's been added than it is to
  248. prevent it from being added in the first place. As described in
  249. :ref:`BPF_kfunc_lifecycle_expectations`, users that rely on specific kfuncs are
  250. encouraged to make their use-cases known as early as possible, and participate
  251. in upstream discussions regarding whether to keep, change, deprecate, or remove
  252. those kfuncs if and when such discussions occur.
  253. 2.5 Registering the kfuncs
  254. --------------------------
  255. Once the kfunc is prepared for use, the final step to making it visible is
  256. registering it with the BPF subsystem. Registration is done per BPF program
  257. type. An example is shown below::
  258. BTF_KFUNCS_START(bpf_task_set)
  259. BTF_ID_FLAGS(func, bpf_get_task_pid, KF_ACQUIRE | KF_RET_NULL)
  260. BTF_ID_FLAGS(func, bpf_put_pid, KF_RELEASE)
  261. BTF_KFUNCS_END(bpf_task_set)
  262. static const struct btf_kfunc_id_set bpf_task_kfunc_set = {
  263. .owner = THIS_MODULE,
  264. .set = &bpf_task_set,
  265. };
  266. static int init_subsystem(void)
  267. {
  268. return register_btf_kfunc_id_set(BPF_PROG_TYPE_TRACING, &bpf_task_kfunc_set);
  269. }
  270. late_initcall(init_subsystem);
  271. 2.6 Specifying no-cast aliases with ___init
  272. --------------------------------------------
  273. The verifier will always enforce that the BTF type of a pointer passed to a
  274. kfunc by a BPF program, matches the type of pointer specified in the kfunc
  275. definition. The verifier, does, however, allow types that are equivalent
  276. according to the C standard to be passed to the same kfunc arg, even if their
  277. BTF_IDs differ.
  278. For example, for the following type definition:
  279. .. code-block:: c
  280. struct bpf_cpumask {
  281. cpumask_t cpumask;
  282. refcount_t usage;
  283. };
  284. The verifier would allow a ``struct bpf_cpumask *`` to be passed to a kfunc
  285. taking a ``cpumask_t *`` (which is a typedef of ``struct cpumask *``). For
  286. instance, both ``struct cpumask *`` and ``struct bpf_cpmuask *`` can be passed
  287. to bpf_cpumask_test_cpu().
  288. In some cases, this type-aliasing behavior is not desired. ``struct
  289. nf_conn___init`` is one such example:
  290. .. code-block:: c
  291. struct nf_conn___init {
  292. struct nf_conn ct;
  293. };
  294. The C standard would consider these types to be equivalent, but it would not
  295. always be safe to pass either type to a trusted kfunc. ``struct
  296. nf_conn___init`` represents an allocated ``struct nf_conn`` object that has
  297. *not yet been initialized*, so it would therefore be unsafe to pass a ``struct
  298. nf_conn___init *`` to a kfunc that's expecting a fully initialized ``struct
  299. nf_conn *`` (e.g. ``bpf_ct_change_timeout()``).
  300. In order to accommodate such requirements, the verifier will enforce strict
  301. PTR_TO_BTF_ID type matching if two types have the exact same name, with one
  302. being suffixed with ``___init``.
  303. .. _BPF_kfunc_lifecycle_expectations:
  304. 3. kfunc lifecycle expectations
  305. ===============================
  306. kfuncs provide a kernel <-> kernel API, and thus are not bound by any of the
  307. strict stability restrictions associated with kernel <-> user UAPIs. This means
  308. they can be thought of as similar to EXPORT_SYMBOL_GPL, and can therefore be
  309. modified or removed by a maintainer of the subsystem they're defined in when
  310. it's deemed necessary.
  311. Like any other change to the kernel, maintainers will not change or remove a
  312. kfunc without having a reasonable justification. Whether or not they'll choose
  313. to change a kfunc will ultimately depend on a variety of factors, such as how
  314. widely used the kfunc is, how long the kfunc has been in the kernel, whether an
  315. alternative kfunc exists, what the norm is in terms of stability for the
  316. subsystem in question, and of course what the technical cost is of continuing
  317. to support the kfunc.
  318. There are several implications of this:
  319. a) kfuncs that are widely used or have been in the kernel for a long time will
  320. be more difficult to justify being changed or removed by a maintainer. In
  321. other words, kfuncs that are known to have a lot of users and provide
  322. significant value provide stronger incentives for maintainers to invest the
  323. time and complexity in supporting them. It is therefore important for
  324. developers that are using kfuncs in their BPF programs to communicate and
  325. explain how and why those kfuncs are being used, and to participate in
  326. discussions regarding those kfuncs when they occur upstream.
  327. b) Unlike regular kernel symbols marked with EXPORT_SYMBOL_GPL, BPF programs
  328. that call kfuncs are generally not part of the kernel tree. This means that
  329. refactoring cannot typically change callers in-place when a kfunc changes,
  330. as is done for e.g. an upstreamed driver being updated in place when a
  331. kernel symbol is changed.
  332. Unlike with regular kernel symbols, this is expected behavior for BPF
  333. symbols, and out-of-tree BPF programs that use kfuncs should be considered
  334. relevant to discussions and decisions around modifying and removing those
  335. kfuncs. The BPF community will take an active role in participating in
  336. upstream discussions when necessary to ensure that the perspectives of such
  337. users are taken into account.
  338. c) A kfunc will never have any hard stability guarantees. BPF APIs cannot and
  339. will not ever hard-block a change in the kernel purely for stability
  340. reasons. That being said, kfuncs are features that are meant to solve
  341. problems and provide value to users. The decision of whether to change or
  342. remove a kfunc is a multivariate technical decision that is made on a
  343. case-by-case basis, and which is informed by data points such as those
  344. mentioned above. It is expected that a kfunc being removed or changed with
  345. no warning will not be a common occurrence or take place without sound
  346. justification, but it is a possibility that must be accepted if one is to
  347. use kfuncs.
  348. 3.1 kfunc deprecation
  349. ---------------------
  350. As described above, while sometimes a maintainer may find that a kfunc must be
  351. changed or removed immediately to accommodate some changes in their subsystem,
  352. usually kfuncs will be able to accommodate a longer and more measured
  353. deprecation process. For example, if a new kfunc comes along which provides
  354. superior functionality to an existing kfunc, the existing kfunc may be
  355. deprecated for some period of time to allow users to migrate their BPF programs
  356. to use the new one. Or, if a kfunc has no known users, a decision may be made
  357. to remove the kfunc (without providing an alternative API) after some
  358. deprecation period so as to provide users with a window to notify the kfunc
  359. maintainer if it turns out that the kfunc is actually being used.
  360. It's expected that the common case will be that kfuncs will go through a
  361. deprecation period rather than being changed or removed without warning. As
  362. described in :ref:`KF_deprecated_flag`, the kfunc framework provides the
  363. KF_DEPRECATED flag to kfunc developers to signal to users that a kfunc has been
  364. deprecated. Once a kfunc has been marked with KF_DEPRECATED, the following
  365. procedure is followed for removal:
  366. 1. Any relevant information for deprecated kfuncs is documented in the kfunc's
  367. kernel docs. This documentation will typically include the kfunc's expected
  368. remaining lifespan, a recommendation for new functionality that can replace
  369. the usage of the deprecated function (or an explanation as to why no such
  370. replacement exists), etc.
  371. 2. The deprecated kfunc is kept in the kernel for some period of time after it
  372. was first marked as deprecated. This time period will be chosen on a
  373. case-by-case basis, and will typically depend on how widespread the use of
  374. the kfunc is, how long it has been in the kernel, and how hard it is to move
  375. to alternatives. This deprecation time period is "best effort", and as
  376. described :ref:`above<BPF_kfunc_lifecycle_expectations>`, circumstances may
  377. sometimes dictate that the kfunc be removed before the full intended
  378. deprecation period has elapsed.
  379. 3. After the deprecation period the kfunc will be removed. At this point, BPF
  380. programs calling the kfunc will be rejected by the verifier.
  381. 4. Core kfuncs
  382. ==============
  383. The BPF subsystem provides a number of "core" kfuncs that are potentially
  384. applicable to a wide variety of different possible use cases and programs.
  385. Those kfuncs are documented here.
  386. 4.1 struct task_struct * kfuncs
  387. -------------------------------
  388. There are a number of kfuncs that allow ``struct task_struct *`` objects to be
  389. used as kptrs:
  390. .. kernel-doc:: kernel/bpf/helpers.c
  391. :identifiers: bpf_task_acquire bpf_task_release
  392. These kfuncs are useful when you want to acquire or release a reference to a
  393. ``struct task_struct *`` that was passed as e.g. a tracepoint arg, or a
  394. struct_ops callback arg. For example:
  395. .. code-block:: c
  396. /**
  397. * A trivial example tracepoint program that shows how to
  398. * acquire and release a struct task_struct * pointer.
  399. */
  400. SEC("tp_btf/task_newtask")
  401. int BPF_PROG(task_acquire_release_example, struct task_struct *task, u64 clone_flags)
  402. {
  403. struct task_struct *acquired;
  404. acquired = bpf_task_acquire(task);
  405. if (acquired)
  406. /*
  407. * In a typical program you'd do something like store
  408. * the task in a map, and the map will automatically
  409. * release it later. Here, we release it manually.
  410. */
  411. bpf_task_release(acquired);
  412. return 0;
  413. }
  414. References acquired on ``struct task_struct *`` objects are RCU protected.
  415. Therefore, when in an RCU read region, you can obtain a pointer to a task
  416. embedded in a map value without having to acquire a reference:
  417. .. code-block:: c
  418. #define private(name) SEC(".data." #name) __hidden __attribute__((aligned(8)))
  419. private(TASK) static struct task_struct *global;
  420. /**
  421. * A trivial example showing how to access a task stored
  422. * in a map using RCU.
  423. */
  424. SEC("tp_btf/task_newtask")
  425. int BPF_PROG(task_rcu_read_example, struct task_struct *task, u64 clone_flags)
  426. {
  427. struct task_struct *local_copy;
  428. bpf_rcu_read_lock();
  429. local_copy = global;
  430. if (local_copy)
  431. /*
  432. * We could also pass local_copy to kfuncs or helper functions here,
  433. * as we're guaranteed that local_copy will be valid until we exit
  434. * the RCU read region below.
  435. */
  436. bpf_printk("Global task %s is valid", local_copy->comm);
  437. else
  438. bpf_printk("No global task found");
  439. bpf_rcu_read_unlock();
  440. /* At this point we can no longer reference local_copy. */
  441. return 0;
  442. }
  443. ----
  444. A BPF program can also look up a task from a pid. This can be useful if the
  445. caller doesn't have a trusted pointer to a ``struct task_struct *`` object that
  446. it can acquire a reference on with bpf_task_acquire().
  447. .. kernel-doc:: kernel/bpf/helpers.c
  448. :identifiers: bpf_task_from_pid
  449. Here is an example of it being used:
  450. .. code-block:: c
  451. SEC("tp_btf/task_newtask")
  452. int BPF_PROG(task_get_pid_example, struct task_struct *task, u64 clone_flags)
  453. {
  454. struct task_struct *lookup;
  455. lookup = bpf_task_from_pid(task->pid);
  456. if (!lookup)
  457. /* A task should always be found, as %task is a tracepoint arg. */
  458. return -ENOENT;
  459. if (lookup->pid != task->pid) {
  460. /* bpf_task_from_pid() looks up the task via its
  461. * globally-unique pid from the init_pid_ns. Thus,
  462. * the pid of the lookup task should always be the
  463. * same as the input task.
  464. */
  465. bpf_task_release(lookup);
  466. return -EINVAL;
  467. }
  468. /* bpf_task_from_pid() returns an acquired reference,
  469. * so it must be dropped before returning from the
  470. * tracepoint handler.
  471. */
  472. bpf_task_release(lookup);
  473. return 0;
  474. }
  475. 4.2 struct cgroup * kfuncs
  476. --------------------------
  477. ``struct cgroup *`` objects also have acquire and release functions:
  478. .. kernel-doc:: kernel/bpf/helpers.c
  479. :identifiers: bpf_cgroup_acquire bpf_cgroup_release
  480. These kfuncs are used in exactly the same manner as bpf_task_acquire() and
  481. bpf_task_release() respectively, so we won't provide examples for them.
  482. ----
  483. Other kfuncs available for interacting with ``struct cgroup *`` objects are
  484. bpf_cgroup_ancestor() and bpf_cgroup_from_id(), allowing callers to access
  485. the ancestor of a cgroup and find a cgroup by its ID, respectively. Both
  486. return a cgroup kptr.
  487. .. kernel-doc:: kernel/bpf/helpers.c
  488. :identifiers: bpf_cgroup_ancestor
  489. .. kernel-doc:: kernel/bpf/helpers.c
  490. :identifiers: bpf_cgroup_from_id
  491. Eventually, BPF should be updated to allow this to happen with a normal memory
  492. load in the program itself. This is currently not possible without more work in
  493. the verifier. bpf_cgroup_ancestor() can be used as follows:
  494. .. code-block:: c
  495. /**
  496. * Simple tracepoint example that illustrates how a cgroup's
  497. * ancestor can be accessed using bpf_cgroup_ancestor().
  498. */
  499. SEC("tp_btf/cgroup_mkdir")
  500. int BPF_PROG(cgrp_ancestor_example, struct cgroup *cgrp, const char *path)
  501. {
  502. struct cgroup *parent;
  503. /* The parent cgroup resides at the level before the current cgroup's level. */
  504. parent = bpf_cgroup_ancestor(cgrp, cgrp->level - 1);
  505. if (!parent)
  506. return -ENOENT;
  507. bpf_printk("Parent id is %d", parent->self.id);
  508. /* Return the parent cgroup that was acquired above. */
  509. bpf_cgroup_release(parent);
  510. return 0;
  511. }
  512. 4.3 struct cpumask * kfuncs
  513. ---------------------------
  514. BPF provides a set of kfuncs that can be used to query, allocate, mutate, and
  515. destroy struct cpumask * objects. Please refer to :ref:`cpumasks-header-label`
  516. for more details.