[dpdk-dev] [PATCH v17 00/11] Add PMD power management
Anatoly Burakov
anatoly.burakov at intel.com
Thu Jan 14 15:46:02 CET 2021
This patchset proposes a simple API for Ethernet drivers to cause the
CPU to enter a power-optimized state while waiting for packets to
arrive. There are multiple proposed mechanisms to achieve said power
savings: simple frequency scaling, idle loop, and monitoring the Rx
queue for incoming packages. The latter is achieved through cooperation
with the NIC driver that will allow us to know address of wake up event,
and wait for writes on that address.
On IA, this is achieved through using UMONITOR/UMWAIT instructions. They
are used in their raw opcode form because there is no widespread
compiler support for them yet. Still, the API is made generic enough to
hopefully support other architectures, if they happen to implement
similar instructions.
To achieve power savings, there is a very simple mechanism used: we're
counting empty polls, and if a certain threshold is reached, we employ
one of the suggested power management schemes automatically, from within
a Rx callback inside the PMD. Once there's traffic again, the empty poll
counter is reset.
This patchset also introduces a few changes into existing power
management-related intrinsics, namely to provide a native way of waking
up a sleeping core without application being responsible for it, as well
as general robustness improvements. There's quite a bit of locking going
on, but these locks are per-thread and very little (if any) contention
is expected, so the performance impact shouldn't be that bad (and in any
case the locking happens when we're about to sleep anyway).
Why are we putting it into ethdev as opposed to leaving this up to the
application? Our customers specifically requested a way to do it with
minimal changes to the application code. The current approach allows to
just flip a switch and automatically have power savings.
Things of note:
- Only 1:1 core to queue mapping is supported, meaning that each lcore
must at most handle RX on a single queue
- Support 3 type policies. Monitor/Pause/Frequency Scaling
- Power management is enabled per-queue
- The API doesn't extend to other device types
v17:
- Added exception for ethdev driver-only ABI
- Added memory barriers for monitor/wakeup (Konstantin)
- Fixed compiled issues on non-x86 platforms (hopefully!)
v16:
- Implemented Konstantin's suggestions and comments
- Added return values to the API
v15:
- Fixed incorrect check in UMWAIT callback
- Fixed accidental whitespace changes
v14:
- Fixed ARM/PPC builds
- Addressed various review comments
v13:
- Reworked the librte_power code to require less locking and handle invalid
parameters better
- Fix numerous rebase errors present in v12
v12:
- Rebase on top of 21.02
- Rework of power intrinsics code
Anatoly Burakov (5):
eal: uninline power intrinsics
eal: avoid invalid API usage in power intrinsics
eal: change API of power intrinsics
eal: remove sync version of power monitor
eal: add monitor wakeup function
Liang Ma (6):
ethdev: add simple power management API
power: add PMD power management API and callback
net/ixgbe: implement power management API
net/i40e: implement power management API
net/ice: implement power management API
examples/l3fwd-power: enable PMD power mgmt
devtools/libabigail.abignore | 3 +
doc/guides/prog_guide/power_man.rst | 44 +++
doc/guides/rel_notes/release_21_02.rst | 15 +
.../sample_app_ug/l3_forward_power_man.rst | 35 ++
drivers/event/dlb/dlb.c | 10 +-
drivers/event/dlb2/dlb2.c | 10 +-
drivers/net/i40e/i40e_ethdev.c | 1 +
drivers/net/i40e/i40e_rxtx.c | 25 ++
drivers/net/i40e/i40e_rxtx.h | 1 +
drivers/net/ice/ice_ethdev.c | 1 +
drivers/net/ice/ice_rxtx.c | 26 ++
drivers/net/ice/ice_rxtx.h | 1 +
drivers/net/ixgbe/ixgbe_ethdev.c | 1 +
drivers/net/ixgbe/ixgbe_rxtx.c | 25 ++
drivers/net/ixgbe/ixgbe_rxtx.h | 1 +
examples/l3fwd-power/main.c | 89 ++++-
.../arm/include/rte_power_intrinsics.h | 40 --
lib/librte_eal/arm/meson.build | 1 +
lib/librte_eal/arm/rte_power_intrinsics.c | 40 ++
.../include/generic/rte_power_intrinsics.h | 88 ++---
.../ppc/include/rte_power_intrinsics.h | 40 --
lib/librte_eal/ppc/meson.build | 1 +
lib/librte_eal/ppc/rte_power_intrinsics.c | 40 ++
lib/librte_eal/version.map | 3 +
.../x86/include/rte_power_intrinsics.h | 115 ------
lib/librte_eal/x86/meson.build | 1 +
lib/librte_eal/x86/rte_power_intrinsics.c | 215 +++++++++++
lib/librte_ethdev/rte_ethdev.c | 28 ++
lib/librte_ethdev/rte_ethdev.h | 25 ++
lib/librte_ethdev/rte_ethdev_driver.h | 22 ++
lib/librte_ethdev/version.map | 3 +
lib/librte_power/meson.build | 5 +-
lib/librte_power/rte_power_pmd_mgmt.c | 364 ++++++++++++++++++
lib/librte_power/rte_power_pmd_mgmt.h | 90 +++++
lib/librte_power/version.map | 5 +
35 files changed, 1155 insertions(+), 259 deletions(-)
create mode 100644 lib/librte_eal/arm/rte_power_intrinsics.c
create mode 100644 lib/librte_eal/ppc/rte_power_intrinsics.c
create mode 100644 lib/librte_eal/x86/rte_power_intrinsics.c
create mode 100644 lib/librte_power/rte_power_pmd_mgmt.c
create mode 100644 lib/librte_power/rte_power_pmd_mgmt.h
--
2.25.1
More information about the dev
mailing list