[PATCH] net/bonding: fix bond startup failure when NUMA is -1
humin (Q)
humin29 at huawei.com
Fri Jun 16 13:57:32 CEST 2023
在 2023/6/16 14:08, Chaoyong He 写道:
>> 在 2023/6/16 11:20, Chaoyong He 写道:
>>> From: Zerun Fu <zerun.fu at corigine.com>
>>>
>>> After the mainline Linux kernel commit
>>> "fe205d984e7730f4d21f6f8ebc60f0698404ac31" (ACPI: Remove side
>> effect
>>> of partly creating a node in acpi_map_pxm_to_online_node) by Jonathan
>>> Cameron. When the system does not support NUMA architecture, the
>>> "socket_id" is expected to be -1. The valid "socket_id" in BOND PMD is
>>> greater than or equal to zero. So it will cause an error when DPDK
>>> checks the validity of the "socket_id" when starting the bond. This
>>> commit can fix this bug.
>>>
>>> Fixes: f294e04851fd ("net/bonding: fix socket ID check")
>>> Cc: stable at dpdk.org
>>>
>>> Signed-off-by: Zerun Fu <zerun.fu at corigine.com>
>>> Reviewed-by: Peng Zhang <peng.zhang at corigine.com>
>>> Reviewed-by: Chaoyong He <chaoyong.he at corigine.com>
>>> Reviewed-by: Long Wu <long.wu at corigine.com>
>>> ---
>>> drivers/net/bonding/rte_eth_bond_args.c | 6 ++++++
>>> drivers/net/bonding/rte_eth_bond_pmd.c | 2 +-
>>> 2 files changed, 7 insertions(+), 1 deletion(-)
>>>
>>> diff --git a/drivers/net/bonding/rte_eth_bond_args.c
>>> b/drivers/net/bonding/rte_eth_bond_args.c
>>> index 6553166f5c..c137efd55f 100644
>>> --- a/drivers/net/bonding/rte_eth_bond_args.c
>>> +++ b/drivers/net/bonding/rte_eth_bond_args.c
>>> @@ -212,6 +212,12 @@ bond_ethdev_parse_socket_id_kvarg(const char
>> *key __rte_unused,
>>> if (*endptr != 0 || errno != 0)
>>> return -1;
>>>
>>> + /* SOCKET_ID_ANY also consider a valid socket id */
>>> + if ((int8_t)socket_id == SOCKET_ID_ANY) {
>>> + *(int *)extra_args = SOCKET_ID_ANY;
>>> + return 0;
>>> + }
>>> +
>>> /* validate socket id value */
>>> if (socket_id >= 0 && socket_id < RTE_MAX_NUMA_NODES) {
>>> *(int *)extra_args = (int)socket_id; diff --git
>>> a/drivers/net/bonding/rte_eth_bond_pmd.c
>>> b/drivers/net/bonding/rte_eth_bond_pmd.c
>>> index f0c4f7d26b..390a5b4271 100644
>>> --- a/drivers/net/bonding/rte_eth_bond_pmd.c
>>> +++ b/drivers/net/bonding/rte_eth_bond_pmd.c
>>> @@ -3604,7 +3604,7 @@ static int
>>> bond_alloc(struct rte_vdev_device *dev, uint8_t mode)
>>> {
>>> const char *name = rte_vdev_device_name(dev);
>>> - uint8_t socket_id = dev->device.numa_node;
>>> + int socket_id = dev->device.numa_node;
>> Well, other point should be also modified, like :
>>
>> ***
>>
>> "socket %u.", name, bonding_mode, socket_id);
>>
>> ***
>>
>> %u -- > %d.
> Okay, I will send a v2 patch fix this.
>
>> BTW, I think there is no need to add args like "socket_id=-1..." if we know
>> this server does not support NUMA.
>>
>> Default socket id is -1, so this is meaningless.
>>
> We found this bug when running 'dperf' app, and it is the 'dperf' app add this 'socket_id=-1' args.
> Maybe the 'dperf' app should change its logic?
> Please help correct me if I misunderstood, thanks.
I agree with your patch.
What I mentioned is just for further discussion.
>
>>> struct bond_dev_private *internals = NULL;
>>> struct rte_eth_dev *eth_dev = NULL;
>>> uint32_t vlan_filter_bmp_size;
More information about the stable
mailing list