[dpdk-stable] [dpdk-dev] [PATCH v4] lib/table: fix cache alignment issue

Dumitrescu, Cristian cristian.dumitrescu at intel.com
Wed Jul 29 15:54:03 CEST 2020



> -----Original Message-----
> From: David Marchand <david.marchand at redhat.com>
> Sent: Wednesday, July 29, 2020 2:28 PM
> To: Dumitrescu, Cristian <cristian.dumitrescu at intel.com>
> Cc: Xu, Ting <ting.xu at intel.com>; dev <dev at dpdk.org>; dpdk stable
> <stable at dpdk.org>; Kevin Traynor <ktraynor at redhat.com>; Luca Boccassi
> <bluca at debian.org>
> Subject: Re: [dpdk-stable] [dpdk-dev] [PATCH v4] lib/table: fix cache
> alignment issue
> 
> On Wed, Jul 29, 2020 at 3:14 PM Dumitrescu, Cristian
> <cristian.dumitrescu at intel.com> wrote:
> > > Please correct me if I am wrong, but it simply means this part of the
> > > table library never worked for 32-bit.
> > > It seems more adding 32-bit support rather than a fix and then I
> > > wonder if it has its place in rc3.
> > >
> >
> > Functionally. the code works, but performance is affected.
> >
> > The only thing that prevents the code from working is the check in the
> table create function that checks the size of the above structure is 64 bytes,
> which caught this issue.
> 
> Yes, and that's my point.
> It was not working.
> It was not tested.
> 
> 

Not sure when this code was last tested on 32-bit systems, I'll let the validation folks comment on this, but I cannot rule out a change in compiler behavior either.

This is a low complexity and low impact change, hence low risk IMO.

> This patch asks for backport in stable branches, I will let Kevin and
> Luca comment.
> 
> 
> >
> > >
> > >
> > > Now, looking at the details:
> > >
> > > For 64-bit on my x86, we have:
> > >
> > > struct rte_bucket_4_8 {
> > >     uint64_t                   signature;            /*     0     8 */
> > >     uint64_t                   lru_list;             /*     8     8 */
> > >     struct rte_bucket_4_8 *    next;                 /*    16     8 */
> > >     uint64_t                   next_valid;           /*    24     8 */
> > >     uint64_t                   key[4];               /*    32    32 */
> > >     /* --- cacheline 1 boundary (64 bytes) --- */
> > >     uint8_t                    data[];               /*    64     0 */
> > >
> > >     /* size: 64, cachelines: 1, members: 6 */
> > > };
> > >
> > >
> > > For 32-bit, we have:
> > >
> > > struct rte_bucket_4_8 {
> > >     uint64_t                   signature;            /*     0     8 */
> > >     uint64_t                   lru_list;             /*     8     8 */
> > >     struct rte_bucket_4_8 *    next;                 /*    16     4 */
> > >     uint64_t                   next_valid;           /*    20     8 */
> > >     uint64_t                   key[4];               /*    28    32 */
> > >     uint8_t                    data[];               /*    60     0 */
> > >
> > >     /* size: 60, cachelines: 1, members: 6 */
> > >     /* last cacheline: 60 bytes */
> > > } __attribute__((__packed__));
> > >
> > > ^^ it is interesting that a packed attribute ends up here.
> > > I saw no such attribute in the library code.
> > > Compiler black magic at work I guess...
> > >
> >
> > Where do you see the packet attribute? I don't see it in the code.
> 
> That's pahole reporting this.
> Maybe the tool extrapolates this attribute based on the next_valid
> field placement... I don't know.
> 
> > A packet attribute would explain this issue, i.e. why did the compiler decide
> not to insert an expected padfing of 4 bytes right after the "next" field, that
> would allow the field "next_valid" to be aligned to its natural boundary of 8
> bytes.
> 
> Or a 64-bit field on 32-bit has a special alignment that I am not aware of.
> 
> 
> >
> > >
> > > >
> > > > Fixes: 8aa327214c ("table: hash")
> > > > Cc: stable at dpdk.org
> > > >
> > > > Signed-off-by: Ting Xu <ting.xu at intel.com>
> > > >
> > > > ---
> > > > v3->v4: Change design based on comment
> > > > v2->v3: Rebase
> > > > v1->v2: Correct patch time
> > > > ---
> > > >  lib/librte_table/rte_table_hash_key16.c | 17 +++++++++++++++++
> > > >  lib/librte_table/rte_table_hash_key32.c | 17 +++++++++++++++++
> > > >  lib/librte_table/rte_table_hash_key8.c  | 16 ++++++++++++++++
> > > >  3 files changed, 50 insertions(+)
> > > >
> > > > diff --git a/lib/librte_table/rte_table_hash_key16.c
> > > b/lib/librte_table/rte_table_hash_key16.c
> > > > index 2cca1c924..c4384b114 100644
> > > > --- a/lib/librte_table/rte_table_hash_key16.c
> > > > +++ b/lib/librte_table/rte_table_hash_key16.c
> > > > @@ -33,6 +33,7 @@
> > > >
> > > >  #endif
> > > >
> > > > +#ifdef RTE_ARCH_64
> > > >  struct rte_bucket_4_16 {
> > > >         /* Cache line 0 */
> > > >         uint64_t signature[4 + 1];
> > > > @@ -46,6 +47,22 @@ struct rte_bucket_4_16 {
> > > >         /* Cache line 2 */
> > > >         uint8_t data[0];
> > > >  };
> > > > +#else
> > > > +struct rte_bucket_4_16 {
> > > > +       /* Cache line 0 */
> > > > +       uint64_t signature[4 + 1];
> > > > +       uint64_t lru_list;
> > > > +       struct rte_bucket_4_16 *next;
> > > > +       uint32_t pad;
> > > > +       uint64_t next_valid;
> > > > +
> > > > +       /* Cache line 1 */
> > > > +       uint64_t key[4][2];
> > > > +
> > > > +       /* Cache line 2 */
> > > > +       uint8_t data[0];
> > > > +};
> > > > +#endif
> > >
> > > The change could simply be:
> > >
> > > @@ -38,6 +38,9 @@ struct rte_bucket_4_16 {
> > >         uint64_t signature[4 + 1];
> > >         uint64_t lru_list;
> > >         struct rte_bucket_4_16 *next;
> > > +#ifndef RTE_ARCH_64
> > > +       uint32_t pad;
> > > +#endif
> > >         uint64_t next_valid;
> > >
> > >         /* Cache line 1 */
> > >
> > > It avoids duplicating the whole structure definition (we could miss
> > > updating one side of the #ifdef later).
> > > Idem for the other "8" and "32" structures.
> 
> 
> What about this comment?
> 
> 
> --
> David Marchand



More information about the stable mailing list