eal: fix out of bound access when no cpu is available

Message ID 1547730777-20405-1-git-send-email-david.marchand@redhat.com (mailing list archive)
State Accepted, archived
Delegated to: Thomas Monjalon
Headers
Series eal: fix out of bound access when no cpu is available |

Checks

Context Check Description
ci/intel-Performance-Testing success Performance Testing PASS
ci/mellanox-Performance-Testing success Performance Testing PASS
ci/Intel-compilation success Compilation OK

Commit Message

David Marchand Jan. 17, 2019, 1:12 p.m. UTC
  In the unlikely case when the dpdk application is started with no cpu
available in the [0, RTE_MAX_LCORE - 1] range, the master_lcore is
automatically chosen as RTE_MAX_LCORE which triggers an out of bound
access.

Either you have a crash then, or the initialisation fails later when
trying to pin the master thread on it.
In my test, with RTE_MAX_LCORE == 2:

$ taskset -c 2 ./master/app/testpmd --no-huge -m 512 --log-level *:debug
[...]
EAL: pthread_setaffinity_np failed
PANIC in eal_thread_init_master():
cannot set affinity
7: [./master/app/testpmd() [0x47f629]]

Bugzilla ID: 19
Signed-off-by: David Marchand <david.marchand@redhat.com>
---
 lib/librte_eal/common/eal_common_options.c | 2 ++
 1 file changed, 2 insertions(+)
  

Comments

Thomas Monjalon Jan. 17, 2019, 4:34 p.m. UTC | #1
17/01/2019 14:12, David Marchand:
> In the unlikely case when the dpdk application is started with no cpu
> available in the [0, RTE_MAX_LCORE - 1] range, the master_lcore is
> automatically chosen as RTE_MAX_LCORE which triggers an out of bound
> access.
> 
> Either you have a crash then, or the initialisation fails later when
> trying to pin the master thread on it.
> In my test, with RTE_MAX_LCORE == 2:
> 
> $ taskset -c 2 ./master/app/testpmd --no-huge -m 512 --log-level *:debug
> [...]
> EAL: pthread_setaffinity_np failed
> PANIC in eal_thread_init_master():
> cannot set affinity
> 7: [./master/app/testpmd() [0x47f629]]
> 
> Bugzilla ID: 19
> Signed-off-by: David Marchand <david.marchand@redhat.com>

We should backport this fix.
When this bug has been introduced?
  
David Marchand Jan. 17, 2019, 5:17 p.m. UTC | #2
On Thu, Jan 17, 2019 at 5:34 PM Thomas Monjalon <thomas@monjalon.net> wrote:

> 17/01/2019 14:12, David Marchand:
> > In the unlikely case when the dpdk application is started with no cpu
> > available in the [0, RTE_MAX_LCORE - 1] range, the master_lcore is
> > automatically chosen as RTE_MAX_LCORE which triggers an out of bound
> > access.
> >
> > Either you have a crash then, or the initialisation fails later when
> > trying to pin the master thread on it.
> > In my test, with RTE_MAX_LCORE == 2:
> >
> > $ taskset -c 2 ./master/app/testpmd --no-huge -m 512 --log-level *:debug
> > [...]
> > EAL: pthread_setaffinity_np failed
> > PANIC in eal_thread_init_master():
> > cannot set affinity
> > 7: [./master/app/testpmd() [0x47f629]]
> >
> > Bugzilla ID: 19
> > Signed-off-by: David Marchand <david.marchand@redhat.com>
>
> We should backport this fix.
> When this bug has been introduced?
>

Indeed, at first, I thought the problem had always been there, but it
should be starting 17.02:
Fixes: 2eba8d21f3c9 ("eal: restrict cores auto detection")

+ CC stable

Do you want a v2 ?
  
Thomas Monjalon Jan. 17, 2019, 5:38 p.m. UTC | #3
17/01/2019 18:17, David Marchand:
> On Thu, Jan 17, 2019 at 5:34 PM Thomas Monjalon <thomas@monjalon.net> wrote:
> 
> > 17/01/2019 14:12, David Marchand:
> > > In the unlikely case when the dpdk application is started with no cpu
> > > available in the [0, RTE_MAX_LCORE - 1] range, the master_lcore is
> > > automatically chosen as RTE_MAX_LCORE which triggers an out of bound
> > > access.
> > >
> > > Either you have a crash then, or the initialisation fails later when
> > > trying to pin the master thread on it.
> > > In my test, with RTE_MAX_LCORE == 2:
> > >
> > > $ taskset -c 2 ./master/app/testpmd --no-huge -m 512 --log-level *:debug
> > > [...]
> > > EAL: pthread_setaffinity_np failed
> > > PANIC in eal_thread_init_master():
> > > cannot set affinity
> > > 7: [./master/app/testpmd() [0x47f629]]
> > >
> > > Bugzilla ID: 19
> > > Signed-off-by: David Marchand <david.marchand@redhat.com>
> >
> > We should backport this fix.
> > When this bug has been introduced?
> >
> 
> Indeed, at first, I thought the problem had always been there, but it
> should be starting 17.02:
> Fixes: 2eba8d21f3c9 ("eal: restrict cores auto detection")
> 
> + CC stable
> 
> Do you want a v2 ?

Applied, thanks
  

Patch

diff --git a/lib/librte_eal/common/eal_common_options.c b/lib/librte_eal/common/eal_common_options.c
index 3796dbf..7aad303 100644
--- a/lib/librte_eal/common/eal_common_options.c
+++ b/lib/librte_eal/common/eal_common_options.c
@@ -1383,6 +1383,8 @@  static int xdigit2val(unsigned char c)
 	/* default master lcore is the first one */
 	if (!master_lcore_parsed) {
 		cfg->master_lcore = rte_get_next_lcore(-1, 0, 0);
+		if (cfg->master_lcore >= RTE_MAX_LCORE)
+			return -1;
 		lcore_config[cfg->master_lcore].core_role = ROLE_RTE;
 	}