JDK-8175813 : PPC64: "mbind: Invalid argument" when -XX:+UseNUMA is used
  • Type: Bug
  • Component: hotspot
  • Sub-Component: gc
  • Affected Version: 9
  • Priority: P4
  • Status: Resolved
  • Resolution: Fixed
  • OS: linux
  • CPU: ppc
  • Submitted: 2017-02-24
  • Updated: 2018-02-27
  • Resolved: 2017-05-07
The Version table provides details related to the release that this issue/RFE will be addressed.

Unresolved : Release in which this issue/RFE will be addressed.
Resolved: Release in which this issue/RFE has been resolved.
Fixed : Release in which this issue/RFE has been fixed. The release containing this fix may be available for download as an Early Access Release or a General Availability Release.

To download the current JDK release, click here.
JDK 10 JDK 8
10 b21Fixed 8u152Fixed
Related Reports
Relates :  
Relates :  
Relates :  
Relates :  
Description
The JVM code used to discover which numa nodes it can bind assumes that nodes are consecutive and tries to bind from 0 to numa_max_node(), i.e. 0 to the highest node number available on the system. However, at least on PPC64 that assumption is not always true. For instance, the following numa
topology:

available: 4 nodes (0-1,16-17)
node 0 cpus: 0 8 16 24 32
node 0 size: 130706 MB
node 0 free: 145 MB
node 1 cpus: 40 48 56 64 72
node 1 size: 0 MB
node 1 free: 0 MB
node 16 cpus: 80 88 96 104 112
node 16 size: 130630 MB
node 16 free: 529 MB
node 17 cpus: 120 128 136 144 152
node 17 size: 0 MB
node 17 free: 0 MB
node distances:
node 0 1 16 17
0: 10 20 40 40
1: 20 10 40 40
16: 40 40 10 20
17: 40 40 20 10

Also, some nodes can also be present in the topology but contain no memory (a memory-less node in libnuma terms).
Comments
Review thread: http://mail.openjdk.java.net/pipermail/hotspot-dev/2017-March/026156.html
21-03-2017

You are right. There can be many reasons for EINVAL from mbind and the two underlying causes seem likely to be different.
01-03-2017

Hi @dholmes. The issue reported by Sangheon on x64 seems different in its root cause from the one I reported on PPC64 since, AFAICS, numa nodes on x64 are always consecutive. If so, is it really a dup?
01-03-2017

This is a duplicate of JDK-8163796. One of them should be closed.
27-02-2017

Initial discussion: http://mail.openjdk.java.net/pipermail/ppc-aix-port-dev/2017-February/002904.html
24-02-2017