This can run into problems with missing task isolation when thread local variables are used in iteration over nodes. It's likely to just add overhead anyway; there should be enough parallelism coming from the number of nodes anyway.
This can run into problems with missing task isolation when thread local variables are used in iteration over nodes. It's likely to just add overhead anyway; there should be enough parallelism coming from the number of nodes anyway.