Bleh ... why? This needs to be calculated much more lazily than this,
or you're going to kick the hell out of any cache affinity. Can you
recalc this in the rebalance code or something instead?
> 2: I have no idea how tasks sharing the mm structure will behave. I'd
> like them to run on different nodes (that's why node_mem is not in mm),
> but they could (legally) free pages which they did not allocate and
> have wrong values in node_mem[].
Yes, that really ought to be per-process, not per task. Which means
locking or atomics ... and overhead. Ick.
For the first cut of the NUMA sched, maybe you could just leave page
allocation alone, and do that seperately? or is that what the second
patch was meant to be?
M.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/