I suspect the reason for the big difference is runqueue lock contention
(as in the original chat benchmark).
It might be interesting to keep the benchmark load the same and
rerun with a varying number of CPUs. My 'guess' is that with the
default scheduler you will see better numbers with less CPUs. When
we were working with chat on an 8-way Intel, we saw benchmark
numbers peak at around 4 CPUs. As you added more than 4 CPUs,
your benchmark numbers got worse. You can see this graphicly at:
http://lse.sourceforge.net/scheduling/ols2001/img54.htm
-- Mike - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/