>
> On Mon, 23 Sep 2002, Con Kolivas wrote:
>
> > Agreed. There probably is no statistically significant difference in the
> > different gcc versions.
> >
> > Contest is very new and I appreciate any feedback I can get to make it
> > as worthwhile a benchmark as possible to those who know.
>
> your measurements are really useful i think, and people like Andrew
Thank you. I was beginning to wonder on this.
> started to watch those numbers - this is why at this point a bit more
> effort can/should be taken to filter out fluctuations better. Ie. a single
> fluctuation could send Andrew out on a wild goose chase while perhaps in
> reality his kernel was the fastest. Running every test twice should at
> least give a ballpart figure wrt. fluctuations, without increasing the
> runtime unrealistically.
Absolutely. In my real profession I deal with statistics all the time so I'm
acutely aware of the problem.
> i agree that only the IO benchmarks are problematic from this POV - things
> like the process load and your other CPU-saturating numbers look perfectly
> valid.
Yes, the IO load is proving to be a pain and I'm afraid it will take numerous
measurements to get some idea of the real average. So far the trends in the
results I've reported I think are still correct. The variability, though, that's
another matter.
> obviously another concern to to make testing not take days to accomplish.
> This i think is one of the hardest things - making timely measurements
> which are still meaningful and provide stable results.
I know. Some of the changes I've made to make results reproducible I have
already had complaints about; the situation will only get worse :(
Con
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/