I think I found a solution. At the very least, I've found something
that drastically affects reliability of this hardware combo.
The combo in question is a KT133 chipset (Phoenix BIOS), Athlon 1.3GHz,
2 Promise Ultra100 IDE controllers with an IBM 75gb drive on each channel
(4 drives). Doing anything that beat on all 4 drives sufficiently
(such as software RAID5) would hang the system hard.
The magic settings that had a drastic impact on reliability were the PCI
device latency timers. The early settings I tried just changed how long
the system would run before it crashed (in some cases making things *much*
worse). Then after more of something one could loosely term "research", I
hit on some settings that seem to have resulted in a fully stable system!
Forthwith and to wit:
setpci -v -d *:* latency_timer=b0
setpci -v -d 105a:* latency_timer=ff
Yes, that's a baseline setting of 176 for everything, then max settings
for the two Promise cards. Rather drastic? Perhaps, but it works.
More research and tweaking is probably in order.
I wanted to get the news out -- even if a bit premature -- in hopes that
it might relieve someone else's grief. It's really sucked having all
this hardware for three months and not being able to put it to good use
(unless crash testing counts...)
-cw-
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/