I've seen this at least 5 times now in one month.. One of our boxes die
when postgresql maintenance script AND backup cron jobs are ran at the same
time (by mistake - normally they are not ran at the same time)..
So it seems to be related to high disk i/o. The adapter is
HP NetRAID 1M with latest firmware. There is one RAID5 array with 3 disks
configured to it.
Usually this crash happens 1 to 2 times a week.. always when cron starts to
run the stuff at the night. The console will be flooded with "Blocked
mailbox......!!" text (which surprisingly means that the megaraid firmware
has stopped responding.. according to google)
This box doesn't have high disk i/o at the daytime, only at the night when
cron starts to do things..
When the cron jobs are not ran at the same time, the box is stable.
Other box using the same kind of adapter, but RAID1 array instead, having
high disk i/o all the time doesn't have any problems.. (with the same kernels).
Kernels are compiled with gcc 2.95.4 (Debian 3.0).
Any thoughts?
-- Pasi Kärkkäinen
^
. .
Linux
/ - \
Choice.of.the
.Next.Generation.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/