We installed a 2.4.7 kernel on a Compaq Proliant server and it makes the
machine freeze totally at various moments. We've been running 2.4 on several
other non-server machines for a while without any problem at all, so we
tried to figure out what may be causing the problem. Here is a list of what
we know so far:
INITIAL SYMPTOMS
At totally unpredictable moments the sever hangs. Nothing on the display, no
keyboard led response, no disk activity, no ping response on the net.
2.2.19 KERNEL
The machine ran a 2.2.19 kernel prior to the 2.4.7 kernel without any
problems at all. Only kernel upgrades forced us to do reboots.
CPQHEALTH
There were Compaq cpqhealth modules in the kernel. Those are binary modules
from Compaq that help to gather health information about the machine. The
modules didn't load in a 2.4.7 kernel (cpqhealth 2.1 even created kernel
stack dumps when loaded in a 2.4.7 kernel on another machine) so we removed
the cpqhealth software. It didn't help.
CPQARRAY
This is the first machine with a Compaq SMART 3200 raid controller we tried
with 2.4.7. I don't believe this is causing the problem, but it's worth
mentioning I think.
MONITOR/MOUSE/KEYBOARD SWITCH
This also is the first machine with an electronic MONITOR/MOUSE/KEYBOARD
SWITCH. Last weekend the machine ran w/o any problems at all, but today
(monday) it had a hangup. We can be sure that nobody touched the switch
during the weekend and today at was used most certainly, so there may be a
relation between the switch and the hanging.
Of course we'll go on investigating this, but because the machine has a
semi-production status we have to do this with care. However suggestions are
wellcome.
Rolf
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/