Yes, the remote server was shut down and caused this problem.
> > I also have this in my kernel log:
> > May 26 06:33:16 fileserver kernel: Uhhuh. NMI received. Dazed and confused, but trying to continue
> > May 26 06:33:16 fileserver kernel: You probably have a hardware problem with your RAM chips
>
> However, this error could (but I don't really know what the effects are of
> this) potentially stop a process at some random point. If a process
> crashes, for example an oops, while holding the semaphore that semaphore
> will still be held and everyone trying to get in will stop in D state.
>
I will resove this issue soon, but don't forget that the processes stuck in
D state has been happening for a while on another machine also.
>
> There are some patches here:
> http://www.hojdpunkten.ac.se/054/samba/index.html
>
> But that server appears to be down right now.
>
> There is one patch that uses poll to help with the problem of a server
> that is gone, and another that changes a lot of how smbfs sends requests
> and additionaly makes the user processes always(?) be interruptible.
>
Do these require any changes to the samba userspace?
> But if the NMIs are killing things at random points then none of those
> patches will help.
AFAICT, no processes have been killed. I'm going to try to reproduce this
on another machine and I'll post the sysrq+t ksymoops output from that.
I'll probably have to do it next week though.
Mike
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/