> When this happens, there is always a file left in
> /var/lib/nfs/sm (normally there are no files in there for none
> of the clients, even when they are on) for the hanging box. Is
> this normal?
It means that rpc.statd did not manage to unmonitor the NFS locks
before it shut down. The reasons for this could be multiple, such as
for instance if the client crashed and/or rebooted. It might also
indicate that the server could not be contacted in order to unmonitor
the lock.
> We also occasionally get a log message in the server for this
> box like:
> kernel:Aug 10 17:39:22 anthem kernel: lockd: cannot monitor
> 192.168.99.7
Means that the kernel was unable to contact rpc.statd, or that was
unable to contact the server's rpc.statd for some reason.
> Trond, can I get you more troubleshooting information, or
> should I try 2.4.20-pre on server *and* clients? This is a bit
> wierd, but since I don't know a lot of what went on in the last
> changes, I'm not sure where to start looking.
There is nothing in the above to suggest that this must be a kernel
problem. The fact that you are seeing files in /var/lib/nfs/sm
in these cases rather suggests that the problem lies with rpc.statd.
Can you see any reason in your setup why it should be failing?
Cheers,
Trond
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/