Re: [PATCH] BUG(): sched.c: Line 944

Steven Cole (elenstev@mesatop.com)
17 Sep 2002 14:32:28 -0600


On Tue, 2002-09-17 at 14:06, Robert Love wrote:
> On Tue, 2002-09-17 at 15:54, Steven Cole wrote:
>
> Thank you for the testing, Steven.
>
> > Running dbench 3 resulted in the dbench clients hanging and being
> > unkillable with kill -9 in the D state.
>
> Hrm, I cannot reproduce this. I just successfully completed a `dbench
> 16'. Can you find where they are hanging? You can get a trace via
> sysrq. You can also see where they are in the kernel via the wchan
> field of ps: "ps -ewo user,pid,priority,%cpu,stat,command,wchan" is a
> favorite of mine.

I rebooted the box, but had to SYSRQ-S, SYSRQ-B, due to shutdown -r now
hanging up. On reboot, the init scripts froze after "starting atd", but
fortunately this was after sshd, so I ssh'd to the box several times,
and did your ps command before hanging it with dbench 3 (dbench 1 and 2
were successful again). I have that ps output if you want it. I could
NOT do that, or even ls, after the dbench 3 hang. When I shutdown,
SYSRQ-S only partially synced (just one partition).
>
> Sure it does not happen with a stock kernel (no preempt)?

I'll try 2.5.35-bk3 (no other patches) without preempt shortly.
I'll report any failures, but no news is good news.
>
> What if you replace the printk() and dump_stack() in schedule() with a
> no-op (but not something that will optimize away the conditional, i.e.
> try a cpu_relax()).

I'll try that if I get the time. But I wasn't getting any dump info in
dmesg for those boots.
>
> Oh, is the previous patch fully backed out? None of that do_exit muck
> anymore, right?
Yep, plain 2.5.35, then patched with 2.5.35-bk3, then with your patch
posted at 15:23. Nothing else.
>
> > Test box is 2-way pIII, kernel SMP.
>
> I too am SMP with kernel preemption, dual Athlon MP.

Steven

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/