Re: [PATCH] natsemi compiler workaround & cleanup

Manfred Spraul (manfred@colorfullife.com)
Thu, 12 Jul 2001 07:07:35 +0200

Messages sorted by: [ date ][ thread ][ subject ][ author ]
Next message: John Alvord: "Re: Switching Kernels without Rebooting?"
Previous message: Davide Libenzi: "Re: Improving (network) IO performance ..."
Maybe in reply to: Manfred Spraul: "[PATCH] natsemi compiler workaround & cleanup"

Donald Becker wrote:
>
> On Wed, 11 Jul 2001, Manfred Spraul wrote:
>
> > The rx setup in init_ring() in drivers/net/natsemi.c in 2.4.6 is
> > miscompiled by several gcc-2.95 versions.
> >
> > I could reproduce it with 2.95.1, and I received bug reports with
> > gcc version 2.95.2 20000220 (Debian GNU/Linux)
> > gcc 2.95.3 19991030 from Mandrake 7.2
>
> I don't agree with all of the patch, but I can address this specific point.
>
> > egcs-1.12 and rh gcc 2.96-85 are not affected.
>
> My version had the code structured as
>
> /* Initialize all Rx descriptors. */
> for (i = 0; i < RX_RING_SIZE; i++) {
> np->rx_ring[i].next_desc = virt_to_le32desc(&np->rx_ring[i+1]);
> np->rx_ring[i].cmd_status = DescOwn;
> np->rx_skbuff[i] = 0;
> }
>
> This code does not trigger the difference in compiler behavior. (I'm not
> certain that the less-than-transparent behavior could be accurately
> called a bug.)
>

It is a bug. np->rx_ring[i].next_desc is initialized by 2.4.6 with
gcc-2.95.1 to
[0]: 0x...210
[1]: 0x00000000
[2]: 0x...220
[3]: 0x...230
[4]: 0x...240.
etc.

> I realize the 2.4 code was changed to have the descriptor ring base
> be pre-translated from a virtual address to PCI bus-accessable physical
> memory address, and used offsets from that base. I'm pointing out that
> this problem doesn't exist in the 2.2. Note that the code above
> explicitly translates each descriptor ring entry to a physical address
> individually.
>
Correct. The problem was introduce by the virt_to_desc to pci_dma
conversion.

> > The patch also cleans up the suspend/resume synchronization and removes
> > 2 superflous (& wrong) spin_unlock calls.
>
> The (large) patch seems to add some unnecessary locking.
>
It's possible that some locking outside of the tx and rx codepath is not
required, I'm concentrating on a race free suspend & resume
implementation.
It shouldn't affect the critical functions:
rx interrupts run without a spinlock, and start_tx only acquires the
lock around "status = DescOwn;np->cur_rx++". netdev_tx_done() during
start_tx is an idea for tx interrupt mitigation, it's not yet finished.

--
	Manfred
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Next message: John Alvord: "Re: Switching Kernels without Rebooting?"
Previous message: Davide Libenzi: "Re: Improving (network) IO performance ..."
Maybe in reply to: Manfred Spraul: "[PATCH] natsemi compiler workaround & cleanup"