Sorry, yes - I was doing single connection tests with no changes to the
default Netperf settings. Each machine was running a copy of the server and
then I ran the test with "netperf -H x.x.x.x" to each machine (or just
"netperf" for a localhost speed check).
>However, we did run into serious performance problems with
>the Netgear FA311/2 (tulip). Found that the link lost
>connectivity because of card lockups and transmit timeout
>failures - and some of these were silent. However, I moved
>to the 3C905C (3c59x driver) which behaved like a champ, and
>we didnt see the problems any more, so have stuck to that card.
>This was back in the 2.4.0 time frame, and there have many
>patches since then to various drivers, so not sure if the
>problem(s) have been resolved or not (likely to have been,
>extensively reported). Both your cards might actually be
>underperforming..
I'm a little confused here - do you mean the FA310TX ("tulip" driver) or the
FA311/2 ("natsemi" driver)? I have not had any connection problems with
either the FA310 or the FA311 cards. I haven't noticed any speed problems
with the FA311 card, but I haven't benchmarked it, either. The FA310 is so
horribly slow, I couldn't help but notice. Unfortunately, the same is true
of the 3cSOHO.
While I am willing to accept that both the FA310 and FA311 cards are
underperforming, I think it is more than a little strange that the 3cSOHO
card would turn in the same performance numbers. Also, keep in mind that I
was only seeing horrible receive performance, TX performance seemed to be
ok.
I didn't post FTP numbers (both machines are running FTP servers). While the
FTP performance numbers are probably not as "scientific" as Netperf, they do
seem to agree from what I have observered. I.e. retrieving files from the
P-90 machine is ok (~3MB/sec) but sending files to it is very slow
(~100K/sec). This roughly agrees with the Netperf numbers I saw.
FTP transfers to the FA311 machine (P2-350) are OK in both directions.
>Are you seeing any errors reported in /var/log/messages?
>Are you monitoring your connection via tcpdump, for example?
>You might sometimes see long gaps in transmission...Are
>there any abnormal numbers in /proc/net/ stats? I dont remember
>seeing that high frame errors, although there were a few.
No, I don't see anything in /var/log/messages.
I looked at tcpdump to try and figure it out, and it appeared that the P-90
was taking a very long time to ACK some packets. I am not a TCP/IP guru by
any stretch, but my guess at the time was that the packets that were taking
forever to get ACK'ed were the ones causing a framing error on the P-90, but
again, I'm not an expert.
The only unusual stat is the framing errors. There are a lot of them under
heavy receive load. The machine will go for weeks without a single framing
error, but if I blast some netperf action at it (or FTP send to it, etc.)
then I get about 1/3 of the incoming packets (to the P-90) with framing
errors. I see no other errors at all except a TX overrun error (maybe 1 in
100000 packets).
>HW checksumming for the kind of test you are doing (tcp, mostly
>fast path) will not buy you any real performance gain, the
>checksum is actually consumed by the user-kernel copy routine.
Ok, I'll take your word for it. The P-90 isn't a very fast machine to begin
with, so I thought it could use all the HW assistance it could get (that and
the 3cSOHO card was really cheap :-).
I am very disappointed that TCP/IP performance on this machine is so lousy,
but the problem is clearly with the kernel - just look at the performance
numbers for 2.4.3 vs 2.2.19 (or 2.2.16). Those numbers aren't exactly great,
but they are a lot better than 2.4.3.
>You can also run the tests on a profiling kernel and compare
>results...
>
>Nivedita
>
>---
>Nivedita Singhvi (503) 578-4580
>Linux Technology Center nivedita@us.ibm.com
>IBM Beaverton, OR nivedita@sequent.com
Thanks for the assistance. Based on the benchmark information I have, I
would say that there is a problem with the kernel and would like to pursue
getting that fixed. I just can't justify why 2.4.3 should be 600% slower
than 2.2.19.
- John
_________________________________________________________________
Get your FREE download of MSN Explorer at http://explorer.msn.com
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/