OK, number seven on the list was going to be thinking about tracking
timeouts at the block layer.
Tag starvation handling is a property of each individual SCSI HBA (and I
know it shouldn't be). Some do a good job, some don't. However, it's
implicitly also handled in the SCSI error handler because on timeout we
quiesce the device (which finally causes the starved tag to execute). I
have thought about doing it properly in the error handler, but it's a
radical divergence---you basically need to begin to throttle the queue,
but wait to see if your starved tag gets serviced and then increase the
feed again (as opposed to our current quiesce then handle model).
> As I mentioned, Andrew actually had some really bad latency numbers to
> prove that this is a real issue. SCSI with more than 4 tags or so results
> in potentially _major_ starvation, because the disks themselves are not
> even trying to avoid it.
Tag starvation isn't really a problem for the majority of modern SCSI
drives (big iron vendors yelled and kicked about this a while ago). You
can still find some of the older drives that do have these issues (I
keep several for the purposes of exciting the error handling).
> Also, even aside from the starvation issue with unfair disks, just from a
> "linear head movement" standpoint, you really want to sort the queue
> according to what is going on _now_ in the disk. If the disk eats the
> requests immediately (but doesn't execute them immediately), the sorting
> has nothing to go on, and you get tons of back-and-forth movements.
Most scsi discs have a non linear geometry anyway, so "head scheduling"
is pretty useless to them. All SCSI really wants is I/O consolidation.
i.e. we'd like to use fewer larger sized requests.
James
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/