current as segment register was Re: Using %cr2 to reference "current"

Andi Kleen (ak@suse.de)
06 Nov 2001 20:02:09 +0100

Messages sorted by: [ date ][ thread ][ subject ][ author ]
Next message: Gunther Mayer: "Re: 2.4.12-ac3 floppy module requires 0x3f0-0x3f1 ioports"
Previous message: Dennis J.A. Bijwaard: "kernel panic smp & usb scanner, with ksymoops, superseded"
Next in thread: Keith Owens: "Re: current as segment register was Re: Using %cr2 to reference "current""
Reply: Keith Owens: "Re: current as segment register was Re: Using %cr2 to reference "current""

Linus Torvalds <torvalds@transmeta.com> writes:

> There are some people who think that the 5kB stack we have now is too
> small ;(

It was too small on x86-64 ;)

>
> So it would basically be a small per-CPU/thread area, not just the "struct
> task_struct".

x86-64 has such a per cpu area. It is accessed via gs. It is needed to make
the SYSCALL system entry code work. So far I didn't move
that many fields into it to keep the patches small, but current is just
a field in the PDA (processor data area). The x86-64 has special instructions
to quickly save/restore gs on kernel entry.

One nasty thing is that lea doesn't work on segment registers. It needs
an quite ugly inline assembly mess similar to get/put_user to implement
read/write/add_pda functions for the fundamental types. To implement
that properly without hardcoded offsets it is also needed to have an
offset.c like many other ports that puts structure offsets at compile
time into an include file.

Also all system entry have to enter with interrupts off to avoid races
in switching %gs.

So far I have not moved the task_struct into a slab cache yet, but it is
a trivial step now because all the infrastructure is there.

The code size impact seems to be minimal between current as rsp & mask and
%gs:current_offset. smp_processor_id() is much cheaper however. The current
x86 implementation of that was quite bad.

Long term I hope more per CPU data (e.g. in networking, in the scheduler or
in interrupt handling) should migrate into the per data area.
Accessing the data there is much more efficent than indexing an cache line
padded array, and it saves a lot of cache too because the cache lines can be
shared.

-Andi

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

Next message: Gunther Mayer: "Re: 2.4.12-ac3 floppy module requires 0x3f0-0x3f1 ioports"
Previous message: Dennis J.A. Bijwaard: "kernel panic smp & usb scanner, with ksymoops, superseded"
Next in thread: Keith Owens: "Re: current as segment register was Re: Using %cr2 to reference "current""
Reply: Keith Owens: "Re: current as segment register was Re: Using %cr2 to reference "current""