Re: SMP/cc Cluster description

Henning Schmiedehausen (hps@intermeta.de)
07 Dec 2001 09:54:58 +0100

Messages sorted by: [ date ][ thread ][ subject ][ author ]
Next message: Kiril Vidimce: "Re: kernel: ldt allocation failed"
Previous message: rohit prasad: "kernel size"
In reply to: David S. Miller: "Re: SMP/cc Cluster description"
Next in thread: Larry McVoy: "Re: SMP/cc Cluster description"
Reply: Larry McVoy: "Re: SMP/cc Cluster description"

On Thu, 2001-12-06 at 22:30, Daniel Phillips wrote:
> On December 6, 2001 09:21 pm, Larry McVoy wrote:
> > On Thu, Dec 06, 2001 at 12:15:54PM -0800, David S. Miller wrote:
> > > If you aren't getting rid of this locking, what is the point?
> > > That is what we are trying to talk about.
> >
> > The points are:
> >
> > a) you have to thread the entire kernel, every data structure which is a
> > problem. Scheduler, networking, device drivers, everything. That's
> > thousands of locks and uncountable bugs, not to mention the impact on
> > uniprocessor performance.
> >
> > b) I have to thread a file system.
>
> OK, this is your central point. It's a little more than just a mmap, no?
> We're pressing you on your specific ideas on how to handle the 'peripheral'
> details.

How about creating one node as "master" and write a "cluster network
filesystem" which uses shared memory as its "network layer".

Then boot all other nodes diskless from these cluster network
filesystems.

You can still have shared mmap (which I believe is Larry's toy point)
between the nodes but you avoid all of the filesystem locking issues,
because you're going over (a hopefully superfast) memory network
filesystem.

Or go iSCSI and attach a network device to each of the cluster node. Or
go 802.1q, attach a virtual network device to each cluster node, pull
all of them out over GigE and let some Cisco outside sort these out
again. :-)

What I don't like about the approach is the fact that all nodes should
share the same file system. One (at least IMHO) does not want this for
at least /etc. The "all nodes the same FS" works fine for your number
cruncher clusters where every node runs more or less the same software.

It does not work for cluster boxes like the Starfire or 390. There you
want to boot totally different OS images on the nodes. No sense in
implementing a "threaded file system" that is not used.

Regards
Henning

-- Dipl.-Inf. (Univ.) Henning P. Schmiedehausen -- Geschaeftsfuehrer INTERMETA - Gesellschaft fuer Mehrwertdienste mbH hps@intermeta.de

Am Schwabachgrund 22 Fon.: 09131 / 50654-0 info@intermeta.de D-91054 Buckenhof Fax.: 09131 / 50654-20

- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/

Next message: Kiril Vidimce: "Re: kernel: ldt allocation failed"
Previous message: rohit prasad: "kernel size"
In reply to: David S. Miller: "Re: SMP/cc Cluster description"
Next in thread: Larry McVoy: "Re: SMP/cc Cluster description"
Reply: Larry McVoy: "Re: SMP/cc Cluster description"