Re: File copy system call proposal

Quinn Harris (quinn@nmt.edu)
07 Dec 2001 23:03:59 -0700


On Fri, 2001-12-07 at 21:00, H. Peter Anvin wrote:
> Followup to: <1007782956.355.2.camel@quinn.rcn.nmt.edu>
> By author: Quinn Harris <quinn@nmt.edu>
> In newsgroup: linux.dev.kernel
> >
> > All kernel copy:
> > Commands like cp and install open the source and destination file using
> > the open sys call. The data from the source is copied to the
> > destination by repeatedly calling the read then write sys calls. This
> > process involves copying the data in the file from kernel memory space
> > to the user memory space and back again. Note that all this copying is
> > done by the kernel upon calling read or write. I would expect if this
> > can be moved completely into the kernel no memory copy operations would
> > be performed by the processor by using hardware DMA.
> >
>
> mmap(source file);
> write(target file, mmap region);
>
> -hpa

mmap will indeed get a file into user mem space without any memcopy
operation. But as far as I can tell from examining generic_file_write
(in mm/filemap.c) used by ext2 and I asume many others, a write will
copy the memory even if it was mapped via mmap. Am I missing
something? This isn't true if kiobuf is used but as I understand it,
this bypasses the buffer cache.

I would like to see a zero-memcopy file copy. A file copy that would
read the file into the buffer cache if its not already there similar to
a normal read then write it back to disk from the cache without
duplicating the pages. This would probably lead to modifying the buffer
cache to allow multiple buffer_heads to refer to the same data which
might not be worth the overhead.

One might implement such a thing by attempting in the generic_file_read
to determine if the memory range is an actual page or pages that can be
eventually written to the disk without a memcopy. This could
conceivably make duplications between different files to not take up
duplicate pages. But what is the chance that there are any sizeable
number of identical pages in the buffer cache do to anything but a file
copy?

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/