On Sat, Jun 14, 2008 at 03:15:47AM +0100, Jamie Lokier (jamie@shareable.org) wrote:Noreover, that's true :) I regulary run and post various benchmarks comparing POHMELFS, NFS, XFS and Ext4, main goal of POHMELFS at this stage is to be essentially as fast as underlying local filesystem. And it is... Though there is a single place (random reading, all others reached FS speed, so it is from 10 to 300% faster than NFS in various loads :), but I'm working on it, I think it is not server's side though. Maybe it was a bit naive though :) But I checked lots of implementation, all of them use send()/recv() approach. NFSv4 uses a bit different, but it is a cryptic, and at least from its names it is not clear: like nfs_pagein_multi() -> nfs_pageio_complete() -> add_stats. Presumably we add stats when we have data handy... CIFS/SMB use synchronous approach. From those projects, which are not in kernel, like CRFS and CEPH, the former uses async receiving thread, while the latter is synchronous, but can select different servers for reading, more like NFSv4.1 leases. It covers all operations, including reading, directory listing, lookups, attribite changes and so on. Its main goal is to allow transaparent failover, so it has to be done for reading too. Oplocks and leases are essentially lock on given file, which allows one client to operate on it. POHMELFS does not have locks now, and they will be created depending on how distributed server will require them. In the simplesst case it can just lock file for writing and do not allow its updates from other clients. Lock aciquite can be done at write_begin time. Without lock and writeback cache in your case writeback for file Y can happen before writeback for file X, but if client does not only write, but also sync after its write, then yes, client will see later updates after more earlier. POHMELFS does not broadcast its interest in the file content until real writing happens, i.e. at writeback time. Although I can add a mode, when the same will be done during write_begin() time. In that case your example will work without sync. -- Evgeniy Polyakov --
| David Miller | Slow DOWN, please!!! |
| H. Peter Anvin | Re: [Ksummit-2008-discuss] Fixing the Kernel Janitors project |
| Pardo | Re: pthread_create() slow for many threads; also time to revisit 64b context switc... |
| Roland Dreier | Re: Integration of SCST in the mainstream Linux kernel |
git: | |
| Ken Pratt | pack operation is thrashing my server |
| Junio C Hamano | Re: [RFC] origin link for cherry-pick and revert |
| Len Brown | fatal: unable to create '.git/index': File exists |
| Petr Baudis | [RFC][PATCH 0/7] Submodule support in git mv, git rm |
| Karel Kulhavy | OpenBSD kernel janitors |
| rezidue | Speed Problems |
| Richard Stallman | Real men don't attack straw men |
| Alex Thurlow | Router performance on OpenBSD and OpenBGPD |
| David Miller | [GIT]: Networking |
| David Miller | Re: kernel oops when system under network stress |
| Laszlo Attila Toth | [PATCH] Introducing socket mark socket option |
| Evgeniy Polyakov | [resend take 2 0/4] Distributed storage. |
