Re: state or future of LFS?

To: Miles Nordin <carton%Ivy.NET@localhost>
Subject: Re: state or future of LFS?
From: Chuck Swiger <cswiger%mac.com@localhost>
Date: Mon, 13 Apr 2009 11:22:26 -0700

On Apr 12, 2009, at 12:40 AM, Miles Nordin wrote:


   cs>       F_FULLFSYNC Does the same thing as fsync(2) then asks
   cs> the drive to flush all buffered data

yeah well, whatever.

Yeah, well, relevant facts don't care about opinions: whether youacknowledge them or wish to ignore them is up to you.

This is what I was talking about:

http://oss.sgi.com/archives/xfs/2005-02/msg00395.html

which says that you have to use some special OS X only API to achieve
an fsync that's functionally equivalent to what every other Unix
normally gives you.  I don't think it's fair to mention all the other
detail without mentioning this well-known problem.

That's absolutely right-- only, I mentioned the special API (that'sthe F_FULLFSYNC bit you quoted above).

As far as I can tell (ie, from looking at the code), by default OSXdoes synchronous updates to data and async updates to filesystemmetadata because it trusts the journaling mechanism to keep themetadata consistent-- and this is well-documented, along with what anapp like a database should do to obtain ACID semantics from thefilesystem.


But let's focus on just what other Unices do with fsync():

and what every other Unix normally gives you is not really so thorough
as one might like (may incl. I think NetBSD? does not propogate SYNC
CACHE command all the way to the disk (ZFS does), or discards said
disk commands in the software RAID layer (Linux LVM2), iSCSI
correctness problems, u.s.w.), but is still more useful than what OS X
gives without the special option.

Whether the data actually gets written and the on-disk cache itselfflushed seems to depend on a sysctl called hw.ata.wc for FreeBSD orthe dkctl setting in NetBSD; write-caching seems to always default toon because otherwise people scream bloody murder about the factor often reduction in write performance with it off. Further, by default(ie, FFSv2 with soft updates), data changes are synced out when you doan fsync(), but metadata changes are done asynchronously-- which isexactly what OSX does.

I'm sure their API circus made them look great in filebench or bonnie
or fsstress or whatever benchmarks don't know about their special API,
though.  It's thoroughly bullshit, IMO.


Be careful of throwing stones-- from the authoritative source:

http://www.usenix.org/publications/library/proceedings/usenix2000/general/full_papers/seltzer/seltzer_html/index.html

"Both journaling and Soft Updates systems ensure the integrity of meta-data operations, but they provide slightly different semantics. Thefour areas of difference are the durability of meta-data operationssuch as create and delete, the status of the file system after areboot and recovery, the guarantees made about the data in files afterrecovery, and the ability to provide atomicity.

The original FFS implemented meta-data operations such as create,delete, and rename synchronously, guaranteeing that when the systemcall returned, the meta-data changes were persistent. Some FFSvariants (e.g., Solaris) made deletes asynchronous and other variants(e.g., SVR4) made create and rename asynchronous. However, on FreeBSD,FFS does guarantee that create, delete, and rename operations aresynchronous.FFS-async makes no such guarantees, and furthermore does not guaranteethat the resulting file system can be recovered (via fsck) to aconsistent state after failure. Thus, instead of being a viablecandidate for a production file system, FFS-async provides an upperbound on the performance one can expect to achieve with the FFSderivatives.

Soft Updates provides looser guarantees than FFS about when meta-datachanges reach disk. Create, delete, and rename operations typicallyreach disk within 45 seconds of the corresponding system call, but canbe delayed up to 90 seconds in certain boundary cases (a newly createdfile in a hierarchy of newly created directories)."


--
-Chuck

Follow-Ups:
- Re: state or future of LFS?
  - From: Miles Nordin

References:
- state or future of LFS?
  - From: Niels Dettenbach
- Re: state or future of LFS?
  - From: Wouter Klouwen
- Re: state or future of LFS?
  - From: Niels Dettenbach
- Re: state or future of LFS?
  - From: Wouter Klouwen
- Re: state or future of LFS?
  - From: Miles Nordin
- Re: state or future of LFS?
  - From: Chuck Swiger
- Re: state or future of LFS?
  - From: Miles Nordin

Prev by Date: Re: state or future of LFS?
Next by Date: Marvell 88E8038 ethernet controller gives "no carrier" under NetBSD 5, isn't detected under NetBSD 4
Previous by Thread: Re: state or future of LFS?
Next by Thread: Re: state or future of LFS?
Indexes:

Home | Main Index | Thread Index | Old Index