> Logging filesystem are beautifully elegant. With a checksum, we can easily det...

geky · on Aug 31, 2019

I'm not sure I'm following this snap operation. What is a "log tail" in this context? Does this the snap operation still work if we are copying over entries of the log lazily and we don't have a definite end-of-log?

You are right though, a checksum used this way provides no protection over bit errors or misdirected write/reads. But, if you assume no bit errors, a checksum can provide power-loss protection as long as there's a fallback.

But doesn't this just move the problem somewhere else? Yes. In this case it moves error detection onto the block device. But it turns out performing error detection/correction at the block device level is simpler and more effective. Most NAND flash components even have built-in ECC hardware for this specific purpose.

oscargrouch · on Aug 31, 2019

This is not my field of expertise, so forgive me if im mistaken, but isnt 'Ohad Rodeh B-trees'[1] a simple and more elegant solution than a journal/WAL?

As far as i know its already used in Linux Btrfs and in LMDB, and i wonder why, if they were designing this from scratch, why they didnt go for this in the first place. Familiarity perhaps?

By the way i have code that deals with the SQLite Btree directly and by reading your comment now i understand why theres a need for a two-phase commit as expressed in:

BtreeCommitPhaseOne()

BtreeCommitPhaseTwo()

[1] - https://liw.fi/larch/ohad-btrees-shadowing-clones.pdf

jdub · on Aug 31, 2019

The article explains why tree structures were inappropriate for the use case.

devwastaken · on Aug 31, 2019

Isn't that essentially sqlite?

IA21 · on Sept 1, 2019

I don't think sqlite would fit inside the microcontrollers this post is about.