Journaling: Difference between revisions

From
Jump to navigation Jump to search
(first edit)
 
(added links)
 
(One intermediate revision by the same user not shown)
Line 1: Line 1:
== Introduction ==
== Introduction ==
The goal both in Journaling and [[Soft-Updates]]
The goal both in Journaling and [[Soft-Updates]]
is to speed up file writes, especially the writing of [[Basics#meta-data]].
is to speed up file writes, especially the writing of meta-data.


Journaling does this by writing alle the meta-data changes en bloc
Journaling does this by writing alle the meta-data changes en bloc
Line 14: Line 14:
this is the most efficient option.
this is the most efficient option.


A Journal contains three types of entries[ ]
A Journal contains three types of entries
* the actual meta-data
* the actual meta-data
* descriptors which say where the meta-data belongs, and
* descriptors which say where the meta-data belongs, and
Line 34: Line 34:
== Crash recovery ==
== Crash recovery ==
If a crash occurs during operations,
If a crash occurs during operations,
the file system may be in an [[Basics#inconsistent state]].
the file system may be in an inconsistent state.
This inconsistency may comee from any of the active operations in the log.
This inconsistency may comee from any of the active operations in the log.
To repair the system, the log is read,
To repair the system, the log is read,
Line 50: Line 50:


This effect is dwarfed by the huge savings
This effect is dwarfed by the huge savings
that arise from [[Basics#asynchronously]] writing the meta-data
that arise from asynchronously writing the meta-data
and by writing the journal in huge chunks.
and by writing the journal in huge chunks.


Line 62: Line 62:
Journaling yeilded an increase in 50 to 400 %
Journaling yeilded an increase in 50 to 400 %
versus standard operations.
versus standard operations.

== Links ==
*[http://www.usenix.org/publications/library/proceedings/usenix2000/general/full_papers/seltzer/seltzer.pdf Journaling versus Soft Updates: Asynchronous Meta-data Protection in File Systems]
*[http://pdos.csail.mit.edu/papers/cffs-usenix97.ps Embedded Inodes and Explicit Grouping: Exploiting Disk Bandwidth for Small files]
*[http://pdos.csail.mit.edu/6.824/papers/hagmann-fs.pdf Reimplementing the Cedar File System Using Logging and Group Commit]
*[http://www.hpl.hp.com/personal/Craig_Soules/papers/TOCS.softupdates.pdf Soft Updates: A Solution to the Metadata Update Problem in File Systems]
* http://en.wikipedia.org/wiki/Comparison_of_file_systems

Latest revision as of 15:07, 4 May 2007

Introduction

The goal both in Journaling and Soft-Updates is to speed up file writes, especially the writing of meta-data.

Journaling does this by writing alle the meta-data changes en bloc to a consecutive log before the data write. If a crash occurs during the write, the log is either replayed or undone.

The Journal

is a consecutive and circular buffer which stores the meta-data. It can be either a file in the current partition, a partition on its own or even a separate disk - this is the most efficient option.

A Journal contains three types of entries

  • the actual meta-data
  • descriptors which say where the meta-data belongs, and
  • a start and end marker

Before each write of real data, the meta-data is written to the log and the end-marker is moved.

After the data and the meta-data writes have completed, the start marker moves to reflect the new start of the journal.

(If a journal contains several entries, they may be processed out of their log sequence. If so, the system remembers the completed parts and moves the start marker forward that far after the first block is on disk, too.)

Crash recovery

If a crash occurs during operations, the file system may be in an inconsistent state. This inconsistency may comee from any of the active operations in the log. To repair the system, the log is read, the start and end are determined and the open operations are either all undone or completed.

This takes a small amount of time but much less than checking the entire fs.

Efficiency

On the first view, logging leads to an increase in disk writes: meta-data is written not once, but twice: one time to the journal, the second time to the real file system.

This effect is dwarfed by the huge savings that arise from asynchronously writing the meta-data and by writing the journal in huge chunks.

Both procedures reduce disk seeks, the expensive operation in writes. In addition, not all operations that are logged need to be carried out: if, for example, a directory is changed twice in short succession, only the second state needs to reach the disk.

In tests in the literature, Journaling yeilded an increase in 50 to 400 % versus standard operations.

Links