Post: File System

4 minute read

Overview

Introduction: this blog post will cover the basics of an operating system’ s file systems.

Hardware:

In a spinning hard disk, data are encoded around a circular disk
To access a certain data, the disk will need to spin such that the data location on the hard disk coincide with the head.

File data: the data that is associated to the file (excludes metadata)

Structure:

Array of bytes:
- All the data in a file can be seen as an array of raw bytes
- Each byte will have a unique offset
Records:
- File data is stored as a collection of records.
  - Each file has many records and each records has many bytes
- Variant 1: fixed length records:
  - Each record has a fixed length
  - Structured as an array of fixed length record
  - Easily access a record by getting the offset in the record array
- Variant 2: variable length records:
  - Each record can have different length

Access method:

Sequential Access:
- Access the bytes in the file data sequentially
- Cannot skip a byte
Random Access:
- Can access any byte in the file data
- Using Read(offset) or Seek(offset)
Direct Access:
- Randomly access any records directly

Open: prepare all the information needed for file. Must be used before any file operation
Create: new file created
Read: read data, usually starting from current position
Write: write data usually starting from current position
Repositioning aka seek: move the current position to a new location
Truncate: removes data between position to end of file

Information needed for file operation:

Caveats:

Unix Implementation:

Reference Counting:

System wide open file table and vnode table use reference count to decide when to evict the entry
Process’ file descriptor table has a reference to system wide open file table
System wide open file table has reference to vnode table
OS uses reference counting to determine when there are no higher level table’s entries that refers to it.
Evict that entry once the reference count goes to 0.

More details: here

Components:

Disk structure: a disk can be treated as a 1-D array of logical block
logical block: smallest accessible unit in the disk (usually 512b to 4kb)
disk sector: each logical block belongs to a disk sector
Master Boot Record (MBR): located at the start of sector 0
- Stores the OS boot up information
Partition: after the MBR
- Stores information on how the files are located and accessed

Problem: similar to memory allocation, file system face the problem of external fragmentation.

Should the blocks be contiguous or non-contiguous?
(klement: Internal fragmentation is not really a problem as it is restricted by the logical block size)

Naive solutions:

Linked List:
- Each file will have a head block and each block will contain a pointer to the next block.
- Random Access: O(N)
- Can be optimised by having the block pointers loaded in memory (still O(N)).
Direct indexing:
- Each file will have a special index block
- Index block will contain an array of pointers to all the blocks that belongs to the file.
- Cons: the file size is limited by the maximum size of the array in a logical block.
- Can be optimised using multi level table (similar to multi level paging)

Combine Scheme: uses both direct indexing and multi-level scheme

Each inode has 15 blocks
0 - 11 blocks: direct pointers
- direct pointers are actual data on disk
12 block: single pointer
- contains an array of direct pointers (direct indexing)
13 block: double indirect
- points to an array of single pointers
14 block: triple indirect
- points to an array of double indirect