Incomputeroperating systems,union mounting is a way of combining multiple directories into one that appears to contain their combined contents.[1] Union mounting is supported inLinux,BSD and several of its successors, andPlan 9, with similar but subtly different behavior.
As an example application of union mounting, consider the need to update the information contained on aCD-ROM orDVD. While a CD-ROM is not writable, one can overlay the CD'smount point with a writable directory in a union mount. Then, updating files in the union directory will cause them to end up in the writable directory, giving the illusion that the CD-ROM's contents have been updated.[1][2]
In the Plan 9 operating system fromBell Labs (mid-1980s onward), union mounting is a central concept, replacing several olderUnix conventions with union directories; for example, several directories containingexecutables, unioned together at a single/bin directory, replace thePATH variable for command lookup in the shell.[3]
Plan 9 union semantics are greatly simplified compared to the implementations forPOSIX-style operating systems: the union of two directories is simply the concatenation of their contents, so a directory listing of the union may display duplicate names. Also, no effort is made to recursively merge subdirectories, leading to an extremely simple implementation.[4] Directories are unioned in a controllable order;u/name, whereu is a union directory, denotes the file calledname in the first constituent directory that contains such a file.[4]
Unix/POSIX implementations of unions have requirements different from the Plan 9 implementation due to constraints in the traditional Unix file system behavior, which greatly complicates their implementation and often leads to compromises.[5] Problems that union mounting on Unix-like operating systems encounters include:
Early attempts to add unioning to Unix filesystems included the 3-d filesystem (Bell Labs) and the Translucent File Service inSunOS (Sun Microsystems, 1988[2]). An implementation of union mounting was added to the BSD version of Unix in version4.4 (1994), taking inspiration from these earlier attempts, Plan 9 and the stackable file systems inSpring (Sun, 1994).[1] 4.4BSD implements the stack-of-directories approach outlined above. As in Plan 9, operations traverse this stack top-down to resolve names, but unlike Plan 9, BSD union mounts are recursive, so that the contents of subdirectories appear merged in the union directory. Also unlike the Plan 9 version, all layers except the top are read-only: modifying files in the union causes their contents to first be copied into the top layer of the stack, where the modifications are then applied. Deletion of files is implemented by writing aspecial type of file called awhiteout to the top directory, which has the effect of marking the file name as non-existent and hiding files with the same name in the lower layers of the stack.[1] Whiteouts require support from the underlying file system.[4]
Union mounting was implemented for Linux 0.99 in 1993; this initial implementation was called the Inheriting File System, but was abandoned by its developer because of its complexity.[2] The next major implementation wasUnionFS, which grew out of theFiST project atStony Brook University.[6][5] An attempt to replace UnionFS,aufs, was released in 2006, followed in 2009 byOverlayFS.[2] In 2014OverlayFS union mount implementation was added to the standardLinux kernel source code.[7]
Similarly,GlusterFS offers the ability to mount different filesystems distributed across a network, rather than being located on the same machine.[8]
MergerFS, originally released in 2014, is an actively developed open-source FUSE plugin, allowing pooling of arbitrary directories.[9]
GlusterFS is an open source, distributed file system capable of scaling to several petabytes (actually, 72 brontobytes!) and handling thousands of clients. GlusterFS clusters together storage building blocks over Infiniband RDMA or TCP/IP interconnect, aggregating disk and memory resources and managing data in a single global namespace. GlusterFS is based on a stackable user space design and can deliver exceptional performance for diverse workloads.