Movatterモバイル変換

[0]ホーム

Jump to content

GPFS

Edit links

From Wikipedia, the free encyclopedia

High-performance clustered file system

This article has multiple issues. Please helpimprove it or discuss these issues on thetalk page.(Learn how and when to remove these messages)

This articlecontainspromotional content. Please helpimprove it by removingpromotional language and inappropriateexternal links, and by adding encyclopedic text written from aneutral point of view.(May 2024) (Learn how and when to remove this message)

This articleneeds additional citations forverification. Please helpimprove this article byadding citations to reliable sources. Unsourced material may be challenged and removed.
Find sources: "GPFS" – news ·newspapers ·books ·scholar ·JSTOR(May 2024) (Learn how and when to remove this message)

(Learn how and when to remove this message)

GPFS
Limits
Developer(s)	IBM
Full name	IBM Spectrum Scale
Introduced	1998; 27 years ago (1998) withAIX
Max volume size	8 YB
Max file size	8 EB
Maxno. of files	2⁶⁴ per file system
Features
File system permissions	POSIX
Transparent encryption	yes
Other
Supported operating systems	AIX,Linux,Windows Server

GPFS (General Parallel File System, brand nameIBM Storage Scale and previouslyIBM Spectrum Scale)^[1] is a high-performanceclustered file system software developed byIBM. It can be deployed inshared-disk orshared-nothing distributed parallel modes, or a combination of these. It is used by many of the world's largest commercial companies, as well as some of thesupercomputers on theTop 500 List.^[2]For example, it is the filesystem of theSummit^[3]atOak Ridge National Laboratory which was the #1 fastest supercomputer in the world in the November 2019 Top 500 List.^[4] Summit is a 200Petaflops system composed of more than 9,000POWER9 processors and 27,000 NVIDIAVolta GPUs. The storage filesystem is called Alpine.^[5]

Like typical cluster filesystems, GPFS provides concurrent high-speed file access to applications executing on multiple nodes of clusters. It can be used withAIX clusters,Linux clusters,^[6] on MicrosoftWindows Server, or a heterogeneous cluster of AIX, Linux and Windows nodes running onx86,Power orIBM Z processor architectures.

History

[edit]

GPFS began as theTiger Shark file system, a research project at IBM'sAlmaden Research Center as early as 1993. Tiger Shark was initially designed to support high throughput multimedia applications. This design turned out to be well suited to scientific computing.^[7]

Another ancestor is IBM'sVesta filesystem, developed as a research project at IBM'sThomas J. Watson Research Center between 1992 and 1995.^[8] Vesta introduced the concept of file partitioning to accommodate the needs of parallel applications that run on high-performancemulticomputers withparallel I/O subsystems. With partitioning, a file is not a sequence of bytes, but rather multiple disjoint sequences that may be accessed in parallel. The partitioning is such that it abstracts away the number and type of I/O nodes hosting the filesystem, and it allows a variety of logically partitioned views of files, regardless of the physical distribution of data within the I/O nodes. The disjoint sequences are arranged to correspond to individual processes of a parallel application, allowing for improved scalability.^[9]^[10]

Vesta was commercialized as the PIOFS filesystem around 1994,^[11] and was succeeded by GPFS around 1998.^[12]^[13] The main difference between the older and newer filesystems was that GPFS replaced the specialized interface offered by Vesta/PIOFS with the standardUnix API: all the features to support high performance parallel I/O were hidden from users and implemented under the hood.^[7]^[13] GPFS also shared many components with the related products IBM Multi-Media Server and IBM Video Charger, which is why many GPFS utilities start with the prefixmm—multi-media.^[14]^: xi

In 2010, IBM previewed a version of GPFS that included a capability known as GPFS-SNC, where SNC stands for Shared Nothing Cluster. This was officially released with GPFS 3.5 in December 2012, and is now known as FPO^[15] (File Placement Optimizer).

Architecture

[edit]

This sectionneeds additional citations forverification. Please helpimprove this article byadding citations to reliable sources in this section. Unsourced material may be challenged and removed.
Find sources: "GPFS" – news ·newspapers ·books ·scholar ·JSTOR(January 2020) (Learn how and when to remove this message)

It is aclustered file system. It breaks a file into blocks of a configured size, less than 1 megabyte each, which are distributed across multiple cluster nodes.

The system stores data on standard block storage volumes, but includes an internal RAID layer that can virtualize those volumes for redundancy and parallel access much like a RAID block storage system. It also has the ability to replicate across volumes at the higher file level.

Features of the architecture include

Distributed metadata, including the directory tree. There is no single "directory controller" or "index server" in charge of the filesystem.
Efficient indexing of directory entries for very large directories.
Distributed locking. This allows for fullPOSIX filesystem semantics, including locking for exclusive file access.
Partition Aware. A failure of the network may partition the filesystem into two or more groups of nodes that can only see the nodes in their group. This can be detected through a heartbeat protocol, and when a partition occurs, the filesystem remains live for the largest partition formed. This offers a graceful degradation of the filesystem — some machines will remain working.
Filesystem maintenance can be performed online. Most of the filesystem maintenance chores (adding new disks, rebalancing data across disks) can be performed while the filesystem is live. This maximizes the filesystem availability, and thus the availability of the supercomputer cluster itself.

Other features include high availability, ability to be used in a heterogeneous cluster, disaster recovery, security,DMAPI,HSM andILM.

Compared to Hadoop Distributed File System (HDFS)

[edit]

Hadoop's HDFS filesystem, is designed to store similar or greater quantities of data on commodity hardware — that is, datacenters withoutRAID disks and astorage area network (SAN).

HDFS also breaks files up into blocks, and stores them on different filesystem nodes.
GPFS has full Posix filesystem semantics.
GPFS distributes its directory indices and other metadata across the filesystem. Hadoop, in contrast, keeps this on the Primary and Secondary Namenodes, large servers which must store all index information in-RAM.
GPFS breaks files up into small blocks. Hadoop HDFS likes blocks of64 MB or more, as this reduces the storage requirements of the Namenode. Small blocks or many small files fill up a filesystem's indices fast, so limit the filesystem's size.

Information lifecycle management

[edit]

Storage pools allow for the grouping of disks within a file system. An administrator can create tiers of storage by grouping disks based on performance, locality or reliability characteristics. For example, one pool could be high-performanceFibre Channel disks and another more economical SATA storage.

A fileset is a sub-tree of the file system namespace and provides a way to partition the namespace into smaller, more manageable units. Filesets provide an administrative boundary that can be used to set quotas and be specified in a policy to control initial data placement or data migration. Data in a single fileset can reside in one or more storage pools. Where the file data resides and how it is migrated is based on a set of rules in a user defined policy.

There are two types of user defined policies: file placement and file management. File placement policies direct file data as files are created to the appropriate storage pool. File placement rules are selected by attributes such as file name, the user name or the fileset. File management policies allow the file's data to be moved or replicated or files to be deleted. File management policies can be used to move data from one pool to another without changing the file's location in the directory structure. File management policies are determined by file attributes such as last access time, path name or size of the file.

The policy processing engine is scalable and can be run on many nodes at once. This allows management policies to be applied to a single file system with billions of files and complete in a few hours.^{[citation needed]}

References

[edit]

^"GPFS (General Parallel File System)". IBM. Retrieved2020-04-07.
^Schmuck, Frank; Roger Haskin (January 2002)."GPFS: A Shared-Disk File System for Large Computing Clusters"(PDF).Proceedings of the FAST'02 Conference on File and Storage Technologies. Monterey, California, US: USENIX. pp. 231–244.ISBN 1-880446-03-0. Retrieved2008-01-18.
^"Summit compute systems". Oak Ridge National Laboratory. Retrieved2020-04-07.
^"November 2019 top500 list". top500.org. Archived fromthe original on 2020-01-02. Retrieved2020-04-07.
^"Summit FAQ". Oak Ridge National Laboratory. Retrieved2020-04-07.
^Wang, Teng; Vasko, Kevin; Liu, Zhuo; Chen, Hui; Yu, Weikuan (Nov 2014). "BPAR: A Bundle-Based Parallel Aggregation Framework for Decoupled I/O Execution".2014 International Workshop on Data Intensive Scalable Computing Systems. IEEE. pp. 25–32.doi:10.1109/DISCS.2014.6.ISBN 978-1-4673-6750-9.S2CID 2402391.
^^a ^bMay, John M. (2000).Parallel I/O for High Performance Computing. Morgan Kaufmann. p. 92.ISBN 978-1-55860-664-7. Retrieved2008-06-18.
^Corbett, Peter F.; Feitelson, Dror G.; Prost, J.-P.; Baylor, S. J. (1993). "Parallel access to files in the Vesta file system".Proceedings of the 1993 ACM/IEEE conference on Supercomputing - Supercomputing '93. Portland, Oregon, United States: ACM/IEEE. pp. 472–481.doi:10.1145/169627.169786.ISBN 978-0818643408.S2CID 46409100.
^Corbett, Peter F.; Feitelson, Dror G. (August 1996)."The Vesta parallel file system"(PDF).ACM Transactions on Computer Systems.14 (3):225–264.doi:10.1145/233557.233558.S2CID 11975458. Archived from the original on 2012-02-12. Retrieved2008-06-18.{{cite journal}}: CS1 maint: bot: original URL status unknown (link)
^Teng Wang; Kevin Vasko; Zhuo Liu; Hui Chen; Weikuan Yu (2016). "Enhance parallel input/output with cross-bundle aggregation".The International Journal of High Performance Computing Applications.30 (2):241–256.doi:10.1177/1094342015618017.S2CID 12067366.
^Corbett, P. F.; D. G. Feitelson; J.-P. Prost; G. S. Almasi; S. J. Baylor; A. S. Bolmarcich; Y. Hsu; J. Satran; M. Snir; R. Colao; B. D. Herr; J. Kavaky; T. R. Morgan; A. Zlotek (1995)."Parallel file systems for the IBM SP computers"(PDF).IBM Systems Journal.34 (2):222–248.CiteSeerX 10.1.1.381.2988.doi:10.1147/sj.342.0222. Archived from the original on 2004-04-19. Retrieved2008-06-18.{{cite journal}}: CS1 maint: bot: original URL status unknown (link)
^Barris, Marcelo; Terry Jones; Scott Kinnane; Mathis Landzettel Safran Al-Safran; Jerry Stevens; Christopher Stone; Chris Thomas; Ulf Troppens (September 1999).Sizing and Tuning GPFS(PDF). IBM Redbooks, International Technical Support Organization. see page 1 ("GPFS is the successor to the PIOFS file system"). Archived from the original on 2010-12-14. Retrieved2022-12-06.{{cite book}}: CS1 maint: bot: original URL status unknown (link)
^^a ^bSnir, Marc (June 2001)."Scalable parallel systems: Contributions 1990-2000"(PDF). HPC seminar, Computer Architecture Department, Universitat Politècnica de Catalunya. Retrieved2008-06-18.
^General Parallel File System Administration and Programming Reference Version 3.1(PDF). IBM. April 2006.
^"IBM GPFS FPO (DCS03038-USEN-00)"(PDF). IBM Corporation. 2013. Retrieved2012-08-12.^{[permanent dead link‍]}

File systems

Disk and
non-rotating

ADFS
AdvFS
Amiga FFS
Amiga OFS
APFS
AthFS
bcachefs
BFS
- Be File System
- Boot File System
- Byte File System (z/VM)
Btrfs
CVFS
CXFS
DFS
EFS
- Encrypting File System
- Extent File System
Episode
ext
- ext2
- ext3
- ext3cow
- ext4
FAT
- exFAT
Files-11
Fossil
GPFS
HAMMER
- HAMMER2
HFS (Classic Mac OS)
HFS (MVS)
HFS+
HPFS
HTFS
JFS
LFS
MFS
- Macintosh File System
- TiVo Media File System
MINIX
NetWare File System
Next3
NILFS
- NILFS2
NSS
NTFS
OneFS
OpenZFS
PFS
QFS
QNX4FS
ReFS
ReiserFS
- Reiser4
Reliance
Reliance Nitro
RFS
SFS
- Shared File System (VM)
- Smart File System
SNFS
Soup (Apple)
Tux3
UBIFS
UFS/UFS2
- soft updates
- WAPBL
VxFS
WAFL
Xiafs
XFS
Xsan
zFS (z/OS)
ZFS (Sun)

Optical disc

Flash memory andSSD

host-sidewear leveling	CHFS JFFS JFFS2 LogFS NILFS NILFS2 YAFFS UBIFS

Distributed parallel

NAS

Specialized

Aufs AXFS Boot File System Compact Disc File System cramfs Davfs2 EROFS FTPFS FUSE Lnfs LTFS NOVA MVFS SquashFS UMSDOS OverlayFS UnionFS
Pseudo	configfs devfs debugfs kernfs procfs specfs sysfs tmpfs WinFS
Encrypted	eCryptfs EncFS EFS Rubberhose SSHFS ZFS

Types

Features

Case preservation Copy-on-write Data deduplication Data scrubbing Execute in place Extent File attribute Extended file attributes File change log Fork Inode Links Hard Symbolic
Access control	Access-control list Filesystem-level encryption Permissions Modes Sticky bit