Movatterモバイル変換

[0]ホーム

Jump to content

842 (compression algorithm)

Add links

From Wikipedia, the free encyclopedia

Lossless compression algorithm

This articlemay rely excessively on sourcestoo closely associated with the subject, potentially preventing the article from beingverifiable andneutral. Please helpimprove it by replacing them with more appropriatecitations toreliable, independent sources.(October 2021) (Learn how and when to remove this message)

842,8-4-2, orEFT is alossless data compression algorithm. It is a variation onLempel–Ziv compression with a limited dictionary length. With typical data, 842 gives 80 to 90 percent of the compression of LZ77 with much faster throughput and less memory use.^[1] Hardware implementations also provide minimal use of energy and minimal chip area.

842 compression can be used forvirtual memory compression, for databases — especiallycolumn-oriented stores, and when streaming input-output — for example to dobackups or to write tolog files.

Algorithm

[edit]

The algorithm operates on blocks of 8 bytes with sub-phrases of 8, 4 and 2 bytes. A hash of each phrase is used to look up a hash table with offsets to a sliding window buffer of past encoded data. Matches can be replaced by the offset, so the result for each block can be some mixture of matched data and new literal data.^[2]^[1]^[3]

Implementations

[edit]

IBM added hardware accelerators and instructions for 842 compression to theirPower processors fromPOWER7+ onward.^[4] In addition,POWER9 andPower10 added hardware acceleration for theRFC 1951 Deflate algorithm, which is used byzlib andgzip.^[5]

Adevice driver for hardware-assisted 842 compression on a POWER processor was added to theLinux kernel in 2011.^[6] More recently, Linux can fallback to a software implementation, which of course is much slower.^[7]zram, aLinux kernel module for compressedRAM drives, can be configured to use 842.

Researchers have implemented 842 usinggraphics processing units and found about 30x faster decompression using dedicated GPUs.^[8] An open source library provides 842 forCUDA andOpenCL.^[9] AnFPGA implementation of 842 demonstrated 13 times better throughput than a software implementation.^[10]

References

[edit]

^^a ^bPlauth, Max; Polze, Andreas."Towards Improving Data Transfer Efficiency for Accelerators using Hardware Compression".
^Franaszek, Peter A; Lastras-Montaño, Luis A; Peng, Song; Robinson, John T (14 September 2016)."Data Compression with Restricted Parsings".IBM Research. IBM. Retrieved2021-07-13.
^Blaner, B.; Abali, B.; Bass, B. M.; Chari, S.; Kalla, R.; Kunkel, S.; Lauricella, K.; Leavens, R.; Reilly, J. J.; Sandon, P. A. (November 2013)."IBM POWER7+ processor on-chip accelerators for cryptography and active memory expansion".IBM Journal of Research and Development.57 (6): 3:1–3:16.doi:10.1147/JRD.2013.2280090. Retrieved2021-07-13.
^"POWER NX842 Compression for Db2"(PDF). IBM. Retrieved2021-07-13.
^Veale, Brian F (14 March 2022)."GZip Acceleration with AIX on Power Systems".IBM Power Community. IBM. Retrieved2022-10-22.
^"Torvalds/Linux".GitHub. 12 February 2022.
^"Torvalds/Linux".GitHub. 12 February 2022.
^Plauth, Max; Polze, Andreas (2019). "GPU-Based Decompression for the 842 Algorithm".2019 Seventh International Symposium on Computing and Networking Workshops (CANDARW). pp. 97–102.doi:10.1109/CANDARW.2019.00025.ISBN 978-1-7281-5268-4.S2CID 210694935.
^"Lib842".GitHub. 3 November 2020.
^Sukhwani, Bharat; Abali, Bulent; Brezzo, Bernard; Asaad, Sameh (2011). "High-Throughput, Lossless Data Compression on FPGAs".2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines. IEEE Xplore. pp. 113–116.doi:10.1109/FCCM.2011.56.ISBN 978-1-61284-277-6.S2CID 7828316.

Data compression methods

Lossless
type

Entropy	Adaptive coding Arithmetic Asymmetric numeral systems Golomb Huffman Adaptive Canonical Modified Range Shannon Shannon–Fano Shannon–Fano–Elias Tunstall Unary Universal Exp-Golomb Fibonacci Gamma Levenshtein
Dictionary	Byte-pair encoding Lempel–Ziv 842 LZ4 LZJB LZO LZRW LZSS LZW LZWL Snappy
Other	BWT CTW CM Delta Incremental DMC DPCM Grammar Re-Pair Sequitur LDCT MTF PAQ PPM RLE
Hybrid	LZ77 + Huffman Deflate LZX LZS LZ77 + ANS LZFSE LZ77 + Huffman + ANS Zstandard LZ77 + Huffman + context Brotli LZSS + Huffman LHA/LZH LZ77 + Range LZMA LZHAM RLE + BWT + MTF + Huffman bzip2

Lossy
type

Transform	Discrete cosine transform DCT MDCT DST FFT Wavelet Daubechies DWT SPIHT
Predictive	DPCM ADPCM LPC ACELP CELP LAR LSP WLPC Motion Compensation Estimation Vector Psychoacoustic

Audio

Concepts	Bit rate ABR CBR VBR Companding Convolution Dynamic range Latency Nyquist–Shannon theorem Sampling Silence compression Sound quality Speech coding Sub-band coding
Codec parts	A-law μ-law DPCM ADPCM DM FT FFT LPC ACELP CELP LAR LSP WLPC MDCT Psychoacoustic model

Image

Concepts	Chroma subsampling Coding tree unit Color space Compression artifact Image resolution Macroblock Pixel PSNR Quantization Standard test image Texture compression
Methods	Chain code DCT Deflate Fractal KLT LP RLE Wavelet Daubechies DWT EZW SPIHT

Video

Concepts	Bit rate ABR CBR VBR Display resolution Frame Frame rate Frame types Interlace Video characteristics Video quality
Codec parts	DCT DPCM Deblocking filter Lapped transform Motion Compensation Estimation Vector Wavelet Daubechies DWT

Theory

Community

Hutter Prize

People

Retrieved from "https://en.wikipedia.org/w/index.php?title=842_(compression_algorithm)&oldid=1338700537"

Categories:

Hidden categories:

[8]ページ先頭