Movatterモバイル変換

[0]ホーム

Jump to content

Transform coding

Edit links

From Wikipedia, the free encyclopedia

Data compression

This articleneeds additional citations forverification. Please helpimprove this article byadding citations to reliable sources. Unsourced material may be challenged and removed.
Find sources: "Transform coding" – news ·newspapers ·books ·scholar ·JSTOR(September 2018) (Learn how and when to remove this message)

Transform coding is a type ofdata compression for "natural" data likeaudio signals or photographicimages. The transformation is typically lossless (perfectly reversible) on its own but is used to enable better (more targeted)quantization, which then results in a lower quality copy of the original input (lossy compression).

In transform coding, knowledge of the application is used to choose information to discard, thereby lowering itsbandwidth. The remaining information can then be compressed via a variety of methods. When the output is decoded, the result may not be identical to the original input, but is expected to be close enough for the purpose of the application.

Colour television

[edit]

Further information:YIQ

NTSC

[edit]

One of the most successful transform encoding system is typically not referred to as such—the example beingNTSC colortelevision. After an extensive series of studies in the 1950s,Alda Bedford showed that the human eye has high resolution only for black and white, somewhat less for "mid-range" colors like yellows and greens, and much less for colors on the end of the spectrum, reds and blues.

Using this knowledge allowedRCA to develop a system in which they discarded most of the blue signal after it comes from the camera, keeping most of the green and only some of the red; this ischroma subsampling in theYIQ color space.

The result is a signal with considerably less content, one that would fit within existing 6 MHz black-and-white signals as a phase modulated differential signal. The average TV displays the equivalent of 350 pixels on a line, but the TV signal contains enough information for only about 50 pixels of blue and perhaps 150 of red. This is not apparent to the viewer in most cases, as the eye makes little use of the "missing" information anyway.

PAL and SECAM

[edit]

The PAL and SECAM systems use nearly identical or very similar methods to transmit colour. In any case both systems are subsampled.

Digital

[edit]

The term is much more commonly used indigital media anddigital signal processing. The most widely used transform coding technique in this regard is thediscrete cosine transform (DCT),^[1]^[2] proposed byNasir Ahmed in 1972,^[3]^[4] and presented by Ahmed with T. Natarajan andK. R. Rao in 1974.^[5] This DCT, in the context of the family of discrete cosine transforms, is the DCT-II. It is the basis for the commonJPEG image compression standard,^[6] which examines small blocks of the image and transforms them to thefrequency domain for more efficient quantization (lossy) anddata compression. Invideo coding, theH.26x andMPEG standards modify this DCT image compression technique across frames in a motion image usingmotion compensation, further reducing the size compared to a series of JPEGs.

Inaudio coding, MPEG audio compression analyzes the transformed data according to apsychoacoustic model that describes the human ear's sensitivity to parts of the signal, similar to the TV model.MP3 uses a hybrid coding algorithm, combining themodified discrete cosine transform (MDCT) andfast Fourier transform (FFT).^[7] It was succeeded byAdvanced Audio Coding (AAC), which uses a pure MDCT algorithm to significantly improve compression efficiency.^[8]

The basic process ofdigitizing an analog signal is a kind of transform coding that usessampling in one or more domains as its transform.

References

[edit]

^Muchahary, D.; Mondal, A. J.; Parmar, R. S.; Borah, A. D.; Majumder, A. (2015). "A Simplified Design Approach for Efficient Computation of DCT".2015 Fifth International Conference on Communication Systems and Network Technologies. pp. 483–487.doi:10.1109/CSNT.2015.134.ISBN 978-1-4799-1797-6.S2CID 16411333.
^Chen, Wai Kai (2004).The Electrical Engineering Handbook.Elsevier. p. 906.ISBN 9780080477480.
^Ahmed, Nasir (January 1991)."How I Came Up With the Discrete Cosine Transform".Digital Signal Processing.1 (1):4–5.doi:10.1016/1051-2004(91)90086-Z.
^Stanković, Radomir S.; Astola, Jaakko T. (2012)."Reminiscences of the Early Work in DCT: Interview with K.R. Rao"(PDF).Reprints from the Early Days of Information Sciences.60. Retrieved13 October 2019.
^Ahmed, Nasir; Natarajan, T.; Rao, K. R. (January 1974), "Discrete Cosine Transform",IEEE Transactions on Computers,C-23 (1):90–93,doi:10.1109/T-C.1974.223784,S2CID 149806273
^"T.81 – Digital compression and coding of continuous-tone still images – Requirements and guidelines"(PDF).CCITT. September 1992. Retrieved12 July 2019.
^Guckert, John (Spring 2012)."The Use of FFT and MDCT in MP3 Audio Compression"(PDF).University of Utah. Retrieved14 July 2019.
^Brandenburg, Karlheinz (1999)."MP3 and AAC Explained"(PDF).Archived(PDF) from the original on 2017-02-13.

Data compression methods

Lossless
type

Entropy	Adaptive coding Arithmetic Asymmetric numeral systems Golomb Huffman Adaptive Canonical Modified Range Shannon Shannon–Fano Shannon–Fano–Elias Tunstall Unary Universal Exp-Golomb Fibonacci Gamma Levenshtein
Dictionary	Byte-pair encoding Lempel–Ziv 842 LZ4 LZJB LZO LZRW LZSS LZW LZWL Snappy
Other	BWT CTW CM Delta Incremental DMC DPCM Grammar Re-Pair Sequitur LDCT MTF PAQ PPM RLE
Hybrid	LZ77 + Huffman Deflate LZX LZS LZ77 + ANS LZFSE LZ77 + Huffman + ANS Zstandard LZ77 + Huffman + context Brotli LZSS + Huffman LHA/LZH LZ77 + Range LZMA LZHAM RLE + BWT + MTF + Huffman bzip2

Lossy
type

Transform	Discrete cosine transform DCT MDCT DST FFT Wavelet Daubechies DWT SPIHT
Predictive	DPCM ADPCM LPC ACELP CELP LAR LSP WLPC Motion Compensation Estimation Vector Psychoacoustic

Audio

Concepts	Bit rate ABR CBR VBR Companding Convolution Dynamic range Latency Nyquist–Shannon theorem Sampling Silence compression Sound quality Speech coding Sub-band coding
Codec parts	A-law μ-law DPCM ADPCM DM FT FFT LPC ACELP CELP LAR LSP WLPC MDCT Psychoacoustic model

Image

Concepts	Chroma subsampling Coding tree unit Color space Compression artifact Image resolution Macroblock Pixel PSNR Quantization Standard test image Texture compression
Methods	Chain code DCT Deflate Fractal KLT LP RLE Wavelet Daubechies DWT EZW SPIHT

Video

Concepts	Bit rate ABR CBR VBR Display resolution Frame Frame rate Frame types Interlace Video characteristics Video quality
Codec parts	DCT DPCM Deblocking filter Lapped transform Motion Compensation Estimation Vector Wavelet Daubechies DWT