Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Audio coding format

From Wikipedia, the free encyclopedia
Digitally coded format for audio signals
Comparison of coding efficiency between popular audio formats

Anaudio coding format[1] (or sometimesaudio compression format) is a encoded format ofdigital audio, such as indigital television,digital radio and in audio and video files. Examples of audio coding formats includeMP3,AAC,Vorbis,FLAC, andOpus. A specific software or hardware implementation capable ofaudio compression and decompression to/from a specific audio coding format is called anaudio codec; an example of an audio codec isLAME, which is one of several different codecs which implements encoding and decoding audio in theMP3 audio coding format in software.

Some audio coding formats are documented by a detailedtechnical specification document known as anaudio coding specification. Some such specifications are written and approved bystandardization organizations astechnical standards, and are thus known as anaudio coding standard. The term "standard" is also sometimes used forde facto standards as well as formal standards.

Audio content encoded in a particular audio coding format is normally encapsulated within acontainer format. As such, the user normally doesn't have a rawAAC file, but instead has a .m4aaudio file, which is aMPEG-4 Part 14 container containing AAC-encoded audio. The container also containsmetadata such as title and other tags, and perhaps an index for fast seeking.[2] A notable exception isMP3 files, which are raw audio coding without a container format. De facto standards for adding metadata tags such as title and artist to MP3s, such asID3, arehacks which work by appending the tags to the MP3, and then relying on the MP3 player to recognize the chunk as malformed audio coding and therefore skip it. In video files with audio, the encoded audio content is bundled with video (in avideo coding format) inside amultimedia container format.

An audio coding format does not dictate allalgorithms used by acodec implementing the format. An important part of how lossy audio compression works is by removing data in ways humans can't hear, according to apsychoacoustic model; the implementer of an encoder has some freedom of choice in which data to remove (according to their psychoacoustic model).

Lossless, lossy, and uncompressed audio coding formats

[edit]
Spectral analysis comparison between losslessFLAC (top) and lossyOpus (bottom) files for the same audio clip. The 20-24 kHz range is absent in the lossy audio file.

Alossless audio coding format reduces the total data needed to represent a sound but can be de-coded to its original, uncompressed form. Alossy audio coding format additionally reduces thebit resolution of the sound on top of compression, which results in far less data at the cost of irretrievably lost information.

Transmitted (streamed) audio is most often compressed using lossy audio codecs as the smaller size is far more convenient for distribution. The most widely used audio coding formats areMP3 andAdvanced Audio Coding (AAC), both of which are lossy formats based onmodified discrete cosine transform (MDCT) andperceptual coding algorithms.

Lossless audio coding formats such asFLAC andApple Lossless are sometimes available, though at the cost of larger files.

Uncompressed audio formats, such aspulse-code modulation (PCM, or .wav), are also sometimes used. PCM was the standard format forCompact Disc Digital Audio (CDDA).

History

[edit]
Solidyne 922: The world's first commercial audio bit compressionsound card for PC, 1990

In 1950,Bell Labs filed the patent ondifferential pulse-code modulation (DPCM).[3]Adaptive DPCM (ADPCM) was introduced by P. Cummiskey,Nikil S. Jayant andJames L. Flanagan atBell Labs in 1973.[4][5]

Perceptual coding was first used forspeech coding compression, withlinear predictive coding (LPC).[6] Initial concepts for LPC date back to the work ofFumitada Itakura (Nagoya University) and Shuzo Saito (Nippon Telegraph and Telephone) in 1966.[7] During the 1970s,Bishnu S. Atal andManfred R. Schroeder atBell Labs developed a form of LPC calledadaptive predictive coding (APC), a perceptual coding algorithm that exploited the masking properties of the human ear, followed in the early 1980s with thecode-excited linear prediction (CELP) algorithm which achieved a significant compression ratio for its time.[6] Perceptual coding is used by modern audio compression formats such asMP3[6] andAAC.

Discrete cosine transform (DCT), developed byNasir Ahmed, T. Natarajan andK. R. Rao in 1974,[8] provided the basis for themodified discrete cosine transform (MDCT) used by modern audio compression formats such as MP3[9] and AAC. MDCT was proposed by J. P. Princen, A. W. Johnson and A. B. Bradley in 1987,[10] following earlier work by Princen and Bradley in 1986.[11] The MDCT is used by modern audio compression formats such asDolby Digital,[12][13]MP3,[9] andAdvanced Audio Coding (AAC).[14]

List of lossy formats

[edit]

General

[edit]
Basic compression algorithmAudio coding standardAbbreviationIntroductionMarket share(2023)[15]Ref
ProductionStreaming
Modified discrete cosine transform (MDCT)Dolby Digital (AC-3)AC3199136–54%[n 1]37–61%[n 1][12][18]
Dolby Digital Plus (E-AC-3)EAC32004[19][20]
Adaptive Transform Acoustic CodingATRAC1992UnknownUnknown[12]
MPEG Layer IIIMP3199315%19%[9][21]
Advanced Audio Coding (MPEG-2 /MPEG-4)AAC199783%87%[14][12]
Windows Media AudioWMA1999UnknownUnknown[12]
OggVorbisOgg20006%4%[22][12]
Constrained Energy Lapped TransformCELT2011[23]
OpusOpus201212%9%[24]
Dolby AC-4AC42014UnknownUnknown[25]
LDACLDAC2015UnknownUnknown[26][27]
Adaptive differential pulse-code modulation (ADPCM)aptX / aptX-HDaptX1989UnknownUnknown[28]
Digital Theater SystemsDTS19908%6%[29][30]
Master Quality AuthenticatedMQA2014UnknownUnknown
Sub-band coding (SBC)MPEG-1 Audio Layer IIMP21993UnknownUnknown[31]
MusepackMPC1997
SBCSBC2003UnknownUnknown[32]

Speech

[edit]
Further information:Speech coding

List of lossless formats

[edit]

See also

[edit]

Notes

[edit]
  1. ^abThe report combines AC-3 & E-AC-3 and separatesDolby Atmos from its market share calculation. Dolby Atmos can be encoded either lossily with E-AC-3/AC-4[16] or losslessly withDolby TrueHD.Music andvideo streaming providers typically use Dolby Digital Plus augmented with Dolby Atmos, whereasdigital downloads andBlu-ray discs typically use Dolby TrueHD augmented with Dolby Atmos.[17]

References

[edit]
  1. ^The term "audio coding" can be seen in e.g. the nameAdvanced Audio Coding, and is analogous to the termvideo coding
  2. ^"Video – Where is synchronization information stored in container formats?".
  3. ^US patent 2605361, C. Chapin Cutler, "Differential Quantization of Communication Signals", issued 1952-07-29 
  4. ^Cummiskey, P.; Jayant, N. S.; Flanagan, J. L. (1973)."Adaptive Quantization in Differential PCM Coding of Speech".Bell System Technical Journal.52 (7):1105–1118.doi:10.1002/j.1538-7305.1973.tb02007.x.
  5. ^Cummiskey, P.; Jayant, Nikil S.; Flanagan, J. L. (1973). "Adaptive quantization in differential PCM coding of speech".The Bell System Technical Journal.52 (7):1105–1118.doi:10.1002/j.1538-7305.1973.tb02007.x.ISSN 0005-8580.
  6. ^abcSchroeder, Manfred R. (2014)."Bell Laboratories".Acoustics, Information, and Communication: Memorial Volume in Honor of Manfred R. Schroeder. Springer. p. 388.ISBN 9783319056609.
  7. ^Gray, Robert M. (2010)."A History of Realtime Digital Speech on Packet Networks: Part II of Linear Predictive Coding and the Internet Protocol"(PDF).Found. Trends Signal Process.3 (4):203–303.doi:10.1561/2000000036.ISSN 1932-8346.
  8. ^Nasir Ahmed; T. Natarajan; Kamisetty Ramamohan Rao (January 1974)."Discrete Cosine Transform"(PDF).IEEE Transactions on Computers.C-23 (1):90–93.doi:10.1109/T-C.1974.223784.S2CID 149806273. Archived fromthe original(PDF) on 2016-12-08. Retrieved2019-10-20.
  9. ^abcGuckert, John (Spring 2012)."The Use of FFT and MDCT in MP3 Audio Compression"(PDF).University of Utah. Retrieved14 July 2019.
  10. ^Princen, J.; Johnson, A.; Bradley, A. (1987). "Subband/Transform coding using filter bank designs based on time domain aliasing cancellation".ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing. Vol. 12. pp. 2161–2164.doi:10.1109/ICASSP.1987.1169405.S2CID 58446992.
  11. ^Princen, J.; Bradley, A. (1986). "Analysis/Synthesis filter bank design based on time domain aliasing cancellation".IEEE Transactions on Acoustics, Speech, and Signal Processing.34 (5):1153–1161.doi:10.1109/TASSP.1986.1164954.
  12. ^abcdefLuo, Fa-Long (2008).Mobile Multimedia Broadcasting Standards: Technology and Practice.Springer Science & Business Media. p. 590.ISBN 9780387782638.
  13. ^Britanak, V. (2011). "On Properties, Relations, and Simplified Implementation of Filter Banks in the Dolby Digital (Plus) AC-3 Audio Coding Standards".IEEE Transactions on Audio, Speech, and Language Processing.19 (5):1231–1241.Bibcode:2011ITASL..19.1231B.doi:10.1109/TASL.2010.2087755.S2CID 897622.
  14. ^abBrandenburg, Karlheinz (1999)."MP3 and AAC Explained"(PDF).Archived(PDF) from the original on 2017-02-13.
  15. ^"Video Developer Report 2019"(PDF).Bitmovin. 2019. Retrieved5 November 2019.
  16. ^"Does Dolby AC-4 support Dolby Atmos?".Dolby Professional Support. 2023-05-23. Retrieved2024-11-08.
  17. ^"Just wait until you hear lossless Dolby Atmos Music".Digital Trends. 2023-05-03. Retrieved2024-11-08.
  18. ^Britanak, V. (2011). "On Properties, Relations, and Simplified Implementation of Filter Banks in the Dolby Digital (Plus) AC-3 Audio Coding Standards".IEEE Transactions on Audio, Speech, and Language Processing.19 (5):1231–1241.Bibcode:2011ITASL..19.1231B.doi:10.1109/TASL.2010.2087755.S2CID 897622.
  19. ^Andersen, Robert Loring; Crockett, B.; Davidson, G.; Davis, Mark; Fielder, L.; Turner, Stephen C.; Vinton, M.; Williams, P. (1 October 2004)."Introduction to Dolby Digital Plus, an Enhancement to the Dolby Digital Coding System"(PDF).Journal of The Audio Engineering Society. Archived fromthe original(PDF) on 2016-11-19.
  20. ^Digital Audio Compression (AC-3, Enhanced AC-3) Standard(PDF), European Telecommunications Standards Institute, 20 September 2017, ETSI TS 102 366 V1.4.1 (2017-09, retrieved21 September 2023
  21. ^Stanković, Radomir S.; Astola, Jaakko T. (2012)."Reminiscences of the Early Work in DCT: Interview with K.R. Rao"(PDF).Reprints from the Early Days of Information Sciences.60. Retrieved13 October 2019.
  22. ^Xiph.Org Foundation (2009-06-02)."Vorbis I specification - 1.1.2 Classification". Xiph.Org Foundation. Retrieved2009-09-22.
  23. ^Terriberry, Timothy B.Presentation of the CELT codec. Archived fromthe original on 2011-08-07.Presentation(PDF). Retrieved2019-11-10.
  24. ^Valin, Jean-Marc; Maxwell, Gregory; Terriberry, Timothy B.; Vos, Koen (October 2013).High-Quality, Low-Delay Music Coding in the Opus Codec. 135th AES Convention.Audio Engineering Society.arXiv:1602.04845.
  25. ^"Dolby AC-4: Audio Delivery for Next-Generation Entertainment Services"(PDF).Dolby Laboratories. 2015-06-01. Archived fromthe original(PDF) on 2015-12-04. Retrieved2016-04-26.
  26. ^Darko, John H. (2017-03-29)."The inconvenient truth about Bluetooth audio".DAR__KO. Archived fromthe original on 2018-01-14. Retrieved2018-01-13.
  27. ^Ford, Jez (2015-08-24)."What is Sony LDAC, and how does it do it?".AVHub. Retrieved2018-01-13.
  28. ^Ford, Jez (2016-11-22)."aptX HD - lossless or lossy?".AVHub. Retrieved2018-01-13.
  29. ^"Digital Theater Systems Audio Formats".Library of Congress. 27 December 2011. Retrieved10 November 2019.
  30. ^Spanias, Andreas; Painter, Ted; Atti, Venkatraman (2006).Audio Signal Processing and Coding.John Wiley & Sons. p. 338.ISBN 9780470041963.
  31. ^"ISO/IEC 11172-3:1993 – Information technology — Coding of moving pictures and associated audio for digital storage media at up to about 1,5 Mbit/s — Part 3: Audio". ISO. 1993. Retrieved2010-07-14.
  32. ^Bluetooth SIG, Specification of the Bluetooth System, Profiles, Advanced Audio Distribution Profile version 1.3.https://www.bluetooth.org/docman/handlers/DownloadDoc.ashx?doc_id=260859&vId=290074
Video
compression
ISO,IEC,
MPEG
ITU-T,VCEG
SMPTE
TrueMotion and AOMedia
Chinese Standard
  • AVS1 P2/AVS+(GB/T 20090.2/16)
  • AVS2 P2(GB/T 33475.2,GY/T 299.1)
    • HDR Vivid(GY/T 358)
  • AVS3 P2(GY/T 368)
Others
Audio
compression
ISO,IEC,
MPEG
ITU-T
IETF
3GPP
ETSI
Bluetooth SIG
Chinese Standard
Others
Image
compression
IEC,ISO,IETF,
W3C,ITU-T,JPEG
Others
Containers
ISO,IEC
ITU-T
IETF
SMPTE
Others
Collaborations
Methods
Lists
SeeCompression methods for techniques andCompression software for codecs
Retrieved from "https://en.wikipedia.org/w/index.php?title=Audio_coding_format&oldid=1316804690"
Category:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp