Movatterモバイル変換

CELT

From Wikipedia, the free encyclopedia

Audio codec and compression format

This article is about the audio codec. For the proposed telescope, seeCalifornia Extremely Large Telescope. For the online database of Irish texts, seeCorpus of Electronic Texts. For other uses, seeCelt (disambiguation).

CELT
Developed by	Xiph.Org Foundation
Type of format	Audio
Contained by	Ogg
Extended to	Opus
Standard	Documentation

libcelt
Developers	Xiph.org Foundation, Jean-Marc Valin

Preview release	0.11.1 / February 15, 2011; 15 years ago (2011-02-15)

Type	Audio codec,reference implementation
License	2-clause BSD
Website	opus-codec.org

Constrained Energy Lapped Transform (CELT) is an open,royalty-free lossy audio compression format and afree software codec with especially low algorithmic delay for use inlow-latency audio communication. The algorithms are openly documented and may be used free ofsoftware patent restrictions. Development of the format was maintained by theXiph.Org Foundation (as part of theOgg codec family) and later coordinated by theOpus working group of theInternet Engineering Task Force (IETF).

CELT was meant to bridge the gap betweenVorbis andSpeex for applications where both high quality audio and low delay are desired.^[1] It is suitable for both speech and music. It borrows ideas from theCELP algorithm, but avoids some of its limitations by operating in thefrequency domain exclusively.^[1]

The original stand-alone CELT has been merged intoOpus.Therefore, CELT as stand-alone format is now abandoned and obsolete. Development is going on only for its hybridised form as a layer of Opus, integrated withSILK.This article covers the historic, stand-alone format; for the integrated form and its evolution since the integration into Opus see the article on Opus.

Properties

[edit]

CELT's central feature is low algorithmic delay. It allows for latencies of typically 3 to 9 ms but is configurable to below 2 ms at the price of more bitrate to reach a similar audio quality.^[2] CELT supports mono and stereo audio and is applicable to both speech and music. It can use asampling rate from 32 kHz to 48 kHz and above and an adaptive bitrate from 24 kbit/s to 128 kbit/s per channel and above.^[2]

There are no knownintellectual property issues pertaining to the CELT algorithm, and its reference implementation is published under a permissive open-source license (the2-clause BSD).^[1]^[3]

LikeVorbis, CELT is a fullband (entire humanhearing range) general-purpose codec, i.e. not specialized for special types of audio signals and therefore different from its sibling projectSpeex. The format enables fortransparent results at high bitrates, as well as very decent quality at lower bitrates. All in all, the compression capabilities are said to be significantly superior to those ofMP3, and as another useful feature for realtime applications like telephony, CELT's audio quality at lower bitrates are even on par withHE-AACv1, thanks to the band folding.^[4]^[5] In comparative double-blind listening tests it proved to be noticeably superior to HE-AACv1 at ~64 kbit/s.^[6]

It has a comparably low computational complexity that resembles that of the low-delay variant ofAAC (AAC-LD) and stays significantly below the complexity of Vorbis.^[7]

It enables forconstant andvariable bitrate. If the signal disappears into the noise floor in speech pauses and similar cases, the transmission can be limited to signal the output ofcomfort noise to the decoder. Most settings of the naturally streaming-enabled format can be changed on the fly without interrupting transmission.

The format is robust to transmission errors. Loss of whole packets as well as bit errors can be masked with a steady degradation of audio quality (packet loss concealment, PLC).

Technology

[edit]

CELT is atransform codec based on themodified discrete cosine transform (MDCT) and concepts fromCELP (with a code book for excitation, but in the frequency domain).

The initialPCM-coded signal is handled in relatively small, overlapping blocks for the MDCT (window function) and transformed to frequency coefficients. Choosing an especially short block size on the one hand enables for a low latency, but also leads to poor frequency resolution that has to be compensated. For a further reduction of the algorithmic delay to the expense of a minor sacrifice in audio quality, the by nature 50% of overlap between the blocks is practically cut down to half by silencing the signal during one eight at both ends of a block, respectively.^[2]

The coefficients are grouped to resemble thecritical bands of the human auditory system. The entire amount of energy of each group is analysed and the valuesquantised fordata reduction and compressed through prediction by only transmitting the difference to the predicted values (delta encoding).

The (unquantised) band energy values are removed from the raw DCT coefficients (normalisation). The coefficients of the resulting residual signal (so-called “band shape”) are coded byPyramid Vector Quantisation (PVQ, a sphericalvector quantisation).^[8] This encoding leads to code words of fixed (predictable) length, which in turn enables for robustness against bit errors and leaves no need forentropy encoding.^[5] Finally, all output of the encoder are coded to one bitstream by arange encoder.^[9] In connection with the PVQ, CELT uses a technique known as band folding, which delivers a similar effect tospectral band replication (SBR) by reusing coefficients of lower bands for higher ones, but has much less impact on the algorithmic delay and computational complexity than the SBR. This works against“birdie” artifacts by preserving more richness in the appropriate frequency bands.

The decoder unpacks the individual components from the range coded bitstream, multiplies the band energy to the band shape coefficients and transforms them back (via iMDCT) to PCM data. The individual blocks are rejoined using weightedoverlap-add (WOLA). Many parameters are not explicitly coded, but instead reconstructed by using the same functions as the encoder.

For thechannel coupling CELT may useM/S stereo orintensity stereo. Blocks can be described independent from adjacent frames (Intra-frame); for example to enable a decoder to jump into a running stream. With transform codecs so-called pre-echo artifacts can get audible, because the quantisation error of sharp, energy-heavy sounds (transients) can spread over the entire DCT block and the transient doesn't mask them backward in time as well as forward. With CELT each block can be further divided to thwart such artifacts.

History

[edit]

First work on plans and drafts for a Vorbis successor was done in 2005 atXiph.org as part of the Ghost project (initially talked about as “Vorbis II”). This discussion together with Vorbis creatorChristopher Montgomery led to Jean-Marc Valin′s interest in a particularly low-latency codec. Valin has worked on CELT since 2007.^[5] In December 2007, the first draft version of libcelt was published as version 0.0.1, initially named “Code-Excited Lapped Transform”.^[10]^[11] CELT was established as anIETF technology in July 2009^[3]^[12]^[13]^[14] under the "ietfcodec" working group. In May 2009, a draft ofRTP payload format for the CELT Codec was published.^[15]

In version 0.9, the pitch prediction operating in the frequency domain used until then was replaced by a less complex solution with a pre- and postfilter pair in time domain,^[16] which was contributed by Raymond Chen ofBroadcom.^[5]

With CELT 0.11 from February 4, 2011 the format was tentatively frozen (“soft freeze”) – reserving the possibility of unexpectedly necessary last changes.

Shortly after the advent of the CELT/SILK hybrid codecOpus (formerly known as Harmony), the development of CELT as a separate project was halted, instead living on the basis of Opus,^[17] which aims to treat the lower part of the spectral range in the time domain withlinear prediction (SILK) and the higher part in the frequency domain with theMDCT. The draft for Opus has been registered at the IETF since September 2010.

Software

[edit]

Thesoftware librarylibcelt serves as thereference implementation for CELT, written inC and published asfree software under Xiph's own 3-clause BSD-ish license.

Despite the format not being finally frozen, it was being used in manyVoIP applications such asEkiga^[18] andFreeSWITCH,^[19] which switched to CELT upon entering soft-freeze in January 2009, as well asMumble,TeamSpeak and other^[20] software. In April 2011, support for CELT was included inFFmpeg.^[21]^[22]

CELT is also supported or used by:^[20]

References

[edit]

^^a ^b ^cXiph.OrgThe CELT ultra low-delay audio codec - home page Archived 2018-08-31 at theWayback Machine, Retrieved 2009-09-01
^^a ^b ^cPresentation of the codec Archived 2011-08-07 at theWayback Machine by Timothy B. Terriberry (65 minutes of video in ~100 MiB OggTheora+Vorbis, see alsopresentation slides Archived 2011-08-10 at theWayback Machine in PDF, ~2,3 MiB)
^^a ^bValin, Jean-Marc; Terriberry, Timothy B.; Maxwell, Greg; Montgomery, Christopher (8 July 2010)."Constrained-Energy Lapped Transform (CELT) Codec".
^Fiona Glaser (2010-11-18)."Important: upcoming CELT bitstream freeze!".ffmpeg-devel.mplayerhq.hu - FFmpeg development discussions and patches mailing list. mplayerhq.hu. Retrieved2012-06-11.
^^a ^b ^c ^dChristopher Montgomery (2010-12-23)."next generation audio: CELT update 20101223".Monty's demo pages. Xiph.Org. Archived fromthe original on 2013-08-23. Retrieved2012-06-11.
^Dirk Bösel (2011-04-18)."CELT beeindruckt beim 64 kb/s Multiformat Hörtest (2011)".MPeX.net (in German). MPeX.net GmbH. Retrieved2011-04-25.
^Valin, Jean-Marc; Terriberry, Timothy B.;Montgomery, Christopher; Maxwell, Gregory (17 April 2009),"A High-Quality Speech and Audio Codec With Less Than 10 ms Delay"(PDF),IEEE Transactions on Audio, Speech, and Language Processing, vol. 18, no. 1, IEEE Signal Processing Society, retrieved2011-02-16
^Fischer, Thomas R. (July 1986), "A pyramid vector quantizer",IEEE Transactions on Information Theory, vol. 32, no. 4, pp. 568–583,doi:10.1109/TIT.1986.1057198
^second version of the draft of the specification
^Jean-Marc Valin (2007-12-09)."Experimental release of Ghost/CELT 0.0.1".Hydrogenaudio Forums. Retrieved2012-06-11.
^Xiph.Org (2007-12-08)CELT releases – celt-0.0.1.tar.gz, Retrieved 2009-09-01
^Monika Ermert (2009-11-13)."IETF kümmert sich um lizenzfreien Audiocodec".heise online. Retrieved2011-02-12.
^Valin, Jean-Marc; Terriberry, Timothy; Maxwell, Gregory; Montgomery, Christopher (8 July 2010)."Constrained-Energy Lapped Transform (CELT) Codec".IETF Datatracker.
^IETF - AVT Working Group (2009-07-04)Constrained-Energy Lapped Transform (CELT) Codec, Retrieved 2009-09-01
^IETF - AVT Working Group (2009-05-08)RTP Payload Format for the CELT Codec, Retrieved 2009-09-01
^Jean-Marc Valin (2011-02-15)."CELT decoder complexity".CELT-dev. Xiph.Org. Archived fromthe original on 2012-04-02. Retrieved2012-06-11.
^Jean-Marc Valin, Koen Vos (October 2010)."Definition of the Opus Audio Codec".IETF Internet-Drafts. IETF Network Working Group. Retrieved2012-06-11.
^"Ekiga 3.1.0 available". Archived fromthe original on 2011-09-30. Retrieved2009-02-27.
^FreeSWITCH: New Release For The New Year
^^a ^b"Software that uses or supports CELT".CELT website. Xiph.Org. Retrieved2012-06-12.
^George, Nicolas (April 20, 2011)."[FFmpeg-devel] [PATCH] Support for Xiph CELT/Opus decoding using libcelt".
^"git.videolan.org Git - ffmpeg.git/commit".git.videolan.org.
^"Patch Notes - Dota2 Dev".
^"Team Fortress 2".www.teamfortress.com.

External links

[edit]

Official homepage

v t e Xiph.Org Foundation
Ogg Project codecs	Vorbis Daala Theora FLAC Opus CELT Speex OggPCM Ogg Writ XSPF Annodex
Media tools	cdparanoia Icecast Tremor
Related articles	Chris Montgomery CMML Ogg page Ogg Squish Ogg formats in HTML5 Vorbis comment List of open-source codecs

Multimedia compression andcontainer formats

Video
compression

ISO,IEC, MPEG	DV MJPEG Motion JPEG 2000 MPEG-1 MPEG-2 Part 2 MPEG-4 Part 2 / ASP Part 10 / AVC Part 33 / IVC MPEG-H Part 2 / HEVC MPEG-I Part 3 / VVC MPEG-5 Part 1 / EVC Part 2 / LCEVC
ITU-T,VCEG	H.120 H.261 H.262 H.263 H.264 / AVC H.265 / HEVC H.266 / VVC H.267 / Enhanced Compression Model
SMPTE	VC-1 VC-2 VC-3 VC-5 VC-6
TrueMotion and AOMedia	TrueMotion S VP3 VP6 VP7 VP8 VP9 AV1 AV2
Chinese Standard	AVS1 P2/AVS+(GB/T 20090.2/16) AVS2 P2(GB/T 33475.2,GY/T 299.1) HDR Vivid(GY/T 358) AVS3 P2(GY/T 368)
Others	Apple Video AVS Bink Cinepak Daala DVI FFV1 Huffyuv Indeo Lagarith Microsoft Video 1 MSU Lossless OMS Video Pixlet ProRes 422 4444 QuickTime Animation Graphics RealVideo RTVideo SheerVideo Smacker Sorenson Video/Spark Theora Thor Ut WMV XEB YULS

Audio
compression

ISO,IEC, MPEG	MPEG-1 Layer II Multichannel MPEG-1 Layer I MPEG-1 Layer III (MP3) AAC HE-AAC AAC-LD MPEG Surround MPEG-4 ALS MPEG-4 SLS MPEG-4 DST MPEG-4 HVXC MPEG-4 CELP MPEG-D USAC MPEG-H 3D Audio
ITU-T	G.711 A-law µ-law G.718 G.719 G.722 G.722.1 G.722.2 G.723 G.723.1 G.726 G.728 G.729 G.729.1
IETF	Opus iLBC Speex Vorbis FLAC
3GPP	AMR AMR-WB AMR-WB+ EVRC EVRC-B EVS GSM-HR GSM-FR GSM-EFR
ETSI	AC-3 AC-4 DTS
Bluetooth SIG	SBC LC3
Chinese Standard	AVS1 P10(GB/T 20090.10) AVS2 P3(GB/T 33475.3) Audio Vivid(GY/T 363) DRA(GB/T 22726) ExAC(SJ/T 11299.4)
Others	ACELP ALAC Asao ATRAC CELT Codec 2 iSAC Lyra MELP Monkey's Audio MT9 Musepack OptimFROG OSQ QCELP RCELP RealAudio SD2 SHN SILK Siren SMV SVOPC TTA True Audio TwinVQ VMR-WB VSELP WavPack WMA MQA aptX aptX HD aptX Low Latency aptX Adaptive LDAC LHDC LLAC TrueHD

Image
compression

IEC,ISO,IETF, W3C,ITU-T,JPEG	CCITT Group 4 GIF HEIC / HEIF HEVC JBIG JBIG2 JPEG JPEG 2000 JPEG-LS JPEG XL JPEG XR JPEG XS JPEG XT PNG APNG TIFF TIFF/EP TIFF/IT
Others	AV1 AVIF BPG DjVu EXR FLIF ICER MNG PGF QOI QTVR WBMP WebP

Containers

ISO,IEC	MPEG-ES MPEG-PES MPEG-PS MPEG-TS ISO/IEC base media file format MPEG-4 Part 14 (MP4) Motion JPEG 2000 MPEG-21 Part 9 MPEG media transport
ITU-T	H.222.0 T.802
IETF	RTP Ogg Matroska
SMPTE	GXF MXF
Others	3GP and 3G2 AMV ASF AIFF AVI AU BPG Bink Smacker BMP DivX Media Format EVO Flash Video HEIF IFF M2TS Matroska WebM QuickTime File Format RatDVD RealMedia RIFF WAV MOD and TOD VOB, IFO and BUP

Collaborations

Methods

Lists

SeeCompression methods for techniques andCompression software for codecs

Data compression software

Archivers with
compression
(comparison)

Free and open-source	7-Zip Ark Expander File Roller FreeArc Info-ZIP KGB Archiver PAQ pax PeaZip XAD (decompression only) Xarchiver Zipeg ZPAQ
Freeware	Filzip LHA StuffIt Expander (decompression only) The Unarchiver (decompression only) TUGZip ZipGenius
Commercial	ARC ALZip Archive Utility ARJ BetterZip MacBinary PKZIP/SecureZIP PowerArchiver StuffIt WinAce WinRAR WinZip

Non-archiving
compressors

Generic	bzip2 compress gzip lzip lzop pack rzip Snappy XZ Utils zstd
For code	UPX

Audio
compression
(comparison)

Lossy	AAC Fraunhofer FDK AAC Nero AAC Codec FAAC Helix DNA Producer MP3 l3enc LAME TooLAME libavcodec libcelt libopus libspeex Musepack libvorbis Windows Media Encoder
Lossless	ALAC FLAC libavcodec Monkey's Audio mp4als OptimFROG Shorten WavPack

Video
compression
(comparison)

Lossy

MPEG-4 ASP	3ivx DivX Nero Digital FFmpeg HDX4 Xvid
H.264	CoreAVC Blu-code DivX FFmpeg Nero Digital OpenH264 QuickTime x264
HEVC	DivX x265
Others	CineForm Cinepak Daala DNxHD Helix DNA Producer Indeo libavcodec Schrödinger (Dirac) SBC Sorenson VP7 libtheora libvpx Windows Media Encoder