This article has multiple issues. Please helpimprove it or discuss these issues on thetalk page.(Learn how and when to remove these messages) (Learn how and when to remove this message)
|
CIF (Common Intermediate Format orCommon Interchange Format), also known asFCIF (Full Common Intermediate Format), is a standardized format for thepicture resolution,frame rate,color space, andcolor subsampling ofdigital video sequences used invideo teleconferencing systems. It was first defined in theH.261 standard in 1988.

As the word "common" in its name implies, CIF was designed as a common compromise format to be relatively easy to convert for use either withPAL orNTSC standard displays and cameras. CIF defines a video sequence with a resolution of 352 × 288, which has a simple relationship to the PAL picture size, but with a frame rate of 30000/1001 (roughly 29.97) frames per second like NTSC, with color encoded using aYCbCr representation with4:2:0 color sampling. It was designed as a compromise between PAL and NTSC schemes, since it uses a picture size that corresponds most easily to PAL, but uses the frame rate of NTSC. The compromise was established as a way to reach international agreement so that video conferencing systems in different countries could communicate with each other without needing two separate modes for displaying the received video.
The simple way to convert NTSC video to CIF is to capture every otherfield (e.g., the top fields) ofinterlaced video,downsample it by 2:1 horizontally to convert 704 samples per line to 352 samples per line, andupsample it vertically by a ratio of 6:5 vertically to convert 240 lines to 288 lines. The simple way to convert PAL video to CIF is to similarly capture every other field, downsample it horizontally by 2:1, and introduce somejitter in the frame rate by skipping or repeating frames as necessary. Since H.261 systems typically operated at lowbit rates, they also typically operated at low frame rates by skipping many of the camera source frames, so introducing some jitter in the frame rate tended not to be noticeable. More sophisticated conversion schemes (e.g., usingdeinterlacing to improve the vertical resolution from an NTSC camera) could also be used in higher quality systems.
In contrast to the CIF compromise that originated with the H.261 standard, there are two variants of the SIF (Source Input Format) that was first defined in theMPEG-1 standard. SIF is otherwise very similar to CIF. SIF on 525-line ("NTSC") based systems is 352 × 240 with a frame rate of 30000/1001 frames per second, and on 625-line ("PAL") based systems, it has the same picture size as CIF (352 × 288) but with a frame rate of 25 frames per second.
Some references to CIF are intended to refer only to itsresolution (352 × 288), without intending to refer to its frame rate.
The YCbCr color representation had been previously defined in the first standard digital video source format,CCIR 601, in 1982. However, CCIR 601 uses4:2:2 color sampling, which subsamples the Cb and Cr components only horizontally. H.261 additionally used vertical color subsampling, resulting in what is known as 4:2:0.
QCIF means "Quarter CIF". To have one quarter of the area, as "quarter" implies, the height and width of the frame are halved.
Terms also used areSQCIF (Sub Quarter CIF, sometimesSub-QCIF),SCIF (sometimesSub-CIF ),4CIF (4 × CIF),9CIF (9 × CIF) and16CIF (16 × CIF). The resolutions for all of these formats are summarized in the table below.
| Format | Video Resolution | Storage aspect ratio (SAR) |
|---|---|---|
| SQCIF | 128 × 96 | 4:3 |
| QCIF | 176 × 144 | 11:9 |
| SCIF | 256 x 192 | 4:3 |
| SIF(525) | 352 x 160 | 11:5 (≈13:6) |
| CIF/SIF(625) | 352 × 288 | 11:9 |
| 4SIF(525) | 704 x 320 | 11:5 (≈13:6) |
| 4CIF/4SIF(625) | 704 × 576 | 11:9 |
| 9CIF | 1056 × 864 | 11:9 |
| 16CIF | 1408 × 1152 | 11:9 |
xCIFpixels are not square, instead having a ″native″ aspect ratio (pixel aspect ratio (PAR)) of 12:11 (PAR =DAR : SAR =4/3 : 11/9 = 12/11), as with the standard for 625-line systems (seeCCIR 601). On square-pixel displays (e.g., computer screens and many modern televisions) xCIF rasters should be rescaled so that the picture covers a 4:3 area, in order to avoid a "stretched" look: CIF content expanded horizontally by 12:11 results in a 4:3 raster of 384 × 288 square pixels (384 = 352 * 12/11). (This can happen on larger graphics displays of any aspect ratio in awindow of384 × 288 square pixels or enlarged to full screen on any larger 4:3 graphic display.)[citation needed]
The CIF and QCIF picture dimensions were specifically chosen to be multiples of 16 because of the way thatdiscrete cosine transform based video compression/decompression was handled in H.261, using 16 × 16macroblocks and 8 × 8 transform blocks. So a CIF-size image (352 × 288) contains 22 × 18 macroblocks and a QCIF image (176 × 144) contains 11 × 9 macroblocks. The 16 × 16 macroblock concept was later also used in other compression standards such asMPEG-1,MPEG-2,MPEG-4 Part 2,H.263, andH.264/MPEG-4 AVC.